This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/clang/
-
clang/
-
Basic/
-
AllDiagnostics.h
-
CMakeLists.txt
-
Diagnostic.td
-
DiagnosticGroups.td
-
DiagnosticIDs.h
-
DiagnosticIndexKinds.td
-
Driver/
-
Job.h
-
Options.td
-
Frontend/
3/3
CompilerInstance.h
1/4
FrontendOptions.h
-
Index/
1/1
DeclOccurrence.h
-
IndexDataConsumer.h
-
IndexDiagnostic.h
5/8
IndexingAction.h
1
RecordingAction.h
-
UnitIndexDataConsumer.h
-
UnitIndexingAction.h
-
module.modulemap
-
lib/
-
Basic/
-
DiagnosticIDs.cpp
-
Driver/
-
Driver.cpp
3/3
Job.cpp
-
ToolChains/
1/1
Clang.cpp
-
Darwin.cpp
-
Frontend/
2/2
CompilerInstance.cpp
-
CompilerInvocation.cpp
-
FrontendTool/
-
CMakeLists.txt
1/2
ExecuteCompilerInvocation.cpp
-
Index/
-
CMakeLists.txt
-
FileIndexData.h
1
FileIndexData.cpp
45/59
IndexingAction.cpp
4/8
IndexingContext.h
1/1
IndexingContext.cpp
-
UnitIndexDataRecorder.h
-
UnitIndexDataRecorder.cpp
-
test/Index/
-
Index/
-
Core/
-
Inputs/
-
module/
-
ModDep.h
-
ModSystem.h
-
ModTop.h
-
ModTopSub1.h
-
ModTopSub2.h
-
module.modulemap
-
sys/
-
system-head.h
-
transitive-include.h
-
external-source-symbol-attr.m
-
index-instantiated-source.cpp
1
index-source.mm
-
index-subkinds.m
-
index-system.mm
1
index-unit.mm
-
Store/
-
assembly-invocation.c
-
tools/
-
c-index-test/
-
core_main.cpp
-
diagtool/
-
DiagnosticNames.cpp
-
libclang/
-
CXIndexDataConsumer.h
-
CXIndexDataConsumer.cpp

Differential D39050

Add index-while-building support to Clang
AbandonedPublic

Authored by jkorous on Oct 18 2017, 6:09 AM.

Download Raw Diff

Details

Reviewers

klimek
akyrtzi
bkramer
ioeric
nathawes

Summary

Adds a new -index-store-path option that causes Clang to additionally collect and output source code indexing information to the supplied path. This is done by wrapping the FrontendAction otherwise setup by the invocation with a WrappingIndexRecordAction. This action simply delegates to the wrapped action, but additionally multiplexes in its own IndexASTConsumer to collect symbol information from the AST and tracks the source file and module dependencies of the translation unit (via the IndexDependencyProvider class).

When the action completes, it then writes this information out to the supplied index store path in the form of a unit file, which stores the dependency information of the translation unit, and record files, that store the symbol and symbol occurrences seen in each source file. These are written out in the LLVM Bitstream format.

For a better (and more detailed) description of these changes, see the design document at: https://docs.google.com/document/d/1cH2sTpgSnJZCkZtJl1aY-rzy4uGPcrI-6RrUpdATO2Q/edit?usp=sharing

and the mailing list discussion 'RFC: Adding index-while-building support to Clang'

Diff Detail

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

arphaman added inline comments.Oct 31 2017, 4:07 PM

lib/Index/IndexRecordHasher.cpp
103 ↗	(On Diff #120916)	Should you hash the return type as well?
204 ↗	(On Diff #120916)	You can use `Qualifiers::Const` here or make your own enum instead of raw constants.

Thanks @arphaman! I'll work through your comments and update.

include/clang/Index/IndexDataStoreSymbolUtils.h
13 ↗	(On Diff #120916)	They're used by IndexRecordWriter below to convert from libIndex's representation of things to the index store's.

nathawes added inline comments.Nov 6 2017, 6:49 PM

lib/Index/IndexRecordHasher.cpp
103 ↗	(On Diff #120916)	The return type doesn't affect the function's USR, so there's no need to consider it when hashing the function decl. The hashing here is happening per decl occurrence (source offset + role set + Decl) collected by the index AST walker, so changing the return type will still change the record hash when any decl occurrences it contains are hashed in.

Based on @arphaman's feedback:

Pulled the index store related diagnostics out into their own category/diagnostic group
Removed the CLANG_PROJECT_INDEX_PATH env var check.
Swapped "/" used in a few places as a separator/root with the equivalent llvm::sys::path call.
Fixed the typo/convention/documentation issues and simplifications pointed out so far

malaperle added a subscriber: malaperle.Nov 7 2017, 7:19 PM

malaperle added inline comments.

lib/Index/IndexUnitWriter.cpp
212 ↗	(On Diff #121833)	extra semi-colon (noticed this warning while compiling)

malaperle added inline comments.Nov 7 2017, 8:58 PM

lib/Index/IndexingAction.cpp
592	As a first attempt, I tried to use index::createIndexDataRecordingAction in combination with ASTUnit::LoadFromCompilerInvocationAction but one problem is that right before it calls EndSourceFileAction in LoadFromCompilerInvocationAction, it calls transferASTDataFromCompilerInstance which means that the SourceManager in CompilerInstance is nulled out as it gets "transfered" to the AST. So this line crashes in this case. To be fair, at this point I don't need the ASTUnit so I can look at executing the action differently, but I thought I'd point it out!

hokein added a subscriber: hokein.Nov 8 2017, 4:41 AM

malaperle added inline comments.Nov 8 2017, 8:19 AM

lib/Index/IndexRecordWriter.cpp
155 ↗	(On Diff #121833)	I'm getting quite a bit of those while indexing Clangd, it looks like it comes from some LLVM/Support headers: Index: Duplicate USR! c:@N@std@ST>2#NI#Nb@__try_lock_impl Index: Duplicate USR! c:@N@llvm@ST>1#T@DenseMapInfo Index: Duplicate USR! c:@N@llvm@ST>1#T@isPodLike Index: Duplicate USR! c:@N@llvm@N@detail@ST>1#T@unit Index: Duplicate USR! c:@N@llvm@ST>2#T#T@format_provider Index: Duplicate USR! c:@N@llvm@ST>2#T#T@format_provider Index: Duplicate USR! c:@N@llvm@ST>1#T@PointerLikeTypeTraits Index: Duplicate USR! c:@N@llvm@ST>1#T@simplify_type Index: Duplicate USR! c:@N@std@ST>1#T@atomic Index: Duplicate USR! c:@N@llvm@ST>1#T@isPodLike Index: Duplicate USR! c:@N@llvm@ST>1#T@DenseMapInfo I think it would be good to have the file name at least in the log. I also assume those duplication are issues that would have to be fixed in USRGenerator (i.e. in separate patches) ?

malaperle added inline comments.Nov 8 2017, 1:40 PM

include/indexstore/IndexStoreCXX.h
75 ↗	(On Diff #118854)	I know this is an old revision but I thought I should ask for the future patch... Would it be possible to not use "blocks"? This will affect portability of the code. I'm not familiar with blocks but I would think it would be possible to replace with C++11 lambdas, or something else that's standard. I was just playing around with this code and since I used GCC, it did not work.

ioeric added a subscriber: ioeric.Nov 9 2017, 12:58 AM

In D39050#900830, @arphaman wrote:

I think this patch should be split into a number of smaller patches to help the review process.

Things like tools/IndexStore, DirectoryWatcher and other components that are not directly needed right now should definitely be in their own patches.
It would be nice to find some way to split the implementation into multiple patches as well.

+1.

This is a lot of work (but great work!) for one patch. Smaller/incremental patches help reviewers understand and (hopefully) capture potential improvement of the design. I would really appreciate it if you could further split the patch.

Some comments/ideas:

The lack of tests is a bit concerning.
I think the implementation of the index output logic (e.g. IndexUnitWriter and bit format file) can be abstracted away (and split into separate patches) so that you can unit-test the action with a custom/mock unit writer; this would also make the action reusable with (potentially) other storage formats.
I would suggest that you start with a patch that implement the index action and just enough components so that you could test the action.

Thanks!

In D39050#920451, @ioeric wrote:

In D39050#900830, @arphaman wrote:

I think this patch should be split into a number of smaller patches to help the review process.

Things like tools/IndexStore, DirectoryWatcher and other components that are not directly needed right now should definitely be in their own patches.
It would be nice to find some way to split the implementation into multiple patches as well.

+1.

This is a lot of work (but great work!) for one patch. Smaller/incremental patches help reviewers understand and (hopefully) capture potential improvement of the design. I would really appreciate it if you could further split the patch.

Thanks for taking a look @ioeric! I'll have a go at splitting it further.

Some comments/ideas:

The lack of tests is a bit concerning.

I moved all the code for reading the index store data into a separate patch (to come after this one) in order to slim this one down for review, and most of the tests went with it because they're based around reading and dumping the stored data for FileCheck. The original version of this patch has them all (https://reviews.llvm.org/D39050?id=118854). The ones that remain here are just those checking that the unit/record files are written out and that the hashing mechanism is producing distinct record files when the symbolic content of the source file changes.

I think the implementation of the index output logic (e.g. IndexUnitWriter and bit format file) can be abstracted away (and split into separate patches) so that you can unit-test the action with a custom/mock unit writer; this would also make the action reusable with (potentially) other storage formats.

The added IndexRecordAction and existing IndexAction use the same functionality from libIndex to collect the indexing data, so I'm not sure mocking the unit writer to unit test IndexRecordAction would add very much value – writing the index data out is the new behavior. The existing tests for IndexAction (under test/Index/Core) are already covering the correctness of the majority of the collected indexing info and the tests coming in the follow-up patch (seen in the original version of this patch) test it's still correct after the write/read round trip.

I would suggest that you start with a patch that implement the index action and just enough components so that you could test the action.

Thanks!

phosek added a subscriber: phosek.Nov 9 2017, 10:15 PM

I think the implementation of the index output logic (e.g. IndexUnitWriter and bit format file) can be abstracted away (and split into separate patches) so that you can unit-test the action with a custom/mock unit writer; this would also make the action reusable with (potentially) other storage formats.

The added IndexRecordAction and existing IndexAction use the same functionality from libIndex to collect the indexing data, so I'm not sure mocking the unit writer to unit test IndexRecordAction would add very much value – writing the index data out is the new behavior. The existing tests for IndexAction (under test/Index/Core) are already covering the correctness of the majority of the collected indexing info and the tests coming in the follow-up patch (seen in the original version of this patch) test it's still correct after the write/read round trip.

Thanks for the clarification! I still think it's useful to decouple the IndexAction from the bit format file so that it could be reusable elsewhere. For example, I can see the index action be useful to clangd for building in-memory index.

I also tried applying your original patch locally but couldn't get it to work mostly due to portability issues (e.g. blocks and if (APPLE) in make files). AFAIK, many folks compile clang with GCC and/or without APPLE, so it's important that you get the portability right from the very beginning. Thanks!

Index-while-build is awesome! I'm looking forward to your patches!

Hey Eric,

In D39050#921748, @ioeric wrote:

I think the implementation of the index output logic (e.g. IndexUnitWriter and bit format file) can be abstracted away (and split into separate patches) so that you can unit-test the action with a custom/mock unit writer; this would also make the action reusable with (potentially) other storage formats.

The added IndexRecordAction and existing IndexAction use the same functionality from libIndex to collect the indexing data, so I'm not sure mocking the unit writer to unit test IndexRecordAction would add very much value – writing the index data out is the new behavior. The existing tests for IndexAction (under test/Index/Core) are already covering the correctness of the majority of the collected indexing info and the tests coming in the follow-up patch (seen in the original version of this patch) test it's still correct after the write/read round trip.

Thanks for the clarification! I still think it's useful to decouple the IndexAction from the bit format file so that it could be reusable elsewhere. For example, I can see the index action be useful to clangd for building in-memory index.

As Nathan mentioned, we believe the indexing action, as it exists in the trunk, is decoupled enough to be useful, for example Marc was already able to use it and write out the indexing data in a completely different format for his fork of clangd. Of course, we are definitely interested in any additional refactorings that would structure things better and we are eager to see and discuss follow-up patches from anyone that is interested in improving the code, but could we treat this as potential follow-up improvements ?

We are eager to provide the functionality so others can start experimenting with it; I'd propose that we discuss ideas for refactoring of the code as follow-up, what do you think ? Getting the initial functionality in and iterating on it, while getting more experience with applying it on various use-cases, is a common operating mindset of the llvm/clang projects.

I also tried applying your original patch locally but couldn't get it to work mostly due to portability issues (e.g. blocks and if (APPLE) in make files). AFAIK, many folks compile clang with GCC and/or without APPLE, so it's important that you get the portability right from the very beginning. Thanks!

Nathan will look into making using blocks optional, providing additional function pointer+context APIs where appropriate and having the common implementation using lambdas.
For the APPLE specific parts it, the only specific darwin-specific part is the part using FSEvents, the other 'if (APPLE)' checks can likely be removed. We would generally need help from people with linux expertise to provide the 'FSEvents' equivalent functionality but this is a small part of the overall feature, it's not important for getting the index-while-building data.

But these things are not part of the current patch, we can discuss again with the follow-up patches that will contain those things.

Index-while-build is awesome! I'm looking forward to your patches!

In D39050#922597, @akyrtzi wrote:

Hey Eric,

In D39050#921748, @ioeric wrote:

I think the implementation of the index output logic (e.g. IndexUnitWriter and bit format file) can be abstracted away (and split into separate patches) so that you can unit-test the action with a custom/mock unit writer; this would also make the action reusable with (potentially) other storage formats.

The added IndexRecordAction and existing IndexAction use the same functionality from libIndex to collect the indexing data, so I'm not sure mocking the unit writer to unit test IndexRecordAction would add very much value – writing the index data out is the new behavior. The existing tests for IndexAction (under test/Index/Core) are already covering the correctness of the majority of the collected indexing info and the tests coming in the follow-up patch (seen in the original version of this patch) test it's still correct after the write/read round trip.

Thanks for the clarification! I still think it's useful to decouple the IndexAction from the bit format file so that it could be reusable elsewhere. For example, I can see the index action be useful to clangd for building in-memory index.

As Nathan mentioned, we believe the indexing action, as it exists in the trunk, is decoupled enough to be useful, for example Marc was already able to use it and write out the indexing data in a completely different format for his fork of clangd. Of course, we are definitely interested in any additional refactorings that would structure things better and we are eager to see and discuss follow-up patches from anyone that is interested in improving the code, but could we treat this as potential follow-up improvements ?

We are eager to provide the functionality so others can start experimenting with it; I'd propose that we discuss ideas for refactoring of the code as follow-up, what do you think ? Getting the initial functionality in and iterating on it, while getting more experience with applying it on various use-cases, is a common operating mindset of the llvm/clang projects.

To be honest, I want this functionality to get in as much as you do, and I'm more than happy to prioritize the code review for it :) But the current patch size makes the reviewing really hard (e.g. I would never have caught the BLOCK issues hadn't I tried running the original patch myself). I'm not sure if it's really a common practice to check in a big chunk of code without careful code review and leave potential improvements as followups. I'm sure @klimek would have thoughts about this.

If the index action is already flexible enough, would you mind splitting the code for the index action out so that we can start reviewing it? Given that the current patch has very few tests, I guess it wouldn't be too much worse to split out the action without proper test.

To be honest, I want this functionality to get in as much as you do, and I'm more than happy to prioritize the code review for it :) But the current patch size makes the reviewing really hard (e.g. I would never have caught the BLOCK issues hadn't I tried running the original patch myself). I'm not sure if it's really a common practice to check in a big chunk of code without careful code review and leave potential improvements as followups. I'm sure @klimek would have thoughts about this.

To be clear, I didn't mean to imply we don't want careful code review, we are really happy for people to point out issues. For example the building problems on linux are serious issues that we will fix and we are grateful for your feedback!

If the index action is already flexible enough, would you mind splitting the code for the index action out so that we can start reviewing it? Given that the current patch has very few tests, I guess it wouldn't be too much worse to split out the action without proper test.

To clarify, the index action Nathan and I are referring to, is the indexing action that exists currently in trunk and is the source of the index symbols, feeding index symbols to an abstract IndexDataConsumer. See here: https://llvm.org/svn/llvm-project/cfe/trunk/include/clang/Index/IndexingAction.h
This is what Marc used to get the index symbols and store them in his own format. Tests for this functionality are in: https://llvm.org/svn/llvm-project/cfe/trunk/test/Index/Core/

If the index action is already flexible enough, would you mind splitting the code for the index action out so that we can start reviewing it? Given that the current patch has very few tests, I guess it wouldn't be too much worse to split out the action without proper test.

To clarify, the index action Nathan and I are referring to, is the indexing action that exists currently in trunk and is the source of the index symbols, feeding index symbols to an abstract IndexDataConsumer. See here: https://llvm.org/svn/llvm-project/cfe/trunk/include/clang/Index/IndexingAction.h
This is what Marc used to get the index symbols and store them in his own format. Tests for this functionality are in: https://llvm.org/svn/llvm-project/cfe/trunk/test/Index/Core/

Ah, sorry, I was referring to IndexRecordAction and its friends (record readers/writers). I didn't notice the newly added index action added and really didn't mean to ask you to refactor the existing code. Apologies for the miscommunication!

What I wanted to proposed is that we could decouple reading/writing of record/unit from the bit file format, so that the record output is not tied to a single output format (e.g. bit format, directory-based) and thus make the compiler more flexible. This might already be the case, but it's not really easy to tell from the current patch...

Hi! I got a bit further in my experiment in integrating this in Clangd. I put some comments (in the first more complete revision). But since the scope of this patch changed, if you feel like we should take the discussions elsewhere, please let me know! Thanks!

include/indexstore/IndexStoreCXX.h
84 ↗	(On Diff #118854)	From what I understand, this returns the beginning of the occurrence. It would be useful to also have the end of the occurrence. From what I tested in Xcode, when you do "Find Selected Symbol in Workspace", it highlights the symbol name in yellow in the list of matches, so it mush use that LineCol then highlight the matching name. This is works in many situations but others occurrences won't have the name of the symbol. For example: "MyClass o1, o2;" If I use "Find Selected Symbol in Workspace" on MyClass constructor, if won't be able to highlight o1 and o2 Do you think it would be possible to add that (EndLineCol)? If not, how would one go about extending libindexstore in order to add additional information per occurrence? It is not obvious to me so far. We also need other things, for example in the case of definitions, we need the full range of the definition so that users can "peek" at definitions in a pop-up. Without storing the end of the definition in the index, we would need to reparse the file.
374 ↗	(On Diff #118854)	As part of this dependency tracking mechanism, I haven't found that it could provide information about about the files including a specific header. So given a header (or source file in odd cases), I do not see a way to get all the files that would need to be reindexed if that header changed. Is that something you achieve outside the index? Or perhaps this is something I missed in the code.
377 ↗	(On Diff #118854)	Could there be a bit of explanation about what's a File dependency versus record and unit? All units and records are file dependencies, right? Are there any files that are neither records or units?

Thanks for the feedback @malaperle!

include/indexstore/IndexStoreCXX.h
84 ↗	(On Diff #118854)	Our approach to related locations (e.g. name, signature, body, and doc comment start/end locs) has been to not include them in the index and derive them from the start location later. There's less data to collect and write out during the build that way, and deriving the other locations isn't that costly usually, as in most cases you 1) don't need to type check or even preprocess to get the related locations, as with finding the end of o1 and o2 in your example, and 2) usually only need to derive locations for a single or small number of occurrences, like when 'peeking' at a definition. Are there cases where you think this approach won't work/perform well enough for the indexer-backed queries clangd needs to support?
374 ↗	(On Diff #118854)	The unit files store the path of the header/source files they depend on as 'File' dependencies. So any unit file with 'File' dependency on header/source file that was modified may need to be re-indexed. To support finding which specific files include or are included by a given header (rather than which units somehow transitively include it) we also store the file-to-file inclusions in the unit file (retrieved via IndexUnitReader's foreachInclude method below).
377 ↗	(On Diff #118854)	I'll rename this to SourceFile and add some comments to explain. Unit file dependencies separate the source dependencies into 'File' dependencies and 'Record' dependencies. The 'File' dependencies track the paths of the header/source files seen in the translation unit, while the 'Record' dependencies track which record files have the symbolic content seen in those source files – the header/source file path doesn't appear anywhere in the record file. This separation lets us depend on a single record file corresponding to multiple source files (e.g. when two source files have the same symbolic content), and on a single source file corresponding multiple record files (e.g. when a single header is included multiple times with different preprocessor contexts changing its symbolic content).

ioeric added a reviewer: ioeric.Nov 28 2017, 6:10 AM

First round of comments. Mostly around indexing actions and file records; I haven't started reviewing the data writing and storing code. I think it might make sense to split the index writing and storing logics into a separate patch, which should be possible if writeUnitData is abstracted into an interface.

include/clang/Frontend/CompilerInstance.h
188	It might make sense to define an alias for `std::function<std::unique_ptr<FrontendAction>(const FrontendOptions &opts, std::unique_ptr<FrontendAction> action)>`, which is used multiple times.
include/clang/Frontend/FrontendOptions.h
377	It might make sense to also have documentations for these options here.
include/indexstore/IndexStoreCXX.h
84 ↗	(On Diff #118854)	I agree that we should try to keep the serialized symbol size minimal, but I think it's worthwhile to also store end locations for occurrences because 1) it's cheap, 2) it's not necessary easy to compute without parsing for occurrences like `a::b::c` or `a::b<X>`, 3) it would be useful for many LSP use cases.
lib/Frontend/CompilerInstance.cpp
1178	nit: no braces around one liners.
lib/FrontendTool/ExecuteCompilerInvocation.cpp
175	Could you comment on what this does? The `Act` above is already wrapped. Why do we need `setGenModuleActionWrapper` to `createIndexDataRecordingAction` again? Also, `createIndexDataRecordingAction` doesn't seem related to `GenModule`.
lib/Index/FileIndexRecord.cpp
23 ↗	(On Diff #121833)	Why?
37 ↗	(On Diff #121833)	Please comment when this would happen.
39 ↗	(On Diff #121833)	Why do we need `Decls` to be sorted by offset? If we want this for printing, it might make sense to just do a sort there.
lib/Index/FileIndexRecord.h
24 ↗	(On Diff #121833)	Please add documentation.
41 ↗	(On Diff #121833)	Is this clang-formatted? You might want to run git-clang-format on the whole patch.
lib/Index/IndexingAction.cpp
314	Again, you don't need the full `IndexingContext` and `RecordOptions` here.
324	Note that `getDecomposedExpansionLoc` can also return invalid decomposed loc.
338	Do we want better error handling here?
357	Please provide documentation.
534	Can we get this state from the base class instead of maintaining a another state, which seems to be identical?
554	Just `StringRef BuildNumber = RepositoryPath;`
731	Please provide a brief documentation for this class.
733	Again, it doesn't seem necessary for this class to have information about all record options. It seems that you only need `RecordSystemDependencies` here.
743	readability nit: avoid using `auto` if the return type is short to spell but hard to infer from the value expression. Same else where.
792	Could you add a comment explaining why we are not allowing searching.
799	It's a bit worrying that `IndexDataRecorder` and `IndexContext` reference each other. If you only need some information from the `IndexingContext`, simply pass it into `Recorder`. In this case, I think you only need the `SourceManager` from the `ASTContext` in the recorder to calculate whether a file is a system header. I see you also cache result of `IndexingContext::isSystemFile` in the indexing context, but I think it would be more sensible for the callers to handle caching for this call.
801	nit: no braces around one liners.
809	nit: redundant empty line
867	Just `auto pair = getIndexOptionsFromFrontendOptions(FEOpts);` and then use `pair.first` and `pair.second`? Same below.
lib/Index/IndexingContext.h
60	Please define the scope of this class to avoid throwing random states into it, which usually happens to a "context" class.

malaperle mentioned this in D40548: [clangd] Symbol index interfaces and an in-memory index implementation..Dec 4 2017, 1:24 PM

malaperle added inline comments.Dec 7 2017, 9:53 AM

include/indexstore/IndexStoreCXX.h
84 ↗	(On Diff #118854)	There's a few reason I think it's better to store the end loc. When doing "find references", computing the end loc of each occurrence will be costly. Imagine having thousands of occurrences and for each of them having to run logic to find the end of the occurrence. The AST and preprocessor are the best tools I know to figure out the proper end loc. Not using them means having to write a mini-preprocessor with some knowledge about the language semantics to cover some cases. MyClass \|o1, o2; Here, I have to stop at the comma. So it's basically take any alpha-numerical character, right? bool operator<(const Foo&, const Foo&) Ret operator()(Params ...params) No, in those cases, we have to take < and the first (). In the case of body start/end locations, similarly, it can be non-trivial. void foo() { if (0) { } } We have to count the balanced { } until we finish the body. #define FUNC_BODY {\ \ } void foo() FUNC_BODY Oops, where's the body? We need another special logic for this, etc. I think overall, it puts a lot of burden on the client of libIndexStore, burden that would be much more work and more inaccurate than using the AST/Preprocessor while indexing.
374 ↗	(On Diff #118854)	Thanks! I'll play around with this a bit more with this new information.
377 ↗	(On Diff #118854)	It's more clear now, thanks!

@malaperle, to clarify we are not suggesting that you write your own parser, the suggestion is to use clang in 'fast-scan' mode to get the structure of the declarations of a single file, see CXTranslationUnit_SingleFileParse (along with enabling skipping of bodies). We have found clang is super fast when you only try to get the structure of a file like this. We can make convenient APIs to provide the syntactic structure of declarations based on their location.

But let's say we added the end-loc, is it enough ? If you want to implement the 'peek the definition' like Eclipse, then it is not enough, you also need to figure out if there are documentation comments associated with the declaration and also show those. Also what if you want to highlight the type signature of a function, then just storing the location of the closing brace of its body is not enough. There can be any arbitrary things you may want to get from the structure of the declaration (e.g. the parameter ranges), but we could provide an API to gather any syntactic structure info you may want.

I would encourage you to try CXTranslationUnit_SingleFileParse|CXTranslationUnit_SkipFunctionBodies, you will be pleasantly surprised with how fast this mode is. The c-index-test option is -single-file-parse.

nathawes added inline comments.Dec 7 2017, 3:42 PM

lib/FrontendTool/ExecuteCompilerInvocation.cpp
175	It's to wrap any GenerateModuleActions that get created as needed when/if Act ends up loading any modules, so that we output index data for them too. I'll add a comment.
lib/Index/FileIndexRecord.cpp
39 ↗	(On Diff #121833)	It's mostly for when we hash them, so that ordering doesn't change the hash, but it's also for printing. The IndexASTConsumer doesn't always report symbol occurrences in source order, due to the preprocessor and a few other cases. We can sort them when the IndexRecordDataConsumer's finish() is called rather than as they're added to avoid the copying from repeated insert calls if that's the concern.
lib/Index/IndexingAction.cpp
534	I don't see this state in either base class (WrapperFrontendAction and IndexRecordActionBase). WrappingIndexAction and WrappingIndexRecordAction both have this, though. Were you thinking a new intermediate common base class between them and WrapperFrontendAction?
799	Good point. The IndexingContext was actually already calling IsSystemFile before it calls IndexDataRecorder's handleDeclOccurrence and handleModuleOccurrence anyway, so I'll change it to pass that through as an extra param and remove IndexDataRecorder's dependency on the IndexingContext.

Worked through the comments from @ioeric and split the code for writing out the collected indexing data into a separate patch.

Herald added a subscriber: mgrang. · View Herald TranscriptDec 7 2017, 4:02 PM

nathawes added a child revision: D40992: Add index-while-building support to Clang - Part 2.Dec 7 2017, 4:29 PM

In D39050#948500, @akyrtzi wrote:

@malaperle, to clarify we are not suggesting that you write your own parser, the suggestion is to use clang in 'fast-scan' mode to get the structure of the declarations of a single file, see CXTranslationUnit_SingleFileParse (along with enabling skipping of bodies). We have found clang is super fast when you only try to get the structure of a file like this.

Thank you, that sounds very useful. I will try that and get some measurements.

We can make convenient APIs to provide the syntactic structure of declarations based on their location.

Perhaps just for the end-loc since it's pretty much guaranteed to be needed by everyone. But if it's very straightforward, perhaps that's not needed. I'll try and see.

But let's say we added the end-loc, is it enough ? If you want to implement the 'peek the definition' like Eclipse, then it is not enough, you also need to figure out if there are documentation comments associated with the declaration and also show those. Also what if you want to highlight the type signature of a function, then just storing the location of the closing brace of its body is not enough. There can be any arbitrary things you may want to get from the structure of the declaration (e.g. the parameter ranges), but we could provide an API to gather any syntactic structure info you may want.

That's a very good point. I guess in the back of my mind, I have the worry that one cannot extend what is stored, either for a different performance trade-off or for additional things. The fact that both clang and clangd have to agree on the format so that index-while-building can be used seems to make it inherently not possible to extend. But perhaps it's better to not overthink this for now.

Thanks a lot for the changes! Some more comments inlined.

Please mark addressed comments as done so that reviewers could know what to look :) Thanks!

include/clang/Frontend/CompilerInstance.h
187	nit: LLVM variable names start with upper-case letters.
include/clang/Index/IndexingAction.h
31	This should be removed? Some forward declarations above are not used as well.
lib/Driver/Job.cpp
306	Could you share this code with line 278 above, which already has a nice comment?
lib/Index/FileIndexRecord.cpp
39 ↗	(On Diff #121833)	I would leave the sorting to the point where records are hashed to avoid making the record stateful. Consider changing `getDeclOccurrences` to `getOccurrencesSortedByOffset`; this should make the behavior more explicit.
lib/Index/FileIndexRecord.h
51 ↗	(On Diff #126065)	s/isSystem/IsSystem/ Also, I wonder if we can filter out system decls proactively and avoid creating file index record for them. We could also avoid propogating `IsSystem` here.
lib/Index/IndexingAction.cpp
395	`IsSystemFileCache &SysrootPath`? What is this parameter?
484	Please document this class. This can be easily confused with `IndexActionBase` which has a similar name. Same for `IndexAction`/`IndexRecordAction` and `WrappingIndexRecordAction`/`WrappingIndexRecordAction`. I think these pairs share (especially the wrapping actions) some common logics and could probably be merged.
510	This does a lot of stuff... please document the behavior!
534	I thought this could be a state in the `WrapperFrontendAction` since both derived classes maintain this state, but after a closer look, this seems to depend on both base classes. I'm not a big fun of maintaining states in multi-stage classes (e.g. `FrontendAction`), which could be confusing and hard to follow; I think `IndexRecordActionBase::finish(...)` should be able to handle the case where no index consumer is created (i.e. no record/dependency/... is collected). Also, `IndexRecordActionBase` (and the existing `IndexActionBase` ) should really be a component instead of a base class since none of its methods is `virtual`.
602	nit: no need for braces. Same below.
614	In the previous patch, `writeUnitData` does several things including handling modules, dependencies, includes and index records, as well as writing data. It might make sense to add an abstract class (`UnitDataCollector`?) that defines interfaces which make these behavior more explicit. We can then have users pass in an implementation via `createIndexDataRecordingAction` which would also decouple the data collection from data storage in the library.
649	I'm a bit nervous about propagating the entire `FrontendOptions` into the index library. I would simply expose `getIndexOptionsFromFrontendOptions` and have callers parse `FrontendOptions` and pass in only index-related options.
lib/Index/IndexingContext.h
41	This name is really confusing... `Is*` is usually used for booleans. Simply call this `SystemFileCache`.
53	How does this affect the existing cached results? Do you need to invalidate them?
64	I think it would be more straightforward to have context own the cache. If `setSysrootPath` is the problem, it might make sense to propagate it via the context or, if necessary, create a new cache when a new `SysrootPath` is set.

Thanks for taking another look @ioeric – I'll work through your comments and update.

nathawes marked 45 inline comments as done.Dec 18 2017, 2:05 PM

nathawes added inline comments.

lib/Index/FileIndexRecord.h
51 ↗	(On Diff #126065)	If the -index-ignore-system-symbols flag is set system decls are filtered out in IndexingContext::handleDeclOccurrence and aren't reported to the IndexDataConsumer, so FileIndexRecords won't be created. The IsSystem here is for clients that want index data for system files, but want to be able to distinguish them from regular files.

I've refactored the indexing/dependency data collection out from the writing with the new IndexUnitDataConsumer class, and made other smaller changes to address the feedback from @ioeric.

Fix out of date header comment in FileIndexData.h

Thanks a lot for further cleaning up the patch! It is now much easier to review. I really appreciate it!

Some more comments on the public APIs and the layering of classes. There are a lot of helper classes in the implementation, so I think it's important to get a clear layering so that they could be easily understood by future contributors :)

Also, with the IndexUnitDataConsumer abstraction, it seems to be feasible now to add some unit tests for createUnitIndexingAction. With all the recent major changes, I think it's important that we have some degree of testing to make sure components actually work together.

include/clang/Frontend/CompilerInstance.h
187	`opts` and `action` are still lower-case.
include/clang/Index/DeclOccurrence.h
39	Nit: indentation. Tip: `git-clang-format` against the diff base can format all changed lines in your patch.
include/clang/Index/IndexUnitDataConsumer.h
1 ↗	(On Diff #127568)	IIUC, this is the index data for a translation unit, as opposed to an AST. If so, consider calling this `UnitIndexDataConsumer` to match `(AST)IndexDataConsumer` which is parallel to this. We might want to rename them to be either `index::UnitDataConsumer` vs `index::ASTDataConsumer` or `index::UnitIndexDataConsumer` vs `index::ASTIndexDataConsumer` . I am inclined to the first pair as `index` is already implied in the namespace.
67 ↗	(On Diff #127568)	Comment? Why do we actually need this?
include/clang/Index/IndexingAction.h
44	We are now mixing functionalities for Unit indexing and AST indexing actions in the same file. We might want to separate these into two headers e..g `UnitIndexingAction.h` and `ASTIndexingAction.h`. This would make it easier for users to find the right functions :)
61	Please add documentation for each field. It's not trivial what each field is for, especially some fields seem to be optional and some seem to be mutually exclusive.
62	These pointers suggest the life time of this struct is tied to some other struct, which makes the struct look a bit dangerous to use. Should we also carry a reference or a smart pointer to the underlying object that keeps these pointers valid? Would it be a `CompilerInstance` (guessing from `IndexUnitDataConsumerFactory` )?
78	What is the intended user of this function? It's unclear how users could obtain a `ConsumerFactory` (i.e. `UnitDetails`) without the functionalities in `UnitDataConsumerActionImpl` . (Also see comment in the implementation of `createIndexDataRecordingAction`.)
92	This is likely only useful for compiler invocation. I would put it in the compiler invocation code.
lib/Driver/Job.cpp
216	nit: Comment should start with an overview of what the function does. Returns a directory path that is ... Also, consider calling this `getDirAdjacentToModCache`. `buildDir` can be ambiguous.
225	Please clang-format the code. Without indentation, this looks like an no-op statement.
lib/Index/IndexingAction.cpp
93	Use `class` for interfaces.
93	Does `CI` here have to be the same instance as the one in `createIndexASTConsumer` ? Might worth documenting.
148	nit: Move this after `Impl->createIndexASTConsumer(CI)`. Do we need to reset this flag? Calling `CreateASTConsumer` multiple times on the same instance seems to be allowed?
254	This seems to be related to files. Maybe `FileIndexDataCollector`?
261	`override`
265	Simply `begin`, if the class is called `FileIndexDataCollector` . Similar below to match iterator naming convention.
276	I think this should be `public` as this is still implementing `IndexDataConsumer`.
334	I'd simply do: if FileIncludeFilter == UnitIndexingOptions::FileIncludeFilterKind::UserOnly) if (isSystem...) return;
348	Same here. This should be `public`
365	The naming convention for the callback interfaces is `forEach*` e.g. `forEachFileDependency`. s/visitor/Callback/ (same below).
369	`forEachInclude`
372	`forEachModuleImport`
380	This is two classes in one, which is difficult to understand. Could you split it into `FileIndexDependencyCollector` and `FileIndexDependencyProvider` and have `FileIndexDependencyCollector` returns a provider on finish (e.g. `Provider consume();`; you might want to copy/move the collected data into the provider). It would be easier to justify the behavior (e.g. what happens when you access the provider while collector is still working?)
384	What does `Entries` contain? What files are added?
512	Instead of passing `ParentUnitConsumer`, consider checking the `Mod` before calling the function.
515	Non-factory static method is often a code smell. Any reason not to make these static methods private members? With that, you wouldn't need to pass along so many parameters. You could make them `const` if you don't want members to be modified.
522	Why is this overload public while others are private? Aren't they all used only in this class?
542	Any reason to close the anonymous namespace here? Shouldn't outlined definitions of `UnitDataConsumerActionImpl`'s methods also in the anonymous namespace?
769	I think the inheritance of `IndexUnitDataConsumer` and the creation of factory should be in user code (e.g. implementation for on-disk persist-index-data should come from the compiler invocation code `ExecuteCompilerInvocation.cpp` or at least a separate file in the library that compiler invocation can use), and the user should only use `createUnitIndexingAction` by providing a factory. Currently, `createUnitIndexingAction` and `createIndexDataRecordingAction` are mostly identical except for the code that implements `IndexUnitDataConsumer` and creates the factory. The current `createIndexDataRecordingAction` would probably only used by the compiler invocation, and we can keep the generalized `createUnitIndexingAction` in the public APIs.
776	The `UnitInfo` is ignored? What do we actually need it for?
780	`Base` doesn't seem to be a very meaningful name here.

ioeric added inline comments.Dec 19 2017, 3:08 PM

include/clang/Index/IndexUnitDataConsumer.h
1 ↗	(On Diff #127568)	Sorry, asking you to also rename `IndexDataConsumer` is probably too much and out of the scope of this patch. I'm fine with `UnitIndexDataConsumer` or `UnitDataConsumer` or something similar for now without touching `IndexDataConsumer` :)

(I think I forgot to update the patch status :)

This revision now requires changes to proceed.Jan 3 2018, 5:54 AM

@ioeric I should have an updated patch up shortly with your inline comments addressed + new tests. Thanks again for reviewing!

include/clang/Index/IndexUnitDataConsumer.h
67 ↗	(On Diff #127568)	From here, my understanding is that it's an optimization to avoid the vtable being included in multiple translation units. I'm not sure if that's actually a problem, I was just following IndexDataConsumer's lead. Added a comment.
include/clang/Index/IndexingAction.h
78	Sorry, I'm not sure what you mean here. Users shouldn't need to know anything about `UnitDataConsumerActionImpl`, they just need to provide a lambda/function reference that takes a `CompilerInstance&` and a `UnitDetails` and returns an `IndexUnitDataConsumer` (or `UnitIndexDataConsumer` once I rename it). This gets called once per translation unit to get a distinct data consumer for each unit, i.e. for the main translation unit as well as for each of its dependent modules that the main unit's data consumer says should be indexed via `shouldIndexModuleDependency(...)`.
92	There's another public `index::` API for writing out index data for individual clang module files in the follow up patch that takes a `RecordingOptions` and is used externally, from Swift. This function's useful on the Swift side to get the `RecordingOptions` from `FrontendOptions` it has already set up.
lib/Index/IndexingAction.cpp
148	Oops. Yes, we do :-)
515	Sorry, there's missing context – they're used from another public API that's in the follow-up patch. I'll bring that over and make these top-level static functions, since they don't belong exclusively to IndexDataConsumerActionImpl.
522	Same as above – this is called from a public `index::` API in the follow-up patch.
769	`IndexUnitDataRecorder` here is just a stub I added when I split the patch up – the follow-up revision has it in a separate file. I'll move the separate files to this patch and stub out the method bodies with TODOs instead. I've made `createIndexDataRecordingAction` call `createUnitIndexingAction` to remove the duplication, and pulled it, `RecordingOptions` and `getRecordingOptionsFromFrontendOptions` to a new header (`RecordingAction.h`) that `ExecuteComilerInvocation.cpp` uses. Does that sound ok?
776	It should be passed to IndexUnitDataRecorder to write out info about the unit itself. This was just me splitting the patch badly.

Applied the various refactorings suggested by @ioeric
Extended c-index-test with a new option to print out the collected unit indexing data, and
Added tests for the unit indexing functionality using the new option
Fixed formatting

Nice! Thanks for making the refactoring and adding tests! I think this is good to go now.

I'm not very familiar with code outside of the index library (Driver, Basic etc), but the changes seem reasonable to me. Feel free to get another pair of eyes for them ;)

include/clang/Index/RecordingAction.h
43	Add a FIXME that this is not implemented yet.
lib/Index/IndexingAction.cpp
769	Sounds good. Thanks for the explanation!

This revision is now accepted and ready to land.Jan 19 2018, 4:10 AM

I'm wondering if there is any further plan for this? ;)

In D39050#1004937, @ioeric wrote:

I'm wondering if there is any further plan for this? ;)

I'd like to comment on the amount of data that will be stored but that can be done outside this review. I still have a few things to figure out before reaching a conclusion.

@ioeric I'm working on a few other priorities over the next few weeks, sorry, but should get back to this relatively soon after that.
I would just land it, but I expect some downstream breakage I want to make sure I have time to fix.

@malaperle Sounds good – I'll keep an eye out for it!

In D39050#949185, @malaperle wrote:

In D39050#948500, @akyrtzi wrote:

@malaperle, to clarify we are not suggesting that you write your own parser, the suggestion is to use clang in 'fast-scan' mode to get the structure of the declarations of a single file, see CXTranslationUnit_SingleFileParse (along with enabling skipping of bodies). We have found clang is super fast when you only try to get the structure of a file like this.

Thank you, that sounds very useful. I will try that and get some measurements.

We can make convenient APIs to provide the syntactic structure of declarations based on their location.

Perhaps just for the end-loc since it's pretty much guaranteed to be needed by everyone. But if it's very straightforward, perhaps that's not needed. I'll try and see.

But let's say we added the end-loc, is it enough ? If you want to implement the 'peek the definition' like Eclipse, then it is not enough, you also need to figure out if there are documentation comments associated with the declaration and also show those. Also what if you want to highlight the type signature of a function, then just storing the location of the closing brace of its body is not enough. There can be any arbitrary things you may want to get from the structure of the declaration (e.g. the parameter ranges), but we could provide an API to gather any syntactic structure info you may want.

That's a very good point. I guess in the back of my mind, I have the worry that one cannot extend what is stored, either for a different performance trade-off or for additional things. The fact that both clang and clangd have to agree on the format so that index-while-building can be used seems to make it inherently not possible to extend. But perhaps it's better to not overthink this for now.

I did a bit more of experimenting. For the end-loc, I changed my prototype so that the end-loc is not stored in the index but rather computed "on the fly" using SourceManager and Lexer only. For my little benchmark, I used the LLVM/Clang/Clangd code base which I queried for all references of "std" (the namespace) which is around 46K references in the index.

With end-loc in index: 3.45s on average (20 samples)
With end-loc computed on the fly: 11.33s on average (20 samples)
I also tried with Xcode but without too much success: it took about 30 secs to reach 45K results and then carried on for a long time and hung (although I didn't try to leave it for hours to see if it finished).

From my perspective, it seems that the extra time is quite substantial and it doesn't seem worth to save an integer per occurrence in this case.

For computing the start/end-loc of function bodies, I tried the SingleFileParseMode and SkipFunctionBodies separately ( as a start). The source I use this on looks like this:

#include "MyClass.h"

MyClass::MyClass() {
}

void MyClass::doOperation() {
}

With SingleFileParseMode, I get several errors:

MyClass.cpp:5:1: error: use of undeclared identifier 'MyClass'
MyClass.cpp:8:6: error: use of undeclared identifier 'MyClass'

Then I cannot obtain any Decl* at the position of doOperation. With SingleFileParseMode, I'm also a bit weary that not processing headers will result in many inaccuracies. From our perspective, we are more wiling to sacrifice disk space in order to have more accuracy and speed. For comparison, the index I worked with containing all end-loc for occurrences and also function start/end is 201M for LLVM/Clang/Clangd which is small to us.

With SkipFunctionBodies alone, I can get the Decl* but FunctionDecl::getSourceRange() doesn't include the body, rather, it stops after the arguments.
It would be very nice if we could do this cheaply but it doesn't seem possible with those two flags alone. What did you have in mind for implementing an "API to gather any syntactic structure info" ?

In D39050#1021204, @malaperle wrote:
For computing the start/end-loc of function bodies, I tried the SingleFileParseMode and SkipFunctionBodies separately ( as a start). The source I use this on looks like this:

Given the discussion in https://reviews.llvm.org/D44247, I think we can do without the start/end-loc of function bodies and try some heuristics client-side. We can always revisit this later if necessary.

However, for the end-loc of occurrences, would you be OK with this being added? I think it would be a good compromise in terms of performance, simplicity and index size.

In D39050#1036249, @malaperle wrote:
In D39050#1021204, @malaperle wrote:
For computing the start/end-loc of function bodies, I tried the SingleFileParseMode and SkipFunctionBodies separately ( as a start). The source I use this on looks like this:
Given the discussion in https://reviews.llvm.org/D44247, I think we can do without the start/end-loc of function bodies and try some heuristics client-side. We can always revisit this later if necessary.

However, for the end-loc of occurrences, would you be OK with this being added? I think it would be a good compromise in terms of performance, simplicity and index size.

@malaperle Just to clarify, what's the particular end-loc we're talking about here? e.g. for a function call, would this be the end of the function's name, or the closing paren?
For the end of the name, couldn't this be derived from the start loc + symbol name length (barring token pastes and escaped new lines in the middle of identifiers, which hopefully aren't too common)?
I can see the value for the closing paren though.

@akyrtzi Are the numbers from Marc-Andre's experiment what you'd expect to see and is there anything else to try? I'm not familiar with those modes at all to comment, sorry. I assume any API to gather syntactic structure info would be based on those modes, right?

In D39050#1036394, @nathawes wrote:

@malaperle Just to clarify, what's the particular end-loc we're talking about here? e.g. for a function call, would this be the end of the function's name, or the closing paren?
For the end of the name, couldn't this be derived from the start loc + symbol name length (barring token pastes and escaped new lines in the middle of identifiers, which hopefully aren't too common)?
I can see the value for the closing paren though.

I mean the end of the name referencing the symbol, so that it can be highlighted properly when using the "find references in workspace" feature. There are cases where the name of the symbol itself is not present, for example "MyClass o1, o2;" (o1 and o2 reference the constructor), references to overloaded operators, etc.

In D39050#1037008, @malaperle wrote:

In D39050#1036394, @nathawes wrote:

@malaperle Just to clarify, what's the particular end-loc we're talking about here? e.g. for a function call, would this be the end of the function's name, or the closing paren?
For the end of the name, couldn't this be derived from the start loc + symbol name length (barring token pastes and escaped new lines in the middle of identifiers, which hopefully aren't too common)?
I can see the value for the closing paren though.

I mean the end of the name referencing the symbol, so that it can be highlighted properly when using the "find references in workspace" feature. There are cases where the name of the symbol itself is not present, for example "MyClass o1, o2;" (o1 and o2 reference the constructor), references to overloaded operators, etc.

Ah, I see – thanks! I was thinking all occurrences whose symbol name didn't actually appear at their location were marked with SymbolRole::Implicit, but that only seems to be true for the ObjC index data.

Hey Marc,

The fact that both clang and clangd have to agree on the format so that index-while-building can be used seems to make it inherently not possible to extend

I don't think "not possible to extend" is quite correct, we can make it so that the format allows optional data to be recorded.

On the topic of recording the end-loc, I agree it's not much data overall, but it will be useful to examine the uses closely and to figure out whether it's really required and whether it is at the same time inadequate for other uses.

I changed my prototype so that the end-loc is not stored in the index but rather computed "on the fly" using SourceManager and Lexer only.

I assume you used SingleFileParseMode+SkipFunctionBodies for this, right ?

For my little benchmark, I used the LLVM/Clang/Clangd code base which I queried for all references of "std" (the namespace) which is around 46K references in the index.

This is an interesting use case, and I can say we have some experience because Xcode has this functionality without requiring the end-loc for every reference.
So what it does is that it has a 'name' to look for (say 'foo' for the variable foo) and if it finds the name in the location then it highlights, otherwise if it doesn't find it (e.g. because it is an implicit reference) then it points to the location but doesn't highlight something. The same thing happens for operator overloads (the operators get highlighted at the reference location).
For implicit references it's most likely there's nothing to highlight so the end-loc will most likely be empty anyway (or same as start-loc ?) to indicate an empty range.

With SingleFileParseMode, I get several errors:

Good point, the parser definitely needs recovery improvements in C++.

With SkipFunctionBodies alone, I can get the Decl* but FunctionDecl::getSourceRange() doesn't include the body

This seems strange, there's an EndRangeLoc field that should have been filled in, not exactly sure if it is a bug or omission.

Going back to the topic of what use cases end-loc covers, note that it still seems inadequate for peek-definition functionality. You can't set it to body-end loc (otherwise occurrences highlighting would highlight the whole body which I think is undesirable) and you still need to include doc-comments if they exist.

In D39050#1037796, @akyrtzi wrote:

Hey Marc,

The fact that both clang and clangd have to agree on the format so that index-while-building can be used seems to make it inherently not possible to extend

I don't think "not possible to extend" is quite correct, we can make it so that the format allows optional data to be recorded.

That would be good. How would one go about asking Clang to generate this extra information? Would a Clang Plugin be suitable for this? I don't know much about those but perhaps that could be one way to extent the basic behavior of "-index_store_path" in this way?

I changed my prototype so that the end-loc is not stored in the index but rather computed "on the fly" using SourceManager and Lexer only.

I assume you used SingleFileParseMode+SkipFunctionBodies for this, right ?

No, sorry the end-locs I meant there is for occurrences. Only the lexer was needed to get the end of the token. So for "MyClass o1, o2;" o1 and o2 get highlighted as references to the MyClass constructor.

For my little benchmark, I used the LLVM/Clang/Clangd code base which I queried for all references of "std" (the namespace) which is around 46K references in the index.

This is an interesting use case, and I can say we have some experience because Xcode has this functionality without requiring the end-loc for every reference.
So what it does is that it has a 'name' to look for (say 'foo' for the variable foo) and if it finds the name in the location then it highlights, otherwise if it doesn't find it (e.g. because it is an implicit reference) then it points to the location but doesn't highlight something.

I think it's useful to highlight something even when the name is not there. For example in "MyClass o1, o2;" it feels natural that o1 and o2 would get highlighted.

The same thing happens for operator overloads (the operators get highlighted at the reference location).

It does? I can only seem to do a textual search. For example, if I look at "FileId::operator<", if I right-click in the middle of "operator<" and do "Find selected symbol in workspace", it seems to start a text based search because there are many results that are semantically unrelated.

For implicit references it's most likely there's nothing to highlight so the end-loc will most likely be empty anyway (or same as start-loc ?) to indicate an empty range.

I think for those cases the end of the token is probably suitable. Can you give examples which implicit references you have in mind? Maybe another one (other than the constructor mentioned above) could be a function call like "passMeAStdString(MyStringRef)", here the "operator std::string" would be called and MyStringRef could be highlighted, I think it would make sense to the user that is gets called by passing this parameter by seeing the highlight.

Going back to the topic of what use cases end-loc covers, note that it still seems inadequate for peek-definition functionality. You can't set it to body-end loc (otherwise occurrences highlighting would highlight the whole body which I think is undesirable) and you still need to include doc-comments if they exist.

I think maybe I wasn't clear, I was thinking about two end-locs: end-locs of occurrences and end-locs of bodies. The end-loc of occurrences would be used for highlight when searching for all occurrences and the end-loc for bodies would be used for the peek definition. I think we can disregard end-locs of bodies for now.

malaperle added a subscriber: simark.Mar 16 2018, 11:51 AM

That would be good. How would one go about asking Clang to generate this extra information? Would a Clang Plugin be suitable for this?

That's an interesting idea that we could explore, but I don't have much experience with that mechanism to comment on.

Only the lexer was needed to get the end of the token

Ok, that's interesting, not sure why Xcode is so fast to highlight, did you reuse same SourceManager/Lexer/buffers for occurrences from same file ? We'd definitely add the end-loc if we cannot come up with a mechanism to highlight fast enough without it.

I think it's useful to highlight something even when the name is not there. For example in "MyClass o1, o2;" it feels natural that o1 and o2 would get highlighted.

To clarify, even with implicit references the start loc points to something. In this case the implicit references can have start locs for the o1 and o2 identifiers and the end result for the UI will be the same (o1 and o2 get highlighted) even without having end-locs for all references.

It does? I can only seem to do a textual search.

The example I tried is the following. If you could file a bug report for the test case that did not work as you expected it would be much appreciated!

class Something1 {
public:
    Something1() {}
    ~Something1() {}
    operator int() {
        return 0;
    }

    friend int operator <<(Something1 &p, Something1 &p2) {
        return 0;
    }
};

void foo1(Something1 p1, Something1 p2) {
    p1 << p2;
    p1 << p2;
}

here the "operator std::string" would be called and MyStringRef could be highlighted

Even without end-loc, the start loc could point to MyStringRef and you could highlight it.

In D39050#1040501, @akyrtzi wrote:

That would be good. How would one go about asking Clang to generate this extra information? Would a Clang Plugin be suitable for this?

That's an interesting idea that we could explore, but I don't have much experience with that mechanism to comment on.

Only the lexer was needed to get the end of the token

Ok, that's interesting, not sure why Xcode is so fast to highlight, did you reuse same SourceManager/Lexer/buffers for occurrences from same file ? We'd definitely add the end-loc if we cannot come up with a mechanism to highlight fast enough without it.

I don't think Xcode is quite fast, it's about 10 times slower (although I'm not sure it really finished) than when I use my branch that has the end-loc. I would try end-locs in Xcode if I could, to compare :) So I don't really know where the bottleneck is in Xcode. Comparing oranges to oranges, it's 4 times slower without end-locs compared to with end-locs on my branch. I does use the same SourceManager for the 46K references and I verified that it uses the same buffers, etc.
I'll put the numbers here again for readability.

For my little benchmark, I used the LLVM/Clang/Clangd code base which I queried for all references of "std" (the namespace) which is around 46K references in the index.

With end-loc in index: 3.45s on average (20 samples)
With end-loc computed on the fly: 11.33s on average (20 samples)
I also tried with Xcode but without too much success: it took about 30 secs to reach 45K results and then carried on for a long time and hung (although I didn't try to leave it for hours to see if it finished).

I think it's useful to highlight something even when the name is not there. For example in "MyClass o1, o2;" it feels natural that o1 and o2 would get highlighted.

To clarify, even with implicit references the start loc points to something. In this case the implicit references can have start locs for the o1 and o2 identifiers and the end result for the UI will be the same (o1 and o2 get highlighted) even without having end-locs for all references.

It's the same but slower. IMO, the trade off is not great. It's entirely subjective but I think 4-10 times slower in order to save an integer per occurrence is not worth it from my point of view.

Even without end-loc, the start loc could point to MyStringRef and you could highlight it.

(Same here, it's doable but faster if already in the index.)

It does? I can only seem to do a textual search.

The example I tried is the following. If you could file a bug report for the test case that did not work as you expected it would be much appreciated!

Sure, I'll give that a try and isolate it as much as I can. BTW, does it work for you on the LLVM code base?

Updated to apply on top-of-tree.

tschuett added a subscriber: tschuett.Jul 25 2018, 11:23 PM

gribozavr added a subscriber: gribozavr.Mar 5 2019, 12:55 AM

gribozavr added inline comments.

include/clang/Frontend/FrontendOptions.h
377	Please end comments with a period.
380	Would it make more sense to flip this boolean to positive? "IndexIncludeSystemSymbols"?
lib/Index/IndexingAction.cpp
102	Please don't duplicate the information from the signature in comments. No need to say that this function returns an IndexASTConsumer (twice, in the first sentence and in the \returns clause), the code already says that. Also, "The compiler instance used to process the input" does not mean much to me either.
154	No semicolon.
163	No semicolon.
186	No semicolon.
275	Please don't duplicate type information from the signature in the comment.
283	I don't understand... this is not really the user-specified output file.
303	Please don't duplicate type information from the signature in the comment.
lib/Index/IndexingContext.h
40	Please add a period at the end of the comment.
44	DirEntries => IsSystemDirEntry?
46	Triple slashes for doc comments.
46	Unclear how a boolean can keep track of the last check. Did you mean "Whether the file is a system file or not. This value is a cache." If so, please rename the variable to something like IsSystemFileCache.
test/Index/Core/index-source.mm
2	No need to specify check-prefixes=CHECK.
test/Index/Core/index-unit.mm
1	This test is very difficult to read... it is just a dump of random internal data structures... what do you think about converting it to a unit test?

Herald added a subscriber: jdoerfert. · View Herald TranscriptMar 5 2019, 12:55 AM

akyrtzi added a reviewer: jkorous.Mar 6 2019, 10:07 AM

mgrang added inline comments.Mar 6 2019, 10:17 AM

lib/Index/FileIndexData.cpp
31	Please use range-based llvm::sort instead of std::sort: llvm::sort(Sorted); See https://llvm.org/docs/CodingStandards.html#beware-of-non-deterministic-sorting-order-of-equal-elements

jkorous commandeered this revision.Mar 6 2019, 10:44 AM

jkorous edited reviewers, added: nathawes; removed: jkorous.

Herald added a subscriber: dexonsmith. · View Herald TranscriptMar 6 2019, 10:44 AM

It's time to officially abandon these patches in favor of new push for upstreaming index-while-building.

Current reviews in progress
https://reviews.llvm.org/D58749
https://reviews.llvm.org/D58418

RFC
http://lists.llvm.org/pipermail/cfe-dev/2019-February/061432.html

I'll address comments for this patch in the new set of patches.

@gribozavr I haven't put up this part of code for the new round of review yet. I will keep this on mind.

@mgrang This already landed in edbbe470f66 as clang/lib/Index/FileIndexRecord.cpp but luckily the implementation isn't using sort() at all. Thanks for pointing this out anyway!

akyrtzi added inline comments.Mar 6 2019, 11:36 AM

include/clang/Frontend/FrontendOptions.h
380	@jkorous I noticed this name can be misleading, it may seem as if what this does is "avoid indexing system symbol occurrences" but what it actually does is "avoid indexing symbol occurrences from system files". We should rename it to "IndexIgnoreSystemHeaders" or "IndexIncludeSystemHeaders" per Dmitri's suggestion.

Revision Contents

Path

Size

include/

clang/

Basic/

1 line

1 line

1 line

1 line

4 lines

DiagnosticIndexKinds.td

31 lines

Driver/

Job.h

6 lines

Options.td

7 lines

Frontend/

CompilerInstance.h

18 lines

FrontendOptions.h

10 lines

Index/

42 lines

8 lines

29 lines

10 lines

59 lines

UnitIndexDataConsumer.h

74 lines

UnitIndexingAction.h

87 lines

module.modulemap

1 line

lib/

Basic/

DiagnosticIDs.cpp

3 lines

Driver/

Driver.cpp

4 lines

Job.cpp

43 lines

ToolChains/

Clang.cpp

12 lines

Darwin.cpp

4 lines

Frontend/

CompilerInstance.cpp

12 lines

CompilerInvocation.cpp

4 lines

FrontendTool/

CMakeLists.txt

1 line

ExecuteCompilerInvocation.cpp

17 lines

Index/

3 lines

56 lines

52 lines

698 lines

32 lines

86 lines

UnitIndexDataRecorder.h

52 lines

UnitIndexDataRecorder.cpp

52 lines

test/

Index/

Core/

Inputs/

module/

3 lines

4 lines

4 lines

1 line

1 line

12 lines

sys/

system-head.h

17 lines

transitive-include.h

6 lines

external-source-symbol-attr.m

1 line

index-instantiated-source.cpp

1 line

1 line

1 line

1 line

135 lines

Store/

assembly-invocation.c

3 lines

tools/

c-index-test/

core_main.cpp

253 lines

diagtool/

DiagnosticNames.cpp

1 line

libclang/

CXIndexDataConsumer.h

7 lines

CXIndexDataConsumer.cpp

5 lines

Diff 153190

include/clang/Basic/AllDiagnostics.h

	Show All 15 Lines
	#define LLVM_CLANG_BASIC_ALLDIAGNOSTICS_H			#define LLVM_CLANG_BASIC_ALLDIAGNOSTICS_H

	#include "clang/AST/ASTDiagnostic.h"			#include "clang/AST/ASTDiagnostic.h"
	#include "clang/AST/CommentDiagnostic.h"			#include "clang/AST/CommentDiagnostic.h"
	#include "clang/Analysis/AnalysisDiagnostic.h"			#include "clang/Analysis/AnalysisDiagnostic.h"
	#include "clang/CrossTU/CrossTUDiagnostic.h"			#include "clang/CrossTU/CrossTUDiagnostic.h"
	#include "clang/Driver/DriverDiagnostic.h"			#include "clang/Driver/DriverDiagnostic.h"
	#include "clang/Frontend/FrontendDiagnostic.h"			#include "clang/Frontend/FrontendDiagnostic.h"
				#include "clang/Index/IndexDiagnostic.h"
	#include "clang/Lex/LexDiagnostic.h"			#include "clang/Lex/LexDiagnostic.h"
	#include "clang/Parse/ParseDiagnostic.h"			#include "clang/Parse/ParseDiagnostic.h"
	#include "clang/Sema/SemaDiagnostic.h"			#include "clang/Sema/SemaDiagnostic.h"
	#include "clang/Serialization/SerializationDiagnostic.h"			#include "clang/Serialization/SerializationDiagnostic.h"
	#include "clang/Tooling/Refactoring/RefactoringDiagnostic.h"			#include "clang/Tooling/Refactoring/RefactoringDiagnostic.h"

	namespace clang {			namespace clang {
	template <size_t SizeOfStr, typename FieldType>			template <size_t SizeOfStr, typename FieldType>
	Show All 11 Lines

include/clang/Basic/CMakeLists.txt

	macro(clang_diag_gen component)			macro(clang_diag_gen component)
	clang_tablegen(Diagnostic${component}Kinds.inc			clang_tablegen(Diagnostic${component}Kinds.inc
	-gen-clang-diags-defs -clang-component=${component}			-gen-clang-diags-defs -clang-component=${component}
	SOURCE Diagnostic.td			SOURCE Diagnostic.td
	TARGET ClangDiagnostic${component})			TARGET ClangDiagnostic${component})
	endmacro(clang_diag_gen)			endmacro(clang_diag_gen)

	clang_diag_gen(Analysis)			clang_diag_gen(Analysis)
	clang_diag_gen(AST)			clang_diag_gen(AST)
	clang_diag_gen(Comment)			clang_diag_gen(Comment)
	clang_diag_gen(Common)			clang_diag_gen(Common)
	clang_diag_gen(CrossTU)			clang_diag_gen(CrossTU)
	clang_diag_gen(Driver)			clang_diag_gen(Driver)
	clang_diag_gen(Frontend)			clang_diag_gen(Frontend)
				clang_diag_gen(Index)
	clang_diag_gen(Lex)			clang_diag_gen(Lex)
	clang_diag_gen(Parse)			clang_diag_gen(Parse)
	clang_diag_gen(Refactoring)			clang_diag_gen(Refactoring)
	clang_diag_gen(Sema)			clang_diag_gen(Sema)
	clang_diag_gen(Serialization)			clang_diag_gen(Serialization)
	clang_tablegen(DiagnosticGroups.inc -gen-clang-diag-groups			clang_tablegen(DiagnosticGroups.inc -gen-clang-diag-groups
	SOURCE Diagnostic.td			SOURCE Diagnostic.td
	TARGET ClangDiagnosticGroups)			TARGET ClangDiagnosticGroups)
	Show All 28 Lines

include/clang/Basic/Diagnostic.td

	Show First 20 Lines • Show All 139 Lines • ▼ Show 20 Lines
	// Definitions for Diagnostics.			// Definitions for Diagnostics.
	include "DiagnosticASTKinds.td"			include "DiagnosticASTKinds.td"
	include "DiagnosticAnalysisKinds.td"			include "DiagnosticAnalysisKinds.td"
	include "DiagnosticCommentKinds.td"			include "DiagnosticCommentKinds.td"
	include "DiagnosticCommonKinds.td"			include "DiagnosticCommonKinds.td"
	include "DiagnosticCrossTUKinds.td"			include "DiagnosticCrossTUKinds.td"
	include "DiagnosticDriverKinds.td"			include "DiagnosticDriverKinds.td"
	include "DiagnosticFrontendKinds.td"			include "DiagnosticFrontendKinds.td"
				include "DiagnosticIndexKinds.td"
	include "DiagnosticLexKinds.td"			include "DiagnosticLexKinds.td"
	include "DiagnosticParseKinds.td"			include "DiagnosticParseKinds.td"
	include "DiagnosticRefactoringKinds.td"			include "DiagnosticRefactoringKinds.td"
	include "DiagnosticSemaKinds.td"			include "DiagnosticSemaKinds.td"
	include "DiagnosticSerializationKinds.td"			include "DiagnosticSerializationKinds.td"

include/clang/Basic/DiagnosticGroups.td

	Show First 20 Lines • Show All 327 Lines • ▼ Show 20 Lines
	def MethodSignatures : DiagGroup<"method-signatures">;			def MethodSignatures : DiagGroup<"method-signatures">;
	def MismatchedParameterTypes : DiagGroup<"mismatched-parameter-types">;			def MismatchedParameterTypes : DiagGroup<"mismatched-parameter-types">;
	def MismatchedReturnTypes : DiagGroup<"mismatched-return-types">;			def MismatchedReturnTypes : DiagGroup<"mismatched-return-types">;
	def MismatchedTags : DiagGroup<"mismatched-tags">;			def MismatchedTags : DiagGroup<"mismatched-tags">;
	def MissingFieldInitializers : DiagGroup<"missing-field-initializers">;			def MissingFieldInitializers : DiagGroup<"missing-field-initializers">;
	def ModuleBuild : DiagGroup<"module-build">;			def ModuleBuild : DiagGroup<"module-build">;
	def ModuleConflict : DiagGroup<"module-conflict">;			def ModuleConflict : DiagGroup<"module-conflict">;
	def ModuleFileExtension : DiagGroup<"module-file-extension">;			def ModuleFileExtension : DiagGroup<"module-file-extension">;
				def IndexStore : DiagGroup<"index-store">;
	def NewlineEOF : DiagGroup<"newline-eof">;			def NewlineEOF : DiagGroup<"newline-eof">;
	def Nullability : DiagGroup<"nullability">;			def Nullability : DiagGroup<"nullability">;
	def NullabilityDeclSpec : DiagGroup<"nullability-declspec">;			def NullabilityDeclSpec : DiagGroup<"nullability-declspec">;
	def NullabilityInferredOnNestedType : DiagGroup<"nullability-inferred-on-nested-type">;			def NullabilityInferredOnNestedType : DiagGroup<"nullability-inferred-on-nested-type">;
	def NullableToNonNullConversion : DiagGroup<"nullable-to-nonnull-conversion">;			def NullableToNonNullConversion : DiagGroup<"nullable-to-nonnull-conversion">;
	def NullabilityCompletenessOnArrays : DiagGroup<"nullability-completeness-on-arrays">;			def NullabilityCompletenessOnArrays : DiagGroup<"nullability-completeness-on-arrays">;
	def NullabilityCompleteness : DiagGroup<"nullability-completeness",			def NullabilityCompleteness : DiagGroup<"nullability-completeness",
	[NullabilityCompletenessOnArrays]>;			[NullabilityCompletenessOnArrays]>;
	▲ Show 20 Lines • Show All 668 Lines • Show Last 20 Lines

include/clang/Basic/DiagnosticIDs.h

Show All 34 Lines	enum {
DIAG_SIZE_LEX = 400,		DIAG_SIZE_LEX = 400,
DIAG_SIZE_PARSE = 500,		DIAG_SIZE_PARSE = 500,
DIAG_SIZE_AST = 150,		DIAG_SIZE_AST = 150,
DIAG_SIZE_COMMENT = 100,		DIAG_SIZE_COMMENT = 100,
DIAG_SIZE_CROSSTU = 100,		DIAG_SIZE_CROSSTU = 100,
DIAG_SIZE_SEMA = 3500,		DIAG_SIZE_SEMA = 3500,
DIAG_SIZE_ANALYSIS = 100,		DIAG_SIZE_ANALYSIS = 100,
DIAG_SIZE_REFACTORING = 1000,		DIAG_SIZE_REFACTORING = 1000,
		DIAG_SIZE_INDEX = 100,
};		};
// Start position for diagnostics.		// Start position for diagnostics.
enum {		enum {
DIAG_START_COMMON = 0,		DIAG_START_COMMON = 0,
DIAG_START_DRIVER = DIAG_START_COMMON + DIAG_SIZE_COMMON,		DIAG_START_DRIVER = DIAG_START_COMMON + DIAG_SIZE_COMMON,
DIAG_START_FRONTEND = DIAG_START_DRIVER + DIAG_SIZE_DRIVER,		DIAG_START_FRONTEND = DIAG_START_DRIVER + DIAG_SIZE_DRIVER,
DIAG_START_SERIALIZATION = DIAG_START_FRONTEND + DIAG_SIZE_FRONTEND,		DIAG_START_SERIALIZATION = DIAG_START_FRONTEND + DIAG_SIZE_FRONTEND,
DIAG_START_LEX = DIAG_START_SERIALIZATION + DIAG_SIZE_SERIALIZATION,		DIAG_START_LEX = DIAG_START_SERIALIZATION + DIAG_SIZE_SERIALIZATION,
DIAG_START_PARSE = DIAG_START_LEX + DIAG_SIZE_LEX,		DIAG_START_PARSE = DIAG_START_LEX + DIAG_SIZE_LEX,
DIAG_START_AST = DIAG_START_PARSE + DIAG_SIZE_PARSE,		DIAG_START_AST = DIAG_START_PARSE + DIAG_SIZE_PARSE,
DIAG_START_COMMENT = DIAG_START_AST + DIAG_SIZE_AST,		DIAG_START_COMMENT = DIAG_START_AST + DIAG_SIZE_AST,
DIAG_START_CROSSTU = DIAG_START_COMMENT + DIAG_SIZE_CROSSTU,		DIAG_START_CROSSTU = DIAG_START_COMMENT + DIAG_SIZE_CROSSTU,
DIAG_START_SEMA = DIAG_START_CROSSTU + DIAG_SIZE_COMMENT,		DIAG_START_SEMA = DIAG_START_CROSSTU + DIAG_SIZE_COMMENT,
DIAG_START_ANALYSIS = DIAG_START_SEMA + DIAG_SIZE_SEMA,		DIAG_START_ANALYSIS = DIAG_START_SEMA + DIAG_SIZE_SEMA,
DIAG_START_REFACTORING = DIAG_START_ANALYSIS + DIAG_SIZE_ANALYSIS,		DIAG_START_REFACTORING = DIAG_START_ANALYSIS + DIAG_SIZE_ANALYSIS,
DIAG_UPPER_LIMIT = DIAG_START_REFACTORING + DIAG_SIZE_REFACTORING		DIAG_START_INDEX = DIAG_START_REFACTORING + DIAG_SIZE_REFACTORING,
		DIAG_UPPER_LIMIT = DIAG_START_INDEX + DIAG_SIZE_INDEX,
};		};

class CustomDiagInfo;		class CustomDiagInfo;

/// All of the diagnostics that can be emitted by the frontend.		/// All of the diagnostics that can be emitted by the frontend.
typedef unsigned kind;		typedef unsigned kind;

// Get typedefs for common diagnostics.		// Get typedefs for common diagnostics.
▲ Show 20 Lines • Show All 276 Lines • Show Last 20 Lines

include/clang/Basic/DiagnosticIndexKinds.td

This file was added.

				//==--- DiagnosticIndexKinds.td - indexing diagnostics --------------------===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//

				//===----------------------------------------------------------------------===//
				// Indexing Diagnostics
				//===----------------------------------------------------------------------===//

				let Component = "Index" in {

				let CategoryName = "Index Store Issue" in {

				def err_index_store_dir_create_failed : Error<"failed creating the index store "
				"directory: %0">;
				def err_index_store_file_status_failed : Error<"failed file status check: %0">;
				def err_index_store_record_write_failed : Error<"failed writing record '%0': "
				"%1">;
				def err_index_store_unit_write_failed : Error<"failed writing unit data: %0">;

				def remark_index_producing_module_file_data : Remark<"producing index data for "
				"module file '%0'">,
				InGroup<IndexStore>;

				}

				} // end of Indexing diagnostics

include/clang/Driver/Job.h

	Show All 29 Lines
	class Tool;			class Tool;

	// Re-export this as clang::driver::ArgStringList.			// Re-export this as clang::driver::ArgStringList.
	using llvm::opt::ArgStringList;			using llvm::opt::ArgStringList;

	struct CrashReportInfo {			struct CrashReportInfo {
	StringRef Filename;			StringRef Filename;
	StringRef VFSPath;			StringRef VFSPath;
				StringRef IndexStorePath;

	CrashReportInfo(StringRef Filename, StringRef VFSPath)			CrashReportInfo(StringRef Filename, StringRef VFSPath,
	: Filename(Filename), VFSPath(VFSPath) {}			StringRef IndexStorePath)
				: Filename(Filename), VFSPath(VFSPath), IndexStorePath(IndexStorePath) {}
	};			};

	/// Command - An executable path/name and argument vector to			/// Command - An executable path/name and argument vector to
	/// execute.			/// execute.
	class Command {			class Command {
	/// Source - The action which caused the creation of this job.			/// Source - The action which caused the creation of this job.
	const Action &Source;			const Action &Source;

	▲ Show 20 Lines • Show All 152 Lines • Show Last 20 Lines

include/clang/Driver/Options.td

	Show First 20 Lines • Show All 322 Lines • ▼ Show 20 Lines
	def objcmt_migrate_designated_init : Flag<["-"], "objcmt-migrate-designated-init">, Flags<[CC1Option]>,			def objcmt_migrate_designated_init : Flag<["-"], "objcmt-migrate-designated-init">, Flags<[CC1Option]>,
	HelpText<"Enable migration to infer NS_DESIGNATED_INITIALIZER for initializer methods">;			HelpText<"Enable migration to infer NS_DESIGNATED_INITIALIZER for initializer methods">;
	def objcmt_whitelist_dir_path: Joined<["-"], "objcmt-whitelist-dir-path=">, Flags<[CC1Option]>,			def objcmt_whitelist_dir_path: Joined<["-"], "objcmt-whitelist-dir-path=">, Flags<[CC1Option]>,
	HelpText<"Only modify files with a filename contained in the provided directory path">;			HelpText<"Only modify files with a filename contained in the provided directory path">;
	// The misspelt "white-list" [sic] alias is due for removal.			// The misspelt "white-list" [sic] alias is due for removal.
	def : Joined<["-"], "objcmt-white-list-dir-path=">, Flags<[CC1Option]>,			def : Joined<["-"], "objcmt-white-list-dir-path=">, Flags<[CC1Option]>,
	Alias<objcmt_whitelist_dir_path>;			Alias<objcmt_whitelist_dir_path>;

				def index_store_path : Separate<["-"], "index-store-path">, Flags<[CC1Option]>,
				HelpText<"Enable indexing with the specified data store path">;
				def index_ignore_system_symbols : Flag<["-"], "index-ignore-system-symbols">, Flags<[CC1Option]>,
				HelpText<"Ignore symbols from system headers">;
				def index_record_codegen_name : Flag<["-"], "index-record-codegen-name">, Flags<[CC1Option]>,
				HelpText<"Record the codegen name for symbols">;

	// Make sure all other -ccc- options are rejected.			// Make sure all other -ccc- options are rejected.
	def ccc_ : Joined<["-"], "ccc-">, Group<internal_Group>, Flags<[Unsupported]>;			def ccc_ : Joined<["-"], "ccc-">, Group<internal_Group>, Flags<[Unsupported]>;

	// Standard Options			// Standard Options

	def _HASH_HASH_HASH : Flag<["-"], "###">, Flags<[DriverOption, CoreOption]>,			def _HASH_HASH_HASH : Flag<["-"], "###">, Flags<[DriverOption, CoreOption]>,
	HelpText<"Print (but do not run) the commands to run for this compilation">;			HelpText<"Print (but do not run) the commands to run for this compilation">;
	def _DASH_DASH : Option<["--"], "", KIND_REMAINING_ARGS>,			def _DASH_DASH : Option<["--"], "", KIND_REMAINING_ARGS>,
	▲ Show 20 Lines • Show All 2,657 Lines • Show Last 20 Lines

include/clang/Frontend/CompilerInstance.h

Show First 20 Lines • Show All 178 Lines • ▼ Show 20 Lines	class CompilerInstance : public ModuleLoader {
/// the stream to a buffer_ostream. These are the buffer and the original		/// the stream to a buffer_ostream. These are the buffer and the original
/// stream.		/// stream.
std::unique_ptr<llvm::raw_fd_ostream> NonSeekStream;		std::unique_ptr<llvm::raw_fd_ostream> NonSeekStream;

/// The list of active output files.		/// The list of active output files.
std::list<OutputFile> OutputFiles;		std::list<OutputFile> OutputFiles;

/// Force an output buffer.		/// Force an output buffer.
std::unique_ptr<llvm::raw_pwrite_stream> OutputStream;		std::unique_ptr<llvm::raw_pwrite_stream> OutputStream;
		ioericUnsubmitted Done Reply Inline Actions nit: LLVM variable names start with upper-case letters. ioeric: nit: LLVM variable names start with upper-case letters.
		ioericUnsubmitted Done Reply Inline Actions `opts` and `action` are still lower-case. ioeric: `opts` and `action` are still lower-case.

		ioericUnsubmitted Done Reply Inline Actions It might make sense to define an alias for `std::function<std::unique_ptr<FrontendAction>(const FrontendOptions &opts, std::unique_ptr<FrontendAction> action)>`, which is used multiple times. ioeric: It might make sense to define an alias for `std::function<std::unique_ptr<FrontendAction>(const…
		typedef std::function<std::unique_ptr<FrontendAction>(
		const FrontendOptions &Opts, std::unique_ptr<FrontendAction> Action)>
		ActionWrapperTy;

		/// \brief An optional callback function used to wrap any
		/// GenerateModuleActions created and executed when loading modules.
		ActionWrapperTy GenModuleActionWrapper;

CompilerInstance(const CompilerInstance &) = delete;		CompilerInstance(const CompilerInstance &) = delete;
void operator=(const CompilerInstance &) = delete;		void operator=(const CompilerInstance &) = delete;
public:		public:
explicit CompilerInstance(		explicit CompilerInstance(
std::shared_ptr<PCHContainerOperations> PCHContainerOps =		std::shared_ptr<PCHContainerOperations> PCHContainerOps =
std::make_shared<PCHContainerOperations>(),		std::make_shared<PCHContainerOperations>(),
MemoryBufferCache *SharedPCMCache = nullptr);		MemoryBufferCache *SharedPCMCache = nullptr);
~CompilerInstance() override;		~CompilerInstance() override;
▲ Show 20 Lines • Show All 248 Lines • ▼ Show 20 Lines	public:
bool hasPreprocessor() const { return PP != nullptr; }		bool hasPreprocessor() const { return PP != nullptr; }

/// Return the current preprocessor.		/// Return the current preprocessor.
Preprocessor &getPreprocessor() const {		Preprocessor &getPreprocessor() const {
assert(PP && "Compiler instance has no preprocessor!");		assert(PP && "Compiler instance has no preprocessor!");
return *PP;		return *PP;
}		}

std::shared_ptr<Preprocessor> getPreprocessorPtr() { return PP; }		std::shared_ptr<Preprocessor> getPreprocessorPtr() const { return PP; }

void resetAndLeakPreprocessor() {		void resetAndLeakPreprocessor() {
BuryPointer(new std::shared_ptr<Preprocessor>(PP));		BuryPointer(new std::shared_ptr<Preprocessor>(PP));
}		}

/// Replace the current preprocessor.		/// Replace the current preprocessor.
void setPreprocessor(std::shared_ptr<Preprocessor> Value);		void setPreprocessor(std::shared_ptr<Preprocessor> Value);

▲ Show 20 Lines • Show All 340 Lines • ▼ Show 20 Lines	public:
bool hadModuleLoaderFatalFailure() const {		bool hadModuleLoaderFatalFailure() const {
return ModuleLoader::HadFatalFailure;		return ModuleLoader::HadFatalFailure;
}		}

GlobalModuleIndex *loadGlobalModuleIndex(SourceLocation TriggerLoc) override;		GlobalModuleIndex *loadGlobalModuleIndex(SourceLocation TriggerLoc) override;

bool lookupMissingImports(StringRef Name, SourceLocation TriggerLoc) override;		bool lookupMissingImports(StringRef Name, SourceLocation TriggerLoc) override;

		void setGenModuleActionWrapper(ActionWrapperTy Wrapper) {
		GenModuleActionWrapper = Wrapper;
		};

		ActionWrapperTy getGenModuleActionWrapper() const {
		return GenModuleActionWrapper;
		}

void addDependencyCollector(std::shared_ptr<DependencyCollector> Listener) {		void addDependencyCollector(std::shared_ptr<DependencyCollector> Listener) {
DependencyCollectors.push_back(std::move(Listener));		DependencyCollectors.push_back(std::move(Listener));
}		}

void setExternalSemaSource(IntrusiveRefCntPtr<ExternalSemaSource> ESS);		void setExternalSemaSource(IntrusiveRefCntPtr<ExternalSemaSource> ESS);

MemoryBufferCache &getPCMCache() const { return *PCMCache; }		MemoryBufferCache &getPCMCache() const { return *PCMCache; }
};		};

} // end namespace clang		} // end namespace clang

#endif		#endif

include/clang/Frontend/FrontendOptions.h

Show First 20 Lines • Show All 368 Lines • ▼ Show 20 Lines	ObjCMT_MigrateAll = (ObjCMT_Literals \| ObjCMT_Subscripting \|
ObjCMT_MigrateDecls \| ObjCMT_PropertyDotSyntax)		ObjCMT_MigrateDecls \| ObjCMT_PropertyDotSyntax)
};		};
unsigned ObjCMTAction = ObjCMT_None;		unsigned ObjCMTAction = ObjCMT_None;
std::string ObjCMTWhiteListPath;		std::string ObjCMTWhiteListPath;

std::string MTMigrateDir;		std::string MTMigrateDir;
std::string ARCMTMigrateReportOut;		std::string ARCMTMigrateReportOut;

		/// The path to write index data to
		ioericUnsubmitted Done Reply Inline Actions It might make sense to also have documentations for these options here. ioeric: It might make sense to also have documentations for these options here.
		gribozavrUnsubmitted Not Done Reply Inline Actions Please end comments with a period. gribozavr: Please end comments with a period.
		std::string IndexStorePath;
		/// Whether to ignore system files when writing out index data
		unsigned IndexIgnoreSystemSymbols : 1;
		gribozavrUnsubmitted Not Done Reply Inline Actions Would it make more sense to flip this boolean to positive? "IndexIncludeSystemSymbols"? gribozavr: Would it make more sense to flip this boolean to positive? "IndexIncludeSystemSymbols"?
		akyrtziUnsubmitted Not Done Reply Inline Actions @jkorous I noticed this name can be misleading, it may seem as if what this does is "avoid indexing system symbol occurrences" but what it actually does is "avoid indexing symbol occurrences from system files". We should rename it to "IndexIgnoreSystemHeaders" or "IndexIncludeSystemHeaders" per Dmitri's suggestion. akyrtzi: @jkorous I noticed this name can be misleading, it may seem as if what this does is "avoid…
		/// Whether to include the codegen name of symbols in the index data
		unsigned IndexRecordCodegenName : 1;

/// The input files and their types.		/// The input files and their types.
std::vector<FrontendInputFile> Inputs;		std::vector<FrontendInputFile> Inputs;

/// When the input is a module map, the original module map file from which		/// When the input is a module map, the original module map file from which
/// that map was inferred, if any (for umbrella modules).		/// that map was inferred, if any (for umbrella modules).
std::string OriginalModuleMap;		std::string OriginalModuleMap;

/// The output file, if any.		/// The output file, if any.
▲ Show 20 Lines • Show All 61 Lines • ▼ Show 20 Lines	public:
FrontendOptions()		FrontendOptions()
: DisableFree(false), RelocatablePCH(false), ShowHelp(false),		: DisableFree(false), RelocatablePCH(false), ShowHelp(false),
ShowStats(false), ShowTimers(false), ShowVersion(false),		ShowStats(false), ShowTimers(false), ShowVersion(false),
FixWhatYouCan(false), FixOnlyWarnings(false), FixAndRecompile(false),		FixWhatYouCan(false), FixOnlyWarnings(false), FixAndRecompile(false),
FixToTemporaries(false), ARCMTMigrateEmitARCErrors(false),		FixToTemporaries(false), ARCMTMigrateEmitARCErrors(false),
SkipFunctionBodies(false), UseGlobalModuleIndex(true),		SkipFunctionBodies(false), UseGlobalModuleIndex(true),
GenerateGlobalModuleIndex(true), ASTDumpDecls(false),		GenerateGlobalModuleIndex(true), ASTDumpDecls(false),
ASTDumpLookups(false), BuildingImplicitModule(false),		ASTDumpLookups(false), BuildingImplicitModule(false),
ModulesEmbedAllFiles(false), IncludeTimestamps(true) {}		ModulesEmbedAllFiles(false), IncludeTimestamps(true),
		IndexIgnoreSystemSymbols(false), IndexRecordCodegenName(false) {}

/// getInputKindForExtension - Return the appropriate input kind for a file		/// getInputKindForExtension - Return the appropriate input kind for a file
/// extension. For example, "c" would return InputKind::C.		/// extension. For example, "c" would return InputKind::C.
///		///
/// \return The input kind for the extension, or InputKind::Unknown if the		/// \return The input kind for the extension, or InputKind::Unknown if the
/// extension is not recognized.		/// extension is not recognized.
static InputKind getInputKindForExtension(StringRef Extension);		static InputKind getInputKindForExtension(StringRef Extension);
};		};

} // namespace clang		} // namespace clang

#endif // LLVM_CLANG_FRONTEND_FRONTENDOPTIONS_H		#endif // LLVM_CLANG_FRONTEND_FRONTENDOPTIONS_H

include/clang/Index/DeclOccurrence.h

This file was added.

				//===--- DeclOccurrence.h - An occurrence of a decl within a file ---------===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLVM_CLANG_INDEX_DECLOCCURRENCE_H
				#define LLVM_CLANG_INDEX_DECLOCCURRENCE_H

				#include "clang/Basic/LLVM.h"
				#include "clang/Index/IndexSymbol.h"
				#include "llvm/ADT/ArrayRef.h"
				#include "llvm/ADT/SmallVector.h"

				namespace clang {
				class Decl;

				namespace index {

				struct DeclOccurrence {
				SymbolRoleSet Roles;
				unsigned Offset;
				const Decl *Dcl;
				SmallVector<SymbolRelation, 3> Relations;

				DeclOccurrence(SymbolRoleSet R, unsigned Offset, const Decl *D,
				ArrayRef<SymbolRelation> Relations)
				: Roles(R), Offset(Offset), Dcl(D),
				Relations(Relations.begin(), Relations.end()) {}

				friend bool operator<(const DeclOccurrence &LHS, const DeclOccurrence &RHS) {
				return LHS.Offset < RHS.Offset;
				}
				};

				} // namespace index
				ioericUnsubmitted Done Reply Inline Actions Nit: indentation. Tip: `git-clang-format` against the diff base can format all changed lines in your patch. ioeric: Nit: indentation. Tip: `git-clang-format` against the diff base can format all changed lines…
				} // namespace clang

				#endif

include/clang/Index/IndexDataConsumer.h

Show All 36 Lines	public:

virtual void initialize(ASTContext &Ctx) {}		virtual void initialize(ASTContext &Ctx) {}

virtual void setPreprocessor(std::shared_ptr<Preprocessor> PP) {}		virtual void setPreprocessor(std::shared_ptr<Preprocessor> PP) {}

/// \returns true to continue indexing, or false to abort.		/// \returns true to continue indexing, or false to abort.
virtual bool handleDeclOccurence(const Decl *D, SymbolRoleSet Roles,		virtual bool handleDeclOccurence(const Decl *D, SymbolRoleSet Roles,
ArrayRef<SymbolRelation> Relations,		ArrayRef<SymbolRelation> Relations,
SourceLocation Loc, ASTNodeInfo ASTNode);		SourceLocation Loc, bool IsInSystemFile,
		ASTNodeInfo ASTNode);

/// \returns true to continue indexing, or false to abort.		/// \returns true to continue indexing, or false to abort.
virtual bool handleMacroOccurence(const IdentifierInfo *Name,		virtual bool handleMacroOccurence(const IdentifierInfo *Name,
const MacroInfo *MI, SymbolRoleSet Roles,		const MacroInfo *MI, SymbolRoleSet Roles,
SourceLocation Loc);		SourceLocation Loc, bool IsInSystemFile);

/// \returns true to continue indexing, or false to abort.		/// \returns true to continue indexing, or false to abort.
virtual bool handleModuleOccurence(const ImportDecl *ImportD,		virtual bool handleModuleOccurence(const ImportDecl *ImportD,
SymbolRoleSet Roles, SourceLocation Loc);		SymbolRoleSet Roles, SourceLocation Loc,
		bool IsInSystemFile);

virtual void finish() {}		virtual void finish() {}

private:		private:
virtual void _anchor();		virtual void _anchor();
};		};

} // namespace index		} // namespace index
} // namespace clang		} // namespace clang

#endif		#endif

include/clang/Index/IndexDiagnostic.h

This file was added.

				//===--- IndexDiagnostic.h - ------------------------------------- C++ --===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLVM_CLANG_INDEX_INDEXDIAGNOSTIC_H
				#define LLVM_CLANG_INDEX_INDEXDIAGNOSTIC_H

				#include "clang/Basic/Diagnostic.h"

				namespace clang {
				namespace diag {
				enum {
				#define DIAG(ENUM, FLAGS, DEFAULT_MAPPING, DESC, GROUP, SFINAE, NOWERROR, \
				SHOWINSYSHEADER, CATEGORY) \
				ENUM,
				#define INDEXSTART
				#include "clang/Basic/DiagnosticIndexKinds.inc"
				#undef DIAG
				NUM_BUILTIN_INDEX_DIAGNOSTICS
				};
				} // end namespace diag
				} // end namespace clang

				#endif // LLVM_CLANG_INDEX_INDEXDIAGNOSTIC_H

include/clang/Index/IndexingAction.h

	//===--- IndexingAction.h - Frontend index action -------------------------===//			//===--- IndexingAction.h - Frontend AST indexing action ------------------===//
	//			//
	// The LLVM Compiler Infrastructure			// The LLVM Compiler Infrastructure
	//			//
	// This file is distributed under the University of Illinois Open Source			// This file is distributed under the University of Illinois Open Source
	// License. See LICENSE.TXT for details.			// License. See LICENSE.TXT for details.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_CLANG_INDEX_INDEXINGACTION_H			#ifndef LLVM_CLANG_INDEX_INDEXINGACTION_H
	#define LLVM_CLANG_INDEX_INDEXINGACTION_H			#define LLVM_CLANG_INDEX_INDEXINGACTION_H

	#include "clang/Basic/LLVM.h"			#include "clang/Basic/LLVM.h"
	#include "llvm/ADT/ArrayRef.h"			#include "llvm/ADT/ArrayRef.h"
	#include <memory>			#include <memory>
				#include <string>

	namespace clang {			namespace clang {
	class ASTContext;			class ASTContext;
	class ASTReader;			class ASTReader;
	class ASTUnit;			class ASTUnit;
	class Decl;			class Decl;
	class FrontendAction;			class FrontendAction;

	namespace serialization {			namespace serialization {
	class ModuleFile;			class ModuleFile;
	}			}

	namespace index {			namespace index {
	class IndexDataConsumer;			class IndexDataConsumer;

				ioericUnsubmitted Done Reply Inline Actions This should be removed? Some forward declarations above are not used as well. ioeric: This should be removed? Some forward declarations above are not used as well.
	struct IndexingOptions {			struct IndexingOptions {
	enum class SystemSymbolFilterKind {			enum class SystemSymbolFilterKind {
	None,			None,
	DeclarationsOnly,			DeclarationsOnly,
	All,			All,
	};			};

	SystemSymbolFilterKind SystemSymbolFilter			SystemSymbolFilterKind SystemSymbolFilter =
	= SystemSymbolFilterKind::DeclarationsOnly;			SystemSymbolFilterKind::DeclarationsOnly;
	bool IndexFunctionLocals = false;			bool IndexFunctionLocals = false;
	};			};

				/// Creates a frontend action that provides decl occurrence information from the
				ioericUnsubmitted Done Reply Inline Actions We are now mixing functionalities for Unit indexing and AST indexing actions in the same file. We might want to separate these into two headers e..g `UnitIndexingAction.h` and `ASTIndexingAction.h`. This would make it easier for users to find the right functions :) ioeric: We are now mixing functionalities for Unit indexing and AST indexing actions in the same file.
				/// AST to the given \c IndexDataConsumer.
				///
	/// \param WrappedAction another frontend action to wrap over or null.			/// \param WrappedAction another frontend action to wrap over or null.
	std::unique_ptr<FrontendAction>			std::unique_ptr<FrontendAction>
	createIndexingAction(std::shared_ptr<IndexDataConsumer> DataConsumer,			createIndexingAction(std::shared_ptr<IndexDataConsumer> DataConsumer,
	IndexingOptions Opts,			IndexingOptions Opts,
	std::unique_ptr<FrontendAction> WrappedAction);			std::unique_ptr<FrontendAction> WrappedAction);

	void indexASTUnit(ASTUnit &Unit, IndexDataConsumer &DataConsumer,			void indexASTUnit(ASTUnit &Unit, IndexDataConsumer &DataConsumer,
	IndexingOptions Opts);			IndexingOptions Opts);

	void indexTopLevelDecls(ASTContext &Ctx, ArrayRef<const Decl *> Decls,			void indexTopLevelDecls(ASTContext &Ctx, ArrayRef<const Decl *> Decls,
	IndexDataConsumer &DataConsumer, IndexingOptions Opts);			IndexDataConsumer &DataConsumer, IndexingOptions Opts);

	void indexModuleFile(serialization::ModuleFile &Mod, ASTReader &Reader,			void indexModuleFile(serialization::ModuleFile &Mod, ASTReader &Reader,
	IndexDataConsumer &DataConsumer, IndexingOptions Opts);			IndexDataConsumer &DataConsumer, IndexingOptions Opts);

				ioericUnsubmitted Done Reply Inline Actions Please add documentation for each field. It's not trivial what each field is for, especially some fields seem to be optional and some seem to be mutually exclusive. ioeric: Please add documentation for each field. It's not trivial what each field is for, especially…
	} // namespace index			} // namespace index
				ioericUnsubmitted Done Reply Inline Actions These pointers suggest the life time of this struct is tied to some other struct, which makes the struct look a bit dangerous to use. Should we also carry a reference or a smart pointer to the underlying object that keeps these pointers valid? Would it be a `CompilerInstance` (guessing from `IndexUnitDataConsumerFactory` )? ioeric: These pointers suggest the life time of this struct is tied to some other struct, which makes…
	} // namespace clang			} // namespace clang

	#endif			#endif
				ioericUnsubmitted Not Done Reply Inline Actions What is the intended user of this function? It's unclear how users could obtain a `ConsumerFactory` (i.e. `UnitDetails`) without the functionalities in `UnitDataConsumerActionImpl` . (Also see comment in the implementation of `createIndexDataRecordingAction`.) ioeric: What is the intended user of this function? It's unclear how users could obtain a…
				nathawesUnsubmitted Not Done Reply Inline Actions Sorry, I'm not sure what you mean here. Users shouldn't need to know anything about `UnitDataConsumerActionImpl`, they just need to provide a lambda/function reference that takes a `CompilerInstance&` and a `UnitDetails` and returns an `IndexUnitDataConsumer` (or `UnitIndexDataConsumer` once I rename it). This gets called once per translation unit to get a distinct data consumer for each unit, i.e. for the main translation unit as well as for each of its dependent modules that the main unit's data consumer says should be indexed via `shouldIndexModuleDependency(...)`. nathawes: Sorry, I'm not sure what you mean here. Users shouldn't need to know anything about…
				ioericUnsubmitted Done Reply Inline Actions This is likely only useful for compiler invocation. I would put it in the compiler invocation code. ioeric: This is likely only useful for compiler invocation. I would put it in the compiler invocation…
				nathawesUnsubmitted Not Done Reply Inline Actions There's another public `index::` API for writing out index data for individual clang module files in the follow up patch that takes a `RecordingOptions` and is used externally, from Swift. This function's useful on the Swift side to get the `RecordingOptions` from `FrontendOptions` it has already set up. nathawes: There's another public `index::` API for writing out index data for individual clang module…

include/clang/Index/RecordingAction.h

This file was added.

				//===--- RecordingAction.h - Frontend index recording action --------------===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLVM_CLANG_INDEX_INDEXRECORDINGACTION_H
				#define LLVM_CLANG_INDEX_INDEXRECORDINGACTION_H

				#include "clang/Basic/LLVM.h"
				#include "clang/Index/UnitIndexingAction.h"
				#include "llvm/ADT/ArrayRef.h"

				namespace clang {
				class CompilerInstance;
				class FrontendAction;
				class FrontendOptions;

				namespace serialization {
				class ModuleFile;
				}

				namespace index {

				struct RecordingOptions : UnitIndexingOptions {
				std::string DataDirPath;
				bool RecordSymbolCodeGenName = false;
				};

				RecordingOptions
				getRecordingOptionsFromFrontendOptions(const FrontendOptions &FEOpts);

				/// \brief Creates a frontend action that collects dependency, file inclusion
				/// and decl ocurrence information for the translation unit and persists it to
				/// an index store.
				///
				/// FIXME: Not implemented yet.
				///
				/// \param WrappedAction another frontend action to wrap over or null.
				std::unique_ptr<FrontendAction>
				ioericUnsubmitted Not Done Reply Inline Actions Add a FIXME that this is not implemented yet. ioeric: Add a FIXME that this is not implemented yet.
				createIndexDataRecordingAction(RecordingOptions RecordOpts,
				std::unique_ptr<FrontendAction> WrappedAction);

				/// Collects dependency, file inclusion and decl occurrence information for a
				/// \c ModuleFile and persists it to an index store. Does \b not check if
				/// the store already has up-to-date information for the provided module file.
				///
				/// FIXME: Not implemented yet.
				void recordIndexDataForModuleFile(serialization::ModuleFile *ModFile,
				RecordingOptions RecordOpts,
				const CompilerInstance &CI);

				} // namespace index
				} // namespace clang

				#endif

include/clang/Index/UnitIndexDataConsumer.h

This file was added.

				//===--- UnitIndexDataConsumer.h - Abstract unit index data consumer ------===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLVM_CLANG_INDEX_UNITINDEXDATACONSUMER_H
				#define LLVM_CLANG_INDEX_UNITINDEXDATACONSUMER_H

				#include "clang/Basic/SourceLocation.h"
				#include "clang/Index/DeclOccurrence.h"
				#include "clang/Index/IndexSymbol.h"
				#include "llvm/ADT/ArrayRef.h"

				namespace clang {
				namespace serialization {
				class ModuleFile;
				}

				namespace index {

				/// Consumer for the index data associated with a translation unit.
				class UnitIndexDataConsumer {
				public:
				virtual ~UnitIndexDataConsumer() = default;

				/// Called for each file dependency of the translation unit.
				virtual void handleFileDependency(const FileEntry *FE, bool IsSystem) {}

				/// Called for each file include in the translation unit.
				virtual void handleInclude(const FileEntry *Source, unsigned Line,
				const FileEntry *Target) {}

				/// Called for each each module imported by the translation unit.
				virtual void handleModuleImport(const serialization::ModuleFile &Mod,
				bool IsSystem) {}

				/// Determines whether to collect the index data associated with the given
				/// dependency of this translation unit or not.
				///
				/// \param OutFilePath the output file path of the dependency.
				/// \returns true to collect index data for \c Mod.
				virtual bool
				shouldIndexModuleDependency(const serialization::ModuleFile &Mod) {
				return false;
				}

				/// Called with the decl occurrences in each file and AST file dependency,
				/// sorted by offset.
				///
				/// \returns true to cancel consuming data for this translation unit. Finish
				/// will not be called.
				virtual bool
				handleFileOccurrences(FileID FID,
				ArrayRef<DeclOccurrence> OccurrencesSortedByOffset,
				bool IsSystem) {
				return false;
				}

				/// Called when there is no more data to handle.
				virtual void finish() {}

				private:
				// avoid duplicate vtables
				virtual void _anchor();
				};

				} // namespace index
				} // namespace clang

				#endif

include/clang/Index/UnitIndexingAction.h

This file was added.

				//===--- UnitIndexingAction.h - Frontend unit indexing action -------------===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLVM_CLANG_INDEX_UNITINDEXINGACTION_H
				#define LLVM_CLANG_INDEX_UNITINDEXINGACTION_H

				#include "clang/Basic/LLVM.h"
				#include "clang/Index/IndexingAction.h"
				#include "llvm/ADT/ArrayRef.h"

				namespace clang {
				class CompilerInstance;
				class FileEntry;
				class FrontendAction;
				class Module;

				namespace serialization {
				class ModuleFile;
				}

				namespace index {
				class UnitIndexDataConsumer;

				struct UnitIndexingOptions : IndexingOptions {
				enum class FileIncludeFilterKind {
				None,
				UserOnly, // only record includes inside non-system files.
				All,
				};

				bool IncludeSystemDependencies = true;
				FileIncludeFilterKind FileIncludeFilter = FileIncludeFilterKind::UserOnly;
				};

				/// \brief Information about a translation unit useful for indexing.
				///
				struct UnitDetails {
				const CompilerInstance &CI; ///< The owning compiler instance.

				Module *UnitModule; ///< The corresponding \c Module (module units only).
				std::string ModuleName; ///< The \c Module name (module units only).
				const FileEntry *RootFile; ///< The root \c FileEntry (non-module units only).

				std::string OutputFile; ///< The output file path.
				StringRef SysrootPath; ///< The "virtual system root" path.
				bool IsSystemUnit;
				bool IsModuleUnit;
				bool IsDebugCompilation;
				};

				/// Factory function type for producing UnitIndexDataConsumers for a given
				/// translation unit
				typedef std::function<std::unique_ptr<UnitIndexDataConsumer>(
				UnitDetails UnitInfo)>
				IndexUnitDataConsumerFactory;

				/// \brief Creates a frontend action that provides dependency, file inclusion
				/// and decl ocurrence information for the translation unit, and optionally its
				/// module dependencies.
				///
				/// Decl occurrence information is provided per-file, sorted by offset.
				///
				/// \param ConsumerFactory provides an \c IndexUnitDataConsumer to use for a
				/// translation unit.
				/// \param WrappedAction another frontend action to wrap over or null.
				std::unique_ptr<FrontendAction>
				createUnitIndexingAction(IndexUnitDataConsumerFactory ConsumerFactory,
				UnitIndexingOptions UnitIndexOpts,
				std::unique_ptr<FrontendAction> WrappedAction);

				/// Collects and provides dependency, file inclusion and decl occurrence
				/// information for a \c ModuleFile to an \c IndexUnitDataConsumer constructed
				/// from the provided \c IndexUnitDataConsumerFactory.
				void indexModuleFile(serialization::ModuleFile &Mod, const CompilerInstance &CI,
				IndexUnitDataConsumerFactory UnitConsumerFactory,
				UnitIndexingOptions Opts);

				} // namespace index
				} // namespace clang

				#endif

include/clang/module.modulemap

Show First 20 Lines • Show All 63 Lines • ▼ Show 20 Lines	module Clang_Diagnostics {
requires cplusplus		requires cplusplus

module All { header "Basic/AllDiagnostics.h" export * }		module All { header "Basic/AllDiagnostics.h" export * }
module Analysis { header "Analysis/AnalysisDiagnostic.h" export * }		module Analysis { header "Analysis/AnalysisDiagnostic.h" export * }
module AST { header "AST/ASTDiagnostic.h" export * }		module AST { header "AST/ASTDiagnostic.h" export * }
module Comment { header "AST/CommentDiagnostic.h" export * }		module Comment { header "AST/CommentDiagnostic.h" export * }
module Driver { header "Driver/DriverDiagnostic.h" export * }		module Driver { header "Driver/DriverDiagnostic.h" export * }
module Frontend { header "Frontend/FrontendDiagnostic.h" export * }		module Frontend { header "Frontend/FrontendDiagnostic.h" export * }
		module Index { header "Index/IndexDiagnostic.h" export * }
module Lex { header "Lex/LexDiagnostic.h" export * }		module Lex { header "Lex/LexDiagnostic.h" export * }
module Parse { header "Parse/ParseDiagnostic.h" export * }		module Parse { header "Parse/ParseDiagnostic.h" export * }
module Sema { header "Sema/SemaDiagnostic.h" export * }		module Sema { header "Sema/SemaDiagnostic.h" export * }
module Serialization { header "Serialization/SerializationDiagnostic.h" export * }		module Serialization { header "Serialization/SerializationDiagnostic.h" export * }
module Refactoring { header "Tooling/Refactoring/RefactoringDiagnostic.h" export * }		module Refactoring { header "Tooling/Refactoring/RefactoringDiagnostic.h" export * }
}		}

module Clang_Driver {		module Clang_Driver {
▲ Show 20 Lines • Show All 82 Lines • Show Last 20 Lines

lib/Basic/DiagnosticIDs.cpp

	Show First 20 Lines • Show All 83 Lines • ▼ Show 20 Lines
	VALIDATE_DIAG_SIZE(SERIALIZATION)			VALIDATE_DIAG_SIZE(SERIALIZATION)
	VALIDATE_DIAG_SIZE(LEX)			VALIDATE_DIAG_SIZE(LEX)
	VALIDATE_DIAG_SIZE(PARSE)			VALIDATE_DIAG_SIZE(PARSE)
	VALIDATE_DIAG_SIZE(AST)			VALIDATE_DIAG_SIZE(AST)
	VALIDATE_DIAG_SIZE(COMMENT)			VALIDATE_DIAG_SIZE(COMMENT)
	VALIDATE_DIAG_SIZE(SEMA)			VALIDATE_DIAG_SIZE(SEMA)
	VALIDATE_DIAG_SIZE(ANALYSIS)			VALIDATE_DIAG_SIZE(ANALYSIS)
	VALIDATE_DIAG_SIZE(REFACTORING)			VALIDATE_DIAG_SIZE(REFACTORING)
				VALIDATE_DIAG_SIZE(INDEX)
	#undef VALIDATE_DIAG_SIZE			#undef VALIDATE_DIAG_SIZE
	#undef STRINGIFY_NAME			#undef STRINGIFY_NAME

	} // namespace anonymous			} // namespace anonymous

	static const StaticDiagInfoRec StaticDiagInfo[] = {			static const StaticDiagInfoRec StaticDiagInfo[] = {
	#define DIAG(ENUM, CLASS, DEFAULT_SEVERITY, DESC, GROUP, SFINAE, NOWERROR, \			#define DIAG(ENUM, CLASS, DEFAULT_SEVERITY, DESC, GROUP, SFINAE, NOWERROR, \
	SHOWINSYSHEADER, CATEGORY) \			SHOWINSYSHEADER, CATEGORY) \
	Show All 9 Lines
	#include "clang/Basic/DiagnosticLexKinds.inc"			#include "clang/Basic/DiagnosticLexKinds.inc"
	#include "clang/Basic/DiagnosticParseKinds.inc"			#include "clang/Basic/DiagnosticParseKinds.inc"
	#include "clang/Basic/DiagnosticASTKinds.inc"			#include "clang/Basic/DiagnosticASTKinds.inc"
	#include "clang/Basic/DiagnosticCommentKinds.inc"			#include "clang/Basic/DiagnosticCommentKinds.inc"
	#include "clang/Basic/DiagnosticCrossTUKinds.inc"			#include "clang/Basic/DiagnosticCrossTUKinds.inc"
	#include "clang/Basic/DiagnosticSemaKinds.inc"			#include "clang/Basic/DiagnosticSemaKinds.inc"
	#include "clang/Basic/DiagnosticAnalysisKinds.inc"			#include "clang/Basic/DiagnosticAnalysisKinds.inc"
	#include "clang/Basic/DiagnosticRefactoringKinds.inc"			#include "clang/Basic/DiagnosticRefactoringKinds.inc"
				#include "clang/Basic/DiagnosticIndexKinds.inc"
	#undef DIAG			#undef DIAG
	};			};

	static const unsigned StaticDiagInfoSize = llvm::array_lengthof(StaticDiagInfo);			static const unsigned StaticDiagInfoSize = llvm::array_lengthof(StaticDiagInfo);

	/// GetDiagInfo - Return the StaticDiagInfoRec entry for the specified DiagID,			/// GetDiagInfo - Return the StaticDiagInfoRec entry for the specified DiagID,
	/// or null if the ID is invalid.			/// or null if the ID is invalid.
	static const StaticDiagInfoRec *GetDiagInfo(unsigned DiagID) {			static const StaticDiagInfoRec *GetDiagInfo(unsigned DiagID) {
	Show All 23 Lines
	CATEGORY(LEX, SERIALIZATION)			CATEGORY(LEX, SERIALIZATION)
	CATEGORY(PARSE, LEX)			CATEGORY(PARSE, LEX)
	CATEGORY(AST, PARSE)			CATEGORY(AST, PARSE)
	CATEGORY(COMMENT, AST)			CATEGORY(COMMENT, AST)
	CATEGORY(CROSSTU, COMMENT)			CATEGORY(CROSSTU, COMMENT)
	CATEGORY(SEMA, CROSSTU)			CATEGORY(SEMA, CROSSTU)
	CATEGORY(ANALYSIS, SEMA)			CATEGORY(ANALYSIS, SEMA)
	CATEGORY(REFACTORING, ANALYSIS)			CATEGORY(REFACTORING, ANALYSIS)
				CATEGORY(INDEX, REFACTORING)
	#undef CATEGORY			#undef CATEGORY

	// Avoid out of bounds reads.			// Avoid out of bounds reads.
	if (ID + Offset >= StaticDiagInfoSize)			if (ID + Offset >= StaticDiagInfoSize)
	return nullptr;			return nullptr;

	assert(ID < StaticDiagInfoSize && Offset < StaticDiagInfoSize);			assert(ID < StaticDiagInfoSize && Offset < StaticDiagInfoSize);

	▲ Show 20 Lines • Show All 581 Lines • Show Last 20 Lines

lib/Driver/Driver.cpp

Show First 20 Lines • Show All 1,283 Lines • ▼ Show 20 Lines	if (StringRef(TempFile).endswith(".cache")) {
// In some cases (modules) we'll dump extra data to help with reproducing		// In some cases (modules) we'll dump extra data to help with reproducing
// the crash into a directory next to the output.		// the crash into a directory next to the output.
VFS = llvm::sys::path::filename(TempFile);		VFS = llvm::sys::path::filename(TempFile);
llvm::sys::path::append(VFS, "vfs", "vfs.yaml");		llvm::sys::path::append(VFS, "vfs", "vfs.yaml");
}		}
}		}

// Assume associated files are based off of the first temporary file.		// Assume associated files are based off of the first temporary file.
CrashReportInfo CrashInfo(TempFiles[0], VFS);		CrashReportInfo CrashInfo(
		TempFiles[0], VFS,
		C.getArgs().getLastArgValue(options::OPT_index_store_path));

std::string Script = CrashInfo.Filename.rsplit('.').first.str() + ".sh";		std::string Script = CrashInfo.Filename.rsplit('.').first.str() + ".sh";
std::error_code EC;		std::error_code EC;
llvm::raw_fd_ostream ScriptOS(Script, EC, llvm::sys::fs::CD_CreateNew);		llvm::raw_fd_ostream ScriptOS(Script, EC, llvm::sys::fs::CD_CreateNew);
if (EC) {		if (EC) {
Diag(clang::diag::note_drv_command_failed_diag_msg)		Diag(clang::diag::note_drv_command_failed_diag_msg)
<< "Error generating run script: " + Script + " " + EC.message();		<< "Error generating run script: " + Script + " " + EC.message();
} else {		} else {
▲ Show 20 Lines • Show All 3,223 Lines • Show Last 20 Lines

lib/Driver/Job.cpp

Show First 20 Lines • Show All 66 Lines • ▼ Show 20 Lines	IsInclude = llvm::StringSwitch<bool>(Flag)
.Cases("-idirafter", "-internal-isystem", "-iwithprefix", true)		.Cases("-idirafter", "-internal-isystem", "-iwithprefix", true)
.Cases("-internal-externc-isystem", "-iprefix", true)		.Cases("-internal-externc-isystem", "-iprefix", true)
.Cases("-iwithprefixbefore", "-isystem", "-iquote", true)		.Cases("-iwithprefixbefore", "-isystem", "-iquote", true)
.Cases("-isysroot", "-I", "-F", "-resource-dir", true)		.Cases("-isysroot", "-I", "-F", "-resource-dir", true)
.Cases("-iframework", "-include-pch", true)		.Cases("-iframework", "-include-pch", true)
.Default(false);		.Default(false);
if (IsInclude)		if (IsInclude)
return !HaveCrashVFS;		return !HaveCrashVFS;
		if (StringRef(Flag).startswith("-index-store-path"))
		return true;

// The remaining flags are treated as a single argument.		// The remaining flags are treated as a single argument.

// These flags are all of the form -Flag and have no second argument.		// These flags are all of the form -Flag and have no second argument.
ShouldSkip = llvm::StringSwitch<bool>(Flag)		ShouldSkip = llvm::StringSwitch<bool>(Flag)
.Cases("-M", "-MM", "-MG", "-MP", "-MD", true)		.Cases("-M", "-MM", "-MG", "-MP", "-MD", true)
.Case("-MMD", true)		.Case("-MMD", true)
.Default(false);		.Default(false);
▲ Show 20 Lines • Show All 123 Lines • ▼ Show 20 Lines	rewriteIncludes(const llvm::ArrayRef<const char *> &Args, size_t Idx,
assert(NumArgs == 2 && "Not expecting more than two arguments");		assert(NumArgs == 2 && "Not expecting more than two arguments");
StringRef Inc(Args[Idx + NumArgs - 1]);		StringRef Inc(Args[Idx + NumArgs - 1]);
if (!getAbsPath(Inc, NewInc))		if (!getAbsPath(Inc, NewInc))
return;		return;
IncFlags.push_back(SmallString<128>(Args[Idx]));		IncFlags.push_back(SmallString<128>(Args[Idx]));
IncFlags.push_back(std::move(NewInc));		IncFlags.push_back(std::move(NewInc));
}		}

		/// Returns a path to a directory named \c DirName adjacent to the module
		ioericUnsubmitted Done Reply Inline Actions nit: Comment should start with an overview of what the function does. Returns a directory path that is ... Also, consider calling this `getDirAdjacentToModCache`. `buildDir` can be ambiguous. ioeric: nit: Comment should start with an overview of what the function does. ``` Returns a directory…
		/// cache directory:
		/// <...>.cache/vfs/<DirName>
		static llvm::SmallString<128>
		getDirAdjacentToModCache(StringRef DirName, CrashReportInfo *CrashInfo) {
		llvm::SmallString<128> RelModCacheDir = llvm::sys::path::parent_path(
		llvm::sys::path::parent_path(CrashInfo->VFSPath));
		llvm::sys::path::append(RelModCacheDir, DirName);

		return RelModCacheDir;
		ioericUnsubmitted Done Reply Inline Actions Please clang-format the code. Without indentation, this looks like an no-op statement. ioeric: Please clang-format the code. Without indentation, this looks like an no-op statement.
		}

void Command::Print(raw_ostream &OS, const char *Terminator, bool Quote,		void Command::Print(raw_ostream &OS, const char *Terminator, bool Quote,
CrashReportInfo *CrashInfo) const {		CrashReportInfo *CrashInfo) const {
// Always quote the exe.		// Always quote the exe.
OS << ' ';		OS << ' ';
printArg(OS, Executable, /Quote=/true);		printArg(OS, Executable, /Quote=/true);

ArrayRef<const char *> Args = Arguments;		ArrayRef<const char *> Args = Arguments;
SmallVector<const char *, 128> ArgsRespFile;		SmallVector<const char *, 128> ArgsRespFile;
if (ResponseFile != nullptr) {		if (ResponseFile != nullptr) {
buildArgvForResponseFile(ArgsRespFile);		buildArgvForResponseFile(ArgsRespFile);
Args = ArrayRef<const char *>(ArgsRespFile).slice(1); // no executable name		Args = ArrayRef<const char *>(ArgsRespFile).slice(1); // no executable name
}		}

bool HaveCrashVFS = CrashInfo && !CrashInfo->VFSPath.empty();		bool HaveCrashVFS = CrashInfo && !CrashInfo->VFSPath.empty();
		bool HaveIndexStorePath = CrashInfo && !CrashInfo->IndexStorePath.empty();
for (size_t i = 0, e = Args.size(); i < e; ++i) {		for (size_t i = 0, e = Args.size(); i < e; ++i) {
const char *const Arg = Args[i];		const char *const Arg = Args[i];

if (CrashInfo) {		if (CrashInfo) {
int NumArgs = 0;		int NumArgs = 0;
bool IsInclude = false;		bool IsInclude = false;
if (skipArgs(Arg, HaveCrashVFS, NumArgs, IsInclude)) {		if (skipArgs(Arg, HaveCrashVFS, NumArgs, IsInclude)) {
i += NumArgs - 1;		i += NumArgs - 1;
Show All 31 Lines	void Command::Print(raw_ostream &OS, const char *Terminator, bool Quote,
}		}

if (CrashInfo && HaveCrashVFS) {		if (CrashInfo && HaveCrashVFS) {
OS << ' ';		OS << ' ';
printArg(OS, "-ivfsoverlay", Quote);		printArg(OS, "-ivfsoverlay", Quote);
OS << ' ';		OS << ' ';
printArg(OS, CrashInfo->VFSPath.str(), Quote);		printArg(OS, CrashInfo->VFSPath.str(), Quote);

// The leftover modules from the crash are stored in		// Provide an empty dir path for the future generated module cache to
// <name>.cache/vfs/modules		// leave the leftover modules from the crash untouched for pcm inspection
// Leave it untouched for pcm inspection and provide a clean/empty dir		SmallString<128> RelModCacheDir =
// path to contain the future generated module cache:		getDirAdjacentToModCache("repro-modules", CrashInfo);
// <name>.cache/vfs/repro-modules
SmallString<128> RelModCacheDir = llvm::sys::path::parent_path(
llvm::sys::path::parent_path(CrashInfo->VFSPath));
llvm::sys::path::append(RelModCacheDir, "repro-modules");

std::string ModCachePath = "-fmodules-cache-path=";		std::string ModCachePath = "-fmodules-cache-path=";
ModCachePath.append(RelModCacheDir.c_str());		ModCachePath.append(RelModCacheDir.c_str());

OS << ' ';		OS << ' ';
printArg(OS, ModCachePath, Quote);		printArg(OS, ModCachePath, Quote);
}		}

		if (CrashInfo && HaveIndexStorePath) {
		SmallString<128> IndexStoreDir;

		if (HaveCrashVFS) {
		// Provide a new index store, leaving the old one from the crash untouched
		ioericUnsubmitted Done Reply Inline Actions Could you share this code with line 278 above, which already has a nice comment? ioeric: Could you share this code with line 278 above, which already has a nice comment?
		IndexStoreDir = getDirAdjacentToModCache("index-store", CrashInfo);
		} else {
		IndexStoreDir = "index-store";
		}

		OS << ' ';
		printArg(OS, "-index-store-path", Quote);
		OS << ' ';
		printArg(OS, IndexStoreDir.c_str(), Quote);
		}

if (ResponseFile != nullptr) {		if (ResponseFile != nullptr) {
OS << "\n Arguments passed via response file:\n";		OS << "\n Arguments passed via response file:\n";
writeResponseFile(OS);		writeResponseFile(OS);
// Avoiding duplicated newline terminator, since FileLists are		// Avoiding duplicated newline terminator, since FileLists are
// newline-separated.		// newline-separated.
if (Creator.getResponseFilesSupport() != Tool::RF_FileList)		if (Creator.getResponseFilesSupport() != Tool::RF_FileList)
OS << "\n";		OS << "\n";
OS << " (end of response file)";		OS << " (end of response file)";
▲ Show 20 Lines • Show All 137 Lines • Show Last 20 Lines

lib/Driver/ToolChains/Clang.cpp

Show First 20 Lines • Show All 3,723 Lines • ▼ Show 20 Lines	#endif
// Pass the path to compiler resource files.		// Pass the path to compiler resource files.
CmdArgs.push_back("-resource-dir");		CmdArgs.push_back("-resource-dir");
CmdArgs.push_back(D.ResourceDir.c_str());		CmdArgs.push_back(D.ResourceDir.c_str());

Args.AddLastArg(CmdArgs, options::OPT_working_directory);		Args.AddLastArg(CmdArgs, options::OPT_working_directory);

RenderARCMigrateToolOptions(D, Args, CmdArgs);		RenderARCMigrateToolOptions(D, Args, CmdArgs);

		if (Args.hasArg(options::OPT_index_store_path)) {
		Args.AddLastArg(CmdArgs, options::OPT_index_store_path);
		Args.AddLastArg(CmdArgs, options::OPT_index_ignore_system_symbols);
		Args.AddLastArg(CmdArgs, options::OPT_index_record_codegen_name);

		// If '-o' is passed along with '-fsyntax-only' pass it along the cc1
		// invocation so that the index action knows what the out file is.
		if (isa<CompileJobAction>(JA) && JA.getType() == types::TY_Nothing) {
		Args.AddLastArg(CmdArgs, options::OPT_o);
		}
		}

// Add preprocessing options like -I, -D, etc. if we are using the		// Add preprocessing options like -I, -D, etc. if we are using the
		arphamanUnsubmitted Done Reply Inline Actions What is this environment variable used for? And why does it imply the other two flags? arphaman: What is this environment variable used for? And why does it imply the other two flags?
// preprocessor.		// preprocessor.
//		//
// FIXME: Support -fpreprocessed		// FIXME: Support -fpreprocessed
if (types::getPreprocessedType(InputType) != types::TY_INVALID)		if (types::getPreprocessedType(InputType) != types::TY_INVALID)
AddPreprocessingOptions(C, JA, D, Args, CmdArgs, Output, Inputs);		AddPreprocessingOptions(C, JA, D, Args, CmdArgs, Output, Inputs);

// Don't warn about "clang -c -DPIC -fPIC test.i" because libtool.m4 assumes		// Don't warn about "clang -c -DPIC -fPIC test.i" because libtool.m4 assumes
// that "The compiler can only warn and ignore the option if not recognized".		// that "The compiler can only warn and ignore the option if not recognized".
▲ Show 20 Lines • Show All 1,924 Lines • Show Last 20 Lines

lib/Driver/ToolChains/Darwin.cpp

Show First 20 Lines • Show All 430 Lines • ▼ Show 20 Lines	void darwin::Linker::ConstructJob(Compilation &C, const JobAction &JA,
// -filelist linker option.		// -filelist linker option.
llvm::opt::ArgStringList InputFileList;		llvm::opt::ArgStringList InputFileList;

// The logic here is derived from gcc's behavior; most of which		// The logic here is derived from gcc's behavior; most of which
// comes from specs (starting with link_command). Consult gcc for		// comes from specs (starting with link_command). Consult gcc for
// more information.		// more information.
ArgStringList CmdArgs;		ArgStringList CmdArgs;

		Args.ClaimAllArgs(options::OPT_index_store_path);
		Args.ClaimAllArgs(options::OPT_index_ignore_system_symbols);
		Args.ClaimAllArgs(options::OPT_index_record_codegen_name);

/// Hack(tm) to ignore linking errors when we are doing ARC migration.		/// Hack(tm) to ignore linking errors when we are doing ARC migration.
if (Args.hasArg(options::OPT_ccc_arcmt_check,		if (Args.hasArg(options::OPT_ccc_arcmt_check,
options::OPT_ccc_arcmt_migrate)) {		options::OPT_ccc_arcmt_migrate)) {
for (const auto &Arg : Args)		for (const auto &Arg : Args)
Arg->claim();		Arg->claim();
const char *Exec =		const char *Exec =
Args.MakeArgString(getToolChain().GetProgramPath("touch"));		Args.MakeArgString(getToolChain().GetProgramPath("touch"));
CmdArgs.push_back(Output.getFilename());		CmdArgs.push_back(Output.getFilename());
▲ Show 20 Lines • Show All 1,845 Lines • Show Last 20 Lines

lib/Frontend/CompilerInstance.cpp

Show All 22 Lines
#include "clang/Frontend/FrontendAction.h"		#include "clang/Frontend/FrontendAction.h"
#include "clang/Frontend/FrontendActions.h"		#include "clang/Frontend/FrontendActions.h"
#include "clang/Frontend/FrontendDiagnostic.h"		#include "clang/Frontend/FrontendDiagnostic.h"
#include "clang/Frontend/LogDiagnosticPrinter.h"		#include "clang/Frontend/LogDiagnosticPrinter.h"
#include "clang/Frontend/SerializedDiagnosticPrinter.h"		#include "clang/Frontend/SerializedDiagnosticPrinter.h"
#include "clang/Frontend/TextDiagnosticPrinter.h"		#include "clang/Frontend/TextDiagnosticPrinter.h"
#include "clang/Frontend/Utils.h"		#include "clang/Frontend/Utils.h"
#include "clang/Frontend/VerifyDiagnosticConsumer.h"		#include "clang/Frontend/VerifyDiagnosticConsumer.h"
		#include "clang/Index/IndexingAction.h"
#include "clang/Lex/HeaderSearch.h"		#include "clang/Lex/HeaderSearch.h"
#include "clang/Lex/PTHManager.h"		#include "clang/Lex/PTHManager.h"
#include "clang/Lex/Preprocessor.h"		#include "clang/Lex/Preprocessor.h"
#include "clang/Lex/PreprocessorOptions.h"		#include "clang/Lex/PreprocessorOptions.h"
#include "clang/Sema/CodeCompleteConsumer.h"		#include "clang/Sema/CodeCompleteConsumer.h"
#include "clang/Sema/Sema.h"		#include "clang/Sema/Sema.h"
#include "clang/Serialization/ASTReader.h"		#include "clang/Serialization/ASTReader.h"
#include "clang/Serialization/GlobalModuleIndex.h"		#include "clang/Serialization/GlobalModuleIndex.h"
▲ Show 20 Lines • Show All 1,106 Lines • ▼ Show 20 Lines	compileModuleImpl(CompilerInstance &ImportingInstance, SourceLocation ImportLoc,
Instance.setFileManager(&ImportingInstance.getFileManager());		Instance.setFileManager(&ImportingInstance.getFileManager());
Instance.createSourceManager(Instance.getFileManager());		Instance.createSourceManager(Instance.getFileManager());
SourceManager &SourceMgr = Instance.getSourceManager();		SourceManager &SourceMgr = Instance.getSourceManager();
SourceMgr.setModuleBuildStack(		SourceMgr.setModuleBuildStack(
ImportingInstance.getSourceManager().getModuleBuildStack());		ImportingInstance.getSourceManager().getModuleBuildStack());
SourceMgr.pushModuleBuildStack(ModuleName,		SourceMgr.pushModuleBuildStack(ModuleName,
FullSourceLoc(ImportLoc, ImportingInstance.getSourceManager()));		FullSourceLoc(ImportLoc, ImportingInstance.getSourceManager()));

		// Pass along the GenModuleActionWrapper callback
		auto WrapGenModuleAction = ImportingInstance.getGenModuleActionWrapper();
		arphamanUnsubmitted Done Reply Inline Actions Please start your variable names with uppercase (http://llvm.org/docs/CodingStandards.html#name-types-functions-variables-and-enumerators-properly). arphaman: Please start your variable names with uppercase (http://llvm.org/docs/CodingStandards.html#name…
		Instance.setGenModuleActionWrapper(WrapGenModuleAction);

// If we're collecting module dependencies, we need to share a collector		// If we're collecting module dependencies, we need to share a collector
// between all of the module CompilerInstances. Other than that, we don't		// between all of the module CompilerInstances. Other than that, we don't
// want to produce any dependency output from the module build.		// want to produce any dependency output from the module build.
Instance.setModuleDepCollector(ImportingInstance.getModuleDepCollector());		Instance.setModuleDepCollector(ImportingInstance.getModuleDepCollector());
Inv.getDependencyOutputOpts() = DependencyOutputOptions();		Inv.getDependencyOutputOpts() = DependencyOutputOptions();

ImportingInstance.getDiagnostics().Report(ImportLoc,		ImportingInstance.getDiagnostics().Report(ImportLoc,
diag::remark_module_build)		diag::remark_module_build)
<< ModuleName << ModuleFileName;		<< ModuleName << ModuleFileName;

PreBuildStep(Instance);		PreBuildStep(Instance);

// Execute the action to actually build the module in-place. Use a separate		// Execute the action to actually build the module in-place. Use a separate
// thread so that we get a stack large enough.		// thread so that we get a stack large enough.
const unsigned ThreadStackSize = 8 << 20;		const unsigned ThreadStackSize = 8 << 20;
llvm::CrashRecoveryContext CRC;		llvm::CrashRecoveryContext CRC;
CRC.RunSafelyOnThread(		CRC.RunSafelyOnThread(
[&]() {		[&]() {
GenerateModuleFromModuleMapAction Action;		std::unique_ptr<FrontendAction> Action(
Instance.ExecuteAction(Action);		new GenerateModuleFromModuleMapAction);
		if (WrapGenModuleAction)
		ioericUnsubmitted Done Reply Inline Actions nit: no braces around one liners. ioeric: nit: no braces around one liners.
		Action = WrapGenModuleAction(FrontendOpts, std::move(Action));
		Instance.ExecuteAction(*Action);
},		},
ThreadStackSize);		ThreadStackSize);

PostBuildStep(Instance);		PostBuildStep(Instance);

ImportingInstance.getDiagnostics().Report(ImportLoc,		ImportingInstance.getDiagnostics().Report(ImportLoc,
diag::remark_module_build_done)		diag::remark_module_build_done)
<< ModuleName;		<< ModuleName;
▲ Show 20 Lines • Show All 991 Lines • Show Last 20 Lines

lib/Frontend/CompilerInvocation.cpp

Show First 20 Lines • Show All 1,605 Lines • ▼ Show 20 Lines	static InputKind ParseFrontendArgs(FrontendOptions &Opts, ArgList &Args,
Opts.ObjCMTWhiteListPath = Args.getLastArgValue(OPT_objcmt_whitelist_dir_path);		Opts.ObjCMTWhiteListPath = Args.getLastArgValue(OPT_objcmt_whitelist_dir_path);

if (Opts.ARCMTAction != FrontendOptions::ARCMT_None &&		if (Opts.ARCMTAction != FrontendOptions::ARCMT_None &&
Opts.ObjCMTAction != FrontendOptions::ObjCMT_None) {		Opts.ObjCMTAction != FrontendOptions::ObjCMT_None) {
Diags.Report(diag::err_drv_argument_not_allowed_with)		Diags.Report(diag::err_drv_argument_not_allowed_with)
<< "ARC migration" << "ObjC migration";		<< "ARC migration" << "ObjC migration";
}		}

		Opts.IndexStorePath = Args.getLastArgValue(OPT_index_store_path);
		Opts.IndexIgnoreSystemSymbols = Args.hasArg(OPT_index_ignore_system_symbols);
		Opts.IndexRecordCodegenName = Args.hasArg(OPT_index_record_codegen_name);

InputKind DashX(InputKind::Unknown);		InputKind DashX(InputKind::Unknown);
if (const Arg *A = Args.getLastArg(OPT_x)) {		if (const Arg *A = Args.getLastArg(OPT_x)) {
StringRef XValue = A->getValue();		StringRef XValue = A->getValue();

// Parse suffixes: '<lang>(-header\|[-module-map][-cpp-output])'.		// Parse suffixes: '<lang>(-header\|[-module-map][-cpp-output])'.
// FIXME: Supporting '<lang>-header-cpp-output' would be useful.		// FIXME: Supporting '<lang>-header-cpp-output' would be useful.
bool Preprocessed = XValue.consume_back("-cpp-output");		bool Preprocessed = XValue.consume_back("-cpp-output");
bool ModuleMap = XValue.consume_back("-module-map");		bool ModuleMap = XValue.consume_back("-module-map");
▲ Show 20 Lines • Show All 1,618 Lines • Show Last 20 Lines

lib/FrontendTool/CMakeLists.txt

	set(LLVM_LINK_COMPONENTS			set(LLVM_LINK_COMPONENTS
	Option			Option
	Support			Support
	)			)

	set(link_libs			set(link_libs
	clangBasic			clangBasic
	clangCodeGen			clangCodeGen
	clangDriver			clangDriver
	clangFrontend			clangFrontend
				clangIndex
	clangRewriteFrontend			clangRewriteFrontend
	)			)

	if(CLANG_ENABLE_ARCMT)			if(CLANG_ENABLE_ARCMT)
	list(APPEND link_libs			list(APPEND link_libs
	clangARCMigrate			clangARCMigrate
	)			)
	endif()			endif()
	Show All 16 Lines

lib/FrontendTool/ExecuteCompilerInvocation.cpp

Show All 17 Lines
#include "clang/Config/config.h"		#include "clang/Config/config.h"
#include "clang/Driver/Options.h"		#include "clang/Driver/Options.h"
#include "clang/Frontend/CompilerInstance.h"		#include "clang/Frontend/CompilerInstance.h"
#include "clang/Frontend/CompilerInvocation.h"		#include "clang/Frontend/CompilerInvocation.h"
#include "clang/Frontend/FrontendActions.h"		#include "clang/Frontend/FrontendActions.h"
#include "clang/Frontend/FrontendDiagnostic.h"		#include "clang/Frontend/FrontendDiagnostic.h"
#include "clang/Frontend/FrontendPluginRegistry.h"		#include "clang/Frontend/FrontendPluginRegistry.h"
#include "clang/Frontend/Utils.h"		#include "clang/Frontend/Utils.h"
		#include "clang/Index/RecordingAction.h"
#include "clang/Rewrite/Frontend/FrontendActions.h"		#include "clang/Rewrite/Frontend/FrontendActions.h"
#include "clang/StaticAnalyzer/Frontend/FrontendActions.h"		#include "clang/StaticAnalyzer/Frontend/FrontendActions.h"
#include "llvm/Option/OptTable.h"		#include "llvm/Option/OptTable.h"
#include "llvm/Option/Option.h"		#include "llvm/Option/Option.h"
#include "llvm/Support/DynamicLibrary.h"		#include "llvm/Support/DynamicLibrary.h"
#include "llvm/Support/ErrorHandling.h"		#include "llvm/Support/ErrorHandling.h"
using namespace clang;		using namespace clang;
using namespace llvm::opt;		using namespace llvm::opt;
▲ Show 20 Lines • Show All 130 Lines • ▼ Show 20 Lines	if (CI.getFrontendOpts().ProgramAction != frontend::MigrateSource &&
if (FEOpts.ObjCMTAction != FrontendOptions::ObjCMT_None) {		if (FEOpts.ObjCMTAction != FrontendOptions::ObjCMT_None) {
Act = llvm::make_unique<arcmt::ObjCMigrateAction>(std::move(Act),		Act = llvm::make_unique<arcmt::ObjCMigrateAction>(std::move(Act),
FEOpts.MTMigrateDir,		FEOpts.MTMigrateDir,
FEOpts.ObjCMTAction);		FEOpts.ObjCMTAction);
}		}
}		}
#endif		#endif

		if (!FEOpts.IndexStorePath.empty()) {
		auto WrapWithIndexRecordAction =
		[&](const FrontendOptions &opts,
		ioericUnsubmitted Done Reply Inline Actions Could you comment on what this does? The `Act` above is already wrapped. Why do we need `setGenModuleActionWrapper` to `createIndexDataRecordingAction` again? Also, `createIndexDataRecordingAction` doesn't seem related to `GenModule`. ioeric: Could you comment on what this does? The `Act` above is already wrapped. Why do we need…
		nathawesUnsubmitted Not Done Reply Inline Actions It's to wrap any GenerateModuleActions that get created as needed when/if Act ends up loading any modules, so that we output index data for them too. I'll add a comment. nathawes: It's to wrap any GenerateModuleActions that get created as needed when/if Act ends up loading…
		std::unique_ptr<FrontendAction> WrappedAction) {
		auto RecordOpts =
		index::getRecordingOptionsFromFrontendOptions(FEOpts);
		return index::createIndexDataRecordingAction(
		RecordOpts, std::move(WrappedAction));
		};

		// Wrap the main action as well as any GenerateModuleActions created while
		// loading modules
		Act = WrapWithIndexRecordAction(FEOpts, std::move(Act));
		CI.setGenModuleActionWrapper(WrapWithIndexRecordAction);
		}

// If there are any AST files to merge, create a frontend action		// If there are any AST files to merge, create a frontend action
// adaptor to perform the merge.		// adaptor to perform the merge.
if (!FEOpts.ASTMergeFiles.empty())		if (!FEOpts.ASTMergeFiles.empty())
Act = llvm::make_unique<ASTMergeAction>(std::move(Act),		Act = llvm::make_unique<ASTMergeAction>(std::move(Act),
FEOpts.ASTMergeFiles);		FEOpts.ASTMergeFiles);

return Act;		return Act;
}		}
▲ Show 20 Lines • Show All 84 Lines • Show Last 20 Lines

lib/Index/CMakeLists.txt

	set(LLVM_LINK_COMPONENTS			set(LLVM_LINK_COMPONENTS
	Core			Core
	Support			Support
	)			)

	add_clang_library(clangIndex			add_clang_library(clangIndex
	CodegenNameGenerator.cpp			CodegenNameGenerator.cpp
	CommentToXML.cpp			CommentToXML.cpp
				FileIndexData.cpp
	IndexBody.cpp			IndexBody.cpp
	IndexDecl.cpp			IndexDecl.cpp
	IndexingAction.cpp			IndexingAction.cpp
	IndexingContext.cpp			IndexingContext.cpp
	IndexSymbol.cpp			IndexSymbol.cpp
	IndexTypeSourceInfo.cpp			IndexTypeSourceInfo.cpp
				UnitIndexDataRecorder.cpp
	USRGeneration.cpp			USRGeneration.cpp

	ADDITIONAL_HEADERS			ADDITIONAL_HEADERS
	IndexingContext.h			IndexingContext.h
	SimpleFormatContext.h			SimpleFormatContext.h

	LINK_LIBS			LINK_LIBS
	clangAST			clangAST
	clangBasic			clangBasic
	clangFormat			clangFormat
	clangFrontend			clangFrontend
				clangLex
	clangRewrite			clangRewrite
	clangSerialization			clangSerialization
	clangToolingCore			clangToolingCore
	)			)

lib/Index/FileIndexData.h

This file was added.

				//===--- FileIndexData.h - Index data per file --------------------------===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLVM_CLANG_LIB_INDEX_FILEINDEXRECORD_H
				#define LLVM_CLANG_LIB_INDEX_FILEINDEXRECORD_H

				#include "clang/Basic/SourceLocation.h"
				#include "clang/Index/DeclOccurrence.h"
				#include "clang/Index/IndexSymbol.h"
				#include "llvm/ADT/ArrayRef.h"
				#include "llvm/ADT/SmallVector.h"
				#include <vector>

				namespace clang {
				class IdentifierInfo;

				namespace index {

				/// Stores the declaration occurrences seen in a particular source or header
				/// file of a translation unit
				class FileIndexData {
				private:
				FileID FID;
				bool IsSystem;
				std::vector<DeclOccurrence> Decls;

				public:
				FileIndexData(FileID FID, bool IsSystem) : FID(FID), IsSystem(IsSystem) {}

				std::vector<DeclOccurrence> getDeclOccurrencesSortedByOffset() const;

				FileID getFileID() const { return FID; }
				bool isSystem() const { return IsSystem; }

				/// Adds an occurrence of the canonical declaration \c D at the supplied
				/// \c Offset
				///
				/// \param Roles the roles the occurrence fulfills in this position.
				/// \param Offset the offset in the file of this occurrence.
				/// \param D the canonical declaration this is an occurrence of.
				/// \param Relations the set of symbols related to this occurrence.
				void addDeclOccurence(SymbolRoleSet Roles, unsigned Offset, const Decl *D,
				ArrayRef<SymbolRelation> Relations);
				void print(llvm::raw_ostream &OS) const;
				};

				} // end namespace index
				} // end namespace clang

				#endif

lib/Index/FileIndexData.cpp

This file was added.

				//===--- FileIndexData.cpp - Index data per file ------------------------===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//

				#include "FileIndexData.h"
				#include "clang/AST/ASTContext.h"
				#include "clang/AST/DeclTemplate.h"
				#include "llvm/ADT/SmallString.h"
				#include "llvm/Support/Path.h"

				using namespace clang;
				using namespace clang::index;

				void FileIndexData::addDeclOccurence(SymbolRoleSet Roles, unsigned Offset,
				const Decl *D,
				ArrayRef<SymbolRelation> Relations) {
				assert(D->isCanonicalDecl() &&
				"Occurrences should be associated with their canonical decl");

				Decls.emplace_back(Roles, Offset, D, Relations);
				}

				std::vector<DeclOccurrence>
				FileIndexData::getDeclOccurrencesSortedByOffset() const {
				std::vector<DeclOccurrence> Sorted(Decls);
				std::sort(Sorted.begin(), Sorted.end());
				mgrangUnsubmitted Not Done Reply Inline Actions Please use range-based llvm::sort instead of std::sort: llvm::sort(Sorted); See https://llvm.org/docs/CodingStandards.html#beware-of-non-deterministic-sorting-order-of-equal-elements mgrang: Please use range-based llvm::sort instead of std::sort: ``` llvm::sort(Sorted); ``` See https…
				return Sorted;
				}

				void FileIndexData::print(llvm::raw_ostream &OS) const {
				OS << "DECLS BEGIN ---\n";
				for (auto &DclInfo : Decls) {
				auto D = DclInfo.Dcl;
				SourceManager &SM = D->getASTContext().getSourceManager();
				SourceLocation Loc = SM.getFileLoc(D->getLocation());
				PresumedLoc PLoc = SM.getPresumedLoc(Loc);
				OS << llvm::sys::path::filename(PLoc.getFilename()) << ':' << PLoc.getLine()
				<< ':' << PLoc.getColumn();

				if (auto ND = dyn_cast<NamedDecl>(D)) {
				OS << ' ' << ND->getNameAsString();
				}

				OS << '\n';
				}
				OS << "DECLS END ---\n";
				}

lib/Index/IndexingAction.cpp

//===- IndexingAction.cpp - Frontend index action -------------------------===//		//===- IndexingAction.cpp - Frontend index action -------------------------===//
//		//
// The LLVM Compiler Infrastructure		// The LLVM Compiler Infrastructure
//		//
// This file is distributed under the University of Illinois Open Source		// This file is distributed under the University of Illinois Open Source
// License. See LICENSE.TXT for details.		// License. See LICENSE.TXT for details.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "clang/Index/IndexingAction.h"		#include "clang/Index/IndexingAction.h"
		#include "clang/Index/RecordingAction.h"
		#include "clang/Index/UnitIndexingAction.h"

		#include "FileIndexData.h"
#include "IndexingContext.h"		#include "IndexingContext.h"
		#include "UnitIndexDataRecorder.h"
		#include "clang/Basic/FileManager.h"
#include "clang/Frontend/CompilerInstance.h"		#include "clang/Frontend/CompilerInstance.h"
#include "clang/Frontend/FrontendAction.h"		#include "clang/Frontend/FrontendAction.h"
		#include "clang/Frontend/FrontendDiagnostic.h"
#include "clang/Frontend/MultiplexConsumer.h"		#include "clang/Frontend/MultiplexConsumer.h"
		#include "clang/Frontend/Utils.h"
#include "clang/Index/IndexDataConsumer.h"		#include "clang/Index/IndexDataConsumer.h"
		#include "clang/Index/IndexDiagnostic.h"
		#include "clang/Index/UnitIndexDataConsumer.h"
#include "clang/Lex/Preprocessor.h"		#include "clang/Lex/Preprocessor.h"
#include "clang/Serialization/ASTReader.h"		#include "clang/Serialization/ASTReader.h"
		#include "llvm/Support/Path.h"

using namespace clang;		using namespace clang;
using namespace clang::index;		using namespace clang::index;

		void UnitIndexDataConsumer::_anchor() {}
void IndexDataConsumer::_anchor() {}		void IndexDataConsumer::_anchor() {}

bool IndexDataConsumer::handleDeclOccurence(const Decl *D, SymbolRoleSet Roles,		bool IndexDataConsumer::handleDeclOccurence(const Decl *D, SymbolRoleSet Roles,
ArrayRef<SymbolRelation> Relations,		ArrayRef<SymbolRelation> Relations,
SourceLocation Loc,		SourceLocation Loc,
		bool IsInSystemFile,
ASTNodeInfo ASTNode) {		ASTNodeInfo ASTNode) {
return true;		return true;
}		}

bool IndexDataConsumer::handleMacroOccurence(const IdentifierInfo *Name,		bool IndexDataConsumer::handleMacroOccurence(const IdentifierInfo *Name,
const MacroInfo *MI,		const MacroInfo *MI,
SymbolRoleSet Roles,		SymbolRoleSet Roles,
SourceLocation Loc) {		SourceLocation Loc,
		bool IsInSystemFile) {
return true;		return true;
}		}

bool IndexDataConsumer::handleModuleOccurence(const ImportDecl *ImportD,		bool IndexDataConsumer::handleModuleOccurence(const ImportDecl *ImportD,
SymbolRoleSet Roles,		SymbolRoleSet Roles,
SourceLocation Loc) {		SourceLocation Loc,
		bool IsInSystemFile) {
return true;		return true;
}		}

namespace {		namespace {

class IndexASTConsumer : public ASTConsumer {		class IndexASTConsumer : public ASTConsumer {
std::shared_ptr<Preprocessor> PP;		std::shared_ptr<Preprocessor> PP;
IndexingContext &IndexCtx;		IndexingContext &IndexCtx;
Show All 20 Lines	protected:
void HandleTopLevelDeclInObjCContainer(DeclGroupRef DG) override {		void HandleTopLevelDeclInObjCContainer(DeclGroupRef DG) override {
IndexCtx.indexDeclGroupRef(DG);		IndexCtx.indexDeclGroupRef(DG);
}		}

void HandleTranslationUnit(ASTContext &Ctx) override {		void HandleTranslationUnit(ASTContext &Ctx) override {
}		}
};		};

class IndexActionBase {		/// Abstracts the core logic shared between \c IndexAction and
protected:		/// \c WrappingIndexAction frontend actions.
		ioericUnsubmitted Done Reply Inline Actions Use `class` for interfaces. ioeric: Use `class` for interfaces.
		ioericUnsubmitted Done Reply Inline Actions Does `CI` here have to be the same instance as the one in `createIndexASTConsumer` ? Might worth documenting. ioeric: Does `CI` here have to be the same instance as the one in `createIndexASTConsumer `? Might…
std::shared_ptr<IndexDataConsumer> DataConsumer;		class IndexActionImpl {
IndexingContext IndexCtx;		public:
		virtual ~IndexActionImpl() = default;
IndexActionBase(std::shared_ptr<IndexDataConsumer> dataConsumer,
IndexingOptions Opts)
: DataConsumer(std::move(dataConsumer)),
IndexCtx(Opts, *DataConsumer) {}

std::unique_ptr<IndexASTConsumer>
createIndexASTConsumer(CompilerInstance &CI) {
return llvm::make_unique<IndexASTConsumer>(CI.getPreprocessorPtr(),
IndexCtx);
}

void finish() {		/// Called at the beginning of processing a single input, this creates the
DataConsumer->finish();		/// IndexASTConsumer object to use.
}		///
		/// \param CI The compiler instance used to process the input
		/// \returns the created IndexASTConsumer.
		gribozavrUnsubmitted Not Done Reply Inline Actions Please don't duplicate the information from the signature in comments. No need to say that this function returns an IndexASTConsumer (twice, in the first sentence and in the \returns clause), the code already says that. Also, "The compiler instance used to process the input" does not mean much to me either. gribozavr: Please don't duplicate the information from the signature in comments. No need to say that…
		virtual std::unique_ptr<IndexASTConsumer>
		createIndexASTConsumer(CompilerInstance &CI) = 0;

		/// Callback at the end of processing a single input.
		///
		/// \param CI The compiler instance used to process the input. It will be the
		/// same instance as provided in \c createIndexASTConsumer.
		virtual void finish(CompilerInstance &CI) = 0;
};		};

class IndexAction : public ASTFrontendAction, IndexActionBase {		class IndexAction : public ASTFrontendAction {
		std::unique_ptr<IndexActionImpl> Impl;

public:		public:
IndexAction(std::shared_ptr<IndexDataConsumer> DataConsumer,		IndexAction(std::unique_ptr<IndexActionImpl> Impl) : Impl(std::move(Impl)) {}
IndexingOptions Opts)
: IndexActionBase(std::move(DataConsumer), Opts) {}

protected:		protected:
std::unique_ptr<ASTConsumer> CreateASTConsumer(CompilerInstance &CI,		std::unique_ptr<ASTConsumer> CreateASTConsumer(CompilerInstance &CI,
StringRef InFile) override {		StringRef InFile) override {
return createIndexASTConsumer(CI);		return Impl->createIndexASTConsumer(CI);
}		}

void EndSourceFileAction() override {		void EndSourceFileAction() override {
FrontendAction::EndSourceFileAction();		FrontendAction::EndSourceFileAction();
finish();		Impl->finish(getCompilerInstance());
}		}
};		};

class WrappingIndexAction : public WrapperFrontendAction, IndexActionBase {		class WrappingIndexAction : public WrapperFrontendAction {
bool IndexActionFailed = false;		std::unique_ptr<IndexActionImpl> Impl;
		bool CreatedASTConsumer = false;

public:		public:
WrappingIndexAction(std::unique_ptr<FrontendAction> WrappedAction,		WrappingIndexAction(std::unique_ptr<FrontendAction> WrappedAction,
std::shared_ptr<IndexDataConsumer> DataConsumer,		std::unique_ptr<IndexActionImpl> Impl)
IndexingOptions Opts)		: WrapperFrontendAction(std::move(WrappedAction)), Impl(std::move(Impl)) {
: WrapperFrontendAction(std::move(WrappedAction)),		}
IndexActionBase(std::move(DataConsumer), Opts) {}

protected:		protected:
std::unique_ptr<ASTConsumer> CreateASTConsumer(CompilerInstance &CI,		std::unique_ptr<ASTConsumer> CreateASTConsumer(CompilerInstance &CI,
StringRef InFile) override;		StringRef InFile) override {
void EndSourceFileAction() override;		auto OtherConsumer = WrapperFrontendAction::CreateASTConsumer(CI, InFile);
};		if (!OtherConsumer)
		return nullptr;

} // anonymous namespace		std::vector<std::unique_ptr<ASTConsumer>> Consumers;
		ioericUnsubmitted Done Reply Inline Actions nit: Move this after `Impl->createIndexASTConsumer(CI)`. Do we need to reset this flag? Calling `CreateASTConsumer` multiple times on the same instance seems to be allowed? ioeric: nit: Move this after `Impl->createIndexASTConsumer(CI)`. Do we need to reset this flag?
		nathawesUnsubmitted Not Done Reply Inline Actions Oops. Yes, we do :-) nathawes: Oops. Yes, we do :-)
		Consumers.push_back(std::move(OtherConsumer));
		Consumers.push_back(Impl->createIndexASTConsumer(CI));
		CreatedASTConsumer = true;

void WrappingIndexAction::EndSourceFileAction() {		return llvm::make_unique<MultiplexConsumer>(std::move(Consumers));
		};
		gribozavrUnsubmitted Not Done Reply Inline Actions No semicolon. gribozavr: No semicolon.

		void EndSourceFileAction() override {
// Invoke wrapped action's method.		// Invoke wrapped action's method.
WrapperFrontendAction::EndSourceFileAction();		WrapperFrontendAction::EndSourceFileAction();
if (!IndexActionFailed)		if (CreatedASTConsumer) {
finish();		CreatedASTConsumer = false;
		Impl->finish(getCompilerInstance());
}		}
		};
		gribozavrUnsubmitted Not Done Reply Inline Actions No semicolon. gribozavr: No semicolon.
		};

std::unique_ptr<ASTConsumer>		/// An implementation for \c IndexAction or \c WrappingIndexAction that provides
WrappingIndexAction::CreateASTConsumer(CompilerInstance &CI, StringRef InFile) {		/// decl ocurrences information from the AST.
auto OtherConsumer = WrapperFrontendAction::CreateASTConsumer(CI, InFile);		class DataConsumerActionImpl : public IndexActionImpl {
if (!OtherConsumer) {		protected:
IndexActionFailed = true;		std::shared_ptr<IndexDataConsumer> DataConsumer;
return nullptr;		IndexingContext IndexCtx;
}

std::vector<std::unique_ptr<ASTConsumer>> Consumers;		public:
Consumers.push_back(std::move(OtherConsumer));		DataConsumerActionImpl(std::shared_ptr<IndexDataConsumer> Consumer,
Consumers.push_back(createIndexASTConsumer(CI));		IndexingOptions Opts)
return llvm::make_unique<MultiplexConsumer>(std::move(Consumers));		: DataConsumer(std::move(Consumer)), IndexCtx(Opts, *DataConsumer) {}

		std::unique_ptr<IndexASTConsumer>
		createIndexASTConsumer(CompilerInstance &CI) override {
		IndexCtx.setSysrootPath(CI.getHeaderSearchOpts().Sysroot);
		return llvm::make_unique<IndexASTConsumer>(CI.getPreprocessorPtr(),
		IndexCtx);
}		}

		void finish(CompilerInstance &CI) override { DataConsumer->finish(); }
		};
		gribozavrUnsubmitted Not Done Reply Inline Actions No semicolon. gribozavr: No semicolon.

		} // anonymous namespace

std::unique_ptr<FrontendAction>		std::unique_ptr<FrontendAction>
index::createIndexingAction(std::shared_ptr<IndexDataConsumer> DataConsumer,		index::createIndexingAction(std::shared_ptr<IndexDataConsumer> DataConsumer,
IndexingOptions Opts,		IndexingOptions Opts,
std::unique_ptr<FrontendAction> WrappedAction) {		std::unique_ptr<FrontendAction> WrappedAction) {
		auto ActionImpl =
		llvm::make_unique<DataConsumerActionImpl>(std::move(DataConsumer), Opts);
if (WrappedAction)		if (WrappedAction)
return llvm::make_unique<WrappingIndexAction>(std::move(WrappedAction),		return llvm::make_unique<WrappingIndexAction>(std::move(WrappedAction),
std::move(DataConsumer),		std::move(ActionImpl));
Opts);		return llvm::make_unique<IndexAction>(std::move(ActionImpl));
return llvm::make_unique<IndexAction>(std::move(DataConsumer), Opts);
}		}


static bool topLevelDeclVisitor(void context, const Decl D) {		static bool topLevelDeclVisitor(void context, const Decl D) {
IndexingContext &IndexCtx = static_cast<IndexingContext>(context);		IndexingContext &IndexCtx = static_cast<IndexingContext>(context);
return IndexCtx.indexTopLevelDecl(D);		return IndexCtx.indexTopLevelDecl(D);
}		}

static void indexTranslationUnit(ASTUnit &Unit, IndexingContext &IndexCtx) {		static void indexTranslationUnit(ASTUnit &Unit, IndexingContext &IndexCtx) {
Unit.visitLocalTopLevelDecls(&IndexCtx, topLevelDeclVisitor);		Unit.visitLocalTopLevelDecls(&IndexCtx, topLevelDeclVisitor);
}		}
Show All 28 Lines	void index::indexModuleFile(serialization::ModuleFile &Mod, ASTReader &Reader,
IndexCtx.setASTContext(Ctx);		IndexCtx.setASTContext(Ctx);
DataConsumer.initialize(Ctx);		DataConsumer.initialize(Ctx);

for (const Decl *D : Reader.getModuleFileLevelDecls(Mod)) {		for (const Decl *D : Reader.getModuleFileLevelDecls(Mod)) {
IndexCtx.indexTopLevelDecl(D);		IndexCtx.indexTopLevelDecl(D);
}		}
DataConsumer.finish();		DataConsumer.finish();
}		}

		//===----------------------------------------------------------------------===//
		// Index Data Recording
		//===----------------------------------------------------------------------===//

		/// Construct a \c UnitDetails for a translation unit with the provided root
		/// \c FileEntry or \c Module and with the provided sysroot path.
		static index::UnitDetails getUnitDetails(const CompilerInstance &CI,
		std::string OutputFile,
		ioericUnsubmitted Done Reply Inline Actions This seems to be related to files. Maybe `FileIndexDataCollector`? ioeric: This seems to be related to files. Maybe `FileIndexDataCollector`?
		const FileEntry *RootFile,
		Module *UnitMod,
		StringRef SysrootPath) {
		std::string ModuleName =
		UnitMod ? UnitMod->getFullModuleName() : std::string();
		bool IsSystemUnit = UnitMod ? UnitMod->IsSystem : false;
		bool IsModuleUnit = UnitMod != nullptr;
		ioericUnsubmitted Done Reply Inline Actions `override` ioeric: `override`
		bool IsDebugCompilation = CI.getCodeGenOpts().OptimizationLevel == 0;

		// Ignore sysroot path if it points to root, otherwise every header will be
		// treated as system one.
		ioericUnsubmitted Done Reply Inline Actions Simply `begin`, if the class is called `FileIndexDataCollector` . Similar below to match iterator naming convention. ioeric: Simply `begin`, if the class is called `FileIndexDataCollector `. Similar below to match…
		if (llvm::sys::path::root_path(SysrootPath) == SysrootPath)
		SysrootPath = "";

		return {CI, UnitMod, ModuleName,
		RootFile, OutputFile, SysrootPath,
		IsSystemUnit, IsModuleUnit, IsDebugCompilation};
		}

		/// Construct a \c UnitDetails from the invocation associated with the provided
		/// \c CompilerInstance and the provided sysroot path.
		gribozavrUnsubmitted Not Done Reply Inline Actions Please don't duplicate type information from the signature in the comment. gribozavr: Please don't duplicate type information from the signature in the comment.
		static index::UnitDetails getUnitDetails(const CompilerInstance &CI,
		ioericUnsubmitted Done Reply Inline Actions I think this should be `public` as this is still implementing `IndexDataConsumer`. ioeric: I think this should be `public` as this is still implementing `IndexDataConsumer`.
		StringRef SysrootPath) {
		SourceManager &SM = CI.getASTContext().getSourceManager();

		std::string OutputFile = CI.getFrontendOpts().OutputFile;
		if (OutputFile.empty()) {
		OutputFile = CI.getFrontendOpts().Inputs[0].getFile();
		OutputFile += ".o";
		gribozavrUnsubmitted Not Done Reply Inline Actions I don't understand... this is not really the user-specified output file. gribozavr: I don't understand... this is not really the user-specified output file.
		}

		const FileEntry *RootFile = nullptr;
		Module *UnitMod = nullptr;
		bool IsModuleGeneration = CI.getLangOpts().isCompilingModule();
		if (!IsModuleGeneration &&
		CI.getFrontendOpts().ProgramAction != frontend::GeneratePCH)
		RootFile = SM.getFileEntryForID(SM.getMainFileID());

		if (IsModuleGeneration) {
		HeaderSearch &HS = CI.getPreprocessor().getHeaderSearchInfo();
		UnitMod = HS.lookupModule(CI.getLangOpts().CurrentModule,
		/AllowSearch=/false);
		assert(UnitMod && "only loaded modules should be indexed");
		}
		return getUnitDetails(CI, std::move(OutputFile), RootFile, UnitMod,
		SysrootPath);
		}

		/// Construct a \c UnitDetails for the given module file.
		gribozavrUnsubmitted Not Done Reply Inline Actions Please don't duplicate type information from the signature in the comment. gribozavr: Please don't duplicate type information from the signature in the comment.
		static index::UnitDetails getUnitDetails(serialization::ModuleFile &Mod,
		const CompilerInstance &CI,
		StringRef SysrootPath) {
		HeaderSearch &HS = CI.getPreprocessor().getHeaderSearchInfo();
		Module UnitMod = HS.lookupModule(Mod.ModuleName, /AllowSearch=*/false);
		assert(UnitMod && "only loaded modules should be indexed");

		return getUnitDetails(CI, /OutputFile=/Mod.FileName, /RootFile=/nullptr,
		UnitMod, SysrootPath);
		}

		ioericUnsubmitted Done Reply Inline Actions Again, you don't need the full `IndexingContext` and `RecordOptions` here. ioeric: Again, you don't need the full `IndexingContext` and `RecordOptions` here.
		namespace {

		/// Collects and groups consumed index data by \c FileID.
		class FileIndexDataCollector : public IndexDataConsumer {
		std::shared_ptr<Preprocessor> PP;
		typedef llvm::DenseMap<FileID, std::unique_ptr<FileIndexData>>
		IndexDataByFileTy;
		IndexDataByFileTy IndexDataByFile;

		public:
		ioericUnsubmitted Done Reply Inline Actions Note that `getDecomposedExpansionLoc` can also return invalid decomposed loc. ioeric: Note that `getDecomposedExpansionLoc ` can also return invalid decomposed loc.
		void setPreprocessor(std::shared_ptr<Preprocessor> PreProc) override {
		PP = PreProc;
		}

		IndexDataByFileTy::const_iterator begin() const {
		return IndexDataByFile.begin();
		}

		IndexDataByFileTy::const_iterator end() const {
		return IndexDataByFile.end();
		ioericUnsubmitted Done Reply Inline Actions I'd simply do: if FileIncludeFilter == UnitIndexingOptions::FileIncludeFilterKind::UserOnly) if (isSystem...) return; ioeric: I'd simply do: ``` if FileIncludeFilter == UnitIndexingOptions::FileIncludeFilterKind…
		}

		bool empty() const { return IndexDataByFile.empty(); }

		ioericUnsubmitted Done Reply Inline Actions Do we want better error handling here? ioeric: Do we want better error handling here?
		bool handleDeclOccurence(const Decl *D, SymbolRoleSet Roles,
		ArrayRef<SymbolRelation> Relations,
		SourceLocation Loc, bool IsInSystemFile,
		ASTNodeInfo ASTNode) override {
		ASTContext &Ctx = D->getASTContext();
		SourceManager &SM = Ctx.getSourceManager();
		Loc = SM.getFileLoc(Loc);
		FileID FID = SM.getFileID(Loc);
		unsigned Offset = SM.getFileOffset(Loc);

		ioericUnsubmitted Done Reply Inline Actions Same here. This should be `public` ioeric: Same here. This should be `public`
		// Ignore occurrences in the predefines buffer
		if (FID == PP->getPredefinesFileID())
		return true;

		FileIndexData &FileData = getFileIndexData(FID, IsInSystemFile);
		FileData.addDeclOccurence(Roles, Offset, D, Relations);
		return true;
		}

		ioericUnsubmitted Done Reply Inline Actions Please provide documentation. ioeric: Please provide documentation.
		private:
		FileIndexData &getFileIndexData(FileID FID, bool IsInSystemFile) {
		auto &Entry = IndexDataByFile[FID];
		if (!Entry) {
		Entry.reset(new FileIndexData(FID, IsInSystemFile));
		}
		return *Entry;
		}
		ioericUnsubmitted Done Reply Inline Actions The naming convention for the callback interfaces is `forEach` e.g. `forEachFileDependency`. s/visitor/Callback/ (same below). ioeric:* The naming convention for the callback interfaces is `forEach*` e.g. `forEachFileDependency`.
		};

		struct IncludeLocation {
		const FileEntry *Source;
		ioericUnsubmitted Done Reply Inline Actions `forEachInclude` ioeric: `forEachInclude`
		const FileEntry *Target;
		unsigned Line;
		};
		ioericUnsubmitted Done Reply Inline Actions `forEachModuleImport` ioeric: `forEachModuleImport`

		/// Preprocessor callbacks to collect file to file inclusion information
		class IncludePPCallbacks : public PPCallbacks {
		SystemFileCache &SystemCache;
		UnitIndexingOptions::FileIncludeFilterKind FileIncludeFilter;
		std::vector<IncludeLocation> &Includes;
		SourceManager &SourceMgr;

		ioericUnsubmitted Done Reply Inline Actions This is two classes in one, which is difficult to understand. Could you split it into `FileIndexDependencyCollector` and `FileIndexDependencyProvider` and have `FileIndexDependencyCollector` returns a provider on finish (e.g. `Provider consume();`; you might want to copy/move the collected data into the provider). It would be easier to justify the behavior (e.g. what happens when you access the provider while collector is still working?) ioeric: This is two classes in one, which is difficult to understand. Could you split it into…
		public:
		IncludePPCallbacks(SystemFileCache &SystemCache,
		UnitIndexingOptions::FileIncludeFilterKind IncludeFilter,
		std::vector<IncludeLocation> &IncludesForFile,
		ioericUnsubmitted Done Reply Inline Actions What does `Entries` contain? What files are added? ioeric: What does `Entries` contain? What files are added?
		SourceManager &SourceMgr)
		: SystemCache(SystemCache), FileIncludeFilter(IncludeFilter),
		Includes(IncludesForFile), SourceMgr(SourceMgr) {}

		virtual void InclusionDirective(SourceLocation HashLoc,
		const Token &IncludeTok, StringRef FileName,
		bool IsAngled, CharSourceRange FilenameRange,
		const FileEntry *File, StringRef SearchPath,
		StringRef RelativePath,
		const Module *Imported,
		SrcMgr::CharacteristicKind FileTy) override {
		ioericUnsubmitted Done Reply Inline Actions `IsSystemFileCache &SysrootPath`? What is this parameter? ioeric: `IsSystemFileCache &SysrootPath`? What is this parameter?
		if (HashLoc.isFileID() && File && File->isValid())
		addInclude(HashLoc, File);
		}

		private:
		void addInclude(SourceLocation From, const FileEntry *To) {
		assert(To);
		if (FileIncludeFilter == UnitIndexingOptions::FileIncludeFilterKind::None)
		return;

		std::pair<FileID, unsigned> LocInfo =
		SourceMgr.getDecomposedExpansionLoc(From);

		if (LocInfo.first.isInvalid())
		return; // Ignore invalid locations.

		if (FileIncludeFilter ==
		UnitIndexingOptions::FileIncludeFilterKind::UserOnly)
		if (SystemCache.isSystem(LocInfo.first, SourceMgr))
		return; // Ignore includes of system headers.

		if (auto *FE = SourceMgr.getFileEntryForID(LocInfo.first)) {
		auto lineNo = SourceMgr.getLineNumber(LocInfo.first, LocInfo.second);
		Includes.push_back({FE, To, lineNo});
		}
		}
		};

		/// Abstract interface for providing the file and module dependencies of a
		/// translation unit, as well as the set of file to file inclusions
		class IndexDependencyProvider {
		public:
		virtual ~IndexDependencyProvider() {}

		virtual void forEachFileDependency(
		const CompilerInstance &CI,
		llvm::function_ref<void(const FileEntry *FE, bool IsSystem)> Callback)
		const = 0;

		virtual void
		forEachInclude(llvm::function_ref<void(const FileEntry *Source, unsigned Line,
		const FileEntry *Target)>
		Callback) const = 0;
		virtual void forEachModuleImport(
		const CompilerInstance &CI,
		llvm::function_ref<void(serialization::ModuleFile &Mod, bool IsSystem)>
		Callback) const = 0;
		};

		/// An IndexDependencyProvider for the index data collected by
		/// \c FileIndexDependencyCollector.
		class FileIndexDependencyProvider : public IndexDependencyProvider {
		llvm::SetVector<const FileEntry *> Files;
		llvm::BitVector IsSystemByUID;
		std::vector<IncludeLocation> Includes;
		bool IncludeSysModules;

		public:
		FileIndexDependencyProvider(llvm::SetVector<const FileEntry *> Entries,
		llvm::BitVector IsSystemByUID,
		std::vector<IncludeLocation> Includes,
		bool IncludeSysDeps)
		: Files(std::move(Entries)), IsSystemByUID(std::move(IsSystemByUID)),
		Includes(std::move(Includes)), IncludeSysModules(IncludeSysDeps) {}

		void forEachFileDependency(
		const CompilerInstance &CI,
		llvm::function_ref<void(const FileEntry *FE, bool IsSystem)> Callback)
		const override {
		for (auto *FE : Files)
		Callback(FE, isSystemFile(FE));
		}

		void
		forEachInclude(llvm::function_ref<void(const FileEntry *Source, unsigned Line,
		const FileEntry *Target)>
		Callback) const override {
		for (auto &Include : Includes)
		Callback(Include.Source, Include.Line, Include.Target);
		}

		void forEachModuleImport(
		const CompilerInstance &CI,
		llvm::function_ref<void(serialization::ModuleFile &Mod, bool IsSystem)>
		Callback) const override {
		HeaderSearch &HS = CI.getPreprocessor().getHeaderSearchInfo();

		if (auto Reader = CI.getModuleManager()) {
		Reader->getModuleManager().visit(
		ioericUnsubmitted Done Reply Inline Actions Please document this class. This can be easily confused with `IndexActionBase` which has a similar name. Same for `IndexAction`/`IndexRecordAction` and `WrappingIndexRecordAction`/`WrappingIndexRecordAction`. I think these pairs share (especially the wrapping actions) some common logics and could probably be merged. ioeric: Please document this class. This can be easily confused with `IndexActionBase` which has a…
		[&](serialization::ModuleFile &Mod) -> bool {
		bool IsSystemMod = false;
		if (Mod.isModule()) {
		if (auto *M =
		HS.lookupModule(Mod.ModuleName, /AllowSearch=/false))
		IsSystemMod = M->IsSystem;
		}
		if (!IsSystemMod \|\| IncludeSysModules)
		Callback(Mod, IsSystemMod);
		return true; // skip module dependencies.
		});
		}
		}

		private:
		bool isSystemFile(const FileEntry *FE) const {
		auto UID = FE->getUID();
		return IsSystemByUID.size() > UID && IsSystemByUID[UID];
		}
		};

		/// Collects file and module dependency information for a translation unit,
		/// including file to file inclusions.
		class FileIndexDependencyCollector : public DependencyCollector {
		SystemFileCache &SystemCache;
		UnitIndexingOptions IndexOpts;
		ioericUnsubmitted Done Reply Inline Actions This does a lot of stuff... please document the behavior! ioeric: This does a lot of stuff... please document the behavior!
		llvm::SetVector<const FileEntry *> SeenFiles;
		llvm::BitVector IsSystemByUID;
		ioericUnsubmitted Done Reply Inline Actions Instead of passing `ParentUnitConsumer`, consider checking the `Mod` before calling the function. ioeric: Instead of passing `ParentUnitConsumer`, consider checking the `Mod` before calling the…
		std::vector<IncludeLocation> Includes;
		SourceManager *SourceMgr = nullptr;

		ioericUnsubmitted Done Reply Inline Actions Non-factory static method is often a code smell. Any reason not to make these static methods private members? With that, you wouldn't need to pass along so many parameters. You could make them `const` if you don't want members to be modified. ioeric: Non-factory static method is often a code smell. Any reason not to make these static methods…
		nathawesUnsubmitted Not Done Reply Inline Actions Sorry, there's missing context – they're used from another public API that's in the follow-up patch. I'll bring that over and make these top-level static functions, since they don't belong exclusively to IndexDataConsumerActionImpl. nathawes: Sorry, there's missing context – they're used from another public API that's in the follow-up…
		public:
		FileIndexDependencyCollector(SystemFileCache &SystemCache,
		UnitIndexingOptions IndexOpts)
		: SystemCache(SystemCache), IndexOpts(IndexOpts) {}

		void attachToPreprocessor(Preprocessor &PP) override {
		DependencyCollector::attachToPreprocessor(PP);
		ioericUnsubmitted Not Done Reply Inline Actions Why is this overload public while others are private? Aren't they all used only in this class? ioeric: Why is this overload public while others are private? Aren't they all used only in this class?
		nathawesUnsubmitted Not Done Reply Inline Actions Same as above – this is called from a public `index::` API in the follow-up patch. nathawes: Same as above – this is called from a public `index::` API in the follow-up patch.
		PP.addPPCallbacks(llvm::make_unique<IncludePPCallbacks>(
		SystemCache, IndexOpts.FileIncludeFilter, Includes,
		PP.getSourceManager()));
		}

		void setSourceManager(SourceManager *SourceMgr) {
		this->SourceMgr = SourceMgr;
		}

		FileIndexDependencyProvider consume() {
		return FileIndexDependencyProvider(
		std::move(SeenFiles), std::move(IsSystemByUID), std::move(Includes),
		ioericUnsubmitted Done Reply Inline Actions Can we get this state from the base class instead of maintaining a another state, which seems to be identical? ioeric: Can we get this state from the base class instead of maintaining a another state, which seems…
		nathawesUnsubmitted Done Reply Inline Actions I don't see this state in either base class (WrapperFrontendAction and IndexRecordActionBase). WrappingIndexAction and WrappingIndexRecordAction both have this, though. Were you thinking a new intermediate common base class between them and WrapperFrontendAction? nathawes: I don't see this state in either base class (WrapperFrontendAction and IndexRecordActionBase).
		ioericUnsubmitted Done Reply Inline Actions I thought this could be a state in the `WrapperFrontendAction` since both derived classes maintain this state, but after a closer look, this seems to depend on both base classes. I'm not a big fun of maintaining states in multi-stage classes (e.g. `FrontendAction`), which could be confusing and hard to follow; I think `IndexRecordActionBase::finish(...)` should be able to handle the case where no index consumer is created (i.e. no record/dependency/... is collected). Also, `IndexRecordActionBase` (and the existing `IndexActionBase` ) should really be a component instead of a base class since none of its methods is `virtual`. ioeric: I thought this could be a state in the `WrapperFrontendAction` since both derived classes…
		IndexOpts.IncludeSystemDependencies);
		}

		private:
		bool needSystemDependencies() override {
		return IndexOpts.IncludeSystemDependencies;
		}

		ioericUnsubmitted Done Reply Inline Actions Any reason to close the anonymous namespace here? Shouldn't outlined definitions of `UnitDataConsumerActionImpl`'s methods also in the anonymous namespace? ioeric: Any reason to close the anonymous namespace here? Shouldn't outlined definitions of…
		bool sawDependency(StringRef Filename, bool FromModule, bool IsSystem,
		bool IsModuleFile, bool IsMissing) override {
		bool SawIt = DependencyCollector::sawDependency(
		Filename, FromModule, IsSystem, IsModuleFile, IsMissing);
		if (auto *FE = SourceMgr->getFileManager().getFile(Filename)) {
		if (SawIt)
		SeenFiles.insert(FE);

		// Record system-ness for all files that we pass through.
		if (IsSystemByUID.size() < FE->getUID() + 1)
		IsSystemByUID.resize(FE->getUID() + 1);
		IsSystemByUID[FE->getUID()] = IsSystem \|\| isInSysroot(Filename);
		ioericUnsubmitted Done Reply Inline Actions Just `StringRef BuildNumber = RepositoryPath;` ioeric: Just `StringRef BuildNumber = RepositoryPath;`
		}
		return SawIt;
		}

		bool isInSysroot(StringRef Filename) {
		StringRef SysrootPath = SystemCache.getSysrootPath();
		return !SysrootPath.empty() && Filename.startswith(SysrootPath);
		}
		};
		} // anonymous namespace

		static void reportData(const CompilerInstance &CI,
		const FileIndexDataCollector &Collector,
		const IndexDependencyProvider &DepProvider,
		UnitDetails UnitInfo,
		const IndexUnitDataConsumerFactory &UnitConsumerFactory,
		const UnitIndexingOptions &IndexOpts) {

		std::unique_ptr<UnitIndexDataConsumer> Consumer =
		UnitConsumerFactory(UnitInfo);
		if (!Consumer)
		return;

		DepProvider.forEachFileDependency(
		CI, [&](const FileEntry *FE, bool IsSystemFile) {
		Consumer->handleFileDependency(FE, IsSystemFile);
		});
		DepProvider.forEachInclude(
		[&](const FileEntry Source, unsigned Line, const FileEntry Target) {
		Consumer->handleInclude(Source, Line, Target);
		});
		DepProvider.forEachModuleImport(
		CI, [&](serialization::ModuleFile &Mod, bool IsSystemMod) {
		Consumer->handleModuleImport(Mod, IsSystemMod);
		if (Mod.isModule() && Consumer->shouldIndexModuleDependency(Mod))
		indexModuleFile(Mod, CI, UnitConsumerFactory, IndexOpts);
		});

		malaperleUnsubmitted Done Reply Inline Actions As a first attempt, I tried to use index::createIndexDataRecordingAction in combination with ASTUnit::LoadFromCompilerInvocationAction but one problem is that right before it calls EndSourceFileAction in LoadFromCompilerInvocationAction, it calls transferASTDataFromCompilerInstance which means that the SourceManager in CompilerInstance is nulled out as it gets "transfered" to the AST. So this line crashes in this case. To be fair, at this point I don't need the ASTUnit so I can look at executing the action differently, but I thought I'd point it out! malaperle: As a first attempt, I tried to use index::createIndexDataRecordingAction in combination with…
		for (auto I = Collector.begin(), E = Collector.end(); I != E; ++I) {
		FileID FID = I->first;
		const FileIndexData &FileData = *I->second;
		if (Consumer->handleFileOccurrences(
		FID, FileData.getDeclOccurrencesSortedByOffset(),
		FileData.isSystem()))
		return;
		}

		Consumer->finish();
		ioericUnsubmitted Done Reply Inline Actions nit: no need for braces. Same below. ioeric: nit: no need for braces. Same below.
		}

		namespace {

		/// An implementation for IndexAction or WrappingIndexAction that gathers decl
		/// occurrence, file inclusion and dependency information for the translation
		/// unit and, optionally, its module dependencies.
		class UnitDataConsumerActionImpl : public IndexActionImpl {
		UnitIndexingOptions IndexOpts;
		FileIndexDataCollector Collector;
		IndexingContext IndexCtx;
		FileIndexDependencyCollector DepCollector;
		ioericUnsubmitted Done Reply Inline Actions In the previous patch, `writeUnitData` does several things including handling modules, dependencies, includes and index records, as well as writing data. It might make sense to add an abstract class (`UnitDataCollector`?) that defines interfaces which make these behavior more explicit. We can then have users pass in an implementation via `createIndexDataRecordingAction` which would also decouple the data collection from data storage in the library. ioeric: In the previous patch, `writeUnitData` does several things including handling modules…
		IndexUnitDataConsumerFactory UnitConsumerFactory;

		public:
		UnitDataConsumerActionImpl(UnitIndexingOptions UnitIndexOpts,
		IndexUnitDataConsumerFactory UnitConsumerFactory)
		: IndexOpts(UnitIndexOpts), IndexCtx(UnitIndexOpts, Collector),
		DepCollector(IndexCtx.getSystemCache(), IndexOpts),
		UnitConsumerFactory(std::move(UnitConsumerFactory)) {}

		std::unique_ptr<IndexASTConsumer>
		createIndexASTConsumer(CompilerInstance &CI) override {
		IndexCtx.setSysrootPath(CI.getHeaderSearchOpts().Sysroot);

		std::shared_ptr<Preprocessor> PP = CI.getPreprocessorPtr();
		Collector.setPreprocessor(PP);
		DepCollector.setSourceManager(&CI.getSourceManager());
		DepCollector.attachToPreprocessor(CI.getPreprocessor());

		return llvm::make_unique<IndexASTConsumer>(PP, IndexCtx);
		}

		/// Provides the collected indexing info to the \c IndexUnitDataConsumer
		void finish(CompilerInstance &CI) override {
		// The consumer may emit more diagnostics so do the begin/end source file
		// invocations on the diagnostic client.
		// FIXME: FrontendAction::EndSourceFile() should probably not call
		// CI.getDiagnosticClient().EndSourceFile()' until after it has called
		// 'EndSourceFileAction()', so that code executing during
		// EndSourceFileAction() can emit diagnostics. If this is fixed,
		// DiagClientBeginEndRAII can go away.
		struct DiagClientBeginEndRAII {
		CompilerInstance &CI;
		DiagClientBeginEndRAII(CompilerInstance &CI) : CI(CI) {
		CI.getDiagnosticClient().BeginSourceFile(CI.getLangOpts());
		}
		ioericUnsubmitted Done Reply Inline Actions I'm a bit nervous about propagating the entire `FrontendOptions` into the index library. I would simply expose `getIndexOptionsFromFrontendOptions` and have callers parse `FrontendOptions` and pass in only index-related options. ioeric: I'm a bit nervous about propagating the entire `FrontendOptions` into the index library. I…
		~DiagClientBeginEndRAII() { CI.getDiagnosticClient().EndSourceFile(); }
		} diagClientBeginEndRAII(CI);

		Collector.finish();
		reportData(CI, Collector, DepCollector.consume(),
		getUnitDetails(CI, IndexCtx.getSysrootPath()),
		UnitConsumerFactory, IndexOpts);
		}
		};

		/// Provides the file and module dependency information for a \c ModuleFile
		class ModuleFileIndexDependencyCollector : public IndexDependencyProvider {
		serialization::ModuleFile &ModFile;
		bool CollectSystemDependencies;

		public:
		ModuleFileIndexDependencyCollector(serialization::ModuleFile &Mod,
		bool CollectSystemDependencies)
		: ModFile(Mod), CollectSystemDependencies(CollectSystemDependencies) {}

		void forEachFileDependency(
		const CompilerInstance &CI,
		llvm::function_ref<void(const FileEntry *FE, bool IsSystem)> Callback)
		const override {
		auto Reader = CI.getModuleManager();
		Reader->visitInputFiles(
		ModFile, CollectSystemDependencies, /Complain=/false,
		[&](const serialization::InputFile &IF, bool IsSystem) {
		auto *FE = IF.getFile();
		if (!FE)
		return;
		// Ignore module map files, they are not as important to track as
		// source files and they may be auto-generated which would create an
		// undesirable dependency on an intermediate build byproduct.
		if (FE->getName().endswith("module.modulemap"))
		return;

		Callback(FE, IsSystem);
		});
		}

		void
		forEachInclude(llvm::function_ref<void(const FileEntry *Source, unsigned Line,
		const FileEntry *Target)>
		Callback) const override {
		// FIXME: Module files without a preprocessing record do not have info about
		// include locations. Serialize enough data to be able to retrieve such
		// info.
		}

		void forEachModuleImport(
		const CompilerInstance &CI,
		llvm::function_ref<void(serialization::ModuleFile &Mod, bool IsSystem)>
		Callback) const override {
		HeaderSearch &HS = CI.getPreprocessor().getHeaderSearchInfo();
		for (auto *Mod : ModFile.Imports) {
		bool IsSystemMod = false;
		if (auto M = HS.lookupModule(Mod->ModuleName, /AllowSearch=*/false))
		IsSystemMod = M->IsSystem;
		if (!IsSystemMod \|\| CollectSystemDependencies)
		Callback(*Mod, IsSystemMod);
		}
		}
		};
		} // anonymous namespace.

		void index::indexModuleFile(serialization::ModuleFile &Mod,
		const CompilerInstance &CI,
		arphamanUnsubmitted Done Reply Inline Actions We might want to start using a new diagnostic group for index-while-building errors instead of the custom ones. arphaman: We might want to start using a new diagnostic group for index-while-building errors instead of…
		IndexUnitDataConsumerFactory UnitConsumerFactory,
		UnitIndexingOptions IndexOpts) {

		DiagnosticsEngine &Diag = CI.getDiagnostics();
		Diag.Report(Mod.ImportLoc, diag::remark_index_producing_module_file_data)
		<< Mod.FileName;

		FileIndexDataCollector Collector;
		IndexingContext ModIndexCtx(IndexOpts, Collector);

		auto &ASTCtx = CI.getASTContext();
		Collector.initialize(ASTCtx);
		Collector.setPreprocessor(CI.getPreprocessorPtr());
		ModIndexCtx.setASTContext(ASTCtx);
		ioericUnsubmitted Done Reply Inline Actions Please provide a brief documentation for this class. ioeric: Please provide a brief documentation for this class.
		ModIndexCtx.setSysrootPath(CI.getHeaderSearchOpts().Sysroot);

		ioericUnsubmitted Done Reply Inline Actions Again, it doesn't seem necessary for this class to have information about all record options. It seems that you only need `RecordSystemDependencies` here. ioeric: Again, it doesn't seem necessary for this class to have information about all record options.
		for (const Decl *D : CI.getModuleManager()->getModuleFileLevelDecls(Mod))
		ModIndexCtx.indexTopLevelDecl(D);

		Collector.finish();

		ModuleFileIndexDependencyCollector DepCollector(
		Mod, IndexOpts.IncludeSystemDependencies);

		reportData(CI, Collector, DepCollector,
		getUnitDetails(Mod, CI, ModIndexCtx.getSysrootPath()),
		ioericUnsubmitted Done Reply Inline Actions readability nit: avoid using `auto` if the return type is short to spell but hard to infer from the value expression. Same else where. ioeric: readability nit: avoid using `auto` if the return type is short to spell but hard to infer from…
		UnitConsumerFactory, IndexOpts);
		}

		std::unique_ptr<FrontendAction>
		index::createUnitIndexingAction(IndexUnitDataConsumerFactory ConsumerFactory,
		UnitIndexingOptions IndexOpts,
		std::unique_ptr<FrontendAction> WrappedAction) {
		auto ActionImpl = llvm::make_unique<UnitDataConsumerActionImpl>(
		std::move(IndexOpts), ConsumerFactory);
		if (WrappedAction)
		return llvm::make_unique<WrappingIndexAction>(std::move(WrappedAction),
		std::move(ActionImpl));
		return llvm::make_unique<IndexAction>(std::move(ActionImpl));
		};

		std::unique_ptr<FrontendAction> index::createIndexDataRecordingAction(
		RecordingOptions RecordOpts,
		std::unique_ptr<FrontendAction> WrappedAction) {

		auto ConsumerFactory =
		[RecordOpts](
		UnitDetails UnitInfo) -> std::unique_ptr<UnitIndexDataConsumer> {
		return llvm::make_unique<UnitIndexDataRecorder>(std::move(UnitInfo),
		RecordOpts);
		};
		return createUnitIndexingAction(ConsumerFactory, std::move(RecordOpts),
		ioericUnsubmitted Done Reply Inline Actions I think the inheritance of `IndexUnitDataConsumer` and the creation of factory should be in user code (e.g. implementation for on-disk persist-index-data should come from the compiler invocation code `ExecuteCompilerInvocation.cpp` or at least a separate file in the library that compiler invocation can use), and the user should only use `createUnitIndexingAction` by providing a factory. Currently, `createUnitIndexingAction` and `createIndexDataRecordingAction` are mostly identical except for the code that implements `IndexUnitDataConsumer` and creates the factory. The current `createIndexDataRecordingAction` would probably only used by the compiler invocation, and we can keep the generalized `createUnitIndexingAction` in the public APIs. ioeric: I think the inheritance of `IndexUnitDataConsumer` and the creation of factory should be in…
		nathawesUnsubmitted Not Done Reply Inline Actions `IndexUnitDataRecorder` here is just a stub I added when I split the patch up – the follow-up revision has it in a separate file. I'll move the separate files to this patch and stub out the method bodies with TODOs instead. I've made `createIndexDataRecordingAction` call `createUnitIndexingAction` to remove the duplication, and pulled it, `RecordingOptions` and `getRecordingOptionsFromFrontendOptions` to a new header (`RecordingAction.h`) that `ExecuteComilerInvocation.cpp` uses. Does that sound ok? nathawes: `IndexUnitDataRecorder` here is just a stub I added when I split the patch up – the follow-up…
		ioericUnsubmitted Not Done Reply Inline Actions Sounds good. Thanks for the explanation! ioeric: Sounds good. Thanks for the explanation!
		std::move(WrappedAction));
		};

		RecordingOptions
		index::getRecordingOptionsFromFrontendOptions(const FrontendOptions &FEOpts) {
		RecordingOptions RecordOpts;
		RecordOpts.DataDirPath = FEOpts.IndexStorePath;
		ioericUnsubmitted Done Reply Inline Actions The `UnitInfo` is ignored? What do we actually need it for? ioeric: The `UnitInfo` is ignored? What do we actually need it for?
		nathawesUnsubmitted Not Done Reply Inline Actions It should be passed to IndexUnitDataRecorder to write out info about the unit itself. This was just me splitting the patch badly. nathawes: It should be passed to IndexUnitDataRecorder to write out info about the unit itself. This was…
		if (FEOpts.IndexIgnoreSystemSymbols) {
		RecordOpts.SystemSymbolFilter =
		index::IndexingOptions::SystemSymbolFilterKind::None;
		}
		ioericUnsubmitted Done Reply Inline Actions `Base` doesn't seem to be a very meaningful name here. ioeric: `Base` doesn't seem to be a very meaningful name here.
		RecordOpts.RecordSymbolCodeGenName = FEOpts.IndexRecordCodegenName;
		return RecordOpts;
		}

		void index::recordIndexDataForModuleFile(serialization::ModuleFile *ModFile,
		RecordingOptions RecordOpts,
		const CompilerInstance &CI) {
		auto UnitConsumerFactory = [RecordOpts](UnitDetails UnitInfo) {
		return llvm::make_unique<UnitIndexDataRecorder>(std::move(UnitInfo),
		RecordOpts);
		};
		return indexModuleFile(*ModFile, CI, UnitConsumerFactory,
		ioericUnsubmitted Done Reply Inline Actions Could you add a comment explaining why we are not allowing searching. ioeric: Could you add a comment explaining why we are not allowing searching.
		std::move(RecordOpts));
		}
		ioericUnsubmitted Done Reply Inline Actions Just `auto pair = getIndexOptionsFromFrontendOptions(FEOpts);` and then use `pair.first` and `pair.second`? Same below. ioeric: Just `auto pair = getIndexOptionsFromFrontendOptions(FEOpts);` and then use `pair.first` and…
		ioericUnsubmitted Done Reply Inline Actions nit: redundant empty line ioeric: nit: redundant empty line
		ioericUnsubmitted Done Reply Inline Actions It's a bit worrying that `IndexDataRecorder` and `IndexContext` reference each other. If you only need some information from the `IndexingContext`, simply pass it into `Recorder`. In this case, I think you only need the `SourceManager` from the `ASTContext` in the recorder to calculate whether a file is a system header. I see you also cache result of `IndexingContext::isSystemFile` in the indexing context, but I think it would be more sensible for the callers to handle caching for this call. ioeric: It's a bit worrying that `IndexDataRecorder` and `IndexContext` reference each other. If you…
		nathawesUnsubmitted Done Reply Inline Actions Good point. The IndexingContext was actually already calling IsSystemFile before it calls IndexDataRecorder's handleDeclOccurrence and handleModuleOccurrence anyway, so I'll change it to pass that through as an extra param and remove IndexDataRecorder's dependency on the IndexingContext. nathawes: Good point. The IndexingContext was actually already calling IsSystemFile before it calls…
		ioericUnsubmitted Done Reply Inline Actions nit: no braces around one liners. ioeric: nit: no braces around one liners.

lib/Index/IndexingContext.h

	//===- IndexingContext.h - Indexing context data ----------------- C++ --===//			//===- IndexingContext.h - Indexing context data ----------------- C++ --===//
	//			//
	// The LLVM Compiler Infrastructure			// The LLVM Compiler Infrastructure
	//			//
	// This file is distributed under the University of Illinois Open Source			// This file is distributed under the University of Illinois Open Source
	// License. See LICENSE.TXT for details.			// License. See LICENSE.TXT for details.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_CLANG_LIB_INDEX_INDEXINGCONTEXT_H			#ifndef LLVM_CLANG_LIB_INDEX_INDEXINGCONTEXT_H
	#define LLVM_CLANG_LIB_INDEX_INDEXINGCONTEXT_H			#define LLVM_CLANG_LIB_INDEX_INDEXINGCONTEXT_H

	#include "clang/Basic/LLVM.h"			#include "clang/Basic/LLVM.h"
				#include "clang/Basic/SourceLocation.h"
	#include "clang/Index/IndexSymbol.h"			#include "clang/Index/IndexSymbol.h"
	#include "clang/Index/IndexingAction.h"			#include "clang/Index/IndexingAction.h"
	#include "llvm/ADT/ArrayRef.h"			#include "llvm/ADT/ArrayRef.h"
				#include "llvm/ADT/DenseMap.h"

	namespace clang {			namespace clang {
	class ASTContext;			class ASTContext;
	class Decl;			class Decl;
	class DeclGroupRef;			class DeclGroupRef;
	class ImportDecl;			class ImportDecl;
	class TagDecl;			class TagDecl;
	class TypeSourceInfo;			class TypeSourceInfo;
	class NamedDecl;			class NamedDecl;
	class ObjCMethodDecl;			class ObjCMethodDecl;
	class DeclContext;			class DeclContext;
	class NestedNameSpecifierLoc;			class NestedNameSpecifierLoc;
	class Stmt;			class Stmt;
	class Expr;			class Expr;
	class TypeLoc;			class TypeLoc;
	class SourceLocation;			class DirectoryEntry;

	namespace index {			namespace index {
	class IndexDataConsumer;			class IndexDataConsumer;

				/// Tracks the current system root path and computes and caches whether a
				/// file is considered a system file or not
				gribozavrUnsubmitted Not Done Reply Inline Actions Please add a period at the end of the comment. gribozavr: Please add a period at the end of the comment.
				class SystemFileCache {
				ioericUnsubmitted Done Reply Inline Actions This name is really confusing... `Is` is usually used for booleans. Simply call this `SystemFileCache`. ioeric:* This name is really confusing... `Is*` is usually used for booleans. Simply call this…
				std::string SysrootPath;
				// Records whether a directory entry is system or not.
				llvm::DenseMap<const DirectoryEntry *, bool> DirEntries;
				gribozavrUnsubmitted Not Done Reply Inline Actions DirEntries => IsSystemDirEntry? gribozavr: DirEntries => IsSystemDirEntry?
				// Keeps track of the last check for whether a FileID is system or
				// not. This is used to speed up isSystemFile() call.
				gribozavrUnsubmitted Not Done Reply Inline Actions Triple slashes for doc comments. gribozavr: Triple slashes for doc comments.
				gribozavrUnsubmitted Not Done Reply Inline Actions Unclear how a boolean can keep track of the last check. Did you mean "Whether the file is a system file or not. This value is a cache." If so, please rename the variable to something like IsSystemFileCache. gribozavr: Unclear how a boolean can keep track of the last check. Did you mean "Whether the file is a…
				std::pair<FileID, bool> LastFileCheck;

				public:
				SystemFileCache() = default;
				SystemFileCache(std::string SysrootPath);

				void setSysrootPath(StringRef path);
				ioericUnsubmitted Done Reply Inline Actions How does this affect the existing cached results? Do you need to invalidate them? ioeric: How does this affect the existing cached results? Do you need to invalidate them?
				StringRef getSysrootPath() const { return SysrootPath; }
				bool isSystem(FileID FID, SourceManager &SM);
				};

				/// Generates and reports indexing data to the provided \c IndexDataConsumer
				/// for any AST nodes passed to its various \c index* methods.
	class IndexingContext {			class IndexingContext {
				ioericUnsubmitted Done Reply Inline Actions Please define the scope of this class to avoid throwing random states into it, which usually happens to a "context" class. ioeric: Please define the scope of this class to avoid throwing random states into it, which usually…
	IndexingOptions IndexOpts;			IndexingOptions IndexOpts;
				SystemFileCache SystemCache;
	IndexDataConsumer &DataConsumer;			IndexDataConsumer &DataConsumer;
	ASTContext *Ctx = nullptr;			ASTContext *Ctx = nullptr;
				ioericUnsubmitted Done Reply Inline Actions I think it would be more straightforward to have context own the cache. If `setSysrootPath` is the problem, it might make sense to propagate it via the context or, if necessary, create a new cache when a new `SysrootPath` is set. ioeric: I think it would be more straightforward to have context own the cache. If `setSysrootPath` is…

	public:			public:
	IndexingContext(IndexingOptions IndexOpts, IndexDataConsumer &DataConsumer)			IndexingContext(IndexingOptions IndexOpts, IndexDataConsumer &DataConsumer)
	: IndexOpts(IndexOpts), DataConsumer(DataConsumer) {}			: IndexOpts(IndexOpts), DataConsumer(DataConsumer) {}

	const IndexingOptions &getIndexOpts() const { return IndexOpts; }			const IndexingOptions &getIndexOpts() const { return IndexOpts; }
				SystemFileCache &getSystemCache() { return SystemCache; }
	IndexDataConsumer &getDataConsumer() { return DataConsumer; }			IndexDataConsumer &getDataConsumer() { return DataConsumer; }

	void setASTContext(ASTContext &ctx) { Ctx = &ctx; }			void setASTContext(ASTContext &ctx) { Ctx = &ctx; }

				void setSysrootPath(StringRef path) { SystemCache.setSysrootPath(path); }
				StringRef getSysrootPath() const { return SystemCache.getSysrootPath(); }

	bool shouldIndex(const Decl *D);			bool shouldIndex(const Decl *D);

	const LangOptions &getLangOpts() const;			const LangOptions &getLangOpts() const;

	bool shouldSuppressRefs() const {			bool shouldSuppressRefs() const {
	return false;			return false;
	}			}

	▲ Show 20 Lines • Show All 69 Lines • Show Last 20 Lines

lib/Index/IndexingContext.cpp

//===- IndexingContext.cpp - Indexing context data ------------------------===//		//===- IndexingContext.cpp - Indexing context data ------------------------===//
//		//
// The LLVM Compiler Infrastructure		// The LLVM Compiler Infrastructure
//		//
// This file is distributed under the University of Illinois Open Source		// This file is distributed under the University of Illinois Open Source
// License. See LICENSE.TXT for details.		// License. See LICENSE.TXT for details.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "IndexingContext.h"		#include "IndexingContext.h"
#include "clang/Index/IndexDataConsumer.h"
#include "clang/AST/ASTContext.h"		#include "clang/AST/ASTContext.h"
#include "clang/AST/DeclTemplate.h"
#include "clang/AST/DeclObjC.h"		#include "clang/AST/DeclObjC.h"
		#include "clang/AST/DeclTemplate.h"
#include "clang/Basic/SourceManager.h"		#include "clang/Basic/SourceManager.h"
		#include "clang/Index/IndexDataConsumer.h"
		#include "llvm/Support/Path.h"

using namespace clang;		using namespace clang;
using namespace index;		using namespace index;
		using namespace llvm;

static bool isGeneratedDecl(const Decl *D) {		static bool isGeneratedDecl(const Decl *D) {
if (auto *attr = D->getAttr<ExternalSourceSymbolAttr>()) {		if (auto *attr = D->getAttr<ExternalSourceSymbolAttr>()) {
return attr->getGeneratedDeclaration();		return attr->getGeneratedDeclaration();
}		}
return false;		return false;
}		}

		void SystemFileCache::setSysrootPath(llvm::StringRef Path) {
		// Ignore sysroot path if it points to root, otherwise every header will be
		// treated as system one.
		SysrootPath = sys::path::root_path(Path) == Path ? StringRef() : Path;

		// Invalidate existing results
		LastFileCheck = {FileID(), false};
		DirEntries.clear();
		}

		SystemFileCache::SystemFileCache(std::string Path) { setSysrootPath(Path); }

		bool SystemFileCache::isSystem(clang::FileID FID, clang::SourceManager &SM) {
		if (LastFileCheck.first == FID)
		return LastFileCheck.second;

		auto Result = [&](bool Res) -> bool {
		LastFileCheck = {FID, Res};
		return Res;
		};

		bool Invalid = false;
		const SrcMgr::SLocEntry &SEntry = SM.getSLocEntry(FID, &Invalid);
		if (Invalid \|\| !SEntry.isFile())
		return Result(false);

		const SrcMgr::FileInfo &FI = SEntry.getFile();
		if (FI.getFileCharacteristic() != SrcMgr::C_User)
		return Result(true);

		auto *CC = FI.getContentCache();
		if (!CC)
		return Result(false);
		auto *FE = CC->OrigEntry;
		if (!FE)
		return Result(false);

		if (SysrootPath.empty())
		return Result(false);

		// Check if directory is in sysroot so that we can consider system headers
		// even the headers found via a user framework search path, pointing inside
		// sysroot.
		auto DirEntry = FE->getDir();
		auto Pair = DirEntries.insert(std::make_pair(DirEntry, false));
		bool &IsSystemDir = Pair.first->second;
		bool WasInserted = Pair.second;
		if (WasInserted) {
		IsSystemDir = StringRef(DirEntry->getName()).startswith(SysrootPath);
		}
		return Result(IsSystemDir);
		}

bool IndexingContext::shouldIndex(const Decl *D) {		bool IndexingContext::shouldIndex(const Decl *D) {
return !isGeneratedDecl(D);		return !isGeneratedDecl(D);
}		}

const LangOptions &IndexingContext::getLangOpts() const {		const LangOptions &IndexingContext::getLangOpts() const {
return Ctx->getLangOpts();		return Ctx->getLangOpts();
}		}

▲ Show 20 Lines • Show All 48 Lines • ▼ Show 20 Lines	bool IndexingContext::importedModule(const ImportDecl *ImportD) {
else		else
Loc = ImportD->getLocation();		Loc = ImportD->getLocation();

SourceManager &SM = Ctx->getSourceManager();		SourceManager &SM = Ctx->getSourceManager();
FileID FID = SM.getFileID(SM.getFileLoc(Loc));		FileID FID = SM.getFileID(SM.getFileLoc(Loc));
if (FID.isInvalid())		if (FID.isInvalid())
return true;		return true;

bool Invalid = false;		bool IsInSystemFile = SystemCache.isSystem(FID, SM);
const SrcMgr::SLocEntry &SEntry = SM.getSLocEntry(FID, &Invalid);		if (IsInSystemFile) {
if (Invalid \|\| !SEntry.isFile())
return true;

if (SEntry.getFile().getFileCharacteristic() != SrcMgr::C_User) {
switch (IndexOpts.SystemSymbolFilter) {		switch (IndexOpts.SystemSymbolFilter) {
case IndexingOptions::SystemSymbolFilterKind::None:		case IndexingOptions::SystemSymbolFilterKind::None:
return true;		return true;
case IndexingOptions::SystemSymbolFilterKind::DeclarationsOnly:		case IndexingOptions::SystemSymbolFilterKind::DeclarationsOnly:
case IndexingOptions::SystemSymbolFilterKind::All:		case IndexingOptions::SystemSymbolFilterKind::All:
break;		break;
}		}
}		}

SymbolRoleSet Roles = (unsigned)SymbolRole::Declaration;		SymbolRoleSet Roles = (unsigned)SymbolRole::Declaration;
if (ImportD->isImplicit())		if (ImportD->isImplicit())
Roles \|= (unsigned)SymbolRole::Implicit;		Roles \|= (unsigned)SymbolRole::Implicit;

return DataConsumer.handleModuleOccurence(ImportD, Roles, Loc);		return DataConsumer.handleModuleOccurence(ImportD, Roles, Loc,
		IsInSystemFile);
}		}

bool IndexingContext::isTemplateImplicitInstantiation(const Decl *D) {		bool IndexingContext::isTemplateImplicitInstantiation(const Decl *D) {
TemplateSpecializationKind TKind = TSK_Undeclared;		TemplateSpecializationKind TKind = TSK_Undeclared;
if (const ClassTemplateSpecializationDecl *		if (const ClassTemplateSpecializationDecl *
SD = dyn_cast<ClassTemplateSpecializationDecl>(D)) {		SD = dyn_cast<ClassTemplateSpecializationDecl>(D)) {
TKind = SD->getSpecializationKind();		TKind = SD->getSpecializationKind();
} else if (const FunctionDecl *FD = dyn_cast<FunctionDecl>(D)) {		} else if (const FunctionDecl *FD = dyn_cast<FunctionDecl>(D)) {
Show All 35 Lines	bool IndexingContext::shouldIgnoreIfImplicit(const Decl *D) {
if (isa<ImportDecl>(D))		if (isa<ImportDecl>(D))
return false;		return false;
return true;		return true;
}		}

static const CXXRecordDecl *		static const CXXRecordDecl *
getDeclContextForTemplateInstationPattern(const Decl *D) {		getDeclContextForTemplateInstationPattern(const Decl *D) {
if (const auto *CTSD =		if (const auto *CTSD =
dyn_cast<ClassTemplateSpecializationDecl>(D->getDeclContext()))		dyn_cast<ClassTemplateSpecializationDecl>(D->getDeclContext()))
		arphamanUnsubmitted Done Reply Inline Actions It might be worth investigating if you can use any of the LLVM's path APIs here instead of doing a UNIX-specific check. arphaman: It might be worth investigating if you can use any of the LLVM's path APIs here instead of…
return CTSD->getTemplateInstantiationPattern();		return CTSD->getTemplateInstantiationPattern();
else if (const auto *RD = dyn_cast<CXXRecordDecl>(D->getDeclContext()))		else if (const auto *RD = dyn_cast<CXXRecordDecl>(D->getDeclContext()))
return RD->getInstantiatedFromMemberClass();		return RD->getInstantiatedFromMemberClass();
return nullptr;		return nullptr;
}		}

static const Decl adjustTemplateImplicitInstantiation(const Decl D) {		static const Decl adjustTemplateImplicitInstantiation(const Decl D) {
if (const ClassTemplateSpecializationDecl *		if (const ClassTemplateSpecializationDecl *
Show All 24 Lines	if (const auto *ED = dyn_cast<EnumDecl>(ECD->getDeclContext())) {
for (const NamedDecl *BaseECD : Pattern->lookup(ECD->getDeclName()))		for (const NamedDecl *BaseECD : Pattern->lookup(ECD->getDeclName()))
return BaseECD;		return BaseECD;
}		}
}		}
}		}
return nullptr;		return nullptr;
}		}

static bool isDeclADefinition(const Decl D, const DeclContext ContainerDC, ASTContext &Ctx) {		static bool isDeclADefinition(const Decl D, const DeclContext ContainerDC,
		ASTContext &Ctx) {
if (auto VD = dyn_cast<VarDecl>(D))		if (auto VD = dyn_cast<VarDecl>(D))
return VD->isThisDeclarationADefinition(Ctx);		return VD->isThisDeclarationADefinition(Ctx);

if (auto FD = dyn_cast<FunctionDecl>(D))		if (auto FD = dyn_cast<FunctionDecl>(D))
return FD->isThisDeclarationADefinition();		return FD->isThisDeclarationADefinition();

if (auto TD = dyn_cast<TagDecl>(D))		if (auto TD = dyn_cast<TagDecl>(D))
return TD->isThisDeclarationADefinition();		return TD->isThisDeclarationADefinition();
▲ Show 20 Lines • Show All 99 Lines • ▼ Show 20 Lines

bool IndexingContext::handleDeclOccurrence(const Decl *D, SourceLocation Loc,		bool IndexingContext::handleDeclOccurrence(const Decl *D, SourceLocation Loc,
bool IsRef, const Decl *Parent,		bool IsRef, const Decl *Parent,
SymbolRoleSet Roles,		SymbolRoleSet Roles,
ArrayRef<SymbolRelation> Relations,		ArrayRef<SymbolRelation> Relations,
const Expr *OrigE,		const Expr *OrigE,
const Decl *OrigD,		const Decl *OrigD,
const DeclContext *ContainerDC) {		const DeclContext *ContainerDC) {
if (D->isImplicit() && !isa<ObjCMethodDecl>(D))		if (D->isImplicit() && !(isa<ObjCMethodDecl>(D) \|\| isa<ObjCIvarDecl>(D)))
return true;		return true;
if (!isa<NamedDecl>(D) \|\| shouldSkipNamelessDecl(cast<NamedDecl>(D)))		if (!isa<NamedDecl>(D) \|\| shouldSkipNamelessDecl(cast<NamedDecl>(D)))
return true;		return true;

SourceManager &SM = Ctx->getSourceManager();		SourceManager &SM = Ctx->getSourceManager();
FileID FID = SM.getFileID(SM.getFileLoc(Loc));		FileID FID = SM.getFileID(SM.getFileLoc(Loc));
if (FID.isInvalid())		if (FID.isInvalid())
return true;		return true;

bool Invalid = false;		bool IsInSystemFile = SystemCache.isSystem(FID, SM);
const SrcMgr::SLocEntry &SEntry = SM.getSLocEntry(FID, &Invalid);		if (IsInSystemFile) {
if (Invalid \|\| !SEntry.isFile())
return true;

if (SEntry.getFile().getFileCharacteristic() != SrcMgr::C_User) {
switch (IndexOpts.SystemSymbolFilter) {		switch (IndexOpts.SystemSymbolFilter) {
case IndexingOptions::SystemSymbolFilterKind::None:		case IndexingOptions::SystemSymbolFilterKind::None:
return true;		return true;
case IndexingOptions::SystemSymbolFilterKind::DeclarationsOnly:		case IndexingOptions::SystemSymbolFilterKind::DeclarationsOnly:
if (!shouldReportOccurrenceForSystemDeclOnlyMode(IsRef, Roles, Relations))		if (!shouldReportOccurrenceForSystemDeclOnlyMode(IsRef, Roles, Relations))
return true;		return true;
break;		break;
case IndexingOptions::SystemSymbolFilterKind::All:		case IndexingOptions::SystemSymbolFilterKind::All:
▲ Show 20 Lines • Show All 56 Lines • ▼ Show 20 Lines	bool IndexingContext::handleDeclOccurrence(const Decl *D, SourceLocation Loc,
}		}

for (auto &Rel : Relations) {		for (auto &Rel : Relations) {
addRelation(SymbolRelation(Rel.Roles,		addRelation(SymbolRelation(Rel.Roles,
Rel.RelatedSymbol->getCanonicalDecl()));		Rel.RelatedSymbol->getCanonicalDecl()));
}		}

IndexDataConsumer::ASTNodeInfo Node{OrigE, OrigD, Parent, ContainerDC};		IndexDataConsumer::ASTNodeInfo Node{OrigE, OrigD, Parent, ContainerDC};
return DataConsumer.handleDeclOccurence(D, Roles, FinalRelations, Loc, Node);		return DataConsumer.handleDeclOccurence(D, Roles, FinalRelations, Loc,
		IsInSystemFile, Node);
}		}

lib/Index/UnitIndexDataRecorder.h

This file was added.

				//===--- UnitIndexDataRecorder.h - Persist index data to the file system --===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLVM_CLANG_LIB_INDEX_UNITINDEXDATARECORDER_H
				#define LLVM_CLANG_LIB_INDEX_UNITINDEXDATARECORDER_H

				#include "clang/Index/RecordingAction.h"
				#include "clang/Index/UnitIndexDataConsumer.h"

				namespace clang {
				class DiagnosticsEngine;
				class FrontendOptions;

				namespace index {

				/// Persists the provided index data for a single translation unit out to the
				/// file system.
				class UnitIndexDataRecorder : public UnitIndexDataConsumer {
				protected:
				UnitDetails UnitInfo;

				public:
				UnitIndexDataRecorder(UnitDetails UnitInfo, RecordingOptions RecordOpts);

				void handleFileDependency(const FileEntry *FE, bool IsSystem) override;

				void handleInclude(const FileEntry *Source, unsigned Line,
				const FileEntry *Target) override;

				void handleModuleImport(const serialization::ModuleFile &Mod,
				bool IsSystem) override;

				bool
				shouldIndexModuleDependency(const serialization::ModuleFile &Mod) override;

				bool handleFileOccurrences(FileID FID,
				ArrayRef<DeclOccurrence> OccurrencesSortedByOffset,
				bool IsSystem) override;

				void finish() override;
				};

				} // end namespace index
				} // end namespace clang

				#endif

lib/Index/UnitIndexDataRecorder.cpp

This file was added.

				//===--- UnitIndexDataRecorder.cpp - Persist index data to the file system ===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//

				#include "UnitIndexDataRecorder.h"
				#include "clang/Frontend/FrontendOptions.h"

				using namespace clang;
				using namespace clang::index;

				UnitIndexDataRecorder::UnitIndexDataRecorder(UnitDetails UnitDetails,
				RecordingOptions RecordOpts)
				: UnitInfo(UnitDetails) {
				// TODO
				}

				void UnitIndexDataRecorder::handleFileDependency(const FileEntry *FE,
				bool IsSystem) {
				// TODO
				}

				void UnitIndexDataRecorder::handleInclude(const FileEntry *Source,
				unsigned int Line,
				const FileEntry *Target) {
				// TODO
				}

				void UnitIndexDataRecorder::handleModuleImport(
				const serialization::ModuleFile &Mod, bool IsSystem) {
				// TODO
				}

				bool UnitIndexDataRecorder::shouldIndexModuleDependency(
				const serialization::ModuleFile &Mod) {
				// TODO
				return true;
				};

				bool UnitIndexDataRecorder::handleFileOccurrences(
				FileID FID, ArrayRef<DeclOccurrence> Occurs, bool IsSystem) {
				// TODO
				return false;
				};

				void UnitIndexDataRecorder::finish(){
				// TODO
				};

test/Index/Core/Inputs/module/ModDep.h

This file was added.

				#include "ModTop.h"

				void ModDep_func(ModTopStruct s);

test/Index/Core/Inputs/module/ModSystem.h

This file was added.


				typedef struct {} ModSystemStruct;

				void ModSystem_func(void);

test/Index/Core/Inputs/module/ModTop.h

This file was added.


				typedef struct {} ModTopStruct;

				void ModTop_func(void);

test/Index/Core/Inputs/module/ModTopSub1.h

This file was added.

void ModTopSub1_func(void);

test/Index/Core/Inputs/module/ModTopSub2.h

This file was added.

// This header has no symbols, intended to show up as file dependency.

test/Index/Core/Inputs/module/module.modulemap

	module ModA { header "ModA.h" export * }			module ModA { header "ModA.h" export * }
				module ModDep { header "ModDep.h" export * }
				module ModSystem [system] { header "ModSystem.h" export * }
				module ModTop {
				header "ModTop.h"
				export *
				module Sub1 {
				header "ModTopSub1.h"
				}
				module Sub2 {
				header "ModTopSub2.h"
				}
				}

test/Index/Core/Inputs/sys/system-head.h

				// UNIT: index-system.mm.o
				// UNIT: is-system: 0
				// UNIT: is-module: 0
				// UNIT: has-main: 1
				// UNIT: main-path: {{.*}}index-system.mm
				// UNIT: out-file: {{.*}}index-system.mm.o
				// UNIT: is-debug: 1

				// UNIT: DEPEND START
				// UNIT: File \| user \| {{.*}}index-system.mm
				// UNIT: File \| system \| {{.*}}system-head.h
				// UNIT: DEPEND END (2)

				// UNIT: INCLUDE START
				// UNIT: {{.}}index-system.mm:4 -> {{.}}system-head.h
				// UNIT: INCLUDE END (1)

	// CHECK: [[@LINE+1]]:12 \| class/ObjC \| Base \| [[Base_USR:.]] \| {{.}} \| Decl \| rel: 0			// CHECK: [[@LINE+1]]:12 \| class/ObjC \| Base \| [[Base_USR:.]] \| {{.}} \| Decl \| rel: 0
	@interface Base			@interface Base
	@end			@end

	// CHECK: [[@LINE+1]]:11 \| protocol/ObjC \| Prot1 \| [[Prot1_USR:.]] \| {{.}} \| Decl \| rel: 0			// CHECK: [[@LINE+1]]:11 \| protocol/ObjC \| Prot1 \| [[Prot1_USR:.]] \| {{.}} \| Decl \| rel: 0
	@protocol Prot1			@protocol Prot1
	@end			@end

	Show All 28 Lines

test/Index/Core/Inputs/transitive-include.h

This file was added.

				#include "system-head.h"

				struct Point {
				int x;
				int y;
				};

test/Index/Core/external-source-symbol-attr.m

	// RUN: c-index-test core -print-source-symbols -- %s -target x86_64-apple-macosx10.7 \| FileCheck %s			// RUN: c-index-test core -print-source-symbols -- %s -target x86_64-apple-macosx10.7 \| FileCheck %s
				// RUN: c-index-test core -print-source-unit -- %s -target x86_64-apple-macosx10.7 \| FileCheck %s

	#define EXT_DECL(mod_name) __attribute__((external_source_symbol(language="Swift", defined_in=mod_name)))			#define EXT_DECL(mod_name) __attribute__((external_source_symbol(language="Swift", defined_in=mod_name)))
	#define GEN_DECL(mod_name) __attribute__((external_source_symbol(language="Swift", defined_in=mod_name, generated_declaration)))			#define GEN_DECL(mod_name) __attribute__((external_source_symbol(language="Swift", defined_in=mod_name, generated_declaration)))
	#define PUSH_GEN_DECL(mod_name) push(GEN_DECL(mod_name), apply_to=any(enum, objc_interface, objc_category, objc_protocol))			#define PUSH_GEN_DECL(mod_name) push(GEN_DECL(mod_name), apply_to=any(enum, objc_interface, objc_category, objc_protocol))

	// Forward declarations should not affect module namespacing below			// Forward declarations should not affect module namespacing below
	@class I1;			@class I1;
	@class I2;			@class I2;
	▲ Show 20 Lines • Show All 95 Lines • Show Last 20 Lines

test/Index/Core/index-instantiated-source.cpp

	// RUN: c-index-test core -print-source-symbols -- %s -std=c++14 -target x86_64-apple-macosx10.7 \| FileCheck %s			// RUN: c-index-test core -print-source-symbols -- %s -std=c++14 -target x86_64-apple-macosx10.7 \| FileCheck %s
				// RUN: c-index-test core -print-source-unit -- %s -std=c++14 -target x86_64-apple-macosx10.7 \| FileCheck %s
	// References to declarations in instantiations should be canonicalized:			// References to declarations in instantiations should be canonicalized:

	template<typename T>			template<typename T>
	class BaseTemplate {			class BaseTemplate {
	public:			public:
	T baseTemplateFunction();			T baseTemplateFunction();
	// CHECK: [[@LINE-1]]:5 \| instance-method/C++ \| baseTemplateFunction \| c:@ST>1#T@BaseTemplate@F@baseTemplateFunction#			// CHECK: [[@LINE-1]]:5 \| instance-method/C++ \| baseTemplateFunction \| c:@ST>1#T@BaseTemplate@F@baseTemplateFunction#

	▲ Show 20 Lines • Show All 79 Lines • Show Last 20 Lines

test/Index/Core/index-source.mm

	// RUN: c-index-test core -print-source-symbols -- %s -target x86_64-apple-macosx10.7 \| FileCheck %s			// RUN: c-index-test core -print-source-symbols -- %s -target x86_64-apple-macosx10.7 \| FileCheck %s
				// RUN: c-index-test core -print-source-unit -- %s -target x86_64-apple-macosx10.7 \| FileCheck -check-prefixes=CHECK %s
				gribozavrUnsubmitted Not Done Reply Inline Actions No need to specify check-prefixes=CHECK. gribozavr: No need to specify check-prefixes=CHECK.

	@interface MyCls			@interface MyCls
	@end			@end

	@protocol P1,P2;			@protocol P1,P2;

	// CHECK: [[@LINE+1]]:6 \| function/C \| foo \| c:@F@foo#*$objc(cs)MyCls# \| __Z3fooP5MyCls \| Decl \| rel: 0			// CHECK: [[@LINE+1]]:6 \| function/C \| foo \| c:@F@foo#*$objc(cs)MyCls# \| __Z3fooP5MyCls \| Decl \| rel: 0
	void foo(MyCls *o);			void foo(MyCls *o);
	// CHECK: [[@LINE+1]]:6 \| function/C \| foo \| c:@F@foo#*Qoobjc(pl)P1objc(pl)P2# \| __Z3fooPU15objcproto2P12P211objc_object \| Decl \| rel: 0			// CHECK: [[@LINE+1]]:6 \| function/C \| foo \| c:@F@foo#*Qoobjc(pl)P1objc(pl)P2# \| __Z3fooPU15objcproto2P12P211objc_object \| Decl \| rel: 0
	void foo(id<P2, P1> o);			void foo(id<P2, P1> o);

test/Index/Core/index-subkinds.m

	// RUN: c-index-test core -print-source-symbols -- %s -target x86_64-apple-macosx10.7 \| FileCheck %s			// RUN: c-index-test core -print-source-symbols -- %s -target x86_64-apple-macosx10.7 \| FileCheck %s
				// RUN: c-index-test core -print-source-unit -- %s -target x86_64-apple-macosx10.7 \| FileCheck %s

	// CHECK: [[@LINE+1]]:12 \| class/ObjC \| XCTestCase \| c:objc(cs)XCTestCase \| _OBJC_CLASS_$_XCTestCase \| Decl \| rel: 0			// CHECK: [[@LINE+1]]:12 \| class/ObjC \| XCTestCase \| c:objc(cs)XCTestCase \| _OBJC_CLASS_$_XCTestCase \| Decl \| rel: 0
	@interface XCTestCase			@interface XCTestCase
	@end			@end

	// CHECK: [[@LINE+1]]:12 \| class(test)/ObjC \| MyTestCase \| c:objc(cs)MyTestCase \| _OBJC_CLASS_$_MyTestCase \| Decl \| rel: 0			// CHECK: [[@LINE+1]]:12 \| class(test)/ObjC \| MyTestCase \| c:objc(cs)MyTestCase \| _OBJC_CLASS_$_MyTestCase \| Decl \| rel: 0
	@interface MyTestCase : XCTestCase			@interface MyTestCase : XCTestCase
	@end			@end
	▲ Show 20 Lines • Show All 51 Lines • Show Last 20 Lines

test/Index/Core/index-system.mm

	// RUN: c-index-test core -print-source-symbols -- %s -isystem %S/Inputs/sys \| FileCheck %S/Inputs/sys/system-head.h			// RUN: c-index-test core -print-source-symbols -- %s -isystem %S/Inputs/sys \| FileCheck %S/Inputs/sys/system-head.h
				// RUN: c-index-test core -print-source-unit -- %s -isystem %S/Inputs/sys \| FileCheck -check-prefixes=UNIT,CHECK %S/Inputs/sys/system-head.h

	#include "system-head.h"			#include "system-head.h"

test/Index/Core/index-unit.mm

This file was added.

				// RUN: rm -rf %t.mcp
				gribozavrUnsubmitted Not Done Reply Inline Actions This test is very difficult to read... it is just a dump of random internal data structures... what do you think about converting it to a unit test? gribozavr: This test is very difficult to read... it is just a dump of random internal data structures...
				// RUN: c-index-test core -print-source-unit -- -arch x86_64 -mmacosx-version-min=10.7 -c %s -o %t.o -isystem %S/Inputs/sys -fmodules -fmodules-cache-path=%t.mcp -Xclang -fdisable-module-hash -I %S/Inputs/module -I %S/Inputs \| FileCheck %s

				@import ModDep;
				@import ModSystem;



				// CHECK: ModTop.pcm

				// CHECK: is-system: 0
				// CHECK: is-module: 1
				// CHECK: module-name: ModTop
				// CHECK: has-main: 0
				// CHECK: main-path: {{$}}
				// CHECK: out-file: {{.*}}/ModTop.pcm

				// CHECK: DEPEND START
				// CHECK: File \| user \| {{.*}}/Inputs/module/ModTopSub2.h
				// CHECK: File \| user \| {{.*}}/Inputs/module/ModTopSub1.h
				// CHECK: File \| user \| {{.*}}/Inputs/module/ModTop.h
				// CHECK: DEPEND END (3)

				// CHECK: INCLUDE START
				// CHECK: INCLUDE END (0)

				// CHECK: {{.*}}/Inputs/module/ModTop.h
				// CHECK: 2:9 \| struct/C \| <no-name> \| c:{{.*}} \| <no-cgname> \| Def \| rel: 0
				// CHECK: 2:19 \| type-alias/C \| ModTopStruct \| [[ModTopStruct_USR:.*]] \| <no-cgname> \| Def \| rel: 0
				// CHECK: 4:6 \| function/C \| ModTop_func \| {{.*}} \| __Z11ModTop_funcv \| Decl \| rel: 0

				// CHECK: {{.*}}/Inputs/module/ModTopSub1.h
				// CHECK: 1:6 \| function/C \| ModTopSub1_func \| {{.*}} \| __Z15ModTopSub1_funcv \| Decl \| rel: 0



				// CHECK: ModDep.pcm

				// CHECK: is-system: 0
				// CHECK: is-module: 1
				// CHECK: module-name: ModDep
				// CHECK: has-main: 0
				// CHECK: main-path: {{$}}
				// CHECK: out-file: {{.*}}/ModDep.pcm

				// CHECK: DEPEND START
				// CHECK: File \| user \| {{.*}}/Inputs/module/ModDep.h
				// CHECK: Module \| user \| {{.*}}/ModTop.pcm
				// CHECK: DEPEND END (2)

				// CHECK: INCLUDE START
				// CHECK: INCLUDE END (0)

				// CHECK: {{.*}}/Inputs/module/ModDep.h
				// CHECK: 3:6 \| function/C \| ModDep_func \| [[ModDep_func_USR:.*]] \| __Z11ModDep_func12ModTopStruct \| Decl \| rel: 0
				// CHECK: 3:18 \| type-alias/C \| ModTopStruct \| [[ModTopStruct_USR]] \| <no-cgname> \| Ref,RelCont \| rel: 1
				// CHECK-NEXT: RelCont \| ModDep_func \| [[ModDep_func_USR]]



				// CHECK: ModSystem.pcm

				// CHECK: is-system: 1
				// CHECK: is-module: 1
				// CHECK: module-name: ModSystem
				// CHECK: has-main: 0
				// CHECK: main-path: {{$}}
				// CHECK: out-file: {{.*}}/ModSystem.pcm

				// CHECK: DEPEND START
				// CHECK: File \| system \| {{.*}}/Inputs/module/ModSystem.h
				// CHECK: DEPEND END (1)

				// CHECK: INCLUDE START
				// CHECK: INCLUDE END (0)

				// CHECK: {{.*}}/Inputs/module/ModSystem.h
				// CHECK: 2:9 \| struct/C \| <no-name> \| {{.*}} \| <no-cgname> \| Def \| rel: 0
				// CHECK: 2:19 \| type-alias/C \| ModSystemStruct \| {{.*}} \| <no-cgname> \| Def \| rel: 0
				// CHECK: 4:6 \| function/C \| ModSystem_func \| {{.*}} \| __Z14ModSystem_funcv \| Decl \| rel: 0



				// CHECK: index-unit.mm.o

				// CHECK: is-system: 0
				// CHECK: is-module: 0
				// CHECK: module-name: {{$}}
				// CHECK: has-main: 1
				// CHECK: main-path: {{.*}}/index-unit.mm
				// CHECK: out-file: {{.*}}/index-unit.mm.o

				// CHECK: DEPEND START
				// CHECK: File \| user \| {{.*}}/index-unit.mm
				// CHECK: File \| user \| {{.*}}/Inputs/module/module.modulemap
				// CHECK: File \| user \| {{.*}}/Inputs/transitive-include.h
				// CHECK: File \| system \| {{.*}}/Inputs/sys/system-head.h
				// CHECK: Module \| user \| {{.*}}/ModDep.pcm
				// CHECK: Module \| system \| {{.*}}/ModSystem.pcm
				// CHECK: DEPEND END (6)

				// CHECK: INCLUDE START
				// CHECK: {{.}}index-unit.mm:[[@LINE+1]] -> {{.}}/Inputs/transitive-include.h
				#include "transitive-include.h"
				// CHECK: {{.}}/Inputs/transitive-include.h:1 -> {{.}}/Inputs/sys/system-head.h
				// CHECK: INCLUDE END (2)

				// CHECK: {{.*}}/Inputs/transitive-include.h
				// CHECK: 3:8 \| struct/C \| Point \| [[Point_USR:.*]] \| <no-cgname> \| Def \| rel: 0
				// CHECK: 4:7 \| field/C \| x \| {{.*}} \| <no-cgname> \| Def,RelChild \| rel: 1
				// CHECK-NEXT: RelChild \| Point \| [[Point_USR]]
				// CHECK: 5:7 \| field/C \| y \| {{.*}} \| <no-cgname> \| Def,RelChild \| rel: 1
				// CHECK-NEXT: RelChild \| Point \| [[Point_USR]]

				// CHECK: {{.*}}/Inputs/sys/system-head.h
				// CHECK: 19:12 \| class/ObjC \| Base \| [[Base_USR:.*]] \| _OBJC_CLASS_$_Base \| Decl \| rel: 0
				// CHECK: 23:11 \| protocol/ObjC \| Prot1 \| [[Prot1_USR:.*]] \| <no-cgname> \| Decl \| rel: 0
				// CHECK: 29:11 \| protocol/ObjC \| Prot2 \| [[Prot2_USR:.*]] \| <no-cgname> \| Decl \| rel: 0
				// CHECK: 29:17 \| protocol/ObjC \| Prot1 \| [[Prot1_USR]] \| <no-cgname> \| Ref,RelBase,RelCont \| rel: 1
				// CHECK-NEXT: RelBase,RelCont \| Prot2 \| [[Prot2_USR]]
				// CHECK: 39:12 \| class/ObjC \| Sub \| [[Sub_USR:.*]] \| _OBJC_CLASS_$_Sub \| Decl \| rel: 0
				// CHECK: 39:18 \| class/ObjC \| Base \| [[Base_USR]] \| _OBJC_CLASS_$_Base \| Ref,RelBase,RelCont \| rel: 1
				// CHECK-NEXT: RelBase,RelCont \| Sub \| [[Sub_USR]]
				// CHECK: 39:23 \| protocol/ObjC \| Prot2 \| [[Prot2_USR]] \| <no-cgname> \| Ref,RelBase,RelCont \| rel: 1
				// CHECK-NEXT: RelBase,RelCont \| Sub \| [[Sub_USR]]
				// CHECK: 39:30 \| protocol/ObjC \| Prot1 \| [[Prot1_USR]] \| <no-cgname> \| Ref,RelBase,RelCont \| rel: 1
				// CHECK-NEXT: RelBase,RelCont \| Sub \| [[Sub_USR]]
				// CHECK: 41:8 \| instance-method/ObjC \| getit \| {{.*}} \| -[Sub getit] \| Decl,Dyn,RelChild \| rel: 1
				// CHECK-NEXT: RelChild \| Sub \| [[Sub_USR]]
				// CHECK: 45:7 \| class/C++ \| Cls \| [[Cls_USR:.*]] \| <no-cgname> \| Def \| rel: 0
				// CHECK: 50:7 \| class/C++ \| SubCls1 \| [[SubCls1_USR:.*]] \| <no-cgname> \| Def \| rel: 0
				// CHECK: 50:24 \| class/C++ \| Cls \| [[Cls_USR]] \| <no-cgname> \| Ref,RelBase,RelCont \| rel: 1
				// CHECK-NEXT: RelBase,RelCont \| SubCls1 \| [[SubCls1_USR]]
				// CHECK: 52:12 \| field/C++ \| f \| {{.*}} \| <no-cgname> \| Def,RelChild \| rel: 1
				// CHECK-NEXT: RelChild \| SubCls1 \| [[SubCls1_USR]]

test/Index/Store/assembly-invocation.c

This file was added.

				// Make sure it doesn't crash.
				// RUN: %clang -target x86_64-apple-macosx10.7 -S %s -o %t.s
				// RUN: %clang -target x86_64-apple-macosx10.7 -c %t.s -o %t.o -index-store-path %t.idx

tools/c-index-test/core_main.cpp

//===-- core_main.cpp - Core Index Tool testbed ---------------------------===//		//===-- core_main.cpp - Core Index Tool testbed ---------------------------===//
//		//
// The LLVM Compiler Infrastructure		// The LLVM Compiler Infrastructure
//		//
// This file is distributed under the University of Illinois Open Source		// This file is distributed under the University of Illinois Open Source
// License. See LICENSE.TXT for details.		// License. See LICENSE.TXT for details.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "clang/CodeGen/ObjectFilePCHContainerOperations.h"		#include "clang/CodeGen/ObjectFilePCHContainerOperations.h"
#include "clang/Frontend/ASTUnit.h"		#include "clang/Frontend/ASTUnit.h"
#include "clang/Frontend/CompilerInstance.h"		#include "clang/Frontend/CompilerInstance.h"
#include "clang/Frontend/CompilerInvocation.h"		#include "clang/Frontend/CompilerInvocation.h"
#include "clang/Frontend/FrontendAction.h"		#include "clang/Frontend/FrontendAction.h"
#include "clang/Index/IndexingAction.h"		#include "clang/Index/CodegenNameGenerator.h"
#include "clang/Index/IndexDataConsumer.h"		#include "clang/Index/IndexDataConsumer.h"
#include "clang/Index/USRGeneration.h"		#include "clang/Index/USRGeneration.h"
#include "clang/Index/CodegenNameGenerator.h"		#include "clang/Index/UnitIndexDataConsumer.h"
		#include "clang/Index/UnitIndexingAction.h"
#include "clang/Serialization/ASTReader.h"		#include "clang/Serialization/ASTReader.h"
#include "llvm/Support/CommandLine.h"		#include "llvm/Support/CommandLine.h"
		#include "llvm/Support/Path.h"
		#include "llvm/Support/PrettyStackTrace.h"
#include "llvm/Support/Signals.h"		#include "llvm/Support/Signals.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
#include "llvm/Support/PrettyStackTrace.h"

using namespace clang;		using namespace clang;
using namespace clang::index;		using namespace clang::index;
using namespace llvm;		using namespace llvm;

extern "C" int indextest_core_main(int argc, const char **argv);		extern "C" int indextest_core_main(int argc, const char **argv);

namespace {		namespace {

enum class ActionType {		enum class ActionType {
None,		None,
PrintSourceSymbols,		PrintSourceSymbols,
		PrintSourceUnit,
};		};

namespace options {		namespace options {

static cl::OptionCategory IndexTestCoreCategory("index-test-core options");		static cl::OptionCategory IndexTestCoreCategory("index-test-core options");

static cl::opt<ActionType>		static cl::opt<ActionType> Action(
Action(cl::desc("Action:"), cl::init(ActionType::None),		cl::desc("Action:"), cl::init(ActionType::None),
cl::values(		cl::values(clEnumValN(ActionType::PrintSourceSymbols,
clEnumValN(ActionType::PrintSourceSymbols,		"print-source-symbols", "Print symbols from source"),
"print-source-symbols", "Print symbols from source")),		clEnumValN(ActionType::PrintSourceUnit, "print-source-unit",
		"Print unit info from source")),
cl::cat(IndexTestCoreCategory));		cl::cat(IndexTestCoreCategory));

static cl::extrahelp MoreHelp(		static cl::extrahelp MoreHelp(
"\nAdd \"-- <compiler arguments>\" at the end to setup the compiler "		"\nAdd \"-- <compiler arguments>\" at the end to setup the compiler "
"invocation\n"		"invocation\n"
);		);

static cl::opt<bool>		static cl::opt<bool>
DumpModuleImports("dump-imported-module-files",		DumpModuleImports("dump-imported-module-files",
Show All 11 Lines

}		}
} // anonymous namespace		} // anonymous namespace

static void printSymbolInfo(SymbolInfo SymInfo, raw_ostream &OS);		static void printSymbolInfo(SymbolInfo SymInfo, raw_ostream &OS);
static void printSymbolNameAndUSR(const Decl *D, ASTContext &Ctx,		static void printSymbolNameAndUSR(const Decl *D, ASTContext &Ctx,
raw_ostream &OS);		raw_ostream &OS);

namespace {		static void printDeclOccurrence(const Decl *D, SymbolRoleSet Roles,
		ArrayRef<SymbolRelation> Relations, FileID FID,
class PrintIndexDataConsumer : public IndexDataConsumer {		unsigned Offset, bool IsInSystemFile,
raw_ostream &OS;		CodegenNameGenerator &CGNameGen,
std::unique_ptr<CodegenNameGenerator> CGNameGen;		raw_ostream &OS) {

public:
PrintIndexDataConsumer(raw_ostream &OS) : OS(OS) {
}

void initialize(ASTContext &Ctx) override {
CGNameGen.reset(new CodegenNameGenerator(Ctx));
}

bool handleDeclOccurence(const Decl *D, SymbolRoleSet Roles,
ArrayRef<SymbolRelation> Relations,
SourceLocation Loc, ASTNodeInfo ASTNode) override {
ASTContext &Ctx = D->getASTContext();		ASTContext &Ctx = D->getASTContext();
SourceManager &SM = Ctx.getSourceManager();		SourceManager &SM = Ctx.getSourceManager();

Loc = SM.getFileLoc(Loc);		unsigned Line = SM.getLineNumber(FID, Offset);
FileID FID = SM.getFileID(Loc);		unsigned Col = SM.getColumnNumber(FID, Offset);
unsigned Line = SM.getLineNumber(FID, SM.getFileOffset(Loc));
unsigned Col = SM.getColumnNumber(FID, SM.getFileOffset(Loc));
OS << Line << ':' << Col << " \| ";		OS << Line << ':' << Col << " \| ";

printSymbolInfo(getSymbolInfo(D), OS);		printSymbolInfo(getSymbolInfo(D), OS);
OS << " \| ";		OS << " \| ";

printSymbolNameAndUSR(D, Ctx, OS);		printSymbolNameAndUSR(D, Ctx, OS);
OS << " \| ";		OS << " \| ";

if (CGNameGen->writeName(D, OS))		if (CGNameGen.writeName(D, OS))
OS << "<no-cgname>";		OS << "<no-cgname>";
OS << " \| ";		OS << " \| ";

printSymbolRoles(Roles, OS);		printSymbolRoles(Roles, OS);
OS << " \| ";		OS << " \| ";

OS << "rel: " << Relations.size() << '\n';		OS << "rel: " << Relations.size() << '\n';

for (auto &SymRel : Relations) {		for (auto &SymRel : Relations) {
OS << '\t';		OS << '\t';
printSymbolRoles(SymRel.Roles, OS);		printSymbolRoles(SymRel.Roles, OS);
OS << " \| ";		OS << " \| ";
printSymbolNameAndUSR(SymRel.RelatedSymbol, Ctx, OS);		printSymbolNameAndUSR(SymRel.RelatedSymbol, Ctx, OS);
OS << '\n';		OS << '\n';
}		}
		}

		namespace {

		class PrintIndexDataConsumer : public IndexDataConsumer {
		raw_ostream &OS;
		std::unique_ptr<CodegenNameGenerator> CGNameGen;

		public:
		PrintIndexDataConsumer(raw_ostream &OS) : OS(OS) {
		}

		void initialize(ASTContext &Ctx) override {
		CGNameGen.reset(new CodegenNameGenerator(Ctx));
		}

		bool handleDeclOccurence(const Decl *D, SymbolRoleSet Roles,
		ArrayRef<SymbolRelation> Relations,
		SourceLocation Loc, bool IsInSystemFile,
		ASTNodeInfo ASTNode) override {
		ASTContext &Ctx = D->getASTContext();
		SourceManager &SM = Ctx.getSourceManager();

		Loc = SM.getFileLoc(Loc);
		FileID FID = SM.getFileID(Loc);
		unsigned Offset = SM.getFileOffset(Loc);
		printDeclOccurrence(D, Roles, Relations, FID, Offset, IsInSystemFile,
		*CGNameGen, OS);
return true;		return true;
}		}

bool handleModuleOccurence(const ImportDecl *ImportD, SymbolRoleSet Roles,		bool handleModuleOccurence(const ImportDecl *ImportD, SymbolRoleSet Roles,
SourceLocation Loc) override {		SourceLocation Loc, bool IsInSystemFile) override {
ASTContext &Ctx = ImportD->getASTContext();		ASTContext &Ctx = ImportD->getASTContext();
SourceManager &SM = Ctx.getSourceManager();		SourceManager &SM = Ctx.getSourceManager();

Loc = SM.getFileLoc(Loc);		Loc = SM.getFileLoc(Loc);
FileID FID = SM.getFileID(Loc);		FileID FID = SM.getFileID(Loc);
unsigned Line = SM.getLineNumber(FID, SM.getFileOffset(Loc));		unsigned Line = SM.getLineNumber(FID, SM.getFileOffset(Loc));
unsigned Col = SM.getColumnNumber(FID, SM.getFileOffset(Loc));		unsigned Col = SM.getColumnNumber(FID, SM.getFileOffset(Loc));
OS << Line << ':' << Col << " \| ";		OS << Line << ':' << Col << " \| ";
▲ Show 20 Lines • Show All 96 Lines • ▼ Show 20 Lines	static bool printSourceSymbolsFromModule(StringRef modulePath,

PrintIndexDataConsumer DataConsumer(outs());		PrintIndexDataConsumer DataConsumer(outs());
IndexingOptions IndexOpts;		IndexingOptions IndexOpts;
indexASTUnit(*AU, DataConsumer, IndexOpts);		indexASTUnit(*AU, DataConsumer, IndexOpts);

return false;		return false;
}		}

		class PrintUnitDataConsumer : public UnitIndexDataConsumer {
		struct Dependency {
		enum Kind { File, Module };

		Kind Kind;
		std::string Name;
		bool IsSystem;

		void print(raw_ostream &OS) const {
		switch (Kind) {
		case File:
		OS << "File";
		break;
		case Module:
		OS << "Module";
		break;
		}
		OS << " \| " << (IsSystem ? "system" : "user") << " \| " << Name << '\n';
		}
		};

		struct Include {
		std::string Source;
		std::string Target;
		unsigned Line;

		void print(raw_ostream &OS) const {
		OS << Source << ':' << Line << " -> " << Target << '\n';
		}
		};

		struct FileOccurrences {
		SourceManager &SM;
		FileID FID;
		bool IsSystem;
		std::vector<DeclOccurrence> Occurrences;

		void print(raw_ostream &OS, CodegenNameGenerator &CGNameGen) const {
		const FileEntry *FE = SM.getFileEntryForID(FID);
		OS << '\n' << FE->getName() << "\n----------\n";
		for (const DeclOccurrence &Occur : Occurrences) {
		printDeclOccurrence(Occur.Dcl, Occur.Roles, Occur.Relations, FID,
		Occur.Offset, IsSystem, CGNameGen, OS);
		}
		}
		};

		raw_ostream &OS;
		UnitDetails UnitInfo;
		const CompilerInstance &CI;
		CodegenNameGenerator CGNameGen;
		bool IndexModDependencies = true;
		std::vector<Dependency> Dependencies;
		std::vector<Include> Includes;
		std::vector<FileOccurrences> FileOccurInfos;

		public:
		PrintUnitDataConsumer(raw_ostream &OS, UnitDetails UnitInfo,
		bool IndexModDeps)
		: OS(OS), UnitInfo(UnitInfo), CI(UnitInfo.CI),
		CGNameGen(CI.getASTContext()), IndexModDependencies(IndexModDeps) {}

		void handleFileDependency(const FileEntry *FE, bool IsSystem) override {
		Dependencies.push_back({Dependency::File, FE->getName(), IsSystem});
		}

		void handleModuleImport(const serialization::ModuleFile &Mod,
		bool IsSystem) override {
		Dependencies.push_back({Dependency::Module, Mod.FileName, IsSystem});
		}

		void handleInclude(const FileEntry *Source, unsigned Line,
		const FileEntry *Target) override {
		Includes.push_back({Source->getName(), Target->getName(), Line});
		}

		bool
		shouldIndexModuleDependency(const serialization::ModuleFile &Mod) override {
		return IndexModDependencies;
		}

		bool handleFileOccurrences(FileID FID,
		ArrayRef<DeclOccurrence> OccurrencesSortedByOffset,
		bool IsSystem) override {
		SourceManager &SM = CI.getASTContext().getSourceManager();
		FileOccurInfos.push_back({SM, FID, IsSystem, OccurrencesSortedByOffset});
		return false;
		}

		void finish() override {
		OS << sys::path::filename(UnitInfo.OutputFile) << '\n'
		<< "----------\n"
		<< "is-system: " << UnitInfo.IsSystemUnit << '\n'
		<< "is-module: " << UnitInfo.IsModuleUnit << '\n'
		<< "module-name: " << UnitInfo.ModuleName << '\n'
		<< "has-main: " << !!UnitInfo.RootFile << '\n'
		<< "main-path: "
		<< (UnitInfo.RootFile ? UnitInfo.RootFile->getName() : "") << '\n'
		<< "out-file: " << UnitInfo.OutputFile << '\n'
		<< "target: " << UnitInfo.CI.getTargetOpts().Triple << '\n'
		<< "is-debug: " << UnitInfo.IsDebugCompilation << '\n';

		OS << "\nDEPEND START\n";
		for (const Dependency &Dep : Dependencies) {
		Dep.print(OS);
		}
		OS << "DEPEND END (" << Dependencies.size() << ")\n";
		OS << "\nINCLUDE START\n";
		for (const Include &Inc : Includes) {
		Inc.print(OS);
		}
		OS << "INCLUDE END (" << Includes.size() << ")\n";
		for (const FileOccurrences &FileInfo : FileOccurInfos) {
		FileInfo.print(OS, CGNameGen);
		}
		OS << '\n';
		}
		};

		static bool printSourceUnit(ArrayRef<const char *> Args, bool IndexLocals,
		bool IndexModDeps) {
		SmallVector<const char *, 4> ArgsWithProgName;
		ArgsWithProgName.push_back("clang");
		ArgsWithProgName.append(Args.begin(), Args.end());
		IntrusiveRefCntPtr<DiagnosticsEngine> Diags(
		CompilerInstance::createDiagnostics(new DiagnosticOptions));
		auto CInvok = createInvocationFromCommandLine(ArgsWithProgName, Diags);
		if (!CInvok)
		return true;

		raw_ostream &OS = outs();
		UnitIndexingOptions IndexOpts;
		IndexOpts.IndexFunctionLocals = IndexLocals;

		auto ConsumerFactory = [&OS, IndexModDeps](UnitDetails UnitInfo) {
		return llvm::make_unique<PrintUnitDataConsumer>(OS, std::move(UnitInfo),
		IndexModDeps);
		};

		std::unique_ptr<FrontendAction> IndexAction;
		IndexAction = createUnitIndexingAction(ConsumerFactory, IndexOpts,
		/WrappedAction=/nullptr);

		auto PCHContainerOps = std::make_shared<PCHContainerOperations>();
		std::unique_ptr<ASTUnit> Unit(ASTUnit::LoadFromCompilerInvocationAction(
		std::move(CInvok), PCHContainerOps, Diags, IndexAction.get()));

		return !Unit;
		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Helper Utils		// Helper Utils
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

static void printSymbolInfo(SymbolInfo SymInfo, raw_ostream &OS) {		static void printSymbolInfo(SymbolInfo SymInfo, raw_ostream &OS) {
OS << getSymbolKindString(SymInfo.Kind);		OS << getSymbolKindString(SymInfo.Kind);
if (SymInfo.SubKind != SymbolSubKind::None)		if (SymInfo.SubKind != SymbolSubKind::None)
OS << '/' << getSymbolSubKindString(SymInfo.SubKind);		OS << '/' << getSymbolSubKindString(SymInfo.SubKind);
▲ Show 20 Lines • Show All 54 Lines • ▼ Show 20 Lines	if (options::Action == ActionType::PrintSourceSymbols) {
}		}
if (CompArgs.empty()) {		if (CompArgs.empty()) {
errs() << "error: missing compiler args; pass '-- <compiler arguments>'\n";		errs() << "error: missing compiler args; pass '-- <compiler arguments>'\n";
return 1;		return 1;
}		}
return printSourceSymbols(CompArgs, options::DumpModuleImports, options::IncludeLocals);		return printSourceSymbols(CompArgs, options::DumpModuleImports, options::IncludeLocals);
}		}

		if (options::Action == ActionType::PrintSourceUnit) {
		if (CompArgs.empty()) {
		errs()
		<< "error: missing compiler args; pass '-- <compiler arguments>'\n";
		return 1;
		}
		return printSourceUnit(CompArgs, options::IncludeLocals,
		/IndexModDepedencies=/true);
		}

return 0;		return 0;
}		}

tools/diagtool/DiagnosticNames.cpp

	Show All 37 Lines
	#include "clang/Basic/DiagnosticSerializationKinds.inc"			#include "clang/Basic/DiagnosticSerializationKinds.inc"
	#include "clang/Basic/DiagnosticLexKinds.inc"			#include "clang/Basic/DiagnosticLexKinds.inc"
	#include "clang/Basic/DiagnosticParseKinds.inc"			#include "clang/Basic/DiagnosticParseKinds.inc"
	#include "clang/Basic/DiagnosticASTKinds.inc"			#include "clang/Basic/DiagnosticASTKinds.inc"
	#include "clang/Basic/DiagnosticCommentKinds.inc"			#include "clang/Basic/DiagnosticCommentKinds.inc"
	#include "clang/Basic/DiagnosticSemaKinds.inc"			#include "clang/Basic/DiagnosticSemaKinds.inc"
	#include "clang/Basic/DiagnosticAnalysisKinds.inc"			#include "clang/Basic/DiagnosticAnalysisKinds.inc"
	#include "clang/Basic/DiagnosticRefactoringKinds.inc"			#include "clang/Basic/DiagnosticRefactoringKinds.inc"
				#include "clang/Basic/DiagnosticIndexKinds.inc"
	#undef DIAG			#undef DIAG
	};			};

	static bool orderByID(const DiagnosticRecord &Left,			static bool orderByID(const DiagnosticRecord &Left,
	const DiagnosticRecord &Right) {			const DiagnosticRecord &Right) {
	return Left.DiagID < Right.DiagID;			return Left.DiagID < Right.DiagID;
	}			}

	▲ Show 20 Lines • Show All 56 Lines • Show Last 20 Lines

tools/libclang/CXIndexDataConsumer.h

Show First 20 Lines • Show All 459 Lines • ▼ Show 20 Lines	public:
CXIdxClientEntity getClientEntity(const Decl *D) const;		CXIdxClientEntity getClientEntity(const Decl *D) const;
void setClientEntity(const Decl *D, CXIdxClientEntity client);		void setClientEntity(const Decl *D, CXIdxClientEntity client);

static bool isTemplateImplicitInstantiation(const Decl *D);		static bool isTemplateImplicitInstantiation(const Decl *D);

private:		private:
bool handleDeclOccurence(const Decl *D, index::SymbolRoleSet Roles,		bool handleDeclOccurence(const Decl *D, index::SymbolRoleSet Roles,
ArrayRef<index::SymbolRelation> Relations,		ArrayRef<index::SymbolRelation> Relations,
SourceLocation Loc, ASTNodeInfo ASTNode) override;		SourceLocation Loc, bool IsInSystemFile,
		ASTNodeInfo ASTNode) override;

bool handleModuleOccurence(const ImportDecl *ImportD,		bool handleModuleOccurence(const ImportDecl *ImportD,
index::SymbolRoleSet Roles,		index::SymbolRoleSet Roles, SourceLocation Loc,
SourceLocation Loc) override;		bool IsInSystemFile) override;

void finish() override;		void finish() override;

bool handleDecl(const NamedDecl *D,		bool handleDecl(const NamedDecl *D,
SourceLocation Loc, CXCursor Cursor,		SourceLocation Loc, CXCursor Cursor,
DeclInfo &DInfo,		DeclInfo &DInfo,
const DeclContext *LexicalDC = nullptr,		const DeclContext *LexicalDC = nullptr,
const DeclContext *SemaDC = nullptr);		const DeclContext *SemaDC = nullptr);
▲ Show 20 Lines • Show All 53 Lines • Show Last 20 Lines

tools/libclang/CXIndexDataConsumer.cpp

Show First 20 Lines • Show All 151 Lines • ▼ Show 20 Lines
CXSymbolRole getSymbolRole(SymbolRoleSet Role) {		CXSymbolRole getSymbolRole(SymbolRoleSet Role) {
// CXSymbolRole mirrors low 9 bits of clang::index::SymbolRole.		// CXSymbolRole mirrors low 9 bits of clang::index::SymbolRole.
return CXSymbolRole(static_cast<uint32_t>(Role) & ((1 << 9) - 1));		return CXSymbolRole(static_cast<uint32_t>(Role) & ((1 << 9) - 1));
}		}
}		}

bool CXIndexDataConsumer::handleDeclOccurence(		bool CXIndexDataConsumer::handleDeclOccurence(
const Decl *D, SymbolRoleSet Roles, ArrayRef<SymbolRelation> Relations,		const Decl *D, SymbolRoleSet Roles, ArrayRef<SymbolRelation> Relations,
SourceLocation Loc, ASTNodeInfo ASTNode) {		SourceLocation Loc, bool IsInSystemFile, ASTNodeInfo ASTNode) {
Loc = getASTContext().getSourceManager().getFileLoc(Loc);		Loc = getASTContext().getSourceManager().getFileLoc(Loc);

if (Roles & (unsigned)SymbolRole::Reference) {		if (Roles & (unsigned)SymbolRole::Reference) {
const NamedDecl *ND = dyn_cast<NamedDecl>(D);		const NamedDecl *ND = dyn_cast<NamedDecl>(D);
if (!ND)		if (!ND)
return true;		return true;

if (auto *ObjCID = dyn_cast_or_null<ObjCInterfaceDecl>(ASTNode.OrigD)) {		if (auto *ObjCID = dyn_cast_or_null<ObjCInterfaceDecl>(ASTNode.OrigD)) {
▲ Show 20 Lines • Show All 49 Lines • ▼ Show 20 Lines	if (Roles & (unsigned)SymbolRole::Reference) {
IndexingDeclVisitor(*this, Loc, LexicalDC).Visit(ASTNode.OrigD);		IndexingDeclVisitor(*this, Loc, LexicalDC).Visit(ASTNode.OrigD);
}		}

return !shouldAbort();		return !shouldAbort();
}		}

bool CXIndexDataConsumer::handleModuleOccurence(const ImportDecl *ImportD,		bool CXIndexDataConsumer::handleModuleOccurence(const ImportDecl *ImportD,
SymbolRoleSet Roles,		SymbolRoleSet Roles,
SourceLocation Loc) {		SourceLocation Loc,
		bool IsInSystemFile) {
IndexingDeclVisitor(*this, SourceLocation(), nullptr).Visit(ImportD);		IndexingDeclVisitor(*this, SourceLocation(), nullptr).Visit(ImportD);
return !shouldAbort();		return !shouldAbort();
}		}

void CXIndexDataConsumer::finish() {		void CXIndexDataConsumer::finish() {
indexDiagnostics();		indexDiagnostics();
}		}

▲ Show 20 Lines • Show All 1,095 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Add index-while-building support to ClangAbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 153190

include/clang/Basic/AllDiagnostics.h

include/clang/Basic/CMakeLists.txt

include/clang/Basic/Diagnostic.td

include/clang/Basic/DiagnosticGroups.td

include/clang/Basic/DiagnosticIDs.h

include/clang/Basic/DiagnosticIndexKinds.td

include/clang/Driver/Job.h

include/clang/Driver/Options.td

include/clang/Frontend/CompilerInstance.h

include/clang/Frontend/FrontendOptions.h

include/clang/Index/DeclOccurrence.h

include/clang/Index/IndexDataConsumer.h

include/clang/Index/IndexDiagnostic.h

include/clang/Index/IndexingAction.h

include/clang/Index/RecordingAction.h

include/clang/Index/UnitIndexDataConsumer.h

include/clang/Index/UnitIndexingAction.h

include/clang/module.modulemap

lib/Basic/DiagnosticIDs.cpp

lib/Driver/Driver.cpp

lib/Driver/Job.cpp

lib/Driver/ToolChains/Clang.cpp

lib/Driver/ToolChains/Darwin.cpp

lib/Frontend/CompilerInstance.cpp

lib/Frontend/CompilerInvocation.cpp

lib/FrontendTool/CMakeLists.txt

lib/FrontendTool/ExecuteCompilerInvocation.cpp

lib/Index/CMakeLists.txt

lib/Index/FileIndexData.h

lib/Index/FileIndexData.cpp

lib/Index/IndexingAction.cpp

lib/Index/IndexingContext.h

lib/Index/IndexingContext.cpp

lib/Index/UnitIndexDataRecorder.h

lib/Index/UnitIndexDataRecorder.cpp

test/Index/Core/Inputs/module/ModDep.h

test/Index/Core/Inputs/module/ModSystem.h

test/Index/Core/Inputs/module/ModTop.h

test/Index/Core/Inputs/module/ModTopSub1.h

test/Index/Core/Inputs/module/ModTopSub2.h

test/Index/Core/Inputs/module/module.modulemap

test/Index/Core/Inputs/sys/system-head.h

test/Index/Core/Inputs/transitive-include.h

test/Index/Core/external-source-symbol-attr.m

test/Index/Core/index-instantiated-source.cpp

test/Index/Core/index-source.mm

test/Index/Core/index-subkinds.m

test/Index/Core/index-system.mm

test/Index/Core/index-unit.mm

test/Index/Store/assembly-invocation.c

tools/c-index-test/core_main.cpp

tools/diagtool/DiagnosticNames.cpp

tools/libclang/CXIndexDataConsumer.h

tools/libclang/CXIndexDataConsumer.cpp

Add index-while-building support to Clang
AbandonedPublic