This is an archive of the discontinued LLVM Phabricator instance.

clang-tools-extra/clangd/IncludeCleaner.cpp
697	Oh yeah makes sense.
698	Ok.
706	Ok, got it. We only use the include conversion for matching, and matching seems not to use line numbers. I guess it's the reason nothing fails. I can't write a test for this function directly since it's in an anonymous namespace.
708	Ok I will skip unresolved includes. But I am not sure I fully understand. We do the following: Convert clangd includes to include-cleaner includes. Match include-cleaner includes with symbol providers. If match found, symbol reference is satisfied. How does it matter in this scenario if the include is resolved? AFAIU as long as the header is spelled in the main file + it's matched with a symbol provider, we should say that the symbol reference is satisfied. Otherwise, it seems like we'll say that the header is missing, although it's there in the main file and unresolved. I don't know if this is in any way a realistic scenario. I am just approaching it with general logic, and in this sense having more "satisfied" symbols seems better than having less => leads to less false positives. It can lead to false negatives, too, but AFAIU false negatives are much less of a risk for missing include management.
716	Oh this is cool. Didn't realize we can do straight from a resolved include to the ID. Thanks.
720	Sure. The new design does this, as well as skipping the header name.
734	Ok, sure. What's a PP-disabled region? Are you talking about #ifdef's and such?
737	Thank you for the great explanation!
742	Yes, storing per Symbol totally makes sense. Let's discuss specifics in the corresponding document.
748	Yes, this comment goes along the lines of the design discussion we are having at the moment. again in big files it might be impossible to see the first range the diagnostic is attached to and people have a tendency to only care about the parts of the code they've touched this is AFAIU in conflict with the suggestion that the diagnostic should only be attached to the first reference.
748	any reason for storing tokens? I was primarily avoiding `clang::Range` since it requires `llvm::Code` to build ranges, and didn't want `computeIncludeCleanerFindings` to depend on anything but the AST. But `syntax::FileRange` sounds good.
759	sure.
775	Ok, great. Didn't know it was Ok to test `std::optional` directly in an `if` clause.

Harbormaster completed remote builds in B214436: Diff 498401.Feb 17 2023, 10:04 AM

kadircet added inline comments.Feb 21 2023, 6:48 AM

clang-tools-extra/clangd/IncludeCleaner.cpp
370	unfortunately `getFile` returns an `llvm::Expected` which requires explicit error handling (or it'll trigger a crash). you can simply `elog` the issue: if (!FE) { elog("IncludeCleaner: Failed to get an entry for resolved path {0}: {1}", Inc.Resolved, FE.takeError()); continue; }
683	this should also be either static or put into anon namespace
704	`generateMissingIncludeDiagnostics` should also be either static or put into anon namespace
705	i think it's better to use an `llvm::ArrayRef` instead of `llvm::SmallVector` for diag infos here, we don't need to copy.
708	How does it matter in this scenario if the include is resolved? AFAIU as long as the header is spelled in the main file + it's matched with a symbol provider, we should say that the symbol reference is satisfied. It doesn't matter at a high-level view. But since the implementation recognizes headers based on the HeaderID and it's only defined for resolved includes, if we were to have any unresolved include matches somehow (e.g. it has spelling "foo/bar.h", but is unresolved, and we do a match based on spelling because some IWYU pragma pointed at this header), we would hit the assertion around HeaderID always having value.
709	nit: `Result.push_back(std::move(D))`, as `D` has lots of strings in it, it might be expensive to copy it around. Even better if you construct the Diag in place via `Diag &D = Result.emplace_back()`, you can achieve this by moving all the logic that might bail out (e.g. replacement generation) to the top of the loop, and start forming the diagnostic only after we're sure that there'll be one.
713	we've got `Config::Diagnostics::Includes::IgnoreHeader` that disables include-cleaner analysis on headers that match a pattern. we should be respecting those config options here too.
720	i feel like there's actually value in keeping the header name around, i.e. the user will have some idea about the action, without triggering an extra interaction. this helps especially in cases where the finding is wrong, they'll discover this sooner, hence we'll be more likely to receive bug reports. but don't really have a strong preference, so feel free to keep it that way.
732	could you add a comment here, as this is subtle, something like `We might suggest insertion of an existing include in edge cases, e.g. include is present in a PP-disabled region, or spelling of the header turns out to be the same as one of the unresolved includes in the main file`
734	What's a PP-disabled region Yes, I was trying to say "preprocessor disabled region". e.g. in a piece of code like: #if 0 #include "foo.h" #endif preprocessor won't actually trigger inclusion of "foo.h", but most of the heuristic parsers (most importantly the logic in `HeaderIncludes`) will treat this include as usual.
755	this also needs to be static or put into anon namespace
755–763	can you restore `std::move`?
756	no need to copy the vector by taking a `std::vector` here, you can take an `llvm::ArrayRef` instead.
764	s/AST.getSourceManager()/SM
766	nit: it might be worth re-writing the following section as: std::vector<Diag> Result = generateUnusedIncludeDiagnostics(AST.tuPath(), Cfg.Diagnostics.UnusedIncludes == Strict ? computeUnusedIncludes(AST) : Findings.UnusedIncludes, Code); llvm::move(generateMissingIncludeDiagnostics(AST, MissingIncludes, Code), std::back_inserter(Result)); and move the checks like `if (Cfg.Diagnostics.MissingIncludes == Config::IncludesPolicy::Strict && !Cfg.Diagnostics.Suppress.contains("missing-includes"))` into the specific function, e.g. `generateUnusedIncludeDiagnostics`, as they already do some of the diagnostic filtering logic.
771	can you introduce a `trace::Span` wrapping the call to `walkUsed` with name `IncludeCleanerAnalysis` so that we can collect some stats about latency here?
783	nit: drop either `*` or `&` (preferably `&`), having a reference vs a pointer doesn't make any differences performance wise, but creates a confusion (as we don't realy need a reference to a pointer here)
793	nit: we prefer `early exits` to extra `nesting`, e.g. rewriting this as: if (Satisfied \|\| Providers.empty() \|\| Ref.RT != Explicit) continue; const auto &TB = AST.getTokens(); auto SpelledTokens = TB.spelledForExpanded(...); if (!SpelledTokens) continue; ... increases readability by: reducing the nesting making it more explicit about under what assumptions the rest of the code is working
797	nit: `auto Range = syntax::Token::range(SM, SpelledForExpanded->front(), SpelledForExpanded->back());`
800–803	you don't need to explicitly copy `Providers` into `ProviderHeaders`, you can pass it directly to `DiagInfo` below.
812	we use llvm casts, specifically `llvm::dyn_cast<NamedDecl*>(&Ref.Target.declaration())->getQualifiedNameAsString()`
813	`getQualifiedNameAsString` is going to print names that are really ugly at certain times, but unfortunately that's a problem we don't have a great solution to. so no action needed ATM, but we might want to switch between qualified and unqualified name depending on the length at the very least (e.g. symbol is coming from a templated class, which has a nasty nested instantiation).
817	nit: `MissingIncludes.emplace_back(std::move(SymbolName), Range, Providers);`
823	nit: you'd want `std::move`s here, around both of them
clang-tools-extra/clangd/IncludeCleaner.h
40	it seems unfortunate that we're duplicating these strings for each diag we want to emit. it might be better to just store a Symbol here (similar to Header) and delay spelling until needed.
45	nit: `std::tie(SymbolName, SymRefRange, Providers) == std::tie(Other.SymbolName, ...);` I'd also put the SymbolName match to be the last, as it's a string match and might be more costly (if we can bail out early)
52	I don't think we've much to gain by using SmallVector here, instead of std::vector
clang-tools-extra/clangd/unittests/IncludeCleanerTests.cpp
451	i don't think there's much value in testing out analysis here, we should rather focus on diagnostics generation, which isn't part of `computeIncludeCleanerFindings`. existing tests were focused on analysis, because legacy implementation for include-cleaner was actually performing these analysis itself. so I'd rather suggest having trivial test cases (from include-cleaner analysis perspective, no need for complicated directory/file layouts) and rather test things out through calls to `generateMissingIncludeDiagnostics` to make sure diagnostics has the right ranges, text and fix contents. right now we're not testing: header spelling symbol name generation ranges these diagnostics correspond to and these are the main functionality we're adding on top of include-cleaner analysis. you can take a look at the tests in llvm/llvm-project/clang-tools-extra/clangd/unittests/DiagnosticsTests.cpp to see how we're testing out diagnostics ranges, messages, fixes and what kind of helpers/matchers we have for them.

Address review comments (apart from testing).

Thanks for the comments! I should have addressed everything apart from testing.

clang-tools-extra/clangd/IncludeCleaner.cpp
370	It returns `llvm::ErrorOr`, if I am not mistaken. There was explicit error handling already (`if (!FE) continue` below), just without the `elog`. Are you trying to say it will crash without the logging? Not sure that's feasible :) Added the logging.
683	sure, thanks.
704	Well then `generateUnusedIncludeDiagnostics` too, I guess.
708	Ah ok, makes sense.
709	Sure. Doing the in-place construction now.
713	Ok, added that. Please have a look. At the moment it's trying to run filters on the output of the `spellHeader` method.
720	Hm I'd expect we'll actually have a higher probability to receive a bug report if the user clicks on the "Quick fix" and gets a wrong header included, because that's annoying :) Having a full message on hover, seeing that it's wrong and just ignoring it might not be annoying enough to file a bug ;-) But this is a pure speculation ofc. I think the point discussed on the design document makes sense: without mentioning the header name it will be a bit easier to extend this to suggesting multiple fixes. So if that's the general direction, I'd prefer to keep it the current way.
732	Ok sure.
734	thanks.
755	Ah didn't realize that you also left a comment here when replying to the identical comment on `generateMissingIncludeDiagnostics`. Should be done.
755–763	I'd rather do the emplacing, so that it's the same as in the `generateMissingIncludeDiagnostics`.
756	Oh this isn't even my code, but as long as it's a small change, sure :)
766	nit: it might be worth re-writing the following section as This code seems to ignore the option `Config::IncludesPolicy::None`. It's saying to either return the old-style clangd results in case of `Strict` or `include-cleaner` results otherwise (incl. in case of `None`). Am I missing something? and move the checks like ... Ok, moved into `generateMissingIncludeDiagnostics`.
771	Sure.
813	Ok so for now no action, IIUC.
817	This does not compile. Seems like it needs a certain type of constructor to be present in `MissingIncludeDiagInfo`, whereas atm it's just a struct.
clang-tools-extra/clangd/IncludeCleaner.h
40	I'm not sure I can see your point re `delaying spelling until needed`. Each `MissingIncludeDiagInfo` corresponds to one diagnostic. Whether we resolve the `Symbol` to the `SymbolName` during analysis (i.e., `walkUsed`) or in diagnostic generation (i.e., `generateMissingIncludeDiagnostics`), does not change the fact that the same symbol will be resolved to its name multiple times. As discussed in the doc, this results in simpler code than the version that maps a single `Symbol` object to multiple `Range`s and `Provider`s. AFAICS, the implementation of `Symbol` to `SymbolName` resolution does not seem to be very expensive either.
45	Ok, sure. I can see your point with the string comparisons. But I don't fully see the point of using `std::tie` here. What is so great about creating an extra object, even if it only stores references? Is it a purely stylistic suggestion?
52	Sure. I'm not so clear on the preferences yet. AFAIU your point, stdlib is to be preferred unless the llvm alternative is clearly beneficial. Is that the case?

Harbormaster completed remote builds in B215294: Diff 499553.Feb 22 2023, 12:27 PM

kadircet added inline comments.Feb 23 2023, 12:59 AM

clang-tools-extra/clangd/IncludeCleaner.cpp
291	can you also change the logic here to use `isFilteredByConfig` (we need the `native` call inside `isFilteredByConfig` as well to make sure it works on windows)
370	It returns llvm::ErrorOr, if I am not mistaken. Ah you're right, I confused it with `getFileRef`. So in theory `ErrorOr` doesn't require explicit checking hence it won't trigger a crash if destroyed while containing an error. Are you trying to say it will crash without the logging? Not sure that's feasible :) Right, it isn't the logging that'll prevent the crash but rather a combination of the call to `takeError` and the way `elog` consumes `Error` objects. But it isn't relevant here, as `ErrorOr` doesn't require mandatory handling.
384	nit: you can directly define `SpelledHeader` at line 377
419	this shouldn't be spelling, it should be the resolved path of the include.
444	can you prefix this with `IncludeCleaner:` and rather say `not diagnosing missing include {0}, filtered by config` to add a bit more context about what specific interaction the log is coming from
472	nit: auto &F = D.Fixes.emplace_back(); F.Message = ...; F.Edits.push_back(replacementToEdit(...));
476	nit: it'd put this next to `D.File` above
720	I think the point discussed on the design document makes sense: without mentioning the header name it will be a bit easier to extend this to suggesting multiple fixes. So if that's the general direction, I'd prefer to keep it the current way. Well, it's unclear when we'll get there, and moreover my suggestion was actually to still mention the header name when there's only a single provider (which will be the case most of the time). But this is a pure speculation ofc. But yeah, my point of view is also mostly speculation. So feel free to keep it this way, I'll just be grumpy :P
749	`auto Range = syntax::Token::range(SM, SpelledForExpanded->front(), SpelledForExpanded->back());`
751	as mentioned elsewhere, i think we should delay this symbol name spelling to diagnostic generation. to make sure core analysis we perform don't do work that might not get re-used (e.g. if we're not going to diagnose missing includes, or in the future when we don't care about spelling of all the symbols)
756	well, this is `our` code in the end :D
766	nit: I think logically it makes more sense for us to return set of `Used` includes here, and let the interaction that issues unused include diagnostics to derive this information from the set of used includes, and change the the missingincludes to a `vector< tuple<Symbol, Ref, Providers> >` (not only the unsatisfied ones) would represent the analysis better and make it more usable in the future (i.e. when we want to augment Hover responses, we can't re-use all the logic in here, we really need to implement another call to `walkUsed` because the analysis we get out of this call won't contain information for `satisfied` symbols. no need to do it now though, we can perform that kind of refactoring as we're adding the features too (or maybe it'll actually look neater to just have another call in those features rather than try and re-use the logic here)
766	This code seems to ignore the option Config::IncludesPolicy::None. It's saying to either return the old-style clangd results in case of Strict or include-cleaner results otherwise (incl. in case of None). Am I missing something? Well that was to be addressed by second part of the comment `And move the checks like if (Cfg.Diagnostics.MissingIncludes == Config::IncludesPolicy::Strict && !Cfg.Diagnostics.Suppress.contains("missing-includes")) into the specific function, e.g. generateUnusedIncludeDiagnostics, as they already do some of the diagnostic filtering logic.` I was talking about both missing and unused include diagnostics generation (hence `e.g.`), similar to the early exit in `generateMissingIncludeDiagnostics`, we should have one that returns an empty set of diagnostics, when it's suppressed or not enabled.
823	oops, i forgot to put the surrounding `{}` it should've been `MissingIncludes.emplace_back({...});`
clang-tools-extra/clangd/IncludeCleaner.h
40	Whether we resolve the Symbol to the SymbolName during analysis (i.e., walkUsed) or in diagnostic generation (i.e., generateMissingIncludeDiagnostics), does not change the fact that the same symbol will be resolved to its name multiple times. we might not generate those diagnostics always, e.g. missing-includes is disabled, but unused-includes is on, or maybe we're going to use these results for something else like Hover responses. As discussed in the doc, this results in simpler code than the version that maps a single Symbol object to multiple Ranges and Providers. I wasn't trying to suggest having a map here, I was suggesting just storing a `Symbol S` instead of `string Name`. AFAICS, the implementation of Symbol to SymbolName resolution does not seem to be very expensive either. Well, generating strings are usually expensive (not asymptotically but in practice, as they tend to require lots of memory allocations).
45	right, the comments with `nit:` prefix are usually things that won't matter much in practice, but reflects my (well, in the general case the reviewer's) or codebase's preference.
52	stdlib is to be preferred unless the llvm alternative is clearly beneficial Right. `llvm::SmallVector` and `std::vector` have different use cases, we usually go for the former if we're sure that number of elements we want to store are going to be handful (literally less than 10) most of the time and the size of the objects themselves is not too big. As `SmallVector` chooses to store objects internally (until it grows too much), rather than allocating a bunch of memory elsewhere and just storing a pointer (as std::vector does). This has benefits when your vector isn't going to grow beyond smallvector's limits (you don't need to pay for memory allocations, which are expensive), but has other costs (e.g. moving a std::vector is trivial as it's just assignment of a pointer and size, but moving a smallvector might not be as trivial (it at least needs to move all the elements) also the initial memory cost for smallvector is higher than std::vector (as it takes up the same space independent of it's fullness). So in this example, we're unlikely to have a small number of `MissingIncludes`, `MissingIncludeDiagInfo` is a big enough struct (vector and filerange add up to more than 30 bytes). Hence std::vector feels like the better choice.

Address review comments.

Thanks for the comments!

clang-tools-extra/clangd/IncludeCleaner.cpp
370	thanks for the explanation.
419	Ok thanks.
756	Sorry, wrong wording. I meant to say that this is not the code that has been touched in this patch. It might sometimes get annoying when comments on the patch dig too deep into code that's not in the diff.
766	Ok, I've refactored more of the config checking logic inside `generate..` functions.
823	No this seems to be even more wrong. `stl_vector.h(1303, 2): Candidate template ignored: substitution failure: deduced incomplete pack <(no value)> for template parameter '_Args'`. This is for the version with braces. And this is for no braces: /usr/bin/../lib/gcc/x86_64-linux-gnu/12/../../../../include/c++/12/bits/new_allocator.h:175:23: error: no matching constructor for initialization of 'clang::clangd::MissingIncludeDiagInfo' { ::new((void *)__p) _Up(std::forward<_Args>(__args)...); } It seems that it just doesn't cooperate with structs.
clang-tools-extra/clangd/IncludeCleaner.h
52	thanks!
clang-tools-extra/clangd/unittests/IncludeCleanerTests.cpp
451	Thank you, this makes sense. However, I believe we need to use `issueIncludeCleanerDiagnostics`rather than `generateMissingIncludeDiagnostics`, since the latter is private.

VitaNuo added inline comments.Feb 23 2023, 8:46 AM

clang-tools-extra/clangd/IncludeCleaner.cpp
751	Ok should be done now.
766	Thanks. This might very well be the case, but this comment also seems to suggest some premature optimization (in a way). It totally makes sense to re-use what's re-usable, but this sort of refactoring really only makes sense once we have a clear use case (and get there :)
clang-tools-extra/clangd/IncludeCleaner.h
40	Ok, agreed. Storing symbols now. Having only unused include analysis on is a convincing use case.

Upload once again.

Merge upstream changes.

Harbormaster completed remote builds in B215545: Diff 499896.Feb 23 2023, 10:07 AM

thanks! looks amazing, we're missing a little bit of test coverage though

clang-tools-extra/clangd/IncludeCleaner.cpp
284	s/HeaderSpelling/HeaderPath
286	s/Path/NormalizedPath
418	what about just `resolvedPath`, if you'd rather keep the verb, i think `get` makes more sense than `find`. we're not really searching anything.
422	nit: you can directly `return SymProvider.physical()->tryGetRealPathName();` (same for other 2 cases) and have an `llvm_unreachable("Unknown symbol kind");` after the switch statement.
425	in this and the next case we need to trim `<>"`
434	same as above, either just `symbolName` or `get`
438	again you can just return here and below
438	`getName` is a StringRef, and unfortunately there are some platforms (like darwin) that don't support implicit conversion from stringrefs to std::string. so can you call `.str()` explicitly in the end?
760	i think for now this should be if (Cfg.Diagnostics.MissingIncludes == Config::IncludesPolicy::Strict \|\| Cfg.Diagnostics.UnusedIncludes == Config::IncludesPolicy::Experiment) { otherwise we'll run both legacy and new analysis for `UnusedIncludes == Strict`
clang-tools-extra/clangd/unittests/IncludeCleanerTests.cpp
450	this is pointing at the declaration inside `b.h` not to the reference inside the main file. are you sure this test passes?
470	can you also add a reference (and declaration) for std::vector, and have an IWYU private pragma in one of the headers to test code paths that spell verbatim and standard headers? also having some diagnostic suppressed via `IgnoreHeaders` is important to check
482	can you make one of these names qualified? e.g. `namespace ns { struct Bar { void f(); }; }`

hokein mentioned this in D144976: [clangd] Add provider info on symbol hover..Mar 2 2023, 1:33 AM

Improve test coverage.

Thank you for all the thoughtful comments!

clang-tools-extra/clangd/IncludeCleaner.cpp
418	Ok, let's call it `get`. I do prefer verbs for methods, that's correct.
clang-tools-extra/clangd/unittests/IncludeCleanerTests.cpp
450	Yes, all the tests pass. `D` is a `Decl` from the main file, otherwise it wouldn't have passed the safeguard `if (!SM.isWrittenInMainFile(SM.getExpansionLoc(D->getLocation()))) continue;` above.
470	Thank you for the great tips on improving test coverage! In fact, I had to also introduce support for private pragmas, as they were not taken care of. Hopefully, the solution will make sense to you.

Harbormaster completed remote builds in B217135: Diff 502084.Mar 3 2023, 3:20 AM

thanks, looks great!

clang-tools-extra/clangd/IncludeCleaner.cpp
453	we should respect the style configurations (sorry for missing this in previous iterations). you can get the relevant style with: `clang::format::getStyle`, which has an IncludeStyle. in case the `getStyle` fails, we should fallback to `clang::format::getLLVMStyle` as we do in other places. you can get at the relevant VFS instance through sourcemanager.
731	you can directly use `!Pragmas->isPrivate(Inc->Resolved)` here, instead of getpublic
731	this check seems to be new. what's the reason for rejecting private providers? I can see that we might want to be conservative by not inserting private providers, but treating symbols as unsatisfied when a private provider is already included doesn't feel right. e.g. the code being analyzed might be allowed to depend on this private header, because it's also part of the library, or it's the public header that's exposing this private header. in such a scenario we shouldn't try to insert the public header again. is there a more concrete issue this code is trying to address?
clang-tools-extra/clangd/unittests/IncludeCleanerTests.cpp
445	nit: braces
450	this is passing because `bool BDeclFound;` is uninitialized above, if you set it to `bool BDeclFound = false;` you should see the test fail. there's no declaration for `b` inside the main file, it's declared in `b.h` and referenced inside the main file. you still need to search for the decl (without the constraint of being written in main file), use it to build an include_cleaner::Symbol, and use a `clangd::Annotation` range for the range of the reference. it might be easer to write this as: const NamedDecl* B = nullptr; for (...) { ... B = D; } ASSERT_TRUE(B); // build expected diagnostic info based on B and check that it's equal to what we've produced
458	i think the example for `std::vector` is solid, and `IWYU pragma private` needs a little adjustment.
471	we should include private.h through some indirection (not public.h) to check `IWYU pragma private` spellings are respected.
477	name this range as `bar` instead of `d`?
481	could you add a comment here saying this shouldn't be diagnosed?

Address review comments.

Thanks for the comments!

clang-tools-extra/clangd/IncludeCleaner.cpp
731	Ok makes sense. No, I guess I was just confused, because I understood that you wanted a test that includes "private.h" with a diagnostic generated saying that "public.h" should be included instead, so I assumed that was expected behaviour. But that's not what you meant, so I misunderstood.
clang-tools-extra/clangd/unittests/IncludeCleanerTests.cpp
450	Didn't know there was a difference between uninitialized and `false`.. Thanks for the idea with `ASSERT_TRUE(Decl)`. Please check out the new version.

kadircet added inline comments.Mar 7 2023, 6:25 AM

clang-tools-extra/clangd/IncludeCleaner.cpp
456	creating a copy of LLVM style unnecessarily all the time is not really great, can you move this into the failure case instead? also you can drop the `clang::` here and elsewhere, as this code is already part of `clang::` namespace.
457	as mentioned above we also need to make sure we're passing the relevant VFS instance inside the source manager, rather than using the real file system (as some clients rely on the VFS).
458	s/MainFile->getName()/AST.tuPath()/ to be consistent with other places.
460	can you also `elog` this error? as it should be rare and when this goes wrong, having this mentioned in the logs are really useful for debugging (since the failure is actually outside of clangd, it usually means a malformed config file somewhere)
clang-tools-extra/clangd/unittests/IncludeCleanerTests.cpp
428	nit: instead of using a point, can you use a range here instead (i.e. `[[b]]`)? afterwards you can have a `FileRange` pointing at both offsets, rather than relying on the length of the identifier.
448	rest of the code here doesn't really belong to the for loop, can you take them out?

Address review comments.

Thanks for the comments.

thanks for bearing with me, let's ship it!

clang-tools-extra/clangd/IncludeCleaner.cpp
456	nit: this could be shorter with auto FileStyle = format::getStyle(..); if (!FileStyle) { elog("..."); FileStyle = format::getLLVMStyle(); } tooling::HeaderIncludes HeaderIncludes(AST.tuPath(), Code, FileStyle->IncludeStyle);
clang-tools-extra/clangd/unittests/IncludeCleanerTests.cpp
450	nit: size_t Start = llvm::cantFail(positionToOffset(MainFile.code(), Range.start)); size_t End = llvm::cantFail(positionToOffset(MainFile.code(), Range.end)); no need for `EXPECT_FALSE(..takeError())`s as `llvm::cantFail` will fail (no pun intended :P), `static_cast`s are also redundant
460	it'd be better to `ASSERT_TRUE(BDecl);` right after the `for loop`, as rest of the code will crash (and even trigger undefined behavior because we're dereferencing nullptr in failure case). difference between `ASSERT_X` and `EXPECT_X` macros are, the former will stop execution of the particular test (hence we'll never trigger a nullptr deref with ASSERT_TRUE), whereas the latter just prints the failure, but doesn't abort the execution of test (hence helps print multiple failures at once, when they're non fatal).

This revision is now accepted and ready to land.Mar 7 2023, 7:24 AM

Address review comments.

Rebase to main.

This revision was landed with ongoing or failed builds.Mar 7 2023, 8:07 AM

Closed by commit rG38b9fb5a129d: [clangd] Add support for missing includes analysis. (authored by VitaNuo). · Explain Why

This revision was automatically updated to reflect the committed changes.

VitaNuo added a commit: rG38b9fb5a129d: [clangd] Add support for missing includes analysis..

Harbormaster completed remote builds in B217883: Diff 503045.Mar 7 2023, 8:50 AM

This change broke regression, none-one read: "This revision was landed with ongoing or failed builds."
./ClangdTests.exe/IncludeCleaner/GenerateMissingHeaderDiags tests fails on windows

This breaks tests on windows: http://45.33.8.238/win/75486/step_9.txt

Please take a look and revert for now if it takes a while to fix.

thakis added a reverting change: rG2eb5ac99a76d: Revert "[clangd] Add support for missing includes analysis.".Mar 7 2023, 7:14 PM

Oh, that was reported a while ago already. Reverted in 2eb5ac99a76dbbf8ac68c538211fabeaa5ac0bfd for now.

hokein added a subscriber: hokein.Mar 8 2023, 2:52 AM

hokein added inline comments.

clang-tools-extra/clangd/unittests/IncludeCleanerTests.cpp
464	Looks like this filter doesn't work on windows (the `/` vs `\` path separator might be the root cause here), I think a fix can be change the check to `return Header.endsWith("buzz.h")` or `return Header == testPath("buzz.h", llvm::sys::path::Style::posix)`.

@VitaNuo Why did you recommit this again without any fix, breaking regression again.

VitaNuo reopened this revision.Mar 8 2023, 4:20 AM

This revision is now accepted and ready to land.Mar 8 2023, 4:20 AM

Try to fix windows build.

This revision was landed with ongoing or failed builds.Mar 8 2023, 4:31 AM

Closed by commit rG46447e0ba2e3: Revert "Revert "Re-land [clangd] Add support for missing includes analysis."" (authored by VitaNuo). · Explain Why

This revision was automatically updated to reflect the committed changes.

VitaNuo added a commit: rG46447e0ba2e3: Revert "Revert "Re-land [clangd] Add support for missing includes analysis."".

VitaNuo reopened this revision.Mar 8 2023, 4:43 AM

This revision is now accepted and ready to land.Mar 8 2023, 4:43 AM

Try another approach.

Harbormaster completed remote builds in B218068: Diff 503322.Mar 8 2023, 5:28 AM

Fix formatting.

Harbormaster completed remote builds in B218077: Diff 503335.Mar 8 2023, 6:34 AM

VitaNuo closed this revision.Mar 13 2023, 2:57 AM

Revision Contents

Path

Size

clang-tools-extra/

clangd/

12 lines

24 lines

16 lines

3 lines

27 lines

375 lines

10 lines

4 lines

unittests/

ConfigCompileTests.cpp

6 lines

DiagnosticsTests.cpp

2 lines

IncludeCleanerTests.cpp

178 lines

PreambleTests.cpp

2 lines

include-cleaner/

include/

clang-include-cleaner/

Analysis.h

2 lines

lib/

Analysis.cpp

4 lines

Diff 503335

clang-tools-extra/clangd/Config.h

Show First 20 Lines • Show All 82 Lines • ▼ Show 20 Lines	struct Config {
/// Controls index behavior.		/// Controls index behavior.
struct {		struct {
/// Whether this TU should be background-indexed.		/// Whether this TU should be background-indexed.
BackgroundPolicy Background = BackgroundPolicy::Build;		BackgroundPolicy Background = BackgroundPolicy::Build;
ExternalIndexSpec External;		ExternalIndexSpec External;
bool StandardLibrary = true;		bool StandardLibrary = true;
} Index;		} Index;

enum class UnusedIncludesPolicy {		enum class IncludesPolicy {
		kadircetUnsubmitted Done Reply Inline Actions rather than duplicating, what about renaming `UnusedIncludesPolicy` to `IncludesPolicy` and use it for both `UnusedIncludes` and `MissingIncludes` options below? kadircet: rather than duplicating, what about renaming `UnusedIncludesPolicy` to `IncludesPolicy` and use…
		VitaNuoAuthorUnsubmitted Done Reply Inline Actions Sure. VitaNuo: Sure.
/// Diagnose unused includes.		/// Diagnose missing and unused includes.
Strict,		Strict,
None,		None,
/// The same as Strict, but using the include-cleaner library.		/// The same as Strict, but using the include-cleaner library for
		/// unused includes.
Experiment,		Experiment,
};		};
/// Controls warnings and errors when parsing code.		/// Controls warnings and errors when parsing code.
struct {		struct {
bool SuppressAll = false;		bool SuppressAll = false;
llvm::StringSet<> Suppress;		llvm::StringSet<> Suppress;

/// Configures what clang-tidy checks to run and options to use with them.		/// Configures what clang-tidy checks to run and options to use with them.
struct {		struct {
// A comma-seperated list of globs specify which clang-tidy checks to run.		// A comma-seperated list of globs specify which clang-tidy checks to run.
std::string Checks;		std::string Checks;
llvm::StringMap<std::string> CheckOptions;		llvm::StringMap<std::string> CheckOptions;
} ClangTidy;		} ClangTidy;

UnusedIncludesPolicy UnusedIncludes = UnusedIncludesPolicy::None;

/// Enable emitting diagnostics using stale preambles.		/// Enable emitting diagnostics using stale preambles.
bool AllowStalePreamble = false;		bool AllowStalePreamble = false;

		IncludesPolicy UnusedIncludes = IncludesPolicy::None;
		IncludesPolicy MissingIncludes = IncludesPolicy::None;

/// IncludeCleaner will not diagnose usages of these headers matched by		/// IncludeCleaner will not diagnose usages of these headers matched by
/// these regexes.		/// these regexes.
struct {		struct {
std::vector<std::function<bool(llvm::StringRef)>> IgnoreHeader;		std::vector<std::function<bool(llvm::StringRef)>> IgnoreHeader;
} Includes;		} Includes;
} Diagnostics;		} Diagnostics;

/// Style of the codebase.		/// Style of the codebase.
▲ Show 20 Lines • Show All 55 Lines • Show Last 20 Lines

clang-tools-extra/clangd/ConfigCompile.cpp

Show First 20 Lines • Show All 425 Lines • ▼ Show 20 Lines	if (!Normalized.empty())
[Normalized(std::move(Normalized))](const Params &, Config &C) {		[Normalized(std::move(Normalized))](const Params &, Config &C) {
if (C.Diagnostics.SuppressAll)		if (C.Diagnostics.SuppressAll)
return;		return;
for (llvm::StringRef N : Normalized)		for (llvm::StringRef N : Normalized)
C.Diagnostics.Suppress.insert(N);		C.Diagnostics.Suppress.insert(N);
});		});

if (F.UnusedIncludes)		if (F.UnusedIncludes)
if (auto Val =		if (auto Val = compileEnum<Config::IncludesPolicy>("UnusedIncludes",
compileEnum<Config::UnusedIncludesPolicy>("UnusedIncludes",
**F.UnusedIncludes)		**F.UnusedIncludes)
.map("Strict", Config::UnusedIncludesPolicy::Strict)		.map("Strict", Config::IncludesPolicy::Strict)
.map("Experiment", Config::UnusedIncludesPolicy::Experiment)		.map("Experiment", Config::IncludesPolicy::Experiment)
.map("None", Config::UnusedIncludesPolicy::None)		.map("None", Config::IncludesPolicy::None)
.value())		.value())
Out.Apply.push_back([Val](const Params &, Config &C) {		Out.Apply.push_back([Val](const Params &, Config &C) {
C.Diagnostics.UnusedIncludes = *Val;		C.Diagnostics.UnusedIncludes = *Val;
});		});

if (F.AllowStalePreamble) {		if (F.AllowStalePreamble) {
if (auto Val = F.AllowStalePreamble)		if (auto Val = F.AllowStalePreamble)
Out.Apply.push_back([Val](const Params &, Config &C) {		Out.Apply.push_back([Val](const Params &, Config &C) {
C.Diagnostics.AllowStalePreamble = **Val;		C.Diagnostics.AllowStalePreamble = **Val;
});		});
}		}

		if (F.MissingIncludes)
		if (auto Val = compileEnum<Config::IncludesPolicy>("MissingIncludes",
		**F.MissingIncludes)
		.map("Strict", Config::IncludesPolicy::Strict)
		.map("None", Config::IncludesPolicy::None)
		.value())
		Out.Apply.push_back([Val](const Params &, Config &C) {
		C.Diagnostics.MissingIncludes = *Val;
		});

compile(std::move(F.Includes));		compile(std::move(F.Includes));
compile(std::move(F.ClangTidy));		compile(std::move(F.ClangTidy));
}		}

void compile(Fragment::StyleBlock &&F) {		void compile(Fragment::StyleBlock &&F) {
if (!F.FullyQualifiedNamespaces.empty()) {		if (!F.FullyQualifiedNamespaces.empty()) {
std::vector<std::string> FullyQualifiedNamespaces;		std::vector<std::string> FullyQualifiedNamespaces;
for (auto &N : F.FullyQualifiedNamespaces) {		for (auto &N : F.FullyQualifiedNamespaces) {
▲ Show 20 Lines • Show All 177 Lines • Show Last 20 Lines

clang-tools-extra/clangd/ConfigFragment.h

Show First 20 Lines • Show All 215 Lines • ▼ Show 20 Lines

struct DiagnosticsBlock {

/// - warning categories (e.g. unused-result)

/// - clang-tidy check names (e.g. bugprone-narrowing-conversions)

///

/// This is a simple filter. Diagnostics can be controlled in other ways

/// (e.g. by disabling a clang-tidy check, or the -Wunused compile flag).

/// This often has other advantages, such as skipping some analysis.

std::vector<Located<std::string>> Suppress;

/// Controls how clangd will correct "unnecessary #include directives.

/// Controls how clangd will correct "unnecessary" #include directives.

kadircetUnsubmitted

Done

nit: i'd rather drop the quotes completely to match your description of MissingIncludes below.

kadircet: nit: i'd rather drop the quotes completely to match your description of MissingIncludes below.

/// clangd can warn if a header is `#include`d but not used, and suggest

/// removing it.

/// Strict means a header is unused if it does not *directly* provide any

/// symbol used in the file. Removing it may still break compilation if it

/// transitively includes headers that are used. This should be fixed by

/// including those headers directly.

///

/// Valid values are:

/// - Strict

/// - Experiment

/// - None

std::optional<Located<std::string>> UnusedIncludes;

kadircetUnsubmitted

Done

std::optional<Located<std::string>> UnusedIncludes;

- /// Controls how clangd handles missing #include directives.

+ /// Controls whether clangd should analyze missing #include directives.

/// clangd can warn if a header for a symbol is not `#include`d (missing),

kadircet:

/// Enable emitting diagnostics using stale preambles.

std::optional<Located<bool>> AllowStalePreamble;

kadircetUnsubmitted

Done

/// Controls how clangd handles missing #include directives.

- /// clangd can warn if a header for a symbol is not `#include`d (missing),

+ /// clangd will warn if no header providing a symbol is not `#include`d (missing) directly,

/// and suggest adding it.

///

/// Strict means a header is missing if it is not *directly #include'd.

kadircet:

/// Controls if clangd should analyze missing #include directives.

kadircetUnsubmitted

Done

/// and suggest adding it.

///

- /// Strict means a header is missing if it is not *directly #include'd.

+ /// Strict means a header providing a symbol is missing if it is not directly #include'd.

/// The file might still compile if the header is included transitively.

kadircet:

/// clangd will warn if no header providing a symbol is `#include`d

/// (missing) directly, and suggest adding it.

///

/// Strict means a header providing a symbol is missing if it is not

/// *directly #include'd. The file might still compile if the header is

/// included transitively.

///

/// Valid values are:

/// - Strict

/// - None

std::optional<Located<std::string>> MissingIncludes;

/// Controls IncludeCleaner diagnostics.

struct IncludesBlock {

/// Regexes that will be used to avoid diagnosing certain includes as

/// unused or missing. These can match any suffix of the header file in

/// question.

std::vector<Located<std::string>> IgnoreHeader;

};

IncludesBlock Includes;

▲ Show 20 Lines • Show All 71 Lines • Show Last 20 Lines

clang-tools-extra/clangd/ConfigYAML.cpp

Show First 20 Lines • Show All 122 Lines • ▼ Show 20 Lines	void parse(Fragment::DiagnosticsBlock &F, Node &N) {
DictParser Dict("Diagnostics", this);		DictParser Dict("Diagnostics", this);
Dict.handle("Suppress", [&](Node &N) {		Dict.handle("Suppress", [&](Node &N) {
if (auto Values = scalarValues(N))		if (auto Values = scalarValues(N))
F.Suppress = std::move(*Values);		F.Suppress = std::move(*Values);
});		});
Dict.handle("UnusedIncludes", [&](Node &N) {		Dict.handle("UnusedIncludes", [&](Node &N) {
F.UnusedIncludes = scalarValue(N, "UnusedIncludes");		F.UnusedIncludes = scalarValue(N, "UnusedIncludes");
});		});
		Dict.handle("MissingIncludes", [&](Node &N) {
		F.MissingIncludes = scalarValue(N, "MissingIncludes");
		});
Dict.handle("Includes", [&](Node &N) { parse(F.Includes, N); });		Dict.handle("Includes", [&](Node &N) { parse(F.Includes, N); });
Dict.handle("ClangTidy", [&](Node &N) { parse(F.ClangTidy, N); });		Dict.handle("ClangTidy", [&](Node &N) { parse(F.ClangTidy, N); });
Dict.handle("AllowStalePreamble", [&](Node &N) {		Dict.handle("AllowStalePreamble", [&](Node &N) {
F.AllowStalePreamble = boolValue(N, "AllowStalePreamble");		F.AllowStalePreamble = boolValue(N, "AllowStalePreamble");
});		});
Dict.parse(N);		Dict.parse(N);
}		}

▲ Show 20 Lines • Show All 316 Lines • Show Last 20 Lines

clang-tools-extra/clangd/IncludeCleaner.h

	Show All 14 Lines
	///			///
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_CLANG_TOOLS_EXTRA_CLANGD_INCLUDECLEANER_H			#ifndef LLVM_CLANG_TOOLS_EXTRA_CLANGD_INCLUDECLEANER_H
	#define LLVM_CLANG_TOOLS_EXTRA_CLANGD_INCLUDECLEANER_H			#define LLVM_CLANG_TOOLS_EXTRA_CLANGD_INCLUDECLEANER_H

	#include "Headers.h"			#include "Headers.h"
	#include "ParsedAST.h"			#include "ParsedAST.h"
				#include "clang-include-cleaner/Types.h"
	#include "index/CanonicalIncludes.h"			#include "index/CanonicalIncludes.h"
	#include "clang/Basic/SourceLocation.h"			#include "clang/Basic/SourceLocation.h"
	#include "clang/Tooling/Inclusions/StandardLibrary.h"			#include "clang/Tooling/Inclusions/StandardLibrary.h"
				#include "clang/Tooling/Syntax/Tokens.h"
	#include "llvm/ADT/DenseSet.h"			#include "llvm/ADT/DenseSet.h"
	#include "llvm/ADT/STLFunctionalExtras.h"			#include "llvm/ADT/STLFunctionalExtras.h"
	#include "llvm/ADT/StringSet.h"			#include "llvm/ADT/StringSet.h"
	#include <optional>			#include <optional>
				#include <tuple>
	#include <vector>			#include <vector>

	namespace clang {			namespace clang {
	namespace clangd {			namespace clangd {

				// Data needed for missing include diagnostics.
				struct MissingIncludeDiagInfo {
				include_cleaner::Symbol Symbol;
				kadircetUnsubmitted Done Reply Inline Actions it seems unfortunate that we're duplicating these strings for each diag we want to emit. it might be better to just store a Symbol here (similar to Header) and delay spelling until needed. kadircet: it seems unfortunate that we're duplicating these strings for each diag we want to emit. it…
				VitaNuoAuthorUnsubmitted Done Reply Inline Actions I'm not sure I can see your point re `delaying spelling until needed`. Each `MissingIncludeDiagInfo` corresponds to one diagnostic. Whether we resolve the `Symbol` to the `SymbolName` during analysis (i.e., `walkUsed`) or in diagnostic generation (i.e., `generateMissingIncludeDiagnostics`), does not change the fact that the same symbol will be resolved to its name multiple times. As discussed in the doc, this results in simpler code than the version that maps a single `Symbol` object to multiple `Range`s and `Provider`s. AFAICS, the implementation of `Symbol` to `SymbolName` resolution does not seem to be very expensive either. VitaNuo: I'm not sure I can see your point re `delaying spelling until needed`. Each…
				kadircetUnsubmitted Done Reply Inline Actions Whether we resolve the Symbol to the SymbolName during analysis (i.e., walkUsed) or in diagnostic generation (i.e., generateMissingIncludeDiagnostics), does not change the fact that the same symbol will be resolved to its name multiple times. we might not generate those diagnostics always, e.g. missing-includes is disabled, but unused-includes is on, or maybe we're going to use these results for something else like Hover responses. As discussed in the doc, this results in simpler code than the version that maps a single Symbol object to multiple Ranges and Providers. I wasn't trying to suggest having a map here, I was suggesting just storing a `Symbol S` instead of `string Name`. AFAICS, the implementation of Symbol to SymbolName resolution does not seem to be very expensive either. Well, generating strings are usually expensive (not asymptotically but in practice, as they tend to require lots of memory allocations). kadircet: > Whether we resolve the Symbol to the SymbolName during analysis (i.e., walkUsed) or in…
				VitaNuoAuthorUnsubmitted Done Reply Inline Actions Ok, agreed. Storing symbols now. Having only unused include analysis on is a convincing use case. VitaNuo: Ok, agreed. Storing symbols now. Having only unused include analysis on is a convincing use…
				syntax::FileRange SymRefRange;
				std::vector<include_cleaner::Header> Providers;

				bool operator==(const MissingIncludeDiagInfo &Other) const {
				return std::tie(SymRefRange, Providers, Symbol) ==
				kadircetUnsubmitted Done Reply Inline Actions nit: `std::tie(SymbolName, SymRefRange, Providers) == std::tie(Other.SymbolName, ...);` I'd also put the SymbolName match to be the last, as it's a string match and might be more costly (if we can bail out early) kadircet: nit: `std::tie(SymbolName, SymRefRange, Providers) == std::tie(Other.SymbolName, ...);` I'd…
				VitaNuoAuthorUnsubmitted Done Reply Inline Actions Ok, sure. I can see your point with the string comparisons. But I don't fully see the point of using `std::tie` here. What is so great about creating an extra object, even if it only stores references? Is it a purely stylistic suggestion? VitaNuo: Ok, sure. I can see your point with the string comparisons. But I don't fully see the point of…
				kadircetUnsubmitted Done Reply Inline Actions right, the comments with `nit:` prefix are usually things that won't matter much in practice, but reflects my (well, in the general case the reviewer's) or codebase's preference. kadircet: right, the comments with `nit:` prefix are usually things that won't matter much in practice…
				std::tie(Other.SymRefRange, Other.Providers, Other.Symbol);
				}
				};

				struct IncludeCleanerFindings {
				std::vector<const Inclusion *> UnusedIncludes;
				std::vector<MissingIncludeDiagInfo> MissingIncludes;
				kadircetUnsubmitted Done Reply Inline Actions I don't think we've much to gain by using SmallVector here, instead of std::vector kadircet: I don't think we've much to gain by using SmallVector here, instead of std::vector
				VitaNuoAuthorUnsubmitted Done Reply Inline Actions Sure. I'm not so clear on the preferences yet. AFAIU your point, stdlib is to be preferred unless the llvm alternative is clearly beneficial. Is that the case? VitaNuo: Sure. I'm not so clear on the preferences yet. AFAIU your point, stdlib is to be preferred…
				kadircetUnsubmitted Done Reply Inline Actions stdlib is to be preferred unless the llvm alternative is clearly beneficial Right. `llvm::SmallVector` and `std::vector` have different use cases, we usually go for the former if we're sure that number of elements we want to store are going to be handful (literally less than 10) most of the time and the size of the objects themselves is not too big. As `SmallVector` chooses to store objects internally (until it grows too much), rather than allocating a bunch of memory elsewhere and just storing a pointer (as std::vector does). This has benefits when your vector isn't going to grow beyond smallvector's limits (you don't need to pay for memory allocations, which are expensive), but has other costs (e.g. moving a std::vector is trivial as it's just assignment of a pointer and size, but moving a smallvector might not be as trivial (it at least needs to move all the elements) also the initial memory cost for smallvector is higher than std::vector (as it takes up the same space independent of it's fullness). So in this example, we're unlikely to have a small number of `MissingIncludes`, `MissingIncludeDiagInfo` is a big enough struct (vector and filerange add up to more than 30 bytes). Hence std::vector feels like the better choice. kadircet: > stdlib is to be preferred unless the llvm alternative is clearly beneficial Right. `llvm…
				VitaNuoAuthorUnsubmitted Done Reply Inline Actions thanks! VitaNuo: thanks!
				};

	struct ReferencedLocations {			struct ReferencedLocations {
	llvm::DenseSet<SourceLocation> User;			llvm::DenseSet<SourceLocation> User;
	llvm::DenseSet<tooling::stdlib::Symbol> Stdlib;			llvm::DenseSet<tooling::stdlib::Symbol> Stdlib;
	};			};

	/// Finds locations of all symbols used in the main file.			/// Finds locations of all symbols used in the main file.
	///			///
	/// - RecursiveASTVisitor finds references to symbols and records their			/// - RecursiveASTVisitor finds references to symbols and records their
	▲ Show 20 Lines • Show All 48 Lines • ▼ Show 20 Lines

	/// Retrieves headers that are referenced from the main file but not used.			/// Retrieves headers that are referenced from the main file but not used.
	/// In unclear cases, headers are not marked as unused.			/// In unclear cases, headers are not marked as unused.
	std::vector<const Inclusion *>			std::vector<const Inclusion *>
	getUnused(ParsedAST &AST,			getUnused(ParsedAST &AST,
	const llvm::DenseSet<IncludeStructure::HeaderID> &ReferencedFiles,			const llvm::DenseSet<IncludeStructure::HeaderID> &ReferencedFiles,
	const llvm::StringSet<> &ReferencedPublicHeaders);			const llvm::StringSet<> &ReferencedPublicHeaders);

				IncludeCleanerFindings computeIncludeCleanerFindings(ParsedAST &AST);
	std::vector<const Inclusion *> computeUnusedIncludes(ParsedAST &AST);			std::vector<const Inclusion *> computeUnusedIncludes(ParsedAST &AST);
	// The same as computeUnusedIncludes, but it is an experimental and
	// include-cleaner-lib-based implementation.
	std::vector<const Inclusion *>
	computeUnusedIncludesExperimental(ParsedAST &AST);

	std::vector<Diag> issueUnusedIncludesDiagnostics(ParsedAST &AST,			std::vector<Diag> issueIncludeCleanerDiagnostics(ParsedAST &AST,
	llvm::StringRef Code);			llvm::StringRef Code);

	/// Affects whether standard library includes should be considered for			/// Affects whether standard library includes should be considered for
	/// removal. This is off by default for now due to implementation limitations:			/// removal. This is off by default for now due to implementation limitations:
	/// - macros are not tracked			/// - macros are not tracked
	/// - symbol names without a unique associated header are not tracked			/// - symbol names without a unique associated header are not tracked
	/// - references to std-namespaced C types are not properly tracked:			/// - references to std-namespaced C types are not properly tracked:
	/// instead of std::size_t -> <cstddef> we see ::size_t -> <stddef.h>			/// instead of std::size_t -> <cstddef> we see ::size_t -> <stddef.h>
	/// FIXME: remove this hack once the implementation is good enough.			/// FIXME: remove this hack once the implementation is good enough.
	void setIncludeCleanerAnalyzesStdlib(bool B);			void setIncludeCleanerAnalyzesStdlib(bool B);

	} // namespace clangd			} // namespace clangd
	} // namespace clang			} // namespace clang

	#endif // LLVM_CLANG_TOOLS_EXTRA_CLANGD_INCLUDECLEANER_H			#endif // LLVM_CLANG_TOOLS_EXTRA_CLANGD_INCLUDECLEANER_H

clang-tools-extra/clangd/IncludeCleaner.cpp

//===--- IncludeCleaner.cpp - Unused/Missing Headers Analysis ---*- C++ -*-===// //===--- IncludeCleaner.cpp - Unused/Missing Headers Analysis ---*- C++ -*-===//

// //

// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions. // Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.

// See https://llvm.org/LICENSE.txt for license information. // See https://llvm.org/LICENSE.txt for license information.

// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception // SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception

// //

//===----------------------------------------------------------------------===// //===----------------------------------------------------------------------===//

#include "IncludeCleaner.h" #include "IncludeCleaner.h"

#include "Config.h" #include "Config.h"

#include "Diagnostics.h"

#include "Headers.h" #include "Headers.h"

#include "ParsedAST.h" #include "ParsedAST.h"

#include "Protocol.h" #include "Protocol.h"

#include "SourceCode.h" #include "SourceCode.h"

#include "URI.h"

#include "clang-include-cleaner/Analysis.h" #include "clang-include-cleaner/Analysis.h"

#include "clang-include-cleaner/Types.h" #include "clang-include-cleaner/Types.h"

#include "index/CanonicalIncludes.h" #include "index/CanonicalIncludes.h"

#include "support/Logger.h" #include "support/Logger.h"

#include "support/Path.h"

#include "support/Trace.h" #include "support/Trace.h"

#include "clang/AST/ASTContext.h" #include "clang/AST/ASTContext.h"

#include "clang/AST/DeclCXX.h"

#include "clang/AST/Expr.h"

#include "clang/AST/ExprCXX.h" #include "clang/AST/ExprCXX.h"

#include "clang/AST/RecursiveASTVisitor.h" #include "clang/AST/RecursiveASTVisitor.h"

#include "clang/AST/TemplateName.h"

#include "clang/AST/Type.h"

#include "clang/Basic/Diagnostic.h"

#include "clang/Basic/LLVM.h"

#include "clang/Basic/SourceLocation.h" #include "clang/Basic/SourceLocation.h"

#include "clang/Basic/SourceManager.h" #include "clang/Basic/SourceManager.h"

#include "clang/Format/Format.h"

#include "clang/Lex/HeaderSearch.h" #include "clang/Lex/HeaderSearch.h"

#include "clang/Lex/Preprocessor.h" #include "clang/Lex/Preprocessor.h"

#include "clang/Tooling/Core/Replacement.h"

#include "clang/Tooling/Inclusions/HeaderIncludes.h"

#include "clang/Tooling/Inclusions/IncludeStyle.h"

#include "clang/Tooling/Inclusions/StandardLibrary.h"

#include "clang/Tooling/Syntax/Tokens.h" #include "clang/Tooling/Syntax/Tokens.h"

#include "llvm/ADT/ArrayRef.h" #include "llvm/ADT/ArrayRef.h"

#include "llvm/ADT/DenseSet.h"

#include "llvm/ADT/STLExtras.h"

#include "llvm/ADT/STLFunctionalExtras.h" #include "llvm/ADT/STLFunctionalExtras.h"

#include "llvm/ADT/SmallString.h" #include "llvm/ADT/SmallString.h"

#include "llvm/ADT/SmallVector.h"

#include "llvm/ADT/StringRef.h"

#include "llvm/ADT/StringSet.h" #include "llvm/ADT/StringSet.h"

#include "llvm/Support/Casting.h"

#include "llvm/Support/Error.h"

#include "llvm/Support/ErrorHandling.h"

#include "llvm/Support/FormatVariadic.h" #include "llvm/Support/FormatVariadic.h"

#include "llvm/Support/Path.h" #include "llvm/Support/Path.h"

#include "llvm/Support/Regex.h" #include "llvm/Support/Regex.h"

#include <functional> #include <iterator>

#include <optional> #include <optional>

#include <string>

#include <vector>

namespace clang { namespace clang {

namespace clangd { namespace clangd {

static bool AnalyzeStdlib = false; static bool AnalyzeStdlib = false;

void setIncludeCleanerAnalyzesStdlib(bool B) { AnalyzeStdlib = B; } void setIncludeCleanerAnalyzesStdlib(bool B) { AnalyzeStdlib = B; }

namespace { namespace {

▲ Show 20 Lines • Show All 208 Lines • ▼ Show 20 Lines if (!Macro)

continue; continue;

auto Loc = Macro->Info->getDefinitionLoc(); auto Loc = Macro->Info->getDefinitionLoc();

if (Loc.isValid()) if (Loc.isValid())

Result.User.insert(Loc); Result.User.insert(Loc);

// FIXME: support stdlib macros // FIXME: support stdlib macros

} }

bool isFilteredByConfig(const Config &Cfg, llvm::StringRef HeaderPath) {

kadircetUnsubmitted

Done

s/HeaderSpelling/HeaderPath

kadircet: s/HeaderSpelling/HeaderPath

// Convert the path to Unix slashes and try to match against the filter.

llvm::SmallString<64> NormalizedPath(HeaderPath);

kadircetUnsubmitted

Done

s/Path/NormalizedPath

kadircet: s/Path/NormalizedPath

llvm::sys::path::native(NormalizedPath, llvm::sys::path::Style::posix);

for (auto &Filter : Cfg.Diagnostics.Includes.IgnoreHeader) {

if (Filter(NormalizedPath))

return true;

}

return false;

}

static bool mayConsiderUnused(const Inclusion &Inc, ParsedAST &AST, static bool mayConsiderUnused(const Inclusion &Inc, ParsedAST &AST,

const Config &Cfg) { const Config &Cfg) {

if (Inc.BehindPragmaKeep) if (Inc.BehindPragmaKeep)

return false; return false;

// FIXME(kirillbobyrev): We currently do not support the umbrella headers. // FIXME(kirillbobyrev): We currently do not support the umbrella headers.

// System headers are likely to be standard library headers. // System headers are likely to be standard library headers.

// Until we have good support for umbrella headers, don't warn about them. // Until we have good support for umbrella headers, don't warn about them.

Show All 14 Lines static bool mayConsiderUnused(const Inclusion &Inc, ParsedAST &AST,

// Headers without include guards have side effects and are not // Headers without include guards have side effects and are not

// self-contained, skip them. // self-contained, skip them.

if (!AST.getPreprocessor().getHeaderSearchInfo().isFileMultipleIncludeGuarded( if (!AST.getPreprocessor().getHeaderSearchInfo().isFileMultipleIncludeGuarded(

&FE->getFileEntry())) { &FE->getFileEntry())) {

dlog("{0} doesn't have header guard and will not be considered unused", dlog("{0} doesn't have header guard and will not be considered unused",

FE->getName()); FE->getName());

return false; return false;

} }

for (auto &Filter : Cfg.Diagnostics.Includes.IgnoreHeader) {

kadircetUnsubmitted

Done

can you also change the logic here to use isFilteredByConfig (we need the native call inside isFilteredByConfig as well to make sure it works on windows)

kadircet: can you also change the logic here to use `isFilteredByConfig` (we need the `native` call…

// Convert the path to Unix slashes and try to match against the filter. if (isFilteredByConfig(Cfg, Inc.Resolved)) {

llvm::SmallString<64> Path(Inc.Resolved);

llvm::sys::path::native(Path, llvm::sys::path::Style::posix);

if (Filter(Path)) {

dlog("{0} header is filtered out by the configuration", FE->getName()); dlog("{0} header is filtered out by the configuration", FE->getName());

return false; return false;

} }

}

return true; return true;

} }

// In case symbols are coming from non self-contained header, we need to find // In case symbols are coming from non self-contained header, we need to find

// its first includer that is self-contained. This is the header users can // its first includer that is self-contained. This is the header users can

// include, so it will be responsible for bringing the symbols from given // include, so it will be responsible for bringing the symbols from given

// header into the scope. // header into the scope.

FileID headerResponsible(FileID ID, const SourceManager &SM, FileID headerResponsible(FileID ID, const SourceManager &SM,

Show All 12 Lines if (Includes.isSelfContained(*HID))

break; break;

// The header is not self-contained: put the responsibility for its symbols // The header is not self-contained: put the responsibility for its symbols

// on its includer. // on its includer.

ID = SM.getFileID(SM.getIncludeLoc(ID)); ID = SM.getFileID(SM.getIncludeLoc(ID));

} }

return ID; return ID;

} }

include_cleaner::Includes

convertIncludes(const SourceManager &SM,

const llvm::ArrayRef<Inclusion> MainFileIncludes) {

include_cleaner::Includes Includes;

for (const Inclusion &Inc : MainFileIncludes) {

include_cleaner::Include TransformedInc;

llvm::StringRef WrittenRef = llvm::StringRef(Inc.Written);

TransformedInc.Spelled = WrittenRef.trim("\"<>");

TransformedInc.HashLocation =

SM.getComposedLoc(SM.getMainFileID(), Inc.HashOffset);

TransformedInc.Line = Inc.HashLine + 1;

TransformedInc.Angled = WrittenRef.starts_with("<");

auto FE = SM.getFileManager().getFile(Inc.Resolved);

kadircetUnsubmitted

Done

unfortunately getFile returns an llvm::Expected which requires explicit error handling (or it'll trigger a crash). you can simply elog the issue:

if (!FE) {
  elog("IncludeCleaner: Failed to get an entry for resolved path {0}: {1}", Inc.Resolved, FE.takeError());
  continue;
}

kadircet: unfortunately `getFile` returns an `llvm::Expected` which requires explicit error handling (or…

VitaNuoAuthorUnsubmitted

Done

It returns llvm::ErrorOr, if I am not mistaken. There was explicit error handling already (if (!FE) continue below), just without the elog. Are you trying to say it will crash without the logging? Not sure that's feasible :)
Added the logging.

VitaNuo: It returns `llvm::ErrorOr`, if I am not mistaken. There was explicit error handling already…

kadircetUnsubmitted

Done

It returns llvm::ErrorOr, if I am not mistaken.

Ah you're right, I confused it with getFileRef. So in theory ErrorOr doesn't require explicit checking hence it won't trigger a crash if destroyed while containing an error.

Are you trying to say it will crash without the logging? Not sure that's feasible :)

Right, it isn't the logging that'll prevent the crash but rather a combination of the call to takeError and the way elog consumes Error objects. But it isn't relevant here, as ErrorOr doesn't require mandatory handling.

kadircet: > It returns llvm::ErrorOr, if I am not mistaken. Ah you're right, I confused it with…

VitaNuoAuthorUnsubmitted

Done

thanks for the explanation.

VitaNuo: thanks for the explanation.

if (!FE) {

elog("IncludeCleaner: Failed to get an entry for resolved path {0}: {1}",

Inc.Resolved, FE.getError().message());

continue;

}

TransformedInc.Resolved = *FE;

Includes.add(std::move(TransformedInc));

}

return Includes;

}

std::string spellHeader(ParsedAST &AST, const FileEntry *MainFile,

include_cleaner::Header Provider) {

if (Provider.kind() == include_cleaner::Header::Physical) {

kadircetUnsubmitted

Done

nit: you can directly define SpelledHeader at line 377

kadircet: nit: you can directly define `SpelledHeader` at line 377

if (auto CanonicalPath =

getCanonicalPath(Provider.physical(), AST.getSourceManager())) {

std::string SpelledHeader =

llvm::cantFail(URI::includeSpelling(URI::create(*CanonicalPath)));

if (!SpelledHeader.empty())

return SpelledHeader;

}

return include_cleaner::spellHeader(

Provider, AST.getPreprocessor().getHeaderSearchInfo(), MainFile);

}

std::vector<include_cleaner::SymbolReference>

collectMacroReferences(ParsedAST &AST) {

const auto &SM = AST.getSourceManager();

// FIXME: !!this is a hacky way to collect macro references.

std::vector<include_cleaner::SymbolReference> Macros;

auto &PP = AST.getPreprocessor();

for (const syntax::Token &Tok :

AST.getTokens().spelledTokens(SM.getMainFileID())) {

auto Macro = locateMacroAt(Tok, PP);

if (!Macro)

continue;

if (auto DefLoc = Macro->Info->getDefinitionLoc(); DefLoc.isValid())

Macros.push_back(

{Tok.location(),

include_cleaner::Macro{/*Name=*/PP.getIdentifierInfo(Tok.text(SM)),

DefLoc},

include_cleaner::RefType::Explicit});

}

return Macros;

}

llvm::StringRef getResolvedPath(const include_cleaner::Header &SymProvider) {

kadircetUnsubmitted

Done

what about just resolvedPath, if you'd rather keep the verb, i think get makes more sense than find. we're not really searching anything.

kadircet: what about just `resolvedPath`, if you'd rather keep the verb, i think `get` makes more sense…

VitaNuoAuthorUnsubmitted

Done

Ok, let's call it get. I do prefer verbs for methods, that's correct.

VitaNuo: Ok, let's call it `get`. I do prefer verbs for methods, that's correct.

switch (SymProvider.kind()) {

kadircetUnsubmitted

Done

this shouldn't be spelling, it should be the resolved path of the include.

kadircet: this shouldn't be spelling, it should be the resolved path of the include.

VitaNuoAuthorUnsubmitted

Done

Ok thanks.

VitaNuo: Ok thanks.

case include_cleaner::Header::Physical:

return SymProvider.physical()->tryGetRealPathName();

case include_cleaner::Header::Standard:

kadircetUnsubmitted

Done

nit: you can directly return SymProvider.physical()->tryGetRealPathName(); (same for other 2 cases) and have an llvm_unreachable("Unknown symbol kind"); after the switch statement.

kadircet: nit: you can directly `return SymProvider.physical()->tryGetRealPathName();` (same for other 2…

return SymProvider.standard().name().trim("<>\"");

case include_cleaner::Header::Verbatim:

return SymProvider.verbatim().trim("<>\"");

kadircetUnsubmitted

Done

in this and the next case we need to trim <>"

kadircet: in this and the next case we need to trim `<>"`

}

llvm_unreachable("Unknown header kind");

}

std::string getSymbolName(const include_cleaner::Symbol &Sym) {

switch (Sym.kind()) {

case include_cleaner::Symbol::Macro:

return Sym.macro().Name->getName().str();

case include_cleaner::Symbol::Declaration:

kadircetUnsubmitted

Done

same as above, either just symbolName or get

kadircet: same as above, either just `symbolName` or `get`

return llvm::dyn_cast<NamedDecl>(&Sym.declaration())

->getQualifiedNameAsString();

}

llvm_unreachable("Unknown symbol kind");

kadircetUnsubmitted

Done

again you can just return here and below

kadircet: again you can just return here and below

kadircetUnsubmitted

Done

getName is a StringRef, and unfortunately there are some platforms (like darwin) that don't support implicit conversion from stringrefs to std::string. so can you call .str() explicitly in the end?

kadircet: `getName` is a StringRef, and unfortunately there are some platforms (like darwin) that don't…

}

std::vector<Diag> generateMissingIncludeDiagnostics(

ParsedAST &AST, llvm::ArrayRef<MissingIncludeDiagInfo> MissingIncludes,

llvm::StringRef Code) {

std::vector<Diag> Result;

kadircetUnsubmitted

Done

can you prefix this with IncludeCleaner: and rather say not diagnosing missing include {0}, filtered by config to add a bit more context about what specific interaction the log is coming from

kadircet: can you prefix this with `IncludeCleaner: ` and rather say `not diagnosing missing include {0}…

const Config &Cfg = Config::current();

if (Cfg.Diagnostics.MissingIncludes != Config::IncludesPolicy::Strict ||

Cfg.Diagnostics.SuppressAll ||

Cfg.Diagnostics.Suppress.contains("missing-includes")) {

return Result;

}

const SourceManager &SM = AST.getSourceManager();

const FileEntry *MainFile = SM.getFileEntryForID(SM.getMainFileID());

kadircetUnsubmitted

Done

we should respect the style configurations (sorry for missing this in previous iterations).

you can get the relevant style with: clang::format::getStyle, which has an IncludeStyle. in case the getStyle fails, we should fallback to clang::format::getLLVMStyle as we do in other places. you can get at the relevant VFS instance through sourcemanager.

kadircet: we should respect the style configurations (sorry for missing this in previous iterations).

auto FileStyle = format::getStyle(

format::DefaultFormatStyle, AST.tuPath(), format::DefaultFallbackStyle,

kadircetUnsubmitted

Done

creating a copy of LLVM style unnecessarily all the time is not really great, can you move this into the failure case instead?

also you can drop the clang:: here and elsewhere, as this code is already part of clang:: namespace.

kadircet: creating a copy of LLVM style unnecessarily all the time is not really great, can you move this…

kadircetUnsubmitted

Done

nit: this could be shorter with

auto FileStyle = format::getStyle(..);
if (!FileStyle) {
  elog("...");
  FileStyle = format::getLLVMStyle();
}
tooling::HeaderIncludes HeaderIncludes(AST.tuPath(), Code, FileStyle->IncludeStyle);

kadircet: nit: this could be shorter with ``` auto FileStyle = format::getStyle(..); if (!FileStyle) {…

Code, &SM.getFileManager().getVirtualFileSystem());

kadircetUnsubmitted

Done

as mentioned above we also need to make sure we're passing the relevant VFS instance inside the source manager, rather than using the real file system (as some clients rely on the VFS).

kadircet: as mentioned above we also need to make sure we're passing the relevant VFS instance inside the…

if (!FileStyle) {

kadircetUnsubmitted

Done

s/MainFile->getName()/AST.tuPath()/

to be consistent with other places.

kadircet: s/MainFile->getName()/AST.tuPath()/ to be consistent with other places.

elog("Couldn't infer style", FileStyle.takeError());

FileStyle = format::getLLVMStyle();

kadircetUnsubmitted

Done

can you also elog this error? as it should be rare and when this goes wrong, having this mentioned in the logs are really useful for debugging (since the failure is actually outside of clangd, it usually means a malformed config file somewhere)

kadircet: can you also `elog` this error? as it should be rare and when this goes wrong, having this…

}

tooling::HeaderIncludes HeaderIncludes(AST.tuPath(), Code,

FileStyle->IncludeStyle);

for (const auto &SymbolWithMissingInclude : MissingIncludes) {

llvm::StringRef ResolvedPath =

getResolvedPath(SymbolWithMissingInclude.Providers.front());

if (isFilteredByConfig(Cfg, ResolvedPath)) {

dlog("IncludeCleaner: not diagnosing missing include {0}, filtered by "

"config",

ResolvedPath);

continue;

kadircetUnsubmitted

Done

nit:

auto &F = D.Fixes.emplace_back();
F.Message = ...;
F.Edits.push_back(replacementToEdit(...));

kadircet: nit: ``` auto &F = D.Fixes.emplace_back(); F.Message = ...; F.Edits.push_back(replacementToEdit…

}

std::string Spelling =

spellHeader(AST, MainFile, SymbolWithMissingInclude.Providers.front());

kadircetUnsubmitted

Done

nit: it'd put this next to D.File above

kadircet: nit: it'd put this next to `D.File` above

llvm::StringRef HeaderRef{Spelling};

bool Angled = HeaderRef.starts_with("<");

// We might suggest insertion of an existing include in edge cases, e.g.,

// include is present in a PP-disabled region, or spelling of the header

// turns out to be the same as one of the unresolved includes in the

// main file.

std::optional<tooling::Replacement> Replacement = HeaderIncludes.insert(

HeaderRef.trim("\"<>"), Angled, tooling::IncludeDirective::Include);

if (!Replacement.has_value())

continue;

Diag &D = Result.emplace_back();

D.Message =

llvm::formatv("No header providing \"{0}\" is directly included",

getSymbolName(SymbolWithMissingInclude.Symbol));

D.Name = "missing-includes";

D.Source = Diag::DiagSource::Clangd;

D.File = AST.tuPath();

D.InsideMainFile = true;

D.Severity = DiagnosticsEngine::Warning;

D.Range = clangd::Range{

offsetToPosition(Code,

SymbolWithMissingInclude.SymRefRange.beginOffset()),

offsetToPosition(Code,

SymbolWithMissingInclude.SymRefRange.endOffset())};

auto &F = D.Fixes.emplace_back();

F.Message = "#include " + Spelling;

TextEdit Edit = replacementToEdit(Code, *Replacement);

F.Edits.emplace_back(std::move(Edit));

}

return Result;

}

std::vector<Diag> generateUnusedIncludeDiagnostics(

PathRef FileName, llvm::ArrayRef<const Inclusion *> UnusedIncludes,

llvm::StringRef Code) {

std::vector<Diag> Result;

const Config &Cfg = Config::current();

if (Cfg.Diagnostics.UnusedIncludes == Config::IncludesPolicy::None ||

Cfg.Diagnostics.SuppressAll ||

Cfg.Diagnostics.Suppress.contains("unused-includes")) {

return Result;

}

for (const auto *Inc : UnusedIncludes) {

Diag &D = Result.emplace_back();

D.Message =

llvm::formatv("included header {0} is not used directly",

llvm::sys::path::filename(

Inc->Written.substr(1, Inc->Written.size() - 2),

llvm::sys::path::Style::posix));

D.Name = "unused-includes";

D.Source = Diag::DiagSource::Clangd;

D.File = FileName;

D.InsideMainFile = true;

D.Severity = DiagnosticsEngine::Warning;

D.Tags.push_back(Unnecessary);

D.Range = getDiagnosticRange(Code, Inc->HashOffset);

// FIXME(kirillbobyrev): Removing inclusion might break the code if the

// used headers are only reachable transitively through this one. Suggest

// including them directly instead.

// FIXME(kirillbobyrev): Add fix suggestion for adding IWYU pragmas

// (keep/export) remove the warning once we support IWYU pragmas.

auto &F = D.Fixes.emplace_back();

F.Message = "remove #include directive";

F.Edits.emplace_back();

F.Edits.back().range.start.line = Inc->HashLine;

F.Edits.back().range.end.line = Inc->HashLine + 1;

}

return Result;

}

} // namespace } // namespace

ReferencedLocations findReferencedLocations(ASTContext &Ctx, Preprocessor &PP, ReferencedLocations findReferencedLocations(ASTContext &Ctx, Preprocessor &PP,

const syntax::TokenBuffer *Tokens) { const syntax::TokenBuffer *Tokens) {

trace::Span Tracer("IncludeCleaner::findReferencedLocations"); trace::Span Tracer("IncludeCleaner::findReferencedLocations");

ReferencedLocations Result; ReferencedLocations Result;

const auto &SM = Ctx.getSourceManager(); const auto &SM = Ctx.getSourceManager();

ReferencedLocationCrawler Crawler(Result, SM); ReferencedLocationCrawler Crawler(Result, SM);

▲ Show 20 Lines • Show All 120 Lines • ▼ Show 20 Lines translateToHeaderIDs(const ReferencedFiles &Files,

} }

for (tooling::stdlib::Header StdlibUsed : Files.Stdlib) for (tooling::stdlib::Header StdlibUsed : Files.Stdlib)

for (auto HID : Includes.StdlibHeaders.lookup(StdlibUsed)) for (auto HID : Includes.StdlibHeaders.lookup(StdlibUsed))

TranslatedHeaderIDs.insert(HID); TranslatedHeaderIDs.insert(HID);

return TranslatedHeaderIDs; return TranslatedHeaderIDs;

} }

// This is the original clangd-own implementation for computing unused // This is the original clangd-own implementation for computing unused

// #includes. Eventually it will be deprecated and replaced by the // #includes. Eventually it will be deprecated and replaced by the

kadircetUnsubmitted

Done

this should also be either static or put into anon namespace

kadircet: this should also be either static or put into anon namespace

VitaNuoAuthorUnsubmitted

Done

sure, thanks.

VitaNuo: sure, thanks.

// include-cleaner-lib-based implementation. // include-cleaner-lib-based implementation.

std::vector<const Inclusion *> computeUnusedIncludes(ParsedAST &AST) { std::vector<const Inclusion *> computeUnusedIncludes(ParsedAST &AST) {

const auto &SM = AST.getSourceManager(); const auto &SM = AST.getSourceManager();

auto Refs = findReferencedLocations(AST); auto Refs = findReferencedLocations(AST);

auto ReferencedFiles = auto ReferencedFiles =

findReferencedFiles(Refs, AST.getIncludeStructure(), findReferencedFiles(Refs, AST.getIncludeStructure(),

AST.getCanonicalIncludes(), AST.getSourceManager()); AST.getCanonicalIncludes(), AST.getSourceManager());

auto ReferencedHeaders = auto ReferencedHeaders =

translateToHeaderIDs(ReferencedFiles, AST.getIncludeStructure(), SM); translateToHeaderIDs(ReferencedFiles, AST.getIncludeStructure(), SM);

return getUnused(AST, ReferencedHeaders, ReferencedFiles.SpelledUmbrellas); return getUnused(AST, ReferencedHeaders, ReferencedFiles.SpelledUmbrellas);

} }

std::vector<const Inclusion *>

computeUnusedIncludesExperimental(ParsedAST &AST) { IncludeCleanerFindings computeIncludeCleanerFindings(ParsedAST &AST) {

kadircetUnsubmitted

Done

since this is a local symbol, either mark it as static or move it to anonymous namespace above so that it won't be visible to other translation units.

kadircet: since this is a local symbol, either mark it as `static` or move it to anonymous namespace…

VitaNuoAuthorUnsubmitted

Done

Oh yeah makes sense.

VitaNuo: Oh yeah makes sense.

const auto &SM = AST.getSourceManager(); const auto &SM = AST.getSourceManager();

kadircetUnsubmitted

Done

nit: you can use llvm::ArrayRef instead of a const std::vector &. ArrayRef is a trivially copyable/constructible type for providing views into a consecutive set of elements.

kadircet: nit: you can use `llvm::ArrayRef` instead of a `const std::vector &`. `ArrayRef` is a trivially…

VitaNuoAuthorUnsubmitted

Done

Ok.

VitaNuo: Ok.

const auto &Includes = AST.getIncludeStructure(); const auto &Includes = AST.getIncludeStructure();

include_cleaner::Includes ConvertedIncludes =

// FIXME: !!this is a hacky way to collect macro references. convertIncludes(SM, Includes.MainFileIncludes);

std::vector<include_cleaner::SymbolReference> Macros; const FileEntry *MainFile = SM.getFileEntryForID(SM.getMainFileID());

auto &PP = AST.getPreprocessor();

for (const syntax::Token &Tok : std::vector<include_cleaner::SymbolReference> Macros =

kadircetUnsubmitted

Done

generateMissingIncludeDiagnostics should also be either static or put into anon namespace

kadircet: `generateMissingIncludeDiagnostics` should also be either static or put into anon namespace

VitaNuoAuthorUnsubmitted

Done

Well then generateUnusedIncludeDiagnostics too, I guess.

VitaNuo: Well then `generateUnusedIncludeDiagnostics` too, I guess.

AST.getTokens().spelledTokens(SM.getMainFileID())) { collectMacroReferences(AST);

kadircetUnsubmitted

Done

i think it's better to use an llvm::ArrayRef instead of llvm::SmallVector for diag infos here, we don't need to copy.

kadircet: i think it's better to use an `llvm::ArrayRef` instead of `llvm::SmallVector` for diag infos…

auto Macro = locateMacroAt(Tok, PP); std::vector<MissingIncludeDiagInfo> MissingIncludes;

kadircetUnsubmitted

Not Done

static_cast<IncludeStructure::HeaderID>(*Inc.HeaderID));

}

- std::vector<include_cleaner::SymbolReference> Macros =

+ auto Macros =

collectMacroReferences(AST);

kadircet:

VitaNuoAuthorUnsubmitted

Done

Why would you use auto here? The return type is not obvious from the function call.

The style guide says: "types that you and your reviewer experience as unnecessary clutter will very often provide useful information to others. For example, you can assume that the return type of make_unique<Foo>() is obvious, but the return type of MyWidgetFactory() probably isn't."
(http://go/cstyle#Type_deduction)

VitaNuo: Why would you use `auto` here? The return type is not obvious from the function call. The…

kadircetUnsubmitted

Done

well, feel free to keep it. but at least, inside clangd codebase, we use auto frequently (maybe more than we should). especially in cases like this where a declaration would otherwise fit a single line and because the name of the initializer implies this will be a container of macro references and any extra details about the container is probably irrelevant.

kadircet: well, feel free to keep it. but at least, inside clangd codebase, we use auto frequently (maybe…

kadircetUnsubmitted

Done

include_cleaner::Includes::Line is 1-based, whereas clang::clangd::Inclusion::HashLine is 0-based. so we need to have a +1 on the RHS. we are probably missing some test coverage if this didn't fail

kadircet: include_cleaner::Includes::Line is 1-based, whereas clang::clangd::Inclusion::HashLine is 0…

VitaNuoAuthorUnsubmitted

Done

Ok, got it. We only use the include conversion for matching, and matching seems not to use line numbers. I guess it's the reason nothing fails.
I can't write a test for this function directly since it's in an anonymous namespace.

VitaNuo: Ok, got it. We only use the include conversion for matching, and matching seems not to use line…

if (!Macro)

continue;

if (auto DefLoc = Macro->Info->getDefinitionLoc(); DefLoc.isValid())

Macros.push_back(

{Tok.location(),

include_cleaner::Macro{/*Name=*/PP.getIdentifierInfo(Tok.text(SM)),

DefLoc},

include_cleaner::RefType::Explicit});

}

llvm::DenseSet<IncludeStructure::HeaderID> Used; llvm::DenseSet<IncludeStructure::HeaderID> Used;

trace::Span Tracer("include_cleaner::walkUsed");

kadircetUnsubmitted

Done

i don't think we should convert any unresolved includes. it usually means header wasn't found, so we can't perform any reliable analysis on them. anything i am missing?

kadircet: i don't think we should convert any unresolved includes. it usually means header wasn't found…

VitaNuoAuthorUnsubmitted

Done

Ok I will skip unresolved includes. But I am not sure I fully understand. We do the following:

Convert clangd includes to include-cleaner includes.
Match include-cleaner includes with symbol providers.
If match found, symbol reference is satisfied.

How does it matter in this scenario if the include is resolved? AFAIU as long as the header is spelled in the main file + it's matched with a symbol provider, we should say that the symbol reference is satisfied.

Otherwise, it seems like we'll say that the header is missing, although it's there in the main file and unresolved.

I don't know if this is in any way a realistic scenario. I am just approaching it with general logic, and in this sense having more "satisfied" symbols seems better than having less => leads to less false positives. It can lead to false negatives, too, but AFAIU false negatives are much less of a risk for missing include management.

VitaNuo: Ok I will skip unresolved includes. But I am not sure I fully understand. We do the following…

kadircetUnsubmitted

Done

How does it matter in this scenario if the include is resolved? AFAIU as long as the header is spelled in the main file + it's matched with a symbol provider, we should say that the symbol reference is satisfied.

It doesn't matter at a high-level view. But since the implementation recognizes headers based on the HeaderID and it's only defined for resolved includes, if we were to have any unresolved include matches somehow (e.g. it has spelling "foo/bar.h", but is unresolved, and we do a match based on spelling because some IWYU pragma pointed at this header), we would hit the assertion around HeaderID always having value.

kadircet: > How does it matter in this scenario if the include is resolved? AFAIU as long as the header…

VitaNuoAuthorUnsubmitted

Done

Ah ok, makes sense.

VitaNuo: Ah ok, makes sense.

include_cleaner::walkUsed( include_cleaner::walkUsed(

kadircetUnsubmitted

Done

nit: Result.push_back(std::move(D)), as D has lots of strings in it, it might be expensive to copy it around. Even better if you construct the Diag in place via Diag &D = Result.emplace_back(), you can achieve this by moving all the logic that might bail out (e.g. replacement generation) to the top of the loop, and start forming the diagnostic only after we're sure that there'll be one.

kadircet: nit: `Result.push_back(std::move(D))`, as `D` has lots of strings in it, it might be expensive…

VitaNuoAuthorUnsubmitted

Done

Sure. Doing the in-place construction now.

VitaNuo: Sure. Doing the in-place construction now.

AST.getLocalTopLevelDecls(), /*MacroRefs=*/Macros, AST.getLocalTopLevelDecls(), /*MacroRefs=*/Macros,

AST.getPragmaIncludes(), SM, AST.getPragmaIncludes(), SM,

[&](const include_cleaner::SymbolReference &Ref, [&](const include_cleaner::SymbolReference &Ref,

llvm::ArrayRef<include_cleaner::Header> Providers) { llvm::ArrayRef<include_cleaner::Header> Providers) {

kadircetUnsubmitted

Done

we've got Config::Diagnostics::Includes::IgnoreHeader that disables include-cleaner analysis on headers that match a pattern. we should be respecting those config options here too.

kadircet: we've got `Config::Diagnostics::Includes::IgnoreHeader` that disables include-cleaner analysis…

VitaNuoAuthorUnsubmitted

Done

Ok, added that. Please have a look. At the moment it's trying to run filters on the output of the spellHeader method.

VitaNuo: Ok, added that. Please have a look. At the moment it's trying to run filters on the output of…

bool Satisfied = false;

for (const auto &H : Providers) { for (const auto &H : Providers) {

switch (H.kind()) { if (H.kind() == include_cleaner::Header::Physical &&

kadircetUnsubmitted

Done

you can directly use IncludeCleanerIncludes now, without the need for matching the header against clangd's Includes. e.g:

bool Satisfied = false;
for (auto &H : Providers) {
   if (H.kind() == Physical && H.physical() == MainFile) {
      Satisfied = true;
      continue;
   }
   for (auto &Inc : IncludeCleanerIncludes.match(H)) {
      Satisfied = true;
      auto HeaderID = Includes.getID(Inc.Resolved);
      assert(HeaderID.has_value() && "IncludeCleanerIncludes only contain resolved includes.");
      Used.insert(*HeaderID);
   }
}

this way you can also get rid of the logic that generates BySpelling mapping above.

kadircet: you can directly use `IncludeCleanerIncludes` now, without the need for matching the header…

VitaNuoAuthorUnsubmitted

Done

Oh this is cool. Didn't realize we can do straight from a resolved include to the ID. Thanks.

VitaNuo: Oh this is cool. Didn't realize we can do straight from a resolved include to the ID. Thanks.

case include_cleaner::Header::Physical: H.physical() == MainFile) {

if (auto HeaderID = Includes.getID(H.physical())) Satisfied = true;

Used.insert(*HeaderID);

break;

case include_cleaner::Header::Standard:

for (auto HeaderID : Includes.StdlibHeaders.lookup(H.standard()))

Used.insert(HeaderID);

break;

case include_cleaner::Header::Verbatim:

for (auto *Inc :

Includes.mainFileIncludesWithSpelling(H.verbatim())) {

if (!Inc->HeaderID.has_value())

continue; continue;

IncludeStructure::HeaderID ID =

static_cast<IncludeStructure::HeaderID>(*Inc->HeaderID);

Used.insert(ID);

} }

kadircetUnsubmitted

Done

i think the symbol name should also be part of the diagnostic. as editors can show these diagnostics without context (e.g. you've got a big file open, there's a diagnostic panel displaying messages/fixes for all of the file. you should be able to reason about the diagnostics and fixes without jumping all over the file). so maybe something like:

formatv("{0} providing '{1}' is not directly included", Header, Symbol)

kadircet: i think the symbol name should also be part of the diagnostic. as editors can show these…

VitaNuoAuthorUnsubmitted

Done

Sure. The new design does this, as well as skipping the header name.

VitaNuo: Sure. The new design does this, as well as skipping the header name.

kadircetUnsubmitted

Done

i feel like there's actually value in keeping the header name around, i.e. the user will have some idea about the action, without triggering an extra interaction. this helps especially in cases where the finding is wrong, they'll discover this sooner, hence we'll be more likely to receive bug reports. but don't really have a strong preference, so feel free to keep it that way.

kadircet: i feel like there's actually value in keeping the header name around, i.e. the user will have…

VitaNuoAuthorUnsubmitted

Done

Hm I'd expect we'll actually have a higher probability to receive a bug report if the user clicks on the "Quick fix" and gets a wrong header included, because that's annoying :)
Having a full message on hover, seeing that it's wrong and just ignoring it might not be annoying enough to file a bug ;-)

But this is a pure speculation ofc.
I think the point discussed on the design document makes sense: without mentioning the header name it will be a bit easier to extend this to suggesting multiple fixes. So if that's the general direction, I'd prefer to keep it the current way.

VitaNuo: Hm I'd expect we'll actually have a higher probability to receive a bug report if the user…

kadircetUnsubmitted

Done

I think the point discussed on the design document makes sense: without mentioning the header name it will be a bit easier to extend this to suggesting multiple fixes. So if that's the general direction, I'd prefer to keep it the current way.

Well, it's unclear when we'll get there, and moreover my suggestion was actually to still mention the header name when there's only a single provider (which will be the case most of the time).

But this is a pure speculation ofc.

But yeah, my point of view is also mostly speculation. So feel free to keep it this way, I'll just be grumpy :P

kadircet: > I think the point discussed on the design document makes sense: without mentioning the header…

break; for (auto *Inc : ConvertedIncludes.match(H)) {

Satisfied = true;

auto HeaderID = Includes.getID(Inc->Resolved);

assert(HeaderID.has_value() &&

"ConvertedIncludes only contains resolved includes.");

Used.insert(*HeaderID);

} }

if (Satisfied || Providers.empty() ||

kadircetUnsubmitted

Done

this is actually an expensive object to create (it scans the file for existing includes). so we should create this once for the whole file. i think it's better to generate whole set of diagnostics in this function directly, rather than creating one by one.

kadircet: this is actually an expensive object to create (it scans the file for existing includes). so we…

Ref.RT != include_cleaner::RefType::Explicit)

kadircetUnsubmitted

Done

you can directly use !Pragmas->isPrivate(Inc->Resolved) here, instead of getpublic

kadircet: you can directly use `!Pragmas->isPrivate(Inc->Resolved)` here, instead of getpublic

VitaNuoAuthorUnsubmitted

Done

Ok makes sense. No, I guess I was just confused, because I understood that you wanted a test that includes "private.h" with a diagnostic generated saying that "public.h" should be included instead, so I assumed that was expected behaviour. But that's not what you meant, so I misunderstood.

VitaNuo: Ok makes sense. No, I guess I was just confused, because I understood that you wanted a test…

kadircetUnsubmitted

Done

this check seems to be new. what's the reason for rejecting private providers? I can see that we might want to be conservative by not inserting private providers, but treating symbols as unsatisfied when a private provider is already included doesn't feel right. e.g. the code being analyzed might be allowed to depend on this private header, because it's also part of the library, or it's the public header that's exposing this private header. in such a scenario we shouldn't try to insert the public header again. is there a more concrete issue this code is trying to address?

kadircet: this check seems to be new. what's the reason for rejecting private providers? I can see that…

return;

kadircetUnsubmitted

Done

could you add a comment here, as this is subtle, something like We might suggest insertion of an existing include in edge cases, e.g. include is present in a PP-disabled region, or spelling of the header turns out to be the same as one of the unresolved includes in the main file

kadircet: could you add a comment here, as this is subtle, something like `We might suggest insertion of…

VitaNuoAuthorUnsubmitted

Done

Ok sure.

VitaNuo: Ok sure.

auto &Tokens = AST.getTokens();

kadircetUnsubmitted

Done

this returns nullopt only if Header is already included in the main file. our analysis should never suggest such a header for include, unless the include is coming from a PP-disabled region.

so i think if Replacement generation fails, we should drop the diagnostic completely rather than just dropping the fix. WDYT?

kadircet: this returns nullopt only if Header is already included in the main file. our analysis should…

VitaNuoAuthorUnsubmitted

Done

Ok, sure. What's a PP-disabled region? Are you talking about #ifdef's and such?

VitaNuo: Ok, sure. What's a PP-disabled region? Are you talking about #ifdef's and such?

kadircetUnsubmitted

Done

What's a PP-disabled region

Yes, I was trying to say "preprocessor disabled region". e.g. in a piece of code like:

#if 0
#include "foo.h"
#endif

preprocessor won't actually trigger inclusion of "foo.h", but most of the heuristic parsers (most importantly the logic in HeaderIncludes) will treat this include as usual.

kadircet: > What's a PP-disabled region Yes, I was trying to say "preprocessor disabled region". e.g. in…

VitaNuoAuthorUnsubmitted

Done

thanks.

VitaNuo: thanks.

auto SpelledForExpanded =

Tokens.spelledForExpanded(Tokens.expandedTokens(Ref.RefLocation));

if (!SpelledForExpanded)

kadircetUnsubmitted

Done

you can just pass an llvm::ArrayRef<Inclusion> to prevent a copy

kadircet: you can just pass an `llvm::ArrayRef<Inclusion>` to prevent a copy

VitaNuoAuthorUnsubmitted

Done

By preventing a copy, do you mean that the construction of llvm::ArrayRef<Inclusion> will only copy a pointer to the data rather than the whole vector? AFAIU const std::vector<Inclusion>& should be even better then, no copies involved. CMIIW.

VitaNuo: By preventing a copy, do you mean that the construction of `llvm::ArrayRef<Inclusion>` will…

kadircetUnsubmitted

Done

well from performance-wise they're pretty close, passing by const ref doesn't mean you don't do any copies, it'll still require address of the entity to be signalled somehow, which is a pointer copy. passing an arrayref implies copying a pointer to data and the size (so it's slightly worse in that regard). but it can represent any chunk of contiguous memory, the data source doesn't need to be a vector. moreover you can easily pass a slice of a vector rather than the whole vector etc.

the performance implications rarely matters in practice, and the abstraction it provides on the interfaces is usually a benefit, e.g. if we were to change underlying type from vector to llvm::SmallVector, the APIs would still work and you don't need to think about any concrete types. hence we tend to prefer having ArrayRef on API boundaries whenever possible.

kadircet: well from performance-wise they're pretty close, passing by const ref doesn't mean you don't do…

VitaNuoAuthorUnsubmitted

Done

Thank you for the great explanation!

VitaNuo: Thank you for the great explanation!

return;

auto Range = syntax::Token::range(SM, SpelledForExpanded->front(),

kadircetUnsubmitted

Done

you can re-write this as:

include_cleaner::Include TransformedInc;
TransformedInc.Spelled = Inc.Written.trim("\"<>");
TransformedInc.HashLocation = SM.getComposedLoc(SM.getMainFileID(), Inc.HashOffset); // we should actually convert this from a SourceLocation to offset in include_cleaner::Include as well
TransformedInc.Line = Inc.HashLine;
TransformedInc.Angled = WrittenRef.starts_with("<");
if(auto FE = SM.getFileManager().getFile(Inc.Resolved))
  TransformedInc.Resolved = *FE;
Includes.add(std::move(TransformedInc));

kadircet: you can re-write this as: ``` include_cleaner::Include TransformedInc; TransformedInc.Spelled =…

VitaNuoAuthorUnsubmitted

Done

Thanks.
It seems like std::string does not have a trim or a starts_with method, so AFAIU I still need to call the llvm::StringRef constructor.

VitaNuo: Thanks. It seems like `std::string` does not have a `trim` or a `starts_with` method, so AFAIU…

SpelledForExpanded->back());

MissingIncludeDiagInfo DiagInfo{Ref.Target, Range, Providers};

kadircetUnsubmitted

Done

sorry wasn't thinking this through in the initial set of comments. but i think we should be grouping findings per Symbol rather than per Header spelling, because:

for reasons i pointed elsewhere, we should be including symbol name in the diagnostic messages
again as pointed elsewhere, we should have diagnostics attached to all references not just the first one.
in the future we'll likely extend this to cover providing fixes for alternative providers. we shouldn't be emitting multiple diags for each header but rather have multiple fixes on the same diagnostic.

so probably a mapping like:

llvm::DenseMap<Symbol, pair</*Providers*/vector<Header>, /*reference locations*/vector<Range>>; // instead of a pair, a dedicated struct would look better

we also should delay spelling of the Header, as it's expensive and we might not be diagnosing missing includes.

later on we can consume the whole map at once and convert them into a set of clangd::Diag. WDYT?

kadircet: sorry wasn't thinking this through in the initial set of comments. but i think we should be…

VitaNuoAuthorUnsubmitted

Done

Yes, storing per Symbol totally makes sense. Let's discuss specifics in the corresponding document.

VitaNuo: Yes, storing per Symbol totally makes sense. Let's discuss specifics in the corresponding…

MissingIncludes.push_back(std::move(DiagInfo));

}); });

return getUnused(AST, Used, /*ReferencedPublicHeaders*/ {}); std::vector<const Inclusion *> UnusedIncludes =

getUnused(AST, Used, /*ReferencedPublicHeaders*/ {});

kadircetUnsubmitted

Done

s/computeUnusedIncludesDiagnostic/generateUnusedIncludeDiagnostic/

kadircet: s/computeUnusedIncludesDiagnostic/generateUnusedIncludeDiagnostic/

return {std::move(UnusedIncludes), std::move(MissingIncludes)};

} }

kadircetUnsubmitted

Done

any reason for storing tokens? you can store a range directly here as a clangd::Range, or a syntax::FileRange if you don't want to pay for offsetToPosition calls if we're not going to emit.

kadircet: any reason for storing tokens? you can store a range directly here as a `clangd::Range`, or a…

VitaNuoAuthorUnsubmitted

Done

any reason for storing tokens?

I was primarily avoiding clang::Range since it requires llvm::Code to build ranges, and didn't want computeIncludeCleanerFindings to depend on anything but the AST. But syntax::FileRange sounds good.

VitaNuo: > any reason for storing tokens? I was primarily avoiding `clang::Range` since it requires…

kadircetUnsubmitted

Done

this is not a multimap, so we're only retaining the range for a missing header on the first range that referenced a symbol provided by it. i think we should be collecting all the reference ranges instead. e.g. if i have a file:

#include <all>

void foo() {
  std::string x;
  std::string y;
}

there should be missing include diagnostics attached to both mentions of std::string not only the first (again in big files it might be impossible to see the first range the diagnostic is attached to and people have a tendency to only care about the parts of the code they've touched). does that make sense?

kadircet: this is not a multimap, so we're only retaining the range for a missing header on the first…

VitaNuoAuthorUnsubmitted

Done

Yes, this comment goes along the lines of the design discussion we are having at the moment.

again in big files it might be impossible to see the first range the diagnostic is attached to and people have a tendency to only care about the parts of the code they've touched

this is AFAIU in conflict with the suggestion that the diagnostic should only be attached to the first reference.

VitaNuo: Yes, this comment goes along the lines of the design discussion we are having at the moment.

kadircetUnsubmitted

Done

auto Range = syntax::Token::range(SM, SpelledForExpanded->front(), SpelledForExpanded->back());

kadircet: `auto Range = syntax::Token::range(SM, SpelledForExpanded->front(), SpelledForExpanded->back())…

std::vector<Diag> issueUnusedIncludesDiagnostics(ParsedAST &AST, std::vector<Diag> issueIncludeCleanerDiagnostics(ParsedAST &AST,

llvm::StringRef Code) { llvm::StringRef Code) {

kadircetUnsubmitted

Done

as mentioned elsewhere, i think we should delay this symbol name spelling to diagnostic generation. to make sure core analysis we perform don't do work that might not get re-used (e.g. if we're not going to diagnose missing includes, or in the future when we don't care about spelling of all the symbols)

kadircet: as mentioned elsewhere, i think we should delay this symbol name spelling to diagnostic…

VitaNuoAuthorUnsubmitted

Done

Ok should be done now.

VitaNuo: Ok should be done now.

const Config &Cfg = Config::current();

if (Cfg.Diagnostics.UnusedIncludes == Config::UnusedIncludesPolicy::None ||

Cfg.Diagnostics.SuppressAll ||

Cfg.Diagnostics.Suppress.contains("unused-includes"))

return {};

// Interaction is only polished for C/CPP. // Interaction is only polished for C/CPP.

if (AST.getLangOpts().ObjC) if (AST.getLangOpts().ObjC)

return {}; return {};

trace::Span Tracer("IncludeCleaner::issueUnusedIncludesDiagnostics");

kadircetUnsubmitted

Done

nit: while here you can replace all uses of FileName to AST.tuPath()

kadircet: nit: while here you can replace all uses of `FileName` to `AST.tuPath()`

kadircetUnsubmitted

Done

this also needs to be static or put into anon namespace

kadircet: this also needs to be static or put into anon namespace

VitaNuoAuthorUnsubmitted

Done

Ah didn't realize that you also left a comment here when replying to the identical comment on generateMissingIncludeDiagnostics. Should be done.

VitaNuo: Ah didn't realize that you also left a comment here when replying to the identical comment on…

std::vector<Diag> Result; trace::Span Tracer("IncludeCleaner::issueIncludeCleanerDiagnostics");

kadircetUnsubmitted

Done

no need to copy the vector by taking a std::vector here, you can take an llvm::ArrayRef instead.

kadircet: no need to copy the vector by taking a `std::vector` here, you can take an `llvm::ArrayRef`…

VitaNuoAuthorUnsubmitted

Done

Oh this isn't even my code, but as long as it's a small change, sure :)

VitaNuo: Oh this isn't even my code, but as long as it's a small change, sure :)

kadircetUnsubmitted

Done

well, this is our code in the end :D

kadircet: well, this is `our` code in the end :D

VitaNuoAuthorUnsubmitted

Done

Sorry, wrong wording. I meant to say that this is not the code that has been touched in this patch. It might sometimes get annoying when comments on the patch dig too deep into code that's not in the diff.

VitaNuo: Sorry, wrong wording. I meant to say that this is not the code that has been touched in this…

std::string FileName =

AST.getSourceManager() const Config &Cfg = Config::current();

.getFileEntryRefForID(AST.getSourceManager().getMainFileID()) IncludeCleanerFindings Findings;

kadircetUnsubmitted

Done

convention is definitely to use auto in place of such detailed type names. e.g. auto &Entry : MissingIncludes

kadircet: convention is definitely to use `auto` in place of such detailed type names. e.g. `auto &Entry…

VitaNuoAuthorUnsubmitted

Done

sure.

VitaNuo: sure.

->getName() if (Cfg.Diagnostics.MissingIncludes == Config::IncludesPolicy::Strict ||

kadircetUnsubmitted

Done

nit: auto Macros = ..

kadircet: nit: `auto Macros = ..`

VitaNuoAuthorUnsubmitted

Done

Same as above. I am not sure about the benefit of auto here..

VitaNuo: Same as above. I am not sure about the benefit of `auto` here..

kadircetUnsubmitted

Done

i think for now this should be

if (Cfg.Diagnostics.MissingIncludes == Config::IncludesPolicy::Strict ||
  Cfg.Diagnostics.UnusedIncludes == Config::IncludesPolicy::Experiment) {

otherwise we'll run both legacy and new analysis for UnusedIncludes == Strict

kadircet: i think for now this should be ``` if (Cfg.Diagnostics.MissingIncludes == Config…

.str(); Cfg.Diagnostics.UnusedIncludes == Config::IncludesPolicy::Experiment) {

const auto &UnusedIncludes = // will need include-cleaner results, call it once

kadircetUnsubmitted

Done

no need for copying to vector here, const auto& MainFileIncludes = ...

kadircet: no need for copying to vector here, `const auto& MainFileIncludes = ...`

Cfg.Diagnostics.UnusedIncludes == Config::UnusedIncludesPolicy::Experiment Findings = computeIncludeCleanerFindings(AST);

kadircetUnsubmitted

Done

can you restore std::move?

kadircet: can you restore `std::move`?

VitaNuoAuthorUnsubmitted

Done

I'd rather do the emplacing, so that it's the same as in the generateMissingIncludeDiagnostics.

VitaNuo: I'd rather do the emplacing, so that it's the same as in the…

? computeUnusedIncludesExperimental(AST) }

kadircetUnsubmitted

Done

s/AST.getSourceManager()/SM

kadircet: s/AST.getSourceManager()/SM

: computeUnusedIncludes(AST);

for (const auto *Inc : UnusedIncludes) { std::vector<Diag> Result = generateUnusedIncludeDiagnostics(

kadircetUnsubmitted

Done

nit: it might be worth re-writing the following section as:

std::vector<Diag> Result = generateUnusedIncludeDiagnostics(AST.tuPath(),
                Cfg.Diagnostics.UnusedIncludes == Strict ? computeUnusedIncludes(AST) : Findings.UnusedIncludes, Code);
llvm::move(generateMissingIncludeDiagnostics(AST, MissingIncludes, Code),
             std::back_inserter(Result));

and move the checks like if (Cfg.Diagnostics.MissingIncludes == Config::IncludesPolicy::Strict && !Cfg.Diagnostics.Suppress.contains("missing-includes")) into the specific function, e.g. generateUnusedIncludeDiagnostics, as they already do some of the diagnostic filtering logic.

kadircet: nit: it might be worth re-writing the following section as: ``` std::vector<Diag> Result =…

VitaNuoAuthorUnsubmitted

Done

nit: it might be worth re-writing the following section as

This code seems to ignore the option Config::IncludesPolicy::None. It's saying to either return the old-style clangd results in case of Strict or include-cleaner results otherwise (incl. in case of None). Am I missing something?

and move the checks like ...

Ok, moved into generateMissingIncludeDiagnostics.

VitaNuo: > nit: it might be worth re-writing the following section as This code seems to ignore the…

kadircetUnsubmitted

Done

This code seems to ignore the option Config::IncludesPolicy::None. It's saying to either return the old-style clangd results in case of Strict or include-cleaner results otherwise (incl. in case of None). Am I missing something?

Well that was to be addressed by second part of the comment And move the checks like if (Cfg.Diagnostics.MissingIncludes == Config::IncludesPolicy::Strict && !Cfg.Diagnostics.Suppress.contains("missing-includes")) into the specific function, e.g. generateUnusedIncludeDiagnostics, as they already do some of the diagnostic filtering logic.
I was talking about both missing and unused include diagnostics generation (hence e.g.), similar to the early exit in generateMissingIncludeDiagnostics, we should have one that returns an empty set of diagnostics, when it's suppressed or not enabled.

kadircet: > This code seems to ignore the option Config::IncludesPolicy::None. It's saying to either…

VitaNuoAuthorUnsubmitted

Done

Ok, I've refactored more of the config checking logic inside generate.. functions.

VitaNuo: Ok, I've refactored more of the config checking logic inside `generate..` functions.

kadircetUnsubmitted

Done

nit: I think logically it makes more sense for us to return set of Used includes here, and let the interaction that issues unused include diagnostics to derive this information from the set of used includes, and change the the missingincludes to a vector< tuple<Symbol, Ref, Providers> > (not only the unsatisfied ones) would represent the analysis better and make it more usable in the future (i.e. when we want to augment Hover responses, we can't re-use all the logic in here, we really need to implement another call to walkUsed because the analysis we get out of this call won't contain information for satisfied symbols.

no need to do it now though, we can perform that kind of refactoring as we're adding the features too (or maybe it'll actually look neater to just have another call in those features rather than try and re-use the logic here)

kadircet: nit: I think logically it makes more sense for us to return set of `Used` includes here, and…

VitaNuoAuthorUnsubmitted

Done

Thanks. This might very well be the case, but this comment also seems to suggest some premature optimization (in a way). It totally makes sense to re-use what's re-usable, but this sort of refactoring really only makes sense once we have a clear use case (and get there :)

VitaNuo: Thanks. This might very well be the case, but this comment also seems to suggest some premature…

Diag D; AST.tuPath(),

kadircetUnsubmitted

Done

you can use AST.tuPath()

kadircet: you can use `AST.tuPath()`

VitaNuoAuthorUnsubmitted

Done

Thanks, but this actually seems unused. It was some debugging artefact too.. :(

VitaNuo: Thanks, but this actually seems unused. It was some debugging artefact too.. :(

D.Message = Cfg.Diagnostics.UnusedIncludes == Config::IncludesPolicy::Strict

llvm::formatv("included header {0} is not used directly", ? computeUnusedIncludes(AST)

llvm::sys::path::filename( : Findings.UnusedIncludes,

Inc->Written.substr(1, Inc->Written.size() - 2), Code);

kadircetUnsubmitted

Done

can you introduce a trace::Span wrapping the call to walkUsed with name IncludeCleanerAnalysis so that we can collect some stats about latency here?

kadircet: can you introduce a `trace::Span` wrapping the call to `walkUsed` with name…

VitaNuoAuthorUnsubmitted

Done

Sure.

VitaNuo: Sure.

llvm::sys::path::Style::posix)); llvm::move(

kadircetUnsubmitted

Done

looks like debugging artifact?

kadircet: looks like debugging artifact?

VitaNuoAuthorUnsubmitted

Done

sorry.

VitaNuo: sorry.

D.Name = "unused-includes"; generateMissingIncludeDiagnostics(AST, Findings.MissingIncludes, Code),

kadircetUnsubmitted

Done

s/resolveSpelledHeader/spellHeader/

again make this static or put into anon namespace above

kadircet: s/resolveSpelledHeader/spellHeader/ again make this static or put into anon namespace above

D.Source = Diag::DiagSource::Clangd; std::back_inserter(Result));

D.File = FileName;

D.Severity = DiagnosticsEngine::Warning;

D.Tags.push_back(Unnecessary);

D.Range = getDiagnosticRange(Code, Inc->HashOffset);

// FIXME(kirillbobyrev): Removing inclusion might break the code if the

// used headers are only reachable transitively through this one. Suggest

// including them directly instead.

// FIXME(kirillbobyrev): Add fix suggestion for adding IWYU pragmas

// (keep/export) remove the warning once we support IWYU pragmas.

D.Fixes.emplace_back();

D.Fixes.back().Message = "remove #include directive";

D.Fixes.back().Edits.emplace_back();

D.Fixes.back().Edits.back().range.start.line = Inc->HashLine;

D.Fixes.back().Edits.back().range.end.line = Inc->HashLine + 1;

D.InsideMainFile = true;

Result.push_back(std::move(D));

}

return Result; return Result;

kadircetUnsubmitted

Done

nit: you can rewrite this as:

// Give URI schemes a chance to customize header spellings
if(Provider.kind() == Physical) {
   if(auto CanPath = getCanonicalPath(Provider.physical(), AST.getSourceManager())) {
         // URI::includeSpelling only fails when URI scheme is unknown. Since we're creating URI ourselves here, we can't get an unknown scheme.
         std::string Header = llvm::cantFail(URI::includeSpelling(URI::create(*CanonicalPath));
         if (!Header.empty())
             return Header;
   }
}
return spellHeader(...);

kadircet: nit: you can rewrite this as: ``` // Give URI schemes a chance to customize header spellings if…

VitaNuoAuthorUnsubmitted

Done

Ok, great. Didn't know it was Ok to test std::optional directly in an if clause.

VitaNuo: Ok, great. Didn't know it was Ok to test `std::optional` directly in an `if` clause.

} }

} // namespace clangd } // namespace clangd

} // namespace clang } // namespace clang

kadircetUnsubmitted

Done

we've got some tooling library to generate these edits, see https://github.com/llvm/llvm-project/blob/main/clang/include/clang/Tooling/Inclusions/HeaderIncludes.h#L76. that way they'll be placed in "correct" position among the existing includes. you can then use replacementToEdit to convert into a clangd edit.

kadircet: we've got some tooling library to generate these edits, see https://github.com/llvm/llvm…

VitaNuoAuthorUnsubmitted

Done

Thanks for the tips!

VitaNuo: Thanks for the tips!

kadircetUnsubmitted

Done

clangd has some header spelling customizations. so we should actually be doing this through URI::includeSpelling(URI::create(getCanonicalPath(Providers.front().physical(), SM))) first, and fall back to spellHeader if it fails for physical header providers to make sure we're consistent.

this is used by $EMPLOYER$'s integration to always spell includes relative to depot root, rather than certain include search paths.

kadircet: clangd has some header spelling customizations. so we should actually be doing this through…

VitaNuoAuthorUnsubmitted

Done

Thank you. There seem to be a couple of indirection levels involved, so I hope I got it (somewhat) right with all the checking.

VitaNuo: Thank you. There seem to be a couple of indirection levels involved, so I hope I got it…

kadircetUnsubmitted

Done

RefLocation is not necessarily spelled token location, e.g. it might be pointing at a macro expansion.

you can make use of TokenBuffer inside ParsedAST to go from this expanded location to a token spelled inside the main file, e.g. Tokens.spelledForExpanded(Tokens.expandedTokens(Ref.RefLocation)); and use the full spelled token range afterwards for diagnostic location.

kadircet: `RefLocation` is not necessarily spelled token location, e.g. it might be pointing at a macro…

VitaNuoAuthorUnsubmitted

Done

Ok, thank you. Please have a look at the current version, hopefully it makes sense.

VitaNuo: Ok, thank you. Please have a look at the current version, hopefully it makes sense.

kadircetUnsubmitted

Done

nit: you can just break after satisfying the include (same below)

kadircet: nit: you can just `break` after satisfying the include (same below)

VitaNuoAuthorUnsubmitted

Done

Not if the unused and missing include analyses are merged together.

VitaNuo: Not if the unused and missing include analyses are merged together.

kadircetUnsubmitted

Done

nit: you can check whether Ref.RT is Explicit at the top, and bail out early.

kadircet: nit: you can check whether `Ref.RT` is `Explicit` at the top, and bail out early.

VitaNuoAuthorUnsubmitted

Done

This seems obsolete after merging the missing and unused includes analyses together. There is no obvious place to insert the check.

VitaNuo: This seems obsolete after merging the missing and unused includes analyses together. There is…

kadircetUnsubmitted

Done

s/IncludeCleanerIncludes/ConvertedIncludes/ ?

kadircet: s/IncludeCleanerIncludes/ConvertedIncludes/ ?

kadircetUnsubmitted

Done

nit: drop either * or & (preferably &), having a reference vs a pointer doesn't make any differences performance wise, but creates a confusion (as we don't realy need a reference to a pointer here)

kadircet: nit: drop either `*` or `&` (preferably `&`), having a reference vs a pointer doesn't make any…

kadircetUnsubmitted

Done

nit: we prefer early exits to extra nesting, e.g. rewriting this as:

if (Satisfied || Providers.empty() || Ref.RT != Explicit)
  continue;
const auto &TB = AST.getTokens();
auto SpelledTokens = TB.spelledForExpanded(...);
if (!SpelledTokens)
  continue;
...

increases readability by:

reducing the nesting
making it more explicit about under what assumptions the rest of the code is working

kadircet: nit: we prefer `early exits` to extra `nesting`, e.g. rewriting this as: ``` if (Satisfied ||…

kadircetUnsubmitted

Done

nit: auto Range = syntax::Token::range(SM, SpelledForExpanded->front(), SpelledForExpanded->back());

kadircet: nit: `auto Range = syntax::Token::range(SM, SpelledForExpanded->front(), SpelledForExpanded…

kadircetUnsubmitted

Done

you don't need to explicitly copy Providers into ProviderHeaders, you can pass it directly to DiagInfo below.

kadircet: you don't need to explicitly copy `Providers` into `ProviderHeaders`, you can pass it directly…

kadircetUnsubmitted

Done

we use llvm casts, specifically llvm::dyn_cast<NamedDecl*>(&Ref.Target.declaration())->getQualifiedNameAsString()

kadircet: we use llvm casts, specifically `llvm::dyn_cast<NamedDecl*>(&Ref.Target.declaration())…

kadircetUnsubmitted

Done

getQualifiedNameAsString is going to print names that are really ugly at certain times, but unfortunately that's a problem we don't have a great solution to. so no action needed ATM, but we might want to switch between qualified and unqualified name depending on the length at the very least (e.g. symbol is coming from a templated class, which has a nasty nested instantiation).

kadircet: `getQualifiedNameAsString` is going to print names that are really ugly at certain times, but…

VitaNuoAuthorUnsubmitted

Done

Ok so for now no action, IIUC.

VitaNuo: Ok so for now no action, IIUC.

kadircetUnsubmitted

Done

nit: MissingIncludes.emplace_back(std::move(SymbolName), Range, Providers);

kadircet: nit: `MissingIncludes.emplace_back(std::move(SymbolName), Range, Providers);`

VitaNuoAuthorUnsubmitted

Done

This does not compile. Seems like it needs a certain type of constructor to be present in MissingIncludeDiagInfo, whereas atm it's just a struct.

VitaNuo: This does not compile. Seems like it needs a certain type of constructor to be present in…

kadircetUnsubmitted

Done

nit: you'd want std::moves here, around both of them

kadircet: nit: you'd want `std::move`s here, around both of them

kadircetUnsubmitted

Done

oops, i forgot to put the surrounding {} it should've been MissingIncludes.emplace_back({...});

kadircet: oops, i forgot to put the surrounding `{}` it should've been `MissingIncludes.emplace_back({...

VitaNuoAuthorUnsubmitted

Done

No this seems to be even more wrong.
stl_vector.h(1303, 2): Candidate template ignored: substitution failure: deduced incomplete pack <(no value)> for template parameter '_Args'.

This is for the version with braces.

And this is for no braces:

/usr/bin/../lib/gcc/x86_64-linux-gnu/12/../../../../include/c++/12/bits/new_allocator.h:175:23: error: no matching constructor for initialization of 'clang::clangd::MissingIncludeDiagInfo'
        { ::new((void *)__p) _Up(std::forward<_Args>(__args)...); }

It seems that it just doesn't cooperate with structs.

VitaNuo: No this seems to be even more wrong. `stl_vector.h(1303, 2): Candidate template ignored…

clang-tools-extra/clangd/ParsedAST.cpp

Show First 20 Lines • Show All 680 Lines • ▼ Show 20 Lines	// Finally, add diagnostics coming from the AST.
std::vector<Diag> D = ASTDiags.take(&*CTContext);		std::vector<Diag> D = ASTDiags.take(&*CTContext);
Diags->insert(Diags->end(), D.begin(), D.end());		Diags->insert(Diags->end(), D.begin(), D.end());
}		}
}		}
ParsedAST Result(Filename, Inputs.Version, std::move(Preamble),		ParsedAST Result(Filename, Inputs.Version, std::move(Preamble),
std::move(Clang), std::move(Action), std::move(Tokens),		std::move(Clang), std::move(Action), std::move(Tokens),
std::move(Macros), std::move(Marks), std::move(ParsedDecls),		std::move(Macros), std::move(Marks), std::move(ParsedDecls),
std::move(Diags), std::move(Includes),		std::move(Diags), std::move(Includes),
std::move(CanonIncludes));		std::move(CanonIncludes));
if (Result.Diags) {		if (Result.Diags)
		kadircetUnsubmitted Done Reply Inline Actions nit: `llvm::move(issueIncludeCleanerDiagnostics(...), std::back_inserter(Result.Diags))` kadircet:* nit: `llvm::move(issueIncludeCleanerDiagnostics(...), std::back_inserter(*Result.Diags))`
auto UnusedHeadersDiags =		llvm::move(issueIncludeCleanerDiagnostics(Result, Inputs.Contents),
		kadircetUnsubmitted Done Reply Inline Actions instead of introducing a new function here and duplicating logic what about something like: auto IncludeDiags = issueIncludeDiagnostics(...); in IncludeCleaner.cpp: struct IncludeCleanerFindings { std::vector<Inclusion> Unused; llvm::StringMap<std::vector<Range>> MissingIncludes; // Map from missing header spelling to ranges of references. }; IncludeCleanerFindings computeIncludeCleanerFindings(AST) { auto Incs = convertIncludes(); walkUsed(...., { // Update Missing includes, mark relevant main file includes as used. }); // Create the unused includes set return Findings; } std::vector<Diag> issueIncludeDiagnostics(...) { std::vector<Inclusion> UnusedIncludes; llvm::StringMap<std::vector</Location of reference/clangd::Range>> MissingIncludes; if (UnusedIncludePolicy is Strict) { UnusedIncludes = computeUnusedIncludes(...); } else if(UnusedIncludesPolicy is Experiment \|\| MissingIncludesPolicy is Strict) { auto Findings = computeIncludeCleanerFindings(AST); if(UnusedIncludesPolicy is Experiment) { UnusedIncludes = std::move(Findings.Unused); } if(MissingIncludesPolicy is Strict) { MissingIncludes = std::move(Findings.Missing); } } for(auto &Inc: UnusedIncludes) { ... // create unused include diag. } for(auto &Missing: MissingIncludes) { // create missing include diag. } return Diags; } kadircet: instead of introducing a new function here and duplicating logic what about something like: ```…
		VitaNuoAuthorUnsubmitted Done Reply Inline Actions All right, it's probably a good idea to turn these two analyses into one. The resulting code is somewhat tangled, though, and I am not sure how to make it simpler. Let's hope you have more ideas or are happy with the current state. VitaNuo: All right, it's probably a good idea to turn these two analyses into one. The resulting code is…
issueUnusedIncludesDiagnostics(Result, Inputs.Contents);		std::back_inserter(*Result.Diags));
Result.Diags->insert(Result.Diags->end(),
make_move_iterator(UnusedHeadersDiags.begin()),
make_move_iterator(UnusedHeadersDiags.end()));
}
return std::move(Result);		return std::move(Result);
}		}

ParsedAST::ParsedAST(ParsedAST &&Other) = default;		ParsedAST::ParsedAST(ParsedAST &&Other) = default;

ParsedAST &ParsedAST::operator=(ParsedAST &&Other) = default;		ParsedAST &ParsedAST::operator=(ParsedAST &&Other) = default;

ParsedAST::~ParsedAST() {		ParsedAST::~ParsedAST() {
▲ Show 20 Lines • Show All 113 Lines • Show Last 20 Lines

clang-tools-extra/clangd/Preamble.cpp

Show First 20 Lines • Show All 122 Lines • ▼ Show 20 Lines	public:
}		}

void BeforeExecute(CompilerInstance &CI) override {		void BeforeExecute(CompilerInstance &CI) override {
CanonIncludes.addSystemHeadersMapping(CI.getLangOpts());		CanonIncludes.addSystemHeadersMapping(CI.getLangOpts());
LangOpts = &CI.getLangOpts();		LangOpts = &CI.getLangOpts();
SourceMgr = &CI.getSourceManager();		SourceMgr = &CI.getSourceManager();
Includes.collect(CI);		Includes.collect(CI);
if (Config::current().Diagnostics.UnusedIncludes ==		if (Config::current().Diagnostics.UnusedIncludes ==
Config::UnusedIncludesPolicy::Experiment)		Config::IncludesPolicy::Experiment \|\|
		Config::current().Diagnostics.MissingIncludes ==
		Config::IncludesPolicy::Strict)
Pragmas.record(CI);		Pragmas.record(CI);
if (BeforeExecuteCallback)		if (BeforeExecuteCallback)
BeforeExecuteCallback(CI);		BeforeExecuteCallback(CI);
}		}

std::unique_ptr<PPCallbacks> createPPCallbacks() override {		std::unique_ptr<PPCallbacks> createPPCallbacks() override {
assert(SourceMgr && LangOpts &&		assert(SourceMgr && LangOpts &&
"SourceMgr and LangOpts must be set at this point");		"SourceMgr and LangOpts must be set at this point");
▲ Show 20 Lines • Show All 759 Lines • Show Last 20 Lines

clang-tools-extra/clangd/unittests/ConfigCompileTests.cpp

Show First 20 Lines • Show All 243 Lines • ▼ Show 20 Lines	for (const auto &Case : Cases) {
ASSERT_THAT(Diags.Diagnostics, IsEmpty());		ASSERT_THAT(Diags.Diagnostics, IsEmpty());
}		}
}		}

TEST_F(ConfigCompileTests, DiagnosticsIncludeCleaner) {		TEST_F(ConfigCompileTests, DiagnosticsIncludeCleaner) {
// Defaults to None.		// Defaults to None.
EXPECT_TRUE(compileAndApply());		EXPECT_TRUE(compileAndApply());
EXPECT_EQ(Conf.Diagnostics.UnusedIncludes,		EXPECT_EQ(Conf.Diagnostics.UnusedIncludes,
Config::UnusedIncludesPolicy::None);		Config::IncludesPolicy::None);

Frag = {};		Frag = {};
Frag.Diagnostics.UnusedIncludes.emplace("None");		Frag.Diagnostics.UnusedIncludes.emplace("None");
EXPECT_TRUE(compileAndApply());		EXPECT_TRUE(compileAndApply());
EXPECT_EQ(Conf.Diagnostics.UnusedIncludes,		EXPECT_EQ(Conf.Diagnostics.UnusedIncludes,
Config::UnusedIncludesPolicy::None);		Config::IncludesPolicy::None);

Frag = {};		Frag = {};
Frag.Diagnostics.UnusedIncludes.emplace("Strict");		Frag.Diagnostics.UnusedIncludes.emplace("Strict");
EXPECT_TRUE(compileAndApply());		EXPECT_TRUE(compileAndApply());
EXPECT_EQ(Conf.Diagnostics.UnusedIncludes,		EXPECT_EQ(Conf.Diagnostics.UnusedIncludes,
Config::UnusedIncludesPolicy::Strict);		Config::IncludesPolicy::Strict);

Frag = {};		Frag = {};
EXPECT_TRUE(Conf.Diagnostics.Includes.IgnoreHeader.empty())		EXPECT_TRUE(Conf.Diagnostics.Includes.IgnoreHeader.empty())
<< Conf.Diagnostics.Includes.IgnoreHeader.size();		<< Conf.Diagnostics.Includes.IgnoreHeader.size();
Frag.Diagnostics.Includes.IgnoreHeader.push_back(		Frag.Diagnostics.Includes.IgnoreHeader.push_back(
Located<std::string>("foo.h"));		Located<std::string>("foo.h"));
Frag.Diagnostics.Includes.IgnoreHeader.push_back(		Frag.Diagnostics.Includes.IgnoreHeader.push_back(
Located<std::string>(".*inc"));		Located<std::string>(".*inc"));
▲ Show 20 Lines • Show All 292 Lines • Show Last 20 Lines

clang-tools-extra/clangd/unittests/DiagnosticsTests.cpp

Show First 20 Lines • Show All 1,894 Lines • ▼ Show 20 Lines	TU.AdditionalFiles["ignore.h"] = R"cpp(
#pragma once		#pragma once
void ignore() {}		void ignore() {}
)cpp";		)cpp";
TU.AdditionalFiles["system/system_header.h"] = "";		TU.AdditionalFiles["system/system_header.h"] = "";
TU.ExtraArgs = {"-isystem" + testPath("system")};		TU.ExtraArgs = {"-isystem" + testPath("system")};
// Off by default.		// Off by default.
EXPECT_THAT(*TU.build().getDiagnostics(), IsEmpty());		EXPECT_THAT(*TU.build().getDiagnostics(), IsEmpty());
Config Cfg;		Config Cfg;
Cfg.Diagnostics.UnusedIncludes = Config::UnusedIncludesPolicy::Strict;		Cfg.Diagnostics.UnusedIncludes = Config::IncludesPolicy::Strict;
// Set filtering.		// Set filtering.
Cfg.Diagnostics.Includes.IgnoreHeader.emplace_back(		Cfg.Diagnostics.Includes.IgnoreHeader.emplace_back(
[](llvm::StringRef Header) { return Header.endswith("ignore.h"); });		[](llvm::StringRef Header) { return Header.endswith("ignore.h"); });
WithContextValue WithCfg(Config::Key, std::move(Cfg));		WithContextValue WithCfg(Config::Key, std::move(Cfg));
auto AST = TU.build();		auto AST = TU.build();
EXPECT_THAT(		EXPECT_THAT(
*AST.getDiagnostics(),		*AST.getDiagnostics(),
UnorderedElementsAre(AllOf(		UnorderedElementsAre(AllOf(
▲ Show 20 Lines • Show All 55 Lines • Show Last 20 Lines

clang-tools-extra/clangd/unittests/IncludeCleanerTests.cpp

//===--- IncludeCleanerTests.cpp --------------------------------- C++ --===//		//===--- IncludeCleanerTests.cpp --------------------------------- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "Annotations.h"		#include "Annotations.h"
#include "Config.h"		#include "Config.h"
		#include "Diagnostics.h"
#include "IncludeCleaner.h"		#include "IncludeCleaner.h"
		#include "ParsedAST.h"
#include "SourceCode.h"		#include "SourceCode.h"
#include "TestFS.h"		#include "TestFS.h"
#include "TestTU.h"		#include "TestTU.h"
		#include "clang-include-cleaner/Analysis.h"
		#include "clang-include-cleaner/Types.h"
#include "support/Context.h"		#include "support/Context.h"
		#include "clang/AST/DeclBase.h"
		#include "clang/Basic/SourceManager.h"
		#include "clang/Tooling/Syntax/Tokens.h"
#include "llvm/ADT/ScopeExit.h"		#include "llvm/ADT/ScopeExit.h"
		#include "llvm/ADT/StringRef.h"
		#include "llvm/Support/Casting.h"
		#include "llvm/Support/Error.h"
		#include "llvm/Support/ScopedPrinter.h"
#include "llvm/Testing/Support/SupportHelpers.h"		#include "llvm/Testing/Support/SupportHelpers.h"
#include "gmock/gmock.h"		#include "gmock/gmock.h"
#include "gtest/gtest.h"		#include "gtest/gtest.h"
		#include <cstddef>
		#include <string>
		#include <vector>

namespace clang {		namespace clang {
namespace clangd {		namespace clangd {
namespace {		namespace {

		using ::testing::AllOf;
using ::testing::ElementsAre;		using ::testing::ElementsAre;
using ::testing::ElementsAreArray;		using ::testing::ElementsAreArray;
using ::testing::IsEmpty;		using ::testing::IsEmpty;
		using ::testing::Matcher;
using ::testing::Pointee;		using ::testing::Pointee;
using ::testing::UnorderedElementsAre;		using ::testing::UnorderedElementsAre;

		Matcher<const Diag &> withFix(::testing::Matcher<Fix> FixMatcher) {
		return Field(&Diag::Fixes, ElementsAre(FixMatcher));
		}

		MATCHER_P2(Diag, Range, Message,
		"Diag at " + llvm::to_string(Range) + " = [" + Message + "]") {
		return arg.Range == Range && arg.Message == Message;
		}

		MATCHER_P3(Fix, Range, Replacement, Message,
		"Fix " + llvm::to_string(Range) + " => " +
		::testing::PrintToString(Replacement) + " = [" + Message + "]") {
		return arg.Message == Message && arg.Edits.size() == 1 &&
		arg.Edits[0].range == Range && arg.Edits[0].newText == Replacement;
		}

std::string guard(llvm::StringRef Code) {		std::string guard(llvm::StringRef Code) {
return "#pragma once\n" + Code.str();		return "#pragma once\n" + Code.str();
}		}

TEST(IncludeCleaner, ReferencedLocations) {		TEST(IncludeCleaner, ReferencedLocations) {
struct TestCase {		struct TestCase {
std::string HeaderCode;		std::string HeaderCode;
std::string MainCode;		std::string MainCode;
▲ Show 20 Lines • Show All 298 Lines • ▼ Show 20 Lines	TU.AdditionalFiles["bits"] = R"cpp(
}		}
)cpp";		)cpp";
TU.AdditionalFiles["list"] = "#include <bits>";		TU.AdditionalFiles["list"] = "#include <bits>";
TU.AdditionalFiles["queue"] = "#include <bits>";		TU.AdditionalFiles["queue"] = "#include <bits>";
TU.ExtraArgs = {"-isystem", testRoot()};		TU.ExtraArgs = {"-isystem", testRoot()};
auto AST = TU.build();		auto AST = TU.build();
EXPECT_THAT(computeUnusedIncludes(AST),		EXPECT_THAT(computeUnusedIncludes(AST),
ElementsAre(Pointee(writtenInclusion("<queue>"))));		ElementsAre(Pointee(writtenInclusion("<queue>"))));
EXPECT_THAT(computeUnusedIncludesExperimental(AST),		IncludeCleanerFindings Findings = computeIncludeCleanerFindings(AST);
		EXPECT_THAT(Findings.UnusedIncludes,
ElementsAre(Pointee(writtenInclusion("<queue>"))));		ElementsAre(Pointee(writtenInclusion("<queue>"))));
}		}

TEST(IncludeCleaner, GetUnusedHeaders) {		TEST(IncludeCleaner, GetUnusedHeaders) {
llvm::StringLiteral MainFile = R"cpp(		llvm::StringLiteral MainFile = R"cpp(
#include "a.h"		#include "a.h"
#include "b.h"		#include "b.h"
#include "dir/c.h"		#include "dir/c.h"
Show All 20 Lines	TEST(IncludeCleaner, GetUnusedHeaders) {
TU.ExtraArgs.push_back("-I" + testPath("dir"));		TU.ExtraArgs.push_back("-I" + testPath("dir"));
TU.ExtraArgs.push_back("-isystem" + testPath("system"));		TU.ExtraArgs.push_back("-isystem" + testPath("system"));
TU.Code = MainFile.str();		TU.Code = MainFile.str();
ParsedAST AST = TU.build();		ParsedAST AST = TU.build();
EXPECT_THAT(		EXPECT_THAT(
computeUnusedIncludes(AST),		computeUnusedIncludes(AST),
UnorderedElementsAre(Pointee(writtenInclusion("\"unused.h\"")),		UnorderedElementsAre(Pointee(writtenInclusion("\"unused.h\"")),
Pointee(writtenInclusion("\"dir/unused.h\""))));		Pointee(writtenInclusion("\"dir/unused.h\""))));

		IncludeCleanerFindings Findings = computeIncludeCleanerFindings(AST);
EXPECT_THAT(		EXPECT_THAT(
computeUnusedIncludesExperimental(AST),		Findings.UnusedIncludes,
UnorderedElementsAre(Pointee(writtenInclusion("\"unused.h\"")),		UnorderedElementsAre(Pointee(writtenInclusion("\"unused.h\"")),
Pointee(writtenInclusion("\"dir/unused.h\""))));		Pointee(writtenInclusion("\"dir/unused.h\""))));
}		}

		TEST(IncludeCleaner, ComputeMissingHeaders) {
		Annotations MainFile(R"cpp(
		#include "a.h"

		void foo() {
		$b[[b]]();
		kadircetUnsubmitted Done Reply Inline Actions nit: instead of using a point, can you use a range here instead (i.e. `[[b]]`)? afterwards you can have a `FileRange` pointing at both offsets, rather than relying on the length of the identifier. kadircet: nit: instead of using a point, can you use a range here instead (i.e. `[[b]]`)? afterwards you…
		})cpp");
		TestTU TU;
		TU.Filename = "foo.cpp";
		TU.AdditionalFiles["a.h"] = guard("#include \"b.h\"");
		TU.AdditionalFiles["b.h"] = guard("void b();");

		TU.Code = MainFile.code();
		ParsedAST AST = TU.build();

		IncludeCleanerFindings Findings = computeIncludeCleanerFindings(AST);
		const SourceManager &SM = AST.getSourceManager();
		const NamedDecl *BDecl = nullptr;
		for (Decl *D : AST.getASTContext().getTranslationUnitDecl()->decls()) {
		const NamedDecl *CandidateDecl = llvm::dyn_cast<NamedDecl>(D);
		std::string Name = CandidateDecl->getQualifiedNameAsString();
		if (Name != "b")
		continue;
		kadircetUnsubmitted Done Reply Inline Actions nit: braces kadircet: nit: braces
		BDecl = CandidateDecl;
		}
		ASSERT_TRUE(BDecl);
		kadircetUnsubmitted Done Reply Inline Actions rest of the code here doesn't really belong to the for loop, can you take them out? kadircet: rest of the code here doesn't really belong to the for loop, can you take them out?
		include_cleaner::Symbol B{*BDecl};
		auto Range = MainFile.range("b");
		kadircetUnsubmitted Done Reply Inline Actions this is pointing at the declaration inside `b.h` not to the reference inside the main file. are you sure this test passes? kadircet: this is pointing at the declaration inside `b.h` not to the reference inside the main file. are…
		VitaNuoAuthorUnsubmitted Done Reply Inline Actions Yes, all the tests pass. `D` is a `Decl` from the main file, otherwise it wouldn't have passed the safeguard `if (!SM.isWrittenInMainFile(SM.getExpansionLoc(D->getLocation()))) continue;` above. VitaNuo: Yes, all the tests pass. `D` is a `Decl` from the main file, otherwise it wouldn't have passed…
		kadircetUnsubmitted Done Reply Inline Actions this is passing because `bool BDeclFound;` is uninitialized above, if you set it to `bool BDeclFound = false;` you should see the test fail. there's no declaration for `b` inside the main file, it's declared in `b.h` and referenced inside the main file. you still need to search for the decl (without the constraint of being written in main file), use it to build an include_cleaner::Symbol, and use a `clangd::Annotation` range for the range of the reference. it might be easer to write this as: const NamedDecl* B = nullptr; for (...) { ... B = D; } ASSERT_TRUE(B); // build expected diagnostic info based on B and check that it's equal to what we've produced kadircet: this is passing because `bool BDeclFound;` is uninitialized above, if you set it to `bool…
		VitaNuoAuthorUnsubmitted Done Reply Inline Actions Didn't know there was a difference between uninitialized and `false`.. Thanks for the idea with `ASSERT_TRUE(Decl)`. Please check out the new version. VitaNuo: Didn't know there was a difference between uninitialized and `false`.. Thanks for the idea…
		kadircetUnsubmitted Done Reply Inline Actions nit: size_t Start = llvm::cantFail(positionToOffset(MainFile.code(), Range.start)); size_t End = llvm::cantFail(positionToOffset(MainFile.code(), Range.end)); no need for `EXPECT_FALSE(..takeError())`s as `llvm::cantFail` will fail (no pun intended :P), `static_cast`s are also redundant kadircet: nit: ``` size_t Start = llvm::cantFail(positionToOffset(MainFile.code(), Range.start)); size_t…
		size_t Start = llvm::cantFail(positionToOffset(MainFile.code(), Range.start));
		kadircetUnsubmitted Done Reply Inline Actions i don't think there's much value in testing out analysis here, we should rather focus on diagnostics generation, which isn't part of `computeIncludeCleanerFindings`. existing tests were focused on analysis, because legacy implementation for include-cleaner was actually performing these analysis itself. so I'd rather suggest having trivial test cases (from include-cleaner analysis perspective, no need for complicated directory/file layouts) and rather test things out through calls to `generateMissingIncludeDiagnostics` to make sure diagnostics has the right ranges, text and fix contents. right now we're not testing: header spelling symbol name generation ranges these diagnostics correspond to and these are the main functionality we're adding on top of include-cleaner analysis. you can take a look at the tests in llvm/llvm-project/clang-tools-extra/clangd/unittests/DiagnosticsTests.cpp to see how we're testing out diagnostics ranges, messages, fixes and what kind of helpers/matchers we have for them. kadircet: i don't think there's much value in testing out analysis here, we should rather focus on…
		VitaNuoAuthorUnsubmitted Done Reply Inline Actions Thank you, this makes sense. However, I believe we need to use `issueIncludeCleanerDiagnostics`rather than `generateMissingIncludeDiagnostics`, since the latter is private. VitaNuo: Thank you, this makes sense. However, I believe we need to use…
		size_t End = llvm::cantFail(positionToOffset(MainFile.code(), Range.end));
		syntax::FileRange BRange{SM.getMainFileID(), static_cast<unsigned int>(Start),
		static_cast<unsigned int>(End)};
		include_cleaner::Header Header{*SM.getFileManager().getFile("b.h")};
		MissingIncludeDiagInfo BInfo{B, BRange, {Header}};
		EXPECT_THAT(Findings.MissingIncludes, ElementsAre(BInfo));
		}
		kadircetUnsubmitted Done Reply Inline Actions i think the example for `std::vector` is solid, and `IWYU pragma private` needs a little adjustment. kadircet: i think the example for `std::vector` is solid, and `IWYU pragma private` needs a little…

		TEST(IncludeCleaner, GenerateMissingHeaderDiags) {
		kadircetUnsubmitted Done Reply Inline Actions it'd be better to `ASSERT_TRUE(BDecl);` right after the `for loop`, as rest of the code will crash (and even trigger undefined behavior because we're dereferencing nullptr in failure case). difference between `ASSERT_X` and `EXPECT_X` macros are, the former will stop execution of the particular test (hence we'll never trigger a nullptr deref with ASSERT_TRUE), whereas the latter just prints the failure, but doesn't abort the execution of test (hence helps print multiple failures at once, when they're non fatal). kadircet: it'd be better to `ASSERT_TRUE(BDecl);` right after the `for loop`, as rest of the code will…
		Config Cfg;
		Cfg.Diagnostics.MissingIncludes = Config::IncludesPolicy::Strict;
		Cfg.Diagnostics.Includes.IgnoreHeader = {
		[](llvm::StringRef Header) { return Header.ends_with("buzz.h"); }};
		hokeinUnsubmitted Not Done Reply Inline Actions Looks like this filter doesn't work on windows (the `/` vs `\` path separator might be the root cause here), I think a fix can be change the check to `return Header.endsWith("buzz.h")` or `return Header == testPath("buzz.h", llvm::sys::path::Style::posix)`. hokein: Looks like this filter doesn't work on windows (the `/` vs `\` path separator might be the root…
		WithContextValue Ctx(Config::Key, std::move(Cfg));
		Annotations MainFile(R"cpp(
		#include "a.h"
		$insert_b[[]]#include "baz.h"
		#include "dir/c.h"
		$insert_d[[]]#include "fuzz.h"
		kadircetUnsubmitted Done Reply Inline Actions can you also add a reference (and declaration) for std::vector, and have an IWYU private pragma in one of the headers to test code paths that spell verbatim and standard headers? also having some diagnostic suppressed via `IgnoreHeaders` is important to check kadircet: can you also add a reference (and declaration) for std::vector, and have an IWYU private pragma…
		VitaNuoAuthorUnsubmitted Done Reply Inline Actions Thank you for the great tips on improving test coverage! In fact, I had to also introduce support for private pragmas, as they were not taken care of. Hopefully, the solution will make sense to you. VitaNuo: Thank you for the great tips on improving test coverage! In fact, I had to also introduce…
		#include "header.h"
		kadircetUnsubmitted Done Reply Inline Actions we should include private.h through some indirection (not public.h) to check `IWYU pragma private` spellings are respected. kadircet: we should include private.h through some indirection (not public.h) to check `IWYU pragma…
		$insert_foobar[[]]#include <e.h>
		$insert_f[[]]$insert_vector[[]]

		void foo() {
		$b[[b]]();

		kadircetUnsubmitted Done Reply Inline Actions name this range as `bar` instead of `d`? kadircet: name this range as `bar` instead of `d`?
		ns::$bar[[Bar]] bar;
		bar.d();
		$f[[f]]();

		kadircetUnsubmitted Done Reply Inline Actions could you add a comment here saying this shouldn't be diagnosed? kadircet: could you add a comment here saying this shouldn't be diagnosed?
		// this should not be diagnosed, because it's ignored in the config
		kadircetUnsubmitted Done Reply Inline Actions can you make one of these names qualified? e.g. `namespace ns { struct Bar { void f(); }; }` kadircet: can you make one of these names qualified? e.g. `namespace ns { struct Bar { void f(); }; }`
		buzz();

		$foobar[[foobar]]();

		std::$vector[[vector]] v;
		})cpp");

		TestTU TU;
		TU.Filename = "foo.cpp";
		TU.AdditionalFiles["a.h"] = guard("#include \"b.h\"");
		TU.AdditionalFiles["b.h"] = guard("void b();");

		TU.AdditionalFiles["dir/c.h"] = guard("#include \"d.h\"");
		TU.AdditionalFiles["dir/d.h"] =
		guard("namespace ns { struct Bar { void d(); }; }");

		TU.AdditionalFiles["system/e.h"] = guard("#include <f.h>");
		TU.AdditionalFiles["system/f.h"] = guard("void f();");
		TU.ExtraArgs.push_back("-isystem" + testPath("system"));

		TU.AdditionalFiles["fuzz.h"] = guard("#include \"buzz.h\"");
		TU.AdditionalFiles["buzz.h"] = guard("void buzz();");

		TU.AdditionalFiles["baz.h"] = guard("#include \"private.h\"");
		TU.AdditionalFiles["private.h"] = guard(R"cpp(
		// IWYU pragma: private, include "public.h"
		void foobar();
		)cpp");
		TU.AdditionalFiles["header.h"] = guard(R"cpp(
		namespace std { class vector {}; }
		)cpp");

		TU.Code = MainFile.code();
		ParsedAST AST = TU.build();

		std::vector<clangd::Diag> Diags =
		issueIncludeCleanerDiagnostics(AST, TU.Code);
		EXPECT_THAT(
		Diags,
		UnorderedElementsAre(
		AllOf(Diag(MainFile.range("b"),
		"No header providing \"b\" is directly included"),
		withFix(Fix(MainFile.range("insert_b"), "#include \"b.h\"\n",
		"#include \"b.h\""))),
		AllOf(Diag(MainFile.range("bar"),
		"No header providing \"ns::Bar\" is directly included"),
		withFix(Fix(MainFile.range("insert_d"),
		"#include \"dir/d.h\"\n", "#include \"dir/d.h\""))),
		AllOf(Diag(MainFile.range("f"),
		"No header providing \"f\" is directly included"),
		withFix(Fix(MainFile.range("insert_f"), "#include <f.h>\n",
		"#include <f.h>"))),
		AllOf(
		Diag(MainFile.range("foobar"),
		"No header providing \"foobar\" is directly included"),
		withFix(Fix(MainFile.range("insert_foobar"),
		"#include \"public.h\"\n", "#include \"public.h\""))),
		AllOf(
		Diag(MainFile.range("vector"),
		"No header providing \"std::vector\" is directly included"),
		withFix(Fix(MainFile.range("insert_vector"),
		"#include <vector>\n", "#include <vector>")))));
		}

TEST(IncludeCleaner, VirtualBuffers) {		TEST(IncludeCleaner, VirtualBuffers) {
TestTU TU;		TestTU TU;
TU.Code = R"cpp(		TU.Code = R"cpp(
#include "macros.h"		#include "macros.h"

using flags::FLAGS_FOO;		using flags::FLAGS_FOO;

// CLI will come from a define, __cplusplus is a built-in. In both cases, they		// CLI will come from a define, __cplusplus is a built-in. In both cases, they
▲ Show 20 Lines • Show All 137 Lines • ▼ Show 20 Lines	TEST(IncludeCleaner, IWYUPragmas) {
TU.AdditionalFiles["behind_keep.h"] = guard("");		TU.AdditionalFiles["behind_keep.h"] = guard("");
TU.AdditionalFiles["exported.h"] = guard("");		TU.AdditionalFiles["exported.h"] = guard("");
TU.AdditionalFiles["public.h"] = guard("#include \"private.h\"");		TU.AdditionalFiles["public.h"] = guard("#include \"private.h\"");
TU.AdditionalFiles["private.h"] = guard(R"cpp(		TU.AdditionalFiles["private.h"] = guard(R"cpp(
// IWYU pragma: private, include "public.h"		// IWYU pragma: private, include "public.h"
void foo() {}		void foo() {}
)cpp");		)cpp");
Config Cfg;		Config Cfg;
Cfg.Diagnostics.UnusedIncludes = Config::UnusedIncludesPolicy::Experiment;		Cfg.Diagnostics.UnusedIncludes = Config::IncludesPolicy::Experiment;
WithContextValue Ctx(Config::Key, std::move(Cfg));		WithContextValue Ctx(Config::Key, std::move(Cfg));
ParsedAST AST = TU.build();		ParsedAST AST = TU.build();

auto ReferencedFiles = findReferencedFiles(		auto ReferencedFiles = findReferencedFiles(
findReferencedLocations(AST), AST.getIncludeStructure(),		findReferencedLocations(AST), AST.getIncludeStructure(),
AST.getCanonicalIncludes(), AST.getSourceManager());		AST.getCanonicalIncludes(), AST.getSourceManager());
EXPECT_EQ(ReferencedFiles.SpelledUmbrellas.size(), 1u);		EXPECT_EQ(ReferencedFiles.SpelledUmbrellas.size(), 1u);
EXPECT_EQ(ReferencedFiles.SpelledUmbrellas.begin()->getKey(), "\"public.h\"");		EXPECT_EQ(ReferencedFiles.SpelledUmbrellas.begin()->getKey(), "\"public.h\"");
EXPECT_EQ(ReferencedFiles.User.size(), 2u);		EXPECT_EQ(ReferencedFiles.User.size(), 2u);
EXPECT_TRUE(		EXPECT_TRUE(
ReferencedFiles.User.contains(AST.getSourceManager().getMainFileID()));		ReferencedFiles.User.contains(AST.getSourceManager().getMainFileID()));
EXPECT_TRUE(		EXPECT_TRUE(
ReferencedFiles.User.contains(AST.getSourceManager().getMainFileID()));		ReferencedFiles.User.contains(AST.getSourceManager().getMainFileID()));
EXPECT_THAT(AST.getDiagnostics(), llvm::ValueIs(IsEmpty()));		EXPECT_THAT(AST.getDiagnostics(), llvm::ValueIs(IsEmpty()));
EXPECT_THAT(computeUnusedIncludes(AST), IsEmpty());		EXPECT_THAT(computeUnusedIncludes(AST), IsEmpty());
EXPECT_THAT(computeUnusedIncludesExperimental(AST), IsEmpty());		IncludeCleanerFindings Findings = computeIncludeCleanerFindings(AST);
		EXPECT_THAT(Findings.UnusedIncludes, IsEmpty());
}		}

TEST(IncludeCleaner, RecursiveInclusion) {		TEST(IncludeCleaner, RecursiveInclusion) {
TestTU TU;		TestTU TU;
TU.Code = R"cpp(		TU.Code = R"cpp(
#include "foo.h"		#include "foo.h"

void baz() {		void baz() {
Show All 12 Lines	TEST(IncludeCleaner, RecursiveInclusion) {
)cpp";		)cpp";
TU.AdditionalFiles["bar.h"] = guard(R"cpp(		TU.AdditionalFiles["bar.h"] = guard(R"cpp(
#include "foo.h"		#include "foo.h"
)cpp");		)cpp");
ParsedAST AST = TU.build();		ParsedAST AST = TU.build();

EXPECT_THAT(AST.getDiagnostics(), llvm::ValueIs(IsEmpty()));		EXPECT_THAT(AST.getDiagnostics(), llvm::ValueIs(IsEmpty()));
EXPECT_THAT(computeUnusedIncludes(AST), IsEmpty());		EXPECT_THAT(computeUnusedIncludes(AST), IsEmpty());
EXPECT_THAT(computeUnusedIncludesExperimental(AST), IsEmpty());		IncludeCleanerFindings Findings = computeIncludeCleanerFindings(AST);
		EXPECT_THAT(Findings.UnusedIncludes, IsEmpty());
}		}

TEST(IncludeCleaner, IWYUPragmaExport) {		TEST(IncludeCleaner, IWYUPragmaExport) {
TestTU TU;		TestTU TU;
TU.Code = R"cpp(		TU.Code = R"cpp(
#include "foo.h"		#include "foo.h"
)cpp";		)cpp";
TU.AdditionalFiles["foo.h"] = R"cpp(		TU.AdditionalFiles["foo.h"] = R"cpp(
#ifndef FOO_H		#ifndef FOO_H
#define FOO_H		#define FOO_H

#include "bar.h" // IWYU pragma: export		#include "bar.h" // IWYU pragma: export

#endif		#endif
)cpp";		)cpp";
TU.AdditionalFiles["bar.h"] = guard(R"cpp(		TU.AdditionalFiles["bar.h"] = guard(R"cpp(
void bar() {}		void bar() {}
)cpp");		)cpp");
ParsedAST AST = TU.build();		ParsedAST AST = TU.build();

EXPECT_THAT(AST.getDiagnostics(), llvm::ValueIs(IsEmpty()));		EXPECT_THAT(AST.getDiagnostics(), llvm::ValueIs(IsEmpty()));
// FIXME: This is not correct: foo.h is unused but is not diagnosed as such		// FIXME: This is not correct: foo.h is unused but is not diagnosed as such
// because we ignore headers with IWYU export pragmas for now.		// because we ignore headers with IWYU export pragmas for now.
EXPECT_THAT(computeUnusedIncludes(AST), IsEmpty());		EXPECT_THAT(computeUnusedIncludes(AST), IsEmpty());
EXPECT_THAT(computeUnusedIncludesExperimental(AST), IsEmpty());		IncludeCleanerFindings Findings = computeIncludeCleanerFindings(AST);
		EXPECT_THAT(Findings.UnusedIncludes, IsEmpty());
}		}

TEST(IncludeCleaner, NoDiagsForObjC) {		TEST(IncludeCleaner, NoDiagsForObjC) {
TestTU TU;		TestTU TU;
TU.Code = R"cpp(		TU.Code = R"cpp(
#include "foo.h"		#include "foo.h"

void bar() {}		void bar() {}
)cpp";		)cpp";
TU.AdditionalFiles["foo.h"] = R"cpp(		TU.AdditionalFiles["foo.h"] = R"cpp(
#ifndef FOO_H		#ifndef FOO_H
#define FOO_H		#define FOO_H

#endif		#endif
)cpp";		)cpp";
TU.ExtraArgs.emplace_back("-xobjective-c");		TU.ExtraArgs.emplace_back("-xobjective-c");

Config Cfg;		Config Cfg;
Cfg.Diagnostics.UnusedIncludes = Config::UnusedIncludesPolicy::Strict;
		Cfg.Diagnostics.UnusedIncludes = Config::IncludesPolicy::Strict;
		Cfg.Diagnostics.MissingIncludes = Config::IncludesPolicy::Strict;
WithContextValue Ctx(Config::Key, std::move(Cfg));		WithContextValue Ctx(Config::Key, std::move(Cfg));
ParsedAST AST = TU.build();		ParsedAST AST = TU.build();
EXPECT_THAT(AST.getDiagnostics(), llvm::ValueIs(IsEmpty()));		EXPECT_THAT(AST.getDiagnostics(), llvm::ValueIs(IsEmpty()));
}		}
} // namespace		} // namespace
} // namespace clangd		} // namespace clangd
} // namespace clang		} // namespace clang

clang-tools-extra/clangd/unittests/PreambleTests.cpp

Show First 20 Lines • Show All 659 Lines • ▼ Show 20 Lines	[[x]];/* error-ok */)");
EXPECT_THAT(*AST->getDiagnostics(),		EXPECT_THAT(*AST->getDiagnostics(),
ElementsAre(Diag(NewCode.range(), "missing_type_specifier")));		ElementsAre(Diag(NewCode.range(), "missing_type_specifier")));
}		}
}		}

TEST(PreamblePatch, DiagnosticsToPreamble) {		TEST(PreamblePatch, DiagnosticsToPreamble) {
Config Cfg;		Config Cfg;
Cfg.Diagnostics.AllowStalePreamble = true;		Cfg.Diagnostics.AllowStalePreamble = true;
Cfg.Diagnostics.UnusedIncludes = Config::UnusedIncludesPolicy::Strict;		Cfg.Diagnostics.UnusedIncludes = Config::IncludesPolicy::Strict;
WithContextValue WithCfg(Config::Key, std::move(Cfg));		WithContextValue WithCfg(Config::Key, std::move(Cfg));

llvm::StringMap<std::string> AdditionalFiles;		llvm::StringMap<std::string> AdditionalFiles;
AdditionalFiles["foo.h"] = "#pragma once";		AdditionalFiles["foo.h"] = "#pragma once";
AdditionalFiles["bar.h"] = "#pragma once";		AdditionalFiles["bar.h"] = "#pragma once";
{		{
Annotations Code(R"(		Annotations Code(R"(
// Test comment		// Test comment
▲ Show 20 Lines • Show All 156 Lines • Show Last 20 Lines

clang-tools-extra/include-cleaner/include/clang-include-cleaner/Analysis.h

Show First 20 Lines • Show All 67 Lines • ▼ Show 20 Lines	AnalysisResults analyze(llvm::ArrayRef<Decl *> ASTRoots,
const SourceManager &SM, HeaderSearch &HS);		const SourceManager &SM, HeaderSearch &HS);

/// Removes unused includes and inserts missing ones in the main file.		/// Removes unused includes and inserts missing ones in the main file.
/// Returns the modified main-file code.		/// Returns the modified main-file code.
/// The FormatStyle must be C++ or ObjC (to support include ordering).		/// The FormatStyle must be C++ or ObjC (to support include ordering).
std::string fixIncludes(const AnalysisResults &Results, llvm::StringRef Code,		std::string fixIncludes(const AnalysisResults &Results, llvm::StringRef Code,
const format::FormatStyle &IncludeStyle);		const format::FormatStyle &IncludeStyle);

		std::string spellHeader(const Header &H, HeaderSearch &HS,
		const FileEntry *Main);
} // namespace include_cleaner		} // namespace include_cleaner
} // namespace clang		} // namespace clang

#endif		#endif

clang-tools-extra/include-cleaner/lib/Analysis.cpp

Show First 20 Lines • Show All 43 Lines • ▼ Show 20 Lines	void walkUsed(llvm::ArrayRef<Decl *> ASTRoots,
for (const SymbolReference &MacroRef : MacroRefs) {		for (const SymbolReference &MacroRef : MacroRefs) {
assert(MacroRef.Target.kind() == Symbol::Macro);		assert(MacroRef.Target.kind() == Symbol::Macro);
if (!SM.isWrittenInMainFile(SM.getSpellingLoc(MacroRef.RefLocation)))		if (!SM.isWrittenInMainFile(SM.getSpellingLoc(MacroRef.RefLocation)))
continue;		continue;
CB(MacroRef, headersForSymbol(MacroRef.Target, SM, PI));		CB(MacroRef, headersForSymbol(MacroRef.Target, SM, PI));
}		}
}		}

static std::string spellHeader(const Header &H, HeaderSearch &HS,		std::string spellHeader(const Header &H, HeaderSearch &HS,
const FileEntry *Main) {		const FileEntry *Main) {
switch (H.kind()) {		switch (H.kind()) {
case Header::Physical: {		case Header::Physical: {
bool IsSystem = false;		bool IsSystem = false;
std::string Path = HS.suggestPathToFileForDiagnostics(		std::string Path = HS.suggestPathToFileForDiagnostics(
H.physical(), Main->tryGetRealPathName(), &IsSystem);		H.physical(), Main->tryGetRealPathName(), &IsSystem);
return IsSystem ? "<" + Path + ">" : "\"" + Path + "\"";		return IsSystem ? "<" + Path + ">" : "\"" + Path + "\"";
}		}
case Header::Standard:		case Header::Standard:
▲ Show 20 Lines • Show All 56 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[clangd] Add support for missing includes analysis.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 503335

clang-tools-extra/clangd/Config.h

clang-tools-extra/clangd/ConfigCompile.cpp

clang-tools-extra/clangd/ConfigFragment.h

clang-tools-extra/clangd/ConfigYAML.cpp

clang-tools-extra/clangd/IncludeCleaner.h

clang-tools-extra/clangd/IncludeCleaner.cpp

clang-tools-extra/clangd/ParsedAST.cpp

clang-tools-extra/clangd/Preamble.cpp

clang-tools-extra/clangd/unittests/ConfigCompileTests.cpp

clang-tools-extra/clangd/unittests/DiagnosticsTests.cpp

clang-tools-extra/clangd/unittests/IncludeCleanerTests.cpp

clang-tools-extra/clangd/unittests/PreambleTests.cpp

clang-tools-extra/include-cleaner/include/clang-include-cleaner/Analysis.h

clang-tools-extra/include-cleaner/lib/Analysis.cpp

[clangd] Add support for missing includes analysis.
ClosedPublic