This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
docs/analyzer/user-docs/
-
analyzer/
-
user-docs/
1
CrossTranslationUnit.rst
-
include/clang/Basic/
-
clang/
-
Basic/
-
DiagnosticCrossTUKinds.td
-
lib/CrossTU/
-
CrossTU/
2/13
CrossTranslationUnit.cpp
-
test/Analysis/
-
Analysis/
-
Inputs/
-
ctu-import.c.externalDefMap.ast-dump.txt
1/9
ctu-lookup-name-with-space.cpp
-
ctu-other.c.externalDefMap.ast-dump.txt
-
ctu-other.cpp.externalDefMap.ast-dump.txt
-
plist-macros-with-expansion-ctu.c.externalDefMap.txt
-
ctu-inherited-default-ctor.cpp
4/8
ctu-lookup-name-with-space.cpp
1
func-mapping-test.cpp
-
unittests/CrossTU/
-
CrossTU/
-
CrossTranslationUnitTest.cpp

Differential D102669

[analyzer][ctu] Fix wrong 'multiple definitions' errors caused by space characters in lookup names when parsing the ctu index file
ClosedPublic

Authored by OikawaKirie on May 17 2021, 11:22 PM.

Download Raw Diff

Details

Reviewers

gamesh411
martong
balazske
steakhal
a.sidorin
shafik
xazax.hun
teemperor
keith

Commits

rG9f90254286dc: [analyzer][ctu] Fix wrong 'multiple definitions' errors caused by space…
rG333d66b09494: [analyzer][ctu] Fix wrong 'multiple definitions' errors caused by space…

Summary

This error was found when analyzing MySQL with CTU enabled.

When there are space characters in the lookup name, the current delimiter searching strategy will make the file path wrongly parsed.
And when two lookup names have the same prefix before their first space characters, a 'multiple definitions' error will be wrongly reported.

e.g. The lookup names for the two lambda exprs in the test case are c:@S@G@F@G#@Sa@F@operator int (*)(char)#1 and c:@S@G@F@G#@Sa@F@operator bool (*)(char)#1 respectively. And their prefixes are both c:@S@G@F@G#@Sa@F@operator when using the first space character as the delimiter.

Solving the problem by adding a length for the lookup name, making the index items in the format of <USR-Length>:<USR File> <Path>.

In the test case of this patch, we found that it will trigger a "triple mismatch" warning when using clang -cc1 to analyze the source file with CTU using the on-demand-parsing strategy in Darwin systems. And this problem is also encountered in D75665, which is the patch introducing the on-demand parsing strategy.
We temporarily bypass this problem by using the loading-ast-file strategy.

Refer to the discourse topic for more details.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

OikawaKirie created this revision.May 17 2021, 11:22 PM

Herald added subscribers: steakhal, ASDenysPetrov, dkrupp and 9 others. · View Herald TranscriptMay 17 2021, 11:22 PM

OikawaKirie requested review of this revision.May 17 2021, 11:22 PM

Herald added a subscriber: cfe-commits. · View Herald TranscriptMay 17 2021, 11:22 PM

OikawaKirie mentioned this in D102159: [index][analyzer][ctu] Eliminate white spaces in the CTU lookup name..May 17 2021, 11:30 PM

Harbormaster completed remote builds in B104941: Diff 346050.May 18 2021, 12:36 AM

I don't really like having multiple files with the same name.
And the importer TU should be simple to be simply cat-ed into a temporal file.
At that point, you could put the importee's content into this file. It would result in a single, self-contained test case.

I'm not really familiar with the extdefmap part, but I'm surprised that we are using spaces as separators.
Shouldn't we consider using a different character?

clang/test/Analysis/ctu-lookup-name-with-space.cpp
8	Probably splitting this up into multiple lines would result in a more readable solution. Something along these lines should work: cat >%t/compile_commands.json <<EOL line 1 line 2 ... EOL
12–24	Why do you need two separate invocations? Couldn't you just merge these? I've seen cases where `-verify` was used in conjunction with `FileCheck`.

In D102669#2765233, @steakhal wrote:

I'm not really familiar with the extdefmap part, but I'm surprised that we are using spaces as separators.
Shouldn't we consider using a different character?

I prefer the idea of changing the delimiter character, but it may lead to modifying a lot of test cases.
I think we'd better make this change in another revision in the future if we do want to change it.

clang/test/Analysis/ctu-lookup-name-with-space.cpp
8	The suggestion is great, however I cannot find a way to write the `RUN` commands. Could you please tell me how to write the commands in this way? It is also useful to help me merging the test case into one file.
12–24	I forgot the `--allow-empty` argument during writing this test case. I will merge them in an update.

First of all, thank you for the patch!

We had a meeting with my colleges (@steakhal, @gamesh411) and we agreed in the following. This issue you are trying to solve here is indeed a serious problem, but we'd like to suggest an alternative and perhaps more durable solution. In CrossTranslationUnitContext::getLookupName(const NamedDecl *ND) it would be possible to extend the returned string with a prefix that encodes the length of the USR string.
So, instead of

c:@F@g#I# ctu-other.cpp.ast

we'd get

9:c:@F@g#I# ctu-other.cpp.ast

This way, we could handle even file names with spaces in them.

There are quite a few places where the extdef mappings should be updated.
For discovering them I suggest you asserting the new file format (only for detecting them!). This way if you miss one, it wouldn't silently 'work' somehow, but raise your attention.

There are a few references to the format of this mapping in the clang/docs/analyzer/user-docs/CrossTranslationUnit.rst and probably in other files.
Those should be updated to match the new format.

I'm sorry for burdening you with all of this, but I think this is the way to make this parsing more robust. I really appreciate your work.

It has been a long period since the last discussion, I hope you can still remember this bug. And apologize for the delay.

Updated as required, the lookup name generator CrossTranslationUnitContext::getLookupName and parser parseCrossTUIndex are modified.
And assertions are added before accessing the CTU index mapping for input lookup names to be searched.

Corresponding test cases and documents are also updated.

Please let me know if there are other files to be updated.

Herald added a subscriber: manas. · View Herald TranscriptNov 30 2021, 3:33 AM

Harbormaster completed remote builds in B136654: Diff 390653.Nov 30 2021, 4:03 AM

Looks good.
Please get rid of the macro stuff, consider something along the lines I proposed for the parsing stuff.
Also clang-format the code you touch.
I haven't checked the docs and the comments of the codebase, but I'll assume you grepped and fixed all occurrences.
I look forward to this, thank you for working on this @OikawaKirie.

clang/docs/analyzer/user-docs/CrossTranslationUnit.rst
84
clang/lib/CrossTU/CrossTranslationUnit.cpp
154–172
175–180	Please do something about this macro. Encapsulate the logic in some other way.
460
clang/test/Analysis/func-mapping-test.cpp
50–62	I think you could add your lambda stuff to this file. This is really the place for testing this. The test you created actually demonstrating the CTU issue is also valuable IMO, so you can leave it, but have a copy here.

• msteinberg added a subscriber: • msteinberg.Nov 30 2021, 6:02 AM

• msteinberg removed a subscriber: • msteinberg.Dec 1 2021, 7:11 AM

• msteinberg added a subscriber: • msteinberg.Dec 1 2021, 7:11 AM

Fix formatting bugs
Update lookup name format as <USR-Length>:<USR> <File-Path> in all comments and documents
Add the new test case as a part of clang/test/Analysis/func-mapping-test.cpp to verify the lookup name
Change the macros for asserting the lookup name format to an NDEBUG wrapped function

clang/lib/CrossTU/CrossTranslationUnit.cpp
154–172	The source of lookup name of the function being imported is function `CrossTranslationUnitContext::getLookupName`. Keeping the length in the mapping can avoid parsing the lookup name during importing.

Harbormaster completed remote builds in B137595: Diff 391968.Dec 5 2021, 11:11 PM

steakhal added inline comments.Dec 6 2021, 12:54 AM

clang/lib/CrossTU/CrossTranslationUnit.cpp
154–172	Okay; you can copy the original StringRef to have that. But by consuming it on one path makes the code much more readable.
183	charactor -> character
184–188	You should probably use more elaborative names, I wouldn't know what this does if I hadn't reviewed this patch.
450	The assertion speaks for itself. It rarely needs additional documentation.
clang/test/Analysis/Inputs/ctu-lookup-name-with-space.cpp
8–9	I would rather put these into the `importee()`
14	Why do you need to have a div by zero warning?

OikawaKirie added inline comments.Dec 6 2021, 2:47 AM

clang/lib/CrossTU/CrossTranslationUnit.cpp
154–172	The `getAsInterger` call can also check whether the content before the first colon is an integer. Therefore, a sub-string operation is required here.
clang/test/Analysis/Inputs/ctu-lookup-name-with-space.cpp
8–9	The lambda exprs will not be included in the CTU index file if they are declared in a normal function.
14	I am not sure whether we should test if an external function can be correctly imported. Hence, I wrote a div-by-zero bug here. A call to function `clang_analyzer_warnIfReached` is also OK here. As the imported lambda expr will not be called, I think I can only test whether CTU works via another normal function.

Please mark comments 'done' if they are done.

clang/lib/CrossTU/CrossTranslationUnit.cpp
154–172	I don't doubt that your proposed way of doing this works and is efficient. What I say is that I think there is room for improvement in the non-functional aspects, in the readability. However, it's not really a blocking issue, more of a personal taste.
clang/test/Analysis/Inputs/ctu-lookup-name-with-space.cpp
8–9	I see.
14	AFAIK importing a function and import-related stuff are orthogonal to actually emitting bug reports produced by the analyzer. That being said, if the `importee()` would have an empty body, the observable behavior would remain the same. And this is what I propose now.

OikawaKirie added inline comments.Dec 6 2021, 5:09 AM

clang/lib/CrossTU/CrossTranslationUnit.cpp
154–172	I know what you are considering, it is clearer and more readable by consuming the length, then the USR. However, to correctly separate the USR and file path, the length of `USR-Length` is also required, which makes it impossible to just consume the length at the beginning. Another way of solving this problem is to re-create the string with the USR-Length and the USR after parsing, but I think it is not a good solution. BTW, is it necessary to assert the `USR-Length` to be greater than zero? I think it is more reasonable to report invalid format rather than assert the value, as it can be provided by the user.
clang/test/Analysis/Inputs/ctu-lookup-name-with-space.cpp
14	Sorry, but I am not quite clear about your suggestions on this function.

steakhal added inline comments.Dec 6 2021, 5:57 AM

clang/lib/CrossTU/CrossTranslationUnit.cpp
154–172	I think what causes the misunderstanding is the definition of consume in the context of `StringRef`. const StringRef Copy = Line; Line.consumeInteger(...); // Line advances forward by the number of characters that were parsed as an integral value. // Copy still refers to the unmodified, original characters. // I can use it however I want. // `Line` is a suffix of `Copy`, and the `.end()` should be the same, only `.begin()` should differ. I hope that caused the miscommunication. BTW, is it necessary to assert the USR-Length to be greater than zero? I think it is more reasonable to report invalid format rather than assert the value, as it can be provided by the user. Yeah, sure!
clang/test/Analysis/Inputs/ctu-lookup-name-with-space.cpp
13–15	Also fixup the return type in the declaration within the main TU. Also add the `// expected-no-diagnostics` comment to the primary TU.

OikawaKirie added inline comments.Dec 9 2021, 12:51 AM

clang/lib/CrossTU/CrossTranslationUnit.cpp
154–172	I think I have figured out what have been misunderstood. In the current patch, I just modify function `CrossTranslationUnitContext::getLookupName` by adding a length at the beginning. Therefore, the lookup name for the CTU query will have the length part. And for the sake of simplicity and efficiency, the length together with the USR is stored in the mapping as the key. To correctly parse the `<USR-Length>:<USR>` part, I cannot just consume the `<USR-Length>` at the beginning. Otherwise, I cannot know the length of `<USR-Length>:` part, which makes it impossible to parse the entire `<USR-Length>:<USR>` part, even though the original `Line` is copied. I will update the approach of adding the length part. Since the length is only used during parsing the CTU index, I will modify function `createCrossTUIndexString` to add the length and revert the changes to function `CrossTranslationUnitContext::getLookupName` to keep the lookup name for CTU query unchanged.
clang/test/Analysis/Inputs/ctu-lookup-name-with-space.cpp
13–15	Yes, you are right. I was misled by myself.

Revert function CrossTranslationUnitContext::getLookupName
Add length when dumping the CTU index mapping via function createCrossTUIndexString
Remove the assertions during CTU map query process
Make function parseCrossTUIndexItem more readable

Harbormaster completed remote builds in B138384: Diff 393064.Dec 9 2021, 2:19 AM

I think it looks great. Thanks.

This revision is now accepted and ready to land.Dec 13 2021, 3:30 AM

When running my test case ctu-lookup-name-with-space.cpp on Windows, llvm-lit reports 'cp': command not found. And this is the reason why it fails on Windows.
And when I remove the cps and replace them with original file names, clang reports YAML:1:293: error: Unrecognized escape code, it seems that the static analyzer only reads compilation database in YAML format on Windows.
Should I disable this test case on Windows? Or is there any other approaches to make it work on Windows?

In D102669#3194405, @OikawaKirie wrote:

Should I disable this test case on Windows? Or is there any other approaches to make it work on Windows?

I'm fine with disabling this test on Windows.

Fix YAML:1:293: error: Unrecognized escape code error by replacing lit substitution pattern %S to %/S.
Fix cp problems by removing the file copy operations.

Harbormaster completed remote builds in B139579: Diff 394748.Dec 16 2021, 12:11 AM

steakhal accepted this revision.Dec 16 2021, 1:37 AM

It seems this patch has nothing to do with the failure in the Linux build. I think it is now ready to land.
Thanks a lot for your suggestions during the revision.

Could you please commit this patch on my behalf? Thanks.
Ella Ma <alansnape3058@gmail.com>

This revision was landed with ongoing or failed builds.Dec 16 2021, 8:48 AM

Closed by commit rG333d66b09494: [analyzer][ctu] Fix wrong 'multiple definitions' errors caused by space… (authored by OikawaKirie, committed by steakhal). · Explain Why

This revision was automatically updated to reflect the committed changes.

steakhal added a commit: rG333d66b09494: [analyzer][ctu] Fix wrong 'multiple definitions' errors caused by space….

alanphipps added a subscriber: alanphipps.Dec 16 2021, 2:13 PM

This commit seems to have caused a test to fail: https://green.lab.llvm.org/green/job/clang-stage1-cmake-RA-incremental/26118/testReport/

Can you fix the failure or revert the patch?

I do not know how this error happens. Maybe we can currently revert this patch an have another try in the future.

This breaks tests on macOS: http://45.33.8.238/macm1/23920/step_7.txt

Please take a look and revert for now if it takes a while to fix.

Could you please revert this on my behalf? I currently have no idea to fix this problem.

thakis added a reverting change: rG770ef94097c0: Revert "[analyzer][ctu] Fix wrong 'multiple definitions' errors caused by space….Dec 16 2021, 5:51 PM

reverted in 770ef94097c02205b3ec9e902f1d6a9c99b5189c. thanks!

It seems that it is not this patch that triggers the problem, which is similar to D75665.
IMO it is the problem of on-demand-parsing, but I do not have a Mac M1 device to reproduce this bug.
Maybe we can just land this patch by restricting the test case to be executed only on Linux, just as what D75665 does (rG5cc18516c483 vs rG97e07d0c352c), and leave the problem for future fixes.

Could you please do the update as provided below and land this patch again? @steakhal or other reviewers?

clang/test/Analysis/ctu-lookup-name-with-space.cpp
14	Adding this line here.

uabelho added a subscriber: uabelho.Dec 16 2021, 10:06 PM

We shouldn't skip mac targets. I CC ASTImporter folks, they probably have an M1.

arichardson added a subscriber: arichardson.Dec 17 2021, 1:51 AM

arichardson added inline comments.

clang/test/Analysis/ctu-lookup-name-with-space.cpp
14	Disabling the test on non- Linux is not a good idea IMO since it means we lose coverage on other platforms. My guess is that you just need to specify an explicit triple in the clang invocations.

In D102669#3199270, @steakhal wrote:

We shouldn't skip mac targets. I CC ASTImporter folks, they probably have an M1.

I am not intended to ignore this problem triggered on M1. However, I think it is not this patch that leads to this problem, it just triggers it.
I mean we can just disable the test case temporarily on M1, and fix this problem as well as enable this patch and the one of on-demand-parsing in another patch.
I think they trigger the same problem for the same reason on M1.

Besides, it seems to be the problem of ASTUnit::LoadFromCommandLine, rather than the ASTImporter.

clang/test/Analysis/ctu-lookup-name-with-space.cpp
14	AFAIK, we cannot do that. If this test case is executed on different platforms, we cannot determine the triple ahead of time and specify it in the invocation list.

In D102669#3205889, @OikawaKirie wrote:

In D102669#3199270, @steakhal wrote:

We shouldn't skip mac targets. I CC ASTImporter folks, they probably have an M1.

I am not intended to ignore this problem triggered on M1. However, I think it is not this patch that leads to this problem, it just triggers it.
I mean we can just disable the test case temporarily on M1, and fix this problem as well as enable this patch and the one of on-demand-parsing in another patch.
I think they trigger the same problem for the same reason on M1.

Besides, it seems to be the problem of ASTUnit::LoadFromCommandLine, rather than the ASTImporter.

Prior to this patch, it worked on M1; after landing it broke something, so we clearly shouldn't land this.
We should add a test-case demonstrating the problem with M1 with a given configuration.
Then we need to track down and fix the underlying issue causing it. That should be done probably in a separate patch and add it as a parent patch to this one.

If all of these are done, we can probably land both of them.

clang/test/Analysis/ctu-lookup-name-with-space.cpp
14	If we were to pin the triple, then each platform would emit the correct AST dumps according to that platform - ~~ cross-compilation.

This revision is now accepted and ready to land.Dec 22 2021, 1:02 AM

steakhal requested changes to this revision.Dec 22 2021, 1:02 AM

This revision now requires changes to proceed.Dec 22 2021, 1:02 AM

In D102669#3206089, @steakhal wrote:

Prior to this patch, it worked on M1; after landing it broke something, so we clearly shouldn't land this.

I do not think it is this patch that breaks the functionality on M1, as it depends on the *on-demand-parsing* feature that is not tested on M1 currently.

We should add a test-case demonstrating the problem with M1 with a given configuration.

If I got it correct (it is on-demand-parsing that triggers the problem), this problem can be triggered by enabling the test case of D75665 on M1.

Then we need to track down and fix the underlying issue causing it. That should be done probably in a separate patch and add it as a parent patch to this one.

If all of these are done, we can probably land both of them.

Maybe currently a simpler way is trying to use AST dump to load the external TU to be imported, rather than on-demand-parsing, which can make us fix this failure with the test case still enabled on M1.

I will have a series of tests on my concerns later, and I will reply with my results if I can find something.

I have confirmed that this problem is not due to this patch.
Besides, on Mac, both m1 and intel, the on-demand-parsing as well as loading an AST file generated via driver argument -emit-ast will also trigger this problem.
However, when loading an AST file generated via cc1 argument -emit-pch, the problem is not triggered.

See the example below, which is executed on an intel Mac laptop with clang 13.0.0.

/tmp/test/test.c:

void f();
void g() { f(); }

/tmp/test/importee.c:

void f() { }

/tmp/test/odp/externalDefMap.txt:

c:@F@f /tmp/test/importee.c

/tmp/test/odp/invocations.yaml:

"/tmp/test/importee.c": ["gcc", "-c", "/tmp/test/importee.c"]

/tmp/test/ast/externalDefMap.txt:

c:@F@f /tmp/test/ast/importee.c.ast

When executing the analyzer with CTU analysis via on-demand-parsing:

/tmp/test$ clang -cc1 -analyze -analyzer-checker=core -analyzer-config experimental-enable-naive-ctu-analysis=true,ctu-dir=odp,ctu-invocation-list=invocations.yaml test.c

Or loading AST file generated via driver argument -emit-ast:

/tmp/test$ clang -emit-ast importee.c -o ast/importee.c.ast
/tmp/test$ clang -cc1 -analyze -analyzer-checker=core -analyzer-config experimental-enable-naive-ctu-analysis=true,ctu-dir=ast test.c

The same diagnostic message is generated, though the triples are different from the ones for m1.

warning: imported AST from '/tmp/test/importee.c' had been generated for a different target, current: x86_64-apple-darwin21.2.0, imported: x86_64-apple-macosx12.0.0 [-Wctu]

However, the problem will not be triggered if triple is given:
(On demand parsing: setting the triple of the entry file to the one of imported ASTUnit)

/tmp/test$ clang -cc1 -analyze -analyzer-checker=core -analyzer-config experimental-enable-naive-ctu-analysis=true,ctu-dir=odp,ctu-invocation-list=invocations.yaml test.c -triple x86_64-apple-macosx12.0.0

(AST)

/tmp/test$ clang -target arm-apple-macosx -emit-ast importee.c -o ast/importee.c.ast
/tmp/test$ clang -cc1 -analyze -analyzer-checker=core -analyzer-config experimental-enable-naive-ctu-analysis=true,ctu-dir=ast test.c -triple arm-apple-macosx

Or the AST file is generated via cc1 argument -emit-pch:

/tmp/test$ clang -cc1 -emit-pch importee.c -o ast/importee.c.ast
/tmp/test$ clang -cc1 -analyze -analyzer-checker=core -analyzer-config experimental-enable-naive-ctu-analysis=true,ctu-dir=ast test.c

I think we can bypass the problem temporarily by loading the AST file generated by cc1 argument -emit-pch, just as shown in the last code snippet above.

Replace on-demand-parsing with loading AST file for the new test case.
Tested on Linux and MacOS(x86).
If it can also pass the CI test on Windows, I think we can have another try on landing this patch.

Besides, as mentioned above, to trigger the problem of target triple on MacOS, we can simply remove the requirement of Linux system for the two test cases of on-demand-parsing, i.e. clang/test/Analysis/ctu-on-demand-parsing.c and clang/test/Analysis/ctu-on-demand-parsing.cpp.

Harbormaster completed remote builds in B141490: Diff 397289.Jan 4 2022, 7:59 AM

To make it work on Windows, Linux, and Mac OS, using echo to create the external function map, and using AST file for CTU analysis.
Tested on Windows, Linux, and Mac OS under x64.

Harbormaster completed remote builds in B141819: Diff 397767.Jan 5 2022, 7:22 PM

I think I have found out the reason for the problem, and it proved my guesses.

When executing the test case of the static analyzer, we usually use %clang_analyze_cc1 as the entry, which is %clang_cc1 -analyze. And we usually do not set a target triple, as it is not required by the analyzer. However, things are complicated in Darwin Unix.

In Darwin, the default target triple is ARCH-apple-darwinXX.XX.XX, where ARCH is the architecture (e.g. x86_64) and XX.XX.XX is the version of the Darwin system. And the default target triple will be then adjusted to ARCH-apple-SYSTEMXX.XX.XX by the Driver::Darwin::ComputeEffectiveClangTriple in driver related code, where SYSTEM can be watchos, tvos, ios and macosx.

In the clang driver, the adjusted target triple will be passed to the new cc1 process; whereas in tooling and ASTUnit::LoadFromCommandLine, the adjusted target triple will be used to generate cc1 arguments to create CompilerInvocation. But when executing bare clang -cc1, if the target triple argument is not provided, it remains the default ARCH-apple-darwinXX.XX.XX. And this is the reason for the conflict.

The CTU on-demand-parsing mechanism uses ASTUnit::LoadFromCommandLine to load external ASTs. The tool clang-check uses clang tooling to parse the entry file. Therefore, both target triples are the adjusted ones, which can be matched. And so is the driver (clang --analyze ...). But not the bare %clang_cc1, its target triple is the default one.

Let's have a look at the simple example, suppose externalDefMap.txt and invocations.yaml are generated correctly.

input.cc:

void bar();
void foo() { bar(); }

importee.cc:

void bar() { }

Using driver:

clang -v --analyze input.cc -Xanalyzer -analyzer-config -Xanalyzer experimental-enable-naive-ctu-analysis=true,ctu-dir=.

Output: OK, adjusted to adjusted

(in-process)
/path/to/clang-15 -cc1 -triple x86_64-apple-macosx10.15.0 ...

Using clang-check:

clang-check -analyze input.cc -- -v -Xanalyzer -analyzer-config -Xanalyzer experimental-enable-naive-ctu-analysis=true,ctu-dir=.

Output: OK, adjusted to adjusted

clang Invocation:
 "/path/to/clang-tool" "-cc1" "-triple" "x86_64-apple-macosx10.15.0"

Using cc1:

clang -cc1 -v -analyze input.cc  -analyzer-checker=core -analyzer-config experimental-enable-naive-ctu-analysis=true,ctu-dir=.

Output: ERROR, default to adjusted

warning: imported AST from 'importee.cc' had been generated for a different target, current: x86_64-apple-darwin19.6.0, imported: x86_64-apple-macosx10.15.0 [-Wctu]

What do you think is a better way to fix this problem? @gamesh411 @steakhal @martong
Using clang-check to run the test case seems to be a good way to overcome the problem, but the problem still exists.
However, IMO it is not a good idea to make clang cc1 to adjust the default target triple manually.

Thank you @OikawaKirie for the thorough investigation and explanation, even annotations and examples. I really appreciate it.
However, you already surpassed my knowledge regarding the frontend, cc1, and other driver magic transformations.

I think it would be great to invite some Apple folks and experts in the driver flag stuff. That being said this interesting behavior would be a nice candidate for a post on the clang discourse forum.
@NoQ might also have something to say about this target triple magic.

Hi @keith, I've seen you commented on a clang driver-related issue: "Why does march=native not work on Apple M1?"
You might also have some valuable insight about this weird behavior about the detected target triple in a regular mode and using the -cc1 mode.
You don't need to go through the whole conversation, the last few comments should be enough.
Feel free to invite more people if you think.

In D102669#3325467, @steakhal wrote:

Hi @keith, I've seen you commented on a clang driver-related issue: "Why does march=native not work on Apple M1?"
You might also have some valuable insight about this weird behavior about the detected target triple in a regular mode and using the -cc1 mode.
You don't need to go through the whole conversation, the last few comments should be enough.
Feel free to invite more people if you think.

I'm not very familiar with the logic or history here, but I did look into this a bit, and I don't see a clear solution. I think the logic that lives in the Darwin driver code for creating the triple would have to be called from the default triple logic that's used in the cc1 case. I imagine to make this test pass you'd be best off making sure you always call clang or cc1, and not intermix them, or provide a default triple in all cases that's stable if possible. I do think it would be nice to unify the triple logic between these 2 places, but I imagine folks who know this code better than I do have opinions there.

Thanks, @keith.

I agree with @keith to commit this patch without using on-demand-parsing through cc1.
As this patch has nothing to do with the target triple issue we found.

In the current version, I use the PCH file to load the external TU.
And it seems to work fine on my system and on the Windows CI.

IMO, maybe we can just leave a FIXME or something else in the test case and commit this patch to fix the original problem we want to fix.
(of course, re-submit to rerun the test case on Linux)
What do you think? @steakhal

Herald added a project: Restricted Project. · View Herald TranscriptMar 20 2022, 8:39 PM

In D102669#3395364, @OikawaKirie wrote:

IMO, maybe we can just leave a FIXME or something else in the test case and commit this patch to fix the original problem we want to fix.
(of course, re-submit to rerun the test case on Linux)
What do you think? @steakhal

I agree.
Please update the summary to have a reference to the discourse question you posted about this.

Add FIXME in test case.
Add discourse topic link in summary.

Harbormaster completed remote builds in B155329: Diff 416861.Mar 21 2022, 4:13 AM

Okay, let's give it another shot. Please monitor any buildbot failures and revert promptly if needed.

This revision is now accepted and ready to land.Mar 21 2022, 6:08 AM

Closed by commit rG9f90254286dc: [analyzer][ctu] Fix wrong 'multiple definitions' errors caused by space… (authored by OikawaKirie). · Explain WhyMar 21 2022, 7:45 PM

This revision was automatically updated to reflect the committed changes.

OikawaKirie added a commit: rG9f90254286dc: [analyzer][ctu] Fix wrong 'multiple definitions' errors caused by space….

Revision Contents

Path

Size

clang/

docs/

analyzer/

user-docs/

CrossTranslationUnit.rst

8 lines

include/

clang/

Basic/

DiagnosticCrossTUKinds.td

4 lines

lib/

CrossTU/

CrossTranslationUnit.cpp

65 lines

test/

Analysis/

Inputs/

ctu-import.c.externalDefMap.ast-dump.txt

2 lines

ctu-lookup-name-with-space.cpp

17 lines

ctu-other.c.externalDefMap.ast-dump.txt

14 lines

ctu-other.cpp.externalDefMap.ast-dump.txt

60 lines

plist-macros-with-expansion-ctu.c.externalDefMap.txt

8 lines

ctu-inherited-default-ctor.cpp

2 lines

ctu-lookup-name-with-space.cpp

41 lines

func-mapping-test.cpp

26 lines

unittests/

CrossTU/

CrossTranslationUnitTest.cpp

2 lines

Diff 417168

clang/docs/analyzer/user-docs/CrossTranslationUnit.rst

Show First 20 Lines • Show All 75 Lines • ▼ Show 20 Lines

.. code-block:: bash

$ pwd $ /path/to/your/project

$ clang++ -emit-ast -o foo.cpp.ast foo.cpp

$ # Check that the .ast file is generated:

$ ls

compile_commands.json foo.cpp.ast foo.cpp main.cpp

The next step is to create a CTU index file which holds the `USR` name and location of external definitions in the

source files:

source files in format `<USR-Length>:<USR> <File-Path>`:

steakhalUnsubmitted

Not Done

The next step is to create a CTU index file which holds the `USR` name and location of external definitions in the

- source files in format `USR-Length:USR File-Path`:

+ source files in format `<USR-Length>:<USR> <File-Path>`:

.. code-block:: bash

steakhal:

.. code-block:: bash

$ clang-extdef-mapping -p . foo.cpp

c:@F@foo# /path/to/your/project/foo.cpp

9:c:@F@foo# /path/to/your/project/foo.cpp

$ clang-extdef-mapping -p . foo.cpp > externalDefMap.txt

We have to modify `externalDefMap.txt` to contain the name of the `.ast` files instead of the source files:

.. code-block:: bash

$ sed -i -e "s/.cpp/.cpp.ast/g" externalDefMap.txt

▲ Show 20 Lines • Show All 175 Lines • ▼ Show 20 Lines

"/path/to/your/project/main.cpp":

- "clang++"

- "-c"

- "/path/to/your/project/main.cpp"

- "-o"

- "/path/to/your/project/main.o"

We'd like to analyze `main.cpp` and discover the division by zero bug.

As we are using On-demand mode, we only need to create a CTU index file which holds the `USR` name and location of

external definitions in the source files:

external definitions in the source files in format `<USR-Length>:<USR> <File-Path>`:

.. code-block:: bash

$ clang-extdef-mapping -p . foo.cpp

c:@F@foo# /path/to/your/project/foo.cpp

9:c:@F@foo# /path/to/your/project/foo.cpp

$ clang-extdef-mapping -p . foo.cpp > externalDefMap.txt

Now everything is available for the CTU analysis.

We have to feed Clang with CTU specific extra arguments:

.. code-block:: bash

$ pwd

▲ Show 20 Lines • Show All 82 Lines • Show Last 20 Lines

clang/include/clang/Basic/DiagnosticCrossTUKinds.td

	//==--- DiagnosticCrossTUKinds.td - Cross Translation Unit diagnostics ----===//			//==--- DiagnosticCrossTUKinds.td - Cross Translation Unit diagnostics ----===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	let Component = "CrossTU" in {			let Component = "CrossTU" in {

	def err_ctu_error_opening : Error<			def err_ctu_error_opening : Error<
	"error opening '%0': required by the CrossTU functionality">;			"error opening '%0': required by the CrossTU functionality">;

	def err_extdefmap_parsing : Error<			def err_extdefmap_parsing : Error<
	"error parsing index file: '%0' line: %1 'UniqueID filename' format "			"error parsing index file: '%0' line: %1 '<USR-Length>:<USR> <File-Path>' "
	"expected">;			"format expected">;

	def err_multiple_def_index : Error<			def err_multiple_def_index : Error<
	"multiple definitions are found for the same key in index ">;			"multiple definitions are found for the same key in index ">;

	def warn_ctu_incompat_triple : Warning<			def warn_ctu_incompat_triple : Warning<
	"imported AST from '%0' had been generated for a different target, "			"imported AST from '%0' had been generated for a different target, "
	"current: %1, imported: %2">, InGroup<CrossTU>;			"current: %1, imported: %2">, InGroup<CrossTU>;
	}			}

clang/lib/CrossTU/CrossTranslationUnit.cpp

Show First 20 Lines • Show All 143 Lines • ▼ Show 20 Lines

void IndexError::log(raw_ostream &OS) const { void IndexError::log(raw_ostream &OS) const {

OS << Category->message(static_cast<int>(Code)) << '\n'; OS << Category->message(static_cast<int>(Code)) << '\n';

} }

std::error_code IndexError::convertToErrorCode() const { std::error_code IndexError::convertToErrorCode() const {

return std::error_code(static_cast<int>(Code), *Category); return std::error_code(static_cast<int>(Code), *Category);

} }

/// Parse one line of the input CTU index file.

///

/// @param[in] LineRef The input CTU index item in format

/// "<USR-Length>:<USR> <File-Path>".

/// @param[out] LookupName The lookup name in format "<USR-Length>:<USR>".

/// @param[out] FilePath The file path "<File-Path>".

static bool parseCrossTUIndexItem(StringRef LineRef, StringRef &LookupName,

StringRef &FilePath) {

// `LineRef` is "<USR-Length>:<USR> <File-Path>" now.

size_t USRLength = 0;

if (LineRef.consumeInteger(10, USRLength))

return false;

assert(USRLength && "USRLength should be greater than zero.");

if (!LineRef.consume_front(":"))

return false;

// `LineRef` is now just "<USR> <File-Path>".

// Check LookupName length out of bound and incorrect delimiter.

steakhalUnsubmitted

Not Done

StringRef &FilePath) {

- // Find the length delimiter.

- const size_t LengthDelimiter = LineRef.find(':');

- if (StringRef::npos == LengthDelimiter)

+ // "<USR-Length>:<USR> <File-Path>"

+ std::size_t USRLength;

+ if (Line.consumeInteger(10, USRLength))

return false;

+ assert(USRLength && "must be greater then zero");

- // Parse the length of LookupName as USRLength.

- size_t USRLength = 0;

- if (LineRef.substr(0, LengthDelimiter).consumeInteger(10, USRLength))

+ if (!Line.consume_front(":"))

return false;

- // Check LookupName length out of bound and incorrect delimiter.

- const size_t Delimiter = LengthDelimiter + USRLength + 1;

- if (USRLength <= 0 || Delimiter >= LineRef.size() ||

- ' ' != LineRef[Delimiter])

+ // `Line` is now just "<USR> <File-Path>".

+ if (Line.size() <= USRLength || Line[USRLength] != ' ')

return false;

- LookupName = LineRef.substr(0, Delimiter);

- FilePath = LineRef.substr(Delimiter + 1);

+ LookupName = Line.substr(0, USRLength);

+ FilePath = Line.substr(USRLength + 1);

return true;

}

#define IS_CTU_INDEX_KEY_VALID(FUNCTION_NAME) \

steakhal:

OikawaKirieAuthorUnsubmitted

Done

The source of lookup name of the function being imported is function CrossTranslationUnitContext::getLookupName. Keeping the length in the mapping can avoid parsing the lookup name during importing.

OikawaKirie: The source of lookup name of the function being imported is function…

steakhalUnsubmitted

Not Done

Okay; you can copy the original StringRef to have that. But by consuming it on one path makes the code much more readable.

steakhal: Okay; you can copy the original StringRef to have that. But by consuming it on one path makes…

OikawaKirieAuthorUnsubmitted

Not Done

The getAsInterger call can also check whether the content before the first colon is an integer. Therefore, a sub-string operation is required here.

OikawaKirie: The `getAsInterger` call can also check whether the content before the first colon is an…

steakhalUnsubmitted

Not Done

I don't doubt that your proposed way of doing this works and is efficient.
What I say is that I think there is room for improvement in the non-functional aspects, in the readability. However, it's not really a blocking issue, more of a personal taste.

steakhal: I don't doubt that your proposed way of doing this works and is efficient. What I say is that I…

OikawaKirieAuthorUnsubmitted

Not Done

I know what you are considering, it is clearer and more readable by consuming the length, then the USR. However, to correctly separate the USR and file path, the length of USR-Length is also required, which makes it impossible to just *consume* the length at the beginning.

Another way of solving this problem is to re-create the string with the USR-Length and the USR after parsing, but I think it is not a good solution.

BTW, is it necessary to assert the USR-Length to be greater than zero? I think it is more reasonable to report *invalid format* rather than assert the value, as it can be provided by the user.

OikawaKirie: I know what you are considering, it is clearer and more readable by consuming the length, then…

steakhalUnsubmitted

Not Done

I think what causes the misunderstanding is the definition of consume in the context of StringRef.

const StringRef Copy = Line;
Line.consumeInteger(...); // Line advances forward by the number of characters that were parsed as an integral value.
// Copy still refers to the unmodified, original characters.
// I can use it however I want.

// `Line` is a suffix of `Copy`, and the `.end()` should be the same, only `.begin()` should differ.

I hope that caused the miscommunication.

BTW, is it necessary to assert the USR-Length to be greater than zero? I think it is more reasonable to report *invalid format* rather than assert the value, as it can be provided by the user.

Yeah, sure!

steakhal: I think what causes the misunderstanding is the definition of //consume// in the context of…

OikawaKirieAuthorUnsubmitted

Done

I think I have figured out what have been misunderstood.

In the current patch, I just modify function CrossTranslationUnitContext::getLookupName by adding a length at the beginning. Therefore, the lookup name for the CTU query will have the length part. And for the sake of simplicity and efficiency, the length together with the USR is stored in the mapping as the key.

To correctly parse the <USR-Length>:<USR> part, I cannot just consume the <USR-Length> at the beginning. Otherwise, I cannot know the length of <USR-Length>: part, which makes it impossible to parse the entire <USR-Length>:<USR> part, even though the original Line is copied.

I will update the approach of adding the length part. Since the length is only used during parsing the CTU index, I will modify function createCrossTUIndexString to add the length and revert the changes to function CrossTranslationUnitContext::getLookupName to keep the lookup name for CTU query unchanged.

OikawaKirie: I think I have figured out what have been misunderstood. In the current patch, I just modify…

if (USRLength >= LineRef.size() || ' ' != LineRef[USRLength])

return false;

LookupName = LineRef.substr(0, USRLength);

FilePath = LineRef.substr(USRLength + 1);

return true;

}

steakhalUnsubmitted

Not Done

Please do something about this macro.
Encapsulate the logic in some other way.

steakhal: Please do something about this macro. Encapsulate the logic in some other way.

llvm::Expected<llvm::StringMap<std::string>> llvm::Expected<llvm::StringMap<std::string>>

parseCrossTUIndex(StringRef IndexPath) { parseCrossTUIndex(StringRef IndexPath) {

std::ifstream ExternalMapFile{std::string(IndexPath)}; std::ifstream ExternalMapFile{std::string(IndexPath)};

steakhalUnsubmitted

Not Done

charactor -> character

steakhal: charactor -> character

if (!ExternalMapFile) if (!ExternalMapFile)

return llvm::make_error<IndexError>(index_error_code::missing_index_file, return llvm::make_error<IndexError>(index_error_code::missing_index_file,

IndexPath.str()); IndexPath.str());

llvm::StringMap<std::string> Result; llvm::StringMap<std::string> Result;

steakhalUnsubmitted

Not Done

You should probably use more elaborative names, I wouldn't know what this does if I hadn't reviewed this patch.

steakhal: You should probably use more elaborative names, I wouldn't know what this does if I hadn't…

std::string Line; std::string Line;

unsigned LineNo = 1; unsigned LineNo = 1;

while (std::getline(ExternalMapFile, Line)) { while (std::getline(ExternalMapFile, Line)) {

StringRef LineRef{Line}; // Split lookup name and file path

const size_t Delimiter = LineRef.find(' '); StringRef LookupName, FilePathInIndex;

if (Delimiter > 0 && Delimiter != std::string::npos) { if (!parseCrossTUIndexItem(Line, LookupName, FilePathInIndex))

StringRef LookupName = LineRef.substr(0, Delimiter); return llvm::make_error<IndexError>(

index_error_code::invalid_index_format, IndexPath.str(), LineNo);

// Store paths with posix-style directory separator. // Store paths with posix-style directory separator.

SmallString<32> FilePath(LineRef.substr(Delimiter + 1)); SmallString<32> FilePath(FilePathInIndex);

llvm::sys::path::native(FilePath, llvm::sys::path::Style::posix); llvm::sys::path::native(FilePath, llvm::sys::path::Style::posix);

bool InsertionOccured; bool InsertionOccured;

std::tie(std::ignore, InsertionOccured) = std::tie(std::ignore, InsertionOccured) =

Result.try_emplace(LookupName, FilePath.begin(), FilePath.end()); Result.try_emplace(LookupName, FilePath.begin(), FilePath.end());

if (!InsertionOccured) if (!InsertionOccured)

return llvm::make_error<IndexError>( return llvm::make_error<IndexError>(

index_error_code::multiple_definitions, IndexPath.str(), LineNo); index_error_code::multiple_definitions, IndexPath.str(), LineNo);

} else

return llvm::make_error<IndexError>(

index_error_code::invalid_index_format, IndexPath.str(), LineNo);

++LineNo; ++LineNo;

} }

return Result; return Result;

} }

std::string std::string

createCrossTUIndexString(const llvm::StringMap<std::string> &Index) { createCrossTUIndexString(const llvm::StringMap<std::string> &Index) {

std::ostringstream Result; std::ostringstream Result;

for (const auto &E : Index) for (const auto &E : Index)

Result << E.getKey().str() << " " << E.getValue() << '\n'; Result << E.getKey().size() << ':' << E.getKey().str() << ' '

<< E.getValue() << '\n';

return Result.str(); return Result.str();

} }

bool containsConst(const VarDecl *VD, const ASTContext &ACtx) { bool containsConst(const VarDecl *VD, const ASTContext &ACtx) {

CanQualType CT = ACtx.getCanonicalType(VD->getType()); CanQualType CT = ACtx.getCanonicalType(VD->getType());

if (!CT.isConstQualified()) { if (!CT.isConstQualified()) {

const RecordType *RTy = CT->getAs<RecordType>(); const RecordType *RTy = CT->getAs<RecordType>();

if (!RTy || !RTy->hasConstFields()) if (!RTy || !RTy->hasConstFields())

▲ Show 20 Lines • Show All 214 Lines • ▼ Show 20 Lines if (ASTCacheEntry == FileASTUnitMap.end()) {

return ASTCacheEntry->second.get(); return ASTCacheEntry->second.get();

} }

llvm::Expected<ASTUnit *> llvm::Expected<ASTUnit *>

CrossTranslationUnitContext::ASTUnitStorage::getASTUnitForFunction( CrossTranslationUnitContext::ASTUnitStorage::getASTUnitForFunction(

StringRef FunctionName, StringRef CrossTUDir, StringRef IndexName, StringRef FunctionName, StringRef CrossTUDir, StringRef IndexName,

bool DisplayCTUProgress) { bool DisplayCTUProgress) {

// Try the cache first. // Try the cache first.

steakhalUnsubmitted

Not Done

The assertion speaks for itself. It rarely needs additional documentation.

steakhal: The assertion speaks for itself. It rarely needs additional documentation.

auto ASTCacheEntry = NameASTUnitMap.find(FunctionName); auto ASTCacheEntry = NameASTUnitMap.find(FunctionName);

if (ASTCacheEntry == NameASTUnitMap.end()) { if (ASTCacheEntry == NameASTUnitMap.end()) {

// Load the ASTUnit from the pre-dumped AST file specified by ASTFileName. // Load the ASTUnit from the pre-dumped AST file specified by ASTFileName.

// Ensure that the Index is loaded, as we need to search in it. // Ensure that the Index is loaded, as we need to search in it.

if (llvm::Error IndexLoadError = if (llvm::Error IndexLoadError =

ensureCTUIndexLoaded(CrossTUDir, IndexName)) ensureCTUIndexLoaded(CrossTUDir, IndexName))

return std::move(IndexLoadError); return std::move(IndexLoadError);

// Check if there is and entry in the index for the function. // Check if there is an entry in the index for the function.

steakhalUnsubmitted

Not Done

return std::move(IndexLoadError);

- // Check if there is and entry in the index for the function.

+ // Check if there is an entry in the index for the function.

assert(IS_CTU_INDEX_KEY_VALID(FunctionName) &&

steakhal:

if (!NameFileMap.count(FunctionName)) { if (!NameFileMap.count(FunctionName)) {

++NumNotInOtherTU; ++NumNotInOtherTU;

return llvm::make_error<IndexError>(index_error_code::missing_definition); return llvm::make_error<IndexError>(index_error_code::missing_definition);

} }

// Search in the index for the filename where the definition of FuncitonName // Search in the index for the filename where the definition of FuncitonName

// resides. // resides.

if (llvm::Expected<ASTUnit *> FoundForFile = if (llvm::Expected<ASTUnit *> FoundForFile =

▲ Show 20 Lines • Show All 342 Lines • Show Last 20 Lines

clang/test/Analysis/Inputs/ctu-import.c.externalDefMap.ast-dump.txt

c:@F@testStaticImplicit ctu-import.c.ast

23:c:@F@testStaticImplicit ctu-import.c.ast

clang/test/Analysis/Inputs/ctu-lookup-name-with-space.cpp

This file was added.

void f(void (*)());

void f(void (*)(int));

struct G {

G() {

// multiple definitions are found for the same key in index

f([]() -> void {}); // USR: c:@S@G@F@G#@Sa@F@operator void (*)()#1

f([](int) -> void {}); // USR: c:@S@G@F@G#@Sa@F@operator void (*)(int)#1

steakhalUnsubmitted

Not Done

I would rather put these into the importee()

steakhal: I would rather put these into the `importee()`

OikawaKirieAuthorUnsubmitted

Not Done

The lambda exprs will not be included in the CTU index file if they are declared in a normal function.

OikawaKirie: The lambda exprs will not be included in the CTU index file if they are declared in a normal…

steakhalUnsubmitted

Not Done

I see.

steakhal: I see.

// As both lambda exprs have the same prefix, if the CTU index parser uses

// the first space character as the delimiter between USR and file path, a

// "multiple definitions are found for the same key in index" error will

// be reported.

}

steakhalUnsubmitted

Not Done

Why do you need to have a div by zero warning?

steakhal: Why do you need to have a div by zero warning?

OikawaKirieAuthorUnsubmitted

Not Done

I am not sure whether we should test if an external function can be correctly imported. Hence, I wrote a div-by-zero bug here. A call to function clang_analyzer_warnIfReached is also OK here.

As the imported lambda expr will not be called, I think I can only test whether CTU works via another normal function.

OikawaKirie: I am not sure whether we should test if an external function can be correctly imported. Hence…

steakhalUnsubmitted

Not Done

AFAIK importing a function and import-related stuff are orthogonal to actually emitting bug reports produced by the analyzer. That being said, if the importee() would have an empty body, the observable behavior would remain the same. And this is what I propose now.

steakhal: AFAIK importing a function and import-related stuff are orthogonal to actually emitting bug…

OikawaKirieAuthorUnsubmitted

Not Done

Sorry, but I am not quite clear about your suggestions on this function.

OikawaKirie: Sorry, but I am not quite clear about your suggestions on this function.

};

steakhalUnsubmitted

Not Done

}

};

- int importee(int X) {

- return 1 / X;

- }

+ void importee() {}

Also fixup the return type in the declaration within the main TU. Also add the // expected-no-diagnostics comment to the primary TU.

steakhal: Also fixup the return type in the declaration within the main TU. Also add the `// expected-no…

OikawaKirieAuthorUnsubmitted

Done

Yes, you are right.
I was misled by myself.

OikawaKirie: Yes, you are right. I was misled by myself.

void importee() {}

clang/test/Analysis/Inputs/ctu-other.c.externalDefMap.ast-dump.txt

	c:@F@inlineAsm ctu-other.c.ast			14:c:@F@inlineAsm ctu-other.c.ast
	c:@F@g ctu-other.c.ast			6:c:@F@g ctu-other.c.ast
	c:@F@f ctu-other.c.ast			6:c:@F@f ctu-other.c.ast
	c:@F@enumCheck ctu-other.c.ast			14:c:@F@enumCheck ctu-other.c.ast
	c:@F@identImplicit ctu-other.c.ast			18:c:@F@identImplicit ctu-other.c.ast
	c:@F@structInProto ctu-other.c.ast			18:c:@F@structInProto ctu-other.c.ast
	c:@F@switchWithoutCases ctu-other.c.ast			23:c:@F@switchWithoutCases ctu-other.c.ast

clang/test/Analysis/Inputs/ctu-other.cpp.externalDefMap.ast-dump.txt

	c:@N@chns@F@chf1#I# ctu-other.cpp.ast			19:c:@N@chns@F@chf1#I# ctu-other.cpp.ast
	c:@N@myns@N@embed_ns@F@fens#I# ctu-other.cpp.ast			30:c:@N@myns@N@embed_ns@F@fens#I# ctu-other.cpp.ast
	c:@F@g#I# ctu-other.cpp.ast			9:c:@F@g#I# ctu-other.cpp.ast
	c:@S@mycls@F@fscl#I#S ctu-other.cpp.ast			21:c:@S@mycls@F@fscl#I#S ctu-other.cpp.ast
	c:@S@mycls@F@fcl#I# ctu-other.cpp.ast			19:c:@S@mycls@F@fcl#I# ctu-other.cpp.ast
	c:@S@mycls@F@fvcl#I# ctu-other.cpp.ast			20:c:@S@mycls@F@fvcl#I# ctu-other.cpp.ast
	c:@N@myns@S@embed_cls@F@fecl#I# ctu-other.cpp.ast			31:c:@N@myns@S@embed_cls@F@fecl#I# ctu-other.cpp.ast
	c:@S@mycls@S@embed_cls2@F@fecl2#I# ctu-other.cpp.ast			34:c:@S@mycls@S@embed_cls2@F@fecl2#I# ctu-other.cpp.ast
	c:@S@derived@F@fvcl#I# ctu-other.cpp.ast			22:c:@S@derived@F@fvcl#I# ctu-other.cpp.ast
	c:@F@f#I# ctu-other.cpp.ast			9:c:@F@f#I# ctu-other.cpp.ast
	c:@N@myns@F@fns#I# ctu-other.cpp.ast			18:c:@N@myns@F@fns#I# ctu-other.cpp.ast
	c:@F@h#I# ctu-other.cpp.ast			9:c:@F@h#I# ctu-other.cpp.ast
	c:@F@h_chain#I# ctu-chain.cpp.ast			15:c:@F@h_chain#I# ctu-chain.cpp.ast
	c:@N@chns@S@chcls@F@chf4#I# ctu-chain.cpp.ast			27:c:@N@chns@S@chcls@F@chf4#I# ctu-chain.cpp.ast
	c:@N@chns@F@chf2#I# ctu-chain.cpp.ast			19:c:@N@chns@F@chf2#I# ctu-chain.cpp.ast
	c:@F@fun_using_anon_struct#I# ctu-other.cpp.ast			29:c:@F@fun_using_anon_struct#I# ctu-other.cpp.ast
	c:@F@other_macro_diag#I# ctu-other.cpp.ast			24:c:@F@other_macro_diag#I# ctu-other.cpp.ast
	c:@extInt ctu-other.cpp.ast			9:c:@extInt ctu-other.cpp.ast
	c:@N@intns@extInt ctu-other.cpp.ast			17:c:@N@intns@extInt ctu-other.cpp.ast
	c:@extS ctu-other.cpp.ast			7:c:@extS ctu-other.cpp.ast
	c:@S@A@a ctu-other.cpp.ast			8:c:@S@A@a ctu-other.cpp.ast
	c:@extSC ctu-other.cpp.ast			8:c:@extSC ctu-other.cpp.ast
	c:@S@ST@sc ctu-other.cpp.ast			10:c:@S@ST@sc ctu-other.cpp.ast
	c:@extSCN ctu-other.cpp.ast			9:c:@extSCN ctu-other.cpp.ast
	c:@extSubSCN ctu-other.cpp.ast			12:c:@extSubSCN ctu-other.cpp.ast
	c:@extSCC ctu-other.cpp.ast			9:c:@extSCC ctu-other.cpp.ast
	c:@extU ctu-other.cpp.ast			7:c:@extU ctu-other.cpp.ast
	c:@S@TestAnonUnionUSR@Test ctu-other.cpp.ast			26:c:@S@TestAnonUnionUSR@Test ctu-other.cpp.ast
	c:@F@testImportOfIncompleteDefaultParmDuringImport#I# ctu-other.cpp.ast			53:c:@F@testImportOfIncompleteDefaultParmDuringImport#I# ctu-other.cpp.ast
	c:@F@testImportOfDelegateConstructor#I# ctu-other.cpp.ast			39:c:@F@testImportOfDelegateConstructor#I# ctu-other.cpp.ast
	No newline at end of file

clang/test/Analysis/Inputs/plist-macros-with-expansion-ctu.c.externalDefMap.txt

	c:@F@F1 plist-macros-ctu.c.ast			7:c:@F@F1 plist-macros-ctu.c.ast
	c:@F@F2 plist-macros-ctu.c.ast			7:c:@F@F2 plist-macros-ctu.c.ast
	c:@F@F3 plist-macros-ctu.c.ast			7:c:@F@F3 plist-macros-ctu.c.ast
	c:@F@F_H plist-macros-ctu.c.ast			8:c:@F@F_H plist-macros-ctu.c.ast

clang/test/Analysis/ctu-inherited-default-ctor.cpp

	// Should not crash with '-analyzer-opt-analyze-headers' option during CTU analysis.			// Should not crash with '-analyzer-opt-analyze-headers' option during CTU analysis.
	//			//
	// RUN: rm -rf %t && mkdir -p %t/ctudir			// RUN: rm -rf %t && mkdir -p %t/ctudir
	// RUN: %clang_cc1 -std=c++14 -triple x86_64-pc-linux-gnu \			// RUN: %clang_cc1 -std=c++14 -triple x86_64-pc-linux-gnu \
	// RUN: -emit-pch -o %t/ctudir/ctu-inherited-default-ctor-other.cpp.ast \			// RUN: -emit-pch -o %t/ctudir/ctu-inherited-default-ctor-other.cpp.ast \
	// RUN: %S/Inputs/ctu-inherited-default-ctor-other.cpp			// RUN: %S/Inputs/ctu-inherited-default-ctor-other.cpp
	// RUN: echo "c:@N@clang@S@DeclContextLookupResult@SingleElementDummyList ctu-inherited-default-ctor-other.cpp.ast" \			// RUN: echo "59:c:@N@clang@S@DeclContextLookupResult@SingleElementDummyList ctu-inherited-default-ctor-other.cpp.ast" \
	// RUN: > %t/ctudir/externalDefMap.txt			// RUN: > %t/ctudir/externalDefMap.txt
	//			//
	// RUN: %clang_analyze_cc1 -std=c++14 -triple x86_64-pc-linux-gnu \			// RUN: %clang_analyze_cc1 -std=c++14 -triple x86_64-pc-linux-gnu \
	// RUN: -analyzer-opt-analyze-headers \			// RUN: -analyzer-opt-analyze-headers \
	// RUN: -analyzer-checker=core \			// RUN: -analyzer-checker=core \
	// RUN: -analyzer-config experimental-enable-naive-ctu-analysis=true \			// RUN: -analyzer-config experimental-enable-naive-ctu-analysis=true \
	// RUN: -analyzer-config ctu-dir=%t/ctudir \			// RUN: -analyzer-config ctu-dir=%t/ctudir \
	// RUN: -analyzer-config display-ctu-progress=true \			// RUN: -analyzer-config display-ctu-progress=true \
	Show All 13 Lines

clang/test/Analysis/ctu-lookup-name-with-space.cpp

This file was added.

// RUN: rm -rf %t

// RUN: mkdir %t

// RUN: echo '41:c:@S@G@F@G#@Sa@F@operator void (*)(int)#1 %/t/importee.ast' >> %t/externalDefMap.txt

// RUN: echo '38:c:@S@G@F@G#@Sa@F@operator void (*)()#1 %/t/importee.ast' >> %t/externalDefMap.txt

// RUN: echo '14:c:@F@importee# %/t/importee.ast' >> %t/externalDefMap.txt

// RUN: %clang_cc1 -emit-pch %/S/Inputs/ctu-lookup-name-with-space.cpp -o %t/importee.ast

steakhalUnsubmitted

Not Done

Probably splitting this up into multiple lines would result in a more readable solution.

Something along these lines should work:

cat >%t/compile_commands.json <<EOL
line 1
line 2
...
EOL

steakhal: Probably splitting this up into multiple lines would result in a more readable solution.

OikawaKirieAuthorUnsubmitted

Not Done

The suggestion is great, however I cannot find a way to write the RUN commands.
Could you please tell me how to write the commands in this way? It is also useful to help me merging the test case into one file.

OikawaKirie: The suggestion is great, however I cannot find a way to write the `RUN` commands. Could you…

// RUN: cd %t

// RUN: %clang_cc1 -fsyntax-only -analyze \

// RUN: -analyzer-checker=core \

// RUN: -analyzer-config experimental-enable-naive-ctu-analysis=true \

// RUN: -analyzer-config ctu-dir=. \

// RUN: -analyzer-config display-ctu-progress=true \

OikawaKirieAuthorUnsubmitted

Done

// RUN: -verify %s

+ // REQUIRES: system-linux

void importee();

Adding this line here.

OikawaKirie: Adding this line here.

arichardsonUnsubmitted

Not Done

Disabling the test on non- Linux is not a good idea IMO since it means we lose coverage on other platforms. My guess is that you just need to specify an explicit triple in the clang invocations.

arichardson: Disabling the test on non- Linux is not a good idea IMO since it means we lose coverage on…

OikawaKirieAuthorUnsubmitted

Done

AFAIK, we cannot do that. If this test case is executed on different platforms, we cannot determine the triple ahead of time and specify it in the invocation list.

OikawaKirie: AFAIK, we cannot do that. If this test case is executed on different platforms, we cannot…

steakhalUnsubmitted

Not Done

If we were to pin the triple, then each platform would emit the correct AST dumps according to that platform - ~~ cross-compilation.

steakhal: If we were to pin the triple, then each platform would emit the correct AST dumps according to…

// RUN: -verify %s 2>&1 | FileCheck %s

// CHECK: CTU loaded AST file

// FIXME: In this test case, we cannot use the on-demand-parsing approach to

// load the external TU.

// In the Darwin system, the target triple is determined by the driver,

// rather than using the default one like other systems. However, when

// using bare `clang -cc1`, the adjustment is not done, which cannot

steakhalUnsubmitted

Done

Why do you need two separate invocations? Couldn't you just merge these?
I've seen cases where -verify was used in conjunction with FileCheck.

steakhal: Why do you need two separate invocations? Couldn't you just merge these? I've seen cases where…

OikawaKirieAuthorUnsubmitted

Done

I forgot the --allow-empty argument during writing this test case. I will merge them in an update.

OikawaKirie: I forgot the `--allow-empty` argument during writing this test case. I will merge them in an…

// match the one loaded with on-demand-parsing (adjusted triple).

// We bypass this problem by loading AST files, whose target triple is

// also unadjusted when generated via `clang -cc1 -emit-pch`.

// Refer to: https://discourse.llvm.org/t/60762

// This is also the reason why the test case of D75665 (introducing

// the on-demand-parsing feature) is enabled only on Linux.

void importee();

void trigger() {

// Call an external function to trigger the parsing process of CTU index.

// Refer to file Inputs/ctu-lookup-name-with-space.cpp for more details.

importee(); // expected-no-diagnostics

}

clang/test/Analysis/func-mapping-test.cpp

	// RUN: %clang_extdef_map %s -- \| FileCheck --implicit-check-not "c:@y" --implicit-check-not "c:@z" %s			// RUN: %clang_extdef_map %s -- \| FileCheck --implicit-check-not "c:@y" --implicit-check-not "c:@z" %s

	int f(int) {			int f(int) {
	return 0;			return 0;
	}			}
	// CHECK-DAG: c:@F@f#I#			// CHECK-DAG: 9:c:@F@f#I#

	extern const int x = 5;			extern const int x = 5;
	// CHECK-DAG: c:@x			// CHECK-DAG: 4:c:@x

	// Non-const variables should not be collected.			// Non-const variables should not be collected.
	int y = 5;			int y = 5;

	// In C++, const implies internal linkage, so not collected.			// In C++, const implies internal linkage, so not collected.
	const int z = 5;			const int z = 5;

	struct S {			struct S {
	int a;			int a;
	};			};
	extern S const s = {.a = 2};			extern S const s = {.a = 2};
	// CHECK-DAG: c:@s			// CHECK-DAG: 4:c:@s

	struct SF {			struct SF {
	const int a;			const int a;
	};			};
	SF sf = {.a = 2};			SF sf = {.a = 2};
	// CHECK-DAG: c:@sf			// CHECK-DAG: 5:c:@sf

	struct SStatic {			struct SStatic {
	static const int a = 4;			static const int a = 4;
	};			};
	const int SStatic::a;			const int SStatic::a;
	// CHECK-DAG: c:@S@SStatic@a			// CHECK-DAG: 14:c:@S@SStatic@a

	extern int const arr[5] = { 0, 1 };			extern int const arr[5] = { 0, 1 };
	// CHECK-DAG: c:@arr			// CHECK-DAG: 6:c:@arr

	union U {			union U {
	const int a;			const int a;
	const unsigned int b;			const unsigned int b;
	};			};
	U u = {.a = 6};			U u = {.a = 6};
	// CHECK-DAG: c:@u			// CHECK-DAG: 4:c:@u

	// No USR can be generated for this.			// No USR can be generated for this.
	// Check for no crash in this case.			// Check for no crash in this case.
	static union {			static union {
	float uf;			float uf;
	const int ui;			const int ui;
	};			};

				void f(int (*)(char));
				void f(bool (*)(char));

				struct G {
				G() {
				f([](char) -> int { return 42; });
				// CHECK-DAG: 41:c:@S@G@F@G#@Sa@F@operator int (*)(char)#1
				f([](char) -> bool { return true; });
				// CHECK-DAG: 42:c:@S@G@F@G#@Sa@F@operator bool (*)(char)#1
				}
				};
				steakhalUnsubmitted Not Done Reply Inline Actions I think you could add your lambda stuff to this file. This is really the place for testing this. The test you created actually demonstrating the CTU issue is also valuable IMO, so you can leave it, but have a copy here. steakhal: I think you could add your lambda stuff to this file. This is really the place for testing this.

clang/unittests/CrossTU/CrossTranslationUnitTest.cpp

Show First 20 Lines • Show All 55 Lines • ▼ Show 20 Lines	ASSERT_FALSE(
llvm::sys::fs::createTemporaryFile("f_ast", "ast", ASTFD, ASTFileName));		llvm::sys::fs::createTemporaryFile("f_ast", "ast", ASTFD, ASTFileName));
llvm::ToolOutputFile ASTFile(ASTFileName, ASTFD);		llvm::ToolOutputFile ASTFile(ASTFileName, ASTFD);

int IndexFD;		int IndexFD;
llvm::SmallString<256> IndexFileName;		llvm::SmallString<256> IndexFileName;
ASSERT_FALSE(llvm::sys::fs::createTemporaryFile("index", "txt", IndexFD,		ASSERT_FALSE(llvm::sys::fs::createTemporaryFile("index", "txt", IndexFD,
IndexFileName));		IndexFileName));
llvm::ToolOutputFile IndexFile(IndexFileName, IndexFD);		llvm::ToolOutputFile IndexFile(IndexFileName, IndexFD);
IndexFile.os() << "c:@F@f#I# " << ASTFileName << "\n";		IndexFile.os() << "9:c:@F@f#I# " << ASTFileName << "\n";
IndexFile.os().flush();		IndexFile.os().flush();
EXPECT_TRUE(llvm::sys::fs::exists(IndexFileName));		EXPECT_TRUE(llvm::sys::fs::exists(IndexFileName));

StringRef SourceText = "int f(int) { return 0; }\n";		StringRef SourceText = "int f(int) { return 0; }\n";
// This file must exist since the saved ASTFile will reference it.		// This file must exist since the saved ASTFile will reference it.
int SourceFD;		int SourceFD;
llvm::SmallString<256> SourceFileName;		llvm::SmallString<256> SourceFileName;
ASSERT_FALSE(llvm::sys::fs::createTemporaryFile("input", "cpp", SourceFD,		ASSERT_FALSE(llvm::sys::fs::createTemporaryFile("input", "cpp", SourceFD,
▲ Show 20 Lines • Show All 205 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[analyzer][ctu] Fix wrong 'multiple definitions' errors caused by space characters in lookup names when parsing the ctu index fileClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 417168

clang/docs/analyzer/user-docs/CrossTranslationUnit.rst

clang/include/clang/Basic/DiagnosticCrossTUKinds.td

clang/lib/CrossTU/CrossTranslationUnit.cpp

clang/test/Analysis/Inputs/ctu-import.c.externalDefMap.ast-dump.txt

clang/test/Analysis/Inputs/ctu-lookup-name-with-space.cpp

clang/test/Analysis/Inputs/ctu-other.c.externalDefMap.ast-dump.txt

clang/test/Analysis/Inputs/ctu-other.cpp.externalDefMap.ast-dump.txt

clang/test/Analysis/Inputs/plist-macros-with-expansion-ctu.c.externalDefMap.txt

clang/test/Analysis/ctu-inherited-default-ctor.cpp

clang/test/Analysis/ctu-lookup-name-with-space.cpp

clang/test/Analysis/func-mapping-test.cpp

clang/unittests/CrossTU/CrossTranslationUnitTest.cpp

[analyzer][ctu] Fix wrong 'multiple definitions' errors caused by space characters in lookup names when parsing the ctu index file
ClosedPublic