In some cases system includes extractions is not enough, we also need target specific defines.
The problems appears when clang default target is not the same as toolchain's one (GCC cross-compiler, MinGW on Windows).
After this patch query-driver also extracts target and adds --target=<extracted target> compile option.
Details
- Reviewers
sammccall kadircet - Commits
- rG1ca174b6420a: [clangd][query-driver] Extract target
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Thanks, this seems really useful!
clang-tools-extra/clangd/QueryDriverDatabase.cpp | ||
---|---|---|
62 | define a little struct holding these two, for clarity? | |
92 | This part seems pretty confusing :-) If we want to be strict though, it might be time to rewrite to be more stateful e.g. a loop consuming chunks from an ArrayRef<StringRef>, or a state machine, or something. | |
131 | this changes the error behavior: intentional? | |
251 | clang -target foo test.cc seems to be a hard error in the driver if the target is unknown. so this could regress some scenarios. Can we mitigate that? | |
343 | nit: while here, add std::move? |
clang-tools-extra/clangd/QueryDriverDatabase.cpp | ||
---|---|---|
251 | what if target already exists in Cmd? also it would be nice to use --target=X format to be consistent with target inference from invocation name as in https://github.com/llvm/llvm-project/blob/master/clang/lib/Tooling/Tooling.cpp#L278. |
Add structure to hold system includes and target.
Use state machine to go though lines of the driver output.
Handle invalid target
Use '--target=X' instead of '-target X'
clang-tools-extra/clangd/QueryDriverDatabase.cpp | ||
---|---|---|
134 | Unsure about this. | |
251 | I failed to find options to process in case of invalid target, so added target validation. | |
251 | We could specify several --target options, the last one will be used. But I am not sure should we override existing or not. |
clang-tools-extra/clangd/QueryDriverDatabase.cpp | ||
---|---|---|
67 | i think you can just do TargetRegistry::lookupTarget | |
230 | nit: I would fold these two into a single log statement. | |
251 |
Having multiple targets sounds confusing. I would prefer keeping a target specifically mentioned by the user but it is also possible for the target to be inferred by clang (and that might be wrong). This is similar to our stand against predefined macros though, i think we should fix clang's inference logic if it has any false positives. So i am slightly leaning towards not overriding the target if it exists. | |
337 | nit: if(auto Info = ...) setTarget(addSystemIncludes(...), ...); return std::move(Cmd); |
Thanks! just nits, apart from the "-target already explicitly set" issue Kadir raised.
clang-tools-extra/clangd/QueryDriverDatabase.cpp | ||
---|---|---|
99 | I'm not sure these early returns are worth the complexity | |
102 | also not sure whether initial vs includesextracted is worth distinguishing - this doesn't scale well if we add a third thing we're tracking (do we need a state for each combination that we could have seen already?) Maybe just a bool for SeenIncludes or something to detect duplicate/missing sections would be simpler | |
106 | this mostly echoes the code: "Only detect targets that clang understands"? | |
108 | I'd prefix with "System include extraction: " and bump up to elog - this is fairly likely to break things (and we'll only log it ~once thanks to caching) | |
112 | also add a prefix here so it's clear what the context is in the log | |
227 | we want to log this even if the set is empty, so the reader can tell the extraction worked | |
231 | The target extractor isn't a separate component from the system includes extractor, so the log message seems a bit misleading. | |
251 | I think the question of "target already explicitly set" should probably be answered. |
Sorry, code reviews are racy :-)
clang-tools-extra/clangd/QueryDriverDatabase.cpp | ||
---|---|---|
134 | Yup, at least for this patch (this is the existing behavior). | |
251 | I think we should definitely not override the target if it already exists. The scenario is: compile_commands.json has /my/driver --target=bar foo.cc. So foo is the default target, but it's being overridden on the command-line as part of the build. Now if we add the extracted target to the end of the compile command, to produce /my/driver --target=bar foo.cc --target=foo, we're effectively reversing the precedence: now the default target is used even if the compile command overrode it. |
clang-tools-extra/clangd/QueryDriverDatabase.cpp | ||
---|---|---|
67 | Seems we can't. TargetRegistry::lookupTarget returns nullptr for arm-linux-gnueabihf with error message: No available targets are compatible with triple "arm-linux-gnueabihf" | |
337 | I used your advice except std::move(Cmd), seems we do not need std::move here |
clang-tools-extra/clangd/QueryDriverDatabase.cpp | ||
---|---|---|
349 | ofc we don't need it. but the thing is we are returning an Optional<tooling::Command> hence the return statement needs to invoke a constructor for Optional and without std::move it would invoke a copy constructor rather than a move-based one. And even though rest of the calculations are cached per driver&language name, this copy is going to happen for every translation unit. So in theory it would be nice to prevent that. |
clang-tools-extra/clangd/QueryDriverDatabase.cpp | ||
---|---|---|
349 | oh actually never mind, Cmd itself is also an Optional<tooling::Command>, i thought it was a naked tooling::Command |
Still LG, thanks!
clang-tools-extra/clangd/QueryDriverDatabase.cpp | ||
---|---|---|
349 | It's moot here, but this isn't the case anymore, return x can call a converting move constructor: http://www.open-std.org/jtc1/sc22/wg21/docs/cwg_defects.html#1579 This is part of C++14, and so we should be able to rely on it in LLVM. It wasn't true in the original C++11 spec, but since it was a defect report, reasonably-modern compilers allow the optimization even in C++11 mode. Buggy compilers certainly exist, but I don't think we need to work around them for small performance issues. (If the code fails to compile on old-gcc without std::move, that's another story...) |
define a little struct holding these two, for clarity?