Most of CGExprConstant.cpp is using the CharUnits abstraction
and is using getCharWidth() (directly of indirectly) when converting
between size of a char and size in bits. This patch is making that
abstraction more consistent by adding CharTy to the CodeGenTypeCache
(honoring getCharWidth() when mapping from char to LLVM IR types,
instead of using Int8Ty directly).
Details
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
As far as i recall, all RFC's about LLVM support for non-8-bit char targets didn't end in favor of the proposal
I'd say that until there's general consensus/buy-in (including testing path), cleaning stuff up will only confuse things.
CC @jfb
Well, I haven't even added reviewers. But if it hurts so much maybe I should abandon it ;-)
But from my point of view this patch isn't really adding non-8-bit char support. But there is an abstraction in clang (such as ASTContext) with getCharWidth(), toCharUnitsFromBits() etc. And it is confusing when things don't add up (such as calculating the number of chars using toCharUnitsFromBits(), indirectly using getCharWidth(), and then later ignoring it and assuming that the char width is 8. I don't see how being more consistent is more confusing.
I don't mind generally abstracting Clang to handle this better.
clang/lib/CodeGen/CodeGenModule.h | ||
---|---|---|
1229 | Please just add a CharTy to CodeGenTypeCache. |
@lebedev.ri : I think I need your blessing as well on this, considering your earlier concerns. Is it still just confusing? (I doubt that we want/can replace all uses of getCharWidth/getIntWidth etc in clang with integers)
I don't particularly want/like to straight-up derail this,
but my *personal* opinion still hasn't changed, and it is not unsubstantiated,
see https://lists.llvm.org/pipermail/llvm-dev/2019-May/132080.html,
https://lists.llvm.org/pipermail/llvm-dev/2019-October/136115.html, etc,
I think the fact that last *two* recent-ish RFC's on this didn't succeed
suggests that a third one might be in order, perhaps with better arguments
this time, or maybe the previous opponents will just not block this..
TBN, i don't dislike the feature per-se, at least without seeing the full impact of the cleanup.
Clang has generally made a point of trying to be as neutral about CHAR_BITS as it can be, even though LLVM has not. As the code owner of this code, I view this as a fix consistent with that, and I consider it acceptable to land.
Of course, if you are pursuing these patches out of an expectation that you'll eventually get Clang to support compiling targets with non-8-bit-addressing, you need to understand that (at least for most cases) that depends on LLVM supporting such targets, and so far the LLVM community has been pretty strongly opposed to that idea. But Clang has *internally* already accepted the abstraction costs of CharUnits, and we should to be as consistent as possible about that in our own code.
clang-format not found in user's PATH; not linting file.