This is the funcref counterpart to 104bad5a. We introduce a new attribute
that marks a function pointer as a funcref. It also implements builtin
__builtin_wasm_ref_null_func(), that returns a null funcref value.
Details
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Rebase on top of main. Add missing code in Sema* to ensure the new funcref builtin is checked for.
Thanks to @asb for helping fix this.
Update the patch as there were some missing calls from SemaChecking.
Thanks to @asb for spotting it.
It would be great to get this landed. Can someone pls take a look at it? It's quite independent from the remainder of the work and self-contained.
maybe @tlively ping?
It would be good to add tests for more error conditions like in the externref patch and also additional errors like trying to apply __funcref to types that aren't function pointers.
clang/include/clang/Basic/Attr.td | ||
---|---|---|
4140 | It would be good to document this! | |
clang/lib/Sema/SemaChecking.cpp | ||
4729 | Are the extra parentheses meaningful here? | |
6761–6764 | How do these parts correspond to the original source code? |
Same question here about an RFC for adding this new type to the type system and a design document for how the type behaves as in https://reviews.llvm.org/D122215.
clang/include/clang/Basic/Attr.td | ||
---|---|---|
4140 | It's a requirement that this be documented. Also, this should probably have a Subjects line that expects a function of some sort? (It doesn't impact the semantics of anything, but it self-documents the expectations for people reading Attr.td.) | |
clang/include/clang/Basic/DiagnosticSemaKinds.td | ||
7389 |
| |
clang/include/clang/Basic/TokenKinds.def | ||
682 | Why is this a keyword in all language modes? | |
clang/include/clang/Parse/Parser.h | ||
2932 | ||
clang/include/clang/Sema/Sema.h | ||
13499–13500 | Formatting is off here. | |
13570 | I think there's a verb missing from this function identifier -- also, I don't think we need to continue the terrible trend of prepending Sema onto the function name which lives in the Sema object. | |
clang/lib/AST/ASTContext.cpp | ||
954 ↗ | (On Diff #446726) | Where did this value come from? |
clang/lib/Parse/ParseDecl.cpp | ||
843–844 | ||
849–852 | ||
5919 | Why that specific ordering as opposed to AR_VendorAttributesParsed? | |
clang/lib/Sema/SemaChecking.cpp | ||
6766–6770 | Why would we need to do this?? And what do we do when there is no FunctionDecl to get back to (because it's a call expression through a function pointer, for example)? | |
6773 | Why are you guaranteed this function returns void? | |
clang/lib/Sema/SemaType.cpp | ||
7359 | ||
7372 | This seems like it's not used. | |
7392–7393 | What happens when the user's function already has an explicitly specified address space? |
clang/lib/AST/ASTContext.cpp | ||
---|---|---|
954 ↗ | (On Diff #446726) | Unsure what you mean here. This is the address space number we also use on the LLVM side. |
clang/lib/AST/ASTContext.cpp | ||
---|---|---|
954 ↗ | (On Diff #446726) |
That's exactly what I was looking for -- I wanted to make sure this agreed with the LLVM side of things and any public documentation about address spaces for WASM. |
clang/lib/AST/ASTContext.cpp | ||
---|---|---|
954 ↗ | (On Diff #446726) |
Sure! Marking as done. |
clang/include/clang/Basic/DiagnosticSemaKinds.td | ||
---|---|---|
7389 |
Yes, I meant __funcref. Thanks, will add test coverage for the diagnostic. | |
clang/include/clang/Basic/TokenKinds.def | ||
682 | I might have misread the docs but this keyword should be available for both C and C++. Maybe I want KEYC99 | KEYCXX ? | |
clang/include/clang/Sema/Sema.h | ||
13499–13500 | Don't understand why. It's the same as CheckRISCVBuiltinFunctionCall ? |
Ready for review.
RFC: https://discourse.llvm.org/t/rfc-webassembly-reference-types-in-clang/66939
@aaron.ballman Hi Aaron, since you requested an RFC for this. I wonder if you have any comments for this or the other related patches: D122215, D139010
I appreciate your patience with the extreme delay on getting you feedback after I asked you to post the RFC; I wanted to give the RFC a chance to bake and then ran into the holiday season as a result. Sorry for that!
Adding @erichkeane as new attributes code owner as well, in case he's got additional thoughts.
clang/include/clang/Basic/TokenKinds.def | ||
---|---|---|
682 | I was thinking this keyword would only work for a web assembly target, so we'd likely want to add KEYWEBASM or something to the list like we have for KEYALTIVEC. But now I'm second-guessing that instinct and am wondering what behavior we want here.
My original thinking was that we wanted #3, but on reflection both #1 and #2 seem reasonable to me. Do you have a preference? I think I prefer either 2 or 3 over 1 because this involves the type system (errors are usually a better approach in that case). | |
clang/lib/AST/TypePrinter.cpp | ||
1652–1653 | I'm not certain that the assert adds much here given that the predicate tests exactly the same thing, perhaps remove it (and drop the curly braces)? | |
clang/lib/Basic/Targets/WebAssembly.h | ||
44–45 | I'm not super familiar with the address space maps, but this sets of my spidey senses. All of the other address space maps end with: ptr64, hlsl_groupshared, wasm_funcref but this one is ending with: ptr64, wasm_externref, wasm_funcref which makes me think something's amiss here. Thoughts? | |
clang/lib/Parse/ParseDecl.cpp | ||
844 | I think we should assert this condition; we already know what the token is before we call this function anyway. | |
5919–5922 | I don't think we need to care about attribute requirements because this is a keyword attribute (this feature is used to prohibit attributes in certain grammars and I don't think that applies here). | |
clang/lib/Sema/SemaChecking.cpp | ||
6775–6781 | These comments no longer seem to match the code -- nothing creates a FunctionDecl here. | |
6782 | I'm still wondering why this code is needed and what it's trying to do. It worries me that we're changing the type of the call expression like this. | |
clang/lib/Sema/SemaDecl.cpp | ||
14838 | ||
clang/lib/Sema/SemaType.cpp | ||
7369 | We're missing Sema test coverage for all the various diagnostics you've added; can you add that coverage? | |
7372 | Still wondering about this | |
7392–7393 | Still wondering about this as well. |
clang/include/clang/Basic/Attr.td | ||
---|---|---|
4141 | Note that it seems this is likely not going to work correctly on template type aliases! You probably want to make sure that is acceptable to you. | |
clang/include/clang/Basic/AttrDocs.td | ||
6933 | This says 'function pointer declaration', but you have it's subject as Typedef? | |
clang/include/clang/Basic/TokenKinds.def | ||
682 | I don't think #1 is acceptable for the keyword, we shouldn't ignore keywords like we do attributes. Doing an ignored warning in SemaDeclAttr.td for the ATTRIBUTE is fine, but keyword form needs to be an error. My preference is #2, an error if we encounter it during parsing and we're not in WebAsm target. | |
clang/lib/Basic/Targets/DirectX.h | ||
45 | Also apply to the rest of the targets. Also, why is there only 1 added here? I thought we had 2 address spaces for Wasm? | |
clang/lib/Basic/Targets/WebAssembly.h | ||
44–45 | That DEFINITELY looks suspicious to me as well, I noted above too, but either we need to specify we're re-using the hlsl_groupshared for wasm_externref address space (at which point, i wonder: why not re-use a different address space?), or correct this. | |
clang/lib/Parse/ParseDecl.cpp | ||
847 | ||
clang/lib/Sema/SemaChecking.cpp | ||
6767 | Does this diagnose? Should we be emitting an error here? | |
6779 | All those comments don't really seem relevant here, and are not really correct. The only thing SemaChecking should do is either modify its arguments (though you don't have any), or set its return type. | |
6782 | This appears to be a 'custom type checking' builtin, where the code in Sema has to check the arguments (rather than the automatic behavior working), and set its return type to the 'correct' one. It appears that this accepts no arguments, then changes the result of the builtin. So this doesn't seem particularly odd, other than the strange comments above. | |
clang/lib/Sema/SemaType.cpp | ||
7358 | This name seems misleading... should you be desugaring 'Type' to store it in 'Desugared'? Also, I see that this value is never used, so why do the copy init? ALSO, why is it called Desugared, when it never seems to get a Desugared type? |
clang/lib/Sema/SemaChecking.cpp | ||
---|---|---|
6782 | Oh yeah that's right, that is how the custom checking code works, thank you for the reminder! |
@aaron.ballman Added support for externref mangling in Microsoft. This follows the pattern implemented by OpenCL types.
clang/lib/AST/MicrosoftMangle.cpp | ||
---|---|---|
2481 ↗ | (On Diff #488645) | I'm not sure what the PA is supposed to mean? I see some uses of mangleArtificialTagType don't do anything with it, so I fear it is something OCL specific. |
clang/include/clang/Basic/Attr.td | ||
---|---|---|
4141 | Documentation says about Subject: // The things to which an attribute can appertain SubjectList Subjects; Given this can be attached to function pointer types, is the subject then just Type ? | |
clang/include/clang/Basic/AttrDocs.td | ||
6933 | That's a mistake due to how things worked in an earlier revision. Thanks for spotting. | |
clang/include/clang/Basic/TokenKinds.def | ||
682 | Thanks for the comments, I have now updated this with an implementation of option 2 and added a diagnostic test. | |
clang/lib/Basic/Targets/DirectX.h | ||
45 | Externref is a type in clang, doesn't need its own address space in clang, only when it's lowered to a opaque type in llvm it gets its own address space. |
clang/include/clang/Basic/Attr.td | ||
---|---|---|
4141 | IIRC, the list of subjects are the Decls and Stmt types that are possibly allowed. I don't thnk it would be just 'Type' there. As you have it, it is only allowed on TypedefNameDecl. See the SubsetSubject's at the top of this file for some examples on how you could constrain a special matcher to only work on function pointer decls. | |
clang/lib/Basic/Targets/DirectX.h | ||
45 | Please fix the comment in my suggestion. Also, your explanation doesn't make it clear to me why you added TWO entries to this list for other targets, but only 1 here. |
clang/include/clang/Basic/Attr.td | ||
---|---|---|
4141 | Good point @erichkeane . What do you think of FunctionPointer implementation here? |
clang/include/clang/Basic/Attr.td | ||
---|---|---|
4141 | That seems acceptable to me. |
Some more comments, but there are unaddressed comments from earlier reviews as well.
clang/include/clang/Basic/Attr.td | ||
---|---|---|
4141 | That makes sense to me as well, but FWIW, type attributes don't currently have much of any tablegen automation (unlike decl and stmt attributes), so the subject list isn't strictly necessary here. But it's still appreciated because 1) it's documentation, and 2) we want to tablegen more and this helps us with that goal. | |
clang/include/clang/Basic/DiagnosticSemaKinds.td | ||
11806 | ||
clang/lib/AST/DeclBase.cpp | ||
1060–1063 | ||
clang/lib/Parse/ParseDecl.cpp | ||
854–855 | Be sure to run clang-format over the patch to fix this sort of thing. |
clang/include/clang/Basic/Attr.td | ||
---|---|---|
4141 | @erichkeane Yes, you're right. Something like using IntIntFuncref = int(*)(int) __funcref; doesn't work. Why is this not accepted? Is it due to the Subjects listed? |
clang/lib/Sema/SemaChecking.cpp | ||
---|---|---|
6761–6764 | If I understand your question correctly, this is just the semantic checks for each of the builtins. | |
6773 | I think this question is not relevant anymore given the current code, right? The wasm ref null builtin will always return the null funcref. | |
6775–6781 | Right - fixed. | |
6779 | Right, these comments can be removed. I fixed these. | |
clang/lib/Sema/SemaType.cpp | ||
7358 | Bad naming due to earlier code. I have simplified this code now. |
clang/lib/Sema/SemaType.cpp | ||
---|---|---|
7372 | That's correct - removed. |
@aaron.ballman I have finished addressing all the concerns on this patch. Do you have any further comments?
clang/lib/Sema/SemaType.cpp | ||
---|---|---|
7392–7393 | We'll just be overriding the address space in this case. Although, I am not sure something like this can happen in Wasm given there's only one linear memory. |
Generally LGTM with a few comments sprinkled around.
clang/include/clang/Basic/Attr.td | ||
---|---|---|
4141 | Not due to the subjects -- keyword attributes require manual parsing and type attributes require manual processing (we don't tablegen as much as I'd like for type attributes). But that code should be accepted (you should add a test case for it as well). | |
clang/include/clang/Basic/AttrDocs.td | ||
6930 | DocCatType, right? | |
6933 | "Function pointer declaration" reads a bit odd to me, I think you mean "function pointer type", right? e.g., | |
clang/lib/Sema/SemaType.cpp | ||
7350 | Just to avoid confusing editors that can't tell Type is a variable and not clang::Type the type. | |
clang/test/Sema/wasm-refs.c | ||
50 ↗ | (On Diff #497632) | Should we also test two-level application of the qualifier, as in? using uh_oh = funcref_t __funcref; |
Thanks @aaron.ballman . I think I have now address all outstanding comments.
clang/include/clang/Basic/Attr.td | ||
---|---|---|
4141 | Seems to be fine now. Test added. | |
clang/include/clang/Basic/AttrDocs.td | ||
6930 | Yes. | |
clang/test/Sema/wasm-refs.c | ||
50 ↗ | (On Diff #497632) | Double qualification shouldn't be an issue - adding this to SemaCXX/wasm-funcref.cpp. |
First, we've been dealing with the 16.0 release process/regressions, which unfortunately got us pretty far behind on reviews. Sorry about that!
I have only 1 concern here, otherwise, this LGTM. I'll leave it to Aaron to make the final 'value judgement' of if we want to keep this, so I'll leave it to him to Approve.
clang/lib/AST/DeclBase.cpp | ||
---|---|---|
1056 | probably want to canonicalize here (and above). Else typedefs-to-typedefs/etc might getcha. |
LGTM aside from some small nits that you can fix when landing. Thank you for your patience on this review!
clang/lib/AST/DeclBase.cpp | ||
---|---|---|
1056 | Yeah, I'd do that below on the return, as in: return Ty.getCanonicalType()->isFunctionPointerType(); | |
clang/test/SemaCXX/wasm-funcref.cpp | ||
14 ↗ | (On Diff #501081) | Please add a newline to the end of the file. |
clang/lib/Format/FormatToken.h | ||
---|---|---|
606 | Is there a unit test in clang format for this change? |
It would be good to document this!