This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
include/clang/Basic/
-
clang/
-
Basic/
22
Attr.td
6/38
AttrDocs.td
6/9
DiagnosticSemaKinds.td
-
lib/Sema/
-
Sema/
4
SemaDeclAttr.cpp
-
test/SemaSYCL/
-
SemaSYCL/
-
kernel-attribute-on-non-sycl.cpp
5/6
kernel-attribute.cpp

Differential D60455

[SYCL] Add sycl_kernel attribute for accelerated code outlining
ClosedPublic

Authored by bader on Apr 9 2019, 5:36 AM.

Download Raw Diff

Details

Reviewers

jlebar
keryell
Naghasan
ABataev
aaron.ballman
rjmccall
rsmith
Fznamznon
arphaman
Anastasia

Commits

rGc094e7dc4b3f: [SYCL] Add sycl_kernel attribute for accelerated code outlining

Summary

SYCL is single source offload programming model relying on compiler to
separate device code (i.e. offloaded to an accelerator) from the code
executed on the host.

Here is code example of the SYCL program to demonstrate compiler
outlining work:

int foo(int x) { return ++x; }
int bar(int x) { throw std::exception("CPU code only!"); }
...
using namespace cl::sycl;
queue Q;
buffer<int, 1> a(range<1>{1024});
Q.submit([&](handler& cgh) {
  auto A = a.get_access<access::mode::write>(cgh);
  cgh.parallel_for<init_a>(range<1>{1024}, [=](id<1> index) {
    A[index] = index[0] * 2 + index[1] + foo(42);
  });
}
...

SYCL device compiler must compile lambda expression passed to
cl::sycl::handler::parallel_for method and function foo called from this
lambda expression for an "accelerator". SYCL device compiler also must
ignore bar function as it's not required for offloaded code execution.

This patch adds the sycl_kernel attribute, which is used to mark code
passed to cl::sycl::handler::parallel_for as "accelerated code".

Attribute must be applied to function templates which parameters include
at least "kernel name" and "kernel function object". These parameters
will be used to establish an ABI between the host application and
offloaded part.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

bader added inline comments.May 27 2019, 7:16 AM

clang/lib/Sema/SemaSYCL.cpp
23 ↗	(On Diff #200658)	I think this is also preventing traditional linking of translation units. Could you elaborate more on this topic, please? What do you mean by "traditional linking of translation units" and what exactly "is preventing" it? Do you compare with the linking of regular C++ code (i.e. which do not split into host and device code)? If so, SYCL is different from this model and more similar to CUDA/OpenMP models, which also skip "linking" of irrelevant part (e.g. host code is not linked by the device compiler). Mariya added Justin (@jlebar) and Alexey (@ABataev), who work on single-source programming models to make them aware and provide feedback if any.

In D60455#1518178, @bader wrote:

In D60455#1513499, @Anastasia wrote:

Design question: since you are not aware what functions are to be run on a device while parsing them, at what point do you plan to diagnose the invalid behavior from the standard C++ i.e. using function pointers in kernel code is likely to cause issues?

We are going to use DeviceDiagBuilder and related infrastructure implemented in Clang to diagnose device side code errors in OpenMP/CUDA modes.
More details are in the comments here:
https://clang.llvm.org/doxygen/classclang_1_1Sema_1_1DeviceDiagBuilder.html#details

Just a thought, if you parse host code first and provide the device outlining information to the device compilation phase would you then be able to reuse more parsing functionality from OpenCL?

Also do you need to outline the data structures too? For example classes used on device are not allowed to have virtual function.

Yes. This restriction is already implemented in our code base on GitHub.

Cool, is it implemented in SemaSYCL.cpp too?

clang/include/clang/Basic/Attr.td
1076	Sema part is mostly not relevant for SYCL mode because SYCL API do not allow the cases currently detected by clang (e.g. constant address space variable declaration in OpenCL kernel scope, naming OpenCL kernel main, etc). Would you mind pointing me to your impl of those? A couple of check that might be useful are: void return type for kernel functions kernel can't be static function and some of the checks are harmful for proposed implementation (e.g. kernels can't be template functions). @Anastasia, @keryell, @agozillon and @aaron.ballman need to agree if this sufficient to justify the re-use of OpenCL kernel attribute. Let me know if you need any additional information to make a decision. Ok, if from ~20 occurrences in the source code you will be able to reuse only just 2 it doesn't seem like it's worth to share `__kernel` attribute.
clang/lib/Sema/SemaSYCL.cpp
23 ↗	(On Diff #200658)	Yes indeed, I mean linking of modules in C/C++ even though it doesn't necessarily mean linking of object files. So you don't plan to support `SYCL_EXTERNAL` in clang? In CUDA the functions executed on device are annotated manually using `__device__` hence separate translation units can specify external device function... although I don't know if CUDA implementation in clang support this. I guess OpenMP is allowed to fall back to run on host?

Fznamznon added inline comments.May 28 2019, 4:35 AM

clang/test/SemaSYCL/device-attributes-on-non-sycl.cpp
1 ↗	(On Diff #200658)	Sorry for confusion. The C++ features used in SYCL are a subset of the C++11 standard features. I will add -std=c++11 key to run line to avoid such confusion in future.

Applied comments from @Anastasia

Added documentation for sycl_kernel function
Added comments to Sema.h
Added -std=c++11 to test run lines

Harbormaster completed remote builds in B32558: Diff 201641.May 28 2019, 5:08 AM

Anastasia added inline comments.May 30 2019, 10:53 AM

clang/include/clang/Basic/Attr.td
1076	Undocumented -> SYCLKernelDocs
clang/include/clang/Basic/AttrDocs.td
269	The example doesn't demonstrate the use of the attribute. It explains how it is used by the toolchain only! May be @aaron.ballman can help here as I am not sure what the format should be.
clang/lib/Parse/ParseAST.cpp
171 ↗	(On Diff #201641)	Do you also need to prevent generation of non-device functions somehow?
clang/lib/Sema/SemaSYCL.cpp
23 ↗	(On Diff #200658)	Ping! I would suggest to document it a bit more including any current limitations/assumption that you can mark under FIXME i.e. does your code handle lambdas yet, what if lambdas are used in function parameters, etc...
clang/lib/Sema/SemaTemplateInstantiateDecl.cpp
5520 ↗	(On Diff #201641)	May be this should go into a helper function as it seems to be now a bigger chunk of code that is repeated? Although, I am not very familiar with this code. You can try to get someone to review who has contributed to this more recently.
clang/test/CodeGenSYCL/device-functions.cpp
24 ↗	(On Diff #201641)	I can't see where the SPIR calling convention is currently set for SYCL?
clang/test/SemaSYCL/device-attributes-on-non-sycl.cpp
3 ↗	(On Diff #201641)	I don't think this comment is necessary.

Fznamznon added inline comments.May 31 2019, 5:52 AM

clang/include/clang/Basic/Attr.td
1076	Oh, Thank you for that!
clang/lib/Parse/ParseAST.cpp
171 ↗	(On Diff #201641)	I think It's already prevented by change to CodeGenModule.cpp in this patch. CodeGen just ignores declarations without SYCL device attribute now.
clang/lib/Sema/SemaSYCL.cpp
23 ↗	(On Diff #200658)	Oh, sorry, I missed this comment when I updated patch last time. Could you please advise in which form I can document it?
clang/lib/Sema/SemaTemplateInstantiateDecl.cpp
5520 ↗	(On Diff #201641)	I think this chunk of code seems big because of big repeated comment.
clang/test/CodeGenSYCL/device-functions.cpp
24 ↗	(On Diff #201641)	If I understand correct it's set automatically on AST level because we use SPIR-based triple for device code. Only in case of C++ methods clang doesn't set SPIR calling convention. We did a modification in our codebase to get SPIR calling convention for C++ methods too (available here )

Anastasia added inline comments.Jun 3 2019, 3:00 AM

clang/test/CodeGenSYCL/device-functions.cpp
24 ↗	(On Diff #201641)	Ok and what happens if some other target is used - not SPIR?

Fznamznon added inline comments.Jun 3 2019, 3:28 AM

clang/test/CodeGenSYCL/device-functions.cpp
24 ↗	(On Diff #201641)	There will be no SPIR calling convention for device functions.

Anastasia added inline comments.Jun 3 2019, 7:54 AM

clang/test/CodeGenSYCL/device-functions.cpp
24 ↗	(On Diff #201641)	Just FYI at some point we generalized SPIR calling convention to be used for kernels irrespective from target by default (see `TargetCodeGenInfo::getOpenCLKernelCallingConv`). Not sure if it might make sense to do for SYCL device functions too. I am not saying it belongs to this patch though.

Applied comments from @Anastasia

Added link to documentation for sycl_device attribute
Removed redundant comment from test

@Anastasia, do you have additional comments?

Harbormaster completed remote builds in B33126: Diff 203785.Jun 10 2019, 2:25 AM

@aaron.ballman , please let me know if you have additional comments/suggestions. If not, could you please accept this revision?

bader added reviewers: rjmccall, rsmith.Jun 11 2019, 9:17 AM

Ping.

Most of the comments are about minor nits like grammar and coding conventions, but I did have some questions regarding what kinds of functions the sycl_kernel attribute gets applied to. Also, I'd like to see some additional tests that demonstrate the sycl device attribute is being implicitly created on the proper declarations as expected (can probably do that using -ast-dump and checking to see if the right functions have the attribute attached).

clang/include/clang/Basic/AttrDocs.td
259	is SYCL "kernel function" -> is a SYCL "kernel function"
260	SYCL -> A SYCL
261	Kernel is a -> A kernel is a
263–264	This doesn't really demonstrate the need for the attribute -- the attribute is never shown in the code example. I'd prefer an example that shows when and how a user would write this attribute.
278	called SYLC -> called a SYLC
280	use sycl_kernel -> use the sycl_kernel
281	as SYCL -> as a SYCL Compiler is supposed to -> The compiler will
284	In this code example compiler is supposed to add "foo" function -> In this code example, the compiler will add the "foo" function
clang/include/clang/Sema/Sema.h
11182 ↗	(On Diff #203785)	Function -> function
11183 ↗	(On Diff #203785)	In SYCL, when we generate device code, we don't know
11184 ↗	(On Diff #203785)	we emit sycl kernels, so we add device
11189 ↗	(On Diff #203785)	adds the function declaration
11190 ↗	(On Diff #203785)	Should be named `addSyclDeviceFunc()` per coding standards. Similar for the other new functions.
11194 ↗	(On Diff #203785)	Don't repeat the function name in the comments, please. Also, rather than returning a concrete `SmallVector<>`, I think it would be more natural to return a `SmallVectorImpl` so that callers don't have to contend with the explicit size. There should also be a `const` overload for this function.
11197 ↗	(On Diff #203785)	Constructs a SYCL kernel that is compatible with OpenCL from the SYCL "kernel
11200–11201 ↗	(On Diff #203785)	Marks all functions accessible from SYCL kernels with the SYCL device attribute
clang/lib/CodeGen/CodeGenModule.cpp
2410 ↗	(On Diff #203785)	with the SYCL device attribute
2412–2415 ↗	(On Diff #203785)	These `if` statements can be combined.
2533 ↗	(On Diff #203785)	Missing a full stop at the end of the comment.
clang/lib/Sema/SemaSYCL.cpp
14 ↗	(On Diff #203785)	This include doesn't seem to be necessary?
23 ↗	(On Diff #203785)	`e` does not use our usual naming conventions.
41 ↗	(On Diff #203785)	Spurious whitespace can be removed.
44 ↗	(On Diff #203785)	Elide braces.
48 ↗	(On Diff #203785)	Don't use `auto` as the type is not spelled out in the initialization.
52 ↗	(On Diff #203785)	Elide braces.
68 ↗	(On Diff #203785)	with the SYCL device attribute
70 ↗	(On Diff #203785)	`elt` -> `Elt` per naming conventions
71 ↗	(On Diff #203785)	`auto *` since the type is spelled out in the initialization.
73 ↗	(On Diff #203785)	Elide braces
clang/lib/Sema/SemaTemplateInstantiateDecl.cpp
5523 ↗	(On Diff #203785)	for the SYCL kernel attribute
5525 ↗	(On Diff #203785)	Elide braces
5537 ↗	(On Diff #203785)	for the SYCL kernel attribute
5539 ↗	(On Diff #203785)	Elide braces
clang/test/SemaSYCL/device-attributes-on-non-sycl.cpp
4 ↗	(On Diff #203785)	`#ifndef` ?
11 ↗	(On Diff #203785)	I'd prefer to spell this with `__attribute__`, same in the other test
clang/test/SemaSYCL/device-attributes.cpp
3 ↗	(On Diff #203785)	I'd like to see some more tests covering less obvious scenarios. Can I add this attribute to a lambda? What about a member function? How does it work with virtual functions? That sort of thing.

Fznamznon added inline comments.Jun 18 2019, 8:01 AM

clang/test/SemaSYCL/device-attributes.cpp
3 ↗	(On Diff #203785)	Actually there is no restrictions for adding this attribute to any function to outline device code so I just checked the simplest variant. But I'm working on new patch which will put some requirements on function which is marked with `sycl_kernel` attribute. This new patch will add generation of OpenCL kernel from function marked with `sycl_kernel` attribute. The main idea of this approach is described in this document (in this document generated kernel is called "kernel wrapper"). And to be able to generate OpenCL kernel using function marked with `sycl_kernel` attribute we put some requirements on this function, for example it must be a template function. You can find these requirements and example of proper function which can be marked with `sycl_kernel` in this comment .

aaron.ballman added inline comments.Jun 18 2019, 3:15 PM

clang/test/SemaSYCL/device-attributes.cpp

3 ↗

(On Diff #203785)

Actually there is no restrictions for adding this attribute to any function to outline device code so I just checked the simplest variant.

So there are no concerns about code like:

struct Base {
  __attribute__((sycl_kernel)) virtual void foo();
  virtual void bar();
};

struct Derived : Base {
  void foo() override;
  __attribute__((sycl_kernel)) void bar() override;
};

void f(Base *B, Derived *D) {
  // Will all of these "do the right thing"?
  B->foo();
  B->bar();

  D->foo();
  D->bar();
}

Appled part of comments from @aaron.ballman:

Fixed grammar and code style in all places except sycl_kernel docs
Added a lit test which checks that sycl_device attribute implicitly added to proper declarations

Harbormaster completed remote builds in B33642: Diff 205663.Jun 19 2019, 1:42 PM

Fznamznon added inline comments.Jun 19 2019, 1:45 PM

clang/include/clang/Basic/AttrDocs.td
263–264	I see. I will update documentation in the next version.

bader added inline comments.Jun 19 2019, 2:00 PM

clang/test/SemaSYCL/device-attributes.cpp
3 ↗	(On Diff #203785)	Actually there is no restrictions for adding this attribute to any function to outline device code so I just checked the simplest variant. But I'm working on new patch which will put some requirements on function which is marked with sycl_kernel attribute. @aaron.ballman, sorry for confusing. The usage scenarios should have been articulated more accurately. We have only four uses of this attribute in our implementation: https://github.com/intel/llvm/blob/sycl/sycl/include/CL/sycl/handler.hpp#L538 (lines 538-605). All four uses are applied to member functions of `cl::sycl::handler` class and all of them have similar prototype (which is mentioned by Mariya in the previous comment: namespace cl { namespace sycl { class handler { template <typename KernelName, typename KernelType/, .../> __attribute__((sycl_kernel)) void sycl_kernel_function(KernelType KernelFuncObj) { KernelFuncObj(); } }; }} Here is the list of SYCL device compiler expectations with regard to the function marked with `sycl_kernel` attribute. Function template with at least one parameter is expected. The compiler generates OpenCL kernel and uses first template parameter as unique name to the generated OpenCL kernel. Host application uses this unique name to invoke the OpenCL kernel generated for the `sycl_kernel_function` specialized by this name and KernelType (which might be a lambda type). Function must have at least one parameter. First parameter expected to be a function object type (named or unnamed i.e. lambda). Compiler uses function object type field to generate OpenCL kernel parameters. Aaron, I hope it makes more sense now. We don't plan in any use cases other than in SYCL standard library implementation mentioned above. If I understand you concerns correctly, you want to be sure that clang prohibits other uses of this attribute, which are not intended. Right? What is the best way to do this? Add more negative tests cases and make sure that clang generate error diagnostic messages?

keryell added inline comments.Jun 19 2019, 6:55 PM

clang/test/SemaSYCL/device-attributes.cpp
3 ↗	(On Diff #203785)	If I understand you concerns correctly, you want to be sure that clang prohibits other uses of this attribute, which are not intended. Right? But since it is an attribute to be used by SYCL run-time writers, I am not sure there is a lot of value in over-engineering the restrictions of its use. It diverts brain power from the real implementation & review and might even prevent innovation due to some creative use cases.

Updated sycl_kernel attribute documentation.

Harbormaster completed remote builds in B33686: Diff 205813.Jun 20 2019, 7:54 AM

Fixed a couple coding style issues, renamed markDevice function with markSYCLDevice.

Harbormaster completed remote builds in B33690: Diff 205831.Jun 20 2019, 8:59 AM

aaron.ballman added inline comments.Jun 24 2019, 1:28 PM

clang/test/SemaSYCL/device-attributes.cpp
3 ↗	(On Diff #203785)	If I understand you concerns correctly, you want to be sure that clang prohibits other uses of this attribute, which are not intended. Right? Effectively, yes. I'd like to ensure that situations where the attribute does not do what the user expects are diagnosed. A good rule of thumb that I use is to diagnose (as a warning) situations where the attribute will be silently ignored, and diagnose (as an error) situations where applying the attribute would cause really bad results (like miscompiles, security concerns, etc). What is the best way to do this? Add more negative tests cases and make sure that clang generate error diagnostic messages? That's a good approach, yes. Though for the situations you describe, I'd probably just warn rather than err because it seems like it's harmless to ignore the attribute so long as the user knows it's being ignored.
3 ↗	(On Diff #203785)	But since it is an attribute to be used by SYCL run-time writers, I am not sure there is a lot of value in over-engineering the restrictions of its use. It diverts brain power from the real implementation & review and might even prevent innovation due to some creative use cases. I disagree. Part of the real implementation is ensuring the attribute is not accidentally misused. It's frustrating for users to have an attribute silently ignored because it's easy to mistake that situation for the attribute behaving as expected.

Added warning diagnostic for sycl_kernel attribute.

Now if the sycl_kernel attribute applied to a function which doesn't meet requirements for OpenCL kernel generation, attribute will be ignored and diagnostic will be emitted.

Harbormaster completed remote builds in B34003: Diff 206861.Jun 27 2019, 7:18 AM

Minor fix

Harbormaster completed remote builds in B34007: Diff 206873.Jun 27 2019, 8:52 AM

aaron.ballman added inline comments.Jul 1 2019, 8:13 AM

clang/include/clang/Basic/AttrDocs.td
260	generate an OpenCL kernel
261	demonstrates the compiler's
278	defines the entry point
279	The compiler will
281	the compiler will add the "foo" function
282	More details about the compilation of functions for the device part can be found
284	of the code, the SYCL runtime
308	generate an OpenCL kernel
313	The function must be a template with at least two type template parameters.
314	generates an OpenCL kernel and uses the first template parameter as a unique name
315	The host application uses
318	The function must The first parameter is required to be a

aaron.ballman added inline comments.Jul 1 2019, 8:13 AM

clang/include/clang/Basic/Attr.td
1074	Shouldn't this be `FunctionTemplate` instead?
clang/include/clang/Basic/AttrDocs.td
319	The compiler uses the function object type
321	The function must return void. The compiler reuses the body of marked functions to generate the OpenCL kernel body, and the OpenCL kernel must return `void`. I'd move the "The sycl_kernel_function" sentence to its own paragraph rather than as part of the final bullet.
clang/include/clang/Basic/DiagnosticSemaKinds.td
10108–10109	I think this diagnostic should be split out into a few diagnostics that explicitly cover the requirements. Something like: `'sycl_kernel' attribute only applies to a %select{templated function\|function returning 'void'\|etc}0`. It's best to avoid trying to send users to documentation if we can just tell them explicitly what they did wrong with their code.
clang/include/clang/Sema/Sema.h
11210 ↗	(On Diff #206873)	Can you add a `const` overload that returns a `const` container reference? Also, why return the container rather than returning an iterator range from the container?
clang/lib/Sema/SemaDeclAttr.cpp
6417	Spurious newline above and missing a full stop at the end of the comment. Comments below are also missing full stops.
6418–6422	You can replace all this with a `cast<FunctionDecl>(D)` because the common attribute handler already verifies the subject is correct.
6424	I'd appreciate this being declared as a `const` pointer (same for the other nodes obtained through `FT`).
6427–6430	If you switch the subject to `FunctionTemplate`, then I believe this predicate can also go away.
clang/lib/Sema/SemaSYCL.cpp
65–66 ↗	(On Diff #206873)	I think there's some type confusion happening here. I would expect `Elt` to either be `auto ` or `Func` to be a `const auto &`. I suspect `Elt` should be declared as `auto `.

Hi @aaron.ballman,

Thanks a lot for the comments and sorry for the long delay. We've been working on complete implementation of the SYCL 1.2.1 specification.
Now I have more time to work on contributing the implementation to LLVM project.

I re-based the patch and started applying your suggestions.
In addition to that I'd like to investigate slightly different approach to outlining suggested by @ABataev at LLVM Dev. Meeting conference and utilize the infrastructure OpenMP compiler uses in CodeGen library to emit "device part" of the single source.

Thanks,
Alexey

Applied comments from Aaron.

Two comments left unresolved:

Split diagnostic message for sycl_kernel attribute into multiple messages. Will do tomorrow.
Change attribute "subject" in TableGen file from "Function" to "FunctionTemplate". I need guidance here as I'm not sure how to make it work.

Refactored patch to re-use CodeGen infrastructure for emitting SYCL device code.
It turned out to be quite simple change - just two one-liner changes in ASTContext to say that only SYCL kernels must be emitted when we compile for SYCL device + similar change in the CodeGen to mark symbols which must be emitted.

Removed sycl_device attribute, which was required by previous implementation for device code outlining. I think we still might need this attribute to mark "non-kernel" symbols as "device code", so the compiler will emit even though they are not used by SYCL kernels. Anyway it's not required for device code outlining and shouldn't be part of this patch.

Enhanced CodeGen test to check that host part of the code is not emitted.

Harbormaster completed remote builds in B40636: Diff 228286.Nov 7 2019, 11:53 AM

bader added inline comments.Nov 7 2019, 11:55 AM

clang/include/clang/Basic/Attr.td
1074	@aaron.ballman, I'm not sure. I tried to use FunctionTemplate instead of Function, but I get following warning: warning: 'sycl_kernel' attribute only applies to redeclarable templates I investigated this a little and Sema passes Function declaration instead of FunctionTemplate to the function validating the attribute appertains to the right subject. I think it's because attributes are handled before FunctionTemplateDecl node is created. Do we have an infrastructure to handle "FunctionTemplate" attributes? I can't find any other attribute with FunctionTemplate subject to learn from...

bader commandeered this revision.Nov 7 2019, 12:01 PM

bader edited reviewers, added: Fznamznon; removed: bader.

Applied two remaining comments from Aaron.

Split diagnostics for sycl_kernel attribute to provide more informative message.
Moved attribute target check to TableGen file. I stole a workaround for a function template subject emulation from @hfinkel C++ JIT compiler prototype (https://github.com/hfinkel/llvm-project-cxxjit/blob/cxxjit/clang/include/clang/Basic/Attr.td#L121).

Harbormaster completed remote builds in B40808: Diff 228868.Nov 12 2019, 4:37 AM

bader marked an inline comment as done.Nov 12 2019, 4:38 AM

bader added inline comments.

clang/test/SemaSYCL/device-attributes.cpp
35 ↗	(On Diff #228286)	Do we have to check each diagnostic message for both attribute spellings?

@aaron.ballman, @Anastasia, could you take a look at new version of the patch, please?

aaron.ballman added inline comments.Nov 20 2019, 6:37 AM

clang/include/clang/Basic/DiagnosticSemaKinds.td
10118	Do you mean template function or function template? A function template is a template used to generate functions and a template function is a function produced by a template. I think you probably mean "function template" here.
10121–10122	This diagnostic reads a bit like you cannot do this: `template <class N>` when I think the actual restriction is that you cannot do this: `template <int N>`. Is that correct? If so, I think this could be worded as `template parameter of a function template with the 'sycl_kernel' attribute must be a template type parameter`. Just double-checking, but you also intend to prohibit template template parameters? e.g., you can't do `template <template <typename> typename C>`
10124	Probably "function template" here as well.
10127	Same here.

Applied code review comments.

bader added a subscriber: erichkeane.Nov 20 2019, 9:23 AM

bader added inline comments.

clang/include/clang/Basic/DiagnosticSemaKinds.td
10121–10122	This diagnostic reads a bit like you cannot do this: template <class N> when I think the actual restriction is that you cannot do this: template <int N>. Is that correct? Yes. That is correct. If so, I think this could be worded as template parameter of a function template with the 'sycl_kernel' attribute must be a template type parameter. Thanks! Applied your wording. Just double-checking, but you also intend to prohibit template template parameters? e.g., you can't do template <template <typename> typename C> Currently we allow following use case: https://github.com/intel/llvm/blob/sycl/clang/test/SemaSYCL/mangle-kernel.cpp. I assume it qualifies as "template type" and not as "template template" parameter. Right? Quoting SYCL specification $6.2 Naming of kernels (https://www.khronos.org/registry/SYCL/specs/sycl-1.2.1.pdf#page=250). SYCL kernels are extracted from C++ source files and stored in an implementation- defined format. In the case of the shared-source compilation model, the kernels have to be uniquely identified by both host and device compiler. This is required in order for the host runtime to be able to load the kernel by using the OpenCL host runtime interface. From this requirement the following rules apply for naming the kernels: • The kernel name is a C++ typename. • The kernel needs to have a globally-visible name. In the case of a named function object type, the name can be the typename of the function object, as long as it is globally-visible. In the case where it isn’t, a globally visible name has to be provided, as template parameter to the kernel invoking interface, as described in 4.8.5. In C++11, lambdas do not have a globally-visible name, so a globally-visible typename has to be provided in the kernel invoking interface, as described in 4.8.5. • The kernel name has to be a unique identifier in the program. We also have an extension, which lifts these restrictions/requirements when clang is used as host and device compiler. @erichkeane implemented built-in function (https://github.com/intel/llvm/pull/250) providing "unique identifier", which we use as a kernel name for lambda objects. But this is going to be a separate patch.

Harbormaster completed remote builds in B41251: Diff 230281.Nov 20 2019, 9:26 AM

bader marked an inline comment as done.Nov 20 2019, 9:54 AM

bader added inline comments.

clang/test/Misc/pragma-attribute-supported-attributes-list.test
134 ↗	(On Diff #230281)	It looks like this change is not needed anymore. This check fails on my machine with the latest version of the patch. @aaron.ballman, I'm not sure if this is a problem of the implementation or test issue. Do I understand correctly that this test validates the list of the attributes which can be applied using `#pragma clang`? If so, removing this check seems to be okay. We need only `[[clang::sycl_kernel]]` or `__attribute__((sycl_kernel))` to work.

aaron.ballman added a reviewer: arphaman.Nov 20 2019, 10:59 AM

aaron.ballman added a subscriber: arphaman.

aaron.ballman added inline comments.

clang/include/clang/Basic/DiagnosticSemaKinds.td
10121–10122	Currently we allow following use case: https://github.com/intel/llvm/blob/sycl/clang/test/SemaSYCL/mangle-kernel.cpp. I assume it qualifies as "template type" and not as "template template" parameter. Right? Yeah, those are template types. A template template parameter would be: https://godbolt.org/z/9kwbW9 In that example, `C` is a template template parameter and `Ty` is a template type parameter. The part I'm a bit unclear on is why a template template parameter should be disallowed (I believe it names a type, as opposed to a non-type template parameter which names a value)?
clang/test/Misc/pragma-attribute-supported-attributes-list.test
134 ↗	(On Diff #230281)	Your understanding is correct, and I think it's a bug if you don't need to add an entry here for `SYCLKernel`. @arphaman, WDYT?

Herald added a subscriber: dexonsmith. · View Herald TranscriptNov 20 2019, 10:59 AM

Applied code review comments from Aaron.

Allow template template parameters for function templates marked with sycl_kernel attribute.

Harbormaster completed remote builds in B41259: Diff 230310.Nov 20 2019, 12:20 PM

bader marked 3 inline comments as done.Nov 20 2019, 12:33 PM

bader added inline comments.

clang/include/clang/Basic/DiagnosticSemaKinds.td
10121–10122	I think Mariya implemented this restriction to avoid usages not required for SYCL kernel support implementation in run-time library. As it was mentioned before, this attribute is intended to be used by SYCL run-time library only and current implantation do not require `template template parameter` support. I think that this might be useful for alternative implementations, so I updated the patch to restrict non-type template parameters only.
clang/test/Misc/pragma-attribute-supported-attributes-list.test
134 ↗	(On Diff #230281)	I turned out that the workaround I added to allow only function templates affected this test (described in this comment https://reviews.llvm.org/D60455#1742083). I.e. def FunctionTmpl : SubsetSubject<Function, [{S->getTemplatedKind() == FunctionDecl::TK_FunctionTemplate}], "function templates">; I also noted that there is no check for `artificial` attribute which uses the same approach to limit the subject to "inline functions". https://github.com/llvm/llvm-project/blob/master/clang/include/clang/Basic/Attr.td#L652 https://github.com/llvm/llvm-project/blob/master/clang/include/clang/Basic/Attr.td#L122

Ping.

ABataev added inline comments.Nov 27 2019, 6:23 AM

clang/lib/CodeGen/CodeGenModule.cpp
2477 ↗	(On Diff #230310)	Need to check if the decl must be emitted at all.

bader marked an inline comment as done.Nov 27 2019, 7:25 AM

bader added inline comments.

clang/lib/CodeGen/CodeGenModule.cpp
2477 ↗	(On Diff #230310)	Let me check that I get it right. You suggest adding `if (MustBeEmitted(Global))`, right? if (LangOpts.SYCLIsDevice && Global->hasAttr<SYCLKernelAttr>() && MustBeEmitted(Global)) { ... addDeferredDeclToEmit(GD); return; }

ABataev added inline comments.Nov 27 2019, 7:32 AM

clang/lib/CodeGen/CodeGenModule.cpp
2477 ↗	(On Diff #230310)	Yes

bader marked an inline comment as done.Nov 27 2019, 9:21 AM

bader added inline comments.

clang/lib/CodeGen/CodeGenModule.cpp
2477 ↗	(On Diff #230310)	Okay. Making this change requires additional adjustments in the patch and I have a few options. In this patch we do not add any logic forcing compiler to emit SYCL kernel. This logic is supposed to be added by follow-up patch (currently under SYCL working group review here https://github.com/intel/llvm/pull/249), which add code emitting "externally visible" OpenCL kernel calling function object passed to SYCL kernel function. I can: Temporally remove CodeGen test and add updated version back with the follow-up patch Do change making SYCL kernels "externally visible" and revert this change with the follow-up patch (this is kind of current logic which emits SYCL kernels unconditionally) Merge two patches and submit them together, but I assume it will significantly increase the size of the patch.

ABataev added inline comments.Nov 27 2019, 11:00 AM

clang/lib/CodeGen/CodeGenModule.cpp
2477 ↗	(On Diff #230310)	Probably, better would be to split the patch

The attribute bits LGTM aside from a wording nit with the diagnostic; I have no opinion on the CodeGen question.

clang/include/clang/Basic/DiagnosticSemaKinds.td
10122	`can't` -> `cannot`

Applied code review suggestions.

Split the patch into two parts. This patch contains only Sema part and LLVM IR generation part will be added separately. Updated commit message.
Replace can't with cannot.

bader retitled this revision from [SYCL] Implement SYCL device code outlining to [SYCL] Add sycl_kernel attribute for accelerated code outlining.Nov 28 2019, 3:04 AM

bader edited the summary of this revision. (Show Details)

Fixed typo in the commit message: complier -> compiler.

Sorry, I don't have capacity currently to review this and I don't want to be blocking it either.

This revision is now accepted and ready to land.Nov 28 2019, 3:23 AM

In D60455#1762804, @Anastasia wrote:

Sorry, I don't have capacity currently to review this and I don't want to be blocking it either.

@Anastasia, thanks for finding time for reviewing previous revisions of the patch. I really appreciate your comments.

Minor update adjusting to the recent changes.

Updated comment "The 'sycl_kernel' attribute applies only to functions" -> "The 'sycl_kernel' attribute applies only to function templates".
Renamed tests from "device-attributes*" to "kernel-attribute*".

A couple of minor comments.

clang/include/clang/Basic/AttrDocs.td
313	@bader , could you please apply this too?
314	I'm not an expert in English, so you can ignore it if I'm wrong, but a phrase like "uses parameter as a name to the kernel" seems strange. Maybe "for kernel"?
317	(which might be a lambda or a function object type).

Applied comments from @Fznamznon.

LGTM with some testing requests.

clang/test/SemaSYCL/kernel-attribute.cpp
5–6	Still missing a test that the attribute is ignored when SYCL is not enabled.
7	This test should be on a templated function (we already demonstrated it only applies to templated functions, so the check for the argument is not what is failing).
8	Same here.

bader marked 5 inline comments as done.Dec 2 2019, 12:21 AM

bader added inline comments.

clang/test/SemaSYCL/kernel-attribute.cpp
5–6	Still missing a test that the attribute is ignored when SYCL is not enabled. I think clang/test/SemaSYCL/kernel-attribute-on-non-sycl.cpp should check that. Please, let me know if you mean something else. This test should be on a templated function (we already demonstrated it only applies to templated functions, so the check for the argument is not what is failing). Nice catch. Thanks!

Applied @aaron.ballman suggestions to kernel-attribute.cpp test

LGTM with a couple of minor comments.

clang/include/clang/Basic/AttrDocs.td
273	Sorry for late catch, but there is a little bug in this SYCL code: `index` is one-dimensional `id`, so calling subscript operator with any value other than `0` is a bug.
319	There are two spaces between "." and "The" at the end of line 319.

aaron.ballman added inline comments.Dec 2 2019, 4:44 AM

clang/test/SemaSYCL/kernel-attribute.cpp
5–6	I think clang/test/SemaSYCL/kernel-attribute-on-non-sycl.cpp should check that. Please, let me know if you mean something else. Oh, you're correct, that was the test I was hoping for!

Fixed SYCL code example for sycl_kernel attribute documentation and commit message.

I hope all comments from are @Fznamznon and @aaron.ballman are applied.
@ABataev, do you have any other comments?

Closed by commit rGc094e7dc4b3f: [SYCL] Add sycl_kernel attribute for accelerated code outlining (authored by Fznamznon, committed by bader). · Explain WhyDec 3 2019, 8:23 AM

This revision was automatically updated to reflect the committed changes.

• Quuxplusone mentioned this in D50119: P1144 "Trivially relocatable" (0/3): Compiler support for `__is_trivially_relocatable(T)`.Jan 7 2020, 1:38 PM

tschuett mentioned this in D140226: [NVPTX] Introduce attribute to mark kernels without a language mode.Dec 18 2022, 8:56 AM

Revision Contents

Path

Size

clang/

include/

clang/

Basic/

Attr.td

13 lines

AttrDocs.td

73 lines

DiagnosticSemaKinds.td

15 lines

lib/

Sema/

SemaDeclAttr.cpp

42 lines

test/

SemaSYCL/

kernel-attribute-on-non-sycl.cpp

14 lines

kernel-attribute.cpp

44 lines

Diff 231918

clang/include/clang/Basic/Attr.td

Show First 20 Lines • Show All 115 Lines • ▼ Show 20 Lines	def SharedVar : SubsetSubject<Var,
"global variables">;		"global variables">;

def GlobalVar : SubsetSubject<Var,		def GlobalVar : SubsetSubject<Var,
[{S->hasGlobalStorage()}], "global variables">;		[{S->hasGlobalStorage()}], "global variables">;

def InlineFunction : SubsetSubject<Function,		def InlineFunction : SubsetSubject<Function,
[{S->isInlineSpecified()}], "inline functions">;		[{S->isInlineSpecified()}], "inline functions">;

		def FunctionTmpl
		: SubsetSubject<Function, [{S->getTemplatedKind() ==
		FunctionDecl::TK_FunctionTemplate}],
		"function templates">;

// FIXME: this hack is needed because DeclNodes.td defines the base Decl node		// FIXME: this hack is needed because DeclNodes.td defines the base Decl node
// type to be a class, not a definition. This makes it impossible to create an		// type to be a class, not a definition. This makes it impossible to create an
// attribute subject which accepts a Decl. Normally, this is not a problem,		// attribute subject which accepts a Decl. Normally, this is not a problem,
// because the attribute can have no Subjects clause to accomplish this. But in		// because the attribute can have no Subjects clause to accomplish this. But in
// the case of a SubsetSubject, there's no way to express it without this hack.		// the case of a SubsetSubject, there's no way to express it without this hack.
def DeclBase : AttrSubject;		def DeclBase : AttrSubject;
def FunctionLike : SubsetSubject<DeclBase,		def FunctionLike : SubsetSubject<DeclBase,
[{S->getFunctionType(false) != nullptr}],		[{S->getFunctionType(false) != nullptr}],
▲ Show 20 Lines • Show All 159 Lines • ▼ Show 20 Lines	class LangOpt<string name, code customCode = [{}]> {
// A custom predicate, written as an expression evaluated in a context with		// A custom predicate, written as an expression evaluated in a context with
// "LangOpts" bound.		// "LangOpts" bound.
code CustomCode = customCode;		code CustomCode = customCode;
}		}
def MicrosoftExt : LangOpt<"MicrosoftExt">;		def MicrosoftExt : LangOpt<"MicrosoftExt">;
def Borland : LangOpt<"Borland">;		def Borland : LangOpt<"Borland">;
def CUDA : LangOpt<"CUDA">;		def CUDA : LangOpt<"CUDA">;
def HIP : LangOpt<"HIP">;		def HIP : LangOpt<"HIP">;
		def SYCL : LangOpt<"SYCLIsDevice">;
def COnly : LangOpt<"COnly", "!LangOpts.CPlusPlus">;		def COnly : LangOpt<"COnly", "!LangOpts.CPlusPlus">;
def CPlusPlus : LangOpt<"CPlusPlus">;		def CPlusPlus : LangOpt<"CPlusPlus">;
def OpenCL : LangOpt<"OpenCL">;		def OpenCL : LangOpt<"OpenCL">;
def RenderScript : LangOpt<"RenderScript">;		def RenderScript : LangOpt<"RenderScript">;
def ObjC : LangOpt<"ObjC">;		def ObjC : LangOpt<"ObjC">;
def BlocksSupported : LangOpt<"Blocks">;		def BlocksSupported : LangOpt<"Blocks">;
def ObjCAutoRefCount : LangOpt<"ObjCAutoRefCount">;		def ObjCAutoRefCount : LangOpt<"ObjCAutoRefCount">;
def ObjCNonFragileRuntime : LangOpt<"ObjCNonFragileRuntime",		def ObjCNonFragileRuntime : LangOpt<"ObjCNonFragileRuntime",
▲ Show 20 Lines • Show All 744 Lines • ▼ Show 20 Lines

def CUDAShared : InheritableAttr {		def CUDAShared : InheritableAttr {
let Spellings = [GNU<"shared">, Declspec<"__shared__">];		let Spellings = [GNU<"shared">, Declspec<"__shared__">];
let Subjects = SubjectList<[Var]>;		let Subjects = SubjectList<[Var]>;
let LangOpts = [CUDA];		let LangOpts = [CUDA];
let Documentation = [Undocumented];		let Documentation = [Undocumented];
}		}

		def SYCLKernel : InheritableAttr {
		let Spellings = [Clang<"sycl_kernel">];
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions Is there a reason to not also introduce a C++11 and C2x style spelling in the `clang` namespace? e.g., `[[clang::sycl_device]]` aaron.ballman: Is there a reason to not also introduce a C++11 and C2x style spelling in the `clang` namespace?
		FznamznonUnsubmitted Not Done Reply Inline Actions I don't think that it makes sense because these attributes not for public consumption. These attributes is needed to separate code which is supposed to be offloaded from regular host code. I think SYCLDevice attribute actually doesn't need a spelling because it will be added only implicitly by compiler. Fznamznon: I don't think that it makes sense because these attributes not for public consumption. These…
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions If these are only being added implicitly by the compiler, then they should not be given any `Spelling`. See `AlignMac68k` for an example. aaron.ballman: If these are only being added implicitly by the compiler, then they should not be given any…
		keryellUnsubmitted Not Done Reply Inline Actions If we go towards this direction, `[[clang::sycl::device]]` or `[[clang::sycl::kernel]]` look more compatible with the concept of name space. While not a public interface, if we have a kind of "standard" outlining in Clang/LLVM, some people might want to use it in some other contexts too. keryell: If we go towards this direction, `[[clang::sycl::device]]` or `[[clang::sycl::kernel]]` look…
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions I'm still confused -- are these created implicitly or are they spelled out by the user explicitly? Right now, it looks like they're spelled out explicitly, but I was under the impression they are only intended to be created implicitly by the compiler. If they are expected to be explicitly specified by the user, the spelling should be using `Clang<>` instead of using `GNU<>`, `C2x<>`, and `CXX11<>` explicitly. If we go towards this direction, [[clang::sycl::device]] or [[clang::sycl::kernel]] look more compatible with the concept of name space. Attribute namespaces do not work that way. There is the vendor namespace and then the attribute name. aaron.ballman: I'm still confused -- are these created implicitly or are they spelled out by the user…
		keryellUnsubmitted Not Done Reply Inline Actions Attribute namespaces do not work that way. There is the vendor namespace and then the attribute name. After diving into "9.11.1 Attribute syntax and semantics [dcl.attr.grammar]" of the latest C++ draft standard, it looks you are right... There is only 1 level of `::` allowed. :-( keryell: > Attribute namespaces do not work that way. There is the vendor namespace and then the…
		let Subjects = SubjectList<[FunctionTmpl]>;
		let LangOpts = [SYCL];
		let Documentation = [SYCLKernelDocs];
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions No new, undocumented attributes, please. aaron.ballman: No new, undocumented attributes, please.
		FznamznonUnsubmitted Not Done Reply Inline Actions As I said, these attributes are not for public consumption. Should I add documentation in this case too? Fznamznon: As I said, these attributes are not for public consumption. Should I add documentation in this…
		keryellUnsubmitted Not Done Reply Inline Actions Yes, documentation and comments are always appreciated. keryell: Yes, documentation and comments are always appreciated.
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions If the attribute is only ever added implicitly and the user cannot spell it in any way, then I think it's reasonable to elide the documentation. It's an implementation detail at that point. aaron.ballman: If the attribute is only ever added implicitly and the user cannot spell it in any way, then I…
		}

def C11NoReturn : InheritableAttr {		def C11NoReturn : InheritableAttr {
let Spellings = [Keyword<"_Noreturn">];		let Spellings = [Keyword<"_Noreturn">];
let Subjects = SubjectList<[Function], ErrorDiag>;		let Subjects = SubjectList<[Function], ErrorDiag>;
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions Shouldn't this be `FunctionTemplate` instead? aaron.ballman: Shouldn't this be `FunctionTemplate` instead?
		baderAuthorUnsubmitted Not Done Reply Inline Actions @aaron.ballman, I'm not sure. I tried to use FunctionTemplate instead of Function, but I get following warning: warning: 'sycl_kernel' attribute only applies to redeclarable templates I investigated this a little and Sema passes Function declaration instead of FunctionTemplate to the function validating the attribute appertains to the right subject. I think it's because attributes are handled before FunctionTemplateDecl node is created. Do we have an infrastructure to handle "FunctionTemplate" attributes? I can't find any other attribute with FunctionTemplate subject to learn from... bader: @aaron.ballman, I'm not sure. I tried to use FunctionTemplate instead of Function, but I get…
let SemaHandler = 0;		let SemaHandler = 0;
let Documentation = [C11NoReturnDocs];		let Documentation = [C11NoReturnDocs];
		keryellUnsubmitted Not Done Reply Inline Actions Cf supra. keryell: Cf supra.
		AnastasiaUnsubmitted Not Done Reply Inline Actions Ok, I thought the earlier request was not to add undocumented attributes with the spelling? Also did `__kernel` attribute not work at the end? I can't quite get where the current disconnect comes from but I find it extremely unhelpful. Anastasia: Ok, I thought the earlier request was not to add undocumented attributes with the spelling?
		baderAuthorUnsubmitted Not Done Reply Inline Actions Hi @Anastasia, let me try to help. Ok, I thought the earlier request was not to add undocumented attributes with the spelling? Right. @Fznamznon, could you document `sycl_kernel` attribute, please? Also did __kernel attribute not work at the end? Maria left a comment with the summary of our experiment: https://reviews.llvm.org/D60455#1472705. There is a link to pull request, where @keryell and @agozillon expressed preference to have separate SYCL attributes. Let me copy their feedback here: @keryell : Thank you for the experiment. That looks like a straight forward change. The interesting part is that it does not expose any advantage from reusing OpenCL __kernel marker.... So I am not more convinced that it is the way to go, because we would use any other keyword or attribute and it would be the same... @agozillon : Just my two cents, I think a separation of concerns and having separate attributes will simplify things long-term. While possibly similar just now, the SYCL specification is evolving and may end up targeting more than just OpenCL. So the semantics of the attributes may end up being quite different, even if at the moment the SYCL attribute is there mostly just to mark something for outlining. If it doesn't then the case for refactoring and merging them in a future patch could be brought up again. To summarize: we don't have good arguments to justify re-use of OpenCL `__kernel` keyword for SYCL mode requested by @aaron.ballman here https://reviews.llvm.org/D60455#1469150. I can't quite get where the current disconnect comes from but I find it extremely unhelpful. Let me know how I can help here. Additional note. I've submitted initial version of SYCL compiler design document to the GItHub: https://github.com/intel/llvm/blob/sycl/sycl/doc/SYCL_compiler_and_runtime_design.md. Please, take a look and let me know if you have questions. bader: Hi @Anastasia, let me try to help. > Ok, I thought the earlier request was not to add…
		FznamznonUnsubmitted Not Done Reply Inline Actions Ok, I thought the earlier request was not to add undocumented attributes with the spelling? Right. @Fznamznon, could you document sycl_kernel attribute, please? Do we really need add documentation for attribute which is not presented in SYCL spec and used for internal implementation details only because it has spelling? Fznamznon: >> Ok, I thought the earlier request was not to add undocumented attributes with the spelling?
		AnastasiaUnsubmitted Not Done Reply Inline Actions Ok, I thought the earlier request was not to add undocumented attributes with the spelling? Right. @Fznamznon, could you document sycl_kernel attribute, please? Do we really need add documentation for attribute which is not presented in SYCL spec and used for internal implementation details only because it has spelling? You are adding an attribute that is exposed to the programmers that use clang to compile their code, so unless you come up with some way to reject it in the non-toolchain mode it has to be documented. And for clang it will become "hidden" SYCL dialect so absolutely not different to `__kernel`. Another aspect to consider is that clang used `TypePrinter` in diagnostics and even though printing entire function signature is rare it might appear in diagnostics and the programmer should have a way to understand what the "alien" construct is. This is where clang documentation will help. Anastasia: > Ok, I thought the earlier request was not to add undocumented attributes with the…
		AnastasiaUnsubmitted Not Done Reply Inline Actions @keryell : Thank you for the experiment. That looks like a straight forward change. The interesting part is that it does not expose any advantage from reusing OpenCL __kernel marker.... So I am not more convinced that it is the way to go, because we would use any other keyword or attribute and it would be the same... I don't understand how this conclusions are made on incomplete implementation or even just an initial patch. The kind of analysis I am missing at the moment is whether you would need to add similar logic for `sycl_kernel` as we have now for `__kernel` i.e. did anyone look at the occurrences of kernel handling in the code base to see if it's going to need the same logic or not: include/clang/Basic/Attr.td: : SubsetSubject<Function, [{S->hasAttr<OpenCLKernelAttr>()}], include/clang/Parse/Parser.h: void ParseOpenCLKernelAttributes(ParsedAttributes &attrs); lib/AST/Decl.cpp: if (hasAttr<OpenCLKernelAttr>()) lib/AST/Decl.cpp: if (hasAttr<OpenCLKernelAttr>()) lib/AST/Decl.cpp: if (hasAttr<OpenCLKernelAttr>()) lib/CodeGen/CGCall.cpp: if (TargetDecl && TargetDecl->hasAttr<OpenCLKernelAttr>()) { lib/CodeGen/CodeGenFunction.cpp: if (!FD->hasAttr<OpenCLKernelAttr>()) lib/CodeGen/TargetInfo.cpp: if (FD->hasAttr<OpenCLKernelAttr>()) { lib/CodeGen/TargetInfo.cpp: if (FD->hasAttr<OpenCLKernelAttr>()) { lib/CodeGen/TargetInfo.cpp: return D->hasAttr<OpenCLKernelAttr>() \|\| lib/CodeGen/TargetInfo.cpp: if (M.getLangOpts().OpenCL && FD->hasAttr<OpenCLKernelAttr>() && lib/Parse/ParseDecl.cpp:void Parser::ParseOpenCLKernelAttributes(ParsedAttributes &attrs) { lib/Parse/ParseDecl.cpp: ParseOpenCLKernelAttributes(DS.getAttributes()); lib/Sema/SemaDecl.cpp: if (FD && !FD->hasAttr<OpenCLKernelAttr>()) { lib/Sema/SemaDecl.cpp: if (FD && FD->hasAttr<OpenCLKernelAttr>()) { lib/Sema/SemaDecl.cpp: if (getLangOpts().OpenCL && NewFD->hasAttr<OpenCLKernelAttr>()) { lib/Sema/SemaDecl.cpp: << FD->hasAttr<OpenCLKernelAttr>(); lib/Sema/SemaDecl.cpp: if (FD->hasAttr<OpenCLKernelAttr>()) lib/Sema/SemaDeclAttr.cpp: handleSimpleAttribute<OpenCLKernelAttr>(S, D, AL); lib/Sema/SemaDeclAttr.cpp: if (!D->hasAttr<OpenCLKernelAttr>()) { I don't mind either way but I would like the decision to be based on the analysis of clang code base please! @agozillon : Just my two cents, I think a separation of concerns and having separate attributes will simplify things long-term. This can potentially be a fair point! While possibly similar just now, the SYCL specification is evolving and may end up targeting more than just OpenCL. So the semantics of the attributes may end up being quite different, even if at the moment the SYCL attribute is there mostly just to mark something for outlining. This is really great! But unless you provide concrete information what the evolution is and what exactly you are trying to achieve and how it affect compiler design there is no way to review your patches. Let me know how I can help here. Additional note. I've submitted initial version of SYCL compiler design document to the GItHub: https://github.com/intel/llvm/blob/sycl/sycl/doc/SYCL_compiler_and_runtime_design.md. Please, take a look and let me know if you have questions. Thanks for sharing! I will try to find time to look into this and provide my feedback if any. Anastasia: > @keryell : > > Thank you for the experiment. > That looks like a straight forward…
		baderAuthorUnsubmitted Not Done Reply Inline Actions @Anastasia, I've looked at the occurrences of OpenCLKernelAttr attribute and it looks like the only part useful for SYCL is lib/CodeGen/CodeGenFunction.cpp, which emits OpenCL specific metadata required for SPIR-V translation. Sema part is mostly not relevant for SYCL mode because SYCL API do not allow the cases currently detected by clang (e.g. constant address space variable declaration in OpenCL kernel scope, naming OpenCL kernel `main`, etc). A couple of check that might be useful are: `void` return type for kernel functions kernel can't be static function and some of the checks are harmful for proposed implementation (e.g. kernels can't be template functions). @Anastasia, @keryell, @agozillon and @aaron.ballman need to agree if this sufficient to justify the re-use of OpenCL kernel attribute. Let me know if you need any additional information to make a decision. bader: @Anastasia, I've looked at the occurrences of OpenCLKernelAttr attribute and it looks like the…
		AnastasiaUnsubmitted Not Done Reply Inline Actions Sema part is mostly not relevant for SYCL mode because SYCL API do not allow the cases currently detected by clang (e.g. constant address space variable declaration in OpenCL kernel scope, naming OpenCL kernel main, etc). Would you mind pointing me to your impl of those? A couple of check that might be useful are: void return type for kernel functions kernel can't be static function and some of the checks are harmful for proposed implementation (e.g. kernels can't be template functions). @Anastasia, @keryell, @agozillon and @aaron.ballman need to agree if this sufficient to justify the re-use of OpenCL kernel attribute. Let me know if you need any additional information to make a decision. Ok, if from ~20 occurrences in the source code you will be able to reuse only just 2 it doesn't seem like it's worth to share `__kernel` attribute. Anastasia: > Sema part is mostly not relevant for SYCL mode because SYCL API do not allow the cases…
		AnastasiaUnsubmitted Not Done Reply Inline Actions Undocumented -> SYCLKernelDocs Anastasia: Undocumented -> SYCLKernelDocs
		FznamznonUnsubmitted Not Done Reply Inline Actions Oh, Thank you for that! Fznamznon: Oh, Thank you for that!
}		}

def CXX11NoReturn : InheritableAttr {		def CXX11NoReturn : InheritableAttr {
let Spellings = [CXX11<"", "noreturn", 200809>];		let Spellings = [CXX11<"", "noreturn", 200809>];
let Subjects = SubjectList<[Function], ErrorDiag>;		let Subjects = SubjectList<[Function], ErrorDiag>;
let Documentation = [CXX11NoReturnDocs];		let Documentation = [CXX11NoReturnDocs];
}		}

▲ Show 20 Lines • Show All 2,391 Lines • Show Last 20 Lines

clang/include/clang/Basic/AttrDocs.td

	Show First 20 Lines • Show All 247 Lines • ▼ Show 20 Lines

	It is also possible to specify a CPU name of ``generic`` which will be resolved			It is also possible to specify a CPU name of ``generic`` which will be resolved
	if the executing processor doesn't satisfy the features required in the CPU			if the executing processor doesn't satisfy the features required in the CPU
	name. The behavior of a program executing on a processor that doesn't satisfy			name. The behavior of a program executing on a processor that doesn't satisfy
	any option of a multiversioned function is undefined.			any option of a multiversioned function is undefined.
	}];			}];
	}			}

				def SYCLKernelDocs : Documentation {
				let Category = DocCatFunction;
				let Content = [{
				The ``sycl_kernel`` attribute specifies that a function template will be used
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions specifies function which is -> specifies that a function is Also, please put backticks around `sycl_device`. aaron.ballman: specifies function which is -> specifies that a function is Also, please put backticks around…
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions is SYCL "kernel function" -> is a SYCL "kernel function" aaron.ballman: is SYCL "kernel function" -> is a SYCL "kernel function"
				to outline device code and to generate an OpenCL kernel.
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions is code example -> is a code example aaron.ballman: is code example -> is a code example
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions SYCL -> A SYCL aaron.ballman: SYCL -> A SYCL
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions generate an OpenCL kernel aaron.ballman: generate an OpenCL kernel
				Here is a code example of the SYCL program, which demonstrates the compiler's
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions the SYCL program, which -> a SYLC program that (Note, I also removed the comma.) aaron.ballman: the SYCL program, which -> a SYLC program that (Note, I also removed the comma.)
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions Kernel is a -> A kernel is a aaron.ballman: Kernel is a -> A kernel is a
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions demonstrates the compiler's aaron.ballman: demonstrates the compiler's
				outlining job:
				.. code-block:: c++

				aaron.ballmanUnsubmitted Not Done Reply Inline Actions This doesn't really demonstrate the need for the attribute -- the attribute is never shown in the code example. I'd prefer an example that shows when and how a user would write this attribute. aaron.ballman: This doesn't really demonstrate the need for the attribute -- the attribute is never shown in…
				FznamznonUnsubmitted Not Done Reply Inline Actions I see. I will update documentation in the next version. Fznamznon: I see. I will update documentation in the next version.
				int foo(int x) { return ++x; }

				using namespace cl::sycl;
				queue Q;
				buffer<int, 1> a(range<1>{1024});
				AnastasiaUnsubmitted Not Done Reply Inline Actions The example doesn't demonstrate the use of the attribute. It explains how it is used by the toolchain only! May be @aaron.ballman can help here as I am not sure what the format should be. Anastasia: The example doesn't demonstrate the use of the attribute. It explains how it is used by the…
				Q.submit([&](handler& cgh) {
				auto A = a.get_access<access::mode::write>(cgh);
				cgh.parallel_for<init_a>(range<1>{1024}, [=](id<1> index) {
				A[index] = index[0] + foo(42);
				FznamznonUnsubmitted Done Reply Inline Actions Sorry for late catch, but there is a little bug in this SYCL code: `index` is one-dimensional `id`, so calling subscript operator with any value other than `0` is a bug. Fznamznon: Sorry for late catch, but there is a little bug in this SYCL code: `index` is one-dimensional…
				});
				}

				A C++ function object passed to the ``parallel_for`` is called a "SYCL kernel".
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions Code -> Do you mean the lambda? If so, perhaps "The lambda that is passed to the `parallel_for`" (add backticks around parallel_for too, please). is called "kernel function" -> is called a "kernel function" aaron.ballman: Code -> Do you mean the lambda? If so, perhaps "The lambda that is passed to the…
				A SYCL kernel defines the entry point to the "device part" of the code. The
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions called SYLC -> called a SYLC aaron.ballman: called SYLC -> called a SYLC
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions defines the entry point aaron.ballman: defines the entry point
				compiler will emit all symbols accessible from a "kernel". In this code
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions The compiler will aaron.ballman: The compiler will
				example, the compiler will emit "foo" function. More details about the
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions use sycl_kernel -> use the sycl_kernel aaron.ballman: use sycl_kernel -> use the sycl_kernel
				compilation of functions for the device part can be found in the SYCL 1.2.1
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions as SYCL -> as a SYCL Compiler is supposed to -> The compiler will aaron.ballman: as SYCL -> as a SYCL Compiler is supposed to -> The compiler will
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions the compiler will add the "foo" function aaron.ballman: the compiler will add the "foo" function
				specification Section 6.4.
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions More details about the compilation of functions for the device part can be found aaron.ballman: More details about the compilation of functions for the device part can be found
				To show to the compiler entry point to the "device part" of the code, the SYCL
				runtime can use the ``sycl_kernel`` attribute in the following way:
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions In this code example compiler is supposed to add "foo" function -> In this code example, the compiler will add the "foo" function aaron.ballman: In this code example compiler is supposed to add "foo" function -> In this code example, the…
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions of the code, the SYCL runtime aaron.ballman: of the code, the SYCL runtime
				.. code-block:: c++
				namespace cl {
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions I'm still not entirely certain how I would know what to mark and how. From the description, it sounds like whoever authors `parallel_for` needs to do this marking, or it somehow happens automatically? (I'll do another editorial pass once I understand the intended behavior a bit better -- I expect there will be a few more wording issues to address.) aaron.ballman: I'm still not entirely certain how I would know what to mark and how. From the description, it…
				keryellUnsubmitted Not Done Reply Inline Actions In normal SYCL it happens automatically. In the compiler unit-tests it is done manually to exercise the framework. I am the one who suggested that in some other contexts, it could be used manually for some special purpose like using some weird hardware, but I do not want to derail the main review with this. keryell: In normal SYCL it happens automatically. In the compiler unit-tests it is done manually to…
				AnastasiaUnsubmitted Not Done Reply Inline Actions In normal SYCL it happens automatically. In the compiler unit-tests it is done manually to exercise the framework. I think if they are not to be exposed to the user they should have no spelling. There are plenty of other ways to test this. For example AST dump. I am the one who suggested that in some other contexts, it could be used manually for some special purpose like using some weird hardware, but I do not want to derail the main review with this. If you are suggesting to expose this feature then you are starting some sort of a language extensions and its use should be documented in some way. I am not sure about this but I think we will end up with some sort of a language extension for SYCL anyways because as it stands now it's not aligned with the general concept of C/C++ language design. So perhaps it's not entirely unreasonable to expose this. Anastasia: > In normal SYCL it happens automatically. > In the compiler unit-tests it is done manually to…
				FznamznonUnsubmitted Not Done Reply Inline Actions Generally the `sycl_device` attribute will be added automatically by the compiler. But as @bader mentioned before: we might need to use sycl_device attribute to mark functions, which are called from the different translation units, i.e. compiler can't identify it w/o user's help. SYCL specification proposes to use special macro as "device function marker", but I guess we can have additional "spellings" in the clang. I think It would be better to re-use this attribute in implementation of "device function marker" macro from SYCL spec than implement additional logic in the compiler to handle this macro. So I saved possibility to add this attribute in code. Fznamznon: Generally the `sycl_device` attribute will be added automatically by the compiler. But as…
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions I'm not opposed to adding the attribute, but @bader also said that SYCL is not supposed to expose non-standard extensions to users and "we might need this" didn't seem like a strong case for needing the attribute. If the SYCL spec has a macro that is used to mark the user's code, then 1) that seems like a nonstandard extension to the language, but 2) it does make a strong case for having a named attribute. aaron.ballman: I'm not opposed to adding the attribute, but @bader also said that SYCL is not supposed to…
				namespace sycl {
				class handler {
				template <typename KernelName, typename KernelType/, .../>
				__attribute__((sycl_kernel)) void sycl_kernel_function(KernelType KernelFuncObj) {
				// ...
				KernelFuncObj();
				}

				template <typename KernelName, typename KernelType, int Dims>
				void parallel_for(range<Dims> NumWorkItems, KernelType KernelFunc) {
				#ifdef __SYCL_DEVICE_ONLY__
				sycl_kernel_function<KernelName, KernelType, Dims>(KernelFunc);
				#else
				// Host implementation
				#endif
				}
				};
				} // namespace sycl
				} // namespace cl

				The compiler will also generate an OpenCL kernel using the function marked with
				the ``sycl_kernel`` attribute.
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions generate an OpenCL kernel aaron.ballman: generate an OpenCL kernel
				Here is the list of SYCL device compiler expectations with regard to the
				function marked with the ``sycl_kernel`` attribute:

				- The function must be a template with at least two type template parameters.
				The compiler generates an OpenCL kernel and uses the first template parameter
				aaron.ballmanUnsubmitted Done Reply Inline Actions The function must be a template with at least two type template parameters. aaron.ballman: The function must be a template with at least two type template parameters.
				FznamznonUnsubmitted Done Reply Inline Actions @bader , could you please apply this too? Fznamznon: @bader , could you please apply this too?
				as a unique name for the generated OpenCL kernel. The host application uses
				aaron.ballmanUnsubmitted Done Reply Inline Actions generates an OpenCL kernel and uses the first template parameter as a unique name aaron.ballman: generates an OpenCL kernel and uses the first template parameter as a unique name
				FznamznonUnsubmitted Done Reply Inline Actions I'm not an expert in English, so you can ignore it if I'm wrong, but a phrase like "uses parameter as a name to the kernel" seems strange. Maybe "for kernel"? Fznamznon: I'm not an expert in English, so you can ignore it if I'm wrong, but a phrase like "uses…
				this unique name to invoke the OpenCL kernel generated for the SYCL kernel
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions The host application uses aaron.ballman: The host application uses
				specialized by this name and second template parameter ``KernelType`` (which
				might be an unnamed function object type).
				FznamznonUnsubmitted Not Done Reply Inline Actions (which might be a lambda or a function object type). Fznamznon: (which might be a lambda or a function object type).
				- The function must have at least one parameter. The first parameter is
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions The function must The first parameter is required to be a aaron.ballman: The function must The first parameter is required to be a
				required to be a function object type (named or unnamed i.e. lambda). The
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions The compiler uses the function object type aaron.ballman: The compiler uses the function object type
				FznamznonUnsubmitted Done Reply Inline Actions There are two spaces between "." and "The" at the end of line 319. Fznamznon: There are two spaces between "." and "The" at the end of line 319.
				compiler uses function object type fields to generate OpenCL kernel
				parameters.
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions The function must return void. The compiler reuses the body of marked functions to generate the OpenCL kernel body, and the OpenCL kernel must return `void`. I'd move the "The sycl_kernel_function" sentence to its own paragraph rather than as part of the final bullet. aaron.ballman: The function must return void. The compiler reuses the body of marked functions to generate the…
				- The function must return void. The compiler reuses the body of marked functions to
				generate the OpenCL kernel body, and the OpenCL kernel must return `void`.

				The SYCL kernel in the previous code sample meets these expectations.
				}];
				}

	def C11NoReturnDocs : Documentation {			def C11NoReturnDocs : Documentation {
	let Category = DocCatFunction;			let Category = DocCatFunction;
	let Content = [{			let Content = [{
	A function declared as ``_Noreturn`` shall not return to its caller. The			A function declared as ``_Noreturn`` shall not return to its caller. The
	compiler will generate a diagnostic for a function declared as ``_Noreturn``			compiler will generate a diagnostic for a function declared as ``_Noreturn``
	that appears to be capable of returning to its caller. Despite being a type			that appears to be capable of returning to its caller. Despite being a type
	specifier, the ``_Noreturn`` attribute cannot be specified on a function			specifier, the ``_Noreturn`` attribute cannot be specified on a function
	pointer type.			pointer type.
	▲ Show 20 Lines • Show All 4,304 Lines • Show Last 20 Lines

clang/include/clang/Basic/DiagnosticSemaKinds.td

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 10,099 Lines • ▼ Show 20 Lines	def warn_noderef_to_dereferenceable_pointer : Warning<
"casting to dereferenceable pointer removes 'noderef' attribute">, InGroup<NoDeref>;		"casting to dereferenceable pointer removes 'noderef' attribute">, InGroup<NoDeref>;

def err_builtin_launder_invalid_arg : Error<		def err_builtin_launder_invalid_arg : Error<
"%select{non-pointer\|function pointer\|void pointer}0 argument to "		"%select{non-pointer\|function pointer\|void pointer}0 argument to "
"'__builtin_launder' is not allowed">;		"'__builtin_launder' is not allowed">;

def err_preserve_field_info_not_field : Error<		def err_preserve_field_info_not_field : Error<
"__builtin_preserve_field_info argument %0 not a field access">;		"__builtin_preserve_field_info argument %0 not a field access">;
def err_preserve_field_info_not_const: Error<		def err_preserve_field_info_not_const: Error<
"__builtin_preserve_field_info argument %0 not a constant">;		"__builtin_preserve_field_info argument %0 not a constant">;
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions I think this diagnostic should be split out into a few diagnostics that explicitly cover the requirements. Something like: `'sycl_kernel' attribute only applies to a %select{templated function\|function returning 'void'\|etc}0`. It's best to avoid trying to send users to documentation if we can just tell them explicitly what they did wrong with their code. aaron.ballman: I think this diagnostic should be split out into a few diagnostics that explicitly cover the…

def err_bit_cast_non_trivially_copyable : Error<		def err_bit_cast_non_trivially_copyable : Error<
"__builtin_bit_cast %select{source\|destination}0 type must be trivially copyable">;		"__builtin_bit_cast %select{source\|destination}0 type must be trivially copyable">;
def err_bit_cast_type_size_mismatch : Error<		def err_bit_cast_type_size_mismatch : Error<
"__builtin_bit_cast source size does not equal destination size (%0 vs %1)">;		"__builtin_bit_cast source size does not equal destination size (%0 vs %1)">;

		// SYCL-specific diagnostics
		def warn_sycl_kernel_num_of_template_params : Warning<
		"'sycl_kernel' attribute only applies to a function template with at least"
		aaron.ballmanUnsubmitted Done Reply Inline Actions Do you mean template function or function template? A function template is a template used to generate functions and a template function is a function produced by a template. I think you probably mean "function template" here. aaron.ballman: Do you mean template function or function template? A function template is a template used to…
		" two template parameters">, InGroup<IgnoredAttributes>;
		def warn_sycl_kernel_invalid_template_param_type : Warning<
		"template parameter of a function template with the 'sycl_kernel' attribute"
		" cannot be a non-type template parameter">, InGroup<IgnoredAttributes>;
		aaron.ballmanUnsubmitted Done Reply Inline Actions This diagnostic reads a bit like you cannot do this: `template <class N>` when I think the actual restriction is that you cannot do this: `template <int N>`. Is that correct? If so, I think this could be worded as `template parameter of a function template with the 'sycl_kernel' attribute must be a template type parameter`. Just double-checking, but you also intend to prohibit template template parameters? e.g., you can't do `template <template <typename> typename C>` aaron.ballman: This diagnostic reads a bit like you cannot do this: `template <class N>` when I think the…
		baderAuthorUnsubmitted Done Reply Inline Actions This diagnostic reads a bit like you cannot do this: template <class N> when I think the actual restriction is that you cannot do this: template <int N>. Is that correct? Yes. That is correct. If so, I think this could be worded as template parameter of a function template with the 'sycl_kernel' attribute must be a template type parameter. Thanks! Applied your wording. Just double-checking, but you also intend to prohibit template template parameters? e.g., you can't do template <template <typename> typename C> Currently we allow following use case: https://github.com/intel/llvm/blob/sycl/clang/test/SemaSYCL/mangle-kernel.cpp. I assume it qualifies as "template type" and not as "template template" parameter. Right? Quoting SYCL specification $6.2 Naming of kernels (https://www.khronos.org/registry/SYCL/specs/sycl-1.2.1.pdf#page=250). SYCL kernels are extracted from C++ source files and stored in an implementation- defined format. In the case of the shared-source compilation model, the kernels have to be uniquely identified by both host and device compiler. This is required in order for the host runtime to be able to load the kernel by using the OpenCL host runtime interface. From this requirement the following rules apply for naming the kernels: • The kernel name is a C++ typename. • The kernel needs to have a globally-visible name. In the case of a named function object type, the name can be the typename of the function object, as long as it is globally-visible. In the case where it isn’t, a globally visible name has to be provided, as template parameter to the kernel invoking interface, as described in 4.8.5. In C++11, lambdas do not have a globally-visible name, so a globally-visible typename has to be provided in the kernel invoking interface, as described in 4.8.5. • The kernel name has to be a unique identifier in the program. We also have an extension, which lifts these restrictions/requirements when clang is used as host and device compiler. @erichkeane implemented built-in function (https://github.com/intel/llvm/pull/250) providing "unique identifier", which we use as a kernel name for lambda objects. But this is going to be a separate patch. bader: > This diagnostic reads a bit like you cannot do this: template <class N> when I think the…
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions Currently we allow following use case: https://github.com/intel/llvm/blob/sycl/clang/test/SemaSYCL/mangle-kernel.cpp. I assume it qualifies as "template type" and not as "template template" parameter. Right? Yeah, those are template types. A template template parameter would be: https://godbolt.org/z/9kwbW9 In that example, `C` is a template template parameter and `Ty` is a template type parameter. The part I'm a bit unclear on is why a template template parameter should be disallowed (I believe it names a type, as opposed to a non-type template parameter which names a value)? aaron.ballman: > Currently we allow following use case: https://github.
		baderAuthorUnsubmitted Done Reply Inline Actions I think Mariya implemented this restriction to avoid usages not required for SYCL kernel support implementation in run-time library. As it was mentioned before, this attribute is intended to be used by SYCL run-time library only and current implantation do not require `template template parameter` support. I think that this might be useful for alternative implementations, so I updated the patch to restrict non-type template parameters only. bader: I think Mariya implemented this restriction to avoid usages not required for SYCL kernel…
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions `can't` -> `cannot` aaron.ballman: `can't` -> `cannot`
		def warn_sycl_kernel_num_of_function_params : Warning<
		"function template with 'sycl_kernel' attribute must have a single parameter">,
		aaron.ballmanUnsubmitted Done Reply Inline Actions Probably "function template" here as well. aaron.ballman: Probably "function template" here as well.
		InGroup<IgnoredAttributes>;
		def warn_sycl_kernel_return_type : Warning<
		"function template with 'sycl_kernel' attribute must have a 'void' return type">,
		aaron.ballmanUnsubmitted Done Reply Inline Actions Same here. aaron.ballman: Same here.
		InGroup<IgnoredAttributes>;

} // end of sema component.		} // end of sema component.

clang/lib/Sema/SemaDeclAttr.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 6,406 Lines • ▼ Show 20 Lines	if (AL.getAttrName()->getName().find("read_write") != StringRef::npos) {
return;		return;
}		}
}		}
}		}

D->addAttr(::new (S.Context) OpenCLAccessAttr(S.Context, AL));		D->addAttr(::new (S.Context) OpenCLAccessAttr(S.Context, AL));
}		}

		static void handleSYCLKernelAttr(Sema &S, Decl *D, const ParsedAttr &AL) {
		// The 'sycl_kernel' attribute applies only to function templates.
		const auto *FD = cast<FunctionDecl>(D);
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions Spurious newline above and missing a full stop at the end of the comment. Comments below are also missing full stops. aaron.ballman: Spurious newline above and missing a full stop at the end of the comment. Comments below are…
		const FunctionTemplateDecl *FT = FD->getDescribedFunctionTemplate();
		assert(FT && "Function template is expected");

		// Function template must have at least two template parameters.
		const TemplateParameterList *TL = FT->getTemplateParameters();
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions You can replace all this with a `cast<FunctionDecl>(D)` because the common attribute handler already verifies the subject is correct. aaron.ballman: You can replace all this with a `cast<FunctionDecl>(D)` because the common attribute handler…
		if (TL->size() < 2) {
		S.Diag(FT->getLocation(), diag::warn_sycl_kernel_num_of_template_params);
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions I'd appreciate this being declared as a `const` pointer (same for the other nodes obtained through `FT`). aaron.ballman: I'd appreciate this being declared as a `const` pointer (same for the other nodes obtained…
		return;
		}

		// Template parameters must be typenames.
		for (unsigned I = 0; I < 2; ++I) {
		const NamedDecl *TParam = TL->getParam(I);
		aaron.ballmanUnsubmitted Not Done Reply Inline Actions If you switch the subject to `FunctionTemplate`, then I believe this predicate can also go away. aaron.ballman: If you switch the subject to `FunctionTemplate`, then I believe this predicate can also go away.
		if (isa<NonTypeTemplateParmDecl>(TParam)) {
		S.Diag(FT->getLocation(),
		diag::warn_sycl_kernel_invalid_template_param_type);
		return;
		}
		}

		// Function must have at least one argument.
		if (getFunctionOrMethodNumParams(D) != 1) {
		S.Diag(FT->getLocation(), diag::warn_sycl_kernel_num_of_function_params);
		return;
		}

		// Function must return void.
		QualType RetTy = getFunctionOrMethodResultType(D);
		if (!RetTy->isVoidType()) {
		S.Diag(FT->getLocation(), diag::warn_sycl_kernel_return_type);
		return;
		}

		handleSimpleAttribute<SYCLKernelAttr>(S, D, AL);
		}

static void handleDestroyAttr(Sema &S, Decl *D, const ParsedAttr &A) {		static void handleDestroyAttr(Sema &S, Decl *D, const ParsedAttr &A) {
if (!cast<VarDecl>(D)->hasGlobalStorage()) {		if (!cast<VarDecl>(D)->hasGlobalStorage()) {
S.Diag(D->getLocation(), diag::err_destroy_attr_on_non_static_var)		S.Diag(D->getLocation(), diag::err_destroy_attr_on_non_static_var)
<< (A.getKind() == ParsedAttr::AT_AlwaysDestroy);		<< (A.getKind() == ParsedAttr::AT_AlwaysDestroy);
return;		return;
}		}

if (A.getKind() == ParsedAttr::AT_AlwaysDestroy)		if (A.getKind() == ParsedAttr::AT_AlwaysDestroy)
▲ Show 20 Lines • Show All 311 Lines • ▼ Show 20 Lines	case ParsedAttr::AT_FlagEnum:
handleSimpleAttribute<FlagEnumAttr>(S, D, AL);		handleSimpleAttribute<FlagEnumAttr>(S, D, AL);
break;		break;
case ParsedAttr::AT_EnumExtensibility:		case ParsedAttr::AT_EnumExtensibility:
handleEnumExtensibilityAttr(S, D, AL);		handleEnumExtensibilityAttr(S, D, AL);
break;		break;
case ParsedAttr::AT_Flatten:		case ParsedAttr::AT_Flatten:
handleSimpleAttribute<FlattenAttr>(S, D, AL);		handleSimpleAttribute<FlattenAttr>(S, D, AL);
break;		break;
		case ParsedAttr::AT_SYCLKernel:
		handleSYCLKernelAttr(S, D, AL);
		break;
case ParsedAttr::AT_Format:		case ParsedAttr::AT_Format:
handleFormatAttr(S, D, AL);		handleFormatAttr(S, D, AL);
break;		break;
case ParsedAttr::AT_FormatArg:		case ParsedAttr::AT_FormatArg:
handleFormatArgAttr(S, D, AL);		handleFormatArgAttr(S, D, AL);
break;		break;
case ParsedAttr::AT_Callback:		case ParsedAttr::AT_Callback:
handleCallbackAttr(S, D, AL);		handleCallbackAttr(S, D, AL);
▲ Show 20 Lines • Show All 1,888 Lines • Show Last 20 Lines

clang/test/SemaSYCL/kernel-attribute-on-non-sycl.cpp

This file was added.

				// RUN: %clang_cc1 -std=c++11 -fsyntax-only -fsycl-is-device -verify %s
				// RUN: %clang_cc1 -std=c++11 -fsyntax-only -verify -x c++ %s

				#ifndef __SYCL_DEVICE_ONLY__
				// expected-warning@+7 {{'sycl_kernel' attribute ignored}}
				// expected-warning@+8 {{'sycl_kernel' attribute ignored}}
				#else
				// expected-no-diagnostics
				#endif

				template <typename T, typename A, int B>
				__attribute__((sycl_kernel)) void foo(T P);
				template <typename T, typename A, int B>
				[[clang::sycl_kernel]] void foo1(T P);

clang/test/SemaSYCL/kernel-attribute.cpp

This file was added.

				// RUN: %clang_cc1 -std=c++11 -fsyntax-only -fsycl-is-device -verify %s

				// Only function templates
				[[clang::sycl_kernel]] int gv2 = 0; // expected-warning {{'sycl_kernel' attribute only applies to function templates}}
				__attribute__((sycl_kernel)) int gv3 = 0; // expected-warning {{'sycl_kernel' attribute only applies to function templates}}

				aaron.ballmanUnsubmitted Done Reply Inline Actions Missing some tests: test that both attributes can be applied to whatever subjects they appertain to test that neither attribute can be applied to an incorrect subject test that the attributes do not accept arguments test that the attribute is ignored when SYCL is not enabled Are there situations where the attribute does not make sense, such as member functions, virtual functions, etc? If so, those are good test cases (and diagnostics) to add as well. aaron.ballman: Missing some tests: * test that both attributes can be applied to whatever subjects they…
				aaron.ballmanUnsubmitted Done Reply Inline Actions Still missing a test that the attribute is ignored when SYCL is not enabled. aaron.ballman: Still missing a test that the attribute is ignored when SYCL is not enabled.
				baderAuthorUnsubmitted Done Reply Inline Actions Still missing a test that the attribute is ignored when SYCL is not enabled. I think clang/test/SemaSYCL/kernel-attribute-on-non-sycl.cpp should check that. Please, let me know if you mean something else. This test should be on a templated function (we already demonstrated it only applies to templated functions, so the check for the argument is not what is failing). Nice catch. Thanks! bader: > Still missing a test that the attribute is ignored when SYCL is not enabled. I think…
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions I think clang/test/SemaSYCL/kernel-attribute-on-non-sycl.cpp should check that. Please, let me know if you mean something else. Oh, you're correct, that was the test I was hoping for! aaron.ballman: > I think clang/test/SemaSYCL/kernel-attribute-on-non-sycl.cpp should check that. Please, let…
				__attribute__((sycl_kernel)) void foo(); // expected-warning {{'sycl_kernel' attribute only applies to function templates}}
				aaron.ballmanUnsubmitted Done Reply Inline Actions This test should be on a templated function (we already demonstrated it only applies to templated functions, so the check for the argument is not what is failing). aaron.ballman: This test should be on a templated function (we already demonstrated it only applies to…
				[[clang::sycl_kernel]] void foo1(); // expected-warning {{'sycl_kernel' attribute only applies to function templates}}
				aaron.ballmanUnsubmitted Done Reply Inline Actions Same here. aaron.ballman: Same here.

				// Attribute takes no arguments
				template <typename T, typename A>
				__attribute__((sycl_kernel(1))) void foo(T P); // expected-error {{'sycl_kernel' attribute takes no arguments}}
				template <typename T, typename A, int I>
				[[clang::sycl_kernel(1)]] void foo1(T P);// expected-error {{'sycl_kernel' attribute takes no arguments}}

				// At least two template parameters
				template <typename T>
				__attribute__((sycl_kernel)) void foo(T P); // expected-warning {{'sycl_kernel' attribute only applies to a function template with at least two template parameters}}
				template <typename T>
				[[clang::sycl_kernel]] void foo1(T P); // expected-warning {{'sycl_kernel' attribute only applies to a function template with at least two template parameters}}

				// First two template parameters cannot be non-type template parameters
				template <typename T, int A>
				__attribute__((sycl_kernel)) void foo(T P); // expected-warning {{template parameter of a function template with the 'sycl_kernel' attribute cannot be a non-type template parameter}}
				template <int A, typename T>
				[[clang::sycl_kernel]] void foo1(T P); // expected-warning {{template parameter of a function template with the 'sycl_kernel' attribute cannot be a non-type template parameter}}

				// Must return void
				template <typename T, typename A>
				__attribute__((sycl_kernel)) int foo(T P); // expected-warning {{function template with 'sycl_kernel' attribute must have a 'void' return type}}
				template <typename T, typename A>
				[[clang::sycl_kernel]] int foo1(T P); // expected-warning {{function template with 'sycl_kernel' attribute must have a 'void' return type}}

				// Must take at least one argument
				template <typename T, typename A>
				__attribute__((sycl_kernel)) void foo(); // expected-warning {{function template with 'sycl_kernel' attribute must have a single parameter}}
				template <typename T, typename A>
				[[clang::sycl_kernel]] void foo1(T t, A a); // expected-warning {{function template with 'sycl_kernel' attribute must have a single parameter}}

				// No diagnostics
				template <typename T, typename A>
				__attribute__((sycl_kernel)) void foo(T P);
				template <typename T, typename A, int I>
				[[clang::sycl_kernel]] void foo1(T P);

This is an archive of the discontinued LLVM Phabricator instance.

[SYCL] Add sycl_kernel attribute for accelerated code outliningClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 231918

clang/include/clang/Basic/Attr.td

clang/include/clang/Basic/AttrDocs.td

clang/include/clang/Basic/DiagnosticSemaKinds.td

clang/lib/Sema/SemaDeclAttr.cpp

clang/test/SemaSYCL/kernel-attribute-on-non-sycl.cpp

clang/test/SemaSYCL/kernel-attribute.cpp

[SYCL] Add sycl_kernel attribute for accelerated code outlining
ClosedPublic