Added support for CUDA launch_bounds attribute to CodeGen.
Sema does have a CUDALaunchBoundsAttr, but CodeGen was doing nothing with it.
This change translates CUDALaunchBoundsAttr to maxntidx and minctasm
metadata, which NVPTX then translates to the correct PTX directives.
Details
Details
- Reviewers
jholewinski eliben rnk
Diff Detail
Diff Detail
Event Timeline
| lib/CodeGen/TargetInfo.cpp | ||
|---|---|---|
| 4769 | Couple of notes:
| |
| 4858 | Put this comment above the add* call above too (OpenCL case)? | |
| 4863 | Is it semantically correct to just populate the "x" here, rather than for all dimensions? | |
| 4865 | no need for const? | |
| 4866 | Explain the default=0 case in a comment here | |
Couple of notes: