Added support for CUDA launch_bounds attribute to CodeGen.
Sema does have a CUDALaunchBoundsAttr, but CodeGen was doing nothing with it.
This change translates CUDALaunchBoundsAttr to maxntidx and minctasm
metadata, which NVPTX then translates to the correct PTX directives.
Details
Details
- Reviewers
jholewinski eliben rnk
Diff Detail
Diff Detail
Event Timeline
lib/CodeGen/TargetInfo.cpp | ||
---|---|---|
4767–4770 | Couple of notes:
| |
4842 | Put this comment above the add* call above too (OpenCL case)? | |
4847 | Is it semantically correct to just populate the "x" here, rather than for all dimensions? | |
4849 | no need for const? | |
4850 | Explain the default=0 case in a comment here |
Couple of notes: