- Remove use of the opencl and amdopencl environment member of the target triple for the AMDGPU target.
- Use a function attribute to communicate to the AMDGPU backend.
Details
- Reviewers
yaxunl kzhuravl arsenm - Commits
- rG1a3f3a2d1494: [AMDGPU] Remove use of OpenCL triple environment and replace with function…
rC328347: [AMDGPU] Remove use of OpenCL triple environment and replace with function…
rL328347: [AMDGPU] Remove use of OpenCL triple environment and replace with function…
Diff Detail
Event Timeline
- Changed to use function attribute that specifies the number of bytes of implicit arguments for OpenCL kernels when using the AMDHSA OS.
lib/CodeGen/TargetInfo.cpp | ||
---|---|---|
7661 | Now we support enqueue_kernel, there are two extra hidden arguments. Totally 6 hidden arguments. The implicitarg-num-bytes should be 48 now. |
lib/CodeGen/TargetInfo.cpp | ||
---|---|---|
7661 | Can you give the ABI for these implicit arguments so can update the AMDGUUsage document? |
lib/CodeGen/TargetInfo.cpp | ||
---|---|---|
7661 | The first 3 implicit arguments are x, y, z components of global offset. |
lib/CodeGen/TargetInfo.cpp | ||
---|---|---|
7661 | Added description to AMDGPUUsage for current 4 implicit arguments. Will address change for device enqueue as a separate patch. |
-mcpu=gfx900