If implicitarg_ptr intrinsic is not used, set implicit kernarg size to 0, otherwise
set it to 256 bytes for code object version 5 (and beyond).
Details
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
This can be tested in the existing llvm/test/CodeGen/AMDGPU/llvm.amdgcn.implicitarg.ptr.ll
This will make things much complicated since version 5 will generates quite different code from version 2, also
the dumped variable names are different. Maybe we should only check the kernarg size for version 5.
llvm/test/CodeGen/AMDGPU/llvm.amdgcn.implicitarg.ptr.ll | ||
---|---|---|
5 | This is broken. You need to add back the GCN check prefix |
llvm/test/CodeGen/AMDGPU/llvm.amdgcn.implicitarg.ptr.ll | ||
---|---|---|
5 | Can you tell how this is broken? It passed the test for my check. |
For the LIT test, add back the GCN check prefix. For the GCN checks that are common to code object version 2 and MESA but different from code object version 5, we split the checks to HSA (version 2) and MESA.
This is broken. You need to add back the GCN check prefix