If implicitarg_ptr intrinsic is not used, set implicit kernarg size to 0, otherwise
set it to 256 bytes for code object version 5 (and beyond).
Details
Diff Detail
Time | Test | |
---|---|---|
60,030 ms | x64 debian > libFuzzer.libFuzzer::large.test |
Event Timeline
This can be tested in the existing llvm/test/CodeGen/AMDGPU/llvm.amdgcn.implicitarg.ptr.ll
This will make things much complicated since version 5 will generates quite different code from version 2, also
the dumped variable names are different. Maybe we should only check the kernarg size for version 5.
llvm/test/CodeGen/AMDGPU/llvm.amdgcn.implicitarg.ptr.ll | ||
---|---|---|
5 ↗ | (On Diff #421061) | This is broken. You need to add back the GCN check prefix |
llvm/test/CodeGen/AMDGPU/llvm.amdgcn.implicitarg.ptr.ll | ||
---|---|---|
5 ↗ | (On Diff #421061) | Can you tell how this is broken? It passed the test for my check. |
For the LIT test, add back the GCN check prefix. For the GCN checks that are common to code object version 2 and MESA but different from code object version 5, we split the checks to HSA (version 2) and MESA.