In some cases, the wrong amount of registers was reserved.
Also enable more v3f16 tests.
@rdomingu, does the generated code look ok?
Paths
| Differential D90847
[AMDGPU] Fix v3f16 interaction with image store workaround ClosedPublic Authored by sebastian-ne on Nov 5 2020, 6:12 AM.
Details Summary In some cases, the wrong amount of registers was reserved. Also enable more v3f16 tests. @rdomingu, does the generated code look ok?
Diff Detail
Event Timeline
sebastian-ne retitled this revision from [AMDGPU] Enable more v3f16 tests, NFC to [AMDGPU] Fix v3f16 interaction with image store workaround.Nov 10 2020, 8:58 AM
This revision is now accepted and ready to land.Nov 18 2020, 9:14 AM This revision was landed with ongoing or failed builds.Nov 18 2020, 9:21 AM Closed by commit rG72ccec1bbc98: [AMDGPU] Fix v3f16 interaction with image store workaround (authored by sebastian-ne). · Explain Why This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 306137 llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp
llvm/lib/Target/AMDGPU/SIISelLowering.cpp
llvm/test/CodeGen/AMDGPU/GlobalISel/legalize-llvm.amdgcn.image.store.2d.d16.ll
llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.image.load.1d.d16.ll
llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.image.store.2d.d16.ll
llvm/test/CodeGen/AMDGPU/llvm.amdgcn.image.d16.dim.ll
|
AFAIK, gfx810 can't pack v3f16 when d16 is set because of a hardware bug. So this instruction should use 3 vgpr (not 2 vgpr).