TFE bit effectively increases size of VData by 32bit so we need _V* variants in
tablegen to account for that.
Note that this change for cmpswap also renames _V1_V*_ into _V2_V*_ and _V2_V*_
into _V4_V*_ for VReg_64 and VReg_128 respectively.
Paths
| Differential D130763
[AMDGPU] Account for VData size increase from tfe bit for image instructions AbandonedPublic Authored by mbrkusanin on Jul 29 2022, 5:30 AM.
Details
Diff Detail
Unit TestsFailed Event Timelinembrkusanin added a child revision: D130764: [AMDGPU] Enable image_gather4h instruction for gfx10 and gfx11.Jul 29 2022, 5:31 AM Comment Actions Does TFE actually work for atomics? Do we not have any IR tests for TFE?
This revision is now accepted and ready to land.Jul 29 2022, 5:55 AM Comment Actions I could not find the clear answer in the docs. But looking at .ll tests it does not seems like it is used. So tfe is probably not supported for atomics or gather4. mbrkusanin removed a child revision: D130764: [AMDGPU] Enable image_gather4h instruction for gfx10 and gfx11.Jul 29 2022, 6:13 AM Comment Actions
If I were to guess it's something that works by default but isn't actually tested
Revision Contents
Diff 448596 llvm/lib/Target/AMDGPU/MIMGInstructions.td
llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.image.atomic.dim.mir
llvm/test/CodeGen/AMDGPU/release-vgprs.mir
llvm/test/MC/AMDGPU/gfx10_asm_mimg.s
llvm/test/MC/AMDGPU/gfx11_asm_mimg.s
llvm/test/MC/Disassembler/AMDGPU/gfx10_mimg.txt
llvm/test/MC/Disassembler/AMDGPU/gfx11_mimg.txt
llvm/test/MC/Disassembler/AMDGPU/mimg_vi.txt
|
Why did this get an additional output register if it's not using TFE?