Update intrinsics to use n x f16 and n x i16 instead
of 32-bit types. This may avoid the need for a bitcast
and is probably less confusing.
Depends on making v16f16 and v16i16 types legal.
Paths
| Differential D128951
[AMDGPU] Update WMMA intrinsics with explicit f16 types ClosedPublic Authored by piotr on Jun 30 2022, 3:01 PM.
Details Summary Update intrinsics to use n x f16 and n x i16 instead Depends on making v16f16 and v16i16 types legal.
Diff Detail
Event TimelineThis revision is now accepted and ready to land.Jun 30 2022, 3:04 PM This revision was landed with ongoing or failed builds.Jun 30 2022, 11:56 PM Closed by commit rGb6ef36a1c427: [AMDGPU] Update WMMA intrinsics with explicit f16 types (authored by piotr). · Explain Why This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 441607 llvm/include/llvm/IR/IntrinsicsAMDGPU.td
llvm/lib/Target/AMDGPU/SIInstrInfo.td
llvm/lib/Target/AMDGPU/VOP3PInstructions.td
llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.wmma_32.ll
llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.wmma_64.ll
llvm/test/CodeGen/AMDGPU/llvm.amdgcn.wmma_32.ll
llvm/test/CodeGen/AMDGPU/llvm.amdgcn.wmma_64.ll
llvm/test/CodeGen/AMDGPU/wmma_multiple_32.ll
llvm/test/CodeGen/AMDGPU/wmma_multiple_64.ll
|