We use some s32 values in VOP3P operands, and won't see any
intervening casts from a 32-bit fneg. Make sure it's really a packed
fneg before folding.
Details
Details
Diff Detail
Diff Detail
Paths
| Differential D74849
AMDGPU/GlobalISel: Fix incorrect VOP3P fneg folding ClosedPublic Authored by arsenm on Feb 19 2020, 9:37 AM.
Details
Diff Detail Event Timelinearsenm added a parent revision: D74848: AMDGPU: Move dot intrinsic patterns to instruction def.Feb 19 2020, 9:44 AM This revision is now accepted and ready to land.Feb 24 2020, 11:24 AM
Revision Contents
Diff 245444 llvm/lib/Target/AMDGPU/AMDGPUInstructionSelector.cpp
llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.fdot2.ll
llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.sdot2.ll
llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.sdot4.ll
llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.sdot8.ll
llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.udot2.ll
llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.udot4.ll
llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.udot8.ll
|