This is cheap to implement, means less work for future passes like
MachineDCE, and slightly improves the folding in some cases.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Paths
| Differential D100117
[AMDGPU] SIFoldOperands: eagerly delete dead copies ClosedPublic Authored by foad on Apr 8 2021, 8:42 AM.
Details Summary This is cheap to implement, means less work for future passes like
Diff Detail
Event TimelineHerald added subscribers: kerbowa, hiraditya, t-tye and 6 others. · View Herald TranscriptApr 8 2021, 8:42 AM
This revision is now accepted and ready to land.Apr 9 2021, 5:28 AM This revision was landed with ongoing or failed builds.Apr 9 2021, 5:57 AM Closed by commit rGa4ced03d3425: [AMDGPU] SIFoldOperands: eagerly delete dead copies (authored by foad). · Explain Why This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 336421 llvm/lib/Target/AMDGPU/SIFoldOperands.cpp
llvm/test/CodeGen/AMDGPU/GlobalISel/cvt_f32_ubyte.ll
llvm/test/CodeGen/AMDGPU/flat-scratch-fold-fi.mir
llvm/test/CodeGen/AMDGPU/fold-cndmask-wave32.mir
llvm/test/CodeGen/AMDGPU/fold-cndmask.mir
llvm/test/CodeGen/AMDGPU/fold-imm-f16-f32.mir
llvm/test/CodeGen/AMDGPU/fold-operands-order.mir
llvm/test/CodeGen/AMDGPU/fold-readlane.mir
llvm/test/CodeGen/AMDGPU/huge-number-operand-folds.mir
llvm/test/CodeGen/AMDGPU/promote-constOffset-to-imm.ll
llvm/test/CodeGen/AMDGPU/udiv64.ll
llvm/test/CodeGen/AMDGPU/urem64.ll
|
Slightly worried about the case where the only remaining copy user is a debug instruction