Using REG_SEQUENCE produces better code than INSERT_SUBREG,
we can omit one move instruction in many cases.
Fixes: SWDEV-298028
Paths
| Differential D107602
[AMDGPU] Improve v2i32/v2f32 insertelt patterns ClosedPublic Authored by rampitec on Aug 5 2021, 2:41 PM.
Details
Summary Using REG_SEQUENCE produces better code than INSERT_SUBREG, Fixes: SWDEV-298028
Diff Detail
Unit TestsFailed
Event TimelineHerald added subscribers: foad, kerbowa, hiraditya and 7 others. · View Herald TranscriptAug 5 2021, 2:41 PM This revision is now accepted and ready to land.Aug 5 2021, 4:05 PM This revision was landed with ongoing or failed builds.Aug 5 2021, 4:14 PM Closed by commit rGd71924fbfef2: [AMDGPU] Improve v2i32/v2f32 insertelt patterns (authored by rampitec). · Explain Why This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 364621 llvm/lib/Target/AMDGPU/SIInstructions.td
llvm/test/CodeGen/AMDGPU/insert_vector_elt.ll
|