When it is AReg_1024 this results in unnecessary copying into
AGPRs of a 32 element vectors even though they are not intended
for an mfma instruction.
Details
Details
Diff Detail
Diff Detail
Paths
| Differential D64815
[AMDGPU] Change register type for v32 vectors ClosedPublic Authored by rampitec on Jul 16 2019, 12:55 PM.
Details Summary When it is AReg_1024 this results in unnecessary copying into
Diff Detail Event TimelineHerald added subscribers: t-tye, tpr, dstuttard and 4 others. · View Herald TranscriptJul 16 2019, 12:55 PM This revision is now accepted and ready to land.Jul 16 2019, 12:57 PM
Closed by commit rL366252: [AMDGPU] Change register type for v32 vectors (authored by rampitec). · Explain WhyJul 16 2019, 1:05 PM This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 210154 lib/Target/AMDGPU/SIISelLowering.cpp
test/CodeGen/AMDGPU/v1024.ll
|
Can you add a comment explaining what this tests