This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU] Change register type for v32 vectors
ClosedPublic

Authored by rampitec on Jul 16 2019, 12:55 PM.

Details

Summary

When it is AReg_1024 this results in unnecessary copying into
AGPRs of a 32 element vectors even though they are not intended
for an mfma instruction.

Diff Detail

Repository
rL LLVM

Event Timeline

rampitec created this revision.Jul 16 2019, 12:55 PM
arsenm accepted this revision.Jul 16 2019, 12:57 PM
arsenm added inline comments.
test/CodeGen/AMDGPU/v1024.ll
8 ↗(On Diff #210152)

You can uses GCN-COUNT-<number>

This revision is now accepted and ready to land.Jul 16 2019, 12:57 PM
arsenm added inline comments.Jul 16 2019, 12:57 PM
test/CodeGen/AMDGPU/v1024.ll
2 ↗(On Diff #210152)

Can you add a comment explaining what this tests

rampitec updated this revision to Diff 210154.Jul 16 2019, 1:04 PM
rampitec marked 2 inline comments as done.

Updated test.

This revision was automatically updated to reflect the committed changes.
Herald added a project: Restricted Project. · View Herald TranscriptJul 16 2019, 1:05 PM