This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU] Cast sub-dword elements to i32 in concat_vectors
ClosedPublic

Authored by rampitec on Jan 9 2023, 3:05 PM.

Details

Summary

This produces better code by avoiding repacking in some cases.

Fixes: SWDEV-373436

Diff Detail

Event Timeline

rampitec created this revision.Jan 9 2023, 3:05 PM
Herald added a project: Restricted Project. · View Herald TranscriptJan 9 2023, 3:05 PM
rampitec requested review of this revision.Jan 9 2023, 3:05 PM
Herald added a project: Restricted Project. · View Herald TranscriptJan 9 2023, 3:05 PM
Herald added a subscriber: wdng. · View Herald Transcript
arsenm accepted this revision.Jan 9 2023, 3:15 PM

Commit message? Can?

llvm/test/CodeGen/AMDGPU/vector_shuffle.packed.ll
2315

Not sure we'll reach here with i8 vectors

This revision is now accepted and ready to land.Jan 9 2023, 3:15 PM
rampitec retitled this revision from [AMDGPU] Can sub-dword elements to i32 in concat_vectors to [AMDGPU] Cast sub-dword elements to i32 in concat_vectors.Jan 9 2023, 3:17 PM
rampitec marked an inline comment as done.Jan 9 2023, 3:21 PM
rampitec added inline comments.
llvm/test/CodeGen/AMDGPU/vector_shuffle.packed.ll
2315

Not now, but in case we make it legal it is better to have a test.

This revision was landed with ongoing or failed builds.Jan 9 2023, 3:36 PM
This revision was automatically updated to reflect the committed changes.
rampitec marked an inline comment as done.