This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU] Allow narrowing muti-dword loads
ClosedPublic

Authored by rampitec on Jan 21 2020, 12:54 PM.

Details

Summary

Currently BE allows only a little load narrowing because
of the fear it will produce sub-dword ext loads. However,
we can always allow narrowing if we are shrinking one
multi-dword load to another multi-dword load.

In particular we were unable to reduce s_load_dwordx8 into
s_load_dwordx4 if identity shuffle was used to extract
low 4 dwords.

Diff Detail

Event Timeline

rampitec created this revision.Jan 21 2020, 12:54 PM
Herald added a project: Restricted Project. · View Herald TranscriptJan 21 2020, 12:54 PM
arsenm accepted this revision.Jan 21 2020, 1:04 PM

LGTM

This revision is now accepted and ready to land.Jan 21 2020, 1:04 PM
This revision was automatically updated to reflect the committed changes.