This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU] Make v4i64/v4f64/v8i64/v8f64 legal
ClosedPublic

Authored by rampitec on May 12 2020, 2:40 PM.

Details

Summary

We can produce such vectors in the Promote Alloca pass,
but we are unable to use movrel to operate it and lower
via scratch. Making it legal makes SI_INDIRECT patterns
work.

There is more work to do in subsequent changes:

  1. We initialize m0 twice to access each dword. It shall

be possible to only do it once and increment base register
number instead.

  1. We also need v16i64/v16f64 but these first need to be

added to tablegen.

Diff Detail

Event Timeline

rampitec created this revision.May 12 2020, 2:40 PM
Herald added a project: Restricted Project. · View Herald TranscriptMay 12 2020, 2:40 PM
arsenm accepted this revision.May 12 2020, 3:13 PM
This revision is now accepted and ready to land.May 12 2020, 3:13 PM
This revision was automatically updated to reflect the committed changes.