This is an archive of the discontinued LLVM Phabricator instance.

AMDGPU: Partially directly select llvm.amdgcn.interp.p1.f16
ClosedPublic

Authored by arsenm on Jan 2 2020, 6:55 AM.

Details

Summary

The 16 bank LDS case is complicated due to using multiple
instructions. If I attempt to write a pattern for it, the generated
selector incorrectly places the copy to m0 after the first
instruction, so that needs to be separately addressed.

Also fix not gluing the copy to m0 to the second operation in the
second half of the 16 bank lowering.

Diff Detail

Event Timeline

arsenm created this revision.Jan 2 2020, 6:55 AM
Herald added a project: Restricted Project. · View Herald TranscriptJan 2 2020, 6:55 AM
kerbowa accepted this revision.Jan 14 2020, 2:20 PM
kerbowa added a subscriber: kerbowa.

LGTM

This revision is now accepted and ready to land.Jan 14 2020, 2:20 PM
arsenm closed this revision.Jan 15 2020, 6:03 AM