This is an archive of the discontinued LLVM Phabricator instance.

AMDGPU: Custom lower v4i16/v4f16 vector operations
ClosedPublic

Authored by arsenm on May 14 2018, 4:08 AM.

Details

Summary

Avoids stack access.

Also handle extract hi elt pattern from truncate + shift
to avoid a couple test regressions.

Diff Detail

Event Timeline

arsenm created this revision.May 14 2018, 4:08 AM
rampitec added inline comments.May 14 2018, 10:08 AM
lib/Target/AMDGPU/SIISelLowering.cpp
4178

Need to bail if vector size is not an expected 64.

arsenm updated this revision to Diff 146905.May 15 2018, 1:24 PM

Add more asserts

This revision is now accepted and ready to land.May 15 2018, 1:27 PM
arsenm closed this revision.May 16 2018, 4:51 AM

r332453