This is an archive of the discontinued LLVM Phabricator instance.

AMDGPU: Split wide vectors of i16/f16 into 32-bit regs on calls
ClosedPublic

Authored by arsenm on Jul 12 2018, 10:37 AM.

Details

Reviewers
rampitec
Summary

This improves code for the same reasons as scalarizing 32-bit
element vectors.

Diff Detail

Event Timeline

arsenm created this revision.Jul 12 2018, 10:37 AM
This revision is now accepted and ready to land.Jul 12 2018, 10:48 AM
arsenm closed this revision.Jul 31 2018, 12:18 PM

r338418