This is an archive of the discontinued LLVM Phabricator instance.

AMDGPU: Use ds_{read|write}_b128
ClosedPublic

Authored by arsenm on Feb 9 2017, 6:21 PM.

Details

Reviewers
kzhuravl
Summary

It's not clear to me if this is always better than
doing ds_write2_b64 This adds the constraint of
a 128-bit register input instead of a pair of
64-bit.

Diff Detail

Event Timeline

arsenm created this revision.Feb 9 2017, 6:21 PM
arsenm updated this revision to Diff 88996.Feb 17 2017, 5:03 PM

Split use into separate patch until performance implications can be investigated

kzhuravl accepted this revision.Feb 28 2017, 8:51 AM

LGTM. Title should also reflect that this is just a definition of instructions, but not a use yet.

This revision is now accepted and ready to land.Feb 28 2017, 8:51 AM
arsenm closed this revision.Feb 28 2017, 12:27 PM

r296512