This is an archive of the discontinued LLVM Phabricator instance.

R600: Fix inconsistency in rsq instructions.
ClosedPublic

Authored by arsenm on Jun 18 2014, 9:23 PM.

Details

Reviewers
arsenm
Summary

R600 was using a clamped version of rsq, but SI was not. Add a
new rsq_clamped intrinsic and use them consistently.

It's unclear to me from the documentation what behavior
the R600 instructions have, so I assume they have the legacy behavior
described by the SI documents. For R600, use RECIPSQRT_IEEE
for both llvm.AMDGPU.rsq.legacy and llvm.AMDGPU.rsq. R600 also
has RECIPSQRT_FF, which I'm not sure how it fits in here.

Diff Detail

Event Timeline

arsenm updated this revision to Diff 10602.Jun 18 2014, 9:23 PM
arsenm retitled this revision from to R600: Fix inconsistency in rsq instructions..
arsenm updated this object.
arsenm edited the test plan for this revision. (Show Details)
arsenm added a subscriber: Unknown Object (MLST).
arsenm accepted this revision.Jun 24 2014, 3:22 PM
arsenm added a reviewer: arsenm.

r211637

This revision is now accepted and ready to land.Jun 24 2014, 3:22 PM
arsenm closed this revision.Jun 24 2014, 3:22 PM