This is an archive of the discontinued LLVM Phabricator instance.

AMDGPU/GFX9: Do not use v_pack_b32_f16 when packing
ClosedPublic

Authored by kzhuravl on Apr 7 2017, 11:08 AM.

Details

Summary

v_pack_b32_f16 always flushes denorms (regardless of f16f64 denorm bits in pgm rsrc) and treats inputs as FP16.

Diff Detail

Repository
rL LLVM

Event Timeline

kzhuravl created this revision.Apr 7 2017, 11:08 AM
kzhuravl edited the summary of this revision. (Show Details)Apr 7 2017, 7:30 PM
arsenm accepted this revision.Apr 10 2017, 10:46 AM

LGTM

This revision is now accepted and ready to land.Apr 10 2017, 10:46 AM
This revision was automatically updated to reflect the committed changes.