v_pack_b32_f16 always flushes denorms (regardless of f16f64 denorm bits in pgm rsrc) and treats inputs as FP16.
Details
Details
Diff Detail
Diff Detail
Paths
| Differential D31819
AMDGPU/GFX9: Do not use v_pack_b32_f16 when packing ClosedPublic Authored by kzhuravl on Apr 7 2017, 11:08 AM.
Details Summary v_pack_b32_f16 always flushes denorms (regardless of f16f64 denorm bits in pgm rsrc) and treats inputs as FP16.
Diff Detail Event TimelineHerald added subscribers: t-tye, tpr, dstuttard and 3 others. · View Herald TranscriptApr 7 2017, 11:08 AM This revision is now accepted and ready to land.Apr 10 2017, 10:46 AM Closed by commit rL300275: AMDGPU/GFX9: Do not use v_pack_b32_f16 when packing (authored by kzhuravl). · Explain WhyApr 13 2017, 4:29 PM This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 94545 lib/Target/AMDGPU/SIInstrInfo.cpp
test/CodeGen/AMDGPU/fptrunc.f16.ll
test/CodeGen/AMDGPU/llvm.rint.f16.ll
test/CodeGen/AMDGPU/llvm.round.ll
test/CodeGen/AMDGPU/pack.v2f16.ll
test/CodeGen/AMDGPU/pack.v2i16.ll
|