Fast FMAF is not a sufficient condition to enable denormals. Before VI, enabling denormals caused F32 instructions to run at F64 speeds.
This approach seems fine to me.
LGTM
r339278