A raw v_rcp_f64 isn't accurate enough, so start applying correction.
Diff Detail
Diff Detail
Event Timeline
Comment Actions
That sequence will give a very accurate result as long as overflow and underflow is avoided. LGTM.
llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp | ||
---|---|---|
3115 | Can you add a comment here and/or in the sdag equivalent showing what the code you're building here will look like, and preferably where it came from and what kind of accuracy you expect from it? |
Can you add a comment here and/or in the sdag equivalent showing what the code you're building here will look like, and preferably where it came from and what kind of accuracy you expect from it?