This gives shorter f64 code and perhaps better accuracy.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Comment Actions
If it doesn’t actually help accuracy, maybe it should check for fast FMA
llvm/lib/Target/AMDGPU/AMDGPUISelLowering.cpp | ||
---|---|---|
2091–2092 | Could rename floor to trunc |
Comment Actions
If it doesn’t actually help accuracy, maybe it should check for fast FMA
I don't have any evidence that it improves accuracy but I'm not sure what the conditions should be for using FMA (I looked at SITargetLowering::isFMAFasterThanFMulAndFAdd but it is complicated) so I left it as a TODO.
Could rename floor to trunc