For AMDGPU this depends on whether denormals are enabled in the
default FP mode for the function. Currently this is treated as a
subtarget feature, so FMAD is selectively legal based on that. I want
to move this out of the subtarget features so this can be controlled
with a denormal mode attribute. Additionally, this will allow folding
based on a future ftz fast math flag.
Details
Details
- Reviewers
bogner cameron.mcinally nhaehnle
Diff Detail
Diff Detail
Event Timeline
Comment Actions
This looks good. Is it possible to write tests for the AMDGPU specific change?
Side note: I've been meaning to get around to pushing the FTZ conversation on llvm-dev, but haven't found time yet. I'll try to coordinate with @andrew.w.kaylor towards the end of this week...