We would always select global FP atomics from atomicrmw fadd, although
they have a hardcoded FP mode.
Details
Details
Diff Detail
Diff Detail
Event Timeline
llvm/lib/Target/AMDGPU/SIISelLowering.cpp | ||
---|---|---|
11802 | Does it depend on the address space? What about LDS atomics, don't you need to move this out of the if statement? |
llvm/lib/Target/AMDGPU/SIISelLowering.cpp | ||
---|---|---|
11802 | I'm not sure |
llvm/lib/Target/AMDGPU/SIISelLowering.cpp | ||
---|---|---|
11802 | Looks like they all behave the same, support denorm modes and always round nearest to zero. |
llvm/lib/Target/AMDGPU/SIISelLowering.cpp | ||
---|---|---|
11802 | RNE of course. |
Comment Actions
I think you need to drop denorm checks and move the check outside of the address space check.
Does it depend on the address space? What about LDS atomics, don't you need to move this out of the if statement?