Update SIFoldOperands pass to recognize v_add_f64 and v_mul_f64 instructions for folding output modifiers.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Comment Actions
Should also have some negative tests where the denormal mode, ieee mode, or signed zeros don't match
| llvm/lib/Target/AMDGPU/SIFoldOperands.cpp | ||
|---|---|---|
| 1777 | This is a separate change | |
| llvm/lib/Target/AMDGPU/SIFoldOperands.cpp | ||
|---|---|---|
| 1777 | Also would need a comment explaining that ieee mode only changes snan behavior, nnan lets us ignore it. | |
This is a separate change