Update SIFoldOperands pass to recognize v_add_f64 and v_mul_f64 instructions for folding output modifiers.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Comment Actions
Should also have some negative tests where the denormal mode, ieee mode, or signed zeros don't match
llvm/lib/Target/AMDGPU/SIFoldOperands.cpp | ||
---|---|---|
1777 | This is a separate change |
llvm/lib/Target/AMDGPU/SIFoldOperands.cpp | ||
---|---|---|
1777 | Also would need a comment explaining that ieee mode only changes snan behavior, nnan lets us ignore it. |
This is a separate change