Update SIFoldOperands pass to recognize v_add_f64 and v_mul_f64 instructions for folding output modifiers.
Details
Details
Diff Detail
Diff Detail
Event Timeline
Comment Actions
Should also have some negative tests where the denormal mode, ieee mode, or signed zeros don't match
llvm/lib/Target/AMDGPU/SIFoldOperands.cpp | ||
---|---|---|
1776 | This is a separate change |
llvm/lib/Target/AMDGPU/SIFoldOperands.cpp | ||
---|---|---|
1776 | Also would need a comment explaining that ieee mode only changes snan behavior, nnan lets us ignore it. |
This is a separate change