This is an archive of the discontinued LLVM Phabricator instance.

[AArch64][SVE] Extend predicated fma patterns to negative zero
ClosedPublic

Authored by dmgreen on Apr 6 2023, 9:46 AM.

Details

Summary

This extends the patterns added in D130564 for fma to also handle negative 0.0. -0.0 is the identity element for fadd so comes up in vectorized loops.

The same basic idea applies to D130564, but nsz should no longer be needed for the fadd case, and is for fsub (which is really only added for completeness).

Diff Detail