A ReadAdvance was incorrectly added to the list of SchedReadWrite for the following opcodes:
sqrtss sqrtsd rsqrtss rcpss
As a consequence, a wrong operand latency was computed for the register operand used by the folded load.
This patch removes the wrong ReadAdvance, and updates the llvm-mca test cases. Now the llvm-mca timeline report shows correct timings for those unary fp SSE1/SSE2 instructions.
Please let me know if okay to commit.
Thanks,
-Andrea