As pointed out in D125755 the operand of a call to getCastInstrCost had the Src and Dst the wrong way around.
This fixes that, but alters one of the tests in a way that may not be profitable, possibly due to inaccuracies in the costmodel for X86 fptosi.sat instructions that are scalarized? https://godbolt.org/z/cnKzzTKcr