This would show up if we fix horizontal reductions to narrow as they go along, but it's an improvement for size and/or Jaguar (fast-hops) independent of that.
We need to do this late to not interfere with other pattern matching of larger horizontal sequences.
I'm guessing we would extend this to integer ops too, but I figured I better stop here and get feedback on the initial FP part in case I've missed something.
We already have shouldCombineToHorizontalOp