If this pattern doesn't fully remove the mul we'll end up with
a MUL and MULW with the same inputs.
On the assumption that multipliers are a limited resource and have
higher latency than an add, it is likely better to use a single MUL
and a sext.w instruction.
Super nitty, but I see su and my mind doesn't jump to "single use". I think more "signed/unsigned". Maybe something like mul_oneuse?