If we need to shift left anyway we might be able to take advantage
of LUI implicitly shifting its immediate left by 12 to cover part
of the shift. This allows us to use more bits of the LUI immediate
to avoid an ADDI.
I believe this is the same or similar to one of the optimizations
from D79492.
I added new RVC RUN lines so we can see that the code with compression enabled isn't affected.