For AMDGPU the shift amount is never 64-bit, and
this needs to use a 32-bit shift.
An equivalent of getShiftAmountTy is necessary for the combiner, but
I'm not sure where to put it.
I'm sort of guessing on the x86 changes. Currently
in SelectionDAG the shift amount is i8. The global
isel selector seems to have been using some kind of
hack to get the 8-bit subregister, which breaks with
this change. The copy it was inserting was invalid
because it ended up being a copy from a super-register
to its own subregister.
Use the SrcTy here?