The shift amount of a funnel shift is modulo the scalar bitwidth:
http://llvm.org/docs/LangRef.html#llvm-fshl-intrinsic
...so we can use demanded bits analysis on that operand to simplify it.
This is another step towards canonicalizing {shift/shift/or} to the intrinsic in IR.
Need to check whether BitWidth is a power of 2 here, this is incorrect for non-pow2. Take the BitWidth=33 case in the tests below.
A mod-33 funnel shift by say 65 should shift by 65-33=32 not 65 % 64 = 1.