- Most CPUs have dedicated adder & shifter to compute base address of
loads and stores, hence they are always free to use
- Older CPUs incur extra 1 cycle when doing load with left shift by 2, don't fold LSL to base address in these cases, add new feature for this
Subtarget->hasLSLFast() || FoldToBaseAddr