this patch:
1: generates more reverse memory load operations, because now the transformation only bases on load instruction result number 0.
2: fixes an exposed bug related to the reverse memory load location. Without the load location fix, the fixed case would be like:
; CHECK: # %bb.0: ; CHECK-NEXT: stxv 34, 0(3) ; CHECK-NEXT: lxvd2x 34, 0, 3 ; CHECK-NEXT: blr