Similar to the FP case in AArch64TargetLowering::LowerBR_CC.
Instead of emitting the csets + a tbnz, just emit a compare + bcc (or two bccs, depending on the condition code)
This improves cases like this: https://godbolt.org/z/v8hebx
This is a 0.1% geomean code size improvement for CTMark at -O3.
What else could the opcode be?