Hi all,
This is a patch very much along the lines of the recent ARM & AArch64 refactoring of atomics: it moves the complex control-flow (& naive instruction selection) logic out of the MachineInstr level and into IR proper. It has many of the same benefits:
+ More efficient code since the ISel phase can see when constants are used (for example).
+ Neater code
That said, the primary purpose is to support 128-bit atomic operations on x86_64 (hence the added test). These benefits mostly come about because it's the easiest way to do so.
It would have been good to completely reuse the existing LoadLinked pass, instead of copy/paste/adapt, but unfortunately the constraints and optimal forms don't *quite* match up between x86 and LL/SC architectures, so I've added it as an x86-specific pass.
The other detail that may need explaining is the removal of NOCMOV checks in atomic-minmax-i6432.ll: it turns out that any CPU with cmpxchg8b will have cmov, and they can't be controlled independently in LLVM. Since this test is fundamentally about cmpxchg8b, I decided to simply remove those tests.
Do the changes look OK, and are people who more frequently fiddle around with x86 happy with the direction?
Cheers.
Tim.
I'd like some comment about why only AtomicRMW and Store need to be handled here...