GCC doesn't CSE inline asm when 'p' is used on inputs, neither should
clang. In order to do so, we must not set memory(none) modref on such
inline asm.
If you're passing the address of a variable into inline asm as an
operand, you're going to be reading memory!
Link: https://lore.kernel.org/lkml/20230306120106.GE1267364@hirez.programming.kicks-ass.net/
I wonder if we should make this change for outputs, too?