nvvm_prmt doesn't seem to be commutative. nvvm also sets IntrSpeculatable for it.
Here is the doc https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#data-movement-and-conversion-instructions-prmt
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo