This patch enables AtomicExpandPass for NVPTX.
Depend on D125652.
Differential D125639
[NVPTX] Enable AtomicExpandPass for NVPTX Authored by tianshilei1992 on May 15 2022, 11:23 AM.
Details
This patch enables AtomicExpandPass for NVPTX. Depend on D125652.
Diff Detail
Event Timeline
Comment Actions
SM20 is the current minimum. Note that NVIDIA has already stopped supporting pre-SM35 GPUs and we should probably start considering removing support for sm_2x in LLVM, too.
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
According to https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#parallel-synchronization-and-communication-instructions-atom
64-bit atom.{and,or,xor,min,max} require sm_32 or higher.
This must be conditional on the GPU variant we're compiling for, similar to how we handle f64 above.