This patch enables AtomicExpandPass for NVPTX.
Depend on D125652.
Differential D125639
[NVPTX] Enable AtomicExpandPass for NVPTX tianshilei1992 on May 15 2022, 11:23 AM. Authored by
Details
This patch enables AtomicExpandPass for NVPTX. Depend on D125652.
Diff Detail
Event Timeline
Comment Actions
SM20 is the current minimum. Note that NVIDIA has already stopped supporting pre-SM35 GPUs and we should probably start considering removing support for sm_2x in LLVM, too.
|
According to https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#parallel-synchronization-and-communication-instructions-atom
64-bit atom.{and,or,xor,min,max} require sm_32 or higher.
This must be conditional on the GPU variant we're compiling for, similar to how we handle f64 above.