HomePhabricator

[CUDA] Implemented __nvvm_atom_*_gen_* builtins.

Authored by tra on Jun 25 2015, 11:29 AM.

Description

[CUDA] Implemented __nvvm_atom_*_gen_* builtins.

Integer variants are implemented as atomicrmw or cmpxchg instructions.
Atomic add for floating point (__nvvm_atom_add_gen_f()) is implemented
as a call to an overloaded @llvm.nvvm.atomic.load.add.f32.* LVVM
intrinsic.

Differential Revision: http://reviews.llvm.org/D10666

llvm-svn: 240669

Details

Committed
traJun 25 2015, 11:29 AM
Differential Revision
D10666: [CUDA] Implemented __nvvm_atom_*_gen_* builtins.
Parents
rG4b4a58ada4c6: [sanitizer] Disable signal_segv_handler test.
Branches
Unknown
Tags
Unknown