Memory fences are not handled by the NVPTX backend. We need to replace
them with a memory barrier intrinsic function. This doesn't include the
ordering, but should perform the necessary functionality, albeit slower.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Comment Actions
It should be sys as far as I understand because this is intended to be used on the Nvidia USM to implement RPC. Also I believe __atomic_thread_fence defaults to system scope on AMDPGU as well.