This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
libc/src/__support/CPP/
-
src/
-
__support/
-
CPP/
-
atomic.h

Differential D146725

[libc] Implement memory fences on NVPTX
ClosedPublic

Authored by jhuber6 on Mar 23 2023, 7:52 AM.

Download Raw Diff

Details

Reviewers

JonChesterfield
jdoerfert
tianshilei1992
sivachandra
tra

Commits

rG9c8bdbcbc502: [libc] Implement memory fences on NVPTX

Summary

Memory fences are not handled by the NVPTX backend. We need to replace
them with a memory barrier intrinsic function. This doesn't include the
ordering, but should perform the necessary functionality, albeit slower.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

jhuber6 created this revision.Mar 23 2023, 7:52 AM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptMar 23 2023, 7:52 AM

Herald added subscribers: libc-commits, mattd, gchakrabarti and 3 others. · View Herald Transcript

jhuber6 requested review of this revision.Mar 23 2023, 7:52 AM

Harbormaster completed remote builds in B221314: Diff 507741.Mar 23 2023, 8:07 AM

Does it have to be sys? Does gl (kernel level) work?

In D146725#4216704, @tianshilei1992 wrote:

Does it have to be sys? Does gl (kernel level) work?

It should be sys as far as I understand because this is intended to be used on the Nvidia USM to implement RPC. Also I believe __atomic_thread_fence defaults to system scope on AMDPGU as well.

In D146725#4216706, @jhuber6 wrote:

In D146725#4216704, @tianshilei1992 wrote:

Does it have to be sys? Does gl (kernel level) work?

It should be sys as far as I understand because this is intended to be used on the Nvidia USM to implement RPC. Also I believe __atomic_thread_fence defaults to system scope on AMDPGU as well.

Oh I see. That's for shared memory. LG.

This revision is now accepted and ready to land.Mar 23 2023, 8:17 AM

Closed by commit rG9c8bdbcbc502: [libc] Implement memory fences on NVPTX (authored by jhuber6). · Explain WhyMar 23 2023, 9:26 AM

This revision was automatically updated to reflect the committed changes.

jhuber6 added a commit: rG9c8bdbcbc502: [libc] Implement memory fences on NVPTX.

Revision Contents

Path

Size

libc/

src/

__support/

CPP/

atomic.h

8 lines

Diff 507766

libc/src/__support/CPP/atomic.h

//===-- A simple equivalent of std::atomic ----------------------- C++ --===//		//===-- A simple equivalent of std::atomic ----------------------- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_LIBC_SRC_SUPPORT_CPP_ATOMIC_H		#ifndef LLVM_LIBC_SRC_SUPPORT_CPP_ATOMIC_H
#define LLVM_LIBC_SRC_SUPPORT_CPP_ATOMIC_H		#define LLVM_LIBC_SRC_SUPPORT_CPP_ATOMIC_H

#include "src/__support/macros/attributes.h"		#include "src/__support/macros/attributes.h"
		#include "src/__support/macros/properties/architectures.h"

#include "type_traits.h"		#include "type_traits.h"

namespace __llvm_libc {		namespace __llvm_libc {
namespace cpp {		namespace cpp {

enum class MemoryOrder : int {		enum class MemoryOrder : int {
RELAXED = __ATOMIC_RELAXED,		RELAXED = __ATOMIC_RELAXED,
▲ Show 20 Lines • Show All 70 Lines • ▼ Show 20 Lines	public:

// Set the value without using an atomic operation. This is useful		// Set the value without using an atomic operation. This is useful
// in initializing atomic values without a constructor.		// in initializing atomic values without a constructor.
void set(T rhs) { val = rhs; }		void set(T rhs) { val = rhs; }
};		};

// Issue a thread fence with the given memory ordering.		// Issue a thread fence with the given memory ordering.
LIBC_INLINE void atomic_thread_fence(MemoryOrder mem_ord) {		LIBC_INLINE void atomic_thread_fence(MemoryOrder mem_ord) {
		// The NVPTX backend currently does not support atomic thread fences so we use a
		// full system fence instead.
		#ifdef LIBC_TARGET_ARCH_IS_NVPTX
		(void)mem_ord;
		__nvvm_membar_sys();
		#else
__atomic_thread_fence(int(mem_ord));		__atomic_thread_fence(int(mem_ord));
		#endif
}		}

} // namespace cpp		} // namespace cpp
} // namespace __llvm_libc		} // namespace __llvm_libc

#endif // LLVM_LIBC_SRC_SUPPORT_CPP_ATOMIC_H		#endif // LLVM_LIBC_SRC_SUPPORT_CPP_ATOMIC_H