This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
compiler-rt/trunk/lib/sanitizer_common/
-
trunk/
-
lib/
-
sanitizer_common/
-
sanitizer_allocator_secondary.h
-
sanitizer_local_address_space_view.h

Differential D54879

Introduce `LocalAddressSpaceView::LoadWritable(...)` and make the `Load(...)` method return a const pointer.
ClosedPublic

Authored by delcypher on Nov 25 2018, 12:05 PM.

Download Raw Diff

Details

Reviewers

kcc
cryptoad
eugenis
kubamracek
george.karpenkov
vitalybuka

Commits

rG8c11fb3ed419: Introduce `LocalAddressSpaceView::LoadWritable(...)` and make the `Load(...)`…
rCRT350136: Introduce `LocalAddressSpaceView::LoadWritable(...)` and make the `Load(...)`…
rL350136: Introduce `LocalAddressSpaceView::LoadWritable(...)` and make the `Load(...)`…

Summary

This is a follow-up to r346956 (https://reviews.llvm.org/D53975).

The purpose of this change to allow implementers of the
AddressSpaceView to be able to distinguish between when a caller wants
read-only memory and when a caller wants writable memory. Being able
distinguish these cases allows implementations to optimize for the
different cases and also provides a way to workaround possible platform
restrictions (e.g. the low level platform interface for reading
out-of-process memory may place memory in read-only pages).

For allocator enumeration in almost all cases read-only is sufficient so
we make Load(...) take on this new requirement and introduce the
LoadWritable(...) variants for cases where memory needs to be
writable.

The behaviour of LoadWritable(...) documented in comments are
deliberately very restrictive so that it will be possible in the future
to implement a simple write-cache (i.e. just a map from target address
to a writable region of memory). These restrictions can be loosened in
the future if necessary by implementing a more sophisticated
write-cache.

rdar://problem/45284065

Diff Detail

Repository: rL LLVM

Event Timeline

delcypher created this revision.Nov 25 2018, 12:05 PM

Herald added a subscriber: Restricted Project. · View Herald TranscriptNov 25 2018, 12:05 PM

Harbormaster completed remote builds in B25303: Diff 175179.Nov 25 2018, 12:05 PM

@kcc ping.

vitalybuka added a subscriber: vitalybuka.Dec 10 2018, 1:20 AM

vitalybuka added inline comments.

lib/sanitizer_common/sanitizer_local_address_space_view.h
43 ↗	(On Diff #175179)	const T *target_addres? and then you maybe can remove Writable from the another name

vitalybuka added a reviewer: vitalybuka.Dec 10 2018, 1:20 AM

delcypher marked an inline comment as done.Dec 10 2018, 8:10 AM

delcypher added inline comments.

lib/sanitizer_common/sanitizer_local_address_space_view.h
43 ↗	(On Diff #175179)	@vitalybuka I considered this but this ends up putting the burden of giving `Load(...)` a const pointer on all callers. This isn't good because it involves having to write a lot of `reinterpret_cast<const Something*>(...)` at the call sites. Given that getting a `const` pointer for enumeration of the allocator is the common case it makes more sense to force the caller to write the "longer" thing when we call site needs a non-const pointer (the uncommon case). So in that case the call-site uses `LoadWritable(...)` rather than `Load(...)`. Does this make sense?

vitalybuka added inline comments.Dec 10 2018, 1:48 PM

lib/sanitizer_common/sanitizer_local_address_space_view.h
43 ↗	(On Diff #175179)	I am not sure what is going to be implementation of these. Is there going to be any disadvantages of non-const vs const implementations?

delcypher marked an inline comment as not done.Dec 10 2018, 3:19 PM

vitalybuka added inline comments.Dec 13 2018, 5:39 PM

lib/sanitizer_common/sanitizer_local_address_space_view.h
43 ↗	(On Diff #175179)	Could you please respond to one above?
43 ↗	(On Diff #175179)	input arg should be const? static const T Load(const T target_address, uptr num_elements = 1)

@vitalybuka Sorry for the delay on responding to this. The reason for the delay is that I was testing whether this patch is actually needed. The original motivation was that in some cases the memory copied from another process on Darwin will be copied into read-only pages. Further testing actually suggests that this isn't the case when copying portions of memory from another process that were allocated by ASan's runtime (i.e. the memory copied is writable).

However this patch might be needed for optimization later on. When memory from another process (that was copied in as COW pages) gets written to this will trigger a copy of the page. On some Darwin platforms these pages are large (IIRC 16k) which is very wasteful given that writes are made to a very small portion of the page.

Given that I'm not sure if we need this patch yet, let's put landing this on hold and concentrate on other patches that definitely need to land.

lib/sanitizer_common/sanitizer_local_address_space_view.h
43 ↗	(On Diff #175179)	I am not sure what is going to be implementation of these. Is there going to be any disadvantages of non-const vs const implementations? Sorry the delay on this. For the implementations, the only disadvantages I can think right now is the fact that two different code paths need to be maintained. For callers there are some disadvantages: Callers of `Load(...)` is that they must be more careful about const-correctness otherwise the code won't compile. When `Load(...)` is made and later a `LoadWritable(...)` to the same `target_address` the returned pointers might be different. Callers need to avoid calling `LoadWritable(...)` on overlapping objects

Make Load's target_address parameter be const.

Harbormaster completed remote builds in B26022: Diff 178202.Dec 14 2018, 1:34 AM

delcypher marked an inline comment as done.Dec 14 2018, 1:34 AM

vitalybuka accepted this revision.Dec 19 2018, 1:27 AM

This revision is now accepted and ready to land.Dec 19 2018, 1:27 AM

Closed by commit rL350136: Introduce `LocalAddressSpaceView::LoadWritable(...)` and make the `Load(...)`… (authored by delcypher). · Explain WhyDec 28 2018, 11:34 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

compiler-rt/

trunk/

lib/

sanitizer_common/

sanitizer_allocator_secondary.h

10 lines

sanitizer_local_address_space_view.h

40 lines

Diff 179650

compiler-rt/trunk/lib/sanitizer_common/sanitizer_allocator_secondary.h

Show First 20 Lines • Show All 198 Lines • ▼ Show 20 Lines	void GetBlockBegin(const void ptr) {
CHECK_LE(nearest_chunk, p);		CHECK_LE(nearest_chunk, p);
if (h->map_beg + h->map_size <= p)		if (h->map_beg + h->map_size <= p)
return nullptr;		return nullptr;
return GetUser(h);		return GetUser(h);
}		}

void EnsureSortedChunks() {		void EnsureSortedChunks() {
if (chunks_sorted_) return;		if (chunks_sorted_) return;
Header **chunks = AddressSpaceView::Load(chunks_, n_chunks_);		Header **chunks = AddressSpaceView::LoadWritable(chunks_, n_chunks_);
Sort(reinterpret_cast<uptr *>(chunks), n_chunks_);		Sort(reinterpret_cast<uptr *>(chunks), n_chunks_);
for (uptr i = 0; i < n_chunks_; i++)		for (uptr i = 0; i < n_chunks_; i++)
AddressSpaceView::Load(chunks[i])->chunk_idx = i;		AddressSpaceView::LoadWritable(chunks[i])->chunk_idx = i;
chunks_sorted_ = true;		chunks_sorted_ = true;
}		}

// This function does the same as GetBlockBegin, but is much faster.		// This function does the same as GetBlockBegin, but is much faster.
// Must be called with the allocator locked.		// Must be called with the allocator locked.
void GetBlockBeginFastLocked(void ptr) {		void GetBlockBeginFastLocked(void ptr) {
mutex_.CheckLocked();		mutex_.CheckLocked();
uptr p = reinterpret_cast<uptr>(ptr);		uptr p = reinterpret_cast<uptr>(ptr);
▲ Show 20 Lines • Show All 51 Lines • ▼ Show 20 Lines	public:
void ForceUnlock() {		void ForceUnlock() {
mutex_.Unlock();		mutex_.Unlock();
}		}

// Iterate over all existing chunks.		// Iterate over all existing chunks.
// The allocator must be locked when calling this function.		// The allocator must be locked when calling this function.
void ForEachChunk(ForEachChunkCallback callback, void *arg) {		void ForEachChunk(ForEachChunkCallback callback, void *arg) {
EnsureSortedChunks(); // Avoid doing the sort while iterating.		EnsureSortedChunks(); // Avoid doing the sort while iterating.
Header **chunks = AddressSpaceView::Load(chunks_, n_chunks_);		const Header const chunks = AddressSpaceView::Load(chunks_, n_chunks_);
for (uptr i = 0; i < n_chunks_; i++) {		for (uptr i = 0; i < n_chunks_; i++) {
Header *t = chunks[i];		const Header *t = chunks[i];
callback(reinterpret_cast<uptr>(GetUser(t)), arg);		callback(reinterpret_cast<uptr>(GetUser(t)), arg);
// Consistency check: verify that the array did not change.		// Consistency check: verify that the array did not change.
CHECK_EQ(chunks[i], t);		CHECK_EQ(chunks[i], t);
CHECK_EQ(AddressSpaceView::Load(chunks[i])->chunk_idx, i);		CHECK_EQ(AddressSpaceView::Load(chunks[i])->chunk_idx, i);
}		}
}		}

private:		private:
struct Header {		struct Header {
uptr map_beg;		uptr map_beg;
uptr map_size;		uptr map_size;
uptr size;		uptr size;
uptr chunk_idx;		uptr chunk_idx;
};		};

Header *GetHeader(uptr p) {		Header *GetHeader(uptr p) {
CHECK(IsAligned(p, page_size_));		CHECK(IsAligned(p, page_size_));
return reinterpret_cast<Header*>(p - page_size_);		return reinterpret_cast<Header*>(p - page_size_);
}		}
Header GetHeader(const void p) {		Header GetHeader(const void p) {
return GetHeader(reinterpret_cast<uptr>(p));		return GetHeader(reinterpret_cast<uptr>(p));
}		}

void GetUser(Header h) {		void GetUser(const Header h) {
CHECK(IsAligned((uptr)h, page_size_));		CHECK(IsAligned((uptr)h, page_size_));
return reinterpret_cast<void*>(reinterpret_cast<uptr>(h) + page_size_);		return reinterpret_cast<void*>(reinterpret_cast<uptr>(h) + page_size_);
}		}

uptr RoundUpMapSize(uptr size) {		uptr RoundUpMapSize(uptr size) {
return RoundUpTo(size, page_size_) + page_size_;		return RoundUpTo(size, page_size_) + page_size_;
}		}

Show All 10 Lines

compiler-rt/trunk/lib/sanitizer_common/sanitizer_local_address_space_view.h

	Show All 25 Lines
	// code duplication.			// code duplication.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	#ifndef SANITIZER_LOCAL_ADDRES_SPACE_VIEW_H			#ifndef SANITIZER_LOCAL_ADDRES_SPACE_VIEW_H
	#define SANITIZER_LOCAL_ADDRES_SPACE_VIEW_H			#define SANITIZER_LOCAL_ADDRES_SPACE_VIEW_H

	namespace __sanitizer {			namespace __sanitizer {
	struct LocalAddressSpaceView {			struct LocalAddressSpaceView {
	// Load memory `sizeof(T) * num_elements` bytes of memory			// Load memory `sizeof(T) * num_elements` bytes of memory from the target
	// from the target process (always local for this implementation)			// process (always local for this implementation) starting at address
	// starting at address `target_address`. The local copy of			// `target_address`. The local copy of this memory is returned as a pointer.
	// this memory is returned as a pointer. It is guaranteed that			// The caller should not write to this memory. The behaviour when doing so is
	//			// undefined. Callers should use `LoadWritable()` to get access to memory
	// * That the function will always return the same value			// that is writable.
	// for a given set of arguments.
	// * That the memory returned is writable and that writes will persist.
	//			//
	// The lifetime of loaded memory is implementation defined.			// The lifetime of loaded memory is implementation defined.
	template <typename T>			template <typename T>
	static T Load(T target_address, uptr num_elements = 1) {			static const T Load(const T target_address, uptr num_elements = 1) {
				// The target address space is the local address space so
				// nothing needs to be copied. Just return the pointer.
				return target_address;
				}

				// Load memory `sizeof(T) * num_elements` bytes of memory from the target
				// process (always local for this implementation) starting at address
				// `target_address`. The local copy of this memory is returned as a pointer.
				// The memory returned may be written to.
				//
				// Writes made to the returned memory will be visible in the memory returned
				// by subsequent `Load()` or `LoadWritable()` calls provided the
				// `target_address` parameter is the same. It is not guaranteed that the
				// memory returned by previous calls to `Load()` will contain any performed
				// writes. If two or more overlapping regions of memory are loaded via
				// separate calls to `LoadWritable()`, it is implementation defined whether
				// writes made to the region returned by one call are visible in the regions
				// returned by other calls.
				//
				// Given the above it is recommended to load the largest possible object
				// that requires modification (e.g. a class) rather than individual fields
				// from a class to avoid issues with overlapping writable regions.
				//
				// The lifetime of loaded memory is implementation defined.
				template <typename T>
				static T LoadWritable(T target_address, uptr num_elements = 1) {
	// The target address space is the local address space so			// The target address space is the local address space so
	// nothing needs to be copied. Just return the pointer.			// nothing needs to be copied. Just return the pointer.
	return target_address;			return target_address;
	}			}
	};			};
	} // namespace __sanitizer			} // namespace __sanitizer

	#endif			#endif