This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
compiler-rt/lib/gwp_asan/
-
lib/
-
gwp_asan/
-
guarded_pool_allocator.cpp

Differential D92415

[GWP-ASan] Fix flaky test on Fuchsia
ClosedPublic

Authored by cryptoad on Dec 1 2020, 11:46 AM.

Download Raw Diff

Details

Reviewers

mcgrathr
hctim
eugenis

Commits

rGc904c32b9c92: [GWP-ASan] Fix flaky test on Fuchsia

Summary

The LateInit test might be reusing some already initialized thread
specific data if run within the main thread. This means that there
is a chance that the current value will not be enough for the 100
iterations, hence the test flaking.

Fix this by making the test run in its own thread.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

cryptoad created this revision.Dec 1 2020, 11:46 AM

Herald added a project: Restricted Project. · View Herald TranscriptDec 1 2020, 11:46 AM

Herald added a subscriber: Restricted Project. · View Herald Transcript

cryptoad requested review of this revision.Dec 1 2020, 11:46 AM

Harbormaster completed remote builds in B80699: Diff 308726.Dec 1 2020, 12:14 PM

I'm troubled in two ways.

If this test is polluted by global (TLS) state from prior tests, that seems like a non-hermeticity problem for all these tests. Should we maybe be using a test fixture with SetUp/TearDown to wipe the global state between tests?
It seems like the code under test actually has semantics that when uninitialized, it will sometimes sample (just rarely). That seems like a problem to me. If the initialized state as configured by the environment will be to never do a gwp allocation, then there should never be a window where one is randomly allowed to happen.

In D92415#2426397, @mcgrathr wrote:

I'm troubled in two ways.

If this test is polluted by global (TLS) state from prior tests, that seems like a non-hermeticity problem for all these tests. Should we maybe be using a test fixture with SetUp/TearDown to wipe the global state between tests?

In other environments (both for upstream testing and Android) we run this test in isolation to avoid this problem.

I'm happy with Kostya's solution to spawn a thread, or resetting the TLS in the start of this test (*gwp_asan::getThreadLocals() = ThreadLocalPackedVariables();)

It seems like the code under test actually has semantics that when uninitialized, it will sometimes sample (just rarely). That seems like a problem to me. If the initialized state as configured by the environment will be to never do a gwp allocation, then there should never be a window where one is randomly allowed to happen.

There's an uninitialized check in the slow path. We might enter GPA::allocate() after 1 << 31 allocations, but we never actually provide a GWP-ASan allocation (it just returns nullptr) and we reset the counter to wait another 1 << 31 times before entering the slow path again.

In D92415#2426531, @hctim wrote:

In D92415#2426397, @mcgrathr wrote:

I'm troubled in two ways.

If this test is polluted by global (TLS) state from prior tests, that seems like a non-hermeticity problem for all these tests. Should we maybe be using a test fixture with SetUp/TearDown to wipe the global state between tests?

In other environments (both for upstream testing and Android) we run this test in isolation to avoid this problem.

What kind of isolation? In the upstream tests/CMakeLIsts.txt I don't see anything segregating this test from others. I see that Android build file has a comment and "isolated: true", but I have no idea what that means because I am not familiar with the Android build system or test-running facilities.

I'm happy with Kostya's solution to spawn a thread, or resetting the TLS in the start of this test (*gwp_asan::getThreadLocals() = ThreadLocalPackedVariables();)

The latter seems mildly preferable to me. That is, it makes clear that this particular TLS state is not doing something useful across tests, it just needs to be wiped.

It seems like the code under test actually has semantics that when uninitialized, it will sometimes sample (just rarely). That seems like a problem to me. If the initialized state as configured by the environment will be to never do a gwp allocation, then there should never be a window where one is randomly allowed to happen.

There's an uninitialized check in the slow path. We might enter GPA::allocate() after 1 << 31 allocations, but we never actually provide a GWP-ASan allocation (it just returns nullptr) and we reset the counter to wait another 1 << 31 times before entering the slow path again.

I see. Thanks for the explanation. That eliminates my only concern that wasn't purely about how the test works, and I'm much more sanguine deferring to you on the test scenario.

This revision is now accepted and ready to land.Dec 1 2020, 1:28 PM

Updating the CL with another proposed fix: reseting the TLS data in
the uninit function called by the tests.

lgtm

LGTM

Harbormaster completed remote builds in B80727: Diff 308777.Dec 1 2020, 3:11 PM

Closed by commit rGc904c32b9c92: [GWP-ASan] Fix flaky test on Fuchsia (authored by cryptoad). · Explain WhyDec 2 2020, 9:01 AM

This revision was automatically updated to reflect the committed changes.

cryptoad added a commit: rGc904c32b9c92: [GWP-ASan] Fix flaky test on Fuchsia.

Revision Contents

Path

Size

compiler-rt/

lib/

gwp_asan/

guarded_pool_allocator.cpp

1 line

Diff 308982

compiler-rt/lib/gwp_asan/guarded_pool_allocator.cpp

Show First 20 Lines • Show All 142 Lines • ▼ Show 20 Lines	if (Metadata) {
Metadata = nullptr;		Metadata = nullptr;
}		}
if (FreeSlots) {		if (FreeSlots) {
unmap(FreeSlots,		unmap(FreeSlots,
roundUpTo(State.MaxSimultaneousAllocations * sizeof(*FreeSlots),		roundUpTo(State.MaxSimultaneousAllocations * sizeof(*FreeSlots),
State.PageSize));		State.PageSize));
FreeSlots = nullptr;		FreeSlots = nullptr;
}		}
		*getThreadLocals() = ThreadLocalPackedVariables();
}		}

void *GuardedPoolAllocator::allocate(size_t Size) {		void *GuardedPoolAllocator::allocate(size_t Size) {
// GuardedPagePoolEnd == 0 when GWP-ASan is disabled. If we are disabled, fall		// GuardedPagePoolEnd == 0 when GWP-ASan is disabled. If we are disabled, fall
// back to the supporting allocator.		// back to the supporting allocator.
if (State.GuardedPagePoolEnd == 0) {		if (State.GuardedPagePoolEnd == 0) {
getThreadLocals()->NextSampleCounter =		getThreadLocals()->NextSampleCounter =
(AdjustedSampleRatePlusOne - 1) &		(AdjustedSampleRatePlusOne - 1) &
▲ Show 20 Lines • Show All 142 Lines • Show Last 20 Lines