This is an archive of the discontinued LLVM Phabricator instance.

[DirectoryWatcher] Increase timeout to make test less flaky
ClosedPublic

Authored by smeenai on Mar 3 2021, 11:59 AM.

Download Raw Diff

Details

Reviewers

jkorous
arphaman
akyrtzi
gribozavr
plotfi

Commits

rG9a2a167b6ca7: [DirectoryWatcher] Increase timeout to make test less flaky

Summary

We've observed this test being significantly flaky on our Mac CI
machines when we're running the full check-clang suite. It fails because
the wait_for condition isn't met within 3 seconds. We believe it's
because our CI machines are somewhat underpowered and pretty heavily
loaded when we're running the full check-clang suite.

I ran some experiments on increasing the timeout. I ran the full
check-clang suite 100 times with each timeout value and recorded how
many flaky failures we encountered in these tests. The results are:

3 second timeout (baseline): 20 failures
10 second timeout: 14 failures
20 second timeout: 4 failures
30 second timeout: 2 failures
40 second timeout: 1 failure
50 second timeout: 0 failures
60 second timeout: 0 failures

I ran another set of 100 tests for the 50 second timeout and observed
one flaky failure. By contrast, I ended up running check-clang 500 times
for the 60 second timeout and didn't observe a single flaky failure.
That's how the 60 second timeout value used in this patch was derived.

While a 60 second timeout might seem high, keep in mind that:

This is a timeout, not a sleep; the test should require much less time the vast majority of instances, especially on more powerful machines.
The long timeout is most likely to occur when other tests are also running at the same time, so the latency of the timeout will also be masked by the latency of the other tests.

See https://reviews.llvm.org/D58418?id=200123#inline-554211 for where
this timeout was originally introduced and the possibility of raising it
if it wasn't enough was discussed.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

smeenai requested review of this revision.Mar 3 2021, 11:59 AM

smeenai created this revision.

Herald added a project: Restricted Project. · View Herald TranscriptMar 3 2021, 11:59 AM

Herald added a subscriber: cfe-commits. · View Herald Transcript

smeenai added a reviewer: plotfi.Mar 3 2021, 4:59 PM

This makes sense to me. I approve. Can we move the 3/60 seconds number to a const int value set somewhere higher up in the file as a global with a comment explaining this as well?

This revision is now accepted and ready to land.Mar 3 2021, 7:17 PM

@jkorous thoughts?

Harbormaster completed remote builds in B91870: Diff 327874.Mar 3 2021, 8:31 PM

LGTM.

Adding the comment would be great.

Address review comments

In D97878#2602077, @plotfi wrote:

This makes sense to me. I approve. Can we move the 3/60 seconds number to a const int value set somewhere higher up in the file as a global with a comment explaining this as well?

Updated the diff; lemme know if this is what you had in mind.

Harbormaster completed remote builds in B92144: Diff 328283.Mar 5 2021, 5:18 AM

Closed by commit rG9a2a167b6ca7: [DirectoryWatcher] Increase timeout to make test less flaky (authored by smeenai). · Explain WhyMar 5 2021, 5:49 PM

This revision was automatically updated to reflect the committed changes.

smeenai added a commit: rG9a2a167b6ca7: [DirectoryWatcher] Increase timeout to make test less flaky.

Revision Contents

Path

Size

clang/

unittests/

DirectoryWatcher/

DirectoryWatcherTest.cpp

13 lines

Diff 328707

clang/unittests/DirectoryWatcher/DirectoryWatcherTest.cpp

Show All 28 Lines	return lhs.Filename == rhs.Filename &&
static_cast<int>(lhs.Kind) == static_cast<int>(rhs.Kind);		static_cast<int>(lhs.Kind) == static_cast<int>(rhs.Kind);
}		}
} // namespace clang		} // namespace clang

namespace {		namespace {

typedef DirectoryWatcher::Event::EventKind EventKind;		typedef DirectoryWatcher::Event::EventKind EventKind;

		// We've observed this test being significantly flaky when running on a heavily
		// loaded machine (e.g. when it's being run as part of the full check-clang
		// suite). Set a high timeout value to avoid this flakiness. The 60s timeout
		// value was determined empirically. It's a timeout value, not a sleep value,
		// and the test should require much less time in practice the vast majority of
		// instances. The cases where we do come close to (or still end up hitting) the
		// longer timeout are most likely to occur when other tests are also running at
		// the same time (e.g. as part of the full check-clang suite), in which case the
		// latency of the timeout will be masked by the latency of the other tests.
		constexpr std::chrono::seconds EventualResultTimeout(60);

struct DirectoryWatcherTestFixture {		struct DirectoryWatcherTestFixture {
std::string TestRootDir;		std::string TestRootDir;
std::string TestWatchedDir;		std::string TestWatchedDir;

DirectoryWatcherTestFixture() {		DirectoryWatcherTestFixture() {
SmallString<128> pathBuf;		SmallString<128> pathBuf;
#ifndef NDEBUG		#ifndef NDEBUG
std::error_code UniqDirRes =		std::error_code UniqDirRes =
▲ Show 20 Lines • Show All 193 Lines • ▼ Show 20 Lines

void checkEventualResultWithTimeout(VerifyingConsumer &TestConsumer) {		void checkEventualResultWithTimeout(VerifyingConsumer &TestConsumer) {
std::packaged_task<int(void)> task(		std::packaged_task<int(void)> task(
[&TestConsumer]() { return TestConsumer.blockUntilResult(); });		[&TestConsumer]() { return TestConsumer.blockUntilResult(); });
std::future<int> WaitForExpectedStateResult = task.get_future();		std::future<int> WaitForExpectedStateResult = task.get_future();
std::thread worker(std::move(task));		std::thread worker(std::move(task));
worker.detach();		worker.detach();

EXPECT_TRUE(WaitForExpectedStateResult.wait_for(std::chrono::seconds(3)) ==		EXPECT_TRUE(WaitForExpectedStateResult.wait_for(EventualResultTimeout) ==
std::future_status::ready)		std::future_status::ready)
<< "The expected result state wasn't reached before the time-out.";		<< "The expected result state wasn't reached before the time-out.";
std::unique_lock<std::mutex> L(TestConsumer.Mtx);		std::unique_lock<std::mutex> L(TestConsumer.Mtx);
EXPECT_TRUE(TestConsumer.result().hasValue());		EXPECT_TRUE(TestConsumer.result().hasValue());
if (TestConsumer.result().hasValue()) {		if (TestConsumer.result().hasValue()) {
EXPECT_TRUE(*TestConsumer.result());		EXPECT_TRUE(*TestConsumer.result());
}		}
if ((TestConsumer.result().hasValue() && !TestConsumer.result().getValue()) \|\|		if ((TestConsumer.result().hasValue() && !TestConsumer.result().getValue()) \|\|
▲ Show 20 Lines • Show All 236 Lines • Show Last 20 Lines