Download Raw Diff

Details

Reviewers

None

Group Reviewers

Restricted Project

Commits

rG8b5e4c038ed7: [runtimes][CI] Add a 20 minutes individual test time out

Summary

If a single test has been running for more than 10 minutes on a CI node,
something is wrong and it should time-out instead of running until the
node potentially times out itself.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	600,010 ms	libcxx CI MSAN > llvm-libc++-shared-cfg-in.std/containers/sequences/deque/deque_modifiers::insert_iter_iter.pass.cpp

Event Timeline

ldionne created this revision.Dec 1 2021, 11:29 AM

Herald added a subscriber: arichardson. · View Herald TranscriptDec 1 2021, 11:29 AM

ldionne requested review of this revision.Dec 1 2021, 11:29 AM

Herald added a project: Restricted Project. · View Herald TranscriptDec 1 2021, 11:29 AM

Herald added a reviewer: Restricted Project. · View Herald Transcript

Herald added a subscriber: libcxx-commits. · View Herald Transcript

Harbormaster completed remote builds in B136976: Diff 391097.Dec 1 2021, 11:38 AM

Could this be put into the lit config itself, so it applies to even local/developer invocations rather than only CI invocations?

In D114896#3165444, @dblaikie wrote:

Could this be put into the lit config itself, so it applies to even local/developer invocations rather than only CI invocations?

Sorry for dropping this on the floor -- I've actually been wondering about what's the best answer to this. I believe I've convinced myself that no, the timeout should really only be on the CI nodes. Indeed, the test suite could be run on e.g. a simulator device that is extremely slow, and we wouldn't, in that case, have such a timeout. So I think the correct thing to do is tie this timeout to our CI fleet only, where we know we must have reasonable timing guarantees.

Herald added a project: Restricted Project. · View Herald TranscriptMar 17 2022, 2:15 PM

Rebase onto main. I suspect this will still fail due to missing psutil on our CI nodes -- I'll add it separately.

Harbormaster completed remote builds in B154919: Diff 416316.Mar 17 2022, 3:16 PM

Poke CI

Harbormaster completed remote builds in B156975: Diff 419164.Mar 30 2022, 12:32 PM

Increase timeout to 10 minutes to account for long TSan tests. We should also try to reduce tests that take too long to run in the test suite as a separate effort.

Harbormaster completed remote builds in B157226: Diff 419512.Mar 31 2022, 1:01 PM

15 minutes. Geez how long is that test?

Harbormaster completed remote builds in B158774: Diff 421622.Apr 8 2022, 5:17 PM

Bump to 20 minutes. We'll really want to investigate the deque test that takes that long.

Harbormaster completed remote builds in B159051: Diff 421977.Apr 11 2022, 1:01 PM

ldionne accepted this revision as: Restricted Project.Apr 11 2022, 2:46 PM

This revision was not accepted when it landed; it landed in state Needs Review.Apr 11 2022, 2:47 PM

Closed by commit rG8b5e4c038ed7: [runtimes][CI] Add a 20 minutes individual test time out (authored by ldionne). · Explain Why

This revision was automatically updated to reflect the committed changes.

ldionne added a commit: rG8b5e4c038ed7: [runtimes][CI] Add a 20 minutes individual test time out.

Generally if something's sent for review it shouldn't be committed until it's approved. If you're looking to run the CI without review, you can use arc's --draft argument to produce a phab 'review' that isn't sent for review, this makes it clear whether something's intended for human review or not.

In D114896#3454353, @dblaikie wrote:

Generally if something's sent for review it shouldn't be committed until it's approved. If you're looking to run the CI without review, you can use arc's --draft argument to produce a phab 'review' that isn't sent for review, this makes it clear whether something's intended for human review or not.

I think I disagree with you here. When people submit things for review even though they plan to just land it when CI passed it is a lot easier to see what people are up to and what changes (now matter how big) are made to the code base. libc++ is relatively small, so the changes easily affect other people's patches.

In D114896#3454476, @philnik wrote:

In D114896#3454353, @dblaikie wrote:

Generally if something's sent for review it shouldn't be committed until it's approved. If you're looking to run the CI without review, you can use arc's --draft argument to produce a phab 'review' that isn't sent for review, this makes it clear whether something's intended for human review or not.

I think I disagree with you here. When people submit things for review even though they plan to just land it when CI passed it is a lot easier to see what people are up to and what changes (now matter how big) are made to the code base. libc++ is relatively small, so the changes easily affect other people's patches.

The issue is that if someone sends something for review, then commits it without approval - it's possible they felt it needed review, but then committed without it because they weren't willing to wait for review or other reasons. Basically - sending it for review is generally a statement of "this needs a second set of eyes" & then committing without approval seems like walking that back.

If folks want to (& they should, for sure) keep an eye on changes in their area - the place/tools for that are the commits list (filters/searches if you need to pare it down a bit, understandably) and Phabricator's Herald rules ( https://secure.phabricator.com/book/phabricator/article/herald/ )

vitalybuka added a subscriber: vitalybuka.Sep 15 2022, 11:04 AM

vitalybuka added inline comments.

libcxx/utils/ci/run-buildbot
87	Tests which runs >1000 are problematic for buildbot It timeouts if no output for 1200sec https://lab.llvm.org/buildbot/#/builders/237/builds/135 I understand that this shell script is not for buildbot, but existence of such tests is a problem.

This is an archive of the discontinued LLVM Phabricator instance.

[runtimes][CI] Add a 10 minutes individual test time out
ClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 419512

libcxx/utils/ci/run-buildbot

This is an archive of the discontinued LLVM Phabricator instance.

[runtimes][CI] Add a 10 minutes individual test time outClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 419512

libcxx/utils/ci/run-buildbot

[runtimes][CI] Add a 10 minutes individual test time out
ClosedPublic