Thanks for moving the bots over! FWIW, either using two tags (as you do here) or using a descriptive queue name that includes the architecture is fine by me.

When you update this review, could you please rebase onto main? It will get some additional changes that should reduce the load on our macOS CI.

Harbormaster completed remote builds in B88288: Diff 322109.Feb 8 2021, 9:36 PM

Harbormaster completed remote builds in B88294: Diff 322117.Feb 12 2021, 3:18 AM

DavidSpickett planned changes to this revision.Feb 16 2021, 3:18 AM

Rebase, testing connection.

Harbormaster completed remote builds in B89355: Diff 323948.Feb 16 2021, 5:28 AM

Temporarily remove other builders to test AArch64 only.

DavidSpickett planned changes to this revision.Feb 16 2021, 6:01 AM

Misc edit to trigger another run.

DavidSpickett planned changes to this revision.Feb 16 2021, 6:16 AM

Harbormaster completed remote builds in B89375: Diff 323980.Feb 16 2021, 6:39 AM

Harbormaster completed remote builds in B89373: Diff 323977.Feb 16 2021, 6:59 AM

Another random edit to start a new build. This time running on the final hardware.

DavidSpickett planned changes to this revision.Feb 17 2021, 3:58 AM

Harbormaster completed remote builds in B89529: Diff 324266.Feb 17 2021, 4:13 AM

Time with both configs.

DavidSpickett planned changes to this revision.Feb 17 2021, 4:18 AM

Harbormaster completed remote builds in B89532: Diff 324269.Feb 17 2021, 7:37 AM

This is looking pretty good. Your failing tests appear to be due to flaky tests, I just checked in the following to help:

commit 642048eea041ff79aa9e8a934edff2415ab16447
Author: Louis Dionne <ldionne.2@gmail.com>
Date:   Wed Feb 17 11:19:37 2021 -0500

    [libc++] Allow retries in a few more flaky tests

Can you please rebase on top of main and clean up the patch? I think we'll be good to go.

Also, what's your CI capacity like? From the build history, it looks like there's a single builder that takes roughly 1h30 to build. I expect that is going to be insufficient and that will stall the CI queue. Is there any way you could get an additional builder or a faster one? Otherwise, if you were able to check-in the Dockerfile you use into the libcxx repository, we could look into using the capacity we have on GCE to run those. Our GCE instances are huge and work like a charm.

We split the build bots over a few big machines, so timing sensitive tests are often an issue. Thanks for the patch.

Also, what's your CI capacity like? From the build history, it looks like there's a single builder that takes roughly 1h30 to build. I expect that is going to be insufficient and that will stall the CI queue. Is there any way you could get an additional builder or a faster one? Otherwise, if you were able to check-in the Dockerfile you use into the libcxx repository, we could look into using the capacity we have on GCE to run those. Our GCE instances are huge and work like a charm.

It's actually worse than that since I've only got one bot running at the moment, so the total is 3hr. I just wanted a baseline for how slow it is if we just treated it like the existing post commit bot. I'm sure I can bring that down a lot.

Do you happen to know how much extra capacity moving to pre-commit required for other bots? I assume there's maybe 2/3x more pre-commit runs than post commit.

(Side note: Don't take the ~4hr runtime of our post commit bots seriously. It's actually using make -j1, which I only discovered doing this. Good thing they'll be obsolete soon anyway.)

In D96267#2568708, @DavidSpickett wrote:

We split the build bots over a few big machines, so timing sensitive tests are often an issue. Thanks for the patch.

Also, what's your CI capacity like? From the build history, it looks like there's a single builder that takes roughly 1h30 to build. I expect that is going to be insufficient and that will stall the CI queue. Is there any way you could get an additional builder or a faster one? Otherwise, if you were able to check-in the Dockerfile you use into the libcxx repository, we could look into using the capacity we have on GCE to run those. Our GCE instances are huge and work like a charm.

It's actually worse than that since I've only got one bot running at the moment, so the total is 3hr. I just wanted a baseline for how slow it is if we just treated it like the existing post commit bot. I'm sure I can bring that down a lot.

Got it. One thing we can do to reduce contention is put your jobs after the - wait step in the BuildKite pipeline. That way, your jobs will only run if all the jobs above the wait succeeded. I use that to reduce the load on the macOS testers and it helps a lot.

Do you happen to know how much extra capacity moving to pre-commit required for other bots? I assume there's maybe 2/3x more pre-commit runs than post commit.

To be honest, I don't know because the GCE instances that we use for all of our main jobs are so beefy they finish in a few minutes. In fact, they are scaled up/down automatically based on the number of jobs. To give you a general guideline based on what I've been seeing since the start of pre-commit CI, I think if you can have machines that run the tests in 30-45 minutes, just one or two of those should be sufficient since you only have two build jobs to dispatch. We can also add just one of the two jobs (say the one with exceptions enabled) with your current capacity and see how things go over the next few days/weeks. We'll adjust then.

Gentle ping :-). It would be awesome to be able to move off buildbot completely.

Now with 2 agents, hopefully with enough cores for decent build times.

I'm thinking that I'll leave them after the "wait" until I'm confident we can deliver a decent turnaround time.

Harbormaster completed remote builds in B91540: Diff 327406.Mar 2 2021, 5:53 AM

@ldionne Getting ~1/2hr for each config https://buildkite.com/llvm-project/libcxx-ci/builds/1685#_. This is with 2 agents one per config so most of the time they'll run in parallel. Ok to get this reviewed as is and see how it goes?

In the meantime I'll do the prep to move the other 4 and remove the buildbot instances.

Excellent, this LGTM! Thanks a lot!

Do you have commit access?

libcxx/cmake/caches/AArch64.cmake
3–4	Non blocking question: Is it possible to use something like `--target` instead? Could we set `LIBCXX_TARGET_TRIPLE` instead?

This revision is now accepted and ready to land.Mar 2 2021, 7:36 AM

Set target triple instead of cpu.

Remove stray "ON" from string triple line.

Harbormaster completed remote builds in B91763: Diff 327713.Mar 3 2021, 3:13 AM

DavidSpickett marked an inline comment as done.Mar 3 2021, 4:02 AM

Harbormaster completed remote builds in B91774: Diff 327729.Mar 3 2021, 4:09 AM

Switched to target triple, still good to commit? (I have commit access)

In D96267#2599932, @DavidSpickett wrote:

Switched to target triple, still good to commit? (I have commit access)

Yes, this looks perfect. Thanks a lot! You can disregard the back-deployment CI, it's been failing due to some artifacts not being available.

This revision was landed with ongoing or failed builds.Mar 4 2021, 2:22 AM

Closed by commit rG6e5342a6b0f4: [libcxx] Move Linaro AArch64 buildbots to buildkite (authored by DavidSpickett). · Explain Why

This revision was automatically updated to reflect the committed changes.

DavidSpickett added a commit: rG6e5342a6b0f4: [libcxx] Move Linaro AArch64 buildbots to buildkite.

Diff 323980

libcxx/cmake/caches/AArch64.cmake

This file was added.

				set(LIBCXXABI_USE_LLVM_UNWINDER ON CACHE BOOL "")
				set(CMAKE_C_FLAGS "-mcpu=cortex-a57" CACHE STRING "")
				set(CMAKE_CXX_FLAGS "-mcpu=cortex-a57" CACHE STRING "")

libcxx/utils/ci/buildkite-pipeline.yml

	Show All 9 Lines
	# This file describes the various pre-commit CI bots used to test libc++.			# This file describes the various pre-commit CI bots used to test libc++.
	#			#
	# This file should never contain logic -- all the logic must be offloaded			# This file should never contain logic -- all the logic must be offloaded
	# into scripts. This is critical to being able to reproduce CI issues outside			# into scripts. This is critical to being able to reproduce CI issues outside
	# of the CI environment, which is important for debugging.			# of the CI environment, which is important for debugging.
	#			#

	steps:			steps:
	- label: "Format"			# # Build with the configuration we use to generate libc++.dylib on Apple platforms
	command: "libcxx/utils/ci/run-buildbot check-format"			# - label: "Apple system"
	artifact_paths:			# command: "libcxx/utils/ci/run-buildbot x86_64-apple-system"
	- "**/clang-format.patch"			# artifact_paths:
	agents:			# - "**/test-results.xml"
	queue: "libcxx-builders"			# agents:
	retry:			# queue: "libcxx-builders-macos"
	automatic:			# retry:
	- exit_status: -1 # Agent was lost			# automatic:
	limit: 2			# - exit_status: -1 # Agent was lost
	soft_fail:			# limit: 2
	- exit_status: 1			#
				# - label: "Apple system -fno-exceptions"
	- label: "C++03"			# command: "libcxx/utils/ci/run-buildbot x86_64-apple-system-noexceptions"
	command: "libcxx/utils/ci/run-buildbot generic-cxx03"			# artifact_paths:
	artifact_paths:			# - "**/test-results.xml"
	- "**/test-results.xml"			# agents:
	- "*/.abilist"			# queue: "libcxx-builders-macos"
	agents:			# retry:
	queue: "libcxx-builders"			# automatic:
	retry:			# - exit_status: -1 # Agent was lost
	automatic:			# limit: 2
	- exit_status: -1 # Agent was lost			#
	limit: 2			# # Test back-deployment to older Apple platforms
				# - label: "Apple back-deployment macosx10.9"
	- label: "C++11"			# command: "libcxx/utils/ci/run-buildbot x86_64-apple-system-backdeployment-10.9"
	command: "libcxx/utils/ci/run-buildbot generic-cxx11"			# artifact_paths:
	artifact_paths:			# - "**/test-results.xml"
	- "**/test-results.xml"			# agents:
	- "*/.abilist"			# queue: "libcxx-builders-macos10.15" # TODO: For now, we're running the back-deployment tests for 10.9 on 10.15, because we don't have proper 10.9 machines
	agents:			# retry:
	queue: "libcxx-builders"			# automatic:
	retry:			# - exit_status: -1 # Agent was lost
	automatic:			# limit: 2
	- exit_status: -1 # Agent was lost			#
	limit: 2			# - label: "Apple back-deployment macosx10.15"
				# command: "libcxx/utils/ci/run-buildbot x86_64-apple-system-backdeployment-10.15"
	- label: "C++14"			# artifact_paths:
	command: "libcxx/utils/ci/run-buildbot generic-cxx14"			# - "**/test-results.xml"
	artifact_paths:			# agents:
	- "**/test-results.xml"			# queue: "libcxx-builders-macos10.15"
	- "*/.abilist"			# retry:
	agents:			# automatic:
	queue: "libcxx-builders"			# - exit_status: -1 # Agent was lost
	retry:			# limit: 2
	automatic:
	- exit_status: -1 # Agent was lost
	limit: 2

	- label: "C++17"
	command: "libcxx/utils/ci/run-buildbot generic-cxx17"
	artifact_paths:
	- "**/test-results.xml"
	- "*/.abilist"
	agents:
	queue: "libcxx-builders"
	retry:
	automatic:
	- exit_status: -1 # Agent was lost
	limit: 2

	- label: "C++20"
	command: "libcxx/utils/ci/run-buildbot generic-cxx20"
	artifact_paths:
	- "**/test-results.xml"
	- "*/.abilist"
	agents:
	queue: "libcxx-builders"
	retry:
	automatic:
	- exit_status: -1 # Agent was lost
	limit: 2

	- label: "C++2b"
	command: "libcxx/utils/ci/run-buildbot generic-cxx2b"
	artifact_paths:
	- "**/test-results.xml"
	- "*/.abilist"
	agents:
	queue: "libcxx-builders"
	retry:
	automatic:
	- exit_status: -1 # Agent was lost
	limit: 2

	- label: "-fno-exceptions"
	command: "libcxx/utils/ci/run-buildbot generic-noexceptions"
	artifact_paths:
	- "**/test-results.xml"
	agents:
	queue: "libcxx-builders"
	retry:
	automatic:
	- exit_status: -1 # Agent was lost
	limit: 2

	- label: "GCC/C++20"
	command: "libcxx/utils/ci/run-buildbot generic-gcc"
	artifact_paths:
	- "**/test-results.xml"
	agents:
	queue: "libcxx-builders"
	retry:
	automatic:
	- exit_status: -1 # Agent was lost
	limit: 2

	- label: "ASAN"
	command: "libcxx/utils/ci/run-buildbot generic-asan"
	artifact_paths:
	- "**/test-results.xml"
	agents:
	queue: "libcxx-builders"
	retry:
	automatic:
	- exit_status: -1 # Agent was lost
	limit: 2

	- label: "TSAN"
	command: "libcxx/utils/ci/run-buildbot generic-tsan"
	artifact_paths:
	- "**/test-results.xml"
	agents:
	queue: "libcxx-builders"
	retry:
	automatic:
	- exit_status: -1 # Agent was lost
	limit: 2

	- label: "UBSAN"
	command: "libcxx/utils/ci/run-buildbot generic-ubsan"
	artifact_paths:
	- "**/test-results.xml"
	agents:
	queue: "libcxx-builders"
	retry:
	automatic:
	- exit_status: -1 # Agent was lost
	limit: 2

	- label: "With LLVM's libunwind"
	command: "libcxx/utils/ci/run-buildbot generic-with_llvm_unwinder"
	artifact_paths:
	- "**/test-results.xml"
	agents:
	queue: "libcxx-builders"
	retry:
	automatic:
	- exit_status: -1 # Agent was lost
	limit: 2

	- label: "Single-threaded"
	command: "libcxx/utils/ci/run-buildbot generic-singlethreaded"
	artifact_paths:
	- "**/test-results.xml"
	agents:
	queue: "libcxx-builders"
	retry:
	automatic:
	- exit_status: -1 # Agent was lost
	limit: 2

	- label: "No debug mode"
	command: "libcxx/utils/ci/run-buildbot generic-nodebug"
	artifact_paths:
	- "**/test-results.xml"
	agents:
	queue: "libcxx-builders"
	retry:
	automatic:
	- exit_status: -1 # Agent was lost
	limit: 2

	- label: "No Filesystem"
	command: "libcxx/utils/ci/run-buildbot generic-no-filesystem"
	artifact_paths:
	- "**/test-results.xml"
	agents:
	queue: "libcxx-builders"
	retry:
	automatic:
	- exit_status: -1 # Agent was lost
	limit: 2

	- label: "No random device"
	command: "libcxx/utils/ci/run-buildbot generic-no-random_device"
	artifact_paths:
	- "**/test-results.xml"
	agents:
	queue: "libcxx-builders"
	retry:
	automatic:
	- exit_status: -1 # Agent was lost
	limit: 2

	- label: "No locale"
	command: "libcxx/utils/ci/run-buildbot generic-no-localization"
	artifact_paths:
	- "**/test-results.xml"
	agents:
	queue: "libcxx-builders"
	retry:
	automatic:
	- exit_status: -1 # Agent was lost
	limit: 2

	- label: "MacOS C++20"
	command: "libcxx/utils/ci/run-buildbot generic-cxx20"
	artifact_paths:
	- "**/test-results.xml"
	- "*/.abilist"
	agents:
	queue: "libcxx-builders-macos"
	retry:
	automatic:
	- exit_status: -1 # Agent was lost
	limit: 2

	- label: "Benchmarks"
	command: "libcxx/utils/ci/run-buildbot benchmarks"
	artifact_paths:
	- "**/test-results.xml"
	agents:
	queue: "libcxx-builders"
	retry:
	automatic:
	- exit_status: -1 # Agent was lost
	limit: 2

	- label: "Documentation"
	command: "libcxx/utils/ci/run-buildbot documentation"
	artifact_paths:
	- "**/test-results.xml"
	agents:
	queue: "libcxx-builders"
	retry:
	automatic:
	- exit_status: -1 # Agent was lost
	limit: 2

	- label: "Legacy standalone build"
	command: "libcxx/utils/ci/run-buildbot legacy-standalone"
	artifact_paths:
	- "**/test-results.xml"
	agents:
	queue: "libcxx-builders"
	retry:
	automatic:
	- exit_status: -1 # Agent was lost
	limit: 2

	- label: "Unified standalone build"
	command: "libcxx/utils/ci/run-buildbot unified-standalone"
	artifact_paths:
	- "**/test-results.xml"
	agents:
	queue: "libcxx-builders"
	retry:
	automatic:
	- exit_status: -1 # Agent was lost
	limit: 2

	# All jobs defined before this `wait` will run whenever a CI job is started.
	# Jobs defined after the `wait` will run only if all the jobs above succeeded.
	# We use this to reduce the load on testers that have more constrained resources
	# and avoid running builds that we know fail anyway.
	- wait

	# Build with the configuration we use to generate libc++.dylib on Apple platforms
	- label: "Apple system"
	command: "libcxx/utils/ci/run-buildbot x86_64-apple-system"
	artifact_paths:
	- "**/test-results.xml"
	agents:
	queue: "libcxx-builders-macos"
	retry:
	automatic:
	- exit_status: -1 # Agent was lost
	limit: 2

	- label: "Apple system -fno-exceptions"
	command: "libcxx/utils/ci/run-buildbot x86_64-apple-system-noexceptions"
	artifact_paths:
	- "**/test-results.xml"
	agents:
	queue: "libcxx-builders-macos"
	retry:
	automatic:
	- exit_status: -1 # Agent was lost
	limit: 2

	# Test back-deployment to older Apple platforms			- label: "AArch64"
	- label: "Apple back-deployment macosx10.9"			command: "libcxx/utils/ci/run-buildbot aarch64"
	command: "libcxx/utils/ci/run-buildbot x86_64-apple-system-backdeployment-10.9"
	artifact_paths:			artifact_paths:
	- "**/test-results.xml"			- "**/test-results.xml"
	agents:			agents:
	queue: "libcxx-builders-macos10.15" # TODO: For now, we're running the back-deployment tests for 10.9 on 10.15, because we don't have proper 10.9 machines			queue: "libcxx-builders-linaro-arm"
				arch: "aarch64"
	retry:			retry:
	automatic:			automatic:
	- exit_status: -1 # Agent was lost			- exit_status: -1 # Agent was lost
	limit: 2			limit: 2

	- label: "Apple back-deployment macosx10.15"			# - label: "AArch64 -fno-exceptions"
	command: "libcxx/utils/ci/run-buildbot x86_64-apple-system-backdeployment-10.15"			# command: "libcxx/utils/ci/run-buildbot aarch64-noexceptions"
	artifact_paths:			# artifact_paths:
	- "**/test-results.xml"			# - "**/test-results.xml"
	agents:			# agents:
	queue: "libcxx-builders-macos10.15"			# queue: "libcxx-builders-linaro-arm"
	retry:			# arch: "aarch64"
	automatic:			# retry:
	- exit_status: -1 # Agent was lost			# automatic:
	limit: 2			# - exit_status: -1 # Agent was lost
				# limit: 2

libcxx/utils/ci/run-buildbot

Show First 20 Lines • Show All 370 Lines • ▼ Show 20 Lines	legacy-standalone)
ninja -C "${BUILD_DIR}/libcxx" cxx		ninja -C "${BUILD_DIR}/libcxx" cxx

echo "+++ Running the libc++ tests"		echo "+++ Running the libc++ tests"
ninja -C "${BUILD_DIR}/libcxx" check-cxx		ninja -C "${BUILD_DIR}/libcxx" check-cxx

echo "+++ Running the libc++abi tests"		echo "+++ Running the libc++abi tests"
ninja -C "${BUILD_DIR}/libcxxabi" check-cxxabi		ninja -C "${BUILD_DIR}/libcxxabi" check-cxxabi
;;		;;
		aarch64)
		clean
		generate-cmake -C "${MONOREPO_ROOT}/libcxx/cmake/caches/AArch64.cmake"
		check-cxx-cxxabi
		;;
		aarch64-noexceptions)
		clean
		generate-cmake -C "${MONOREPO_ROOT}/libcxx/cmake/caches/AArch64.cmake" \
		-DLIBCXX_ENABLE_EXCEPTIONS=OFF \
		-DLIBCXXABI_ENABLE_EXCEPTIONS=OFF
		check-cxx-cxxabi
		;;
*)		*)
echo "${BUILDER} is not a known configuration"		echo "${BUILDER} is not a known configuration"
exit 1		exit 1
;;		;;
esac		esac

This is an archive of the discontinued LLVM Phabricator instance.

[libcxx] Move Linaro AArch64 buildbots to buildkite
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 323980

libcxx/cmake/caches/AArch64.cmake

libcxx/utils/ci/buildkite-pipeline.yml

libcxx/utils/ci/run-buildbot

This is an archive of the discontinued LLVM Phabricator instance.

[libcxx] Move Linaro AArch64 buildbots to buildkiteClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 323980

libcxx/cmake/caches/AArch64.cmake

libcxx/utils/ci/buildkite-pipeline.yml

libcxx/utils/ci/run-buildbot

[libcxx] Move Linaro AArch64 buildbots to buildkite
ClosedPublic