This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
CMakeLists.txt
-
cmake/modules/
-
modules/
-
HandleLLVMOptions.cmake
-
utils/
-
prepare-code-coverage-artifact.py

Differential D20993

Add support for collating profiles for use with code coverage
ClosedPublic

Authored by vsk on Jun 3 2016, 4:59 PM.

Download Raw Diff

Details

Reviewers

davidxl
silvas
MatzeB
beanz
ributzka

Commits

rGd9aed82cf7e2: Add support for collating profiles for use with code coverage
rL272599: Add support for collating profiles for use with code coverage

Summary

Add a cmake option to build with clang code coverage.
Specify a profile data directory along with -fprofile-instr-generate: this lets us gather profiles from instrumented binaries in a dedicated location.
Add a utility script that generates code coverage artifacts.

This patch uses the new in-process raw profile merging functionality in compiler-rt. We no longer index raw profiles after every test invocation, so no lit changes are required.

Running ninja check-llvm-unit generates 682M of raw profile data using a merge pool size of 4. In practice, this turns out to be a lot faster (and also much more compact) than doing llvm-profdata merge -sparse after each test run.

Diff Detail

Repository: rL LLVM

Event Timeline

vsk updated this revision to Diff 59641.Jun 3 2016, 4:59 PM

vsk retitled this revision from to [lit] Add support for PGO profile and code coverage collection.

vsk updated this object.

vsk added reviewers: beanz, cmatthews, • ddunbar.

vsk added a subscriber: llvm-commits.

This adds a bigger bunch of code to lit, I wonder if it is necessary:

I assume the necessity for the logic inside lit is just the fact that the profile data overwrites each other after running a command. Is setting LLVM_PROFILE_FILE with a '%p' placeholder not enough to avoid this? The final profile data merging can always be done outside of llvm-lit afterwards. Or are there other reasons why you need to intercept the opt/llc/etc. calls?

The lit tests don't strike me as the most typical llvm usage (they are often designed to test corner cases of the compiler), wouldn't it be better to gather profiles by compiling the llvm test-suite or something similar?

How many profile data files are generated in a full run? It may take huge amount of space.

I have not gotten time to submit my patch for the runtime to support online/in-process merge yet, but will try to get it done soon. Once that is ready, we can reduce the overhead greatly.

In D20993#449092, @MatzeB wrote:

This adds a bigger bunch of code to lit, I wonder if it is necessary:

I assume the necessity for the logic inside lit is just the fact that the profile data overwrites each other after running a command. Is setting LLVM_PROFILE_FILE with a '%p' placeholder not enough to avoid this?

Using PID substitution gets close to solving the problem of having overwritten profiles, but not all the way. On 32-bit systems PID wraparound would pose a real problem. In this patch I include the hash of the test command to minimize loss of profiles.

The final profile data merging can always be done outside of llvm-lit afterwards.

On a practical level, I don't think this is possible. Raw profiles are too large. Turning off the cleanup step and running check-llvm produces over half a terrabyte of data. There are a few ways to address this without touching lit:

Reduce the size of raw profiles. This would make the compiler runtime larger and more complex.
Run a monitor process that does the merging/cleanup. I don't think that approach has any advantages compared to modifying lit.

Alternatively, we could do in-place raw profile merging in the compiler runtime. I don't think that's preferable because we'd have to introduce a lot of complexity to compiler-rt (e.g portable mandatory file locking).

Or are there other reasons why you need to intercept the opt/llc/etc. calls?

Intercepting the calls makes it possible to generate 'unique' hashes for the profiles, see my comment above.

The lit tests don't strike me as the most typical llvm usage (they are often designed to test corner cases of the compiler), wouldn't it be better to gather profiles by compiling the llvm test-suite or something similar?

It's true that the profiles gathered from check-llvm aren't ideal for PGO. But, they are perfect for measuring code coverage of the llvm codebase. Moreover because lit is portable, other projects can use these changes to gather coverage reports with their tests.

(IIRC the test-suite also uses lit?)

In D20993#449116, @vsk wrote:

In D20993#449092, @MatzeB wrote:

This adds a bigger bunch of code to lit, I wonder if it is necessary:

I assume the necessity for the logic inside lit is just the fact that the profile data overwrites each other after running a command. Is setting LLVM_PROFILE_FILE with a '%p' placeholder not enough to avoid this?

Using PID substitution gets close to solving the problem of having overwritten profiles, but not all the way. On 32-bit systems PID wraparound would pose a real problem. In this patch I include the hash of the test command to minimize loss of profiles.

This is a really annoying problem to have... Maybe we should find a solution within the profile infrastructure itself (can we add another flag that adds a unique suffix to the filename if it already exists?) so not every user of the profiling infrastructure has to jump through the same hoops (I've done a similar dance in the test-suite profile support).

The final profile data merging can always be done outside of llvm-lit afterwards.

On a practical level, I don't think this is possible. Raw profiles are too large. Turning off the cleanup step and running check-llvm produces over half a terrabyte of data. There are a few ways to address this without touching lit:

Reduce the size of raw profiles. This would make the compiler runtime larger and more complex.

Run a monitor process that does the merging/cleanup. I don't think that approach has any advantages compared to modifying lit.

Alternatively, we could do in-place raw profile merging in the compiler runtime. I don't think that's preferable because we'd have to introduce a lot of complexity to compiler-rt (e.g portable mandatory file locking).

Ah, so that seems harder to solve without an external driver merging the profile data. I'm still not a big fan of injecting this stuff into the core of lit, but we may have no other choice today.

Or are there other reasons why you need to intercept the opt/llc/etc. calls?

Intercepting the calls makes it possible to generate 'unique' hashes for the profiles, see my comment above.

The lit tests don't strike me as the most typical llvm usage (they are often designed to test corner cases of the compiler), wouldn't it be better to gather profiles by compiling the llvm test-suite or something similar?

It's true that the profiles gathered from check-llvm aren't ideal for PGO. But, they are perfect for measuring code coverage of the llvm codebase. Moreover because lit is portable, other projects can use these changes to gather coverage reports with their tests.

Oh indeed I did not think about code coverage.

(IIRC the test-suite also uses lit?)

The test-suite uses lit to execute the benchmarks. I already added profiling support for that (see the TEST_SUITE_PROFILE_GENERATE/TEST_SUITE_PROFILE_USE variants). This is however about running the benchmarks. It does not help you when you want to generate profile data from the clang executable that compiled the benchmarks as that is only done by make (or ninja) and not lit.

Some of the discussion moved onto the mailing list and Phab couldn't pick up the comments.

The TL;DR: The plan is to add in-process raw profile merging support to compiler-rt. In fact, David has already added the basic mandatory file-locking constructs we'll need (see r271864). Apart from bringing general usability improvements, this will let us trim this patch down significantly.

Reworked the patch to drop all lit changes. See the new title/summary.

vsk added a reviewer: ributzka.Jun 9 2016, 5:42 PM

Why choosing default size of 9? Do you have build time comparison numbers i.e., pool size from 1 to 9?

Nit about the shell script.

utils/prepare-code-coverage-artifact.sh
1 ↗	(On Diff #60287)	I mentioned this in one of the reviews of some stuff that Chris Bieneman was adding to clang for PGO (http://reviews.llvm.org/D15462), but these kinds of utility scripts are generally much more readable, robust, and portable when written in Python. For example, the standard library operations all will throw exceptions nicely in cases of errors (which will stop the program dead, give you a complete stacktrace that is e.g. easy to read in a bot log, etc.). Also you can easily say `assert <cond>, "reason"` for quick sanity checks / "error handling" (again, you get a full stack trace if it fails, so for this kind of script more elaborate error handling isn't really needed). It's really hard to get that kind of robustness from a shell script. (also it avoids general shell script problems like what happens when paths contain spaces, which lead to tremendous head scratching) Also, writing it in Python makes it pretty portable for free and is generally more readable.

In D20993#454452, @davidxl wrote:

Why choosing default size of 9? Do you have build time comparison numbers i.e., pool size from 1 to 9?

I chose 9 because it's the highest option: a larger pool of raw profiles should theoretically lead to less lock contention. Ideally this would be some factor of the number of threads on the system. However, I don't think it's worth optimizing this default.

Also, writing it in Python makes it pretty portable for free and is generally more readable.

Sure, I'll upload a new diff with this change.

Pythonify the artifact prep script.
Use a default merge pool size of 4 (see the mailing list discussion for some justification for this).

vsk updated this object.Jun 10 2016, 6:25 PM

davidxl added inline comments.Jun 13 2016, 11:30 AM

utils/prepare-code-coverage-artifact.py
20 ↗	(On Diff #60430)	is profiles.manifest file needed here? does it simplify the process?

vsk added inline comments.Jun 13 2016, 11:32 AM

utils/prepare-code-coverage-artifact.py
20 ↗	(On Diff #60430)	It's a bit of future-proofing. It was needed with the previous patch because of the sheer number of raw profiles. Now, it helps avoid failures during the `check_call` step due to too many arguments.

lgtm

This revision is now accepted and ready to land.Jun 13 2016, 11:33 AM

Thanks for converting the script to python!

One advisory comment on the script (not a big deal), but this LGTM.

utils/prepare-code-coverage-artifact.py
20 ↗	(On Diff #60430)	`%m` only supports up to 9, which should be fine even on windows. It would make things slightly simpler to pass directly on the command line.

Thanks for the feedback and reviews. I'm planning on committing after double-checking that this doesn't break any of our PGO bots.

utils/prepare-code-coverage-artifact.py
20 ↗	(On Diff #60430)	Fair point, though that does get multiplied by the number of distinct binaries invoked.

Closed by commit rL272599: Add support for collating profiles for use with code coverage (authored by vedantk). · Explain WhyJun 13 2016, 4:40 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

trunk/

CMakeLists.txt

15 lines

cmake/

modules/

HandleLLVMOptions.cmake

10 lines

utils/

prepare-code-coverage-artifact.py

55 lines

Diff 60627

llvm/trunk/CMakeLists.txt

	Show First 20 Lines • Show All 424 Lines • ▼ Show 20 Lines
	endif()			endif()

	if (MSVC_IDE AND NOT (MSVC_VERSION LESS 1900))			if (MSVC_IDE AND NOT (MSVC_VERSION LESS 1900))
	option(LLVM_ADD_NATIVE_VISUALIZERS_TO_SOLUTION "Configure project to use Visual Studio native visualizers" TRUE)			option(LLVM_ADD_NATIVE_VISUALIZERS_TO_SOLUTION "Configure project to use Visual Studio native visualizers" TRUE)
	else()			else()
	set(LLVM_ADD_NATIVE_VISUALIZERS_TO_SOLUTION FALSE CACHE INTERNAL "For Visual Studio 2013, manually copy natvis files to Documents\\Visual Studio 2013\\Visualizers" FORCE)			set(LLVM_ADD_NATIVE_VISUALIZERS_TO_SOLUTION FALSE CACHE INTERNAL "For Visual Studio 2013, manually copy natvis files to Documents\\Visual Studio 2013\\Visualizers" FORCE)
	endif()			endif()

				if (LLVM_BUILD_INSTRUMENTED OR LLVM_BUILD_INSTRUMENTED_COVERAGE)
				if(NOT LLVM_PROFILE_MERGE_POOL_SIZE)
				# A pool size of 1-2 is probably sufficient on a SSD. 3-4 should be fine
				# for spining disks. Anything higher may only help on slower mediums.
				set(LLVM_PROFILE_MERGE_POOL_SIZE "4")
				endif()
				if(NOT LLVM_PROFILE_FILE_PATTERN)
				if(NOT LLVM_PROFILE_DATA_DIR)
				set(LLVM_PROFILE_FILE_PATTERN "%${LLVM_PROFILE_MERGE_POOL_SIZE}m.profraw")
				else()
				file(TO_NATIVE_PATH "${LLVM_PROFILE_DATA_DIR}/%${LLVM_PROFILE_MERGE_POOL_SIZE}m.profraw" LLVM_PROFILE_FILE_PATTERN)
				endif()
				endif()
				endif()

	# All options referred to from HandleLLVMOptions have to be specified			# All options referred to from HandleLLVMOptions have to be specified
	# BEFORE this include, otherwise options will not be correctly set on			# BEFORE this include, otherwise options will not be correctly set on
	# first cmake run			# first cmake run
	include(config-ix)			include(config-ix)

	string(REPLACE "Native" ${LLVM_NATIVE_ARCH}			string(REPLACE "Native" ${LLVM_NATIVE_ARCH}
	LLVM_TARGETS_TO_BUILD "${LLVM_TARGETS_TO_BUILD}")			LLVM_TARGETS_TO_BUILD "${LLVM_TARGETS_TO_BUILD}")
	list(REMOVE_DUPLICATES LLVM_TARGETS_TO_BUILD)			list(REMOVE_DUPLICATES LLVM_TARGETS_TO_BUILD)
	▲ Show 20 Lines • Show All 381 Lines • Show Last 20 Lines

llvm/trunk/cmake/modules/HandleLLVMOptions.cmake

	Show First 20 Lines • Show All 592 Lines • ▼ Show 20 Lines
	option(LLVM_ENABLE_EH "Enable Exception handling" OFF)			option(LLVM_ENABLE_EH "Enable Exception handling" OFF)
	option(LLVM_ENABLE_RTTI "Enable run time type information" OFF)			option(LLVM_ENABLE_RTTI "Enable run time type information" OFF)
	if(LLVM_ENABLE_EH AND NOT LLVM_ENABLE_RTTI)			if(LLVM_ENABLE_EH AND NOT LLVM_ENABLE_RTTI)
	message(FATAL_ERROR "Exception handling requires RTTI. You must set LLVM_ENABLE_RTTI to ON")			message(FATAL_ERROR "Exception handling requires RTTI. You must set LLVM_ENABLE_RTTI to ON")
	endif()			endif()

	option(LLVM_BUILD_INSTRUMENTED "Build LLVM and tools with PGO instrumentation (experimental)" Off)			option(LLVM_BUILD_INSTRUMENTED "Build LLVM and tools with PGO instrumentation (experimental)" Off)
	mark_as_advanced(LLVM_BUILD_INSTRUMENTED)			mark_as_advanced(LLVM_BUILD_INSTRUMENTED)
	append_if(LLVM_BUILD_INSTRUMENTED "-fprofile-instr-generate"			append_if(LLVM_BUILD_INSTRUMENTED "-fprofile-instr-generate='${LLVM_PROFILE_FILE_PATTERN}'"
				CMAKE_CXX_FLAGS
				CMAKE_C_FLAGS
				CMAKE_EXE_LINKER_FLAGS
				CMAKE_SHARED_LINKER_FLAGS)

				option(LLVM_BUILD_INSTRUMENTED_COVERAGE "Build LLVM and tools with Code Coverage instrumentation (experimental)" Off)
				mark_as_advanced(LLVM_BUILD_INSTRUMENTED_COVERAGE)
				append_if(LLVM_BUILD_INSTRUMENTED_COVERAGE "-fprofile-instr-generate='${LLVM_PROFILE_FILE_PATTERN}' -fcoverage-mapping"
	CMAKE_CXX_FLAGS			CMAKE_CXX_FLAGS
	CMAKE_C_FLAGS			CMAKE_C_FLAGS
	CMAKE_EXE_LINKER_FLAGS			CMAKE_EXE_LINKER_FLAGS
	CMAKE_SHARED_LINKER_FLAGS)			CMAKE_SHARED_LINKER_FLAGS)

	set(LLVM_ENABLE_LTO OFF CACHE STRING "Build LLVM with LTO. May be specified as Thin or Full to use a particular kind of LTO")			set(LLVM_ENABLE_LTO OFF CACHE STRING "Build LLVM with LTO. May be specified as Thin or Full to use a particular kind of LTO")
	string(TOUPPER "${LLVM_ENABLE_LTO}" uppercase_LLVM_ENABLE_LTO)			string(TOUPPER "${LLVM_ENABLE_LTO}" uppercase_LLVM_ENABLE_LTO)
	if(uppercase_LLVM_ENABLE_LTO STREQUAL "THIN")			if(uppercase_LLVM_ENABLE_LTO STREQUAL "THIN")
	Show All 34 Lines

llvm/trunk/utils/prepare-code-coverage-artifact.py

				#!/usr/bin/env python

				'''Prepare a code coverage artifact.

				- Collate raw profiles into one indexed profile.
				- Delete the raw profiles.
				- Copy the coverage mappings in the binaries directory.
				'''

				import argparse
				import glob
				import os
				import subprocess
				import sys

				def merge_raw_profiles(host_llvm_profdata, profile_data_dir):
				print ':: Merging raw profiles...',
				sys.stdout.flush()
				raw_profiles = glob.glob(os.path.join(profile_data_dir, '*.profraw'))
				manifest_path = os.path.join(profile_data_dir, 'profiles.manifest')
				profdata_path = os.path.join(profile_data_dir, 'Coverage.profdata')
				with open(manifest_path, 'w') as manifest:
				manifest.write('\n'.join(raw_profiles))
				subprocess.check_call([host_llvm_profdata, 'merge', '-sparse', '-f',
				manifest_path, '-o', profdata_path])
				for raw_profile in raw_profiles:
				os.remove(raw_profile)
				print 'Done!'

				def extract_covmappings(host_llvm_cov, profile_data_dir, llvm_bin_dir):
				print ':: Extracting covmappings...',
				sys.stdout.flush()
				for prog in os.listdir(llvm_bin_dir):
				if prog == 'llvm-lit':
				continue
				covmapping_path = os.path.join(profile_data_dir,
				os.path.basename(prog) + '.covmapping')
				subprocess.check_call([host_llvm_cov, 'convert-for-testing',
				os.path.join(llvm_bin_dir, prog), '-o',
				covmapping_path])
				print 'Done!'

				if __name__ == '__main__':
				parser = argparse.ArgumentParser(description=__doc__)
				parser.add_argument('host_llvm_profdata', help='Path to llvm-profdata')
				parser.add_argument('host_llvm_cov', help='Path to llvm-cov')
				parser.add_argument('profile_data_dir',
				help='Path to the directory containing the raw profiles')
				parser.add_argument('llvm_bin_dir',
				help='Path to the directory containing llvm binaries')
				args = parser.parse_args()

				merge_raw_profiles(args.host_llvm_profdata, args.profile_data_dir)
				extract_covmappings(args.host_llvm_cov, args.profile_data_dir,
				args.llvm_bin_dir)