This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
docs/
-
CommandGuide/
-
lit.rst
2
TestingGuide.rst
-
utils/lit/lit/
-
lit/
-
lit/
-
TestRunner.py

Differential D35396

[lit] Remove %T
AcceptedPublic

Authored by kubamracek on Jul 13 2017, 5:18 PM.

Download Raw Diff

Details

Reviewers

• ddunbar
george.karpenkov
kcc
rnk
zturner
beanz
modocache
MatzeB
• espindola

Summary

I've recently found a weird non-deterministic failure in the case-insensitive-include-ms.c, case-insensitive-include.c and case-insensitive-system-include.c testcases. The reason was that they actually share the same %T directory and when they're run in parallel, they can overwrite other test's data. Pretty terrible, right?

This patch changes lit so it generates a per-test temporary directory for %T and not a per-test-suite temporary dir. Another option would of course be to fix the above-mentioned tests, but do we really need to require test writers to remember that %T is not unique to the test?

Diff Detail

Event Timeline

kubamracek created this revision.Jul 13 2017, 5:18 PM

Herald added a reviewer: modocache. · View Herald TranscriptJul 13 2017, 5:18 PM

Tests that have this bug simply did not read the documentation for %T: it is the parent directory of %t, which is a non-existent path unique for the test. Any test that fails this way should do RUN: mkdir %t and use %t instead of %T.

That would be inconsistent with TestingGuide.rst, if this change goes through, the documentation needs to be updated.
In any case, currently TestingGuide.rst directly contradicts lit.rst on the meaning of %T.

chapuni added a subscriber: chapuni.Jul 13 2017, 5:21 PM

@rnk There are two documentation files. The lit.rst says it's unique, TestingGuide.rst says it's not.
So at least that should be fixed.
In any case, I disagree with your comment: what if the user still wants to have %t to signify a temporary file? Why an extra mkdir?

In D35396#809062, @george.karpenkov wrote:

@rnk There are two documentation files. The lit.rst says it's unique, TestingGuide.rst says it's not.
So at least that should be fixed.
In any case, I disagree with your comment: what if the user still wants to have %t to signify a temporary file? Why an extra mkdir?

Somebody has to do the mkdir, and it might as well be the test, since those execute in parallel. Right now lit makes Output directories for every suite, not for every test. Running mkdir for every llvm test seems like it would slow down startup.

@rnk Actually it would be quite easy to do the mkdir iff %T is present in the file, if performance is a concern.
I would argue that test runs should be isolated by the test runner, and test authors should not be trying to judge whether it's safe to share a directory.

In any case, inconsistent documentation should be updated one way or the other.

In D35396#809086, @george.karpenkov wrote:

@rnk Actually it would be quite easy to do the mkdir iff %T is present in the file, if performance is a concern.
I would argue that test runs should be isolated by the test runner, and test authors should not be trying to judge whether it's safe to share a directory.

In any case, inconsistent documentation should be updated one way or the other.

That sounds like a pretty good idea. :) FWIW, I am concerned about the performance. You can instrument lit to check out log it takes to make the Output directories.

I'm not suggesting to mkdir a new directory for each test. %T will point to a non-existent directory, just like %t points to a non-existent file. If the tests wants to use %T, it will have to mkdir it.

kubamracek updated this revision to Diff 106642.Jul 14 2017, 8:24 AM

In D35396#809645, @kubamracek wrote:

I'm not suggesting to mkdir a new directory for each test. %T will point to a non-existent directory, just like %t points to a non-existent file. If the tests wants to use %T, it will have to mkdir it.

lit already ensures that tmpDir exists, so that's what this change will do. Look for callers of getTempPaths.

lit already ensures that tmpDir exists, so that's what this change will do. Look for callers of getTempPaths.

Oh. That's not what I meant then. I'll update the patch.

kubamracek updated this revision to Diff 106658.Jul 14 2017, 10:07 AM

ping

In D35396#809760, @kubamracek wrote:

lit already ensures that tmpDir exists, so that's what this change will do. Look for callers of getTempPaths.

Oh. That's not what I meant then. I'll update the patch.

This doesn't seem like it was addressed? As is, surely this will regress lit startup time.

In D35396#822242, @rnk wrote:

In D35396#809760, @kubamracek wrote:

lit already ensures that tmpDir exists, so that's what this change will do. Look for callers of getTempPaths.

Oh. That's not what I meant then. I'll update the patch.

This doesn't seem like it was addressed? As is, surely this will regress lit startup time.

This was addressed, I changed the patch. dirname(tmpBase) is later being mkdir'd, but tmpDir isn't.

In D35396#822271, @kubamracek wrote:

This was addressed, I changed the patch. dirname(tmpBase) is later being mkdir'd, but tmpDir isn't.

Sounds good!

This revision is now accepted and ready to land.Jul 26 2017, 5:05 PM

Cool. Some tests actually need fixing before this can land.

The fact that %T has the same values for all tests inside a directory is indeed terrible.

One thing to think about however: With this patch it seems %T and %t both point to a unique name that doesn't exist. So why have %T at all, you could just as well do mkdir %t and I think there already are a bunch of tests that do just that.

Works for me. @rnk, do you think that's better?

In D35396#822278, @MatzeB wrote:

The fact that %T has the same values for all tests inside a directory is indeed terrible.

One thing to think about however: With this patch it seems %T and %t both point to a unique name that doesn't exist. So why have %T at all, you could just as well do mkdir %t and I think there already are a bunch of tests that do just that.

Should have read the discussion before answering and realize you already discussed all that :)
(Guess there is just the fact that I lean towards the more aggressive solution that drops the concept of %T entirely. But that is easy to say if you're not the one having to fixup all the tests. In any way I'm fine with this approach too.

In D35396#822282, @kubamracek wrote:

Works for me. @rnk, do you think that's better?

+1 for removing %T. It's just a trap.

I also think that removing %T entirely in this case is a good idea.

kubamracek updated this revision to Diff 109391.Aug 2 2017, 11:33 AM

@kubamracek Wouldn't you need now to change ALL the tests using %T? :P

Yes, of course, this change is not ready to land.

Patch to remove %T from compiler-rt: https://reviews.llvm.org/D36434

Clang: https://reviews.llvm.org/D36437

LLVM: https://reviews.llvm.org/D36495

delcypher added a subscriber: delcypher.Aug 11 2017, 12:26 PM

delcypher added inline comments.Aug 11 2017, 12:34 PM

docs/TestingGuide.rst
459	What are we supposed to do to have a test written that way to work on Windows? There is a `mkdir` command but I'm not familiar enough with it to know if doing `mkdir "%t"` will work. It certainly doesn't take a `-p` argument.

MatzeB added inline comments.Aug 11 2017, 12:42 PM

docs/TestingGuide.rst
459	I have no experience with how we do things on windows, but I'm sure I've seen things like `grep`, `sed` etc. which aren't available on windows by default. I always assumed we expect people to install some posix compatible tools to run the tests on windows...

@delcypher I think LIT already does some magic to work with Windows, right? Many tests use mkdir, otherwise none of them would run.

@rnk will probably know. What is the story with mkdir -p %t on Windows?

And this LGTM.

And some heads up: Removing the tmpDir parameter from getDefaultSubstitutions will need some changes to the llvm test-suite lit integration which calls this function. But that's no reason to stop this patch, I'll update the test-suite when this is committed.

In D35396#839504, @george.karpenkov wrote:

@delcypher I think LIT already does some magic to work with Windows, right? Many tests use mkdir, otherwise none of them would run.

The lit shell tests have two modes. Internal and external. External just invokes bash and the internal shell implements a shell command parser in python.

The executeScriptInternal() function in TestRunner.py handles running commands using the internal shell. This eventually calls _executeShCmd(). If you look at the implementation is has special support of a few shell built-ins (namely cd, echo, export and env). I don't see any special support for mkdir though. If the command doesn't start with . then lit.util.which() will be used to find the executable. So AFAICT mkdir will just run the mkdir command when using the internal shell.

The approach I've used in the past to handle different system tools is to add a substitution that expands to the command appropriate for the platform.

E.g.

https://github.com/symbooglix/symbooglix/blob/master/test_programs/lit.site.cfg#L116

In D35396#839505, @kubamracek wrote:

@rnk will probably know. What is the story with mkdir -p %t on Windows?

It works because LLVM requires Unix core utilities to be available. We have some advice on where to get them tucked away here: https://llvm.org/docs/GettingStartedVS.html#software It should probably be more prominent.

kubamracek retitled this revision from [lit] Make %T return a per-test temporary directory to [lit] Remove %T.Aug 20 2017, 10:45 AM

In D35396#840758, @rnk wrote:

In D35396#839505, @kubamracek wrote:

@rnk will probably know. What is the story with mkdir -p %t on Windows?

It works because LLVM requires Unix core utilities to be available. We have some advice on where to get them tucked away here: https://llvm.org/docs/GettingStartedVS.html#software It should probably be more prominent.

One thing to note here is that even if the GnuWin32 tools are available in PATH, if you type mkdir in a console window you seem to get the Windows version.

Is this ready?

Almost, I'm still working on removing "%T" throughout all the repos. Clang is clean, LLVM has 1 instance left, which I wasn't able to remove, and I asked the responsible person for help. There's also some downstream projects that need to be cleaned first (Swift).

jordan_rose mentioned this in D38010: lit.py: Allow configs and local configs to have a setup_script entry.Sep 18 2017, 5:01 PM

While at it, could the documentation also be clarified whether %t refers to a prefix (or note that it is created in a subdirectory such that one can know for sure that it is cleaned up)? A lot of tests use something like %clang -o %t.o.

As for whether mkdir -p can be used or not, based on a look in utils/lit/, there does not seem to be a special case for that. Very few builtins (export, echo, cd) are handled. Perhaps it would be worth adding more builtins and/or document whether it is safe to use those? Looking at a Windows builder (http://lab.llvm.org:8011/buildslaves/windows7-buildbot) though, the Profile/gcc-flag-compatibility.c test does get executed properly (it uses rm -rf and mkdir -p ../..).

@rnk Are the GnuWin32 tools just needed for the test suite or is it also needed for the build process? (Linux user here, no idea how LLVM is usually developed on Windows)

In D35396#875872, @Lekensteyn wrote:

While at it, could the documentation also be clarified whether %t refers to a prefix (or note that it is created in a subdirectory such that one can know for sure that it is cleaned up)? A lot of tests use something like %clang -o %t.o.

%t is just a name/string that is based on the tests name and is therefore guaranteed to be unique for each test. There is no automatic cleanup, in fact I believe you don't want automatic cleanup as you want to inspect the intermediate results in case of errors.
However I believe the %t path is always in a subdirectory called Output so if you wanted to cleanup intermediate results between test-suite runs you could do something like find build/test -d -name "Output" -exec rm -rf {} ';'

As for whether mkdir -p can be used or not, based on a look in utils/lit/, there does not seem to be a special case for that. Very few builtins (export, echo, cd) are handled. Perhaps it would be worth adding more builtins and/or document whether it is safe to use those? Looking at a Windows builder (http://lab.llvm.org:8011/buildslaves/windows7-buildbot) though, the Profile/gcc-flag-compatibility.c test does get executed properly (it uses rm -rf and mkdir -p ../..).

The tests assume this is possible even on windows. So the expectation is that whoever runs the tests has those GnuWin32 tools installed.

@rnk Are the GnuWin32 tools just needed for the test suite or is it also needed for the build process? (Linux user here, no idea how LLVM is usually developed on Windows)

docs/GettingStartedVS.rst sounds like GnuWin32 is only needed for the tests.

Lekensteyn mentioned this in D37954: Try to shorten system header paths when using -MD depfiles.Sep 19 2017, 6:21 PM

• espindola edited reviewers, added: • espindola; removed: • rafael.Mar 14 2018, 4:57 PM

Hi, what is the status with this patch? It was referred to by https://reviews.llvm.org/D36495 (which is committed), and there still seem to be a lot of users of %T.

In D35396#1132499, @Lekensteyn wrote:

Hi, what is the status with this patch? It was referred to by https://reviews.llvm.org/D36495 (which is committed), and there still seem to be a lot of users of %T.

Yes, that's why this one didn't land yet. We need to get rid of all the %T users first.

In D35396#1132500, @kubamracek wrote:

In D35396#1132499, @Lekensteyn wrote:

Hi, what is the status with this patch? It was referred to by https://reviews.llvm.org/D36495 (which is committed), and there still seem to be a lot of users of %T.

Yes, that's why this one didn't land yet. We need to get rid of all the %T users first.

Perhaps you could already adjust the documentation and mark %T as deprecated. I have honestly forgotten about this patch and used %T again in some new tests because it was documented (without any caveats).

Good idea. https://reviews.llvm.org/D48189.

filcab mentioned this in D36434: [compiler-rt] Get rid of "%T" expansions.Jun 15 2018, 3:46 AM

probinson mentioned this in D78245: [LIT] Make `%T` unique for every test.Apr 29 2020, 1:28 PM

Revision Contents

Path

Size

docs/

CommandGuide/

lit.rst

4 lines

TestingGuide.rst

8 lines

utils/

lit/

TestRunner.py

13 lines

Diff 109391

docs/CommandGuide/lit.rst

	Show First 20 Lines • Show All 396 Lines • ▼ Show 20 Lines

	========== ==============			========== ==============
	Macro Substitution			Macro Substitution
	========== ==============			========== ==============
	%s source path (path to the file currently being run)			%s source path (path to the file currently being run)
	%S source dir (directory of the file currently being run)			%S source dir (directory of the file currently being run)
	%p same as %S			%p same as %S
	%{pathsep} path separator			%{pathsep} path separator
	%t temporary file name unique to the test			%t temporary file name unique to the test, if you need a directory
	%T temporary directory unique to the test			use `mkdir -p %t`
	%% %			%% %
	========== ==============			========== ==============

	Other substitutions are provided that are variations on this base set and			Other substitutions are provided that are variations on this base set and
	further substitution patterns can be defined by each test module. See the			further substitution patterns can be defined by each test module. See the
	modules :ref:`local-configuration-files`.			modules :ref:`local-configuration-files`.

	More detailed information on substitutions can be found in the			More detailed information on substitutions can be found in the
	▲ Show 20 Lines • Show All 64 Lines • Show Last 20 Lines

docs/TestingGuide.rst

Show First 20 Lines • Show All 449 Lines • ▼ Show 20 Lines	``%S``
Directory path to the test case's source.		Directory path to the test case's source.

Example: ``/home/user/llvm/test/MC/ELF``		Example: ``/home/user/llvm/test/MC/ELF``

``%t``		``%t``
File path to a temporary file name that could be used for this test case.		File path to a temporary file name that could be used for this test case.
The file name won't conflict with other test cases. You can append to it		The file name won't conflict with other test cases. You can append to it
if you need multiple temporaries. This is useful as the destination of		if you need multiple temporaries. This is useful as the destination of
some redirected output.		some redirected output. If your test needs a temporary directory, you can
		use ``mkdir -p %t``.
		delcypherUnsubmitted Not Done Reply Inline Actions What are we supposed to do to have a test written that way to work on Windows? There is a `mkdir` command but I'm not familiar enough with it to know if doing `mkdir "%t"` will work. It certainly doesn't take a `-p` argument. delcypher: What are we supposed to do to have a test written that way to work on Windows? There is a…
		MatzeBUnsubmitted Not Done Reply Inline Actions I have no experience with how we do things on windows, but I'm sure I've seen things like `grep`, `sed` etc. which aren't available on windows by default. I always assumed we expect people to install some posix compatible tools to run the tests on windows... MatzeB: I have no experience with how we do things on windows, but I'm sure I've seen things like…

Example: ``/home/user/llvm.build/test/MC/ELF/Output/foo_test.s.tmp``		Example: ``/home/user/llvm.build/test/MC/ELF/Output/foo_test.s.tmp``

``%T``
Directory of ``%t``.

Example: ``/home/user/llvm.build/test/MC/ELF/Output``

``%{pathsep}``		``%{pathsep}``

Expands to the path separator, i.e. ``:`` (or ``;`` on Windows).		Expands to the path separator, i.e. ``:`` (or ``;`` on Windows).

``%/s, %/S, %/t, %/T:``		``%/s, %/S, %/t, %/T:``

Act like the corresponding substitution above but replace any ``\``		Act like the corresponding substitution above but replace any ``\``
character with a ``/``. This is useful to normalize path separators.		character with a ``/``. This is useful to normalize path separators.
▲ Show 20 Lines • Show All 172 Lines • Show Last 20 Lines

utils/lit/lit/TestRunner.py

Show First 20 Lines • Show All 794 Lines • ▼ Show 20 Lines

def getTempPaths(test):		def getTempPaths(test):
"""Get the temporary location, this is always relative to the test suite		"""Get the temporary location, this is always relative to the test suite
root, not test source root."""		root, not test source root."""
execpath = test.getExecPath()		execpath = test.getExecPath()
execdir,execbase = os.path.split(execpath)		execdir,execbase = os.path.split(execpath)
tmpDir = os.path.join(execdir, 'Output')		tmpDir = os.path.join(execdir, 'Output')
tmpBase = os.path.join(tmpDir, execbase)		tmpBase = os.path.join(tmpDir, execbase)
return tmpDir, tmpBase		return tmpBase

def getDefaultSubstitutions(test, tmpDir, tmpBase, normalize_slashes=False):		def getDefaultSubstitutions(test, tmpBase, normalize_slashes=False):
sourcepath = test.getSourcePath()		sourcepath = test.getSourcePath()
sourcedir = os.path.dirname(sourcepath)		sourcedir = os.path.dirname(sourcepath)

# Normalize slashes, if requested.		# Normalize slashes, if requested.
if normalize_slashes:		if normalize_slashes:
sourcepath = sourcepath.replace('\\', '/')		sourcepath = sourcepath.replace('\\', '/')
sourcedir = sourcedir.replace('\\', '/')		sourcedir = sourcedir.replace('\\', '/')
tmpDir = tmpDir.replace('\\', '/')
tmpBase = tmpBase.replace('\\', '/')		tmpBase = tmpBase.replace('\\', '/')

# We use #_MARKER_# to hide %% while we do the other substitutions.		# We use #_MARKER_# to hide %% while we do the other substitutions.
substitutions = []		substitutions = []
substitutions.extend([('%%', '#_MARKER_#')])		substitutions.extend([('%%', '#_MARKER_#')])
substitutions.extend(test.config.substitutions)		substitutions.extend(test.config.substitutions)
tmpName = tmpBase + '.tmp'		tmpName = tmpBase + '.tmp'
baseName = os.path.basename(tmpBase)		baseName = os.path.basename(tmpBase)
substitutions.extend([('%s', sourcepath),		substitutions.extend([('%s', sourcepath),
('%S', sourcedir),		('%S', sourcedir),
('%p', sourcedir),		('%p', sourcedir),
('%{pathsep}', os.pathsep),		('%{pathsep}', os.pathsep),
('%t', tmpName),		('%t', tmpName),
('%basename_t', baseName),		('%basename_t', baseName),
('%T', tmpDir),
('#_MARKER_#', '%')])		('#_MARKER_#', '%')])

# "%/[STpst]" should be normalized.		# "%/[STpst]" should be normalized.
substitutions.extend([		substitutions.extend([
('%/s', sourcepath.replace('\\', '/')),		('%/s', sourcepath.replace('\\', '/')),
('%/S', sourcedir.replace('\\', '/')),		('%/S', sourcedir.replace('\\', '/')),
('%/p', sourcedir.replace('\\', '/')),		('%/p', sourcedir.replace('\\', '/')),
('%/t', tmpBase.replace('\\', '/') + '.tmp'),		('%/t', tmpBase.replace('\\', '/') + '.tmp'),
('%/T', tmpDir.replace('\\', '/')),
])		])

# "%:[STpst]" are paths without colons.		# "%:[STpst]" are paths without colons.
if kIsWindows:		if kIsWindows:
substitutions.extend([		substitutions.extend([
('%:s', re.sub(r'^(.):', r'\1', sourcepath)),		('%:s', re.sub(r'^(.):', r'\1', sourcepath)),
('%:S', re.sub(r'^(.):', r'\1', sourcedir)),		('%:S', re.sub(r'^(.):', r'\1', sourcedir)),
('%:p', re.sub(r'^(.):', r'\1', sourcedir)),		('%:p', re.sub(r'^(.):', r'\1', sourcedir)),
('%:t', re.sub(r'^(.):', r'\1', tmpBase) + '.tmp'),		('%:t', re.sub(r'^(.):', r'\1', tmpBase) + '.tmp'),
('%:T', re.sub(r'^(.):', r'\1', tmpDir)),
])		])
else:		else:
substitutions.extend([		substitutions.extend([
('%:s', sourcepath),		('%:s', sourcepath),
('%:S', sourcedir),		('%:S', sourcedir),
('%:p', sourcedir),		('%:p', sourcedir),
('%:t', tmpBase + '.tmp'),		('%:t', tmpBase + '.tmp'),
('%:T', tmpDir),
])		])
return substitutions		return substitutions

def applySubstitutions(script, substitutions):		def applySubstitutions(script, substitutions):
"""Apply substitutions to the script. Allow full regular expression syntax.		"""Apply substitutions to the script. Allow full regular expression syntax.
Replace each matching occurrence of regular expression pattern a with		Replace each matching occurrence of regular expression pattern a with
substitution b in line ln."""		substitution b in line ln."""
def processLine(ln):		def processLine(ln):
▲ Show 20 Lines • Show All 298 Lines • ▼ Show 20 Lines	if test.config.unsupported:
return lit.Test.Result(Test.UNSUPPORTED, 'Test is unsupported')		return lit.Test.Result(Test.UNSUPPORTED, 'Test is unsupported')

script = parseIntegratedTestScript(test)		script = parseIntegratedTestScript(test)
if isinstance(script, lit.Test.Result):		if isinstance(script, lit.Test.Result):
return script		return script
if litConfig.noExecute:		if litConfig.noExecute:
return lit.Test.Result(Test.PASS)		return lit.Test.Result(Test.PASS)

tmpDir, tmpBase = getTempPaths(test)		tmpBase = getTempPaths(test)
substitutions = list(extra_substitutions)		substitutions = list(extra_substitutions)
substitutions += getDefaultSubstitutions(test, tmpDir, tmpBase,		substitutions += getDefaultSubstitutions(test, tmpBase,
normalize_slashes=useExternalSh)		normalize_slashes=useExternalSh)
script = applySubstitutions(script, substitutions)		script = applySubstitutions(script, substitutions)

# Re-run failed tests up to test_retry_attempts times.		# Re-run failed tests up to test_retry_attempts times.
attempts = 1		attempts = 1
if hasattr(test.config, 'test_retry_attempts'):		if hasattr(test.config, 'test_retry_attempts'):
attempts += test.config.test_retry_attempts		attempts += test.config.test_retry_attempts
for i in range(attempts):		for i in range(attempts):
res = _runShTest(test, litConfig, useExternalSh, script, tmpBase)		res = _runShTest(test, litConfig, useExternalSh, script, tmpBase)
if res.code != Test.FAIL:		if res.code != Test.FAIL:
break		break
# If we had to run the test more than once, count it as a flaky pass. These		# If we had to run the test more than once, count it as a flaky pass. These
# will be printed separately in the test summary.		# will be printed separately in the test summary.
if i > 0 and res.code == Test.PASS:		if i > 0 and res.code == Test.PASS:
res.code = Test.FLAKYPASS		res.code = Test.FLAKYPASS
return res		return res