This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
docs/
3/3
TestingGuide.rst
-
utils/lit/
-
lit/
-
lit/
12/12
BooleanExpression.py
-
Test.py
-
tests/
-
Inputs/show-used-features/
-
show-used-features/
-
mixed.txt
-
show-used-features.py

Differential D104572

[lit] Add the ability to parse regexes in Lit boolean expressions
ClosedPublic

Authored by ldionne on Jun 18 2021, 2:59 PM.

Download Raw Diff

Details

Reviewers

yln
jdenny
mstorsjo

Commits

rGfec521a7b206: [lit] Add the ability to parse regexes in Lit boolean expressions

Summary

This patch augments Lit with the ability to parse regular expressions
in boolean expressions. This includes REQUIRES:, XFAIL:, UNSUPPORTED:,
and all other special Lit markup that evaluates to a boolean expression.

Regular expressions can be specified by enclosing them in {{...}},
similarly to how FileCheck handles such regular expressions. The regular
expression can either be on its own, or it can be part of an identifier.
For example, a match expression like {{.+}}-apple-darwin{{.+}} would match
the following variables:

x86_64-apple-darwin20.0
arm64-apple-darwin20.0
arm64-apple-darwin22.0
etc...

In the long term, this could be used to remove the need to handle the
target triple specially when parsing boolean expressions.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	36,940 ms	x64 debian > libFuzzer.libFuzzer::entropic-scale-per-exec-time.test

Event Timeline

ldionne created this revision.Jun 18 2021, 2:59 PM

Herald added subscribers: pengfei, delcypher, kristof.beyls. · View Herald TranscriptJun 18 2021, 2:59 PM

ldionne requested review of this revision.Jun 18 2021, 2:59 PM

Herald added a project: Restricted Project. · View Herald TranscriptJun 18 2021, 2:59 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

ldionne added inline comments.Jun 18 2021, 3:03 PM

llvm/utils/lit/lit/BooleanExpression.py
7–13	I'm not an EBNF expert, but the intent here was to describe that one can alternate `identifier` and `{{regex}}` inside what used to be just an identifier. This is to allow things like `abc{{regex1}}def{{regex2}}ghi`.

Harbormaster completed remote builds in B110001: Diff 353106.Jun 19 2021, 12:43 PM

https://reviews.llvm.org/D104747 is an example of how libc++, libc++abi and libunwind can take advantage of this in their test suites.

Fix issue with grouping, add more tests (in particular, for interaction with --show-used-features)

Harbormaster completed remote builds in B110634: Diff 353976.Jun 23 2021, 8:46 AM

Gentle ping to reviewers. Does anyone see an issue with adding this? Can anyone think of a subtle (and bad) interaction with existing Lit behavior?

This seems like a worthwhile feature. Thanks for working on it. It's not a part of lit I have much experience with, so hopefully someone else will comment too.

I've commented some on the implementation. I haven't looked at the tests much yet.

llvm/utils/lit/lit/BooleanExpression.py
8	I think the following renaming would make this easier to understand: `regex` -> `braced_regex`: It's not just a plain regular expression. `any-regex` -> `python_regex`: It's written in python's regular expression language.
113–114	`isIdentifier` seems misnamed if it accepts `{{`. The logic should probably be something like either `not isIdentifier and not isRegexOpen` or `not isIdentiferOrRegexOpen`.
114–115	Shouldn't this mention `{{` as a possibility?

ldionne marked 3 inline comments as done.Jun 28 2021, 9:49 AM

ldionne added inline comments.

llvm/utils/lit/lit/BooleanExpression.py
113–114	I agree that `isIdentifier` is misnamed, I'll fix that. I also agree that it should be something like `not isIdentifier and not isRegexOpen`, however this implementation does not track the fact that we're parsing the content of a regular expression -- it doesn't know that it's inside a `{{`. Instead, the tokenization pattern was augmented to treat anything with `{{<whatever>}}` as a token of its own - that was by far the simplest way I could find to implement it. So instead, I believe the fix is to simply rename `isIdentifier` to `isMatchExpression`, and to acknowledge that we now have a new leaf in the grammar, and that `isMatchExpression` basically allows us to detect that.
114–115	I'll say `expected '!' or '(' or match-expression` instead, LMK if that's not satisfying.

Address review comments. Thanks for reviewing!

Harbormaster completed remote builds in B111316: Diff 354940.Jun 28 2021, 11:06 AM

There should be some user-level documentation about this new feature. It looks like the appropriate place is:

https://llvm.org/docs/TestingGuide.html#constraining-test-execution

llvm/utils/lit/lit/BooleanExpression.py
113–114	That seems reasonable. Thanks for addressing it.
114–115	While that does reflect the implementation, I think `expected '!', '(', '{{', or identifier` would be more meaningful to a user who isn't familiar with the internal grammar symbol names. As far as they know, `match-expression` could be, for example, the start symbol for the whole grammar. In contrast, `{{` and `identifier` are probably clear enough to any user.
184	Should there be some minimal test somewhere (maybe here) ensuring that a regex is handled as a literal string when matching triples? I think that means it cannot ever match (due to `{{` and `}}`), so maybe it's not a very interesting behavior, but we still might want to be aware if we accidentally change it.

Address review comments

There should be some user-level documentation about this new feature. It looks like the appropriate place is:

Thanks! Indeed, I wanted to write some documentation but I couldn't find where to add it. I'll do that.

llvm/utils/lit/lit/BooleanExpression.py
114–115	Ok, I agree, that seems better. Changed.
184	Hmm, I agree, I had not considered that. Added two tests: one that a triple doesn't match even if the regex would match, and one where the triple does match the regex when it is treated literally. The second test is not something we can even encounter in real life, as you say, because triples can't contain special characters like `{{`, but I think the test still has value since it pins down the behavior. Note: In the future, I would love to remove any notion of a triple in Lit, since I think the special substring handling was its only purpose. If we ever do that, this whole test will become moot.

Rephrase error message containing too many or's

LGTM.

I apologize if this is frustrating, but can you wait a couple of days to commit? I'd like to give other reviewers just a bit more time to comment in case we overlooked something.

llvm/docs/TestingGuide.rst
465–468	Thanks for adding this. Tiny nit: The second bullet is a different animal than the other two. I'd put the other two next to each other or integrate them. But it's no big deal if you prefer it as is.
llvm/utils/lit/lit/BooleanExpression.py
184	That all makes sense to me. Thanks.

This revision is now accepted and ready to land.Jun 29 2021, 8:42 AM

Harbormaster completed remote builds in B111541: Diff 355253.Jun 29 2021, 9:26 AM

In D104572#2847432, @jdenny wrote:

LGTM.

I apologize if this is frustrating, but can you wait a couple of days to commit? I'd like to give other reviewers just a bit more time to comment in case we overlooked something.

Thanks a lot for the review! Waiting for a bit is not frustrating - I fully understand how everybody's busy and I myself often let reviews go under my radar for a while. I'll wait until the end of the week to commit this.

llvm/docs/TestingGuide.rst
465–468	Hmm, yes, I agree with you. I integrated both into one paragraph, and I documented the fact that string comparison is case sensitive while I was at it. Please let me know if the new formulation doesn't suit you.

Reformulate documentation.

Also, as a side note - should we perhaps create a Lit review group that people can add themselves to, and that is added as a blocking reviewer on any review that touches llvm/utils/lit? I think that might be useful, as I'm never certain of who should be included on those reviews. We have that for libc++ and it's working out pretty well.

In D104572#2848130, @ldionne wrote:

Thanks a lot for the review! Waiting for a bit is not frustrating - I fully understand how everybody's busy and I myself often let reviews go under my radar for a while. I'll wait until the end of the week to commit this.

Thanks for understanding, and for the patch!

In D104572#2848143, @ldionne wrote:

Also, as a side note - should we perhaps create a Lit review group that people can add themselves to, and that is added as a blocking reviewer on any review that touches llvm/utils/lit? I think that might be useful, as I'm never certain of who should be included on those reviews. We have that for libc++ and it's working out pretty well.

I've seen these groups but haven't investigated how they work. Is the idea that someone from that group must accept or the patch is marked as blocked?

llvm/docs/TestingGuide.rst
465–468	LGTM. Thanks.

Harbormaster completed remote builds in B111596: Diff 355329.Jun 29 2021, 1:18 PM

In D104572#2848282, @jdenny wrote:

I've seen these groups but haven't investigated how they work. Is the idea that someone from that group must accept or the patch is marked as blocked?

Yes, basically the review is marked as blocked until anyone from the group gives a green check-mark. It's also possible to make the group a non-blocking reviewer (or even just a subscriber). But having that helps making sure that everybody in the group will be pinged when a review is posted.

In D104572#2848461, @ldionne wrote:

In D104572#2848282, @jdenny wrote:

I've seen these groups but haven't investigated how they work. Is the idea that someone from that group must accept or the patch is marked as blocked?

Yes, basically the review is marked as blocked until anyone from the group gives a green check-mark. It's also possible to make the group a non-blocking reviewer (or even just a subscriber). But having that helps making sure that everybody in the group will be pinged when a review is posted.

Makes sense to me. FileCheck could probably use this too. In either case, whether we need a blocking group isn't clear to me yet, so I'll have to defer to others on that point.

Pretty cool, looks useful! :)
Thanks to you both @ldionne and @jdenny!

I'll commit this now since it seems it has gotten the required attention, after all. If anyone gets here and thinks there's an issue, please ping the review and I can make changes in another commit or revert the patch, depending on what the issue is.

Closed by commit rGfec521a7b206: [lit] Add the ability to parse regexes in Lit boolean expressions (authored by ldionne). · Explain WhyJun 30 2021, 7:52 AM

This revision was automatically updated to reflect the committed changes.

ldionne added a commit: rGfec521a7b206: [lit] Add the ability to parse regexes in Lit boolean expressions.

probinson mentioned this in D107162: [lit] Have REQUIRES support the target triple.Jul 30 2021, 7:17 AM

Revision Contents

Path

Size

llvm/

docs/

TestingGuide.rst

4 lines

utils/

lit/

BooleanExpression.py

84 lines

Test.py

4 lines

tests/

Inputs/

show-used-features/

mixed.txt

6 lines

show-used-features.py

3 lines

Diff 355253

llvm/docs/TestingGuide.rst

Show First 20 Lines • Show All 456 Lines • ▼ Show 20 Lines	.. code-block:: llvm
; XFAIL: powerpc		; XFAIL: powerpc

``REQUIRES`` and ``UNSUPPORTED`` and ``XFAIL`` all accept a comma-separated		``REQUIRES`` and ``UNSUPPORTED`` and ``XFAIL`` all accept a comma-separated
list of boolean expressions. The values in each expression may be:		list of boolean expressions. The values in each expression may be:

- Features added to ``config.available_features`` by		- Features added to ``config.available_features`` by
configuration files such as ``lit.cfg``.		configuration files such as ``lit.cfg``.
- Substrings of the target triple (``UNSUPPORTED`` and ``XFAIL`` only).		- Substrings of the target triple (``UNSUPPORTED`` and ``XFAIL`` only).
		- Any Python regular expression enclosed in ``{{ }}``, in which case the boolean
		expression is satisfied if any feature matches the regular expression. Also note
		that regular expressions can appear inside an identifier, so for example
		``he{{l+}}o`` would match ``helo``, ``hello``, ``helllo``, and so on.
		jdennyUnsubmitted Done Reply Inline Actions Thanks for adding this. Tiny nit: The second bullet is a different animal than the other two. I'd put the other two next to each other or integrate them. But it's no big deal if you prefer it as is. jdenny: Thanks for adding this. Tiny nit: The second bullet is a different animal than the other two.
		ldionneAuthorUnsubmitted Done Reply Inline Actions Hmm, yes, I agree with you. I integrated both into one paragraph, and I documented the fact that string comparison is case sensitive while I was at it. Please let me know if the new formulation doesn't suit you. ldionne: Hmm, yes, I agree with you. I integrated both into one paragraph, and I documented the fact…
		jdennyUnsubmitted Done Reply Inline Actions LGTM. Thanks. jdenny: LGTM. Thanks.

\| ``REQUIRES`` enables the test if all expressions are true.		\| ``REQUIRES`` enables the test if all expressions are true.
\| ``UNSUPPORTED`` disables the test if any expression is true.		\| ``UNSUPPORTED`` disables the test if any expression is true.
\| ``XFAIL`` expects the test to fail if any expression is true.		\| ``XFAIL`` expects the test to fail if any expression is true.

As a special case, ``XFAIL: *`` is expected to fail everywhere.		As a special case, ``XFAIL: *`` is expected to fail everywhere.

.. code-block:: llvm		.. code-block:: llvm
▲ Show 20 Lines • Show All 180 Lines • Show Last 20 Lines

llvm/utils/lit/lit/BooleanExpression.py

import re		import re

class BooleanExpression:		class BooleanExpression:
# A simple evaluator of boolean expressions.		# A simple evaluator of boolean expressions.
#		#
# Grammar:		# Grammar:
# expr :: or_expr		# expr :: or_expr
# or_expr :: and_expr ('\|\|' and_expr)*		# or_expr :: and_expr ('\|\|' and_expr)*
		jdennyUnsubmitted Done Reply Inline Actions I think the following renaming would make this easier to understand: `regex` -> `braced_regex`: It's not just a plain regular expression. `any-regex` -> `python_regex`: It's written in python's regular expression language. jdenny: I think the following renaming would make this easier to understand: * `regex` ->…
# and_expr :: not_expr ('&&' not_expr)*		# and_expr :: not_expr ('&&' not_expr)*
# not_expr :: '!' not_expr		# not_expr :: '!' not_expr
# '(' or_expr ')'		# '(' or_expr ')'
		# match_expr
		# match_expr :: braced_regex
		ldionneAuthorUnsubmitted Done Reply Inline Actions I'm not an EBNF expert, but the intent here was to describe that one can alternate `identifier` and `{{regex}}` inside what used to be just an identifier. This is to allow things like `abc{{regex1}}def{{regex2}}ghi`. ldionne: I'm not an EBNF expert, but the intent here was to describe that one can alternate `identifier`…
# identifier		# identifier
		# braced_regex match_expr
		# identifier match_expr
# identifier :: [-+=._a-zA-Z0-9]+		# identifier :: [-+=._a-zA-Z0-9]+
		# braced_regex :: '{{' python_regex '}}'

# Evaluates `string` as a boolean expression.		# Evaluates `string` as a boolean expression.
# Returns True or False. Throws a ValueError on syntax error.		# Returns True or False. Throws a ValueError on syntax error.
#		#
# Variables in `variables` are true.		# Variables in `variables` are true.
		# Regexes that match any variable in `variables` are true.
# Substrings of `triple` are true.		# Substrings of `triple` are true.
# 'true' is true.		# 'true' is true.
# All other identifiers are false.		# All other identifiers are false.
@staticmethod		@staticmethod
def evaluate(string, variables, triple=""):		def evaluate(string, variables, triple=""):
try:		try:
parser = BooleanExpression(string, set(variables), triple)		parser = BooleanExpression(string, set(variables), triple)
return parser.parseAll()		return parser.parseAll()
Show All 9 Lines	def __init__(self, string, variables, triple=""):
self.triple = triple		self.triple = triple
self.value = None		self.value = None
self.token = None		self.token = None

# Singleton end-of-expression marker.		# Singleton end-of-expression marker.
END = object()		END = object()

# Tokenization pattern.		# Tokenization pattern.
Pattern = re.compile(r'\A\s([()]\|[-+=._a-zA-Z0-9]+\|&&\|\\|\\|\|!)\s(.*)\Z')		Pattern = re.compile(r'\A\s([()]\|&&\|\\|\\|\|!\|(?:[-+=._a-zA-Z0-9]+\|\{\{.+?\}\})+)\s(.*)\Z')

@staticmethod		@staticmethod
def tokenize(string):		def tokenize(string):
while True:		while True:
m = re.match(BooleanExpression.Pattern, string)		m = re.match(BooleanExpression.Pattern, string)
if m is None:		if m is None:
if string == "":		if string == "":
yield BooleanExpression.END;		yield BooleanExpression.END;
Show All 22 Lines	def expect(self, t):
if self.token == t:		if self.token == t:
if self.token != BooleanExpression.END:		if self.token != BooleanExpression.END:
self.token = next(self.tokens)		self.token = next(self.tokens)
else:		else:
raise ValueError("expected: %s\nhave: %s" %		raise ValueError("expected: %s\nhave: %s" %
(self.quote(t), self.quote(self.token)))		(self.quote(t), self.quote(self.token)))

@staticmethod		@staticmethod
def isIdentifier(token):		def isMatchExpression(token):
if (token is BooleanExpression.END or token == '&&' or token == '\|\|' or		if (token is BooleanExpression.END or token == '&&' or token == '\|\|' or
token == '!' or token == '(' or token == ')'):		token == '!' or token == '(' or token == ')'):
return False		return False
return True		return True

		def parseMATCH(self):
		regex = ''
		for part in filter(None, re.split(r'(\{\{.+?\}\})', self.token)):
		if part.startswith('{{'):
		assert part.endswith('}}')
		regex += '(?:{})'.format(part[2:-2])
		else:
		regex += re.escape(part)
		regex = re.compile(regex)
		self.value = self.token in self.triple or any(regex.fullmatch(var) for var in self.variables)
		self.token = next(self.tokens)

def parseNOT(self):		def parseNOT(self):
if self.accept('!'):		if self.accept('!'):
self.parseNOT()		self.parseNOT()
self.value = not self.value		self.value = not self.value
elif self.accept('('):		elif self.accept('('):
self.parseOR()		self.parseOR()
self.expect(')')		self.expect(')')
elif not BooleanExpression.isIdentifier(self.token):		elif not BooleanExpression.isMatchExpression(self.token):
		jdennyUnsubmitted Done Reply Inline Actions `isIdentifier` seems misnamed if it accepts `{{`. The logic should probably be something like either `not isIdentifier and not isRegexOpen` or `not isIdentiferOrRegexOpen`. jdenny: `isIdentifier` seems misnamed if it accepts `{{`. The logic should probably be something like…
		ldionneAuthorUnsubmitted Done Reply Inline Actions I agree that `isIdentifier` is misnamed, I'll fix that. I also agree that it should be something like `not isIdentifier and not isRegexOpen`, however this implementation does not track the fact that we're parsing the content of a regular expression -- it doesn't know that it's inside a `{{`. Instead, the tokenization pattern was augmented to treat anything with `{{<whatever>}}` as a token of its own - that was by far the simplest way I could find to implement it. So instead, I believe the fix is to simply rename `isIdentifier` to `isMatchExpression`, and to acknowledge that we now have a new leaf in the grammar, and that `isMatchExpression` basically allows us to detect that. ldionne: I agree that `isIdentifier` is misnamed, I'll fix that. I also agree that it should be…
		jdennyUnsubmitted Done Reply Inline Actions That seems reasonable. Thanks for addressing it. jdenny: That seems reasonable. Thanks for addressing it.
raise ValueError("expected: '!' or '(' or identifier\nhave: %s" %		raise ValueError("expected: '!', '(', '{{', or identifier\nhave: %s" %
		jdennyUnsubmitted Done Reply Inline Actions Shouldn't this mention `{{` as a possibility? jdenny: Shouldn't this mention `{{` as a possibility?
		ldionneAuthorUnsubmitted Done Reply Inline Actions I'll say `expected '!' or '(' or match-expression` instead, LMK if that's not satisfying. ldionne: I'll say `expected '!' or '(' or match-expression` instead, LMK if that's not satisfying.
		jdennyUnsubmitted Done Reply Inline Actions While that does reflect the implementation, I think `expected '!', '(', '{{', or identifier` would be more meaningful to a user who isn't familiar with the internal grammar symbol names. As far as they know, `match-expression` could be, for example, the start symbol for the whole grammar. In contrast, `{{` and `identifier` are probably clear enough to any user. jdenny: While that does reflect the implementation, I think `expected '!', '(', '{{', or identifier`…
		ldionneAuthorUnsubmitted Done Reply Inline Actions Ok, I agree, that seems better. Changed. ldionne: Ok, I agree, that seems better. Changed.
self.quote(self.token))		self.quote(self.token))
else:		else:
self.value = (self.token in self.variables or		self.parseMATCH()
self.token in self.triple)
self.token = next(self.tokens)

def parseAND(self):		def parseAND(self):
self.parseNOT()		self.parseNOT()
while self.accept('&&'):		while self.accept('&&'):
left = self.value		left = self.value
self.parseNOT()		self.parseNOT()
right = self.value		right = self.value
# this is technically the wrong associativity, but it		# this is technically the wrong associativity, but it
Show All 27 Lines	def test_variables(self):
variables = {'its-true', 'false-lol-true', 'under_score',		variables = {'its-true', 'false-lol-true', 'under_score',
'e=quals', 'd1g1ts'}		'e=quals', 'd1g1ts'}
self.assertTrue(BooleanExpression.evaluate('true', variables))		self.assertTrue(BooleanExpression.evaluate('true', variables))
self.assertTrue(BooleanExpression.evaluate('its-true', variables))		self.assertTrue(BooleanExpression.evaluate('its-true', variables))
self.assertTrue(BooleanExpression.evaluate('false-lol-true', variables))		self.assertTrue(BooleanExpression.evaluate('false-lol-true', variables))
self.assertTrue(BooleanExpression.evaluate('under_score', variables))		self.assertTrue(BooleanExpression.evaluate('under_score', variables))
self.assertTrue(BooleanExpression.evaluate('e=quals', variables))		self.assertTrue(BooleanExpression.evaluate('e=quals', variables))
self.assertTrue(BooleanExpression.evaluate('d1g1ts', variables))		self.assertTrue(BooleanExpression.evaluate('d1g1ts', variables))
		self.assertTrue(BooleanExpression.evaluate('{{its.+}}', variables))
		self.assertTrue(BooleanExpression.evaluate('{{false-[lo]+-true}}', variables))
		self.assertTrue(BooleanExpression.evaluate('{{(true\|false)-lol-(true\|false)}}', variables))
		self.assertTrue(BooleanExpression.evaluate('d1g{{[0-9]}}ts', variables))
		self.assertTrue(BooleanExpression.evaluate('d1g{{[0-9]}}t{{[a-z]}}', variables))
		self.assertTrue(BooleanExpression.evaluate('{{d}}1g{{[0-9]}}t{{[a-z]}}', variables))
		self.assertTrue(BooleanExpression.evaluate('d1{{(g\|1)+}}ts', variables))

self.assertFalse(BooleanExpression.evaluate('false', variables))		self.assertFalse(BooleanExpression.evaluate('false', variables))
self.assertFalse(BooleanExpression.evaluate('True', variables))		self.assertFalse(BooleanExpression.evaluate('True', variables))
self.assertFalse(BooleanExpression.evaluate('true-ish', variables))		self.assertFalse(BooleanExpression.evaluate('true-ish', variables))
self.assertFalse(BooleanExpression.evaluate('not_true', variables))		self.assertFalse(BooleanExpression.evaluate('not_true', variables))
self.assertFalse(BooleanExpression.evaluate('tru', variables))		self.assertFalse(BooleanExpression.evaluate('tru', variables))
		self.assertFalse(BooleanExpression.evaluate('{{its-true.+}}', variables))

def test_triple(self):		def test_triple(self):
triple = 'arch-vendor-os'		triple = 'arch-vendor-os'
self.assertTrue(BooleanExpression.evaluate('arch-', {}, triple))		self.assertTrue(BooleanExpression.evaluate('arch-', {}, triple))
self.assertTrue(BooleanExpression.evaluate('ar', {}, triple))		self.assertTrue(BooleanExpression.evaluate('ar', {}, triple))
self.assertTrue(BooleanExpression.evaluate('ch-vend', {}, triple))		self.assertTrue(BooleanExpression.evaluate('ch-vend', {}, triple))
self.assertTrue(BooleanExpression.evaluate('-vendor-', {}, triple))		self.assertTrue(BooleanExpression.evaluate('-vendor-', {}, triple))
self.assertTrue(BooleanExpression.evaluate('-os', {}, triple))		self.assertTrue(BooleanExpression.evaluate('-os', {}, triple))
self.assertFalse(BooleanExpression.evaluate('arch-os', {}, triple))		self.assertFalse(BooleanExpression.evaluate('arch-os', {}, triple))
		jdennyUnsubmitted Done Reply Inline Actions Should there be some minimal test somewhere (maybe here) ensuring that a regex is handled as a literal string when matching triples? I think that means it cannot ever match (due to `{{` and `}}`), so maybe it's not a very interesting behavior, but we still might want to be aware if we accidentally change it. jdenny: Should there be some minimal test somewhere (maybe here) ensuring that a regex is handled as a…
		ldionneAuthorUnsubmitted Done Reply Inline Actions Hmm, I agree, I had not considered that. Added two tests: one that a triple doesn't match even if the regex would match, and one where the triple does match the regex when it is treated literally. The second test is not something we can even encounter in real life, as you say, because triples can't contain special characters like `{{`, but I think the test still has value since it pins down the behavior. Note: In the future, I would love to remove any notion of a triple in Lit, since I think the special substring handling was its only purpose. If we ever do that, this whole test will become moot. ldionne: Hmm, I agree, I had not considered that. Added two tests: one that a triple doesn't match even…
		jdennyUnsubmitted Done Reply Inline Actions That all makes sense to me. Thanks. jdenny: That all makes sense to me. Thanks.

		# When matching against the triple, a regex is treated as an identifier and checked
		# for a literal match. This preserves existing behavior before regexes were introduced.
		self.assertFalse(BooleanExpression.evaluate('arch-{{vendor}}-os', {}, triple))
		self.assertTrue(BooleanExpression.evaluate('arch-{{vendor}}-os', {}, 'arch-{{vendor}}-os'))

		def test_matching(self):
		expr1 = 'linux && (target={{aarch64-.+}} \|\| target={{x86_64-.+}})'
		self.assertTrue(BooleanExpression.evaluate(expr1, {'linux', 'target=x86_64-unknown-linux-gnu'}))
		self.assertFalse(BooleanExpression.evaluate(expr1, {'linux', 'target=i386-unknown-linux-gnu'}))

		expr2 = 'use_system_cxx_lib && target={{.+}}-apple-macosx10.{{9\|10\|11\|12}} && !no-exceptions'
		self.assertTrue(BooleanExpression.evaluate(expr2, {'use_system_cxx_lib', 'target=arm64-apple-macosx10.12'}))
		self.assertFalse(BooleanExpression.evaluate(expr2, {'use_system_cxx_lib', 'target=arm64-apple-macosx10.12', 'no-exceptions'}))
		self.assertFalse(BooleanExpression.evaluate(expr2, {'use_system_cxx_lib', 'target=arm64-apple-macosx10.15'}))

def test_operators(self):		def test_operators(self):
self.assertTrue(BooleanExpression.evaluate('true \|\| true', {}))		self.assertTrue(BooleanExpression.evaluate('true \|\| true', {}))
self.assertTrue(BooleanExpression.evaluate('true \|\| false', {}))		self.assertTrue(BooleanExpression.evaluate('true \|\| false', {}))
self.assertTrue(BooleanExpression.evaluate('false \|\| true', {}))		self.assertTrue(BooleanExpression.evaluate('false \|\| true', {}))
self.assertFalse(BooleanExpression.evaluate('false \|\| false', {}))		self.assertFalse(BooleanExpression.evaluate('false \|\| false', {}))

self.assertTrue(BooleanExpression.evaluate('true && true', {}))		self.assertTrue(BooleanExpression.evaluate('true && true', {}))
self.assertFalse(BooleanExpression.evaluate('true && false', {}))		self.assertFalse(BooleanExpression.evaluate('true && false', {}))
Show All 31 Lines	def test_errors(self):
"in expression: 'ba#d'")		"in expression: 'ba#d'")

self.checkException("true and true",		self.checkException("true and true",
"expected: <end of expression>\n" +		"expected: <end of expression>\n" +
"have: 'and'\n" +		"have: 'and'\n" +
"in expression: 'true and true'")		"in expression: 'true and true'")

self.checkException("\|\| true",		self.checkException("\|\| true",
"expected: '!' or '(' or identifier\n" +		"expected: '!', '(', '{{', or identifier\n" +
"have: '\|\|'\n" +		"have: '\|\|'\n" +
"in expression: '\|\| true'")		"in expression: '\|\| true'")

self.checkException("true &&",		self.checkException("true &&",
"expected: '!' or '(' or identifier\n" +		"expected: '!', '(', '{{', or identifier\n" +
"have: <end of expression>\n" +		"have: <end of expression>\n" +
"in expression: 'true &&'")		"in expression: 'true &&'")

self.checkException("",		self.checkException("",
"expected: '!' or '(' or identifier\n" +		"expected: '!', '(', '{{', or identifier\n" +
"have: <end of expression>\n" +		"have: <end of expression>\n" +
"in expression: ''")		"in expression: ''")

self.checkException("*",		self.checkException("*",
"couldn't parse text: '*'\n" +		"couldn't parse text: '*'\n" +
"in expression: '*'")		"in expression: '*'")

self.checkException("no wait stop",		self.checkException("no wait stop",
Show All 11 Lines	def test_errors(self):
"in expression: '(((true && true) \|\| true)'")		"in expression: '(((true && true) \|\| true)'")

self.checkException("true (true)",		self.checkException("true (true)",
"expected: <end of expression>\n" +		"expected: <end of expression>\n" +
"have: '('\n" +		"have: '('\n" +
"in expression: 'true (true)'")		"in expression: 'true (true)'")

self.checkException("( )",		self.checkException("( )",
"expected: '!' or '(' or identifier\n" +		"expected: '!', '(', '{{', or identifier\n" +
"have: ')'\n" +		"have: ')'\n" +
"in expression: '( )'")		"in expression: '( )'")

		self.checkException("abc{{def",
		"couldn't parse text: '{{def'\n" +
		"in expression: 'abc{{def'")

		self.checkException("{{}}",
		"couldn't parse text: '{{}}'\n" +
		"in expression: '{{}}'")


if __name__ == '__main__':		if __name__ == '__main__':
unittest.main()		unittest.main()

llvm/utils/lit/lit/Test.py

Show First 20 Lines • Show All 402 Lines • ▼ Show 20 Lines	def getUsedFeatures(self):
feature_keywords = ('UNSUPPORTED:', 'REQUIRES:', 'XFAIL:')		feature_keywords = ('UNSUPPORTED:', 'REQUIRES:', 'XFAIL:')
boolean_expressions = itertools.chain.from_iterable(		boolean_expressions = itertools.chain.from_iterable(
parsed[k] or [] for k in feature_keywords		parsed[k] or [] for k in feature_keywords
)		)
tokens = itertools.chain.from_iterable(		tokens = itertools.chain.from_iterable(
BooleanExpression.tokenize(expr) for expr in		BooleanExpression.tokenize(expr) for expr in
boolean_expressions if expr != '*'		boolean_expressions if expr != '*'
)		)
identifiers = set(filter(BooleanExpression.isIdentifier, tokens))		matchExpressions = set(filter(BooleanExpression.isMatchExpression, tokens))
return identifiers		return matchExpressions

llvm/utils/lit/tests/Inputs/show-used-features/mixed.txt


	// REQUIRES: my-require-feature-2 \|\| my-require-feature-3			// REQUIRES: my-require-feature-2 \|\| my-require-feature-3, my-{{[require]*}}-feature-4
	// UNSUPPORTED: my-unsupported-feature-2, my-unsupported-feature-3			// UNSUPPORTED: my-unsupported-feature-2, my-unsupported-feature-3 && !my-{{[unsupported]*}}-feature-4
	// XFAIL: my-xfail-feature-2, my-xfail-feature-3			// XFAIL: my-xfail-feature-2, my-xfail-feature-3, my-{{[xfail]*}}-feature-4

llvm/utils/lit/tests/show-used-features.py

	# Check that --show-used-features works correctly.			# Check that --show-used-features works correctly.
	#			#
	# RUN: %{lit} %{inputs}/show-used-features --show-used-features \| FileCheck %s			# RUN: %{lit} %{inputs}/show-used-features --show-used-features \| FileCheck %s
	# CHECK: my-require-feature-1 my-require-feature-2 my-require-feature-3			# CHECK: my-require-feature-1 my-require-feature-2 my-require-feature-3
	# CHECK: my-unsupported-feature-1 my-unsupported-feature-2 my-unsupported-feature-3			# CHECK: my-unsupported-feature-1 my-unsupported-feature-2 my-unsupported-feature-3
	# CHECK: my-xfail-feature-1 my-xfail-feature-2 my-xfail-feature-3			# CHECK: my-xfail-feature-1 my-xfail-feature-2 my-xfail-feature-3
				# CHECK: {{my-[{][{]\[require\]\*[}][}]-feature-4}}
				# CHECK: {{my-[{][{]\[unsupported\]\*[}][}]-feature-4}}
				# CHECK: {{my-[{][{]\[xfail\]\*[}][}]-feature-4}}