This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
docs/CommandGuide/
-
CommandGuide/
3/17
FileCheck.rst
-
lib/Support/
-
Support/
7/22
FileCheck.cpp
1
FileCheckImpl.h
-
test/FileCheck/
-
FileCheck/
6
numeric-expression.txt
-
unittests/Support/
-
Support/
2
FileCheckTest.cpp

Differential D79936

[FileCheck] Add function call support to numerical expressions.
ClosedPublic

Authored by paulwalker-arm on May 14 2020, 3:48 AM.

Download Raw Diff

Details

Reviewers

thopre
arichardson
jhenderson

Commits

rG8fd227037024: [FileCheck] Add function call support to numerical expressions.

Summary

This patch extends numerical expressions to allow calls to
predefined functions. These calls can be combined with the
existing numerical operators, which includes nesting calls.

The call syntax is:

  <func>(<args>)

Where <func> is a predefined string literal, currently limited to
one of add, max, min and sub. <arg> is a comma seperated list of
numerical expressions.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

paulwalker-arm created this revision.May 14 2020, 3:48 AM

Herald added a project: Restricted Project. · View Herald TranscriptMay 14 2020, 3:48 AM

Herald added subscribers: llvm-commits, thopre, hiraditya, arichardson. · View Herald Transcript

This is very much work in progress but I welcome early feedback.

I don't know if a function name prefix is necessary but at this stage is allows me to ignore some corner cases. I'd also like to know whether I can get away with only supporting arbitrary argument counts at the parsing layer since currently I only need support for the usual two operand math operations.

paulwalker-arm mentioned this in D79882: [FileCheck] Add saturation support to numerical expressions..May 14 2020, 3:54 AM

paulwalker-arm mentioned this in D79885: [FileCheck] Add multiplication support to numerical expressions..

Harbormaster failed remote builds in B56719: Diff 263966!May 14 2020, 5:20 AM

Only a minor code change so still WIP, but with basic tests and documentation update.

I like this approach. Starting functions with ! seems reasonable since it is similar tablegen and if we decide that we don't need the prefix, we can always drop it.
The advantage of requiring the prefix is that we can give functions names that might also be commonly used capture names.

There should probably be some tests for error messages in llvm/unittests/Support/FileCheckTest.cpp.

Harbormaster failed remote builds in B56728: Diff 263984!May 14 2020, 6:58 AM

I believe the patch is now ready for review. There's an open question as to
whether the precedence operator (i.e. !()) is required, but since it's
implementation came largely for free I ran with it.

Harbormaster failed remote builds in B56890: Diff 264280!May 15 2020, 11:25 AM

Baseline update and fixed naming issue reported by clang-tidy.

Harbormaster failed remote builds in B57055: Diff 264591!May 18 2020, 6:25 AM

paulwalker-arm added reviewers: thopre, arichardson.May 18 2020, 7:06 AM

Added udiv to complement mul.

paulwalker-arm edited the summary of this revision. (Show Details)May 19 2020, 3:18 AM

Harbormaster completed remote builds in B57182: Diff 264835.May 19 2020, 5:22 AM

I don't know if there's an official mechanism beyond adding people but can I request code review please.

Please note that the patch to add support for signed values (https://reviews.llvm.org/D60390) is at an advanced stage of review.

llvm/docs/CommandGuide/FileCheck.rst
688	Why change this sentence? Recursion only happens on one of the operand only.
720–721	I don't think the exclamation mark should be required here. A parenthesis pair should be enough to force precedence. Note that there was a patch to adds support for that and you might want to rebase your patch on top of it.
llvm/lib/Support/FileCheck.cpp
625–628	Please group the two tests together, together they test whether it's an exit condition.

In D79936#2054735, @paulwalker-arm wrote:

I don't know if there's an official mechanism beyond adding people but can I request code review please.

Adding people as reviewer is the official way. The review policy [1] also says you can ping after 1 week if you didn't have any reply.

[1] https://llvm.org/docs/CodeReview.html

paulwalker-arm added inline comments.May 26 2020, 8:37 AM

llvm/docs/CommandGuide/FileCheck.rst
688	I don't understand the distinction. I changed the sentence as I didn't what to say: an expression followed by an operator and either a numerical operand or a function call. Are you saying the above is more correct? Also with functions you can have !(mul(VAR+(umin(VAR2,4)) + !(udivl(VAR3+(umax(VAR3,4))
720–721	Sure, it's more that this implementation comes for free, whereas supporting arbitrary parenthesis pairs along side function calls requires more complex parsing. Is it worth the extra effort? If so I'll happy prevent this usage and error out for unnamed functions. I'd rather not wait for the parenthesis work because I've got other work at review whose tests are much improved when I can make use of function calls.

thopre added inline comments.May 26 2020, 9:01 AM

llvm/docs/CommandGuide/FileCheck.rst
688	Oh right, I forgot about function altogether. That said, I think an expression is a kind of numeric operand so we should just expand the definition below saying it can also be a function call.

thopre added inline comments.May 26 2020, 9:01 AM

llvm/docs/CommandGuide/FileCheck.rst
720–721	I think the exclamation mark should be reserved for function call. I find it a bit confusing otherwise but let's wait to see what other reviewers might think. Do the tests you need this feature for require some way to force precedence or can this be dealt later?

arichardson added inline comments.May 26 2020, 9:12 AM

llvm/docs/CommandGuide/FileCheck.rst
720–721	Apologies for the delay, I've now rebased the parentheses revision (D77383).

arichardson added a reviewer: jhenderson.May 26 2020, 9:12 AM

paulwalker-arm marked 2 inline comments as done.May 26 2020, 9:27 AM

paulwalker-arm added inline comments.

llvm/docs/CommandGuide/FileCheck.rst
688	So something like: A numeric operand is a previously defined numeric variable, an integer literal or the result of a function call.
720–721	I currently have no use for the precedence operator, the function calls are why I started down this path, so am happy either way. Part of me just assumed that in the future the parser might be reworked to remove the need for the ! prefix, but I suppose providing the power of precedence early might prevent that work from happening :)

paulwalker-arm marked 2 inline comments as not done.May 26 2020, 9:37 AM

thopre added inline comments.May 26 2020, 9:51 AM

llvm/docs/CommandGuide/FileCheck.rst
688	I'm nitpicking but technically the operand is the function call itself, but evaluates to the return value of the function call.
720–721	I wouldn't hold my breath for a parser rewrite. While I'd love to make the code nicer and more flexible I have little free time to work on it myself. Anyway, if you don't require operator precedence just error on empty function name for now and we can extend it to be used for operator precedence later if needed.

rebase and post code review fixes

paulwalker-arm marked 3 inline comments as done.May 27 2020, 5:40 AM

paulwalker-arm added inline comments.

llvm/docs/CommandGuide/FileCheck.rst
720–721	Done. Perhaps it's a good idea to mandate that all symbolic operators have a named function counterpart. That way in the short term if somebody does want to force precedence they just need to write a slightly more verbose check line.

paulwalker-arm marked 2 inline comments as done.May 27 2020, 5:41 AM

thopre added inline comments.May 27 2020, 6:36 AM

llvm/docs/CommandGuide/FileCheck.rst
720–721	No objection to that, should be a 2 line changes, right?

paulwalker-arm added inline comments.May 27 2020, 7:05 AM

llvm/docs/CommandGuide/FileCheck.rst
720–721	Yep. I'll add entries for add and sub to this patch and resubmit later today.

Harbormaster completed remote builds in B58036: Diff 266502.May 27 2020, 7:33 AM

Added functions for add and sub.

Harbormaster failed remote builds in B58062: Diff 266552!May 27 2020, 9:11 AM

rebase

Harbormaster completed remote builds in B58076: Diff 266587.May 27 2020, 12:29 PM

LGTM if @thopre and @jhenderson are happy with this change too.

llvm/unittests/Support/FileCheckTest.cpp
1076	Might make sense to add case with missing operators such as `2!mul(FOO,2)` or `FOO !mul(FOO,2)` or `!mul(FOO(!mul(3,2)))`

This revision is now accepted and ready to land.May 28 2020, 5:38 AM

Some of my testing suggestions might better belong in the unit tests. Also, you're probably going to need to rebase and expand the behaviour somewhat now that signed values support has landed in D60390.

llvm/lib/Support/FileCheck.cpp
461	I'm not strongly opposed to the use of `!` to indicate a function call, but is it actually necessary? It seems like a function call could just be identified by `<sequence of identifier chars>(<some chars>)`. The code would look something like the following semi-pseudo code: size_t Parenthesis = Expr.find_first_of('('); if (Parenthesis != npos) { if (all_of(Expr.take_front(Parenthesis), [](char C) { return isValidIdentifierChar(C); }) { if (AO != AllowedOperand::Any) return ErrorDiagnostic::get(SM, Expr, "unexpected function call"); return parseCallExpr(Expr, LineNumber, Context, SM); } } Assuming I've not missed something, that would allow us to simplify all the usages of function calls.
595–596	I believe you could change this case to an asseertion - the parseExpr function treats a leading '(' as a different kind of expression.
667	In conjunction with my suggestion above about not having a function specifier, you could change this code to bail out without error in some cases, perhaps by starting with looking for the `%` followed by some specific characters, followed by a `,`.
llvm/lib/Support/FileCheckImpl.h
728–734	I think you need to delete "both" here, since there are now three different things it accepts.
llvm/test/FileCheck/numeric-expression.txt
120	Perhaps change the inner `UNSI` to `UNSI+1` or something to show that the argument of a function is any kind of expression? Up to you.
385	I would prefer these to be interleaved with their corresponding CHECK and input text: RUN: ... --check-prefix CALL-MISSING-OPENING-BRACKET ... CALL MISSING OPENING BRACKET 30 CALL-MISSING-OPENING-BRACKET-LABEL: ... ... RUN: ... --check-prefix CALL-MISSING-CLOSING-BRACKET ... CALL MISSING CLOSING BRACKET etc. It helps reduce the distance I have to look to find the thing being checked for.
410	There might want to be some interaction testing with plain parentheses. Something like `[[#!mul(NUMVAR,(NUMVAR+3))]]` and `[[#!mul(NUMVAR,(NUMVAR+3)]]` (the first should work, but not the second).
418	Nit: it would probably be best to make this call take two arguments.
426	I think you also want the following: `[[#!mul(,NUMVAR)]]` Possibly also `[[#!mul(NUMVAR,,NUMVAR)]]`
442	Nit: it would probably be best to make this call take two arguments.
llvm/unittests/Support/FileCheckTest.cpp
1076	+1

thopre added inline comments.May 29 2020, 4:07 AM

llvm/lib/Support/FileCheck.cpp
461	Regardless of the ease of implementation, I like the ! prefix since these are builtin functions/operators, not something the user can define. YMMV of course

jhenderson added inline comments.May 29 2020, 4:48 AM

llvm/lib/Support/FileCheck.cpp
461	I'm not sure why it matters that they are builtin? Even if we do provide the ability for users to define their own functions, surely their behaviour should be identical to built-in functions from the majority of the code's point-of-view? I'd actually think that including the `!` would make it harder to parse, since we'd have to support function calls both with and without the `!`.

arichardson added inline comments.May 29 2020, 5:27 AM

llvm/lib/Support/FileCheck.cpp
461	If we can make it work without the `!` without making the implementation much more complicated, I'd prefer that. But I don't feel strongly either way. Since the name needs to be followed by an open paren, even variables that have the same name as builtin functions should work: `[[#mul(mul, 2)]]`. The first one is a function name, the second must be a variable since there is no open paren.

Prefix aside I'm just doing another rebase to bring in the signedness work. I'd like to know if it's going to be a requirement to support the reporting of overflow/underflow for the builtins in my patch. Originally I had gone down the llvm route of making the operations signed rather than the data but I see the signedness patch implements the opposite. Ultimately I need to know if there's light at the end of the tunnel or whether to give up and just write ugly tests.

Rebased but still need to remove the function prefix and tighen up the mul/div operators.

Harbormaster failed remote builds in B58431: Diff 267249!May 29 2020, 9:13 AM

Added the suggested tests to the FileCheck unitest plus fleshed out overflow reporting.

Harbormaster failed remote builds in B58488: Diff 267350!May 29 2020, 2:47 PM

I have a clang-format query. I'm getting failures because clang-format is suggesting to use "-<space><digit>" to format a negative number. This doesn't seem correct to me and is not the style I see for existing code in FileCheckTests.cpp. Is this something I can ignore?

In D79936#2062802, @paulwalker-arm wrote:

I'd like to know if it's going to be a requirement to support the reporting of overflow/underflow for the builtins in my patch. Originally I had gone down the llvm route of making the operations signed rather than the data but I see the signedness patch implements the opposite. Ultimately I need to know if there's light at the end of the tunnel or whether to give up and just write ugly tests.

(Wrote this comment before I saw you added overflow/underflow support, but leaving it because it might give an idea of my thought process on why): Not quite sure I fully followed this comment. I think my preference would be to error out for overflows/underflows, rather than silently allowing them. If things are going to be significantly more complex adding them but you are also going to address them immediately, I'm okay with it being deferred to a future patch. What I don't want long-term is for people to be able to write unintentionally broken test cases because they happen to be triggering underflow/overflow behaviour. Broken test cases are bad!

In D79936#2064448, @paulwalker-arm wrote:

I have a clang-format query. I'm getting failures because clang-format is suggesting to use "-<space><digit>" to format a negative number. This doesn't seem correct to me and is not the style I see for existing code in FileCheckTests.cpp. Is this something I can ignore?

@thopre ran into this recently too. I consider it a bug in clang-format personally, so you can ignore it, but if @thopre hasn't already, you should file a clang-format bug so that it can get fixed.

llvm/docs/CommandGuide/FileCheck.rst
690	The "and have a 64-bit precision" bit seems a bit out of place here. It should probably be its own sentence.
691	Given the new stuff about functions, I might be tempted to pull out the sentence about supported operators into a list, a bit like that used for the accepted values, especially since it needs updating as things stand!
llvm/lib/Support/FileCheck.cpp
233	I think adding `operator*` etc makes sense, but it should be a separate patch to adding function support (probably a prerequisite). We don't want to cloud the intent of this patch by adding in other useful functionality, and it will make it easier to focus the reviewing. I'll save reviewing them for that patch.
317–318	Did you consider writing `min` in terms of `max` (or vice versa)? Not sure if it is a good thing to do or not, but I believe it would lead to less duplicated and more concise code. Something like: if (max(LeftOperand, RightOperand) == LeftOperand) return RightOperand; return LeftOperand;

paulwalker-arm marked 2 inline comments as done.Jun 1 2020, 3:32 AM

paulwalker-arm added inline comments.

llvm/lib/Support/FileCheck.cpp
233	I've created D80915 to add the new operator functions.

paulwalker-arm mentioned this in D80915: [FileCheck] Implement * and / operators for ExpressionValue..Jun 1 2020, 5:25 AM

In D79936#2065837, @jhenderson wrote:

In D79936#2062802, @paulwalker-arm wrote:

I'd like to know if it's going to be a requirement to support the reporting of overflow/underflow for the builtins in my patch. Originally I had gone down the llvm route of making the operations signed rather than the data but I see the signedness patch implements the opposite. Ultimately I need to know if there's light at the end of the tunnel or whether to give up and just write ugly tests.

(Wrote this comment before I saw you added overflow/underflow support, but leaving it because it might give an idea of my thought process on why): Not quite sure I fully followed this comment. I think my preference would be to error out for overflows/underflows, rather than silently allowing them. If things are going to be significantly more complex adding them but you are also going to address them immediately, I'm okay with it being deferred to a future patch. What I don't want long-term is for people to be able to write unintentionally broken test cases because they happen to be triggering underflow/overflow behaviour. Broken test cases are bad!

In D79936#2064448, @paulwalker-arm wrote:

I have a clang-format query. I'm getting failures because clang-format is suggesting to use "-<space><digit>" to format a negative number. This doesn't seem correct to me and is not the style I see for existing code in FileCheckTests.cpp. Is this something I can ignore?

@thopre ran into this recently too. I consider it a bug in clang-format personally, so you can ignore it, but if @thopre hasn't already, you should file a clang-format bug so that it can get fixed.

I'm planning to take a look at this clang-format bug today.

In D79936#2062802, @paulwalker-arm wrote:

Prefix aside I'm just doing another rebase to bring in the signedness work. I'd like to know if it's going to be a requirement to support the reporting of overflow/underflow for the builtins in my patch. Originally I had gone down the llvm route of making the operations signed rather than the data but I see the signedness patch implements the opposite. Ultimately I need to know if there's light at the end of the tunnel or whether to give up and just write ugly tests.

Using signed operation would mean throwing overflow when the result could be represented in uint64_t in some cases which felt weird, especially since we support unsigned values (e.g. addresses).

In D79936#2066160, @thopre wrote:

In D79936#2065837, @jhenderson wrote:

In D79936#2062802, @paulwalker-arm wrote:

I'd like to know if it's going to be a requirement to support the reporting of overflow/underflow for the builtins in my patch. Originally I had gone down the llvm route of making the operations signed rather than the data but I see the signedness patch implements the opposite. Ultimately I need to know if there's light at the end of the tunnel or whether to give up and just write ugly tests.

(Wrote this comment before I saw you added overflow/underflow support, but leaving it because it might give an idea of my thought process on why): Not quite sure I fully followed this comment. I think my preference would be to error out for overflows/underflows, rather than silently allowing them. If things are going to be significantly more complex adding them but you are also going to address them immediately, I'm okay with it being deferred to a future patch. What I don't want long-term is for people to be able to write unintentionally broken test cases because they happen to be triggering underflow/overflow behaviour. Broken test cases are bad!

In D79936#2064448, @paulwalker-arm wrote:

I have a clang-format query. I'm getting failures because clang-format is suggesting to use "-<space><digit>" to format a negative number. This doesn't seem correct to me and is not the style I see for existing code in FileCheckTests.cpp. Is this something I can ignore?

@thopre ran into this recently too. I consider it a bug in clang-format personally, so you can ignore it, but if @thopre hasn't already, you should file a clang-format bug so that it can get fixed.

I'm planning to take a look at this clang-format bug today.

Seems to be related to the use of operator. I've created PR46157

In D79936#2066379, @thopre wrote:

In D79936#2066160, @thopre wrote:

I'm planning to take a look at this clang-format bug today.

Seems to be related to the use of operator. I've created PR46157

Someone already posted a patch for it: https://reviews.llvm.org/D80933. It works for 23ac16cf9bd4cc0bb434efcf6385baf083a2ff7b.

paulwalker-arm marked an inline comment as done.Jun 1 2020, 11:03 AM

paulwalker-arm added inline comments.

llvm/lib/Support/FileCheck.cpp
461	For what it's worth I took a run at an implementation that doesn't require a call prefix and whilst almost as simple as suggested above there are a couple of downsides. (1) Calls without a prefix require look ahead parsing, which means redundant work continually looking for functions that might never be there. For example, take the parsing of var_a + var_b + var_c + var_d - (var_e - var_f) where parseNumericOperand is likely to perform many failed parse attempts. (2) Some diagnostics become harder or impossible. For example, is "mu(la+b)" a call to an unknown function, a missing operator or a bracket typo. I know the same scenario is true if the user forgets the prefix but when they don't, we can emit a more useful message. You can see another example in this patch where it's easier to spot and report a missing operator before a function call. (3) A prefix allows the use of symbols that might otherwise be confusing. Tenuous I know but consider "a + !operator+(b+c)". (4) Are there any plans for VAR1(VAR2+VAR3) as short hand for mul(VAR1, VAR2+VAR3)? A downside of the prefix is that we cannot easily use "!" to mean "not". That said "!(var)" support is only a minor modification. I don't know if these are strong reasons to go with a prefix and ultimately either approach works for me, so just let me know the preference and I'll make the necessary changes.

Taking conversation out-of-line to make it easier. My personal beliefs are as follows:

(1) Calls without a prefix require look ahead parsing, which means redundant work continually looking for functions that might never be there. For example, take the parsing of var_a + var_b + var_c + var_d - (var_e - var_f) where parseNumericOperand is likely to perform many failed parse attempts.

By look ahead parsing are you saying something like to identify whether var_a is actually a function, we have to parse the +? I've not given too much thought to this, but I think this extra work can be avoided by delaying handling of a token until the next token is identified. Thus an identifier token is left unprocessed until the next token has been read in, at which point it is either processed as a function or a variable. However, I accept that might need some rewriting of the existing parsing code. Trying to parse something as a numeric operand as a first attempt before trying to read it as something else seems like the wrong approach long-term as we add more power to these expressions.

(2) Some diagnostics become harder or impossible. For example, is "mu(la+b)" a call to an unknown function, a missing operator or a bracket typo. I know the same scenario is true if the user forgets the prefix but when they don't, we can emit a more useful message. You can see another example in this patch where it's easier to spot and report a missing operator before a function call.

I think it's okay in that case to treat that as an attempt to call function mu, which probably doesn't exist. I'm not sure having the prefix helps in this case: !mu(la+b) is just as unknown a function. The function name is delimited by the end of the previous token and the opening parenthesis in the unprefixed case. I'm not sure I see the case where it's easier to spot a missing operand. The examples in the patch are (I think):

(1)(2) - this is unaffected - no identifier means these are treated as parenthesised expressions.
2(X) - 2 will be parsed as a numeric literal, so this is still a missing operator.
2!mul(FOO,2) - without the !, either the 2 is treated as a separate token, because it can't be the first character of an identifier (and therefore again a missing operator diagnostic is still possible), or 2mul becomes an invalid identifier, with corresponding message. I think either is an acceptable error.
FOO !mul(FOO,2) - without the !, FOO is still a separate token because of the whitespace, so the missing operand is easily identifiable.

(3) A prefix allows the use of symbols that might otherwise be confusing. Tenuous I know but consider "a + !operator+(b+c)".

I think we either a) don't need to support such expressions or b) special-case + following the term operator or equivalent sequences. I think we just want to keep our function names to valid identifiers like we already have for variable names.

(4) Are there any plans for VAR1(VAR2+VAR3) as short hand for mul(VAR1, VAR2+VAR3)?

I'm not aware of any plans, and I don't think they're really needed (especially if we add support for * as a binary operator for multiplication).

A downside of the prefix is that we cannot easily use "!" to mean "not". That said "!(var)" support is only a minor modification.

I don't think we have any plans or need for boolean support (tools don't generally print "true" or "false" and other things like 1 or 0 can be supported using regular numeric expressions). We will probably want to support != as a comparator though at some point, so we need to allow for that.

I don't know if these are strong reasons to go with a prefix and ultimately either approach works for me, so just let me know the preference and I'll make the necessary changes.

My personal preference is still no prefix. Related aside is this comic: https://xkcd.com/1306/ (especially the alt text).

Thanks for the discussion @jhenderson.

I've removed the function prefix and updated the tests accordingly. I've also removed div and mul support so that it's no longer dependent on D80915, which I'll progress separately.

arichardson added inline comments.Jun 4 2020, 11:58 AM

llvm/lib/Support/FileCheck.cpp
608	Should this be ltrim?

Harbormaster completed remote builds in B59105: Diff 268509.Jun 4 2020, 12:09 PM

paulwalker-arm marked an inline comment as done.Jun 4 2020, 1:33 PM

paulwalker-arm added inline comments.

llvm/lib/Support/FileCheck.cpp
608	It looks like redundant code as space is already trimmed before loop entry and exit. I'll remove it.

Removed redundant rtrim.

Harbormaster completed remote builds in B59141: Diff 268580.Jun 4 2020, 3:30 PM

A couple of test cases that might want adding:

Trying to use a variable as a function (e.g. VAR1(1, 2))
Trying to use a function as a variable (e.g. max + min)
Maybe even defining a variable explicitly as a recognised function name (e.g. max + max(1, 2) or even max + max(max, max)).

I reckon the first should be treated as an unrecognised variable, and the others allowed (although the second one probably would be using undefined variables).

llvm/docs/CommandGuide/FileCheck.rst
702	Perhaps "Accepted" rather than "Acceptable"
llvm/lib/Support/FileCheck.cpp
233	Should this be an `Expected` if it can't fail? Same for `min`.
255	This `cantFail` call suggests to me that this shouldn't be an `Expected` return.
369–375	We should probably allow for optional whitespace between the end of the function name and the `(`.

paulwalker-arm edited the summary of this revision. (Show Details)Jun 5 2020, 3:20 AM

llvm/lib/Support/FileCheck.cpp
233	This is to match the binop_eval_t typedef required by BinaryOperation.
255	As explained above. There did not seem much value in creating a duplicate set of functions (i.e. with and without an Expected result) given this is the only other use.
369–375	In LLVM's c/c++ world clang-format will remove such whitespace (presumably to aid readability) so do we really want to allow it in FileCheck code?

jhenderson added inline comments.Jun 5 2020, 4:00 AM

llvm/lib/Support/FileCheck.cpp
233	Thanks, makes sense.
369–375	We've allowed arbitrary whitespace everywhere else in the expressions, so I think we should. Not all environments will necessarily follow LLVM's coding standards.

I initially misread the "max + max" related comments and went down the wrong path. Happily it was not in vain as it prompted me to reuse parseVariable because the function relates to identifiers rather than just variables. I've also simplied parseVariable a little but stopped short of renaming things.

Harbormaster failed remote builds in B59364: Diff 269003!Jun 6 2020, 3:42 AM

Tip to help reviewers: click the "Done" box on inline comments before uploading a patch to indicate you have addressed a specific comment. The comment will then be marked as Done when you upload the next diff.

LGTM, thanks, but wait for others.

llvm/lib/Support/FileCheck.cpp
350	I know this was here before, but since you are modifying this line, you can fix on the way (or in a separate commit before if you prefer): use `size_t` rather than `unsigned` to match the return type of `Str.size()`. Optionally also update `I` to `size_t` for the same reason.

arichardson accepted this revision.Jun 9 2020, 4:35 AM

Closed by commit rG8fd227037024: [FileCheck] Add function call support to numerical expressions. (authored by paulwalker-arm). · Explain WhyJun 10 2020, 3:14 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

docs/

CommandGuide/

FileCheck.rst

27 lines

lib/

Support/

FileCheck.cpp

131 lines

FileCheckImpl.h

24 lines

test/

FileCheck/

numeric-expression.txt

81 lines

unittests/

Support/

FileCheckTest.cpp

96 lines

Diff 269776

llvm/docs/CommandGuide/FileCheck.rst

Show First 20 Lines • Show All 679 Lines • ▼ Show 20 Lines	* ``%<fmtspec>`` is the same matching format specifier as for defining numeric
is used. In case of conflict between matching formats of several numeric		is used. In case of conflict between matching formats of several numeric
variables the format specifier is mandatory.		variables the format specifier is mandatory.

* ``<expr>`` is an expression. An expression is in turn recursively defined		* ``<expr>`` is an expression. An expression is in turn recursively defined
as:		as:

* a numeric operand, or		* a numeric operand, or
* an expression followed by an operator and a numeric operand.		* an expression followed by an operator and a numeric operand.

		thopreUnsubmitted Not Done Reply Inline Actions Why change this sentence? Recursion only happens on one of the operand only. thopre: Why change this sentence? Recursion only happens on one of the operand only.
		paulwalker-armAuthorUnsubmitted Not Done Reply Inline Actions I don't understand the distinction. I changed the sentence as I didn't what to say: an expression followed by an operator and either a numerical operand or a function call. Are you saying the above is more correct? Also with functions you can have !(mul(VAR+(umin(VAR2,4)) + !(udivl(VAR3+(umax(VAR3,4)) paulwalker-arm: I don't understand the distinction. I changed the sentence as I didn't what to say: ``` an…
		thopreUnsubmitted Not Done Reply Inline Actions Oh right, I forgot about function altogether. That said, I think an expression is a kind of numeric operand so we should just expand the definition below saying it can also be a function call. thopre: Oh right, I forgot about function altogether. That said, I think an expression is a kind of…
		paulwalker-armAuthorUnsubmitted Not Done Reply Inline Actions So something like: A numeric operand is a previously defined numeric variable, an integer literal or the result of a function call. paulwalker-arm: So something like: ``` A numeric operand is a previously defined numeric variable, an integer…
		thopreUnsubmitted Done Reply Inline Actions I'm nitpicking but technically the operand is the function call itself, but evaluates to the return value of the function call. thopre: I'm nitpicking but technically the operand is the function call itself, but evaluates to the…
A numeric operand is a previously defined numeric variable, or an integer		A numeric operand is a previously defined numeric variable, an integer
literal and have a 64-bit precision. The supported operators are ``+`` and		literal, or a function. Spaces are accepted before, after and between any of
		jhendersonUnsubmitted Not Done Reply Inline Actions The "and have a 64-bit precision" bit seems a bit out of place here. It should probably be its own sentence. jhenderson: The "and have a 64-bit precision" bit seems a bit out of place here. It should probably be its…
``-``. Spaces are accepted before, after and between any of these elements.		these elements. Numeric operands have 64-bit precision. Overflow and underflow
		jhendersonUnsubmitted Not Done Reply Inline Actions Given the new stuff about functions, I might be tempted to pull out the sentence about supported operators into a list, a bit like that used for the accepted values, especially since it needs updating as things stand! jhenderson: Given the new stuff about functions, I might be tempted to pull out the sentence about…
Overflow and underflow are rejected. There is currently no support for		are rejected. There is no support for operator precendence, but parentheses
operator precendence, but parentheses can be used to change the evaluation		can be used to change the evaluation order.
order.
		The supported operators are:

		* ``+`` - Returns the sum of its two operands.
		* ``-`` - Returns the difference of its two operands.

		The syntax of a function call is ``<name>(<arguments>)`` where:

		* ``name`` is a predefined string literal. Accepted values are:
		jhendersonUnsubmitted Not Done Reply Inline Actions Perhaps "Accepted" rather than "Acceptable" jhenderson: Perhaps "Accepted" rather than "Acceptable"

		* add - Returns the sum of its two operands.
		* max - Returns the largest of its two operands.
		* min - Returns the smallest of its two operands.
		* sub - Returns the difference of its two operands.

		* ``<arguments>`` is a comma seperated list of expressions.

For example:		For example:

.. code-block:: llvm		.. code-block:: llvm

; CHECK: load r[[#REG:]], [r0]		; CHECK: load r[[#REG:]], [r0]
; CHECK: load r[[#REG+1]], [r1]		; CHECK: load r[[#REG+1]], [r1]
; CHECK: Loading from 0x[[#%x,ADDR:]]		; CHECK: Loading from 0x[[#%x,ADDR:]]
; CHECK-SAME: to 0x[[#ADDR + 7]]		; CHECK-SAME: to 0x[[#ADDR + 7]]

The above example would match the text:		The above example would match the text:

		thopreUnsubmitted Not Done Reply Inline Actions I don't think the exclamation mark should be required here. A parenthesis pair should be enough to force precedence. Note that there was a patch to adds support for that and you might want to rebase your patch on top of it. thopre: I don't think the exclamation mark should be required here. A parenthesis pair should be enough…
		paulwalker-armAuthorUnsubmitted Not Done Reply Inline Actions Sure, it's more that this implementation comes for free, whereas supporting arbitrary parenthesis pairs along side function calls requires more complex parsing. Is it worth the extra effort? If so I'll happy prevent this usage and error out for unnamed functions. I'd rather not wait for the parenthesis work because I've got other work at review whose tests are much improved when I can make use of function calls. paulwalker-arm: Sure, it's more that this implementation comes for free, whereas supporting arbitrary…
		thopreUnsubmitted Not Done Reply Inline Actions I think the exclamation mark should be reserved for function call. I find it a bit confusing otherwise but let's wait to see what other reviewers might think. Do the tests you need this feature for require some way to force precedence or can this be dealt later? thopre: I think the exclamation mark should be reserved for function call. I find it a bit confusing…
		paulwalker-armAuthorUnsubmitted Not Done Reply Inline Actions I currently have no use for the precedence operator, the function calls are why I started down this path, so am happy either way. Part of me just assumed that in the future the parser might be reworked to remove the need for the ! prefix, but I suppose providing the power of precedence early might prevent that work from happening :) paulwalker-arm: I currently have no use for the precedence operator, the function calls are why I started down…
		thopreUnsubmitted Done Reply Inline Actions I wouldn't hold my breath for a parser rewrite. While I'd love to make the code nicer and more flexible I have little free time to work on it myself. Anyway, if you don't require operator precedence just error on empty function name for now and we can extend it to be used for operator precedence later if needed. thopre: I wouldn't hold my breath for a parser rewrite. While I'd love to make the code nicer and more…
		paulwalker-armAuthorUnsubmitted Done Reply Inline Actions Done. Perhaps it's a good idea to mandate that all symbolic operators have a named function counterpart. That way in the short term if somebody does want to force precedence they just need to write a slightly more verbose check line. paulwalker-arm: Done. Perhaps it's a good idea to mandate that all symbolic operators have a named function…
		thopreUnsubmitted Not Done Reply Inline Actions No objection to that, should be a 2 line changes, right? thopre: No objection to that, should be a 2 line changes, right?
		paulwalker-armAuthorUnsubmitted Not Done Reply Inline Actions Yep. I'll add entries for add and sub to this patch and resubmit later today. paulwalker-arm: Yep. I'll add entries for add and sub to this patch and resubmit later today.
		arichardsonUnsubmitted Not Done Reply Inline Actions Apologies for the delay, I've now rebased the parentheses revision (D77383). arichardson: Apologies for the delay, I've now rebased the parentheses revision (D77383).
.. code-block:: gas		.. code-block:: gas

load r5, [r0]		load r5, [r0]
load r6, [r1]		load r6, [r1]
Loading from 0xa0463440 to 0xa0463447		Loading from 0xa0463440 to 0xa0463447

but would not match the text:		but would not match the text:

▲ Show 20 Lines • Show All 93 Lines • Show Last 20 Lines

llvm/lib/Support/FileCheck.cpp

Show First 20 Lines • Show All 224 Lines • ▼ Show 20 Lines	if (AbsoluteDifference > MaxInt64) {
Result -= static_cast<int64_t>(AbsoluteDifference);		Result -= static_cast<int64_t>(AbsoluteDifference);
return ExpressionValue(Result);		return ExpressionValue(Result);
}		}

return ExpressionValue(-static_cast<int64_t>(AbsoluteDifference));		return ExpressionValue(-static_cast<int64_t>(AbsoluteDifference));
}		}
}		}

		Expected<ExpressionValue> llvm::max(const ExpressionValue &LeftOperand,
		jhendersonUnsubmitted Done Reply Inline Actions I think adding `operator` etc makes sense, but it should be a separate patch to adding function support (probably a prerequisite). We don't want to cloud the intent of this patch by adding in other useful functionality, and it will make it easier to focus the reviewing. I'll save reviewing them for that patch. jhenderson:* I think adding `operator*` etc makes sense, but it should be a separate patch to adding…
		paulwalker-armAuthorUnsubmitted Done Reply Inline Actions I've created D80915 to add the new operator functions. paulwalker-arm: I've created D80915 to add the new operator functions.
		jhendersonUnsubmitted Not Done Reply Inline Actions Should this be an `Expected` if it can't fail? Same for `min`. jhenderson: Should this be an `Expected` if it can't fail? Same for `min`.
		paulwalker-armAuthorUnsubmitted Done Reply Inline Actions This is to match the binop_eval_t typedef required by BinaryOperation. paulwalker-arm: This is to match the binop_eval_t typedef required by BinaryOperation.
		jhendersonUnsubmitted Not Done Reply Inline Actions Thanks, makes sense. jhenderson: Thanks, makes sense.
		const ExpressionValue &RightOperand) {
		if (LeftOperand.isNegative() && RightOperand.isNegative()) {
		int64_t LeftValue = cantFail(LeftOperand.getSignedValue());
		int64_t RightValue = cantFail(RightOperand.getSignedValue());
		return ExpressionValue(std::max(LeftValue, RightValue));
		}

		if (!LeftOperand.isNegative() && !RightOperand.isNegative()) {
		uint64_t LeftValue = cantFail(LeftOperand.getUnsignedValue());
		uint64_t RightValue = cantFail(RightOperand.getUnsignedValue());
		return ExpressionValue(std::max(LeftValue, RightValue));
		}

		if (LeftOperand.isNegative())
		return RightOperand;

		return LeftOperand;
		}

		Expected<ExpressionValue> llvm::min(const ExpressionValue &LeftOperand,
		const ExpressionValue &RightOperand) {
		if (cantFail(max(LeftOperand, RightOperand)) == LeftOperand)
		jhendersonUnsubmitted Not Done Reply Inline Actions This `cantFail` call suggests to me that this shouldn't be an `Expected` return. jhenderson: This `cantFail` call suggests to me that this shouldn't be an `Expected` return.
		paulwalker-armAuthorUnsubmitted Done Reply Inline Actions As explained above. There did not seem much value in creating a duplicate set of functions (i.e. with and without an Expected result) given this is the only other use. paulwalker-arm: As explained above. There did not seem much value in creating a duplicate set of functions (i.e.
		return RightOperand;

		return LeftOperand;
		}

Expected<ExpressionValue> NumericVariableUse::eval() const {		Expected<ExpressionValue> NumericVariableUse::eval() const {
Optional<ExpressionValue> Value = Variable->getValue();		Optional<ExpressionValue> Value = Variable->getValue();
if (Value)		if (Value)
return *Value;		return *Value;

return make_error<UndefVarError>(getExpressionStr());		return make_error<UndefVarError>(getExpressionStr());
}		}

Show All 40 Lines	BinaryOperation::getImplicitFormat(const SourceMgr &SM) const {

return LeftFormat != ExpressionFormat::Kind::NoFormat ? LeftFormat		return LeftFormat != ExpressionFormat::Kind::NoFormat ? LeftFormat
: *RightFormat;		: *RightFormat;
}		}

Expected<std::string> NumericSubstitution::getResult() const {		Expected<std::string> NumericSubstitution::getResult() const {
assert(ExpressionPointer->getAST() != nullptr &&		assert(ExpressionPointer->getAST() != nullptr &&
"Substituting empty expression");		"Substituting empty expression");
Expected<ExpressionValue> EvaluatedValue =		Expected<ExpressionValue> EvaluatedValue =
ExpressionPointer->getAST()->eval();		ExpressionPointer->getAST()->eval();
		jhendersonUnsubmitted Not Done Reply Inline Actions Did you consider writing `min` in terms of `max` (or vice versa)? Not sure if it is a good thing to do or not, but I believe it would lead to less duplicated and more concise code. Something like: if (max(LeftOperand, RightOperand) == LeftOperand) return RightOperand; return LeftOperand; jhenderson: Did you consider writing `min` in terms of `max` (or vice versa)? Not sure if it is a good…
if (!EvaluatedValue)		if (!EvaluatedValue)
return EvaluatedValue.takeError();		return EvaluatedValue.takeError();
ExpressionFormat Format = ExpressionPointer->getFormat();		ExpressionFormat Format = ExpressionPointer->getFormat();
return Format.getMatchingString(*EvaluatedValue);		return Format.getMatchingString(*EvaluatedValue);
}		}

Expected<std::string> StringSubstitution::getResult() const {		Expected<std::string> StringSubstitution::getResult() const {
// Look up the value and escape it so that we can put it into the regex.		// Look up the value and escape it so that we can put it into the regex.
Expected<StringRef> VarVal = Context->getPatternVarValue(FromStr);		Expected<StringRef> VarVal = Context->getPatternVarValue(FromStr);
if (!VarVal)		if (!VarVal)
return VarVal.takeError();		return VarVal.takeError();
return Regex::escape(*VarVal);		return Regex::escape(*VarVal);
}		}

bool Pattern::isValidVarNameStart(char C) { return C == '_' \|\| isAlpha(C); }		bool Pattern::isValidVarNameStart(char C) { return C == '_' \|\| isAlpha(C); }

Expected<Pattern::VariableProperties>		Expected<Pattern::VariableProperties>
Pattern::parseVariable(StringRef &Str, const SourceMgr &SM) {		Pattern::parseVariable(StringRef &Str, const SourceMgr &SM) {
if (Str.empty())		if (Str.empty())
return ErrorDiagnostic::get(SM, Str, "empty variable name");		return ErrorDiagnostic::get(SM, Str, "empty variable name");

bool ParsedOneChar = false;		size_t I = 0;
unsigned I = 0;
bool IsPseudo = Str[0] == '@';		bool IsPseudo = Str[0] == '@';

// Global vars start with '$'.		// Global vars start with '$'.
if (Str[0] == '$' \|\| IsPseudo)		if (Str[0] == '$' \|\| IsPseudo)
++I;		++I;

for (unsigned E = Str.size(); I != E; ++I) {		if (!isValidVarNameStart(Str[I++]))
if (!ParsedOneChar && !isValidVarNameStart(Str[I]))
return ErrorDiagnostic::get(SM, Str, "invalid variable name");		return ErrorDiagnostic::get(SM, Str, "invalid variable name");

		for (size_t E = Str.size(); I != E; ++I)
		jhendersonUnsubmitted Not Done Reply Inline Actions I know this was here before, but since you are modifying this line, you can fix on the way (or in a separate commit before if you prefer): use `size_t` rather than `unsigned` to match the return type of `Str.size()`. Optionally also update `I` to `size_t` for the same reason. jhenderson: I know this was here before, but since you are modifying this line, you can fix on the way (or…
// Variable names are composed of alphanumeric characters and underscores.		// Variable names are composed of alphanumeric characters and underscores.
if (Str[I] != '_' && !isAlnum(Str[I]))		if (Str[I] != '_' && !isAlnum(Str[I]))
break;		break;
ParsedOneChar = true;
}

StringRef Name = Str.take_front(I);		StringRef Name = Str.take_front(I);
Str = Str.substr(I);		Str = Str.substr(I);
return VariableProperties {Name, IsPseudo};		return VariableProperties {Name, IsPseudo};
}		}

// StringRef holding all characters considered as horizontal whitespaces by		// StringRef holding all characters considered as horizontal whitespaces by
// FileCheck input canonicalization.		// FileCheck input canonicalization.
constexpr StringLiteral SpaceChars = " \t";		constexpr StringLiteral SpaceChars = " \t";

// Parsing helper function that strips the first character in S and returns it.		// Parsing helper function that strips the first character in S and returns it.
static char popFront(StringRef &S) {		static char popFront(StringRef &S) {
char C = S.front();		char C = S.front();
S = S.drop_front();		S = S.drop_front();
return C;		return C;
}		}

char OverflowError::ID = 0;		char OverflowError::ID = 0;
char UndefVarError::ID = 0;		char UndefVarError::ID = 0;
char ErrorDiagnostic::ID = 0;		char ErrorDiagnostic::ID = 0;
char NotFoundError::ID = 0;		char NotFoundError::ID = 0;

		jhendersonUnsubmitted Not Done Reply Inline Actions We should probably allow for optional whitespace between the end of the function name and the `(`. jhenderson: We should probably allow for optional whitespace between the end of the function name and the `…
		paulwalker-armAuthorUnsubmitted Done Reply Inline Actions In LLVM's c/c++ world clang-format will remove such whitespace (presumably to aid readability) so do we really want to allow it in FileCheck code? paulwalker-arm: In LLVM's c/c++ world clang-format will remove such whitespace (presumably to aid readability)…
		jhendersonUnsubmitted Not Done Reply Inline Actions We've allowed arbitrary whitespace everywhere else in the expressions, so I think we should. Not all environments will necessarily follow LLVM's coding standards. jhenderson: We've allowed arbitrary whitespace everywhere else in the expressions, so I think we should.
Expected<NumericVariable *> Pattern::parseNumericVariableDefinition(		Expected<NumericVariable *> Pattern::parseNumericVariableDefinition(
StringRef &Expr, FileCheckPatternContext *Context,		StringRef &Expr, FileCheckPatternContext *Context,
Optional<size_t> LineNumber, ExpressionFormat ImplicitFormat,		Optional<size_t> LineNumber, ExpressionFormat ImplicitFormat,
const SourceMgr &SM) {		const SourceMgr &SM) {
Expected<VariableProperties> ParseVarResult = parseVariable(Expr, SM);		Expected<VariableProperties> ParseVarResult = parseVariable(Expr, SM);
if (!ParseVarResult)		if (!ParseVarResult)
return ParseVarResult.takeError();		return ParseVarResult.takeError();
StringRef Name = ParseVarResult->Name;		StringRef Name = ParseVarResult->Name;
▲ Show 20 Lines • Show All 69 Lines • ▼ Show 20 Lines	Expected<std::unique_ptr<ExpressionAST>> Pattern::parseNumericOperand(
if (Expr.startswith("(")) {		if (Expr.startswith("(")) {
if (AO != AllowedOperand::Any)		if (AO != AllowedOperand::Any)
return ErrorDiagnostic::get(		return ErrorDiagnostic::get(
SM, Expr, "parenthesized expression not permitted here");		SM, Expr, "parenthesized expression not permitted here");
return parseParenExpr(Expr, LineNumber, Context, SM);		return parseParenExpr(Expr, LineNumber, Context, SM);
}		}

if (AO == AllowedOperand::LineVar \|\| AO == AllowedOperand::Any) {		if (AO == AllowedOperand::LineVar \|\| AO == AllowedOperand::Any) {
// Try to parse as a numeric variable use.		// Try to parse as a numeric variable use.
		jhendersonUnsubmitted Not Done Reply Inline Actions I'm not strongly opposed to the use of `!` to indicate a function call, but is it actually necessary? It seems like a function call could just be identified by `<sequence of identifier chars>(<some chars>)`. The code would look something like the following semi-pseudo code: size_t Parenthesis = Expr.find_first_of('('); if (Parenthesis != npos) { if (all_of(Expr.take_front(Parenthesis), [](char C) { return isValidIdentifierChar(C); }) { if (AO != AllowedOperand::Any) return ErrorDiagnostic::get(SM, Expr, "unexpected function call"); return parseCallExpr(Expr, LineNumber, Context, SM); } } Assuming I've not missed something, that would allow us to simplify all the usages of function calls. jhenderson: I'm not strongly opposed to the use of `!` to indicate a function call, but is it actually…
		thopreUnsubmitted Not Done Reply Inline Actions Regardless of the ease of implementation, I like the ! prefix since these are builtin functions/operators, not something the user can define. YMMV of course thopre: Regardless of the ease of implementation, I like the ! prefix since these are builtin…
		jhendersonUnsubmitted Not Done Reply Inline Actions I'm not sure why it matters that they are builtin? Even if we do provide the ability for users to define their own functions, surely their behaviour should be identical to built-in functions from the majority of the code's point-of-view? I'd actually think that including the `!` would make it harder to parse, since we'd have to support function calls both with and without the `!`. jhenderson: I'm not sure why it matters that they are builtin? Even if we do provide the ability for users…
		arichardsonUnsubmitted Not Done Reply Inline Actions If we can make it work without the `!` without making the implementation much more complicated, I'd prefer that. But I don't feel strongly either way. Since the name needs to be followed by an open paren, even variables that have the same name as builtin functions should work: `[[#mul(mul, 2)]]`. The first one is a function name, the second must be a variable since there is no open paren. arichardson: If we can make it work without the `!` without making the implementation much more complicated…
		paulwalker-armAuthorUnsubmitted Not Done Reply Inline Actions For what it's worth I took a run at an implementation that doesn't require a call prefix and whilst almost as simple as suggested above there are a couple of downsides. (1) Calls without a prefix require look ahead parsing, which means redundant work continually looking for functions that might never be there. For example, take the parsing of var_a + var_b + var_c + var_d - (var_e - var_f) where parseNumericOperand is likely to perform many failed parse attempts. (2) Some diagnostics become harder or impossible. For example, is "mu(la+b)" a call to an unknown function, a missing operator or a bracket typo. I know the same scenario is true if the user forgets the prefix but when they don't, we can emit a more useful message. You can see another example in this patch where it's easier to spot and report a missing operator before a function call. (3) A prefix allows the use of symbols that might otherwise be confusing. Tenuous I know but consider "a + !operator+(b+c)". (4) Are there any plans for VAR1(VAR2+VAR3) as short hand for mul(VAR1, VAR2+VAR3)? A downside of the prefix is that we cannot easily use "!" to mean "not". That said "!(var)" support is only a minor modification. I don't know if these are strong reasons to go with a prefix and ultimately either approach works for me, so just let me know the preference and I'll make the necessary changes. paulwalker-arm: For what it's worth I took a run at an implementation that doesn't require a call prefix and…
Expected<Pattern::VariableProperties> ParseVarResult =		Expected<Pattern::VariableProperties> ParseVarResult =
parseVariable(Expr, SM);		parseVariable(Expr, SM);
if (ParseVarResult)		if (ParseVarResult) {
		// Try to parse a function call.
		if (Expr.ltrim(SpaceChars).startswith("(")) {
		if (AO != AllowedOperand::Any)
		return ErrorDiagnostic::get(SM, ParseVarResult->Name,
		"unexpected function call");

		return parseCallExpr(Expr, ParseVarResult->Name, LineNumber, Context,
		SM);
		}

return parseNumericVariableUse(ParseVarResult->Name,		return parseNumericVariableUse(ParseVarResult->Name,
ParseVarResult->IsPseudo, LineNumber,		ParseVarResult->IsPseudo, LineNumber,
Context, SM);		Context, SM);
		}

if (AO == AllowedOperand::LineVar)		if (AO == AllowedOperand::LineVar)
return ParseVarResult.takeError();		return ParseVarResult.takeError();
// Ignore the error and retry parsing as a literal.		// Ignore the error and retry parsing as a literal.
consumeError(ParseVarResult.takeError());		consumeError(ParseVarResult.takeError());
}		}

// Otherwise, parse it as a literal.		// Otherwise, parse it as a literal.
int64_t SignedLiteralValue;		int64_t SignedLiteralValue;
▲ Show 20 Lines • Show All 84 Lines • ▼ Show 20 Lines	Pattern::parseBinop(StringRef Expr, StringRef &RemainingExpr,
if (!RightOpResult)		if (!RightOpResult)
return RightOpResult;		return RightOpResult;

Expr = Expr.drop_back(RemainingExpr.size());		Expr = Expr.drop_back(RemainingExpr.size());
return std::make_unique<BinaryOperation>(Expr, EvalBinop, std::move(LeftOp),		return std::make_unique<BinaryOperation>(Expr, EvalBinop, std::move(LeftOp),
std::move(*RightOpResult));		std::move(*RightOpResult));
}		}

		Expected<std::unique_ptr<ExpressionAST>>
		Pattern::parseCallExpr(StringRef &Expr, StringRef FuncName,
		Optional<size_t> LineNumber,
		FileCheckPatternContext *Context, const SourceMgr &SM) {
		Expr = Expr.ltrim(SpaceChars);
		assert(Expr.startswith("("));

		auto OptFunc = StringSwitch<Optional<binop_eval_t>>(FuncName)
		.Case("add", operator+)
		.Case("max", max)
		.Case("min", min)
		.Case("sub", operator-)
		.Default(None);

		if (!OptFunc)
		return ErrorDiagnostic::get(
		SM, FuncName, Twine("call to undefined function '") + FuncName + "'");
		jhendersonUnsubmitted Not Done Reply Inline Actions I believe you could change this case to an asseertion - the parseExpr function treats a leading '(' as a different kind of expression. jhenderson: I believe you could change this case to an asseertion - the parseExpr function treats a leading…

		Expr.consume_front("(");
		Expr = Expr.ltrim(SpaceChars);

		// Parse call arguments, which are comma separated.
		SmallVector<std::unique_ptr<ExpressionAST>, 4> Args;
		while (!Expr.empty() && !Expr.startswith(")")) {
		if (Expr.startswith(","))
		return ErrorDiagnostic::get(SM, Expr, "missing argument");

		// Parse the argument, which is an arbitary expression.
		StringRef OuterBinOpExpr = Expr;
		arichardsonUnsubmitted Not Done Reply Inline Actions Should this be ltrim? arichardson: Should this be ltrim?
		paulwalker-armAuthorUnsubmitted Done Reply Inline Actions It looks like redundant code as space is already trimmed before loop entry and exit. I'll remove it. paulwalker-arm: It looks like redundant code as space is already trimmed before loop entry and exit. I'll…
		Expected<std::unique_ptr<ExpressionAST>> Arg =
		parseNumericOperand(Expr, AllowedOperand::Any, LineNumber, Context, SM);
		while (Arg && !Expr.empty()) {
		Expr = Expr.ltrim(SpaceChars);
		// Have we reached an argument terminator?
		if (Expr.startswith(",") \|\| Expr.startswith(")"))
		break;

		// Arg = Arg <op> <expr>
		Arg = parseBinop(OuterBinOpExpr, Expr, std::move(*Arg), false, LineNumber,
		Context, SM);
		}

		// Prefer an expression error over a generic invalid argument message.
		if (!Arg)
		return Arg.takeError();
		Args.push_back(std::move(*Arg));

		// Have we parsed all available arguments?
		Expr = Expr.ltrim(SpaceChars);
		thopreUnsubmitted Done Reply Inline Actions Please group the two tests together, together they test whether it's an exit condition. thopre: Please group the two tests together, together they test whether it's an exit condition.
		if (!Expr.consume_front(","))
		break;

		Expr = Expr.ltrim(SpaceChars);
		if (Expr.startswith(")"))
		return ErrorDiagnostic::get(SM, Expr, "missing argument");
		}

		if (!Expr.consume_front(")"))
		return ErrorDiagnostic::get(SM, Expr,
		"missing ')' at end of call expression");

		const unsigned NumArgs = Args.size();
		if (NumArgs == 2)
		return std::make_unique<BinaryOperation>(Expr, *OptFunc, std::move(Args[0]),
		std::move(Args[1]));

		// TODO: Support more than binop_eval_t.
		return ErrorDiagnostic::get(SM, FuncName,
		Twine("function '") + FuncName +
		Twine("' takes 2 arguments but ") +
		Twine(NumArgs) + " given");
		}

Expected<std::unique_ptr<Expression>> Pattern::parseNumericSubstitutionBlock(		Expected<std::unique_ptr<Expression>> Pattern::parseNumericSubstitutionBlock(
StringRef Expr, Optional<NumericVariable *> &DefinedNumericVariable,		StringRef Expr, Optional<NumericVariable *> &DefinedNumericVariable,
bool IsLegacyLineExpr, Optional<size_t> LineNumber,		bool IsLegacyLineExpr, Optional<size_t> LineNumber,
FileCheckPatternContext *Context, const SourceMgr &SM) {		FileCheckPatternContext *Context, const SourceMgr &SM) {
std::unique_ptr<ExpressionAST> ExpressionASTPointer = nullptr;		std::unique_ptr<ExpressionAST> ExpressionASTPointer = nullptr;
StringRef DefExpr = StringRef();		StringRef DefExpr = StringRef();
DefinedNumericVariable = None;		DefinedNumericVariable = None;
ExpressionFormat ExplicitFormat = ExpressionFormat();		ExpressionFormat ExplicitFormat = ExpressionFormat();

// Parse format specifier.		// Parse format specifier (NOTE: ',' is also an argument seperator).
size_t FormatSpecEnd = Expr.find(',');		size_t FormatSpecEnd = Expr.find(',');
if (FormatSpecEnd != StringRef::npos) {		size_t FunctionStart = Expr.find('(');
		if (FormatSpecEnd != StringRef::npos && FormatSpecEnd < FunctionStart) {
Expr = Expr.ltrim(SpaceChars);		Expr = Expr.ltrim(SpaceChars);
if (!Expr.consume_front("%"))		if (!Expr.consume_front("%"))
		jhendersonUnsubmitted Not Done Reply Inline Actions In conjunction with my suggestion above about not having a function specifier, you could change this code to bail out without error in some cases, perhaps by starting with looking for the `%` followed by some specific characters, followed by a `,`. jhenderson: In conjunction with my suggestion above about not having a function specifier, you could change…
return ErrorDiagnostic::get(		return ErrorDiagnostic::get(
SM, Expr, "invalid matching format specification in expression");		SM, Expr, "invalid matching format specification in expression");

// Check for unknown matching format specifier and set matching format in		// Check for unknown matching format specifier and set matching format in
// class instance representing this expression.		// class instance representing this expression.
SMLoc fmtloc = SMLoc::getFromPointer(Expr.data());		SMLoc fmtloc = SMLoc::getFromPointer(Expr.data());
switch (popFront(Expr)) {		switch (popFront(Expr)) {
case 'u':		case 'u':
▲ Show 20 Lines • Show All 1,834 Lines • Show Last 20 Lines

llvm/lib/Support/FileCheckImpl.h

Show First 20 Lines • Show All 146 Lines • ▼ Show 20 Lines
};		};

/// Performs operation and \returns its result or an error in case of failure,		/// Performs operation and \returns its result or an error in case of failure,
/// such as if an overflow occurs.		/// such as if an overflow occurs.
Expected<ExpressionValue> operator+(const ExpressionValue &Lhs,		Expected<ExpressionValue> operator+(const ExpressionValue &Lhs,
const ExpressionValue &Rhs);		const ExpressionValue &Rhs);
Expected<ExpressionValue> operator-(const ExpressionValue &Lhs,		Expected<ExpressionValue> operator-(const ExpressionValue &Lhs,
const ExpressionValue &Rhs);		const ExpressionValue &Rhs);
		Expected<ExpressionValue> max(const ExpressionValue &Lhs,
		const ExpressionValue &Rhs);
		Expected<ExpressionValue> min(const ExpressionValue &Lhs,
		const ExpressionValue &Rhs);

/// Base class representing the AST of a given expression.		/// Base class representing the AST of a given expression.
class ExpressionAST {		class ExpressionAST {
private:		private:
StringRef ExpressionStr;		StringRef ExpressionStr;

public:		public:
ExpressionAST(StringRef ExpressionStr) : ExpressionStr(ExpressionStr) {}		ExpressionAST(StringRef ExpressionStr) : ExpressionStr(ExpressionStr) {}
▲ Show 20 Lines • Show All 553 Lines • ▼ Show 20 Lines	private:
/// None. Parameter \p Context points to the class instance holding the live		/// None. Parameter \p Context points to the class instance holding the live
/// string and numeric variables. \returns the pointer to the class instance		/// string and numeric variables. \returns the pointer to the class instance
/// representing that variable if successful, or an error holding a		/// representing that variable if successful, or an error holding a
/// diagnostic against \p SM otherwise.		/// diagnostic against \p SM otherwise.
static Expected<std::unique_ptr<NumericVariableUse>> parseNumericVariableUse(		static Expected<std::unique_ptr<NumericVariableUse>> parseNumericVariableUse(
StringRef Name, bool IsPseudo, Optional<size_t> LineNumber,		StringRef Name, bool IsPseudo, Optional<size_t> LineNumber,
FileCheckPatternContext *Context, const SourceMgr &SM);		FileCheckPatternContext *Context, const SourceMgr &SM);
enum class AllowedOperand { LineVar, LegacyLiteral, Any };		enum class AllowedOperand { LineVar, LegacyLiteral, Any };
/// Parses \p Expr for use of a numeric operand at line \p LineNumber, or		/// Parses \p Expr for use of a numeric operand at line \p LineNumber, or
/// before input is parsed if \p LineNumber is None. Accepts both literal		/// before input is parsed if \p LineNumber is None. Accepts literal values,
/// values and numeric variables, depending on the value of \p AO. Parameter		/// numeric variables and function calls, depending on the value of \p AO.
/// \p Context points to the class instance holding the live string and		/// Parameter \p Context points to the class instance holding the live string
/// numeric variables. \returns the class representing that operand in the		/// and numeric variables. \returns the class representing that operand in the
/// AST of the expression or an error holding a diagnostic against \p SM		/// AST of the expression or an error holding a diagnostic against \p SM
/// otherwise. If \p Expr starts with a "(" this function will attempt to		/// otherwise. If \p Expr starts with a "(" this function will attempt to
		jhendersonUnsubmitted Not Done Reply Inline Actions I think you need to delete "both" here, since there are now three different things it accepts. jhenderson: I think you need to delete "both" here, since there are now three different things it accepts.
/// parse a parenthesized expression.		/// parse a parenthesized expression.
static Expected<std::unique_ptr<ExpressionAST>>		static Expected<std::unique_ptr<ExpressionAST>>
parseNumericOperand(StringRef &Expr, AllowedOperand AO,		parseNumericOperand(StringRef &Expr, AllowedOperand AO,
Optional<size_t> LineNumber,		Optional<size_t> LineNumber,
FileCheckPatternContext *Context, const SourceMgr &SM);		FileCheckPatternContext *Context, const SourceMgr &SM);
/// Parses and updates \p RemainingExpr for a binary operation at line		/// Parses and updates \p RemainingExpr for a binary operation at line
/// \p LineNumber, or before input is parsed if \p LineNumber is None. The		/// \p LineNumber, or before input is parsed if \p LineNumber is None. The
/// left operand of this binary operation is given in \p LeftOp and \p Expr		/// left operand of this binary operation is given in \p LeftOp and \p Expr
Show All 13 Lines	private:
/// before input is parsed if \p LineNumber is None. \p Expr must start with		/// before input is parsed if \p LineNumber is None. \p Expr must start with
/// a '('. Accepts both literal values and numeric variables. Parameter \p		/// a '('. Accepts both literal values and numeric variables. Parameter \p
/// Context points to the class instance holding the live string and numeric		/// Context points to the class instance holding the live string and numeric
/// variables. \returns the class representing that operand in the AST of the		/// variables. \returns the class representing that operand in the AST of the
/// expression or an error holding a diagnostic against \p SM otherwise.		/// expression or an error holding a diagnostic against \p SM otherwise.
static Expected<std::unique_ptr<ExpressionAST>>		static Expected<std::unique_ptr<ExpressionAST>>
parseParenExpr(StringRef &Expr, Optional<size_t> LineNumber,		parseParenExpr(StringRef &Expr, Optional<size_t> LineNumber,
FileCheckPatternContext *Context, const SourceMgr &SM);		FileCheckPatternContext *Context, const SourceMgr &SM);

		/// Parses \p Expr for an argument list belonging to a call to function \p
		/// FuncName at line \p LineNumber, or before input is parsed if \p LineNumber
		/// is None. Parameter \p FuncLoc is the source location used for diagnostics.
		/// Parameter \p Context points to the class instance holding the live string
		/// and numeric variables. \returns the class representing that call in the
		/// AST of the expression or an error holding a diagnostic against \p SM
		/// otherwise.
		static Expected<std::unique_ptr<ExpressionAST>>
		parseCallExpr(StringRef &Expr, StringRef FuncName,
		Optional<size_t> LineNumber, FileCheckPatternContext *Context,
		const SourceMgr &SM);
};		};

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Check Strings.		// Check Strings.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

/// A check that we found in the input file.		/// A check that we found in the input file.
struct FileCheckString {		struct FileCheckString {
▲ Show 20 Lines • Show All 44 Lines • Show Last 20 Lines

llvm/test/FileCheck/numeric-expression.txt

	Show First 20 Lines • Show All 49 Lines • ▼ Show 20 Lines
	INVALID-FMT-SPEC-MSG2-NEXT: {{^}} ^{{$}}			INVALID-FMT-SPEC-MSG2-NEXT: {{^}} ^{{$}}

	; Numeric expressions in explicit matching format and default matching rule using			; Numeric expressions in explicit matching format and default matching rule using
	; variables defined on other lines without spaces.			; variables defined on other lines without spaces.
	USE EXPL FMT IMPL MATCH // CHECK-LABEL: USE EXPL FMT IMPL MATCH			USE EXPL FMT IMPL MATCH // CHECK-LABEL: USE EXPL FMT IMPL MATCH
	11 // CHECK-NEXT: {{^}}[[#%u,UNSI]]			11 // CHECK-NEXT: {{^}}[[#%u,UNSI]]
	12 // CHECK-NEXT: {{^}}[[#%u,UNSI+1]]			12 // CHECK-NEXT: {{^}}[[#%u,UNSI+1]]
	10 // CHECK-NEXT: {{^}}[[#%u,UNSI-1]]			10 // CHECK-NEXT: {{^}}[[#%u,UNSI-1]]
				15 // CHECK-NEXT: {{^}}[[#%u,add(UNSI,4)]]
				11 // CHECK-NEXT: {{^}}[[#%u,max(UNSI,7)]]
				99 // CHECK-NEXT: {{^}}[[#%u,max(UNSI,99)]]
				7 // CHECK-NEXT: {{^}}[[#%u,min(UNSI,7)]]
				11 // CHECK-NEXT: {{^}}[[#%u,min(UNSI,99)]]
				8 // CHECK-NEXT: {{^}}[[#%u,sub(UNSI,3)]]
	c // CHECK-NEXT: {{^}}[[#%x,LHEX]]			c // CHECK-NEXT: {{^}}[[#%x,LHEX]]
	d // CHECK-NEXT: {{^}}[[#%x,LHEX+1]]			d // CHECK-NEXT: {{^}}[[#%x,LHEX+1]]
	b // CHECK-NEXT: {{^}}[[#%x,LHEX-1]]			b // CHECK-NEXT: {{^}}[[#%x,LHEX-1]]
	1a // CHECK-NEXT: {{^}}[[#%x,LHEX+0xe]]			1a // CHECK-NEXT: {{^}}[[#%x,LHEX+0xe]]
	1a // CHECK-NEXT: {{^}}[[#%x,LHEX+0xE]]			1a // CHECK-NEXT: {{^}}[[#%x,LHEX+0xE]]
				e // CHECK-NEXT: {{^}}[[#%x,add(LHEX,2)]]
				ff // CHECK-NEXT: {{^}}[[#%x,max(LHEX,0xff)]]
				a // CHECK-NEXT: {{^}}[[#%x,min(LHEX,0xa)]]
				a // CHECK-NEXT: {{^}}[[#%x,sub(LHEX,2)]]
	D // CHECK-NEXT: {{^}}[[#%X,UHEX]]			D // CHECK-NEXT: {{^}}[[#%X,UHEX]]
	E // CHECK-NEXT: {{^}}[[#%X,UHEX+1]]			E // CHECK-NEXT: {{^}}[[#%X,UHEX+1]]
	C // CHECK-NEXT: {{^}}[[#%X,UHEX-1]]			C // CHECK-NEXT: {{^}}[[#%X,UHEX-1]]
	1B // CHECK-NEXT: {{^}}[[#%X,UHEX+0xe]]			1B // CHECK-NEXT: {{^}}[[#%X,UHEX+0xe]]
	1B // CHECK-NEXT: {{^}}[[#%X,UHEX+0xE]]			1B // CHECK-NEXT: {{^}}[[#%X,UHEX+0xE]]
				F // CHECK-NEXT: {{^}}[[#%X,add(UHEX,2)]]
				FF // CHECK-NEXT: {{^}}[[#%X,max(UHEX,0xff)]]
				A // CHECK-NEXT: {{^}}[[#%X,min(UHEX,0xa)]]
				B // CHECK-NEXT: {{^}}[[#%X,sub(UHEX,2)]]
	-30 // CHECK-NEXT: {{^}}[[#%d,SIGN]]			-30 // CHECK-NEXT: {{^}}[[#%d,SIGN]]
	-29 // CHECK-NEXT: {{^}}[[#%d,SIGN+1]]			-29 // CHECK-NEXT: {{^}}[[#%d,SIGN+1]]
	-31 // CHECK-NEXT: {{^}}[[#%d,SIGN-1]]			-31 // CHECK-NEXT: {{^}}[[#%d,SIGN-1]]
	42 // CHECK-NEXT: {{^}}[[#%d,SIGN+72]]			42 // CHECK-NEXT: {{^}}[[#%d,SIGN+72]]
				-29 // CHECK-NEXT: {{^}}[[#%d,add(SIGN,1)]]
				-17 // CHECK-NEXT: {{^}}[[#%d,max(SIGN,-17)]]
				-30 // CHECK-NEXT: {{^}}[[#%d,min(SIGN,-17)]]
				-31 // CHECK-NEXT: {{^}}[[#%d,sub(SIGN,1)]]
	11 // CHECK-NEXT: {{^}}[[#%u,UNSIa]]			11 // CHECK-NEXT: {{^}}[[#%u,UNSIa]]
	11 // CHECK-NEXT: {{^}}[[#%u,UNSIb]]			11 // CHECK-NEXT: {{^}}[[#%u,UNSIb]]
	11 // CHECK-NEXT: {{^}}[[#%u,UNSIc]]			11 // CHECK-NEXT: {{^}}[[#%u,UNSIc]]
	c // CHECK-NEXT: {{^}}[[#%x,LHEXa]]			c // CHECK-NEXT: {{^}}[[#%x,LHEXa]]

	; Numeric expressions in explicit matching format and default matching rule using			; Numeric expressions in explicit matching format and default matching rule using
	; variables defined on other lines with different spacing.			; variables defined on other lines with different spacing.
	USE EXPL FMT IMPL MATCH SPC // CHECK-LABEL: USE EXPL FMT IMPL MATCH SPC			USE EXPL FMT IMPL MATCH SPC // CHECK-LABEL: USE EXPL FMT IMPL MATCH SPC
	11 // CHECK-NEXT: {{^}}[[#%u, UNSI]]			11 // CHECK-NEXT: {{^}}[[#%u, UNSI]]
	11 // CHECK-NEXT: {{^}}[[# %u, UNSI]]			11 // CHECK-NEXT: {{^}}[[# %u, UNSI]]
	11 // CHECK-NEXT: {{^}}[[# %u, UNSI ]]			11 // CHECK-NEXT: {{^}}[[# %u, UNSI ]]
	12 // CHECK-NEXT: {{^}}[[#%u, UNSI+1]]			12 // CHECK-NEXT: {{^}}[[#%u, UNSI+1]]
	12 // CHECK-NEXT: {{^}}[[# %u, UNSI+1]]			12 // CHECK-NEXT: {{^}}[[# %u, UNSI+1]]
	12 // CHECK-NEXT: {{^}}[[# %u , UNSI+1]]			12 // CHECK-NEXT: {{^}}[[# %u , UNSI+1]]
	12 // CHECK-NEXT: {{^}}[[# %u , UNSI +1]]			12 // CHECK-NEXT: {{^}}[[# %u , UNSI +1]]
	12 // CHECK-NEXT: {{^}}[[# %u , UNSI + 1]]			12 // CHECK-NEXT: {{^}}[[# %u , UNSI + 1]]
	12 // CHECK-NEXT: {{^}}[[# %u , UNSI + 1 ]]			12 // CHECK-NEXT: {{^}}[[# %u , UNSI + 1 ]]
	10 // CHECK-NEXT: {{^}}[[#%u, UNSI-1]]			10 // CHECK-NEXT: {{^}}[[#%u, UNSI-1]]
	10 // CHECK-NEXT: {{^}}[[# %u, UNSI-1]]			10 // CHECK-NEXT: {{^}}[[# %u, UNSI-1]]
	10 // CHECK-NEXT: {{^}}[[# %u , UNSI-1]]			10 // CHECK-NEXT: {{^}}[[# %u , UNSI-1]]
	10 // CHECK-NEXT: {{^}}[[# %u , UNSI -1]]			10 // CHECK-NEXT: {{^}}[[# %u , UNSI -1]]
	10 // CHECK-NEXT: {{^}}[[# %u , UNSI - 1]]			10 // CHECK-NEXT: {{^}}[[# %u , UNSI - 1]]
	10 // CHECK-NEXT: {{^}}[[# %u , UNSI - 1 ]]			10 // CHECK-NEXT: {{^}}[[# %u , UNSI - 1 ]]
				13 // CHECK-NEXT: {{^}}[[#%u, add(UNSI,2)]]
				13 // CHECK-NEXT: {{^}}[[# %u, add(UNSI,2)]]
				13 // CHECK-NEXT: {{^}}[[# %u , add(UNSI,2)]]
				13 // CHECK-NEXT: {{^}}[[# %u , add(UNSI, 2)]]
				13 // CHECK-NEXT: {{^}}[[# %u , add( UNSI, 2)]]
				13 // CHECK-NEXT: {{^}}[[# %u , add( UNSI,2)]]
				13 // CHECK-NEXT: {{^}}[[# %u , add(UNSI,2) ]]
				13 // CHECK-NEXT: {{^}}[[# %u , add (UNSI,2)]]
				jhendersonUnsubmitted Not Done Reply Inline Actions Perhaps change the inner `UNSI` to `UNSI+1` or something to show that the argument of a function is any kind of expression? Up to you. jhenderson: Perhaps change the inner `UNSI` to `UNSI+1` or something to show that the argument of a…
				104 // CHECK-NEXT: {{^}}[[# %u , UNSI + sub( add (100 , UNSI+ 1 ), 20) +1 ]]

	; Numeric expressions in implicit matching format and default matching rule using			; Numeric expressions in implicit matching format and default matching rule using
	; variables defined on other lines.			; variables defined on other lines.
	USE IMPL FMT IMPL MATCH // CHECK-LABEL: USE IMPL FMT IMPL MATCH			USE IMPL FMT IMPL MATCH // CHECK-LABEL: USE IMPL FMT IMPL MATCH
	11 // CHECK-NEXT: {{^}}[[#UNSI]]			11 // CHECK-NEXT: {{^}}[[#UNSI]]
	12 // CHECK-NEXT: {{^}}[[#UNSI+1]]			12 // CHECK-NEXT: {{^}}[[#UNSI+1]]
	10 // CHECK-NEXT: {{^}}[[#UNSI-1]]			10 // CHECK-NEXT: {{^}}[[#UNSI-1]]
				99 // CHECK-NEXT: {{^}}[[#max(UNSI,99)]]
				7 // CHECK-NEXT: {{^}}[[#min(UNSI,7)]]
	c // CHECK-NEXT: {{^}}[[#LHEX]]			c // CHECK-NEXT: {{^}}[[#LHEX]]
	d // CHECK-NEXT: {{^}}[[#LHEX+1]]			d // CHECK-NEXT: {{^}}[[#LHEX+1]]
	b // CHECK-NEXT: {{^}}[[#LHEX-1]]			b // CHECK-NEXT: {{^}}[[#LHEX-1]]
	1a // CHECK-NEXT: {{^}}[[#LHEX+0xe]]			1a // CHECK-NEXT: {{^}}[[#LHEX+0xe]]
	1a // CHECK-NEXT: {{^}}[[#LHEX+0xE]]			1a // CHECK-NEXT: {{^}}[[#LHEX+0xE]]
				ff // CHECK-NEXT: {{^}}[[#max(LHEX,255)]]
				a // CHECK-NEXT: {{^}}[[#min(LHEX,10)]]
	D // CHECK-NEXT: {{^}}[[#UHEX]]			D // CHECK-NEXT: {{^}}[[#UHEX]]
	E // CHECK-NEXT: {{^}}[[#UHEX+1]]			E // CHECK-NEXT: {{^}}[[#UHEX+1]]
	C // CHECK-NEXT: {{^}}[[#UHEX-1]]			C // CHECK-NEXT: {{^}}[[#UHEX-1]]
	1B // CHECK-NEXT: {{^}}[[#UHEX+0xe]]			1B // CHECK-NEXT: {{^}}[[#UHEX+0xe]]
	1B // CHECK-NEXT: {{^}}[[#UHEX+0xE]]			1B // CHECK-NEXT: {{^}}[[#UHEX+0xE]]
				FF // CHECK-NEXT: {{^}}[[#max(UHEX,255)]]
				A // CHECK-NEXT: {{^}}[[#min(UHEX,10)]]
	-30 // CHECK-NEXT: {{^}}[[#SIGN]]			-30 // CHECK-NEXT: {{^}}[[#SIGN]]
	-29 // CHECK-NEXT: {{^}}[[#SIGN+1]]			-29 // CHECK-NEXT: {{^}}[[#SIGN+1]]
	-31 // CHECK-NEXT: {{^}}[[#SIGN-1]]			-31 // CHECK-NEXT: {{^}}[[#SIGN-1]]

	; Numeric expressions using variables defined on other lines and an immediate			; Numeric expressions using variables defined on other lines and an immediate
	; interpreted as an unsigned value.			; interpreted as an unsigned value.
	; Note: 9223372036854775819 = 0x8000000000000000 + 11			; Note: 9223372036854775819 = 0x8000000000000000 + 11
	USE IMPL FMT IMPL MATCH UNSIGNED IMM			USE IMPL FMT IMPL MATCH UNSIGNED IMM
	▲ Show 20 Lines • Show All 224 Lines • ▼ Show 20 Lines
	REDEF-NEW-FMT-MSG-NEXT: {{R}}EDEF-NEW-FMT-NEXT: {{\[\[#%X,UNSI:\]\]}}			REDEF-NEW-FMT-MSG-NEXT: {{R}}EDEF-NEW-FMT-NEXT: {{\[\[#%X,UNSI:\]\]}}
	REDEF-NEW-FMT-MSG-NEXT: {{^}} ^{{$}}			REDEF-NEW-FMT-MSG-NEXT: {{^}} ^{{$}}

	; Numeric expression with overflow.			; Numeric expression with overflow.
	RUN: not FileCheck --check-prefix OVERFLOW --input-file %s %s 2>&1 \			RUN: not FileCheck --check-prefix OVERFLOW --input-file %s %s 2>&1 \
	RUN: \| FileCheck --check-prefix OVERFLOW-MSG --strict-whitespace %s			RUN: \| FileCheck --check-prefix OVERFLOW-MSG --strict-whitespace %s

	OVERFLOW			OVERFLOW
	BIGVAR=10000000000000000			BIGVAR=10000000000000000
				jhendersonUnsubmitted Not Done Reply Inline Actions I would prefer these to be interleaved with their corresponding CHECK and input text: RUN: ... --check-prefix CALL-MISSING-OPENING-BRACKET ... CALL MISSING OPENING BRACKET 30 CALL-MISSING-OPENING-BRACKET-LABEL: ... ... RUN: ... --check-prefix CALL-MISSING-CLOSING-BRACKET ... CALL MISSING CLOSING BRACKET etc. It helps reduce the distance I have to look to find the thing being checked for. jhenderson: I would prefer these to be interleaved with their corresponding CHECK and input text: ``` RUN…
	OVERFLOW-LABEL: OVERFLOW			OVERFLOW-LABEL: OVERFLOW
	OVERFLOW-NEXT: BIGVAR: [[#BIGVAR:0x8000000000000000+0x8000000000000000]]			OVERFLOW-NEXT: BIGVAR: [[#BIGVAR:0x8000000000000000+0x8000000000000000]]
	OVERFLOW-MSG: numeric-expression.txt:[[#@LINE-1]]:27: error: unable to substitute variable or numeric expression			OVERFLOW-MSG: numeric-expression.txt:[[#@LINE-1]]:27: error: unable to substitute variable or numeric expression
	OVERFLOW-MSG-NEXT: {{O}}VERFLOW-NEXT: BIGVAR: {{\[\[#BIGVAR:0x8000000000000000\+0x8000000000000000\]\]}}			OVERFLOW-MSG-NEXT: {{O}}VERFLOW-NEXT: BIGVAR: {{\[\[#BIGVAR:0x8000000000000000\+0x8000000000000000\]\]}}
	OVERFLOW-MSG-NEXT: {{^}} ^{{$}}			OVERFLOW-MSG-NEXT: {{^}} ^{{$}}

	; Numeric expression with underflow.			; Numeric expression with underflow.
	RUN: not FileCheck --check-prefix UNDERFLOW --input-file %s %s 2>&1 \			RUN: not FileCheck --check-prefix UNDERFLOW --input-file %s %s 2>&1 \
	RUN: \| FileCheck --check-prefix UNDERFLOW-MSG --strict-whitespace %s			RUN: \| FileCheck --check-prefix UNDERFLOW-MSG --strict-whitespace %s

	UNDERFLOW			UNDERFLOW
	TINYVAR=-10000000000000000			TINYVAR=-10000000000000000
	UNDERFLOW-LABEL: UNDERFLOW			UNDERFLOW-LABEL: UNDERFLOW
	UNDERFLOW-NEXT: TINYVAR: [[#%d,TINYVAR:-0x8000000000000000-0x8000000000000000]]			UNDERFLOW-NEXT: TINYVAR: [[#%d,TINYVAR:-0x8000000000000000-0x8000000000000000]]
	UNDERFLOW-MSG: numeric-expression.txt:[[#@LINE-1]]:29: error: unable to substitute variable or numeric expression			UNDERFLOW-MSG: numeric-expression.txt:[[#@LINE-1]]:29: error: unable to substitute variable or numeric expression
	UNDERFLOW-MSG-NEXT: {{U}}NDERFLOW-NEXT: TINYVAR: {{\[\[#%d,TINYVAR:-0x8000000000000000-0x8000000000000000\]\]}}			UNDERFLOW-MSG-NEXT: {{U}}NDERFLOW-NEXT: TINYVAR: {{\[\[#%d,TINYVAR:-0x8000000000000000-0x8000000000000000\]\]}}
	UNDERFLOW-MSG-NEXT: {{^}} ^{{$}}			UNDERFLOW-MSG-NEXT: {{^}} ^{{$}}

				RUN: %ProtectFileCheckOutput \
				RUN: not FileCheck -D#NUMVAR=10 --check-prefix CALL-MISSING-CLOSING-BRACKET --input-file %s %s 2>&1 \
				RUN: \| FileCheck --strict-whitespace --check-prefix CALL-MISSING-CLOSING-BRACKET-MSG %s

				CALL MISSING CLOSING BRACKET
				30
				CALL-MISSING-CLOSING-BRACKET-LABEL: CALL MISSING CLOSING BRACKET
				jhendersonUnsubmitted Not Done Reply Inline Actions There might want to be some interaction testing with plain parentheses. Something like `[[#!mul(NUMVAR,(NUMVAR+3))]]` and `[[#!mul(NUMVAR,(NUMVAR+3)]]` (the first should work, but not the second). jhenderson: There might want to be some interaction testing with plain parentheses. Something like `[[#!mul…
				CALL-MISSING-CLOSING-BRACKET-NEXT: [[#add(NUMVAR,3]]
				CALL-MISSING-CLOSING-BRACKET-MSG: numeric-expression.txt:[[#@LINE-1]]:51: error: missing ')' at end of call expression
				CALL-MISSING-CLOSING-BRACKET-MSG-NEXT: {{C}}ALL-MISSING-CLOSING-BRACKET-NEXT: {{\[\[#add\(NUMVAR,3\]\]}}
				CALL-MISSING-CLOSING-BRACKET-MSG-NEXT: {{^}} ^{{$}}

				RUN: %ProtectFileCheckOutput \
				RUN: not FileCheck -D#NUMVAR=10 --check-prefix CALL-MISSING-ARGUMENT --input-file %s %s 2>&1 \
				RUN: \| FileCheck --strict-whitespace --check-prefix CALL-MISSING-ARGUMENT-MSG %s
				jhendersonUnsubmitted Not Done Reply Inline Actions Nit: it would probably be best to make this call take two arguments. jhenderson: Nit: it would probably be best to make this call take two arguments.

				CALL MISSING ARGUMENT
				30
				CALL-MISSING-ARGUMENT-LABEL: CALL MISSING ARGUMENT
				CALL-MISSING-ARGUMENT-NEXT: [[#add(NUMVAR,)]]
				CALL-MISSING-ARGUMENT-MSG: numeric-expression.txt:[[#@LINE-1]]:43: error: missing argument
				CALL-MISSING-ARGUMENT-MSG-NEXT: {{C}}ALL-MISSING-ARGUMENT-NEXT: {{\[\[#add$NUMVAR,$\]\]}}
				CALL-MISSING-ARGUMENT-MSG-NEXT: {{^}} ^{{$}}
				jhendersonUnsubmitted Not Done Reply Inline Actions I think you also want the following: `[[#!mul(,NUMVAR)]]` Possibly also `[[#!mul(NUMVAR,,NUMVAR)]]` jhenderson: I think you also want the following: `[[#!mul(,NUMVAR)]]` Possibly also `[[#!mul(NUMVAR…

				RUN: %ProtectFileCheckOutput \
				RUN: not FileCheck -D#NUMVAR=10 --check-prefix CALL-WRONG-ARGUMENT-COUNT --input-file %s %s 2>&1 \
				RUN: \| FileCheck --strict-whitespace --check-prefix CALL-WRONG-ARGUMENT-COUNT-MSG %s

				CALL WRONG ARGUMENT COUNT
				30
				CALL-WRONG-ARGUMENT-COUNT-LABEL: CALL WRONG ARGUMENT COUNT
				CALL-WRONG-ARGUMENT-COUNT-NEXT: [[#add(NUMVAR)]]
				CALL-WRONG-ARGUMENT-COUNT-MSG: numeric-expression.txt:[[#@LINE-1]]:36: error: function 'add' takes 2 arguments but 1 given
				CALL-WRONG-ARGUMENT-COUNT-MSG-NEXT: {{C}}ALL-WRONG-ARGUMENT-COUNT-NEXT: {{\[\[#add$NUMVAR$\]\]}}
				CALL-WRONG-ARGUMENT-COUNT-MSG-NEXT: {{^}} ^{{$}}

				RUN: %ProtectFileCheckOutput \
				RUN: not FileCheck -D#NUMVAR=10 --check-prefix CALL-UNDEFINED-FUNCTION --input-file %s %s 2>&1 \
				RUN: \| FileCheck --strict-whitespace --check-prefix CALL-UNDEFINED-FUNCTION-MSG %s
				jhendersonUnsubmitted Not Done Reply Inline Actions Nit: it would probably be best to make this call take two arguments. jhenderson: Nit: it would probably be best to make this call take two arguments.

				CALL UNDEFINED FUNCTION
				30
				CALL-UNDEFINED-FUNCTION-LABEL: CALL UNDEFINED FUNCTION
				CALL-UNDEFINED-FUNCTION-NEXT: [[#bogus_function(NUMVAR)]]
				CALL-UNDEFINED-FUNCTION-MSG: numeric-expression.txt:[[#@LINE-1]]:34: error: call to undefined function 'bogus_function'
				CALL-UNDEFINED-FUNCTION-MSG-NEXT: {{C}}ALL-UNDEFINED-FUNCTION-NEXT: {{\[\[#bogus_function$NUMVAR$\]\]}}
				CALL-UNDEFINED-FUNCTION-MSG-NEXT: {{^}} ^{{$}}

llvm/unittests/Support/FileCheckTest.cpp

Show First 20 Lines • Show All 806 Lines • ▼ Show 20 Lines	private:
size_t LineNumber = 1;		size_t LineNumber = 1;
SourceMgr SM;		SourceMgr SM;
FileCheckRequest Req;		FileCheckRequest Req;
FileCheckPatternContext Context;		FileCheckPatternContext Context;
Pattern P{Check::CheckPlain, &Context, LineNumber};		Pattern P{Check::CheckPlain, &Context, LineNumber};

public:		public:
PatternTester() {		PatternTester() {
std::vector<StringRef> GlobalDefines = {"#FOO=42", "BAR=BAZ"};		std::vector<StringRef> GlobalDefines = {"#FOO=42", "BAR=BAZ", "#add=7"};
// An ASSERT_FALSE would make more sense but cannot be used in a		// An ASSERT_FALSE would make more sense but cannot be used in a
// constructor.		// constructor.
EXPECT_THAT_ERROR(Context.defineCmdlineVariables(GlobalDefines, SM),		EXPECT_THAT_ERROR(Context.defineCmdlineVariables(GlobalDefines, SM),
Succeeded());		Succeeded());
Context.createLineVariable();		Context.createLineVariable();
// Call parsePattern to have @LINE defined.		// Call parsePattern to have @LINE defined.
P.parsePattern("N/A", "CHECK", SM, Req);		P.parsePattern("N/A", "CHECK", SM, Req);
// parsePattern does not expect to be called twice for the same line and		// parsePattern does not expect to be called twice for the same line and
▲ Show 20 Lines • Show All 227 Lines • ▼ Show 20 Lines	TEST_F(FileCheckTest, ParseNumericSubstitutionBlock) {
expectDiagnosticError("unsupported operation ')'",		expectDiagnosticError("unsupported operation ')'",
Tester.parseSubst("1)").takeError());		Tester.parseSubst("1)").takeError());
expectDiagnosticError("unsupported operation ')'",		expectDiagnosticError("unsupported operation ')'",
Tester.parseSubst("(1+2))").takeError());		Tester.parseSubst("(1+2))").takeError());
expectDiagnosticError("unsupported operation ')'",		expectDiagnosticError("unsupported operation ')'",
Tester.parseSubst("(2))").takeError());		Tester.parseSubst("(2))").takeError());
expectDiagnosticError("unsupported operation ')'",		expectDiagnosticError("unsupported operation ')'",
Tester.parseSubst("(1))(").takeError());		Tester.parseSubst("(1))(").takeError());

		// Valid expression with function call.
		EXPECT_THAT_EXPECTED(Tester.parseSubst("add(FOO,3)"), Succeeded());
		EXPECT_THAT_EXPECTED(Tester.parseSubst("add (FOO,3)"), Succeeded());
		// Valid expression with nested function call.
		EXPECT_THAT_EXPECTED(Tester.parseSubst("add(FOO, min(BAR,10))"), Succeeded());
		// Valid expression with function call taking expression as argument.
		EXPECT_THAT_EXPECTED(Tester.parseSubst("add(FOO, (BAR+10) + 3)"),
		Succeeded());
		EXPECT_THAT_EXPECTED(Tester.parseSubst("add(FOO, min (BAR,10) + 3)"),
		Succeeded());
		// Valid expression with variable named the same as a function.
		EXPECT_THAT_EXPECTED(Tester.parseSubst("add"), Succeeded());
		EXPECT_THAT_EXPECTED(Tester.parseSubst("add+FOO"), Succeeded());
		EXPECT_THAT_EXPECTED(Tester.parseSubst("FOO+add"), Succeeded());
		EXPECT_THAT_EXPECTED(Tester.parseSubst("add(add,add)+add"), Succeeded());

		// Malformed call syntax.
		arichardsonUnsubmitted Not Done Reply Inline Actions Might make sense to add case with missing operators such as `2!mul(FOO,2)` or `FOO !mul(FOO,2)` or `!mul(FOO(!mul(3,2)))` arichardson: Might make sense to add case with missing operators such as `2!mul(FOO,2)` or `FOO !mul(FOO,2)`…
		jhendersonUnsubmitted Not Done Reply Inline Actions +1 jhenderson: +1
		expectDiagnosticError("missing ')' at end of call expression",
		Tester.parseSubst("add(FOO,(BAR+7)").takeError());
		expectDiagnosticError("missing ')' at end of call expression",
		Tester.parseSubst("add(FOO,min(BAR,7)").takeError());
		expectDiagnosticError("missing argument",
		Tester.parseSubst("add(FOO,)").takeError());
		expectDiagnosticError("missing argument",
		Tester.parseSubst("add(,FOO)").takeError());
		expectDiagnosticError("missing argument",
		Tester.parseSubst("add(FOO,,3)").takeError());

		// Valid call, but to an unknown function.
		expectDiagnosticError("call to undefined function 'bogus_function'",
		Tester.parseSubst("bogus_function(FOO,3)").takeError());
		expectDiagnosticError("call to undefined function '@add'",
		Tester.parseSubst("@add(2,3)").takeError());
		expectDiagnosticError("call to undefined function '$add'",
		Tester.parseSubst("$add(2,3)").takeError());
		expectDiagnosticError("call to undefined function 'FOO'",
		Tester.parseSubst("FOO(2,3)").takeError());
		expectDiagnosticError("call to undefined function 'FOO'",
		Tester.parseSubst("FOO (2,3)").takeError());

		// Valid call, but with incorrect argument count.
		expectDiagnosticError("function 'add' takes 2 arguments but 1 given",
		Tester.parseSubst("add(FOO)").takeError());
		expectDiagnosticError("function 'add' takes 2 arguments but 3 given",
		Tester.parseSubst("add(FOO,3,4)").takeError());

		// Valid call, but not part of a valid expression.
		expectDiagnosticError("unsupported operation 'a'",
		Tester.parseSubst("2add(FOO,2)").takeError());
		expectDiagnosticError("unsupported operation 'a'",
		Tester.parseSubst("FOO add(FOO,2)").takeError());
		expectDiagnosticError("unsupported operation 'a'",
		Tester.parseSubst("add(FOO,2)add(FOO,2)").takeError());
}		}

TEST_F(FileCheckTest, ParsePattern) {		TEST_F(FileCheckTest, ParsePattern) {
PatternTester Tester;		PatternTester Tester;

// Invalid space in string substitution.		// Invalid space in string substitution.
EXPECT_TRUE(Tester.parsePattern("[[ BAR]]"));		EXPECT_TRUE(Tester.parsePattern("[[ BAR]]"));

▲ Show 20 Lines • Show All 157 Lines • ▼ Show 20 Lines	TEST_F(FileCheckTest, MatchParen) {
Tester.initNextPattern();		Tester.initNextPattern();
ASSERT_FALSE(Tester.parsePattern("[[#(NUMVAR)]]"));		ASSERT_FALSE(Tester.parsePattern("[[#(NUMVAR)]]"));
EXPECT_THAT_EXPECTED(Tester.match("18"), Succeeded());		EXPECT_THAT_EXPECTED(Tester.match("18"), Succeeded());
Tester.initNextPattern();		Tester.initNextPattern();
ASSERT_FALSE(Tester.parsePattern("[[#(NUMVAR+2)]]"));		ASSERT_FALSE(Tester.parsePattern("[[#(NUMVAR+2)]]"));
EXPECT_THAT_EXPECTED(Tester.match("20"), Succeeded());		EXPECT_THAT_EXPECTED(Tester.match("20"), Succeeded());
}		}

		TEST_F(FileCheckTest, MatchBuiltinFunctions) {
		PatternTester Tester;
		// Esnure #NUMVAR has the expected value.
		Tester.initNextPattern();
		ASSERT_FALSE(Tester.parsePattern("[[#NUMVAR:]]"));
		expectNotFoundError(Tester.match("FAIL").takeError());
		expectNotFoundError(Tester.match("").takeError());
		EXPECT_THAT_EXPECTED(Tester.match("18"), Succeeded());

		// Check each builtin function generates the expected result.
		Tester.initNextPattern();
		ASSERT_FALSE(Tester.parsePattern("[[#add(NUMVAR,13)]]"));
		EXPECT_THAT_EXPECTED(Tester.match("31"), Succeeded());
		Tester.initNextPattern();
		ASSERT_FALSE(Tester.parsePattern("[[#sub(NUMVAR,7)]]"));
		EXPECT_THAT_EXPECTED(Tester.match("11"), Succeeded());
		Tester.initNextPattern();
		ASSERT_FALSE(Tester.parsePattern("[[#max(NUMVAR,5)]]"));
		EXPECT_THAT_EXPECTED(Tester.match("18"), Succeeded());
		Tester.initNextPattern();
		ASSERT_FALSE(Tester.parsePattern("[[#max(NUMVAR,99)]]"));
		EXPECT_THAT_EXPECTED(Tester.match("99"), Succeeded());
		Tester.initNextPattern();
		ASSERT_FALSE(Tester.parsePattern("[[#min(NUMVAR,5)]]"));
		EXPECT_THAT_EXPECTED(Tester.match("5"), Succeeded());
		Tester.initNextPattern();
		ASSERT_FALSE(Tester.parsePattern("[[#min(NUMVAR,99)]]"));
		EXPECT_THAT_EXPECTED(Tester.match("18"), Succeeded());

		// Check nested function calls.
		Tester.initNextPattern();
		ASSERT_FALSE(Tester.parsePattern("[[#add(min(7,2),max(4,10))]]"));
		EXPECT_THAT_EXPECTED(Tester.match("12"), Succeeded());

		// Check function call that uses a variable of the same name.
		Tester.initNextPattern();
		ASSERT_FALSE(Tester.parsePattern("[[#add(add,add)+min (add,3)+add]]"));
		EXPECT_THAT_EXPECTED(Tester.match("24"), Succeeded());
		}

TEST_F(FileCheckTest, Substitution) {		TEST_F(FileCheckTest, Substitution) {
SourceMgr SM;		SourceMgr SM;
FileCheckPatternContext Context;		FileCheckPatternContext Context;
EXPECT_THAT_ERROR(Context.defineCmdlineVariables({"FOO=BAR"}, SM),		EXPECT_THAT_ERROR(Context.defineCmdlineVariables({"FOO=BAR"}, SM),
Succeeded());		Succeeded());

// Substitution of an undefined string variable fails and error holds that		// Substitution of an undefined string variable fails and error holds that
// variable's name.		// variable's name.
▲ Show 20 Lines • Show All 200 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[FileCheck] Add function call support to numerical expressions.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 269776

llvm/docs/CommandGuide/FileCheck.rst

llvm/lib/Support/FileCheck.cpp

llvm/lib/Support/FileCheckImpl.h

llvm/test/FileCheck/numeric-expression.txt

llvm/unittests/Support/FileCheckTest.cpp

[FileCheck] Add function call support to numerical expressions.
ClosedPublic