This is an archive of the discontinued LLVM Phabricator instance.

[FileCheck] Add check modifier capability
AbandonedPublic

Authored by jpienaar on Nov 18 2020, 9:22 PM.

Details

Reviewers
jhenderson
thopre
Summary

Introduce CHECK modifiers that change the behavior of the CHECK directive. Also add a LITERAL modifier for cases where matching could end requiring escaping strings interpreted as regex where only literal/fixed string matching is desired (where LITERAL would make the CHECK's easier to write, less fragile/difficult to interpret).

Diff Detail

Event Timeline

jpienaar created this revision.Nov 18 2020, 9:22 PM
Herald added a project: Restricted Project. · View Herald TranscriptNov 18 2020, 9:22 PM
jpienaar requested review of this revision.Nov 18 2020, 9:22 PM

You should add examples for the other directives as well (CHECK-LITERAL-DAG, CHECK-LITERAL-NOT, CHECK-LITERAL-COUNT)

llvm/docs/CommandGuide/FileCheck.rst
663–669

Proposed rewording. If not convinced by it you should at least fix the syntax for mentioning a directive and the typo on "escaping".

llvm/include/llvm/FileCheck/FileCheck.h
79

I know that's what the above line did but I'd simply phrase it as per suggestion.

llvm/lib/FileCheck/FileCheck.cpp
1597–1600

Given how simple is this I would put it in the class declaration.

On a more general note, I wonder if having new directives is the right way to go. Why not introduce a concept of directive modifier? It could allow multiple modifier to cumulate. We could even introduce modifier in the middle of a directive at some point (though obviously not for a directive that was marked literal).

How about: CHECK{LITERAL}? Or perhaps CHECK{\L}. Other directives would follow naturally and CHECK-EMPTY{\L} would be a nop.

On a more general note, I wonder if having new directives is the right way to go. Why not introduce a concept of directive modifier? It could allow multiple modifier to cumulate. We could even introduce modifier in the middle of a directive at some point (though obviously not for a directive that was marked literal).

How about: CHECK{LITERAL}? Or perhaps CHECK{\L}. Other directives would follow naturally and CHECK-EMPTY{\L} would be a nop.

Good point, indirectly this is what this is doing, LITERAL is more a modifier than a directive (and yes CHECK-EMPTY is a nop - I was in between asserting that CHECK-LITERAL-EMPTY is not used, or just allowing it as it means the same). This pushes the modifier parsing to the end. Let me give that a go.

jpienaar updated this revision to Diff 306546.Nov 19 2020, 3:19 PM

Swap it to be a directive & incorporate doc suggestions.

I have to say, I'm not super enthused about this feature. Typically we've added directives when there's no other way to achieve the desired effect; the LITERAL feature doesn't meet that bar. It's a way to simplify something that you could do already. Not to say I'm going to insist on killing it, but I would like to hear more justification.

Does this excessive-escaping situation come up very often in practice? I saw a relatively simple case recently, {{{.*}}} which is, a pair of braces with anything in between. (Which FileCheck doesn't parse in the intuitive way, but it does work as intended.) But that's about the most complicated thing I've come across lately.

I have to say, I'm not super enthused about this feature. Typically we've added directives when there's no other way to achieve the desired effect

SAME could be achieved without needing SAME, COUNT too, and NEXT could be folded in too by extending to multi-line regex and verifying no newlines so too DAG. So I believe there is a goal with directives to make checks readable for users & simple to interpret rather than just possible. Escaping is possible but obscures what is being tested and easy to get wrong.

Does this excessive-escaping situation come up very often in practice?

Very often indeed in MLIR as dense constants use [] pairs for values, so if you have a multi-dimensional tensor value you end up with a lot of escaping.

I have to say, I'm not super enthused about this feature. Typically we've added directives when there's no other way to achieve the desired effect

SAME could be achieved without needing SAME, COUNT too, and NEXT could be folded in too by extending to multi-line regex and verifying no newlines so too DAG. So I believe there is a goal with directives to make checks readable for users & simple to interpret rather than just possible. Escaping is possible but obscures what is being tested and easy to get wrong.

Fair. I'd forgotten about COUNT and that multi-line regexes are possible. (Although I will point out that SAME does not always do the same thing as pasting the directives together with .* separators; see my "FileCheck Follies" lightning talk from the Fall 2016 Dev Meeting.)

"Easy to get wrong" is a worthwhile point too.

Does this excessive-escaping situation come up very often in practice?

Very often indeed in MLIR as dense constants use [] pairs for values, so if you have a multi-dimensional tensor value you end up with a lot of escaping.

Okay.

Fair. I'd forgotten about COUNT and that multi-line regexes are possible. (Although I will point out that SAME does not always do the same thing as pasting the directives together with .* separators; see my "FileCheck Follies" lightning talk from the Fall 2016 Dev Meeting.)

Nice talk, I had not seen it before.

"Easy to get wrong" is a worthwhile point too.

Yes, I had a couple of O(10) chat messages exchange with different folks that were frustrated and thought "mmm, could we make this better/easier"

jpienaar retitled this revision from [FileCheck] Add a literal check directive to [FileCheck] Add a literal check modifier.Nov 20 2020, 10:50 AM
jpienaar edited the summary of this revision. (Show Details)
jpienaar updated this revision to Diff 306745.Nov 20 2020, 11:09 AM

Fix missing :

jpienaar retitled this revision from [FileCheck] Add a literal check modifier to [FileCheck] Add check modifier capability.Dec 1 2020, 8:57 AM
jpienaar edited the summary of this revision. (Show Details)
jpienaar edited reviewers, added: jhenderson, thopre; removed: mtrofin.
jhenderson added inline comments.Dec 2 2020, 12:16 AM
llvm/docs/CommandGuide/FileCheck.rst
664

Fix this line to have the right number of ~ or the doc build will break. If you don't have it setup, I recommend you enable the docs build on your machine and build the html pages, to flush out any problems.

It would be nice to have an example of how to use the LITERAL modifier, both in conjunction with a normal check and a more complicated directive. I think this was there before, looking at the diff history, but it's disappeared again.

llvm/include/llvm/FileCheck/FileCheck.h
96

literal -> Literal, as per clang-tidy warning.

Same issue in various other cases in the new code (variables in the core LLVM project use UpperCase)

llvm/lib/FileCheck/FileCheck.cpp
1611
1612

Would it be simpler to just do Prefix + str + getModifiersDescription()? Not sure formatv really gets us anything here.

llvm/test/FileCheck/check-literal.txt
2

It would be nice to have a brief comment at the start of the test explaining what this test is actually testing (specifically, it's testing the literal modifier, and not something else that might be considered "literal").

You should probably have a case with a suffix following {LITERAL} to show the behaviour when the : doesn't immediately follow it. Also, where there's no closing } after LITERAL.

Is it worth checking the use with -NOT too? Not sure on that one, but maybe.

jpienaar updated this revision to Diff 309126.Dec 2 2020, 7:34 PM
jpienaar marked 5 inline comments as done.

Fix naming conventions & expand test

jpienaar updated this revision to Diff 309127.Dec 2 2020, 7:39 PM

Accidentally included unrelated change in previous diff

jpienaar updated this revision to Diff 309129.Dec 2 2020, 7:45 PM

Fix missing empty line in rst

jhenderson added inline comments.Dec 3 2020, 2:20 AM
llvm/test/FileCheck/check-literal.txt
7

"but"?

23

I was thinking more that you need another FileCheck run which will fail because it is doing A-NOT{LITERAL} but found a match.

jpienaar updated this revision to Diff 309358.Dec 3 2020, 1:42 PM
jpienaar marked 2 inline comments as done.

Fixing redundant word in test case and adding negative test.

jhenderson added inline comments.Dec 7 2020, 1:32 AM
llvm/test/FileCheck/check-literal.txt
4

Here and in other comments, I suggest using ;; for the comment marker - the double marker helps distinguish it from an actual FileCheck or lit directive.

We have tended to do that in some newer tests in the LLVM binutils at least, and I think others I've been involved in reviewing.

28–29

I think this needs %ProtectFileCheckOutput, but am not 100% sure if that applies for "not" cases (I think it does).

Also, I have a personal preference for continuation lines to be formatted as suggested in the inline edit. The idea is that it clearly shows from looking at the first line that you've got to the end of that command, whilst the second line is indented shwoing that it is a continuation.

jpienaar updated this revision to Diff 309962.Dec 7 2020, 11:11 AM
jpienaar marked 3 inline comments as done.

Updated test comments to be ;; , using ProtectFileCheckOutput for piped case, switch | location for continuation

The implementation of the feature looks fine to me, but I'd like to defer to @thopre and @probinson on whether the feature actually gets accepted or not, so not marking as accepted myself.

thopre added a comment.Dec 8 2020, 2:57 AM

Could you add some more tests:

  • negative tests for the modifier syntax (e.g. "{LITERAL,}", "{}", "{LITTERAL}", "{LITERAL"
  • positive tests with several modifiers (i.e. "{LITERAL,LITERAL}")

It's looking good otherwise. Thanks for making the code already prepared for multiple modifiers. I think we might want to have a shorter version of the modifier but better keep it for later when the need is felt.

llvm/docs/CommandGuide/FileCheck.rst
667

I suggest to match the formalism used to introduced numeric expression, as per edit

669

When there'll be more than one I'll suggest using a list but since there's only one supported value I propose folding it inside the text above (as per edit)

llvm/include/llvm/FileCheck/FileCheck.h
73

I'd remove the value, it'll automatically take the last enumerator + 1 and won't require changing for every new modifier.

llvm/lib/FileCheck/FileCheck.cpp
1668–1673

Perhaps allow whitespaces?

1679–1680

This comment needs updating

llvm/test/FileCheck/check-literal.txt
32

It took me a while to parse that sentence but I'm not sure how to improve it. Any suggestion @jhenderson ?

Perhaps "This ensures an error is correctly reported when a NOT directive with a LITERAL modifier matches"?

jpienaar updated this revision to Diff 310241.Dec 8 2020, 9:12 AM
jpienaar marked 8 inline comments as done.

Adding more negative tests, updating doc & comments, allow whitespace in modifier list.

jhenderson added inline comments.Dec 9 2020, 12:25 AM
llvm/lib/FileCheck/FileCheck.cpp
1683–1684
1685–1686

Looking at this again, I'm not sure these two should be folded together. I think it would make more sense to have the : check separate to the { check, and then not to call ConsumeModifiers in the event of the :, since there are none.

llvm/test/FileCheck/check-literal.txt
21

Also A{,LITERAL}?

31

Also add a space before the comma, so that the before and after aspects of the whitespace trimming are tested.

32

The latest version seems reasonable to me. I might suggest "an error" -> "a failure" since error could be implied to mean a problem (e.g. undefined variable) as opposed to a failed CHECK directive.

45

I'd make this more obvious, e.g.INVALID.

jpienaar updated this revision to Diff 310512.Dec 9 2020, 6:16 AM
jpienaar marked 6 inline comments as done.

Add additional test & split checking for plain CHECK.

jhenderson added inline comments.Dec 10 2020, 1:04 AM
llvm/lib/FileCheck/FileCheck.cpp
1681–1682

Can you get rid of this check now?

llvm/test/FileCheck/check-literal.txt
46

Sorry, I was referring to the {LITTERAL} bit actually :)

The single character difference to LITERAL makes it look like there's a typo, and not a deliberate decision.

jpienaar updated this revision to Diff 310858.Dec 10 2020, 5:26 AM
jpienaar marked an inline comment as done.

Change from LITTERAL to BADMODIFIER to make it clearer

jpienaar marked an inline comment as done.Dec 10 2020, 5:28 AM
jpienaar added inline comments.
llvm/lib/FileCheck/FileCheck.cpp
1681–1682

No because these are used by CHECK-NEXT and the rest, so I would need to repeat this for each of them there.

The test is currently failing on the pre-merge bot :-)

llvm/test/FileCheck/check-literal.txt
47

Should this be 'INVALID'?

jpienaar updated this revision to Diff 311213.Dec 11 2020, 6:25 AM
jpienaar marked an inline comment as done.

D'oh fix incorrect typo in test testing typo :)

jhenderson added inline comments.Dec 11 2020, 6:33 AM
llvm/test/FileCheck/check-literal.txt
47

Any reason, you're not testing the closing '?

jpienaar added inline comments.Dec 11 2020, 6:47 AM
llvm/test/FileCheck/check-literal.txt
47

Yes, as it ends with :' which I feel could be improved (either by just dropping the : or also adding a regex for modifiers as optional there, preference towards the former at the moment). So wanted to just rest the essential part here and have the update of error message in follow up as it could affect more tests (e.g., make this and it more directed changes).

No more comments from me at this time, but as before, I'd prefer others to confirm they're happy with the concept.

llvm/test/FileCheck/check-literal.txt
47

Thanks for the explanation.

No more comments from me at this time, but as before, I'd prefer others to confirm they're happy with the concept.

I suggested making it a modifier rather than a new directive so I'm of course happy with it. Will let others speak up if they want but you're good to go from me

No more comments from me at this time, but as before, I'd prefer others to confirm they're happy with the concept.

I suggested making it a modifier rather than a new directive so I'm of course happy with it. Will let others speak up if they want but you're good to go from me

SG, I'll wait a couple of days to see if any other feedback and then proceed. Thanks

jpienaar updated this revision to Diff 312892.Dec 18 2020, 3:47 PM

Rebase to trigger presubmit again

jpienaar abandoned this revision.Dec 22 2020, 8:13 AM

Closed in https://github.com/llvm/llvm-project/commit/44f399ccc12e27d20bae1ea7e712ef7f71e2ff3a (unfortunately omitted from commit description :-/)