This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
lib/Format/
-
Format/
-
UnwrappedLineParser.h
11/11
UnwrappedLineParser.cpp
-
unittests/Format/
-
Format/
-
TokenAnnotatorTest.cpp

Differential D119138

[clang-format] Further improve support for requires expressions
ClosedPublic

Authored by HazardyKnusperkeks on Feb 7 2022, 7:01 AM.

Download Raw Diff

Details

Reviewers

MyDeveloperDay
curdeius
owenpan

Commits

rGbcd1e4612f4f: [clang-format] Further improve support for requires expressions

Summary

Detect requires expressions in more unusable contexts. This is far from
perfect, but currently we have no good metric to decide between a
requires expression and a trailing requires clause.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

HazardyKnusperkeks requested review of this revision.Feb 7 2022, 7:01 AM

HazardyKnusperkeks created this revision.

Herald added a project: Restricted Project. · View Herald TranscriptFeb 7 2022, 7:01 AM

Herald added a subscriber: cfe-commits. · View Herald Transcript

HazardyKnusperkeks added a parent revision: D113319: [clang-format] Improve require and concept handling.Feb 7 2022, 7:01 AM

HazardyKnusperkeks added a child revision: D113369: [clang-format] Extend SpaceBeforeParens for requires.Feb 7 2022, 7:08 AM

Some nits.

clang/lib/Format/UnwrappedLineParser.cpp
1564–1565
2817
2844
2852
2880
2888

Harbormaster completed remote builds in B147974: Diff 406446.Feb 7 2022, 8:16 AM

Rebased and updated

Harbormaster completed remote builds in B149089: Diff 408028.Feb 11 2022, 1:17 PM

• Quuxplusone added a subscriber: • Quuxplusone.Feb 11 2022, 1:39 PM

• Quuxplusone added inline comments.

clang/lib/Format/UnwrappedLineParser.cpp
2824–2828	s/doesn't/don't/ IIUC you're talking about, like, `void member() && requires (` — is that right? It might help the reader to give an example snippet right here. (OTOH, it might be "obvious," I don't know. I'm not in the target audience for this code.) ...Ah, I see you give a snippet on line 2837 that's basically what I mean; I just first felt the need for that snippet all the way up here.
2861–2867	I think it's weird that your heuristic parses backward rather than forward. I would think that the next token after the `requires` keyword tells you what it is with pretty high probability: `requires requires` — it's a clause `requires identifier` — it's a clause `requires {` — it's an expression `requires (` — unclear, apply further heuristics Or are those heuristics already present in trunk, and this PR is just dealing with the "unclear" case?
2908

HazardyKnusperkeks planned changes to this revision.Feb 12 2022, 12:55 AM

HazardyKnusperkeks added inline comments.

clang/lib/Format/UnwrappedLineParser.cpp
2861–2867	That would be so much better, but I can't easily look forward. `Next` is still `nullptr`, until I call `nextToken()`, but then I'm already moved along. But this got me thinking, at least for the easy stuff I can just go forward and don't start on the keyword in `parseRequiresClause()` and `parseRequiresExpression()`. The paren case is more tricky, but I will try something.

New approach for identifying the expressions.

HazardyKnusperkeks marked 3 inline comments as done.Feb 12 2022, 4:28 PM

HazardyKnusperkeks added inline comments.

clang/lib/Format/UnwrappedLineParser.cpp
2861–2867	Present in main everything is a clause, except for requires expressions in a constraint expression. So the stuff where you use the requires expression in a "normal" boolean expression are misparsed and thus most likely misformatted. There is actually a `peekToken()`, let's see if this is better.

Harbormaster completed remote builds in B149245: Diff 408221.Feb 12 2022, 4:54 PM

LGTM!

This revision is now accepted and ready to land.Feb 14 2022, 12:59 AM

Closed by commit rGbcd1e4612f4f: [clang-format] Further improve support for requires expressions (authored by HazardyKnusperkeks). · Explain WhyFeb 15 2022, 12:38 PM

This revision was automatically updated to reflect the committed changes.

HazardyKnusperkeks added a commit: rGbcd1e4612f4f: [clang-format] Further improve support for requires expressions.

Hey ho, sorry for the late comment here, but adding peekNextToken(n) is problematic, as this gets in the way of future changes we want to do to handle macros better.
Usually we want to use X = Tokens->getPosition() and FormatTok = Tokens->setPosition(X) pairs when doing look-ahead.
I did a quick attempt at fixing this, but ran into infinite loops later in the annotator :(

Herald added a project: Restricted Project. · View Herald TranscriptNov 25 2022, 7:30 AM

Generally, why do we need to have that much information? I.e. why do we need to know the exact type of the "requires" keyword?
I do understand we need to know the brace type, but that seems like it would be easier to figure out in the TokenAnnotator (where we already parsed UnwrappedLines).
Do we ever parse UnwrappedLines differently depending on requires clauses/expressions?
If not, we should really do the annotation in TokenAnnotator, where we already have nice parsing bounds from the parsed UnwrappedLines.

In D119138#3951749, @klimek wrote:

Generally, why do we need to have that much information? I.e. why do we need to know the exact type of the "requires" keyword?
I do understand we need to know the brace type, but that seems like it would be easier to figure out in the TokenAnnotator (where we already parsed UnwrappedLines).
Do we ever parse UnwrappedLines differently depending on requires clauses/expressions?
If not, we should really do the annotation in TokenAnnotator, where we already have nice parsing bounds from the parsed UnwrappedLines.

Who is we, I'm not part of that we and haven't heard of some macro improvements. And I don't see how that feature is harming you, but be my guest in changing that. If you look into the history of this change I had a heuristic approach which would only look behind to differentiate.

I don't know if that can be solved in the TokenAnnotator, but you and I have different opinions about that. I'd put more annotating in the UnwrappedLineParser, annotate it as soon as we can.

I'll happily review any changes proposed, but I will not rework this piece of code, unless I can see a big flaw in it (which I can't right now).

I changed it in 49aca00d63e14df8bc68fc4329e6cbc9c9805eb8.

"We" is the people working on clang-format :) I hope that we have a common goal of making clang-format as easy to maintain as we can.

FWIW, I once had the same opinion as you about best doing all parsing as early as possible, but djasper convinced me that the split was a good idea, and in the end, I think it turns out to be significantly less brittle to do more complex annotation in TokenAnnotator. E.g. we now have a lookahead limit of 50, which seems rather arbitrary, while in TokenAnnotator we could simply limit lookahead towards the current UnwrappedLine. Similarly, in TokenAnnotator, we already have all the parens connected, so we could simply look from requires l_paren to the corresponding r_paren and whether the next token is an l_brace. If I can find a bit of time I'll take an attempt at implementing it.

In D119138#3951850, @klimek wrote:

I changed it in 49aca00d63e14df8bc68fc4329e6cbc9c9805eb8.

"We" is the people working on clang-format :) I hope that we have a common goal of making clang-format as easy to maintain as we can.

FWIW, I once had the same opinion as you about best doing all parsing as early as possible, but djasper convinced me that the split was a good idea, and in the end, I think it turns out to be significantly less brittle to do more complex annotation in TokenAnnotator. E.g. we now have a lookahead limit of 50, which seems rather arbitrary, while in TokenAnnotator we could simply limit lookahead towards the current UnwrappedLine. Similarly, in TokenAnnotator, we already have all the parens connected, so we could simply look from requires l_paren to the corresponding r_paren and whether the next token is an l_brace. If I can find a bit of time I'll take an attempt at implementing it.

Your commit is in my view a an example of making that maintaining a bit harder, it didn't went through review, had you not posted it here I'd never seen it. LLVM receives to many commits to scan them for changes in clang-format. And as someone who isn't that long involved in clang-format I think there is an overview really missing.

For non-functional clean-ups generally llvm doesn't require pre-commit review - I did communicate here so people involved in the original change wouldn't miss the clean-up. I do agree that what commits to pre-review is a fine line, and usually try to err on the side of pre-review; I'll take your feedback into consideration for future changes.

Regarding a better overview, you're 100% right. This is something we've definitely not been good enough and we need to get better at.

Revision Contents

Path

Size

clang/

lib/

Format/

UnwrappedLineParser.h

5 lines

UnwrappedLineParser.cpp

214 lines

unittests/

Format/

TokenAnnotatorTest.cpp

126 lines

Diff 409012

clang/lib/Format/UnwrappedLineParser.h

Show First 20 Lines • Show All 127 Lines • ▼ Show 20 Lines	private:
void parseSwitch();		void parseSwitch();
void parseNamespace();		void parseNamespace();
void parseModuleImport();		void parseModuleImport();
void parseNew();		void parseNew();
void parseAccessSpecifier();		void parseAccessSpecifier();
bool parseEnum();		bool parseEnum();
bool parseStructLike();		bool parseStructLike();
void parseConcept();		void parseConcept();
void parseRequiresClause();		bool parseRequires();
void parseRequiresExpression();		void parseRequiresClause(FormatToken *RequiresToken);
		void parseRequiresExpression(FormatToken *RequiresToken);
void parseConstraintExpression();		void parseConstraintExpression();
void parseJavaEnumBody();		void parseJavaEnumBody();
// Parses a record (aka class) as a top level element. If ParseAsExpr is true,		// Parses a record (aka class) as a top level element. If ParseAsExpr is true,
// parses the record as a child block, i.e. if the class declaration is an		// parses the record as a child block, i.e. if the class declaration is an
// expression.		// expression.
void parseRecord(bool ParseAsExpr = false);		void parseRecord(bool ParseAsExpr = false);
void parseObjCLightweightGenerics();		void parseObjCLightweightGenerics();
void parseObjCMethod();		void parseObjCMethod();
▲ Show 20 Lines • Show All 192 Lines • Show Last 20 Lines

clang/lib/Format/UnwrappedLineParser.cpp

Show All 34 Lines public:

// Returns the token preceding the token returned by the last call to // Returns the token preceding the token returned by the last call to

// getNextToken() in the token stream, or nullptr if no such token exists. // getNextToken() in the token stream, or nullptr if no such token exists.

virtual FormatToken *getPreviousToken() = 0; virtual FormatToken *getPreviousToken() = 0;

// Returns the token that would be returned by the next call to // Returns the token that would be returned by the next call to

// getNextToken(). // getNextToken().

virtual FormatToken *peekNextToken() = 0; virtual FormatToken *peekNextToken() = 0;

// Returns the token that would be returned after the next N calls to

// getNextToken(). N needs to be greater than zero, and small enough that

// there are still tokens. Check for tok::eof with N-1 before calling it with

// N.

virtual FormatToken *peekNextToken(int N) = 0;

// Returns whether we are at the end of the file. // Returns whether we are at the end of the file.

// This can be different from whether getNextToken() returned an eof token // This can be different from whether getNextToken() returned an eof token

// when the FormatTokenSource is a view on a part of the token stream. // when the FormatTokenSource is a view on a part of the token stream.

virtual bool isEOF() = 0; virtual bool isEOF() = 0;

// Gets the current position in the token stream, to be used by setPosition(). // Gets the current position in the token stream, to be used by setPosition().

virtual unsigned getPosition() = 0; virtual unsigned getPosition() = 0;

▲ Show 20 Lines • Show All 81 Lines • ▼ Show 20 Lines public:

} }

FormatToken *peekNextToken() override { FormatToken *peekNextToken() override {

if (eof()) if (eof())

return &FakeEOF; return &FakeEOF;

return PreviousTokenSource->peekNextToken(); return PreviousTokenSource->peekNextToken();

} }

FormatToken *peekNextToken(int N) override {

assert(N > 0);

if (eof())

return &FakeEOF;

return PreviousTokenSource->peekNextToken(N);

}

bool isEOF() override { return PreviousTokenSource->isEOF(); } bool isEOF() override { return PreviousTokenSource->isEOF(); }

unsigned getPosition() override { return PreviousTokenSource->getPosition(); } unsigned getPosition() override { return PreviousTokenSource->getPosition(); }

FormatToken *setPosition(unsigned Position) override { FormatToken *setPosition(unsigned Position) override {

PreviousToken = nullptr; PreviousToken = nullptr;

Token = PreviousTokenSource->setPosition(Position); Token = PreviousTokenSource->setPosition(Position);

return Token; return Token;

▲ Show 20 Lines • Show All 104 Lines • ▼ Show 20 Lines FormatToken *peekNextToken() override {

int Next = Position + 1; int Next = Position + 1;

LLVM_DEBUG({ LLVM_DEBUG({

llvm::dbgs() << "Peeking "; llvm::dbgs() << "Peeking ";

dbgToken(Next); dbgToken(Next);

}); });

return Tokens[Next]; return Tokens[Next];

} }

FormatToken *peekNextToken(int N) override {

assert(N > 0);

int Next = Position + N;

LLVM_DEBUG({

llvm::dbgs() << "Peeking (+" << (N - 1) << ") ";

dbgToken(Next);

});

return Tokens[Next];

}

bool isEOF() override { return Tokens[Position]->is(tok::eof); } bool isEOF() override { return Tokens[Position]->is(tok::eof); }

unsigned getPosition() override { unsigned getPosition() override {

LLVM_DEBUG(llvm::dbgs() << "Getting Position: " << Position << "\n"); LLVM_DEBUG(llvm::dbgs() << "Getting Position: " << Position << "\n");

assert(Position >= 0); assert(Position >= 0);

return Position; return Position;

} }

▲ Show 20 Lines • Show All 1,264 Lines • ▼ Show 20 Lines case tok::at:

return; return;

default: default:

break; break;

} }

break; break;

case tok::kw_concept: case tok::kw_concept:

parseConcept(); parseConcept();

return; return;

case tok::kw_requires: case tok::kw_requires: {

parseRequiresClause(); bool ParsedClause = parseRequires();

if (ParsedClause)

curdeiusUnsubmitted

Done

case tok::kw_requires: {

- bool Return = parseRequires();

- if (Return)

+ bool ParsedClause = parseRequires();

+ if (ParsedClause)

return;

curdeius:

return; return;

break;

}

case tok::kw_enum: case tok::kw_enum:

// Ignore if this is part of "template <enum ...". // Ignore if this is part of "template <enum ...".

if (Previous && Previous->is(tok::less)) { if (Previous && Previous->is(tok::less)) {

nextToken(); nextToken();

break; break;

} }

// parseEnum falls through and does not yet add an unwrapped line as an // parseEnum falls through and does not yet add an unwrapped line as an

▲ Show 20 Lines • Show All 650 Lines • ▼ Show 20 Lines do {

case tok::identifier: case tok::identifier:

if (Style.isJavaScript() && if (Style.isJavaScript() &&

(FormatTok->is(Keywords.kw_function) || (FormatTok->is(Keywords.kw_function) ||

FormatTok->startsSequence(Keywords.kw_async, Keywords.kw_function))) FormatTok->startsSequence(Keywords.kw_async, Keywords.kw_function)))

tryToParseJSFunction(); tryToParseJSFunction();

else else

nextToken(); nextToken();

break; break;

case tok::kw_requires: case tok::kw_requires: {

parseRequiresExpression(); auto RequiresToken = FormatTok;

nextToken();

parseRequiresExpression(RequiresToken);

break; break;

}

case tok::ampamp: case tok::ampamp:

if (AmpAmpTokenType != TT_Unknown) if (AmpAmpTokenType != TT_Unknown)

FormatTok->setType(AmpAmpTokenType); FormatTok->setType(AmpAmpTokenType);

LLVM_FALLTHROUGH; LLVM_FALLTHROUGH;

default: default:

nextToken(); nextToken();

break; break;

} }

▲ Show 20 Lines • Show All 558 Lines • ▼ Show 20 Lines if (!FormatTok->Tok.is(tok::equal))

return; return;

nextToken(); nextToken();

parseConstraintExpression(); parseConstraintExpression();

if (FormatTok->Tok.is(tok::semi)) if (FormatTok->Tok.is(tok::semi))

nextToken(); nextToken();

addUnwrappedLine(); addUnwrappedLine();

} }

/// \brief Parses a requires, decides if it is a clause or an expression.

/// \pre The current token has to be the requires keyword.

/// \returns true if it parsed a clause.

curdeiusUnsubmitted

Done

/// \pre The current token has to be the requires keyword.

- /// \returns If it parsed a clause.

+ /// \returns true if it parsed a clause.

bool clang::format::UnwrappedLineParser::parseRequires() {

curdeius:

bool clang::format::UnwrappedLineParser::parseRequires() {

assert(FormatTok->Tok.is(tok::kw_requires) && "'requires' expected");

auto RequiresToken = FormatTok;

// We try to guess if it is a requires clause, or a requires expression. For

// that we first consume the keyword and check the next token.

nextToken();

switch (FormatTok->Tok.getKind()) {

case tok::l_brace:

// This can only be an expression, never a clause.

QuuxplusoneUnsubmitted

Done

s/doesn't/don't/
IIUC you're talking about, like, void member() && requires ( — is that right?
It might help the reader to give an example snippet right here. (OTOH, it might be "obvious," I don't know. I'm not in the target audience for this code.)
...Ah, I see you give a snippet on line 2837 that's basically what I mean; I just first felt the need for that snippet all the way up here.

Quuxplusone: s/doesn't/don't/ IIUC you're talking about, like, `void member() && requires (` — is that right?

parseRequiresExpression(RequiresToken);

return false;

case tok::l_paren:

// Clauses and expression can start with a paren, it's unclear what we have.

break;

default:

// All other tokens can only be a clause.

parseRequiresClause(RequiresToken);

return true;

}

// Looking forward we would have to decide if there are function declaration

// like arguments to the requires expression:

// requires (T t) {

// Or there is a constraint expression for the requires clause:

// requires (C<T> && ...

curdeiusUnsubmitted

Done

default:

- // Is most definitly an expression.

+ // It is most definitely an expression.

return true;

curdeius:

// But first let's look behind.

auto *PreviousNonComment = RequiresToken->getPreviousNonComment();

if (!PreviousNonComment ||

PreviousNonComment->is(TT_RequiresExpressionLBrace)) {

// If there is no token, or an expression left brace, we are a requires

// clause within a requires expression.

curdeiusUnsubmitted

Done

if (!LastParenContent) {

- // No Token is invalid code, just do whatever you want.

+ // Missing token is invalid code, just do whatever you want.

return true;

curdeius:

parseRequiresClause(RequiresToken);

return true;

}

switch (PreviousNonComment->Tok.getKind()) {

case tok::greater:

case tok::r_paren:

case tok::kw_noexcept:

case tok::kw_const:

// This is a requires clause.

parseRequiresClause(RequiresToken);

return true;

case tok::amp:

case tok::ampamp: {

// This can be either:

QuuxplusoneUnsubmitted

Done

I think it's weird that your heuristic parses backward rather than forward. I would think that the next token after the requires keyword tells you what it is with pretty high probability:
requires requires — it's a clause
requires identifier — it's a clause
requires { — it's an expression
requires ( — unclear, apply further heuristics

Or are those heuristics already present in trunk, and this PR is just dealing with the "unclear" case?

Quuxplusone: I think it's weird that your heuristic parses backward rather than forward. I would think that…

HazardyKnusperkeksAuthorUnsubmitted

Done

That would be so much better, but I can't easily look forward. Next is still nullptr, until I call nextToken(), but then I'm already moved along.

But this got me thinking, at least for the easy stuff I can just go forward and don't start on the keyword in parseRequiresClause() and parseRequiresExpression(). The paren case is more tricky, but I will try something.

HazardyKnusperkeks: That would be so much better, but I can't easily look forward. `Next` is still `nullptr`, until…

HazardyKnusperkeksAuthorUnsubmitted

Done

Present in main everything is a clause, except for requires expressions in a constraint expression. So the stuff where you use the requires expression in a "normal" boolean expression are misparsed and thus most likely misformatted.

There is actually a peekToken(), let's see if this is better.

HazardyKnusperkeks: Present in main everything is a clause, except for requires expressions in a constraint…

// if (... && requires (T t) ...)

// Or

// void member(...) && requires (C<T> ...

// We check the one token before that for a const:

// void member(...) const && requires (C<T> ...

auto PrevPrev = PreviousNonComment->getPreviousNonComment();

if (PrevPrev && PrevPrev->is(tok::kw_const)) {

parseRequiresClause(RequiresToken);

return true;

}

break;

}

default:

curdeiusUnsubmitted

Done

if (LastParenContent->isSimpleTypeSpecifier()) {

- // Definetly function delcaration.

+ // Definitely function declaration.

return false;

curdeius:

// It's an expression.

parseRequiresExpression(RequiresToken);

return false;

}

// Now we look forward and try to check if the paren content is a parameter

// list. The parameters can be cv-qualified and contain references or

// pointers.

curdeiusUnsubmitted

Done

if (!BeforeLastParenContent) {

- // No Token is invalid code, just do whatever you want.

+ // Missing token is invalid code, just do whatever you want.

return true;

curdeius:

// So we want basically to check for TYPE NAME, but TYPE can contain all kinds

// of stuff: typename, const, *, &, &&, ::, identifiers.

int NextTokenOffset = 1;

auto NextToken = Tokens->peekNextToken(NextTokenOffset);

auto PeekNext = [&NextTokenOffset, &NextToken, this] {

++NextTokenOffset;

NextToken = Tokens->peekNextToken(NextTokenOffset);

};

bool FoundType = false;

bool LastWasColonColon = false;

int OpenAngles = 0;

for (; NextTokenOffset < 50; PeekNext()) {

switch (NextToken->Tok.getKind()) {

case tok::kw_volatile:

case tok::kw_const:

case tok::comma:

parseRequiresExpression(RequiresToken);

QuuxplusoneUnsubmitted

Done

if (BeforeLastParenContent->isSimpleTypeSpecifier()) {

- // Definetly function delcaration.

+ // Definitely a function declaration.

return false;

Quuxplusone:

return false;

case tok::r_paren:

case tok::pipepipe:

parseRequiresClause(RequiresToken);

return true;

case tok::eof:

// Break out of the loop.

NextTokenOffset = 50;

break;

case tok::coloncolon:

LastWasColonColon = true;

break;

case tok::identifier:

if (FoundType && !LastWasColonColon && OpenAngles == 0) {

parseRequiresExpression(RequiresToken);

return false;

}

FoundType = true;

LastWasColonColon = false;

break;

case tok::less:

++OpenAngles;

break;

case tok::greater:

--OpenAngles;

break;

default:

if (NextToken->isSimpleTypeSpecifier()) {

parseRequiresExpression(RequiresToken);

return false;

}

break;

}

// This seems to be a complicated expression, just assume it's a clause.

parseRequiresClause(RequiresToken);

return true;

}

/// \brief Parses a requires clause. /// \brief Parses a requires clause.

/// \pre The current token needs to be the requires keyword. /// \param RequiresToken The requires keyword token, which starts this clause.

/// \pre We need to be on the next token after the requires keyword.

/// \sa parseRequiresExpression /// \sa parseRequiresExpression

/// ///

/// Returns if it either has finished parsing the clause, or it detects, that /// Returns if it either has finished parsing the clause, or it detects, that

/// the clause is incorrect. /// the clause is incorrect.

void UnwrappedLineParser::parseRequiresClause() { void UnwrappedLineParser::parseRequiresClause(FormatToken *RequiresToken) {

assert(FormatTok->Tok.is(tok::kw_requires) && "'requires' expected"); assert(FormatTok->getPreviousNonComment() == RequiresToken);

assert(FormatTok->getType() == TT_Unknown); assert(RequiresToken->Tok.is(tok::kw_requires) && "'requires' expected");

assert(RequiresToken->getType() == TT_Unknown);

// If there is no previous token, we are within a requires expression, // If there is no previous token, we are within a requires expression,

// otherwise we will always have the template or function declaration in front // otherwise we will always have the template or function declaration in front

// of it. // of it.

bool InRequiresExpression = bool InRequiresExpression =

!FormatTok->Previous || !RequiresToken->Previous ||

FormatTok->Previous->is(TT_RequiresExpressionLBrace); RequiresToken->Previous->is(TT_RequiresExpressionLBrace);

FormatTok->setType(InRequiresExpression RequiresToken->setType(InRequiresExpression

? TT_RequiresClauseInARequiresExpression ? TT_RequiresClauseInARequiresExpression

: TT_RequiresClause); : TT_RequiresClause);

nextToken();

parseConstraintExpression(); parseConstraintExpression();

if (!InRequiresExpression) if (!InRequiresExpression)

FormatTok->Previous->ClosesRequiresClause = true; FormatTok->Previous->ClosesRequiresClause = true;

} }

/// \brief Parses a requires expression. /// \brief Parses a requires expression.

/// \pre The current token needs to be the requires keyword. /// \param RequiresToken The requires keyword token, which starts this clause.

/// \pre We need to be on the next token after the requires keyword.

/// \sa parseRequiresClause /// \sa parseRequiresClause

/// ///

/// Returns if it either has finished parsing the expression, or it detects, /// Returns if it either has finished parsing the expression, or it detects,

/// that the expression is incorrect. /// that the expression is incorrect.

void UnwrappedLineParser::parseRequiresExpression() { void UnwrappedLineParser::parseRequiresExpression(FormatToken *RequiresToken) {

assert(FormatTok->Tok.is(tok::kw_requires) && "'requires' expected"); assert(FormatTok->getPreviousNonComment() == RequiresToken);

assert(FormatTok->getType() == TT_Unknown); assert(RequiresToken->Tok.is(tok::kw_requires) && "'requires' expected");

assert(RequiresToken->getType() == TT_Unknown);

FormatTok->setType(TT_RequiresExpression); RequiresToken->setType(TT_RequiresExpression);

nextToken();

if (FormatTok->is(tok::l_paren)) { if (FormatTok->is(tok::l_paren)) {

FormatTok->setType(TT_RequiresExpressionLParen); FormatTok->setType(TT_RequiresExpressionLParen);

parseParens(); parseParens();

} }

if (FormatTok->is(tok::l_brace)) { if (FormatTok->is(tok::l_brace)) {

FormatTok->setType(TT_RequiresExpressionLBrace); FormatTok->setType(TT_RequiresExpressionLBrace);

parseChildBlock(/*CanContainBracedList=*/false, parseChildBlock(/*CanContainBracedList=*/false,

/*NextLBracesType=*/TT_CompoundRequirementLBrace); /*NextLBracesType=*/TT_CompoundRequirementLBrace);

} }

/// \brief Parses a constraint expression. /// \brief Parses a constraint expression.

/// ///

/// This is either the definition of a concept, or the body of a requires /// This is either the definition of a concept, or the body of a requires

/// clause. It returns, when the parsing is complete, or the expression is /// clause. It returns, when the parsing is complete, or the expression is

/// incorrect. /// incorrect.

void UnwrappedLineParser::parseConstraintExpression() { void UnwrappedLineParser::parseConstraintExpression() {

do { do {

switch (FormatTok->Tok.getKind()) { switch (FormatTok->Tok.getKind()) {

case tok::kw_requires: case tok::kw_requires: {

parseRequiresExpression(); auto RequiresToken = FormatTok;

nextToken();

parseRequiresExpression(RequiresToken);

break; break;

}

case tok::l_paren: case tok::l_paren:

parseParens(/*AmpAmpTokenType=*/TT_BinaryOperator); parseParens(/*AmpAmpTokenType=*/TT_BinaryOperator);

break; break;

case tok::l_square: case tok::l_square:

if (!tryToParseLambda()) if (!tryToParseLambda())

return; return;

▲ Show 20 Lines • Show All 936 Lines • Show Last 20 Lines

clang/unittests/Format/TokenAnnotatorTest.cpp

Show First 20 Lines • Show All 135 Lines • ▼ Show 20 Lines	TEST_F(TokenAnnotatorTest, UnderstandsRequiresClausesAndConcepts) {
EXPECT_TOKEN(Tokens[13], tok::ampamp, TT_BinaryOperator);		EXPECT_TOKEN(Tokens[13], tok::ampamp, TT_BinaryOperator);
EXPECT_TOKEN(Tokens[16], tok::ampamp, TT_BinaryOperator);		EXPECT_TOKEN(Tokens[16], tok::ampamp, TT_BinaryOperator);

Tokens = annotate("template <typename T>\n"		Tokens = annotate("template <typename T>\n"
"concept C = requires(T t) {\n"		"concept C = requires(T t) {\n"
" { t.foo() };\n"		" { t.foo() };\n"
"} && Bar<T> && Baz<T>;");		"} && Bar<T> && Baz<T>;");
ASSERT_EQ(Tokens.size(), 35u) << Tokens;		ASSERT_EQ(Tokens.size(), 35u) << Tokens;
		EXPECT_TOKEN(Tokens[8], tok::kw_requires, TT_RequiresExpression);
		EXPECT_TOKEN(Tokens[9], tok::l_paren, TT_RequiresExpressionLParen);
		EXPECT_TOKEN(Tokens[13], tok::l_brace, TT_RequiresExpressionLBrace);
EXPECT_TOKEN(Tokens[23], tok::ampamp, TT_BinaryOperator);		EXPECT_TOKEN(Tokens[23], tok::ampamp, TT_BinaryOperator);
EXPECT_TOKEN(Tokens[28], tok::ampamp, TT_BinaryOperator);		EXPECT_TOKEN(Tokens[28], tok::ampamp, TT_BinaryOperator);

Tokens = annotate("template<typename T>\n"		Tokens = annotate("template<typename T>\n"
"requires C1<T> && (C21<T> \|\| C22<T> && C2e<T>) && C3<T>\n"		"requires C1<T> && (C21<T> \|\| C22<T> && C2e<T>) && C3<T>\n"
"struct Foo;");		"struct Foo;");
ASSERT_EQ(Tokens.size(), 36u) << Tokens;		ASSERT_EQ(Tokens.size(), 36u) << Tokens;
		EXPECT_TOKEN(Tokens[5], tok::kw_requires, TT_RequiresClause);
EXPECT_TOKEN(Tokens[6], tok::identifier, TT_Unknown);		EXPECT_TOKEN(Tokens[6], tok::identifier, TT_Unknown);
EXPECT_EQ(Tokens[6]->FakeLParens.size(), 1u);		EXPECT_EQ(Tokens[6]->FakeLParens.size(), 1u);
EXPECT_TOKEN(Tokens[10], tok::ampamp, TT_BinaryOperator);		EXPECT_TOKEN(Tokens[10], tok::ampamp, TT_BinaryOperator);
EXPECT_TOKEN(Tokens[16], tok::pipepipe, TT_BinaryOperator);		EXPECT_TOKEN(Tokens[16], tok::pipepipe, TT_BinaryOperator);
EXPECT_TOKEN(Tokens[21], tok::ampamp, TT_BinaryOperator);		EXPECT_TOKEN(Tokens[21], tok::ampamp, TT_BinaryOperator);
EXPECT_TOKEN(Tokens[27], tok::ampamp, TT_BinaryOperator);		EXPECT_TOKEN(Tokens[27], tok::ampamp, TT_BinaryOperator);
EXPECT_TOKEN(Tokens[31], tok::greater, TT_TemplateCloser);		EXPECT_TOKEN(Tokens[31], tok::greater, TT_TemplateCloser);
EXPECT_EQ(Tokens[31]->FakeRParens, 1u);		EXPECT_EQ(Tokens[31]->FakeRParens, 1u);
EXPECT_TRUE(Tokens[31]->ClosesRequiresClause);		EXPECT_TRUE(Tokens[31]->ClosesRequiresClause);

Tokens =		Tokens =
annotate("template<typename T>\n"		annotate("template<typename T>\n"
"requires (C1<T> && (C21<T> \|\| C22<T> && C2e<T>) && C3<T>)\n"		"requires (C1<T> && (C21<T> \|\| C22<T> && C2e<T>) && C3<T>)\n"
"struct Foo;");		"struct Foo;");
ASSERT_EQ(Tokens.size(), 38u) << Tokens;		ASSERT_EQ(Tokens.size(), 38u) << Tokens;
		EXPECT_TOKEN(Tokens[5], tok::kw_requires, TT_RequiresClause);
EXPECT_TOKEN(Tokens[7], tok::identifier, TT_Unknown);		EXPECT_TOKEN(Tokens[7], tok::identifier, TT_Unknown);
EXPECT_EQ(Tokens[7]->FakeLParens.size(), 1u);		EXPECT_EQ(Tokens[7]->FakeLParens.size(), 1u);
EXPECT_TOKEN(Tokens[11], tok::ampamp, TT_BinaryOperator);		EXPECT_TOKEN(Tokens[11], tok::ampamp, TT_BinaryOperator);
EXPECT_TOKEN(Tokens[17], tok::pipepipe, TT_BinaryOperator);		EXPECT_TOKEN(Tokens[17], tok::pipepipe, TT_BinaryOperator);
EXPECT_TOKEN(Tokens[22], tok::ampamp, TT_BinaryOperator);		EXPECT_TOKEN(Tokens[22], tok::ampamp, TT_BinaryOperator);
EXPECT_TOKEN(Tokens[28], tok::ampamp, TT_BinaryOperator);		EXPECT_TOKEN(Tokens[28], tok::ampamp, TT_BinaryOperator);
EXPECT_TOKEN(Tokens[32], tok::greater, TT_TemplateCloser);		EXPECT_TOKEN(Tokens[32], tok::greater, TT_TemplateCloser);
EXPECT_EQ(Tokens[32]->FakeRParens, 1u);		EXPECT_EQ(Tokens[32]->FakeRParens, 1u);
EXPECT_TOKEN(Tokens[33], tok::r_paren, TT_Unknown);		EXPECT_TOKEN(Tokens[33], tok::r_paren, TT_Unknown);
EXPECT_TRUE(Tokens[33]->ClosesRequiresClause);		EXPECT_TRUE(Tokens[33]->ClosesRequiresClause);

		Tokens = annotate("template <typename T>\n"
		"void foo(T) noexcept requires Bar<T>;");
		ASSERT_EQ(Tokens.size(), 18u) << Tokens;
		EXPECT_TOKEN(Tokens[11], tok::kw_requires, TT_RequiresClause);

		Tokens = annotate("template <typename T>\n"
		"struct S {\n"
		" void foo() const requires Bar<T>;\n"
		" void bar() const & requires Baz<T>;\n"
		" void bar() && requires Baz2<T>;\n"
		" void baz() const & noexcept requires Baz<T>;\n"
		" void baz() && noexcept requires Baz2<T>;\n"
		"};\n"
		"\n"
		"void S::bar() const & requires Baz<T> { }");
		ASSERT_EQ(Tokens.size(), 85u) << Tokens;
		EXPECT_TOKEN(Tokens[13], tok::kw_requires, TT_RequiresClause);
		EXPECT_TOKEN(Tokens[25], tok::kw_requires, TT_RequiresClause);
		EXPECT_TOKEN(Tokens[36], tok::kw_requires, TT_RequiresClause);
		EXPECT_TOKEN(Tokens[49], tok::kw_requires, TT_RequiresClause);
		EXPECT_TOKEN(Tokens[61], tok::kw_requires, TT_RequiresClause);
		EXPECT_TOKEN(Tokens[77], tok::kw_requires, TT_RequiresClause);

		Tokens = annotate("void Class::member() && requires(Constant) {}");
		ASSERT_EQ(Tokens.size(), 14u) << Tokens;
		EXPECT_TOKEN(Tokens[7], tok::kw_requires, TT_RequiresClause);

		Tokens = annotate("void Class::member() && requires(Constant<T>) {}");
		ASSERT_EQ(Tokens.size(), 17u) << Tokens;
		EXPECT_TOKEN(Tokens[7], tok::kw_requires, TT_RequiresClause);

		Tokens =
		annotate("void Class::member() && requires(Namespace::Constant<T>) {}");
		ASSERT_EQ(Tokens.size(), 19u) << Tokens;
		EXPECT_TOKEN(Tokens[7], tok::kw_requires, TT_RequiresClause);

		Tokens = annotate("void Class::member() && requires(typename "
		"Namespace::Outer<T>::Inner::Constant) {}");
		ASSERT_EQ(Tokens.size(), 24u) << Tokens;
		EXPECT_TOKEN(Tokens[7], tok::kw_requires, TT_RequiresClause);
		}

		TEST_F(TokenAnnotatorTest, UnderstandsRequiresExpressions) {
		auto Tokens = annotate("bool b = requires(int i) { i + 5; };");
		ASSERT_EQ(Tokens.size(), 16u) << Tokens;
		EXPECT_TOKEN(Tokens[3], tok::kw_requires, TT_RequiresExpression);
		EXPECT_TOKEN(Tokens[4], tok::l_paren, TT_RequiresExpressionLParen);
		EXPECT_TOKEN(Tokens[8], tok::l_brace, TT_RequiresExpressionLBrace);

		Tokens = annotate("if (requires(int i) { i + 5; }) return;");
		ASSERT_EQ(Tokens.size(), 17u) << Tokens;
		EXPECT_TOKEN(Tokens[2], tok::kw_requires, TT_RequiresExpression);
		EXPECT_TOKEN(Tokens[3], tok::l_paren, TT_RequiresExpressionLParen);
		EXPECT_TOKEN(Tokens[7], tok::l_brace, TT_RequiresExpressionLBrace);

		Tokens = annotate("if (func() && requires(int i) { i + 5; }) return;");
		ASSERT_EQ(Tokens.size(), 21u) << Tokens;
		EXPECT_TOKEN(Tokens[6], tok::kw_requires, TT_RequiresExpression);
		EXPECT_TOKEN(Tokens[7], tok::l_paren, TT_RequiresExpressionLParen);
		EXPECT_TOKEN(Tokens[11], tok::l_brace, TT_RequiresExpressionLBrace);

		Tokens = annotate("foo(requires(const T t) {});");
		ASSERT_EQ(Tokens.size(), 13u) << Tokens;
		EXPECT_TOKEN(Tokens[2], tok::kw_requires, TT_RequiresExpression);
		EXPECT_TOKEN(Tokens[3], tok::l_paren, TT_RequiresExpressionLParen);
		EXPECT_TOKEN(Tokens[8], tok::l_brace, TT_RequiresExpressionLBrace);

		Tokens = annotate("foo(requires(const int t) {});");
		ASSERT_EQ(Tokens.size(), 13u) << Tokens;
		EXPECT_TOKEN(Tokens[2], tok::kw_requires, TT_RequiresExpression);
		EXPECT_TOKEN(Tokens[3], tok::l_paren, TT_RequiresExpressionLParen);
		EXPECT_TOKEN(Tokens[8], tok::l_brace, TT_RequiresExpressionLBrace);

		Tokens = annotate("foo(requires(const T t) {});");
		ASSERT_EQ(Tokens.size(), 13u) << Tokens;
		EXPECT_TOKEN(Tokens[2], tok::kw_requires, TT_RequiresExpression);
		EXPECT_TOKEN(Tokens[3], tok::l_paren, TT_RequiresExpressionLParen);
		EXPECT_TOKEN(Tokens[8], tok::l_brace, TT_RequiresExpressionLBrace);

		Tokens = annotate("foo(requires(int const* volatile t) {});");
		ASSERT_EQ(Tokens.size(), 15u) << Tokens;
		EXPECT_TOKEN(Tokens[2], tok::kw_requires, TT_RequiresExpression);
		EXPECT_TOKEN(Tokens[3], tok::l_paren, TT_RequiresExpressionLParen);
		EXPECT_TOKEN(Tokens[10], tok::l_brace, TT_RequiresExpressionLBrace);

		Tokens = annotate("foo(requires(T const* volatile t) {});");
		ASSERT_EQ(Tokens.size(), 15u) << Tokens;
		EXPECT_TOKEN(Tokens[2], tok::kw_requires, TT_RequiresExpression);
		EXPECT_TOKEN(Tokens[3], tok::l_paren, TT_RequiresExpressionLParen);
		EXPECT_TOKEN(Tokens[10], tok::l_brace, TT_RequiresExpressionLBrace);

		Tokens =
		annotate("foo(requires(const typename Outer<T>::Inner * const t) {});");
		ASSERT_EQ(Tokens.size(), 21u) << Tokens;
		EXPECT_TOKEN(Tokens[2], tok::kw_requires, TT_RequiresExpression);
		EXPECT_TOKEN(Tokens[3], tok::l_paren, TT_RequiresExpressionLParen);
		EXPECT_TOKEN(Tokens[16], tok::l_brace, TT_RequiresExpressionLBrace);

		Tokens = annotate("template <typename T>\n"
		"concept C = requires(T T) {\n"
		" requires Bar<T> && Foo<T>;\n"
		"};");
		ASSERT_EQ(Tokens.size(), 28u) << Tokens;
		EXPECT_TOKEN(Tokens[8], tok::kw_requires, TT_RequiresExpression);
		EXPECT_TOKEN(Tokens[9], tok::l_paren, TT_RequiresExpressionLParen);
		EXPECT_TOKEN(Tokens[13], tok::l_brace, TT_RequiresExpressionLBrace);
		EXPECT_TOKEN(Tokens[14], tok::kw_requires,
		TT_RequiresClauseInARequiresExpression);

		Tokens = annotate("template <typename T>\n"
		"concept C = requires(T T) {\n"
		" { t.func() } -> std::same_as<int>;"
		" requires Bar<T> && Foo<T>;\n"
		"};");
		ASSERT_EQ(Tokens.size(), 43u) << Tokens;
		EXPECT_TOKEN(Tokens[8], tok::kw_requires, TT_RequiresExpression);
		EXPECT_TOKEN(Tokens[9], tok::l_paren, TT_RequiresExpressionLParen);
		EXPECT_TOKEN(Tokens[13], tok::l_brace, TT_RequiresExpressionLBrace);
		EXPECT_TOKEN(Tokens[29], tok::kw_requires,
		TT_RequiresClauseInARequiresExpression);
}		}

TEST_F(TokenAnnotatorTest, RequiresDoesNotChangeParsingOfTheRest) {		TEST_F(TokenAnnotatorTest, RequiresDoesNotChangeParsingOfTheRest) {
auto NumberOfAdditionalRequiresClauseTokens = 5u;		auto NumberOfAdditionalRequiresClauseTokens = 5u;
auto NumberOfTokensBeforeRequires = 5u;		auto NumberOfTokensBeforeRequires = 5u;

auto BaseTokens = annotate("template<typename T>\n"		auto BaseTokens = annotate("template<typename T>\n"
"T Pi = 3.14;");		"T Pi = 3.14;");
▲ Show 20 Lines • Show All 204 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[clang-format] Further improve support for requires expressionsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 409012

clang/lib/Format/UnwrappedLineParser.h

clang/lib/Format/UnwrappedLineParser.cpp

clang/unittests/Format/TokenAnnotatorTest.cpp

[clang-format] Further improve support for requires expressions
ClosedPublic