This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
lib/Format/
-
Format/
3/6
TokenAnnotator.cpp
-
unittests/Format/
-
Format/
-
FormatTest.cpp

Differential D60362

[clang-format] [PR39719] clang-format converting object-like macro to function-like macro
AbandonedPublic

Authored by MyDeveloperDay on Apr 6 2019, 5:41 AM.

Download Raw Diff

Details

Reviewers

klimek
djasper
reuk
russellmcc
owenpan
sammccall

Summary

Clang format is incorrectly formatting macros where keywords are redefined

See https://bugs.llvm.org/show_bug.cgi?id=39719

The following code

- #define true ((foo)1)
- #define false ((foo)0)

becomes

+ #define true((foo)1)
+ #define false((foo)0)

Diff Detail

Event Timeline

MyDeveloperDay created this revision.Apr 6 2019, 5:41 AM

MyDeveloperDay added a reviewer: owenpan.Apr 7 2019, 12:50 PM

owenpan requested changes to this revision.Apr 7 2019, 11:52 PM

owenpan added inline comments.

clang/lib/Format/TokenAnnotator.cpp
2467–2470	I think it can be more precise and simplified to something like this: if (Left.Previous && Left.Previous->is(tok::pp_define) && Left.isNot(tok::identifier) && Right.is(tok::l_paren))

This revision now requires changes to proceed.Apr 7 2019, 11:52 PM

owenpan added a reviewer: sammccall.Apr 7 2019, 11:53 PM

klimek added inline comments.Apr 8 2019, 1:08 AM

clang/lib/Format/TokenAnnotator.cpp
2467–2470	Why don't we have the same problem for identifier? Is that already solved and the problem is that this is a keyword redefinition?

MyDeveloperDay marked 2 inline comments as done.Apr 8 2019, 2:00 AM

MyDeveloperDay added inline comments.

clang/lib/Format/TokenAnnotator.cpp
2467–2470	Yes the identifier seems to work ok, but when its a keyword redfinition the identifier is replaced with the token for the keyword i.e. tok::kw_true or tok::kw_false

klimek added inline comments.Apr 8 2019, 2:04 AM

clang/lib/Format/TokenAnnotator.cpp
2467–2470	And the idea is that for non-ID #define true(x) x won't work anyway? (otherwise this patch would be incorrect, right?) Have you looked at where we detect the diff between #define a(x) x and #define a (x) in the identifier case and looked we could add common keyword macro cases there?

MyDeveloperDay planned changes to this revision.Apr 8 2019, 5:48 AM

MyDeveloperDay marked 2 inline comments as done.

MyDeveloperDay added inline comments.

clang/lib/Format/TokenAnnotator.cpp
2467–2470	I see what you mean, this path will reformat the false #define incorrectly #define true ((foo)1) #define false(x) x will be transformed to #define true ((foo)1) #define false (x) x

lebedev.ri set the repository for this revision to rC Clang.Apr 8 2019, 6:23 AM

lebedev.ri edited projects, added Restricted Project; removed Restricted Project.

lebedev.ri edited subscribers, added: cfe-commits; removed: llvm-commits.

klimek added inline comments.Apr 8 2019, 8:14 AM

clang/lib/Format/TokenAnnotator.cpp
2467–2470	Exactly.

@klimek one possible solution to this might be to replace the "keyword" back to an identifier in a '#define <keywoord>' scenario

Maybe something like this?

bool FormatTokenLexer::tryConvertKeyWordDefines() {
  // ensure #define keyword x = tok::hash,tok::identifier,tok::identifier
  if (Tokens.size() < 3)
    return false;

  auto &Hash = *(Tokens.end() - 3);
  auto &Define = *(Tokens.end() - 2);
  auto &Keyword = *(Tokens.end() - 1);

  if (!Hash->is(tok::hash))
    return false;

  if (!Define->is(tok::identifier))
    return false;

  // Already an identifier
  if (Keyword->is(tok::identifier))
    return false;

  if (!Define->Tok.getIdentifierInfo() ||
      Define->Tok.getIdentifierInfo()->getPPKeywordID() != tok::pp_define)
    return false;

  // switch the type to be an identifier
  Keyword->Tok.setKind(tok::identifier);
  return true;
}

A more straightforward way, IMO, is to add to the spaceRequiredBetween function a separate if statement that returns false for the sequence of tokens: #, define, tok::identifier, and (

Actually, there is a neater way: https://reviews.llvm.org/D60853

Abandoning in favor of D60853: clang-format converts a keyword macro definition to a macro function

Revision Contents

Path

Size

clang/

lib/

Format/

TokenAnnotator.cpp

7 lines

unittests/

Format/

FormatTest.cpp

19 lines

Diff 194016

clang/lib/Format/TokenAnnotator.cpp

Show First 20 Lines • Show All 2,456 Lines • ▼ Show 20 Lines	bool TokenAnnotator::spaceRequiredBeforeParens(const FormatToken &Right) const {
return Style.SpaceBeforeParens == FormatStyle::SBPO_Always \|\|		return Style.SpaceBeforeParens == FormatStyle::SBPO_Always \|\|
(Style.SpaceBeforeParens == FormatStyle::SBPO_NonEmptyParentheses &&		(Style.SpaceBeforeParens == FormatStyle::SBPO_NonEmptyParentheses &&
Right.ParameterCount > 0);		Right.ParameterCount > 0);
}		}

bool TokenAnnotator::spaceRequiredBetween(const AnnotatedLine &Line,		bool TokenAnnotator::spaceRequiredBetween(const AnnotatedLine &Line,
const FormatToken &Left,		const FormatToken &Left,
const FormatToken &Right) {		const FormatToken &Right) {
		// if in a #define and a keyword is being defined e.g. #define true (1)
		// ensure the space between the keyword and '(' is preserved
		if (Line.InPPDirective && Right.is(tok::l_paren) &&
		!Left.is(tok::identifier) && Left.Previous &&
		Left.Previous->is(tok::identifier) && Left.Previous->Previous &&
		Left.Previous->Previous->is(tok::hash))
		owenpanUnsubmitted Not Done Reply Inline Actions I think it can be more precise and simplified to something like this: if (Left.Previous && Left.Previous->is(tok::pp_define) && Left.isNot(tok::identifier) && Right.is(tok::l_paren)) owenpan: I think it can be more precise and simplified to something like this: ``` if (Left.Previous…
		klimekUnsubmitted Done Reply Inline Actions Why don't we have the same problem for identifier? Is that already solved and the problem is that this is a keyword redefinition? klimek: Why don't we have the same problem for identifier? Is that already solved and the problem is…
		MyDeveloperDayAuthorUnsubmitted Done Reply Inline Actions Yes the identifier seems to work ok, but when its a keyword redfinition the identifier is replaced with the token for the keyword i.e. tok::kw_true or tok::kw_false MyDeveloperDay: Yes the identifier seems to work ok, but when its a keyword redfinition the identifier is…
		klimekUnsubmitted Not Done Reply Inline Actions And the idea is that for non-ID #define true(x) x won't work anyway? (otherwise this patch would be incorrect, right?) Have you looked at where we detect the diff between #define a(x) x and #define a (x) in the identifier case and looked we could add common keyword macro cases there? klimek: And the idea is that for non-ID #define true(x) x won't work anyway? (otherwise this patch…
		MyDeveloperDayAuthorUnsubmitted Done Reply Inline Actions I see what you mean, this path will reformat the false #define incorrectly #define true ((foo)1) #define false(x) x will be transformed to #define true ((foo)1) #define false (x) x MyDeveloperDay: I see what you mean, this path will reformat the false #define incorrectly ``` #define true…
		klimekUnsubmitted Not Done Reply Inline Actions Exactly. klimek: Exactly.
		return true;
if (Left.is(tok::kw_return) && Right.isNot(tok::semi))		if (Left.is(tok::kw_return) && Right.isNot(tok::semi))
return true;		return true;
if (Left.is(Keywords.kw_assert) && Style.Language == FormatStyle::LK_Java)		if (Left.is(Keywords.kw_assert) && Style.Language == FormatStyle::LK_Java)
return true;		return true;
if (Style.ObjCSpaceAfterProperty && Line.Type == LT_ObjCProperty &&		if (Style.ObjCSpaceAfterProperty && Line.Type == LT_ObjCProperty &&
Left.Tok.getObjCKeywordID() == tok::objc_property)		Left.Tok.getObjCKeywordID() == tok::objc_property)
return true;		return true;
if (Right.is(tok::hashhash))		if (Right.is(tok::hashhash))
▲ Show 20 Lines • Show All 1,004 Lines • Show Last 20 Lines

clang/unittests/Format/FormatTest.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 13,417 Lines • ▼ Show 20 Lines	TEST_F(FormatTest, GuessLanguageWithChildLines) {
EXPECT_EQ(		EXPECT_EQ(
FormatStyle::LK_Cpp,		FormatStyle::LK_Cpp,
guessLanguage("foo.h", "#define FOO ({ foo(); ({ std::string s; }) })"));		guessLanguage("foo.h", "#define FOO ({ foo(); ({ std::string s; }) })"));
EXPECT_EQ(		EXPECT_EQ(
FormatStyle::LK_ObjC,		FormatStyle::LK_ObjC,
guessLanguage("foo.h", "#define FOO ({ foo(); ({ NSString *s; }) })"));		guessLanguage("foo.h", "#define FOO ({ foo(); ({ NSString *s; }) })"));
}		}

		TEST_F(FormatTest, MacroKeyWordsAndParents) {

		format::FormatStyle Style = format::getLLVMStyle();
		verifyFormat("#define TRUE ((foo)1)", Style);
		verifyFormat("#define throw ((foo)1)", Style);
		verifyFormat("#define true ((foo)1)", Style);
		verifyFormat("#define false ((foo)1)", Style);
		verifyFormat("#define sizeof ((foo)1)", Style);
		verifyFormat("#define new ((foo)1)", Style);
		verifyFormat("#define delete ((foo)1)", Style);
		verifyFormat("#define for ((foo)1)", Style);
		verifyFormat("#define override ((foo)1)", Style);
		verifyFormat("#define else ((foo)1)", Style);
		verifyFormat("#define true 1", Style);
		verifyFormat("#define true foo", Style);
		verifyFormat("#define true foo()", Style);
		verifyFormat("#define NO_LENGTH (~(uint64_t)0)", Style);
		}

} // end namespace		} // end namespace
} // end namespace format		} // end namespace format
} // end namespace clang		} // end namespace clang