This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/lib/Lex/
-
lib/
-
Lex/
7/14
TokenLexer.cpp

Differential D134942

[Lex] Simplify and cleanup the updateConsecutiveMacroArgTokens implementation.
ClosedPublic

Authored by hokein on Sep 30 2022, 2:26 AM.

Download Raw Diff

Details

Reviewers

sammccall
nickdesaulniers

Commits

rG74e4f778cf16: [Lex] Simplify and cleanup the updateConsecutiveMacroArgTokens implementation.

Summary

The code falls back to the pre-2011 partition-file-id solution (see for
details).

This patch simplifies/rewrites the code based on the partition-based-on-file-id
idea. The new implementation is optimized by reducing the number of
calling getFileID (~40% drop).

Despite the huge drop of getFileID, this is a marignal improvment on
speed (becase the number of calling non-cached getFileID is roughly
the same). It removes the evaluation-order performance gap between gcc-built-clang
and clang-built-clang.

SemaExpr.cpp:

before: 315063 SLocEntries, FileID scans: 388230 linear, 1393437 binary. 458893 cache hits, 672299 getFileID calls
after: 313494 SLocEntries, FileID scans: 397525 linear, 1451890 binary, 176714 cache hits, 397144 getFileID calls

FindTarget.cpp:

before: 279984 SLocEntries, FileID scans: 361926 linear, 1275930 binary, 436072 cache hits, 632150 getFileID calls
after: 278426 SLocEntries, FileID scans: 371279 linear, 1333963 binary, 153705 cache hits, 356814 getFileID calls

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

hokein created this revision.Sep 30 2022, 2:26 AM

Herald added a project: Restricted Project. · View Herald TranscriptSep 30 2022, 2:26 AM

hokein requested review of this revision.Sep 30 2022, 2:26 AM

Herald added a project: Restricted Project. · View Herald TranscriptSep 30 2022, 2:26 AM

hokein mentioned this in D20401: [Lexer] Don't merge macro args from different macro files.Sep 30 2022, 2:40 AM

Harbormaster completed remote builds in B189623: Diff 464188.Sep 30 2022, 2:59 AM

Some more perf data on building linux kernel (x86_64)

before: getFileID 2.4% (1.10% on getFileIDSlow)
after: getFileID 2.35% (1.05% on getFileIDSlow)

In D134942#3827216, @hokein wrote:

Some more perf data on building linux kernel (x86_64)

before: getFileID 2.4% (1.10% on getFileIDSlow)
after: getFileID 2.35% (1.05% on getFileIDSlow)

What compiler was used as the bootstrap?

For me bootstrapping w/ clang, I observed:

Before:
+    2.11%  clang-15                           [.] clang::SourceManager::getFileIDLocal
After:
+    2.01%  clang-15                           [.] clang::SourceManager::getFileIDLocal

I can test again bootstrapping with gcc.

This is such a small improvement that I'm tempted to say "look elsewhere." But it is an improvement in the hottest method, and gets rid of that awful magic constant 50. It might take a few refactorings to get this method out of the top spot of profiles.

nickdesaulniers added inline comments.Sep 30 2022, 3:18 PM

clang/lib/Lex/TokenLexer.cpp
1006–1011	Could you just check that all of the tokens in the partition have the same fileID as the first token? FileID FirstFID = SM.getFileID(Partition[0]->getLocation()); llvm::all_of(Partition, [&SM, &FirstID](const Token &T) { return SM.getFileID(T.getLocation() == FID; }); or move the assertion into the take_while above so we iterate less?

update

In D134942#3828449, @nickdesaulniers wrote:
In D134942#3827216, @hokein wrote:

Some more perf data on building linux kernel (x86_64)

before: getFileID 2.4% (1.10% on getFileIDSlow)
after: getFileID 2.35% (1.05% on getFileIDSlow)

What compiler was used as the bootstrap?

For me bootstrapping w/ clang, I observed:
Before:
+    2.11%  clang-15                           [.] clang::SourceManager::getFileIDLocal
After:
+    2.01%  clang-15                           [.] clang::SourceManager::getFileIDLocal

That's interesting. I also used clang (v14) for bootstrapping.

What's your base revision? I'm using d32b8fdbdb4b99a5cc21604db6211fc506eb1f9b, looking at your profile (clang-15), I think your base revision is older than mine (my profile shows clang-16).

clang/lib/Lex/TokenLexer.cpp
1006–1011	The optimization for this case is that we don't call any `getFileID`, the getFileID is only needed in the assert sanity check, so moving the assertion to `take_while` doesn't really work. I adjust the code to save some unnecessary `getFileID` call in assert.

Harbormaster completed remote builds in B190130: Diff 464905.Oct 4 2022, 1:45 AM

Thanks, I think this is worthwhile for the simpler code, better (non-lying) comments, avoiding arg-evaluation-order stuff.

-0.05% time and -0.5% SLocEntries is interesting too! Clearly not earth-shattering but the the offset table idea gives us a lead to follow.
I do have one idea to skip a getFileID() that might not be cached...

clang/lib/Lex/TokenLexer.cpp
999	I think this comment can be shorter while still getting its point across. // Consecutive tokens not written in macros must be from the same file. // (Neither #include nor eof can occur inside a macro argument.)
1006	this assertion seems to belong outside the if() - it applies to both the file/macro case? I'd suggest asserting nonempty first and then the rest as another assertion. also missing an assertion that if there are any more tokens, the next token has a different FileID that said with these assertions we should probably check we're not regressing debug performance too much!
1017	this getFileID() call is unneccesary when `All.empty() \|\| All.front().getLocation().isFileID()`. Worth checking whether bailing out in that case is profitable? You could do it right at the top: if (All.size() == 1 \|\| All[0].getLocation().isFileID() != All[1].getLocation().isFileID()) return All.take_front(1);
1021	nit: s/begin/front/ if you're using back()
1037	hmm, actually maybe just before this line would be the best place to assert that T and BeginLoc are in the same FileID, as it justifies the subtraction

nickdesaulniers added inline comments.Oct 4 2022, 1:17 PM

clang/lib/Lex/TokenLexer.cpp
1006	Right, I do all development and profiles with Release builds with assertions enabled. So avoiding getFileID in release+no_asserts builds is a win, but am somewhat bummed to not get as much a win for my rel+assert builds.
1017	Good point; I'd say "avoid getFileID" at all costs, even to readability/style.

nickdesaulniers added a reviewer: nickdesaulniers.Oct 4 2022, 1:17 PM

nickdesaulniers removed a subscriber: nickdesaulniers.

address comments

hokein added inline comments.Oct 5 2022, 1:48 AM

clang/lib/Lex/TokenLexer.cpp
1006	this assertion seems to belong outside the if() - it applies to both the file/macro case? yeah, it should apply the `else` case. Moved it outside -- it seems unnecessary to check the `else` case, since we actually do the partition based on the file id (that being said, it is somehow like `int s = 1+1; assert(s == 2);`), but it might be good for code readability. I do all development and profiles with Release builds with assertions enabled hmm, I think when doing profiles we probably should use a release build without assertions enabled to give a more correct result, since assertion will slow everything down.
1017	We have handled the 1-element special case in the caller, so All.size() > 1 in this function. Added an assertion. All[0].getLocation().isFileID() != All[1].getLocation().isFileID() I tried it when writing this patch, I don't find the result now, but IIRC performance difference is negligible. I'm happy to add this special case since it won't hurt the readability too much.
1037	I agree that this is the best place to put the check-file-id assertion. Simply adding `assert(getFileID(BeginLoc) == getFileID(T.getLocation()))` can work, but it creates N-1 unnecessary getFileID calls on BeginLoc for assert build, it is better to avoid it. The only way I can think of is #ifndef NDEBUG FileID BeginFID = SM.getFileID(BeginLoc); assert(BeginFID == getFileID(T.getLocation())); #endif

Harbormaster completed remote builds in B190432: Diff 465321.Oct 5 2022, 2:38 AM

sammccall accepted this revision.Oct 5 2022, 3:30 AM

sammccall added inline comments.

clang/lib/Lex/TokenLexer.cpp
1037	I assume you mean with the #if once above, but the assert outside the #if and inside the loop? That LGTM There's also #ifdef EXPENSIVE_CHECKS, maybe not needed here though.

This revision is now accepted and ready to land.Oct 5 2022, 3:30 AM

nickdesaulniers accepted this revision.Oct 6 2022, 11:28 AM

nickdesaulniers added inline comments.

clang/lib/Lex/TokenLexer.cpp
1019	Please consider moving this to `#ifdef EXPENSIVE_CHECKS` rather than asserts.

This revision was landed with ongoing or failed builds.Oct 7 2022, 12:16 AM

Closed by commit rG74e4f778cf16: [Lex] Simplify and cleanup the updateConsecutiveMacroArgTokens implementation. (authored by hokein). · Explain Why

This revision was automatically updated to reflect the committed changes.

hokein marked an inline comment as done.

hokein added a commit: rG74e4f778cf16: [Lex] Simplify and cleanup the updateConsecutiveMacroArgTokens implementation..

hokein mentioned this in D136539: [Lex] Bring back the magic number 50 in updateConsecutiveMacroArgTokens..Oct 22 2022, 1:46 PM

alexfh added a reverting change: rGe86161076e4a: Revert "[TokenLexer][NFC] Rename the InstLoc to ExpandLoc".Oct 23 2022, 2:14 PM

alexfh added a reverting change: rGe7656daea872: Revert "[Lex] Simplify and cleanup the updateConsecutiveMacroArgTokens….

hokein mentioned this in rG11c1d8b7fd82: [Lex] Bring back the magic number 50 in updateConsecutiveMacroArgTokens..Oct 26 2022, 3:07 AM

Revision Contents

Path

Size

clang/

lib/

Lex/

TokenLexer.cpp

88 lines

Diff 465983

clang/lib/Lex/TokenLexer.cpp

	Show All 19 Lines
	#include "clang/Lex/LexDiagnostic.h"			#include "clang/Lex/LexDiagnostic.h"
	#include "clang/Lex/Lexer.h"			#include "clang/Lex/Lexer.h"
	#include "clang/Lex/MacroArgs.h"			#include "clang/Lex/MacroArgs.h"
	#include "clang/Lex/MacroInfo.h"			#include "clang/Lex/MacroInfo.h"
	#include "clang/Lex/Preprocessor.h"			#include "clang/Lex/Preprocessor.h"
	#include "clang/Lex/Token.h"			#include "clang/Lex/Token.h"
	#include "clang/Lex/VariadicMacroSupport.h"			#include "clang/Lex/VariadicMacroSupport.h"
	#include "llvm/ADT/ArrayRef.h"			#include "llvm/ADT/ArrayRef.h"
				#include "llvm/ADT/STLExtras.h"
	#include "llvm/ADT/SmallString.h"			#include "llvm/ADT/SmallString.h"
	#include "llvm/ADT/SmallVector.h"			#include "llvm/ADT/SmallVector.h"
	#include "llvm/ADT/iterator_range.h"			#include "llvm/ADT/iterator_range.h"
	#include <cassert>			#include <cassert>
	#include <cstring>			#include <cstring>

	using namespace clang;			using namespace clang;

	▲ Show 20 Lines • Show All 946 Lines • ▼ Show 20 Lines
	/// for the 'foo', '==', 'bar' tokens will point inside that chunk.			/// for the 'foo', '==', 'bar' tokens will point inside that chunk.
	///			///
	/// \arg begin_tokens will be updated to a position past all the found			/// \arg begin_tokens will be updated to a position past all the found
	/// consecutive tokens.			/// consecutive tokens.
	static void updateConsecutiveMacroArgTokens(SourceManager &SM,			static void updateConsecutiveMacroArgTokens(SourceManager &SM,
	SourceLocation InstLoc,			SourceLocation InstLoc,
	Token *&begin_tokens,			Token *&begin_tokens,
	Token * end_tokens) {			Token * end_tokens) {
	assert(begin_tokens < end_tokens);			assert(begin_tokens + 1 < end_tokens);
				SourceLocation BeginLoc = begin_tokens->getLocation();
	SourceLocation FirstLoc = begin_tokens->getLocation();			llvm::MutableArrayRef<Token> All(begin_tokens, end_tokens);
	SourceLocation CurLoc = FirstLoc;			llvm::MutableArrayRef<Token> Partition;

	// Compare the source location offset of tokens and group together tokens that			// Partition the tokens by their FileID.
	// are close, even if their locations point to different FileIDs. e.g.			// This is a hot function, and calling getFileID can be expensive, the
	//			// implementation is optimized by reducing the number of getFileID.
	// \|bar \| foo \| cake \| (3 tokens from 3 consecutive FileIDs)			if (BeginLoc.isFileID()) {
				sammccallUnsubmitted Done Reply Inline Actions I think this comment can be shorter while still getting its point across. // Consecutive tokens not written in macros must be from the same file. // (Neither #include nor eof can occur inside a macro argument.) sammccall: I think this comment can be shorter while still getting its point across. ``` // Consecutive…
	// ^ ^			// Consecutive tokens not written in macros must be from the same file.
	// \|bar foo cake\| (one SLocEntry chunk for all tokens)			// (Neither #include nor eof can occur inside a macro argument.)
	//			Partition = All.take_while([&](const Token &T) {
	// we can perform this "merge" since the token's spelling location depends			return T.getLocation().isFileID();
	// on the relative offset.			});
				} else {
	Token *NextTok = begin_tokens + 1;			// Call getFileID once to calculate the bounds, and use the cheaper
				sammccallUnsubmitted Not Done Reply Inline Actions this assertion seems to belong outside the if() - it applies to both the file/macro case? I'd suggest asserting nonempty first and then the rest as another assertion. also missing an assertion that if there are any more tokens, the next token has a different FileID that said with these assertions we should probably check we're not regressing debug performance too much! sammccall: this assertion seems to belong outside the if() - it applies to both the file/macro case? I'd…
				nickdesaulniersUnsubmitted Not Done Reply Inline Actions Right, I do all development and profiles with Release builds with assertions enabled. So avoiding getFileID in release+no_asserts builds is a win, but am somewhat bummed to not get as much a win for my rel+assert builds. nickdesaulniers: Right, I do all development and profiles with Release builds with assertions enabled. So…
				hokeinAuthorUnsubmitted Done Reply Inline Actions this assertion seems to belong outside the if() - it applies to both the file/macro case? yeah, it should apply the `else` case. Moved it outside -- it seems unnecessary to check the `else` case, since we actually do the partition based on the file id (that being said, it is somehow like `int s = 1+1; assert(s == 2);`), but it might be good for code readability. I do all development and profiles with Release builds with assertions enabled hmm, I think when doing profiles we probably should use a release build without assertions enabled to give a more correct result, since assertion will slow everything down. hokein: > this assertion seems to belong outside the if() - it applies to both the file/macro case?
	for (; NextTok < end_tokens; ++NextTok) {			// sourcelocation-against-bounds comparison.
	SourceLocation NextLoc = NextTok->getLocation();			FileID BeginFID = SM.getFileID(BeginLoc);
	if (CurLoc.isFileID() != NextLoc.isFileID())			SourceLocation Limit =
	break; // Token from different kind of FileID.			SM.getComposedLoc(BeginFID, SM.getFileIDSize(BeginFID));
				Partition = All.take_while([&](const Token &T) {
				nickdesaulniersUnsubmitted Not Done Reply Inline Actions Could you just check that all of the tokens in the partition have the same fileID as the first token? FileID FirstFID = SM.getFileID(Partition[0]->getLocation()); llvm::all_of(Partition, [&SM, &FirstID](const Token &T) { return SM.getFileID(T.getLocation() == FID; }); or move the assertion into the take_while above so we iterate less? nickdesaulniers: Could you just check that all of the tokens in the partition have the same fileID as the first…
				hokeinAuthorUnsubmitted Done Reply Inline Actions The optimization for this case is that we don't call any `getFileID`, the getFileID is only needed in the assert sanity check, so moving the assertion to `take_while` doesn't really work. I adjust the code to save some unnecessary `getFileID` call in assert. hokein: The optimization for this case is that we don't call any `getFileID`, the getFileID is only…
	SourceLocation::IntTy RelOffs;			return T.getLocation() >= BeginLoc && T.getLocation() < Limit;
	if (!SM.isInSameSLocAddrSpace(CurLoc, NextLoc, &RelOffs))			});
	break; // Token from different local/loaded location.
	// Check that token is not before the previous token or more than 50
	// "characters" away.
	if (RelOffs < 0 \|\| RelOffs > 50)
	break;

	if (CurLoc.isMacroID() && !SM.isWrittenInSameFile(CurLoc, NextLoc))
	break; // Token from a different macro.

	CurLoc = NextLoc;
	}			}
				assert(!Partition.empty());

	// For the consecutive tokens, find the length of the SLocEntry to contain			// For the consecutive tokens, find the length of the SLocEntry to contain
				sammccallUnsubmitted Not Done Reply Inline Actions this getFileID() call is unneccesary when `All.empty() \|\| All.front().getLocation().isFileID()`. Worth checking whether bailing out in that case is profitable? You could do it right at the top: if (All.size() == 1 \|\| All[0].getLocation().isFileID() != All[1].getLocation().isFileID()) return All.take_front(1); sammccall: this getFileID() call is unneccesary when `All.empty() \|\| All.front().getLocation().isFileID()`.
				nickdesaulniersUnsubmitted Not Done Reply Inline Actions Good point; I'd say "avoid getFileID" at all costs, even to readability/style. nickdesaulniers: Good point; I'd say "avoid getFileID" at all costs, even to readability/style.
				hokeinAuthorUnsubmitted Done Reply Inline Actions We have handled the 1-element special case in the caller, so All.size() > 1 in this function. Added an assertion. All[0].getLocation().isFileID() != All[1].getLocation().isFileID() I tried it when writing this patch, I don't find the result now, but IIRC performance difference is negligible. I'm happy to add this special case since it won't hurt the readability too much. hokein: We have handled the 1-element special case in the caller, so All.size() > 1 in this function.
	// all of them.			// all of them.
	Token &LastConsecutiveTok = *(NextTok-1);
	SourceLocation::IntTy LastRelOffs = 0;
	SM.isInSameSLocAddrSpace(FirstLoc, LastConsecutiveTok.getLocation(),
	&LastRelOffs);
	SourceLocation::UIntTy FullLength =			SourceLocation::UIntTy FullLength =
				nickdesaulniersUnsubmitted Done Reply Inline Actions Please consider moving this to `#ifdef EXPENSIVE_CHECKS` rather than asserts. nickdesaulniers: Please consider moving this to `#ifdef EXPENSIVE_CHECKS` rather than asserts.
	LastRelOffs + LastConsecutiveTok.getLength();			Partition.back().getEndLoc().getRawEncoding() -
				Partition.front().getLocation().getRawEncoding();
				sammccallUnsubmitted Done Reply Inline Actions nit: s/begin/front/ if you're using back() sammccall: nit: s/begin/front/ if you're using back()
	// Create a macro expansion SLocEntry that will "contain" all of the tokens.			// Create a macro expansion SLocEntry that will "contain" all of the tokens.
	SourceLocation Expansion =			SourceLocation Expansion =
	SM.createMacroArgExpansionLoc(FirstLoc, InstLoc,FullLength);			SM.createMacroArgExpansionLoc(BeginLoc, InstLoc, FullLength);

				#ifdef EXPENSIVE_CHECKS
				assert(llvm::all_of(Partition.drop_front(),
				[&SM, ID = SM.getFileID(Partition.front().getLocation())](
				const Token &T) {
				return ID == SM.getFileID(T.getLocation());
				}) &&
				"Must have the same FIleID!");
				#endif
	// Change the location of the tokens from the spelling location to the new			// Change the location of the tokens from the spelling location to the new
	// expanded location.			// expanded location.
	for (; begin_tokens < NextTok; ++begin_tokens) {			for (Token& T : Partition) {
	Token &Tok = *begin_tokens;			SourceLocation::IntTy RelativeOffset =
				sammccallUnsubmitted Not Done Reply Inline Actions hmm, actually maybe just before this line would be the best place to assert that T and BeginLoc are in the same FileID, as it justifies the subtraction sammccall: hmm, actually maybe just before this line would be the best place to assert that T and BeginLoc…
				hokeinAuthorUnsubmitted Done Reply Inline Actions I agree that this is the best place to put the check-file-id assertion. Simply adding `assert(getFileID(BeginLoc) == getFileID(T.getLocation()))` can work, but it creates N-1 unnecessary getFileID calls on BeginLoc for assert build, it is better to avoid it. The only way I can think of is #ifndef NDEBUG FileID BeginFID = SM.getFileID(BeginLoc); assert(BeginFID == getFileID(T.getLocation())); #endif hokein: I agree that this is the best place to put the check-file-id assertion. Simply adding `assert…
				sammccallUnsubmitted Not Done Reply Inline Actions I assume you mean with the #if once above, but the assert outside the #if and inside the loop? That LGTM There's also #ifdef EXPENSIVE_CHECKS, maybe not needed here though. sammccall: I assume you mean with the #if once above, but the assert outside the #if and inside the loop?
	SourceLocation::IntTy RelOffs = 0;			T.getLocation().getRawEncoding() - BeginLoc.getRawEncoding();
	SM.isInSameSLocAddrSpace(FirstLoc, Tok.getLocation(), &RelOffs);			T.setLocation(Expansion.getLocWithOffset(RelativeOffset));
	Tok.setLocation(Expansion.getLocWithOffset(RelOffs));
	}			}
				begin_tokens = &Partition.back() + 1;
	}			}

	/// Creates SLocEntries and updates the locations of macro argument			/// Creates SLocEntries and updates the locations of macro argument
	/// tokens to their new expanded locations.			/// tokens to their new expanded locations.
	///			///
	/// \param ArgIdSpellLoc the location of the macro argument id inside the macro			/// \param ArgIdSpellLoc the location of the macro argument id inside the macro
	/// definition.			/// definition.
	void TokenLexer::updateLocForMacroArgTokens(SourceLocation ArgIdSpellLoc,			void TokenLexer::updateLocForMacroArgTokens(SourceLocation ArgIdSpellLoc,
	Show All 25 Lines