This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang-tools-extra/clangd/unittests/
-
clangd/
-
unittests/
-
SelectionTests.cpp
-
clang/
-
include/clang/Basic/
-
clang/
-
Basic/
-
SourceManager.h
-
lib/Basic/
-
Basic/
1/2
SourceManager.cpp
-
unittests/Basic/
-
Basic/
-
SourceManagerTest.cpp

Differential D134685

Fix SourceManager::isBeforeInTranslationUnit bug with token-pasting
ClosedPublic

Authored by sammccall on Sep 26 2022, 4:59 PM.

Download Raw Diff

Details

Reviewers

ilya-biryukov
hokein

Commits

rG41b51007e637: Fix SourceManager::isBeforeInTranslationUnit bug with token-pasting

Summary

isBeforeInTranslationUnit compares SourceLocations across FileIDs by
mapping them onto a common ancestor file, following include/expansion edges.

It is possible to get a tie in the common ancestor, because multiple
"chunks" of a macro arg will expand to the same macro param token in the body:

#define ID(X) X
#define TWO 2
ID(1 TWO)

Here two FileIDs both expand into X in ID's expansion:

one containing 1 and spelled on line 3
one containing 2 and spelled by the macro expansion of TWO

isBeforeInTranslationUnit breaks this tie by comparing the two FileIDs:
the one "on the left" is always created first and is numerically smaller.
This seems correct so far.

Prior to this patch it also takes a shortcut (unclear if intentionally).
Instead of comparing the two FileIDs that directly expand to the same location,
it compares the original FileIDs being compared. These may not be the
same if there are multiple macro expansions in between.
This *almost* always yields the right answer, because macro expansion
yields "trees" of FileIDs allocated in a contiguous range: when comparing tree A
to tree B, it doesn't matter what representative you pick.

However, the splitting of >> tokens is modeled as macro expansion (as if
the first '>' was a macro that expands to a '>' spelled a scratch buffer).
This splitting occurs retroactively when parsing, so the FileID allocated is
larger than expected if it were a real macro expansion performed during lexing.
As a result, macro tree A can be on the left of tree B, and yet contain
a token-split FileID whose numeric value is *greator* than those in B.
In this case the tiebreak gives the wrong answer.

Concretely:

#define ID(X) X
template <typename> class S{};
ID(
  ID(S<S<int>> x);
  int y;
)

Given Greater = (typeloc of S<int>).getEndLoc();
      Y       = (decl of y).getLocation();
isBeforeInTranslationUnit(Greater, Y) should return true, but returns false.

Here the common FileID of (Greater, Y) is the body of the outer ID
expansion, and they both expand to X within it.
With the current tiebreak rules, we compare the FileID of Greater (a split)
to the FileID of Y (a macro arg expansion into X of the outer ID).
The former is larger because the token split occurred relatively late.

This patch fixes the issue by removing the shortcut. It tracks the immediate
FileIDs used to reach the common file, and uses these IDs to break ties.
In the example, we now compare the macro arg expansion of the inner ID()
to the macro arg expansion of Y, and find that it is smaller.

This requires some changes to the InBeforeInTUCacheEntry (sic).
We store a little more data so it's probably slightly slower.
It was difficult to resist more invasive changes:

performance: the sizing is very suspicious, and once the cache "fills up" we're thrashing a single entry
API: the class seems to be needlessly complicated

However I tried to avoid mixing these with subtle behavior changes, and
will send a followup instead.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

sammccall created this revision.Sep 26 2022, 4:59 PM

Herald added a project: Restricted Project. · View Herald TranscriptSep 26 2022, 4:59 PM

Herald added subscribers: kadircet, arphaman. · View Herald Transcript

sammccall requested review of this revision.Sep 26 2022, 4:59 PM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptSep 26 2022, 4:59 PM

Herald added a subscriber: cfe-commits. · View Herald Transcript

Harbormaster completed remote builds in B188813: Diff 463053.Sep 26 2022, 5:39 PM

However I tried to avoid mixing these with subtle behavior changes, and will send a followup instead.

D134694 if you're interested. I guess I should try to get performance measures though...

Thanks for digging into the rabbit role and the great analysis. It looks good from my side.

Re the test case ID(I TWO 3) we discussed yesterday, I verified that there is no issue. (we don't merge the 1 2 3 into a single SLOCEntry, each one have a dedicated FileID, and FileID(2) < FileID(3)).

However I tried to avoid mixing these with subtle behavior changes, and will send a followup instead.

D134694 if you're interested. I guess I should try to get performance measures though...

While doing the review, I agree that this part of code is unnecessary complicated, I think doing cleanup is probably a good idea.

clang/lib/Basic/SourceManager.cpp
2108–2111	nit: I found the first/second usage really hurts the code readability here (using the struct `Entry` int all places is probably better, but this requires some changes to the existing interface, no required action here).
2126	maybe add an assertion for "local and load FileIDs are never mixed".

This revision is now accepted and ready to land.Sep 29 2022, 12:53 AM

This revision was landed with ongoing or failed builds.Oct 5 2022, 9:29 AM

Closed by commit rG41b51007e637: Fix SourceManager::isBeforeInTranslationUnit bug with token-pasting (authored by sammccall). · Explain Why

This revision was automatically updated to reflect the committed changes.

sammccall marked an inline comment as done.

sammccall added a commit: rG41b51007e637: Fix SourceManager::isBeforeInTranslationUnit bug with token-pasting.

Revision Contents

Path

Size

clang-tools-extra/

clangd/

unittests/

SelectionTests.cpp

18 lines

clang/

include/

clang/

Basic/

SourceManager.h

34 lines

lib/

Basic/

SourceManager.cpp

77 lines

unittests/

Basic/

SourceManagerTest.cpp

77 lines

Diff 465437

clang-tools-extra/clangd/unittests/SelectionTests.cpp

Show First 20 Lines • Show All 700 Lines • ▼ Show 20 Lines	TEST(SelectionTest, MacroArgExpansion) {
Case = R"cpp(		Case = R"cpp(
void die(const char*);		void die(const char*);
#define assert(x) (x ? (void)0 : die(#x))		#define assert(x) (x ? (void)0 : die(#x))
void foo() { assert(^42); }		void foo() { assert(^42); }
)cpp";		)cpp";
Test = Annotations(Case);		Test = Annotations(Case);
AST = TestTU::withCode(Test.code()).build();		AST = TestTU::withCode(Test.code()).build();
T = makeSelectionTree(Case, AST);		T = makeSelectionTree(Case, AST);

EXPECT_EQ("IntegerLiteral", T.commonAncestor()->kind());		EXPECT_EQ("IntegerLiteral", T.commonAncestor()->kind());

		// Reduced from private bug involving RETURN_IF_ERROR.
		// Due to >>-splitting and a bug in isBeforeInTranslationUnit, the inner
		// S<int> would claim way too many tokens.
		Case = R"cpp(
		#define ID(x) x
		template <typename T> class S {};
		ID(
		ID(S<S<int>> x);
		int ^y;
		)
		)cpp";
		Test = Annotations(Case);
		AST = TestTU::withCode(Test.code()).build();
		T = makeSelectionTree(Case, AST);
		// not TemplateSpecializationTypeLoc!
		EXPECT_EQ("VarDecl", T.commonAncestor()->kind());
}		}

TEST(SelectionTest, Implicit) {		TEST(SelectionTest, Implicit) {
const char *Test = R"cpp(		const char *Test = R"cpp(
struct S { S(const char*); };		struct S { S(const char*); };
int f(S);		int f(S);
int x = f("^");		int x = f("^");
)cpp";		)cpp";
▲ Show 20 Lines • Show All 93 Lines • Show Last 20 Lines

clang/include/clang/Basic/SourceManager.h

	Show First 20 Lines • Show All 536 Lines • ▼ Show 20 Lines
	/// The cache structure is complex enough to be worth breaking out of			/// The cache structure is complex enough to be worth breaking out of
	/// SourceManager.			/// SourceManager.
	class InBeforeInTUCacheEntry {			class InBeforeInTUCacheEntry {
	/// The FileID's of the cached query.			/// The FileID's of the cached query.
	///			///
	/// If these match up with a subsequent query, the result can be reused.			/// If these match up with a subsequent query, the result can be reused.
	FileID LQueryFID, RQueryFID;			FileID LQueryFID, RQueryFID;

	/// True if LQueryFID was created before RQueryFID.			/// The relative order of FileIDs that the CommonFID immediately includes.
	///			///
	/// This is used to compare macro expansion locations.			/// This is used to compare macro expansion locations.
	bool IsLQFIDBeforeRQFID;			bool LChildBeforeRChild;

	/// The file found in common between the two \#include traces, i.e.,			/// The file found in common between the two \#include traces, i.e.,
	/// the nearest common ancestor of the \#include tree.			/// the nearest common ancestor of the \#include tree.
	FileID CommonFID;			FileID CommonFID;

	/// The offset of the previous query in CommonFID.			/// The offset of the previous query in CommonFID.
	///			///
	/// Usually, this represents the location of the \#include for QueryFID, but			/// Usually, this represents the location of the \#include for QueryFID, but
	/// if LQueryFID is a parent of RQueryFID (or vice versa) then these can be a			/// if LQueryFID is a parent of RQueryFID (or vice versa) then these can be a
	/// random token in the parent.			/// random token in the parent.
	unsigned LCommonOffset, RCommonOffset;			unsigned LCommonOffset, RCommonOffset;

	public:			public:
				InBeforeInTUCacheEntry() = default;
				InBeforeInTUCacheEntry(FileID L, FileID R) : LQueryFID(L), RQueryFID(R) {
				assert(L != R);
				}

	/// Return true if the currently cached values match up with			/// Return true if the currently cached values match up with
	/// the specified LHS/RHS query.			/// the specified LHS/RHS query.
	///			///
	/// If not, we can't use the cache.			/// If not, we can't use the cache.
	bool isCacheValid(FileID LHS, FileID RHS) const {			bool isCacheValid() const {
	return LQueryFID == LHS && RQueryFID == RHS;			return CommonFID.isValid();
	}			}

	/// If the cache is valid, compute the result given the			/// If the cache is valid, compute the result given the
	/// specified offsets in the LHS/RHS FileID's.			/// specified offsets in the LHS/RHS FileID's.
	bool getCachedResult(unsigned LOffset, unsigned ROffset) const {			bool getCachedResult(unsigned LOffset, unsigned ROffset) const {
	// If one of the query files is the common file, use the offset. Otherwise,			// If one of the query files is the common file, use the offset. Otherwise,
	// use the #include loc in the common file.			// use the #include loc in the common file.
	if (LQueryFID != CommonFID) LOffset = LCommonOffset;			if (LQueryFID != CommonFID) LOffset = LCommonOffset;
	if (RQueryFID != CommonFID) ROffset = RCommonOffset;			if (RQueryFID != CommonFID) ROffset = RCommonOffset;

	// It is common for multiple macro expansions to be "included" from the same			// It is common for multiple macro expansions to be "included" from the same
	// location (expansion location), in which case use the order of the FileIDs			// location (expansion location), in which case use the order of the FileIDs
	// to determine which came first. This will also take care the case where			// to determine which came first. This will also take care the case where
	// one of the locations points at the inclusion/expansion point of the other			// one of the locations points at the inclusion/expansion point of the other
	// in which case its FileID will come before the other.			// in which case its FileID will come before the other.
	if (LOffset == ROffset)			if (LOffset == ROffset)
	return IsLQFIDBeforeRQFID;			return LChildBeforeRChild;

	return LOffset < ROffset;			return LOffset < ROffset;
	}			}

	/// Set up a new query.			/// Set up a new query.
	void setQueryFIDs(FileID LHS, FileID RHS, bool isLFIDBeforeRFID) {			/// If it matches the old query, we can keep the cached answer.
				void setQueryFIDs(FileID LHS, FileID RHS) {
	assert(LHS != RHS);			assert(LHS != RHS);
				if (LQueryFID != LHS \|\| RQueryFID != RHS) {
	LQueryFID = LHS;			LQueryFID = LHS;
	RQueryFID = RHS;			RQueryFID = RHS;
	IsLQFIDBeforeRQFID = isLFIDBeforeRFID;			CommonFID = FileID();
	}			}

	void clear() {
	LQueryFID = RQueryFID = FileID();
	IsLQFIDBeforeRQFID = false;
	}			}

	void setCommonLoc(FileID commonFID, unsigned lCommonOffset,			void setCommonLoc(FileID commonFID, unsigned lCommonOffset,
	unsigned rCommonOffset) {			unsigned rCommonOffset, bool LParentBeforeRParent) {
	CommonFID = commonFID;			CommonFID = commonFID;
	LCommonOffset = lCommonOffset;			LCommonOffset = lCommonOffset;
	RCommonOffset = rCommonOffset;			RCommonOffset = rCommonOffset;
				LChildBeforeRChild = LParentBeforeRParent;
	}			}
	};			};

	/// The stack used when building modules on demand, which is used			/// The stack used when building modules on demand, which is used
	/// to provide a link between the source managers of the different compiler			/// to provide a link between the source managers of the different compiler
	/// instances.			/// instances.
	using ModuleBuildStack = ArrayRef<std::pair<std::string, FullSourceLoc>>;			using ModuleBuildStack = ArrayRef<std::pair<std::string, FullSourceLoc>>;

	▲ Show 20 Lines • Show All 1,331 Lines • Show Last 20 Lines

clang/lib/Basic/SourceManager.cpp

Show First 20 Lines • Show All 1,986 Lines • ▼ Show 20 Lines

/// Return the cache entry for comparing the given file IDs		/// Return the cache entry for comparing the given file IDs
/// for isBeforeInTranslationUnit.		/// for isBeforeInTranslationUnit.
InBeforeInTUCacheEntry &SourceManager::getInBeforeInTUCache(FileID LFID,		InBeforeInTUCacheEntry &SourceManager::getInBeforeInTUCache(FileID LFID,
FileID RFID) const {		FileID RFID) const {
// This is a magic number for limiting the cache size. It was experimentally		// This is a magic number for limiting the cache size. It was experimentally
// derived from a small Objective-C project (where the cache filled		// derived from a small Objective-C project (where the cache filled
// out to ~250 items). We can make it larger if necessary.		// out to ~250 items). We can make it larger if necessary.
		// FIXME: this is almost certainly full these days. Use an LRU cache?
enum { MagicCacheSize = 300 };		enum { MagicCacheSize = 300 };
IsBeforeInTUCacheKey Key(LFID, RFID);		IsBeforeInTUCacheKey Key(LFID, RFID);

// If the cache size isn't too large, do a lookup and if necessary default		// If the cache size isn't too large, do a lookup and if necessary default
// construct an entry. We can then return it to the caller for direct		// construct an entry. We can then return it to the caller for direct
// use. When they update the value, the cache will get automatically		// use. When they update the value, the cache will get automatically
// updated as well.		// updated as well.
if (IBTUCache.size() < MagicCacheSize)		if (IBTUCache.size() < MagicCacheSize)
return IBTUCache[Key];		return IBTUCache.try_emplace(Key, LFID, RFID).first->second;

// Otherwise, do a lookup that will not construct a new value.		// Otherwise, do a lookup that will not construct a new value.
InBeforeInTUCache::iterator I = IBTUCache.find(Key);		InBeforeInTUCache::iterator I = IBTUCache.find(Key);
if (I != IBTUCache.end())		if (I != IBTUCache.end())
return I->second;		return I->second;

// Fall back to the overflow value.		// Fall back to the overflow value.
		IBTUCacheOverflow.setQueryFIDs(LFID, RFID);
return IBTUCacheOverflow;		return IBTUCacheOverflow;
}		}

/// Determines the order of 2 source locations in the translation unit.		/// Determines the order of 2 source locations in the translation unit.
///		///
/// \returns true if LHS source location comes before RHS, false otherwise.		/// \returns true if LHS source location comes before RHS, false otherwise.
bool SourceManager::isBeforeInTranslationUnit(SourceLocation LHS,		bool SourceManager::isBeforeInTranslationUnit(SourceLocation LHS,
SourceLocation RHS) const {		SourceLocation RHS) const {
▲ Show 20 Lines • Show All 58 Lines • ▼ Show 20 Lines	std::pair<bool, bool> SourceManager::isInTheSameTranslationUnit(

// If we are comparing a source location with multiple locations in the same		// If we are comparing a source location with multiple locations in the same
// file, we get a big win by caching the result.		// file, we get a big win by caching the result.
InBeforeInTUCacheEntry &IsBeforeInTUCache =		InBeforeInTUCacheEntry &IsBeforeInTUCache =
getInBeforeInTUCache(LOffs.first, ROffs.first);		getInBeforeInTUCache(LOffs.first, ROffs.first);

// If we are comparing a source location with multiple locations in the same		// If we are comparing a source location with multiple locations in the same
// file, we get a big win by caching the result.		// file, we get a big win by caching the result.
if (IsBeforeInTUCache.isCacheValid(LOffs.first, ROffs.first))		if (IsBeforeInTUCache.isCacheValid())
return std::make_pair(		return std::make_pair(
true, IsBeforeInTUCache.getCachedResult(LOffs.second, ROffs.second));		true, IsBeforeInTUCache.getCachedResult(LOffs.second, ROffs.second));

// Okay, we missed in the cache, start updating the cache for this query.		// Okay, we missed in the cache, we'll compute the answer and populate it.
IsBeforeInTUCache.setQueryFIDs(LOffs.first, ROffs.first,
/isLFIDBeforeRFID=/LOffs.first.ID < ROffs.first.ID);

// We need to find the common ancestor. The only way of doing this is to		// We need to find the common ancestor. The only way of doing this is to
// build the complete include chain for one and then walking up the chain		// build the complete include chain for one and then walking up the chain
// of the other looking for a match.		// of the other looking for a match.
// We use a map from FileID to Offset to store the chain. Easier than writing
// a custom set hash info that only depends on the first part of a pair.		// A location within a FileID on the path up from LOffs to the main file.
using LocSet = llvm::SmallDenseMap<FileID, unsigned, 16>;		struct Entry {
LocSet LChain;		unsigned Offset;
		FileID ParentFID; // Used for breaking ties.
		};
		llvm::SmallDenseMap<FileID, Entry, 16> LChain;

		FileID Parent;
do {		do {
LChain.insert(LOffs);		LChain.try_emplace(LOffs.first, Entry{LOffs.second, Parent});
// We catch the case where LOffs is in a file included by ROffs and		// We catch the case where LOffs is in a file included by ROffs and
// quit early. The other way round unfortunately remains suboptimal.		// quit early. The other way round unfortunately remains suboptimal.
} while (LOffs.first != ROffs.first && !MoveUpIncludeHierarchy(LOffs, *this));		if (LOffs.first == ROffs.first)
LocSet::iterator I;		break;
while((I = LChain.find(ROffs.first)) == LChain.end()) {		Parent = LOffs.first;
if (MoveUpIncludeHierarchy(ROffs, *this))		} while (!MoveUpIncludeHierarchy(LOffs, *this));
		hokeinUnsubmitted Not Done Reply Inline Actions nit: I found the first/second usage really hurts the code readability here (using the struct `Entry` int all places is probably better, but this requires some changes to the existing interface, no required action here). hokein: nit: I found the first/second usage really hurts the code readability here (using the struct…
break; // Met at topmost file.
}		Parent = FileID();
if (I != LChain.end())		do {
LOffs = *I;		auto I = LChain.find(ROffs.first);
		if (I != LChain.end()) {
// If we exited because we found a nearest common ancestor, compare the		// Compare the locations within the common file and cache them.
// locations within the common file and cache them.		LOffs.first = I->first;
if (LOffs.first == ROffs.first) {		LOffs.second = I->second.Offset;
IsBeforeInTUCache.setCommonLoc(LOffs.first, LOffs.second, ROffs.second);		// The relative order of LParent and RParent is a tiebreaker when
		// - locs expand to the same location (occurs in macro arg expansion)
		// - one loc is a parent of the other (we consider the parent as "first")
		// For the parent to be first, the invalid file ID must compare smaller.
		// However loaded FileIDs are <0, so we perform unsigned comparison!
		// This changes the relative order of local vs loaded FileIDs, but it
		// doesn't matter as these are never mixed in macro expansion.
		hokeinUnsubmitted Done Reply Inline Actions maybe add an assertion for "local and load FileIDs are never mixed". hokein: maybe add an assertion for "local and load FileIDs are never mixed".
		unsigned LParent = I->second.ParentFID.ID;
		unsigned RParent = Parent.ID;
		assert((LOffs.second != ROffs.second) \|\| (LParent == 0 \|\| RParent == 0) \|\|
		isInSameSLocAddrSpace(getComposedLoc(I->second.ParentFID, 0),
		getComposedLoc(Parent, 0), nullptr) &&
		"Mixed local/loaded FileIDs with same include location?");
		IsBeforeInTUCache.setCommonLoc(LOffs.first, LOffs.second, ROffs.second,
		LParent < RParent);
return std::make_pair(		return std::make_pair(
true, IsBeforeInTUCache.getCachedResult(LOffs.second, ROffs.second));		true, IsBeforeInTUCache.getCachedResult(LOffs.second, ROffs.second));
}		}
// Clear the lookup cache, it depends on a common location.		Parent = ROffs.first;
IsBeforeInTUCache.clear();		} while (!MoveUpIncludeHierarchy(ROffs, *this));

		// If we found no match, we're not in the same TU.
		// We don't cache this, but it is rare.
return std::make_pair(false, false);		return std::make_pair(false, false);
}		}

void SourceManager::PrintStats() const {		void SourceManager::PrintStats() const {
llvm::errs() << "\n*** Source Manager Stats:\n";		llvm::errs() << "\n*** Source Manager Stats:\n";
llvm::errs() << FileInfos.size() << " files mapped, " << MemBufferInfos.size()		llvm::errs() << FileInfos.size() << " files mapped, " << MemBufferInfos.size()
<< " mem buffers mapped.\n";		<< " mem buffers mapped.\n";
llvm::errs() << LocalSLocEntryTable.size() << " local SLocEntry's allocated ("		llvm::errs() << LocalSLocEntryTable.size() << " local SLocEntry's allocated ("
▲ Show 20 Lines • Show All 139 Lines • Show Last 20 Lines

clang/unittests/Basic/SourceManagerTest.cpp

Show First 20 Lines • Show All 165 Lines • ▼ Show 20 Lines	TEST_F(SourceManagerTest, isBeforeInTranslationUnit) {
ASSERT_EQ(")", PP.getSpelling(macroExpEndLoc, str));		ASSERT_EQ(")", PP.getSpelling(macroExpEndLoc, str));

EXPECT_TRUE(SourceMgr.isBeforeInTranslationUnit(lsqrLoc, idLoc));		EXPECT_TRUE(SourceMgr.isBeforeInTranslationUnit(lsqrLoc, idLoc));
EXPECT_TRUE(SourceMgr.isBeforeInTranslationUnit(idLoc, rsqrLoc));		EXPECT_TRUE(SourceMgr.isBeforeInTranslationUnit(idLoc, rsqrLoc));
EXPECT_TRUE(SourceMgr.isBeforeInTranslationUnit(macroExpStartLoc, idLoc));		EXPECT_TRUE(SourceMgr.isBeforeInTranslationUnit(macroExpStartLoc, idLoc));
EXPECT_TRUE(SourceMgr.isBeforeInTranslationUnit(idLoc, macroExpEndLoc));		EXPECT_TRUE(SourceMgr.isBeforeInTranslationUnit(idLoc, macroExpEndLoc));
}		}

		TEST_F(SourceManagerTest, isBeforeInTranslationUnitWithTokenSplit) {
		const char *main = R"cpp(
		#define ID(X) X
		ID(
		ID(a >> b)
		c
		)
		)cpp";

		SourceMgr.setMainFileID(
		SourceMgr.createFileID(llvm::MemoryBuffer::getMemBuffer(main)));

		TrivialModuleLoader ModLoader;
		HeaderSearch HeaderInfo(std::make_shared<HeaderSearchOptions>(), SourceMgr,
		Diags, LangOpts, &*Target);
		Preprocessor PP(std::make_shared<PreprocessorOptions>(), Diags, LangOpts,
		SourceMgr, HeaderInfo, ModLoader,
		/IILookup =/nullptr,
		/OwnsHeaderSearch =/false);
		PP.Initialize(*Target);
		PP.EnterMainSourceFile();
		llvm::SmallString<8> Scratch;

		std::vector<Token> toks;
		while (1) {
		Token tok;
		PP.Lex(tok);
		if (tok.is(tok::eof))
		break;
		toks.push_back(tok);
		}

		// Make sure we got the tokens that we expected.
		ASSERT_EQ(4U, toks.size()) << "a >> b c";
		// Sanity check their order.
		for (unsigned I = 0; I < toks.size() - 1; ++I) {
		EXPECT_TRUE(SourceMgr.isBeforeInTranslationUnit(toks[I].getLocation(),
		toks[I + 1].getLocation()));
		EXPECT_FALSE(SourceMgr.isBeforeInTranslationUnit(toks[I + 1].getLocation(),
		toks[I].getLocation()));
		}

		// Split the >> into two > tokens, as happens when parsing nested templates.
		unsigned RightShiftIndex = 1;
		SourceLocation RightShift = toks[RightShiftIndex].getLocation();
		EXPECT_EQ(">>", Lexer::getSpelling(SourceMgr.getSpellingLoc(RightShift),
		Scratch, SourceMgr, LangOpts));
		SourceLocation Greater1 = PP.SplitToken(RightShift, /Length=/1);
		SourceLocation Greater2 = RightShift.getLocWithOffset(1);
		EXPECT_TRUE(Greater1.isMacroID());
		EXPECT_EQ(">", Lexer::getSpelling(SourceMgr.getSpellingLoc(Greater1), Scratch,
		SourceMgr, LangOpts));
		EXPECT_EQ(">", Lexer::getSpelling(SourceMgr.getSpellingLoc(Greater2), Scratch,
		SourceMgr, LangOpts));
		EXPECT_EQ(SourceMgr.getImmediateExpansionRange(Greater1).getBegin(),
		RightShift);

		for (unsigned I = 0; I < toks.size(); ++I) {
		SCOPED_TRACE("Token " + std::to_string(I));
		// Right-shift is the parent of Greater1, so it compares less.
		EXPECT_EQ(
		SourceMgr.isBeforeInTranslationUnit(toks[I].getLocation(), Greater1),
		I <= RightShiftIndex);
		EXPECT_EQ(
		SourceMgr.isBeforeInTranslationUnit(toks[I].getLocation(), Greater2),
		I <= RightShiftIndex);
		EXPECT_EQ(
		SourceMgr.isBeforeInTranslationUnit(Greater1, toks[I].getLocation()),
		RightShiftIndex < I);
		EXPECT_EQ(
		SourceMgr.isBeforeInTranslationUnit(Greater2, toks[I].getLocation()),
		RightShiftIndex < I);
		}
		EXPECT_TRUE(SourceMgr.isBeforeInTranslationUnit(Greater1, Greater2));
		EXPECT_FALSE(SourceMgr.isBeforeInTranslationUnit(Greater2, Greater1));
		}

TEST_F(SourceManagerTest, getColumnNumber) {		TEST_F(SourceManagerTest, getColumnNumber) {
const char *Source =		const char *Source =
"int x;\n"		"int x;\n"
"int y;";		"int y;";

std::unique_ptr<llvm::MemoryBuffer> Buf =		std::unique_ptr<llvm::MemoryBuffer> Buf =
llvm::MemoryBuffer::getMemBuffer(Source);		llvm::MemoryBuffer::getMemBuffer(Source);
FileID MainFileID = SourceMgr.createFileID(std::move(Buf));		FileID MainFileID = SourceMgr.createFileID(std::move(Buf));
▲ Show 20 Lines • Show All 405 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Fix SourceManager::isBeforeInTranslationUnit bug with token-pastingClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 465437

clang-tools-extra/clangd/unittests/SelectionTests.cpp

clang/include/clang/Basic/SourceManager.h

clang/lib/Basic/SourceManager.cpp

clang/unittests/Basic/SourceManagerTest.cpp

Fix SourceManager::isBeforeInTranslationUnit bug with token-pasting
ClosedPublic