This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang-tools-extra/trunk/clangd/
-
trunk/
-
clangd/
-
Selection.h
-
Selection.cpp
-
refactor/
-
Tweak.cpp
-
unittests/
-
SelectionTests.cpp

Differential D65486

[clangd] Ignore semicolons, whitespace, and comments in SelectionTree.
ClosedPublic

Authored by sammccall on Jul 30 2019, 3:50 PM.

Download Raw Diff

Details

Reviewers

SureYeaah
kadircet

Commits

rGabe3c29a8c83: [clangd] Ignore semicolons, whitespace, and comments in SelectionTree.
rCTE367453: [clangd] Ignore semicolons, whitespace, and comments in SelectionTree.
rL367453: [clangd] Ignore semicolons, whitespace, and comments in SelectionTree.

Summary

Whitespace and comments are a clear bugfix: selecting some
comments/space near a statement doesn't mean you're selecting the
surrounding block.

Semicolons are less obvious, but for similar reasons: these tokens
aren't actually claimed by any AST node (usually), so an AST-based model
like SelectionTree shouldn't take them into account.

Callers may still sometimes care about semis of course:

when the selection is an expr with a non-expr parent, selection of the semicolon indicates intent to select the statement.
when a statement with a trailing semi is selected, we need to know its range to ensure it can be removed.

SelectionTree may or may not play a role here, but these are separate questions
from its core function of describing which AST nodes were selected.

The mechanism here is the TokenBuffer from syntax-trees. We use it in a
fairly low-level way (just to get boundaries of raw spelled tokens). The
actual mapping of AST nodes to coordinates continues to use the (fairly
mature) SourceLocation based logic. TokenBuffer/Syntax trees
don't currently offer an alternative to getFileRange(), I think.

Diff Detail

Repository: rL LLVM

Event Timeline

sammccall created this revision.Jul 30 2019, 3:50 PM

Herald added a project: Restricted Project. · View Herald TranscriptJul 30 2019, 3:50 PM

Herald added subscribers: cfe-commits, arphaman, jkorous, MaskRay. · View Herald Transcript

Harbormaster completed remote builds in B35850: Diff 212460.Jul 30 2019, 3:53 PM

It occurs to me that claim is O(node_tokens + log total_tokens) which is bad when the nodes are large.

Indeed for an input like namespace { namespace { namespace { ... } } } time is quadratic.

I think this is probably fine in practice. Against adversarial input clang certainly can take exponential time, and easily crash too.

If we do need to fix it my best idea is to give each "uninteresting" TokInfo (that is, !selected || claimed) a pointer to the next TokInfo with different flags. This would allow iteration to quickly skip over contiguous ranges one they had been traversed once.

Improve comments in SelectedTokens.

Harbormaster completed remote builds in B35878: Diff 212536.Jul 31 2019, 1:58 AM

Thanks for the comments, LGTM!

This revision is now accepted and ready to land.Jul 31 2019, 6:19 AM

SureYeaah added inline comments.Jul 31 2019, 7:14 AM

clang-tools-extra/clangd/Selection.cpp
50 ↗	(On Diff #212536)	Would this work correctly for nested templates? Or do we need to use the specialized token length function that we use for toHalfOpenFileRange?

sammccall marked 2 inline comments as done.Jul 31 2019, 10:45 AM

sammccall added inline comments.

clang-tools-extra/clangd/Selection.cpp
50 ↗	(On Diff #212536)	This uses the token list as an intermediary for matching selected chars with AST nodes. TokenBuffer will indeed by default lex `>>` as a right shift. So we're buggy here, but I think it mostly doesn't matter. If it's a double template: the innermost template will claim it first, if the template range touches the selection. Problem: if only the first > is selected, the inner template will only be partially selected. the outermost template will not get to claim it at all (if the inner template range touches the selection). Problem: if this is the only part of the outer template that's selected, it will be marked unselected. (This should be rare) If the inner template doesn't touch the selection, then the outer template will be selected but only partially, which is actually correct. Examples: a<b<c>> ~ b=partial (correct) ~~~~ b=partial (incorrect: b=complete) <-- this is the worst case ~~~~~~~ a=complete,b=complete (correct) ~ a=partial (correct) ~~ b=partial (incorrect: a=partial,b=partial) I'll send a followup to fix this case tomorrow (I think we can just always split the token in half) but I don't think it's critical.

Closed by commit rL367453: [clangd] Ignore semicolons, whitespace, and comments in SelectionTree. (authored by sammccall). · Explain WhyJul 31 2019, 10:54 AM

This revision was automatically updated to reflect the committed changes.

sammccall marked an inline comment as done.

Herald added a project: Restricted Project. · View Herald TranscriptJul 31 2019, 10:54 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Revision Contents

Path

Size

clang-tools-extra/

trunk/

clangd/

Selection.h

12 lines

Selection.cpp

174 lines

refactor/

Tweak.cpp

2 lines

unittests/

SelectionTests.cpp

9 lines

Diff 212626

clang-tools-extra/trunk/clangd/Selection.h

	Show All 29 Lines
	// by the selection.			// by the selection.
	// - we expose a tree of the selected nodes and their lexical parents.			// - we expose a tree of the selected nodes and their lexical parents.
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_CLANG_TOOLS_EXTRA_CLANGD_SELECTION_H			#ifndef LLVM_CLANG_TOOLS_EXTRA_CLANGD_SELECTION_H
	#define LLVM_CLANG_TOOLS_EXTRA_CLANGD_SELECTION_H			#define LLVM_CLANG_TOOLS_EXTRA_CLANGD_SELECTION_H
	#include "clang/AST/ASTTypeTraits.h"			#include "clang/AST/ASTTypeTraits.h"
	#include "clang/AST/PrettyPrinter.h"			#include "clang/AST/PrettyPrinter.h"
				#include "clang/Tooling/Syntax/Tokens.h"
	#include "llvm/ADT/SmallVector.h"			#include "llvm/ADT/SmallVector.h"

	namespace clang {			namespace clang {
	namespace clangd {			namespace clangd {
	class ParsedAST;			class ParsedAST;

	// A selection can partially or completely cover several AST nodes.			// A selection can partially or completely cover several AST nodes.
	// The SelectionTree contains nodes that are covered, and their parents.			// The SelectionTree contains nodes that are covered, and their parents.
	// SelectionTree does not contain all AST nodes, rather only:			// SelectionTree does not contain all AST nodes, rather only:
	// Decl, Stmt, TypeLoc, NestedNamespaceSpecifierLoc, CXXCtorInitializer.			// Decl, Stmt, TypeLoc, NestedNamespaceSpecifierLoc, CXXCtorInitializer.
	// (These are the nodes with source ranges that fit in DynTypedNode).			// (These are the nodes with source ranges that fit in DynTypedNode).
	//			//
	// Usually commonAncestor() is the place to start:			// Usually commonAncestor() is the place to start:
	// - it's the simplest answer to "what node is under the cursor"			// - it's the simplest answer to "what node is under the cursor"
	// - the selected Expr (for example) can be found by walking up the parent			// - the selected Expr (for example) can be found by walking up the parent
	// chain and checking Node->ASTNode.			// chain and checking Node->ASTNode.
	// - if you want to traverse the selected nodes, they are all under			// - if you want to traverse the selected nodes, they are all under
	// commonAncestor() in the tree.			// commonAncestor() in the tree.
	//			//
	// SelectionTree tries to behave sensibly in the presence of macros, but does			// SelectionTree tries to behave sensibly in the presence of macros, but does
	// not model any preprocessor concepts: the output is a subset of the AST.			// not model any preprocessor concepts: the output is a subset of the AST.
	// Currently comments, directives etc are treated as part of the lexically			//
	// containing AST node, (though we may want to change this in future).			// Comments, directives and whitespace are completely ignored.
				// Semicolons are also ignored, as the AST generally does not model them well.
	//			//
	// The SelectionTree owns the Node structures, but the ASTNode attributes			// The SelectionTree owns the Node structures, but the ASTNode attributes
	// point back into the AST it was constructed with.			// point back into the AST it was constructed with.
	class SelectionTree {			class SelectionTree {
	public:			public:
	// Creates a selection tree at the given byte offset in the main file.			// Creates a selection tree at the given byte offset in the main file.
	// This is approximately equivalent to a range of one character.			// This is approximately equivalent to a range of one character.
	// (Usually, the character to the right of Offset, sometimes to the left).			// (Usually, the character to the right of Offset, sometimes to the left).
	SelectionTree(ASTContext &AST, unsigned Offset);			SelectionTree(ASTContext &AST, const syntax::TokenBuffer &Tokens,
				unsigned Offset);
	// Creates a selection tree for the given range in the main file.			// Creates a selection tree for the given range in the main file.
	// The range includes bytes [Start, End).			// The range includes bytes [Start, End).
	// If Start == End, uses the same heuristics as SelectionTree(AST, Start).			// If Start == End, uses the same heuristics as SelectionTree(AST, Start).
	SelectionTree(ASTContext &AST, unsigned Start, unsigned End);			SelectionTree(ASTContext &AST, const syntax::TokenBuffer &Tokens,
				unsigned Start, unsigned End);

	// Describes to what extent an AST node is covered by the selection.			// Describes to what extent an AST node is covered by the selection.
	enum Selection {			enum Selection {
	// The AST node owns no characters covered by the selection.			// The AST node owns no characters covered by the selection.
	// Note that characters owned by children don't count:			// Note that characters owned by children don't count:
	// if (x == 0) scream();			// if (x == 0) scream();
	// ^^^^^^			// ^^^^^^
	// The IfStmt would be Unselected because all the selected characters are			// The IfStmt would be Unselected because all the selected characters are
	▲ Show 20 Lines • Show All 51 Lines • Show Last 20 Lines

clang-tools-extra/trunk/clangd/Selection.cpp

Show All 11 Lines
#include "SourceCode.h"		#include "SourceCode.h"
#include "clang/AST/ASTTypeTraits.h"		#include "clang/AST/ASTTypeTraits.h"
#include "clang/AST/PrettyPrinter.h"		#include "clang/AST/PrettyPrinter.h"
#include "clang/AST/RecursiveASTVisitor.h"		#include "clang/AST/RecursiveASTVisitor.h"
#include "clang/AST/TypeLoc.h"		#include "clang/AST/TypeLoc.h"
#include "clang/Basic/SourceLocation.h"		#include "clang/Basic/SourceLocation.h"
#include "clang/Basic/SourceManager.h"		#include "clang/Basic/SourceManager.h"
#include "clang/Basic/TokenKinds.h"		#include "clang/Basic/TokenKinds.h"
		#include "clang/Tooling/Syntax/Tokens.h"
#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
#include <algorithm>		#include <algorithm>
#include <string>		#include <string>

namespace clang {		namespace clang {
namespace clangd {		namespace clangd {
namespace {		namespace {
using Node = SelectionTree::Node;		using Node = SelectionTree::Node;
using ast_type_traits::DynTypedNode;		using ast_type_traits::DynTypedNode;

// Stores a collection of (possibly-overlapping) integer ranges.		// Identifies which tokens are selected, and evaluates claims of source ranges
// When new ranges are added, hit-tests them against existing ones.		// by AST nodes. Tokens may be claimed only once: first-come, first-served.
class RangeSet {		class SelectedTokens {
public:		public:
// Returns true if any new offsets are covered.		SelectedTokens(llvm::ArrayRef<syntax::Token> Spelled, const SourceManager &SM,
// This is naive (linear in number of successful add() calls), but ok for now.		unsigned SelBegin, unsigned SelEnd)
bool add(unsigned Begin, unsigned End) {		: SelBegin(SelBegin), SelEnd(SelEnd) {
assert(std::is_sorted(Ranges.begin(), Ranges.end()));		// Extract bounds and selected-ness for all tokens spelled in the file.
assert(Begin < End);		Tokens.reserve(Spelled.size());
		for (const auto& Tok : Spelled) {
if (covered(Begin, End))		// As well as comments, don't count semicolons as real tokens.
return false;		// They're not properly claimed as expr-statement is missing from the AST.
auto Pair = std::make_pair(Begin, End);		if (Tok.kind() == tok::comment \|\| Tok.kind() == tok::semi)
Ranges.insert(llvm::upper_bound(Ranges, Pair), Pair);		continue;
return true;
		Tokens.emplace_back();
		TokInfo &S = Tokens.back();
		S.StartOffset = SM.getFileOffset(Tok.location());
		S.EndOffset = S.StartOffset + Tok.length();
		if (S.StartOffset >= SelBegin && S.EndOffset <= SelEnd)
		S.Selected = SelectionTree::Complete;
		else if (S.EndOffset > SelBegin && S.StartOffset < SelEnd)
		S.Selected = SelectionTree::Partial;
		else
		S.Selected = SelectionTree::Unselected;
		S.Claimed = false;
		}
}		}

private:		// Associates any tokens overlapping [Begin, End) with an AST node.
bool covered(unsigned Begin, unsigned End) {		// Tokens that were already claimed by another AST node are not claimed again.
assert(Begin < End);		// Returns whether the node is selected in the sense of SelectionTree.
for (const auto &R : Ranges) {		SelectionTree::Selection claim(unsigned Begin, unsigned End) {
if (Begin < R.first)		assert(Begin <= End);
return false; // The prefix [Begin, R.first) is not covered.
if (Begin < R.second) {		// Fast-path for missing the selection entirely.
Begin = R.second; // Prefix is covered, truncate the range.		if (Begin >= SelEnd \|\| End <= SelBegin)
if (Begin >= End)		return SelectionTree::Unselected;
return true;
		// We will consider the range (at least partially) selected if it hit any
		// selected and previously unclaimed token.
		bool ClaimedAnyToken = false;
		// The selection is (at most) partial if:
		// - any claimed token is partially selected
		// - any token in the range is unselected
		bool PartialSelection = false;

		// Find the first token that (maybe) overlaps the claimed range.
		auto Start = llvm::partition_point(Tokens, [&](const TokInfo &Tok) {
		return Tok.EndOffset <= Begin;
		});
		// Iterate over every token that overlaps the range.
		// Claim selected tokens, and update the two result flags.
		for (auto It = Start; It != Tokens.end() && It->StartOffset < End; ++It) {
		if (It->Selected) {
		if (!It->Claimed) {
		// Token is selected, in the node's range, and unclaimed; claim it.
		It->Claimed = true;
		ClaimedAnyToken = true;
		// If the token was only partially selected, so is the node.
		PartialSelection \|= (It->Selected == SelectionTree::Partial);
}		}
		} else {
		// If the node covers an unselected token, it's not completely selected.
		PartialSelection = true;
}		}
return false;
}		}

std::vector<std::pair<unsigned, unsigned>> Ranges; // Always sorted.		if (!ClaimedAnyToken)
		return SelectionTree::Unselected;
		return PartialSelection ? SelectionTree::Partial : SelectionTree::Complete;
		}

		private:
		struct TokInfo {
		unsigned StartOffset;
		unsigned EndOffset;
		SelectionTree::Selection Selected;
		bool Claimed;
		bool operator<(const TokInfo &Other) const {
		return StartOffset < Other.StartOffset;
		}
		};
		std::vector<TokInfo> Tokens;
		unsigned SelBegin, SelEnd;
};		};

// Show the type of a node for debugging.		// Show the type of a node for debugging.
void printNodeKind(llvm::raw_ostream &OS, const DynTypedNode &N) {		void printNodeKind(llvm::raw_ostream &OS, const DynTypedNode &N) {
if (const TypeLoc *TL = N.get<TypeLoc>()) {		if (const TypeLoc *TL = N.get<TypeLoc>()) {
// TypeLoc is a hierarchy, but has only a single ASTNodeKind.		// TypeLoc is a hierarchy, but has only a single ASTNodeKind.
// Synthesize the name from the Type subclass (except for QualifiedTypeLoc).		// Synthesize the name from the Type subclass (except for QualifiedTypeLoc).
if (TL->getTypeLocClass() == TypeLoc::Qualified)		if (TL->getTypeLocClass() == TypeLoc::Qualified)
Show All 22 Lines
// we decide whether to keep them or not. To be kept, they must either be		// we decide whether to keep them or not. To be kept, they must either be
// selected or contain some nodes that are.		// selected or contain some nodes that are.
//		//
// For simple cases (not inside macros) we prune subtrees that don't intersect.		// For simple cases (not inside macros) we prune subtrees that don't intersect.
class SelectionVisitor : public RecursiveASTVisitor<SelectionVisitor> {		class SelectionVisitor : public RecursiveASTVisitor<SelectionVisitor> {
public:		public:
// Runs the visitor to gather selected nodes and their ancestors.		// Runs the visitor to gather selected nodes and their ancestors.
// If there is any selection, the root (TUDecl) is the first node.		// If there is any selection, the root (TUDecl) is the first node.
static std::deque<Node> collect(ASTContext &AST, const PrintingPolicy &PP,		static std::deque<Node> collect(ASTContext &AST,
unsigned Begin, unsigned End, FileID File) {		const syntax::TokenBuffer &Tokens,
SelectionVisitor V(AST, PP, Begin, End, File);		const PrintingPolicy &PP, unsigned Begin,
		unsigned End, FileID File) {
		SelectionVisitor V(AST, Tokens, PP, Begin, End, File);
V.TraverseAST(AST);		V.TraverseAST(AST);
assert(V.Stack.size() == 1 && "Unpaired push/pop?");		assert(V.Stack.size() == 1 && "Unpaired push/pop?");
assert(V.Stack.top() == &V.Nodes.front());		assert(V.Stack.top() == &V.Nodes.front());
// We selected TUDecl if characters were unclaimed (or the file is empty).		// We selected TUDecl if tokens were unclaimed (or the file is empty).
if (V.Nodes.size() == 1 \|\| V.Claimed.add(Begin, End)) {		if (V.Nodes.size() == 1 \|\| V.Claimed.claim(Begin, End)) {
StringRef FileContent = AST.getSourceManager().getBufferData(File);		StringRef FileContent = AST.getSourceManager().getBufferData(File);
// Don't require the trailing newlines to be selected.		// Don't require the trailing newlines to be selected.
bool SelectedAll = Begin == 0 && End >= FileContent.rtrim().size();		bool SelectedAll = Begin == 0 && End >= FileContent.rtrim().size();
V.Stack.top()->Selected =		V.Stack.top()->Selected =
SelectedAll ? SelectionTree::Complete : SelectionTree::Partial;		SelectedAll ? SelectionTree::Complete : SelectionTree::Partial;
}		}
return std::move(V.Nodes);		return std::move(V.Nodes);
}		}
▲ Show 20 Lines • Show All 53 Lines • ▼ Show 20 Lines	public:
}		}
// Uninteresting parts of the AST that don't have locations within them.		// Uninteresting parts of the AST that don't have locations within them.
bool TraverseNestedNameSpecifier(NestedNameSpecifier *) { return true; }		bool TraverseNestedNameSpecifier(NestedNameSpecifier *) { return true; }
bool TraverseType(QualType) { return true; }		bool TraverseType(QualType) { return true; }

private:		private:
using Base = RecursiveASTVisitor<SelectionVisitor>;		using Base = RecursiveASTVisitor<SelectionVisitor>;

SelectionVisitor(ASTContext &AST, const PrintingPolicy &PP, unsigned SelBegin,		SelectionVisitor(ASTContext &AST, const syntax::TokenBuffer &Tokens,
unsigned SelEnd, FileID SelFile)		const PrintingPolicy &PP, unsigned SelBegin, unsigned SelEnd,
		FileID SelFile)
: SM(AST.getSourceManager()), LangOpts(AST.getLangOpts()),		: SM(AST.getSourceManager()), LangOpts(AST.getLangOpts()),
#ifndef NDEBUG		#ifndef NDEBUG
PrintPolicy(PP),		PrintPolicy(PP),
#endif		#endif
SelBegin(SelBegin), SelEnd(SelEnd), SelFile(SelFile),		Claimed(Tokens.spelledTokens(SelFile), SM, SelBegin, SelEnd),
		SelFile(SelFile),
SelBeginTokenStart(SM.getFileOffset(Lexer::GetBeginningOfToken(		SelBeginTokenStart(SM.getFileOffset(Lexer::GetBeginningOfToken(
SM.getComposedLoc(SelFile, SelBegin), SM, LangOpts))) {		SM.getComposedLoc(SelFile, SelBegin), SM, LangOpts))),
		SelEnd(SelEnd) {
// Ensure we have a node for the TU decl, regardless of traversal scope.		// Ensure we have a node for the TU decl, regardless of traversal scope.
Nodes.emplace_back();		Nodes.emplace_back();
Nodes.back().ASTNode = DynTypedNode::create(*AST.getTranslationUnitDecl());		Nodes.back().ASTNode = DynTypedNode::create(*AST.getTranslationUnitDecl());
Nodes.back().Parent = nullptr;		Nodes.back().Parent = nullptr;
Nodes.back().Selected = SelectionTree::Unselected;		Nodes.back().Selected = SelectionTree::Unselected;
Stack.push(&Nodes.back());		Stack.push(&Nodes.back());
}		}

▲ Show 20 Lines • Show All 117 Lines • ▼ Show 20 Lines	SelectionTree::Selection claimRange(SourceRange S) {
auto Range = toHalfOpenFileRange(SM, LangOpts, S);		auto Range = toHalfOpenFileRange(SM, LangOpts, S);
assert(Range && "We should be able to get the File Range");		assert(Range && "We should be able to get the File Range");
dlog("{1}claimRange: {0}", Range->printToString(SM), indent());		dlog("{1}claimRange: {0}", Range->printToString(SM), indent());
auto B = SM.getDecomposedLoc(Range->getBegin());		auto B = SM.getDecomposedLoc(Range->getBegin());
auto E = SM.getDecomposedLoc(Range->getEnd());		auto E = SM.getDecomposedLoc(Range->getEnd());
// Otherwise, nodes in macro expansions can't be selected.		// Otherwise, nodes in macro expansions can't be selected.
if (B.first != SelFile \|\| E.first != SelFile)		if (B.first != SelFile \|\| E.first != SelFile)
return SelectionTree::Unselected;		return SelectionTree::Unselected;
// Is there any overlap at all between the selection and range?
if (B.second >= SelEnd \|\| E.second < SelBegin)
return SelectionTree::Unselected;
// We may have hit something.
auto PreciseBounds = std::make_pair(B.second, E.second);
// Trim range using the selection, drop it if empty.
B.second = std::max(B.second, SelBegin);
E.second = std::min(E.second, SelEnd);
if (B.second >= E.second)
return SelectionTree::Unselected;
// Attempt to claim the remaining range. If there's nothing to claim, only		// Attempt to claim the remaining range. If there's nothing to claim, only
// children were selected.		// children were selected.
if (!Claimed.add(B.second, E.second))		SelectionTree::Selection Result = Claimed.claim(B.second, E.second);
return SelectionTree::Unselected;		if (Result)
dlog("{1}hit selection: {0}",		dlog("{1}hit selection: {0}",
SourceRange(SM.getComposedLoc(B.first, B.second),		SourceRange(SM.getComposedLoc(B.first, B.second),
SM.getComposedLoc(E.first, E.second))		SM.getComposedLoc(E.first, E.second))
.printToString(SM),		.printToString(SM),
indent());		indent());
// Some of our own characters are covered, this is a true hit.		return Result;
// Determine whether the node was completely covered.
return (PreciseBounds.first >= SelBegin && PreciseBounds.second <= SelEnd)
? SelectionTree::Complete
: SelectionTree::Partial;
}		}

std::string indent(int Offset = 0) {		std::string indent(int Offset = 0) {
// Cast for signed arithmetic.		// Cast for signed arithmetic.
int Amount = int(Stack.size()) + Offset;		int Amount = int(Stack.size()) + Offset;
assert(Amount >= 0);		assert(Amount >= 0);
return std::string(Amount, ' ');		return std::string(Amount, ' ');
}		}

SourceManager &SM;		SourceManager &SM;
const LangOptions &LangOpts;		const LangOptions &LangOpts;
#ifndef NDEBUG		#ifndef NDEBUG
const PrintingPolicy &PrintPolicy;		const PrintingPolicy &PrintPolicy;
#endif		#endif
std::stack<Node *> Stack;		std::stack<Node *> Stack;
RangeSet Claimed;		SelectedTokens Claimed;
std::deque<Node> Nodes; // Stable pointers as we add more nodes.		std::deque<Node> Nodes; // Stable pointers as we add more nodes.
// Half-open selection range.
unsigned SelBegin;
unsigned SelEnd;
FileID SelFile;		FileID SelFile;
// If the selection start slices a token in half, the beginning of that token.		// If the selection start slices a token in half, the beginning of that token.
// This is useful for checking whether the end of a token range overlaps		// This is useful for checking whether the end of a token range overlaps
// the selection: range.end < SelBeginTokenStart is equivalent to		// the selection: range.end < SelBeginTokenStart is equivalent to
// range.end + measureToken(range.end) < SelBegin (assuming range.end points		// range.end + measureToken(range.end) < SelBegin (assuming range.end points
// to a token), and it saves a lex every time.		// to a token), and it saves a lex every time.
unsigned SelBeginTokenStart;		unsigned SelBeginTokenStart;
		unsigned SelEnd;
};		};

} // namespace		} // namespace

void SelectionTree::print(llvm::raw_ostream &OS, const SelectionTree::Node &N,		void SelectionTree::print(llvm::raw_ostream &OS, const SelectionTree::Node &N,
int Indent) const {		int Indent) const {
if (N.Selected)		if (N.Selected)
OS.indent(Indent - 1) << (N.Selected == SelectionTree::Complete ? '*'		OS.indent(Indent - 1) << (N.Selected == SelectionTree::Complete ? '*'
Show All 29 Lines	static std::pair<unsigned, unsigned> pointBounds(unsigned Offset, FileID FID,
// We could choose either this byte or the previous. Usually we prefer the		// We could choose either this byte or the previous. Usually we prefer the
// character on the right of the cursor (or under a block cursor).		// character on the right of the cursor (or under a block cursor).
// But if that's whitespace, we likely want the token on the left.		// But if that's whitespace, we likely want the token on the left.
if (isWhitespace(Buf[Offset]) && !isWhitespace(Buf[Offset - 1]))		if (isWhitespace(Buf[Offset]) && !isWhitespace(Buf[Offset - 1]))
return {Offset - 1, Offset};		return {Offset - 1, Offset};
return {Offset, Offset + 1};		return {Offset, Offset + 1};
}		}

SelectionTree::SelectionTree(ASTContext &AST, unsigned Begin, unsigned End)		SelectionTree::SelectionTree(ASTContext &AST, const syntax::TokenBuffer &Tokens,
		unsigned Begin, unsigned End)
: PrintPolicy(AST.getLangOpts()) {		: PrintPolicy(AST.getLangOpts()) {
// No fundamental reason the selection needs to be in the main file,		// No fundamental reason the selection needs to be in the main file,
// but that's all clangd has needed so far.		// but that's all clangd has needed so far.
const SourceManager &SM = AST.getSourceManager();		const SourceManager &SM = AST.getSourceManager();
FileID FID = SM.getMainFileID();		FileID FID = SM.getMainFileID();
if (Begin == End)		if (Begin == End)
std::tie(Begin, End) = pointBounds(Begin, FID, AST);		std::tie(Begin, End) = pointBounds(Begin, FID, AST);
PrintPolicy.TerseOutput = true;		PrintPolicy.TerseOutput = true;
PrintPolicy.IncludeNewlines = false;		PrintPolicy.IncludeNewlines = false;

dlog("Computing selection for {0}",		dlog("Computing selection for {0}",
SourceRange(SM.getComposedLoc(FID, Begin), SM.getComposedLoc(FID, End))		SourceRange(SM.getComposedLoc(FID, Begin), SM.getComposedLoc(FID, End))
.printToString(SM));		.printToString(SM));
Nodes = SelectionVisitor::collect(AST, PrintPolicy, Begin, End, FID);		Nodes = SelectionVisitor::collect(AST, Tokens, PrintPolicy, Begin, End, FID);
Root = Nodes.empty() ? nullptr : &Nodes.front();		Root = Nodes.empty() ? nullptr : &Nodes.front();
dlog("Built selection tree\n{0}", *this);		dlog("Built selection tree\n{0}", *this);
}		}

SelectionTree::SelectionTree(ASTContext &AST, unsigned Offset)		SelectionTree::SelectionTree(ASTContext &AST, const syntax::TokenBuffer &Tokens,
: SelectionTree(AST, Offset, Offset) {}		unsigned Offset)
		: SelectionTree(AST, Tokens, Offset, Offset) {}

const Node *SelectionTree::commonAncestor() const {		const Node *SelectionTree::commonAncestor() const {
const Node *Ancestor = Root;		const Node *Ancestor = Root;
while (Ancestor->Children.size() == 1 && !Ancestor->Selected)		while (Ancestor->Children.size() == 1 && !Ancestor->Selected)
Ancestor = Ancestor->Children.front();		Ancestor = Ancestor->Children.front();
// Returning nullptr here is a bit unprincipled, but it makes the API safer:		// Returning nullptr here is a bit unprincipled, but it makes the API safer:
// the TranslationUnitDecl contains all of the preamble, so traversing it is a		// the TranslationUnitDecl contains all of the preamble, so traversing it is a
// performance cliff. Callers can check for null and use root() if they want.		// performance cliff. Callers can check for null and use root() if they want.
Show All 25 Lines

clang-tools-extra/trunk/clangd/refactor/Tweak.cpp

Show All 35 Lines	#ifndef NDEBUG
}		}
#endif		#endif
}		}
} // namespace		} // namespace

Tweak::Selection::Selection(ParsedAST &AST, unsigned RangeBegin,		Tweak::Selection::Selection(ParsedAST &AST, unsigned RangeBegin,
unsigned RangeEnd)		unsigned RangeEnd)
: AST(AST), SelectionBegin(RangeBegin), SelectionEnd(RangeEnd),		: AST(AST), SelectionBegin(RangeBegin), SelectionEnd(RangeEnd),
ASTSelection(AST.getASTContext(), RangeBegin, RangeEnd) {		ASTSelection(AST.getASTContext(), AST.getTokens(), RangeBegin, RangeEnd) {
auto &SM = AST.getSourceManager();		auto &SM = AST.getSourceManager();
Code = SM.getBufferData(SM.getMainFileID());		Code = SM.getBufferData(SM.getMainFileID());
Cursor = SM.getComposedLoc(SM.getMainFileID(), RangeBegin);		Cursor = SM.getComposedLoc(SM.getMainFileID(), RangeBegin);
}		}

std::vector<std::unique_ptr<Tweak>>		std::vector<std::unique_ptr<Tweak>>
prepareTweaks(const Tweak::Selection &S,		prepareTweaks(const Tweak::Selection &S,
llvm::function_ref<bool(const Tweak &)> Filter) {		llvm::function_ref<bool(const Tweak &)> Filter) {
Show All 33 Lines

clang-tools-extra/trunk/clangd/unittests/SelectionTests.cpp

Show All 17 Lines
namespace clangd {		namespace clangd {
namespace {		namespace {
using ::testing::UnorderedElementsAreArray;		using ::testing::UnorderedElementsAreArray;

SelectionTree makeSelectionTree(const StringRef MarkedCode, ParsedAST &AST) {		SelectionTree makeSelectionTree(const StringRef MarkedCode, ParsedAST &AST) {
Annotations Test(MarkedCode);		Annotations Test(MarkedCode);
switch (Test.points().size()) {		switch (Test.points().size()) {
case 1: // Point selection.		case 1: // Point selection.
return SelectionTree(AST.getASTContext(),		return SelectionTree(AST.getASTContext(), AST.getTokens(),
cantFail(positionToOffset(Test.code(), Test.point())));		cantFail(positionToOffset(Test.code(), Test.point())));
case 2: // Range selection.		case 2: // Range selection.
return SelectionTree(		return SelectionTree(
AST.getASTContext(),		AST.getASTContext(), AST.getTokens(),
cantFail(positionToOffset(Test.code(), Test.points()[0])),		cantFail(positionToOffset(Test.code(), Test.points()[0])),
cantFail(positionToOffset(Test.code(), Test.points()[1])));		cantFail(positionToOffset(Test.code(), Test.points()[1])));
default:		default:
ADD_FAILURE() << "Expected 1-2 points for selection.\n" << MarkedCode;		ADD_FAILURE() << "Expected 1-2 points for selection.\n" << MarkedCode;
return SelectionTree(AST.getASTContext(), 0u, 0u);		return SelectionTree(AST.getASTContext(), AST.getTokens(), 0u, 0u);
}		}
}		}

Range nodeRange(const SelectionTree::Node *N, ParsedAST &AST) {		Range nodeRange(const SelectionTree::Node *N, ParsedAST &AST) {
if (!N)		if (!N)
return Range{};		return Range{};
const SourceManager &SM = AST.getSourceManager();		const SourceManager &SM = AST.getSourceManager();
const LangOptions &LangOpts = AST.getASTContext().getLangOpts();		const LangOptions &LangOpts = AST.getASTContext().getLangOpts();
▲ Show 20 Lines • Show All 170 Lines • ▼ Show 20 Lines	Case Cases[] = {
// Point selections.		// Point selections.
{"void foo() { [[^foo]](); }", "DeclRefExpr"},		{"void foo() { [[^foo]](); }", "DeclRefExpr"},
{"void foo() { [[f^oo]](); }", "DeclRefExpr"},		{"void foo() { [[f^oo]](); }", "DeclRefExpr"},
{"void foo() { [[fo^o]](); }", "DeclRefExpr"},		{"void foo() { [[fo^o]](); }", "DeclRefExpr"},
{"void foo() { [[foo^()]]; }", "CallExpr"},		{"void foo() { [[foo^()]]; }", "CallExpr"},
{"void foo() { [[foo^]] (); }", "DeclRefExpr"},		{"void foo() { [[foo^]] (); }", "DeclRefExpr"},
{"int bar; void foo() [[{ foo (); }]]^", "CompoundStmt"},		{"int bar; void foo() [[{ foo (); }]]^", "CompoundStmt"},

		// Ignores whitespace, comments, and semicolons in the selection.
		{"void foo() { [[foo^()]]; /comment/^}", "CallExpr"},

// Tricky case: FunctionTypeLoc in FunctionDecl has a hole in it.		// Tricky case: FunctionTypeLoc in FunctionDecl has a hole in it.
{"[[^void]] foo();", "BuiltinTypeLoc"},		{"[[^void]] foo();", "BuiltinTypeLoc"},
{"[[void foo^()]];", "FunctionProtoTypeLoc"},		{"[[void foo^()]];", "FunctionProtoTypeLoc"},
{"[[^void foo^()]];", "FunctionDecl"},		{"[[^void foo^()]];", "FunctionDecl"},
{"[[void ^foo()]];", "FunctionDecl"},		{"[[void ^foo()]];", "FunctionDecl"},
// Tricky case: two VarDecls share a specifier.		// Tricky case: two VarDecls share a specifier.
{"[[int ^a]], b;", "VarDecl"},		{"[[int ^a]], b;", "VarDecl"},
{"[[int a, ^b]];", "VarDecl"},		{"[[int a, ^b]];", "VarDecl"},
▲ Show 20 Lines • Show All 136 Lines • Show Last 20 Lines