Diff 125939

docs/ClangFormatStyleOptions.rst

Show First 20 Lines • Show All 988 Lines • ▼ Show 20 Lines	BreakConstructorInitializers (``BreakConstructorInitializersStyle``)

Possible values:		Possible values:

* ``BCIS_BeforeColon`` (in configuration: ``BeforeColon``)		* ``BCIS_BeforeColon`` (in configuration: ``BeforeColon``)
Break constructor initializers before the colon and after the commas.		Break constructor initializers before the colon and after the commas.

.. code-block:: c++		.. code-block:: c++

Constructor()		Constructor()
: initializer1(),		: initializer1(),
initializer2()		initializer2()

* ``BCIS_BeforeComma`` (in configuration: ``BeforeComma``)		* ``BCIS_BeforeComma`` (in configuration: ``BeforeComma``)
Break constructor initializers before the colon and commas, and align		Break constructor initializers before the colon and commas, and align
the commas with the colon.		the commas with the colon.

.. code-block:: c++		.. code-block:: c++

Constructor()		Constructor()
: initializer1()		: initializer1()
, initializer2()		, initializer2()

* ``BCIS_AfterColon`` (in configuration: ``AfterColon``)		* ``BCIS_AfterColon`` (in configuration: ``AfterColon``)
Break constructor initializers after the colon and commas.		Break constructor initializers after the colon and commas.

.. code-block:: c++		.. code-block:: c++

Constructor() :		Constructor() :
initializer1(),		initializer1(),
initializer2()		initializer2()



BreakStringLiterals (``bool``)		BreakStringLiterals (``bool``)
Allow breaking string literals when formatting.		Allow breaking string literals when formatting.

ColumnLimit (``unsigned``)		ColumnLimit (``unsigned``)
The column limit.		The column limit.
▲ Show 20 Lines • Show All 169 Lines • ▼ Show 20 Lines	.. code-block:: c++

#include "b.h" into #include "a.h"		#include "b.h" into #include "a.h"
#include "b.h"		#include "b.h"
#include <lib/main.h> #include <lib/main.h>		#include <lib/main.h> #include <lib/main.h>
#include "a.h"		#include "a.h"

* ``IBS_Regroup`` (in configuration: ``Regroup``)		* ``IBS_Regroup`` (in configuration: ``Regroup``)
Merge multiple ``#include`` blocks together and sort as one.		Merge multiple ``#include`` blocks together and sort as one.
Then split into groups based on category priority. See ``IncludeCategories``.		Then split into groups based on category priority. See
		``IncludeCategories``.

.. code-block:: c++		.. code-block:: c++

#include "b.h" into #include "a.h"		#include "b.h" into #include "a.h"
#include "b.h"		#include "b.h"
#include <lib/main.h>		#include <lib/main.h>
#include "a.h" #include <lib/main.h>		#include "a.h" #include <lib/main.h>

▲ Show 20 Lines • Show All 359 Lines • ▼ Show 20 Lines	* ``PAS_Middle`` (in configuration: ``Middle``)

.. code-block:: c++		.. code-block:: c++

int * a;		int * a;



RawStringFormats (``std::vector<RawStringFormat>``)		RawStringFormats (``std::vector<RawStringFormat>``)
Raw string delimiters denoting that the raw string contents are		Defines hints for detecting supported languages code blocks in
code in a particular language and can be reformatted.		raw strings.

A raw string with a matching delimiter will be reformatted assuming the		A raw string with a matching delimiter or a matching enclosing function
specified language based on a predefined style given by 'BasedOnStyle'.		name will be reformatted assuming the specified language based on a
If 'BasedOnStyle' is not found, the formatting is based on llvm style.		predefined style for that language defined in the .clang-format file.

		A matching delimiter of a raw string takes precedence over a matching
		enclosing function name for determining the language of the raw string
		contents.

		If a canonical delimiter is specified, occurences of other delimiters for
		the same language will be updated to the canonical if possible.

		For a particular supported language at most one specification should
		exist.

To configure this in the .clang-format file, use:		To configure this in the .clang-format file, use:

.. code-block:: yaml		.. code-block:: yaml

RawStringFormats:		RawStringFormats:
- Delimiter: 'pb'		- Language: TextProto
Language: TextProto		Delimiters:
BasedOnStyle: llvm		- 'pb'
- Delimiter: 'proto'		- 'proto'
Language: TextProto		EnclosingFunctionNames:
BasedOnStyle: google		- 'PARSE_TEXT_PROTO'
		- Language: Cpp
		Delimiters:
		- 'cc'
		- 'cpp'
		CanonicalDelimiter: 'cc'

ReflowComments (``bool``)		ReflowComments (``bool``)
If ``true``, clang-format will attempt to re-flow comments.		If ``true``, clang-format will attempt to re-flow comments.

.. code-block:: c++		.. code-block:: c++

false:		false:
// veryVeryVeryVeryVeryVeryVeryVeryVeryVeryVeryLongComment with plenty of information		// veryVeryVeryVeryVeryVeryVeryVeryVeryVeryVeryLongComment with plenty of information
▲ Show 20 Lines • Show All 331 Lines • Show Last 20 Lines

include/clang/Format/Format.h

Show All 12 Lines
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_CLANG_FORMAT_FORMAT_H		#ifndef LLVM_CLANG_FORMAT_FORMAT_H
#define LLVM_CLANG_FORMAT_FORMAT_H		#define LLVM_CLANG_FORMAT_FORMAT_H

#include "clang/Basic/LangOptions.h"		#include "clang/Basic/LangOptions.h"
#include "clang/Tooling/Core/Replacement.h"		#include "clang/Tooling/Core/Replacement.h"
#include "llvm/ADT/ArrayRef.h"		#include "llvm/ADT/ArrayRef.h"
		#include <array>
#include <system_error>		#include <system_error>

namespace clang {		namespace clang {

class Lexer;		class Lexer;
class SourceManager;		class SourceManager;
class DiagnosticConsumer;		class DiagnosticConsumer;

▲ Show 20 Lines • Show All 1,177 Lines • ▼ Show 20 Lines	enum LanguageKind {
LK_ObjC,		LK_ObjC,
/// Should be used for Protocol Buffers		/// Should be used for Protocol Buffers
/// (https://developers.google.com/protocol-buffers/).		/// (https://developers.google.com/protocol-buffers/).
LK_Proto,		LK_Proto,
/// Should be used for TableGen code.		/// Should be used for TableGen code.
LK_TableGen,		LK_TableGen,
/// Should be used for Protocol Buffer messages in text format		/// Should be used for Protocol Buffer messages in text format
/// (https://developers.google.com/protocol-buffers/).		/// (https://developers.google.com/protocol-buffers/).
LK_TextProto		LK_TextProto,
};		};
		djasperUnsubmitted Done Reply Inline Actions Lets find a way to implement without this in the public header file. djasper: Lets find a way to implement without this in the public header file.
		// Keep up-to-date with the size of the above enum.
		static constexpr size_t kLanguageKindSize = 1 + LK_TextProto;

bool isCpp() const { return Language == LK_Cpp \|\| Language == LK_ObjC; }		bool isCpp() const { return Language == LK_Cpp \|\| Language == LK_ObjC; }

/// \brief Language, this format style is targeted at.		/// \brief Language, this format style is targeted at.
LanguageKind Language;		LanguageKind Language;

/// \brief A regular expression matching macros that start a block.		/// \brief A regular expression matching macros that start a block.
/// \code		/// \code
/// # With:		/// # With:
▲ Show 20 Lines • Show All 134 Lines • ▼ Show 20 Lines	enum PointerAlignmentStyle {
PAS_Middle		PAS_Middle
};		};

/// \brief Pointer and reference alignment style.		/// \brief Pointer and reference alignment style.
PointerAlignmentStyle PointerAlignment;		PointerAlignmentStyle PointerAlignment;

/// See documentation of ``RawStringFormats``.		/// See documentation of ``RawStringFormats``.
struct RawStringFormat {		struct RawStringFormat {
/// \brief The delimiter that this raw string format matches.		/// \brief The language of this raw string format.
std::string Delimiter;
/// \brief The language of this raw string.
LanguageKind Language;		LanguageKind Language;
/// \brief The style name on which this raw string format is based on.		/// \brief A list of raw string delimiters that match this language.
/// If not specified, the raw string format is based on the style that this		std::vector<std::string> Delimiters;
/// format is based on.		/// \brief A list of enclosing function names that match this language.
std::string BasedOnStyle;		std::vector<std::string> EnclosingFunctionNames;
		/// \brief The canonical delimiter for this language.
		djasperUnsubmitted Not Done Reply Inline Actions Can you pull apart this patch? In my view, it has three parts that have an ordering, but are actually fairly independent: Propagate all configured languages to the formatting library. First patch to land, should not affect the visible behavior. Restructure RawStringFormat to be centered around each language. This is a restructuring to make things easier and use #1. Add a CanonicalDelimiter and make clang-format canonicalize it. I'll focus my comments on what's required for #1 for now as that is already complicated (IMO). djasper: Can you pull apart this patch? In my view, it has three parts that have an ordering, but are…
		krasimirAuthorUnsubmitted Not Done Reply Inline Actions I believe these should all go together: the reason that we propagate all configured languages to the formatting library is to be able to use them as a replacement for the BasedOnStyle in RawStringFormat. To make this possible, we need to update the internal structure of RawStringFormat itself to base it around each language. The canonical delimiter part is just a convenience for this I guess, which could be split. My biggest concern with (1) is that since it has no visible behavior and no other uses except for the adaptation of (2), it is not testable by itself and it's not evident that a patch doing just (1) would handle the things correctly. krasimir: I believe these should all go together: the reason that we propagate all configured languages…
		djasperUnsubmitted Not Done Reply Inline Actions Ok, if you wish, this is not an unreasonable argument. But let's still do the code review in two steps: Lets for now just get the part of handling multiple languages straight and figure out the rest once we are sure that that part is fine. (I do think you can test it, though - but it depends on whether I can convince you to go with the FormatStyleSet approach ;) ) djasper: Ok, if you wish, this is not an unreasonable argument. But let's still do the code review in…
		klimekUnsubmitted Not Done Reply Inline Actions On a philosophical level, something that has no visible behavior, and just restructures the code, should be tested by existing tests? Enclosing function names also seems like an extra feature that could be pulled out, btw. klimek: On a philosophical level, something that has no visible behavior, and just restructures the…
		std::string CanonicalDelimiter;

bool operator==(const RawStringFormat &Other) const {		bool operator==(const RawStringFormat &Other) const {
return Delimiter == Other.Delimiter && Language == Other.Language &&		return Language == Other.Language && Delimiters == Other.Delimiters &&
BasedOnStyle == Other.BasedOnStyle;		EnclosingFunctionNames == Other.EnclosingFunctionNames &&
		CanonicalDelimiter == Other.CanonicalDelimiter;
}		}
};		};

/// \brief Raw string delimiters denoting that the raw string contents are		/// \brief Defines hints for detecting supported languages code blocks in
/// code in a particular language and can be reformatted.		/// raw strings.
		///
		/// A raw string with a matching delimiter or a matching enclosing function
		/// name will be reformatted assuming the specified language based on a
		/// predefined style for that language defined in the .clang-format file.
		///
		/// A matching delimiter of a raw string takes precedence over a matching
		/// enclosing function name for determining the language of the raw string
		/// contents.
///		///
/// A raw string with a matching delimiter will be reformatted assuming the		/// If a canonical delimiter is specified, occurences of other delimiters for
/// specified language based on a predefined style given by 'BasedOnStyle'.		/// the same language will be updated to the canonical if possible.
/// If 'BasedOnStyle' is not found, the formatting is based on llvm style.		///
		/// For a particular supported language at most one specification should
		/// exist.
///		///
/// To configure this in the .clang-format file, use:		/// To configure this in the .clang-format file, use:
/// \code{.yaml}		/// \code{.yaml}
/// RawStringFormats:		/// RawStringFormats:
/// - Delimiter: 'pb'		/// - Language: TextProto
/// Language: TextProto		/// Delimiters:
/// BasedOnStyle: llvm		/// - 'pb'
/// - Delimiter: 'proto'		/// - 'proto'
/// Language: TextProto		/// EnclosingFunctionNames:
/// BasedOnStyle: google		/// - 'PARSE_TEXT_PROTO'
		/// - Language: Cpp
		/// Delimiters:
		/// - 'cc'
		/// - 'cpp'
		/// CanonicalDelimiter: 'cc'
/// \endcode		/// \endcode
std::vector<RawStringFormat> RawStringFormats;		std::vector<RawStringFormat> RawStringFormats;

/// \brief If ``true``, clang-format will attempt to re-flow comments.		/// \brief If ``true``, clang-format will attempt to re-flow comments.
/// \code		/// \code
/// false:		/// false:
/// // veryVeryVeryVeryVeryVeryVeryVeryVeryVeryVeryLongComment with plenty of information		/// // veryVeryVeryVeryVeryVeryVeryVeryVeryVeryVeryLongComment with plenty of information
/// /* second veryVeryVeryVeryVeryVeryVeryVeryVeryVeryVeryLongComment with plenty of information */		/// /* second veryVeryVeryVeryVeryVeryVeryVeryVeryVeryVeryLongComment with plenty of information */
▲ Show 20 Lines • Show All 276 Lines • ▼ Show 20 Lines	return AccessModifierOffset == R.AccessModifierOffset &&
SpacesInAngles == R.SpacesInAngles &&		SpacesInAngles == R.SpacesInAngles &&
SpacesInContainerLiterals == R.SpacesInContainerLiterals &&		SpacesInContainerLiterals == R.SpacesInContainerLiterals &&
SpacesInCStyleCastParentheses == R.SpacesInCStyleCastParentheses &&		SpacesInCStyleCastParentheses == R.SpacesInCStyleCastParentheses &&
SpacesInParentheses == R.SpacesInParentheses &&		SpacesInParentheses == R.SpacesInParentheses &&
SpacesInSquareBrackets == R.SpacesInSquareBrackets &&		SpacesInSquareBrackets == R.SpacesInSquareBrackets &&
Standard == R.Standard && TabWidth == R.TabWidth &&		Standard == R.Standard && TabWidth == R.TabWidth &&
UseTab == R.UseTab;		UseTab == R.UseTab;
}		}
};

		// A mapping from FormatStyle::LanguageKind to a format style to use for
		// code blocks in that language.
		std::array<std::shared_ptr<FormatStyle>, FormatStyle::kLanguageKindSize>
		AdditionalLanguageStyles;
		};
/// \brief Returns a format style complying with the LLVM coding standards:		/// \brief Returns a format style complying with the LLVM coding standards:
/// http://llvm.org/docs/CodingStandards.html.		/// http://llvm.org/docs/CodingStandards.html.
FormatStyle getLLVMStyle();		FormatStyle getLLVMStyle();

/// \brief Returns a format style complying with one of Google's style guides:		/// \brief Returns a format style complying with one of Google's style guides:
/// http://google-styleguide.googlecode.com/svn/trunk/cppguide.xml.		/// http://google-styleguide.googlecode.com/svn/trunk/cppguide.xml.
/// http://google-styleguide.googlecode.com/svn/trunk/javascriptguide.xml.		/// http://google-styleguide.googlecode.com/svn/trunk/javascriptguide.xml.
/// https://developers.google.com/protocol-buffers/docs/style.		/// https://developers.google.com/protocol-buffers/docs/style.
▲ Show 20 Lines • Show All 208 Lines • Show Last 20 Lines

lib/Format/ContinuationIndenter.h

	Show All 31 Lines
	class BreakableToken;			class BreakableToken;
	struct FormatToken;			struct FormatToken;
	struct LineState;			struct LineState;
	struct ParenState;			struct ParenState;
	struct RawStringFormatStyleManager;			struct RawStringFormatStyleManager;
	class WhitespaceManager;			class WhitespaceManager;

	struct RawStringFormatStyleManager {			struct RawStringFormatStyleManager {
	llvm::StringMap<FormatStyle> DelimiterStyle;			llvm::StringMap<const FormatStyle*> DelimiterStyle;
				llvm::StringMap<const FormatStyle*> EnclosingFunctionNameStyle;

	RawStringFormatStyleManager(const FormatStyle &CodeStyle);			RawStringFormatStyleManager(const FormatStyle &CodeStyle);

	llvm::Optional<FormatStyle> get(StringRef Delimiter) const;			const FormatStyle *getDelimiterStyle(StringRef Delimiter) const;

				const FormatStyle *
				getEnclosingFunctionNameStyle(StringRef EnclosingFunctionName) const;
	};			};

	class ContinuationIndenter {			class ContinuationIndenter {
	public:			public:
	/// \brief Constructs a \c ContinuationIndenter to format \p Line starting in			/// \brief Constructs a \c ContinuationIndenter to format \p Line starting in
	/// column \p FirstIndent.			/// column \p FirstIndent.
	ContinuationIndenter(const FormatStyle &Style,			ContinuationIndenter(const FormatStyle &Style,
	const AdditionalKeywords &Keywords,			const AdditionalKeywords &Keywords,
	▲ Show 20 Lines • Show All 385 Lines • Show Last 20 Lines

lib/Format/ContinuationIndenter.cpp

Show First 20 Lines • Show All 96 Lines • ▼ Show 20 Lines	static llvm::Optional<StringRef> getRawStringDelimiter(StringRef TokenText) {
size_t RParenPos = TokenText.size() - Delimiter.size() - 2;		size_t RParenPos = TokenText.size() - Delimiter.size() - 2;
if (TokenText[RParenPos] != ')')		if (TokenText[RParenPos] != ')')
return None;		return None;
if (!TokenText.substr(RParenPos + 1).startswith(Delimiter))		if (!TokenText.substr(RParenPos + 1).startswith(Delimiter))
return None;		return None;
return Delimiter;		return Delimiter;
}		}

		static StringRef
		getCanonicalRawStringDelimiter(const FormatStyle &Style,
		FormatStyle::LanguageKind Language) {
		for (const auto &Format : llvm::reverse(Style.RawStringFormats)) {
		if (Format.Language == Language)
		return StringRef(Format.CanonicalDelimiter);
		}
		return "";
		}

RawStringFormatStyleManager::RawStringFormatStyleManager(		RawStringFormatStyleManager::RawStringFormatStyleManager(
const FormatStyle &CodeStyle) {		const FormatStyle &CodeStyle) {
for (const auto &RawStringFormat : CodeStyle.RawStringFormats) {		for (const auto &RawStringFormat : CodeStyle.RawStringFormats) {
FormatStyle Style;		for (StringRef Delimiter : RawStringFormat.Delimiters) {
if (!getPredefinedStyle(RawStringFormat.BasedOnStyle,		DelimiterStyle.insert(
RawStringFormat.Language, &Style)) {		{Delimiter,
Style = getLLVMStyle();		CodeStyle.AdditionalLanguageStyles[RawStringFormat.Language].get()});
Style.Language = RawStringFormat.Language;		}
		for (StringRef EnclosingFunctionName :
		RawStringFormat.EnclosingFunctionNames) {
		EnclosingFunctionNameStyle.insert(
		{EnclosingFunctionName,
		CodeStyle.AdditionalLanguageStyles[RawStringFormat.Language].get()});
}		}
Style.ColumnLimit = CodeStyle.ColumnLimit;
DelimiterStyle.insert({RawStringFormat.Delimiter, Style});
}		}
}		}

llvm::Optional<FormatStyle>		const FormatStyle *
RawStringFormatStyleManager::get(StringRef Delimiter) const {		RawStringFormatStyleManager::getDelimiterStyle(StringRef Delimiter) const {
auto It = DelimiterStyle.find(Delimiter);		auto It = DelimiterStyle.find(Delimiter);
if (It == DelimiterStyle.end())		if (It == DelimiterStyle.end())
return None;		return nullptr;
		return It->second;
		}

		const FormatStyle *RawStringFormatStyleManager::getEnclosingFunctionNameStyle(
		StringRef EnclosingFunctionName) const {
		auto It = EnclosingFunctionNameStyle.find(EnclosingFunctionName);
		if (It == EnclosingFunctionNameStyle.end())
		return nullptr;
return It->second;		return It->second;
}		}

ContinuationIndenter::ContinuationIndenter(const FormatStyle &Style,		ContinuationIndenter::ContinuationIndenter(const FormatStyle &Style,
const AdditionalKeywords &Keywords,		const AdditionalKeywords &Keywords,
const SourceManager &SourceMgr,		const SourceManager &SourceMgr,
WhitespaceManager &Whitespaces,		WhitespaceManager &Whitespaces,
encoding::Encoding Encoding,		encoding::Encoding Encoding,
▲ Show 20 Lines • Show All 1,154 Lines • ▼ Show 20 Lines	return encoding::columnWidthWithTabs(Text.substr(LastNewlinePos),
/StartColumn=/0, TabWidth, Encoding);		/StartColumn=/0, TabWidth, Encoding);
}		}
}		}

unsigned ContinuationIndenter::reformatRawStringLiteral(		unsigned ContinuationIndenter::reformatRawStringLiteral(
const FormatToken &Current, LineState &State,		const FormatToken &Current, LineState &State,
const FormatStyle &RawStringStyle, bool DryRun) {		const FormatStyle &RawStringStyle, bool DryRun) {
unsigned StartColumn = State.Column - Current.ColumnWidth;		unsigned StartColumn = State.Column - Current.ColumnWidth;
auto Delimiter = *getRawStringDelimiter(Current.TokenText);		StringRef OldDelimiter = *getRawStringDelimiter(Current.TokenText);
		StringRef NewDelimiter =
		getCanonicalRawStringDelimiter(Style, RawStringStyle.Language);
		if (NewDelimiter.empty() \|\| OldDelimiter.empty())
		NewDelimiter = OldDelimiter;
// The text of a raw string is between the leading 'R"delimiter(' and the		// The text of a raw string is between the leading 'R"delimiter(' and the
// trailing 'delimiter)"'.		// trailing 'delimiter)"'.
unsigned PrefixSize = 3 + Delimiter.size();		unsigned OldPrefixSize = 3 + OldDelimiter.size();
unsigned SuffixSize = 2 + Delimiter.size();		unsigned OldSuffixSize = 2 + OldDelimiter.size();
		std::string RawText =
		Current.TokenText.substr(OldPrefixSize).drop_back(OldSuffixSize);
		if (NewDelimiter != OldDelimiter) {
		// Don't update to the canonical delimiter 'deli' if ')deli"' occurs in the
		// raw string.
		std::string CanonicalDelimiterSuffix = (")" + NewDelimiter + "\"").str();
		if (StringRef(RawText).contains(CanonicalDelimiterSuffix))
		NewDelimiter = OldDelimiter;
		}

		unsigned NewPrefixSize = 3 + NewDelimiter.size();
		unsigned NewSuffixSize = 2 + NewDelimiter.size();

// The first start column is the column the raw text starts.		// The first start column is the column the raw text starts after formatting.
unsigned FirstStartColumn = StartColumn + PrefixSize;		unsigned FirstStartColumn = StartColumn + NewPrefixSize;

// The next start column is the intended indentation a line break inside		// The next start column is the intended indentation a line break inside
// the raw string at level 0. It is determined by the following rules:		// the raw string at level 0. It is determined by the following rules:
// - if the content starts on newline, it is one level more than the current		// - if the content starts on newline, it is one level more than the current
// indent, and		// indent, and
// - if the content does not start on a newline, it is the first start		// - if the content does not start on a newline, it is the first start
// column.		// column.
// These rules have the advantage that the formatted content both does not		// These rules have the advantage that the formatted content both does not
// violate the rectangle rule and visually flows within the surrounding		// violate the rectangle rule and visually flows within the surrounding
// source.		// source.
bool ContentStartsOnNewline = Current.TokenText[PrefixSize] == '\n';		bool ContentStartsOnNewline = Current.TokenText[OldPrefixSize] == '\n';
unsigned NextStartColumn = ContentStartsOnNewline		unsigned NextStartColumn = ContentStartsOnNewline
? State.Stack.back().Indent + Style.IndentWidth		? State.Stack.back().Indent + Style.IndentWidth
: FirstStartColumn;		: FirstStartColumn;

// The last start column is the column the raw string suffix starts if it is		// The last start column is the column the raw string suffix starts if it is
// put on a newline.		// put on a newline.
// The last start column is the intended indentation of the raw string postfix		// The last start column is the intended indentation of the raw string postfix
// if it is put on a newline. It is determined by the following rules:		// if it is put on a newline. It is determined by the following rules:
// - if the raw string prefix starts on a newline, it is the column where		// - if the raw string prefix starts on a newline, it is the column where
// that raw string prefix starts, and		// that raw string prefix starts, and
// - if the raw string prefix does not start on a newline, it is the current		// - if the raw string prefix does not start on a newline, it is the current
// indent.		// indent.
unsigned LastStartColumn = Current.NewlinesBefore		unsigned LastStartColumn = Current.NewlinesBefore
? FirstStartColumn - PrefixSize		? FirstStartColumn - NewPrefixSize
: State.Stack.back().Indent;		: State.Stack.back().Indent;

std::string RawText =
Current.TokenText.substr(PrefixSize).drop_back(SuffixSize);

std::pair<tooling::Replacements, unsigned> Fixes = internal::reformat(		std::pair<tooling::Replacements, unsigned> Fixes = internal::reformat(
RawStringStyle, RawText, {tooling::Range(0, RawText.size())},		RawStringStyle, RawText, {tooling::Range(0, RawText.size())},
FirstStartColumn, NextStartColumn, LastStartColumn, "<stdin>",		FirstStartColumn, NextStartColumn, LastStartColumn, "<stdin>",
/Status=/nullptr);		/Status=/nullptr);

auto NewCode = applyAllReplacements(RawText, Fixes.first);		auto NewCode = applyAllReplacements(RawText, Fixes.first);
tooling::Replacements NoFixes;		tooling::Replacements NoFixes;
if (!NewCode) {		if (!NewCode) {
State.Column += Current.ColumnWidth;		State.Column += Current.ColumnWidth;
return 0;		return 0;
}		}
if (!DryRun) {		if (!DryRun) {
		if (NewDelimiter != OldDelimiter) {
		// In 'R"delimiter(...', the delimiter starts 2 characters after the start
		// of the token.
		SourceLocation PrefixDelimiterStart =
		Current.Tok.getLocation().getLocWithOffset(2);
		auto PrefixErr = Whitespaces.addReplacement(tooling::Replacement(
		SourceMgr, PrefixDelimiterStart, OldDelimiter.size(), NewDelimiter));
		if (PrefixErr) {
		llvm::errs()
		<< "Failed to update the prefix delimiter of a raw string: "
		<< llvm::toString(std::move(PrefixErr)) << "\n";
		}
		// In 'R"delimiter(...)delimiter"', the suffix delimiter starts at
		// position length - 1 - \|delimiter\|.
		SourceLocation SuffixDelimiterStart =
		Current.Tok.getLocation().getLocWithOffset(Current.TokenText.size() -
		1 - OldDelimiter.size());
		auto SuffixErr = Whitespaces.addReplacement(tooling::Replacement(
		SourceMgr, SuffixDelimiterStart, OldDelimiter.size(), NewDelimiter));
		if (SuffixErr) {
		llvm::errs()
		<< "Failed to update the suffix delimiter of a raw string: "
		<< llvm::toString(std::move(SuffixErr)) << "\n";
		}
		}
SourceLocation OriginLoc =		SourceLocation OriginLoc =
Current.Tok.getLocation().getLocWithOffset(PrefixSize);		Current.Tok.getLocation().getLocWithOffset(OldPrefixSize);
for (const tooling::Replacement &Fix : Fixes.first) {		for (const tooling::Replacement &Fix : Fixes.first) {
auto Err = Whitespaces.addReplacement(tooling::Replacement(		auto Err = Whitespaces.addReplacement(tooling::Replacement(
SourceMgr, OriginLoc.getLocWithOffset(Fix.getOffset()),		SourceMgr, OriginLoc.getLocWithOffset(Fix.getOffset()),
Fix.getLength(), Fix.getReplacementText()));		Fix.getLength(), Fix.getReplacementText()));
if (Err) {		if (Err) {
llvm::errs() << "Failed to reformat raw string: "		llvm::errs() << "Failed to reformat raw string: "
<< llvm::toString(std::move(Err)) << "\n";		<< llvm::toString(std::move(Err)) << "\n";
}		}
}		}
}		}
unsigned RawLastLineEndColumn = getLastLineEndColumn(		unsigned RawLastLineEndColumn = getLastLineEndColumn(
*NewCode, FirstStartColumn, Style.TabWidth, Encoding);		*NewCode, FirstStartColumn, Style.TabWidth, Encoding);
State.Column = RawLastLineEndColumn + SuffixSize;		State.Column = RawLastLineEndColumn + NewSuffixSize;
return Fixes.second;		return Fixes.second;
}		}

unsigned ContinuationIndenter::addMultilineToken(const FormatToken &Current,		unsigned ContinuationIndenter::addMultilineToken(const FormatToken &Current,
LineState &State) {		LineState &State) {
// Break before further function parameters on all levels.		// Break before further function parameters on all levels.
for (unsigned i = 0, e = State.Stack.size(); i != e; ++i)		for (unsigned i = 0, e = State.Stack.size(); i != e; ++i)
State.Stack[i].BreakBeforeParameter = true;		State.Stack[i].BreakBeforeParameter = true;
▲ Show 20 Lines • Show All 56 Lines • ▼ Show 20 Lines	unsigned ContinuationIndenter::handleEndOfLine(const FormatToken &Current,
}		}
if (State.Column > getColumnLimit(State)) {		if (State.Column > getColumnLimit(State)) {
unsigned ExcessCharacters = State.Column - getColumnLimit(State);		unsigned ExcessCharacters = State.Column - getColumnLimit(State);
Penalty += Style.PenaltyExcessCharacter * ExcessCharacters;		Penalty += Style.PenaltyExcessCharacter * ExcessCharacters;
}		}
return Penalty;		return Penalty;
}		}

		static StringRef getEnclosingFunctionName(const FormatToken& Current) {
		// Look for: 'function(' or 'function<templates>(' before Current.
		auto Tok = Current.getPreviousNonComment();
		if (!Tok \|\| !Tok->is(tok::l_paren)) return "";
		Tok = Tok->getPreviousNonComment();
		if (!Tok) return "";
		if (Tok->is(TT_TemplateCloser)) {
		Tok = Tok->MatchingParen;
		if (Tok)
		Tok = Tok->getPreviousNonComment();
		}
		if (!Tok \|\| !Tok->is(tok::identifier)) return "";
		return Tok->TokenText;
		}

llvm::Optional<FormatStyle>		llvm::Optional<FormatStyle>
ContinuationIndenter::getRawStringStyle(const FormatToken &Current,		ContinuationIndenter::getRawStringStyle(const FormatToken &Current,
const LineState &State) {		const LineState &State) {
if (!Current.isStringLiteral())		if (!Current.isStringLiteral())
return None;		return None;
auto Delimiter = getRawStringDelimiter(Current.TokenText);		auto Delimiter = getRawStringDelimiter(Current.TokenText);
if (!Delimiter)		if (!Delimiter)
return None;		return None;
auto RawStringStyle = RawStringFormats.get(*Delimiter);		auto RawStringStyle = RawStringFormats.getDelimiterStyle(*Delimiter);
		if (!RawStringStyle)
		RawStringStyle = RawStringFormats.getEnclosingFunctionNameStyle(
		getEnclosingFunctionName(Current));
if (!RawStringStyle)		if (!RawStringStyle)
return None;		return None;
RawStringStyle->ColumnLimit = getColumnLimit(State);
return RawStringStyle;		FormatStyle ResultStyle = *RawStringStyle;
		ResultStyle.ColumnLimit = getColumnLimit(State);
		return ResultStyle;
}		}

std::unique_ptr<BreakableToken> ContinuationIndenter::createBreakableToken(		std::unique_ptr<BreakableToken> ContinuationIndenter::createBreakableToken(
const FormatToken &Current, LineState &State, bool AllowBreak) {		const FormatToken &Current, LineState &State, bool AllowBreak) {
unsigned StartColumn = State.Column - Current.ColumnWidth;		unsigned StartColumn = State.Column - Current.ColumnWidth;
if (Current.isStringLiteral()) {		if (Current.isStringLiteral()) {
// FIXME: String literal breaking is currently disabled for Java and JS, as		// FIXME: String literal breaking is currently disabled for Java and JS, as
// it requires strings to be merged using "+" which we don't support.		// it requires strings to be merged using "+" which we don't support.
▲ Show 20 Lines • Show All 433 Lines • Show Last 20 Lines

lib/Format/Format.cpp

Show All 32 Lines
#include "clang/Lex/Lexer.h"		#include "clang/Lex/Lexer.h"
#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
#include "llvm/Support/Allocator.h"		#include "llvm/Support/Allocator.h"
#include "llvm/Support/Debug.h"		#include "llvm/Support/Debug.h"
#include "llvm/Support/Path.h"		#include "llvm/Support/Path.h"
#include "llvm/Support/Regex.h"		#include "llvm/Support/Regex.h"
#include "llvm/Support/YAMLTraits.h"		#include "llvm/Support/YAMLTraits.h"
#include <algorithm>		#include <algorithm>
#include <memory>
#include <string>		#include <string>

#define DEBUG_TYPE "format-formatter"		#define DEBUG_TYPE "format-formatter"

using clang::format::FormatStyle;		using clang::format::FormatStyle;

LLVM_YAML_IS_SEQUENCE_VECTOR(clang::format::FormatStyle::IncludeCategory)		LLVM_YAML_IS_SEQUENCE_VECTOR(clang::format::FormatStyle::IncludeCategory)
LLVM_YAML_IS_SEQUENCE_VECTOR(clang::format::FormatStyle::RawStringFormat)		LLVM_YAML_IS_SEQUENCE_VECTOR(clang::format::FormatStyle::RawStringFormat)
▲ Show 20 Lines • Show All 400 Lines • ▼ Show 20 Lines	static void enumeration(IO &IO, FormatStyle::IncludeBlocksStyle &Value) {
IO.enumCase(Value, "Preserve", FormatStyle::IBS_Preserve);		IO.enumCase(Value, "Preserve", FormatStyle::IBS_Preserve);
IO.enumCase(Value, "Merge", FormatStyle::IBS_Merge);		IO.enumCase(Value, "Merge", FormatStyle::IBS_Merge);
IO.enumCase(Value, "Regroup", FormatStyle::IBS_Regroup);		IO.enumCase(Value, "Regroup", FormatStyle::IBS_Regroup);
}		}
};		};

template <> struct MappingTraits<FormatStyle::RawStringFormat> {		template <> struct MappingTraits<FormatStyle::RawStringFormat> {
static void mapping(IO &IO, FormatStyle::RawStringFormat &Format) {		static void mapping(IO &IO, FormatStyle::RawStringFormat &Format) {
IO.mapOptional("Delimiter", Format.Delimiter);
IO.mapOptional("Language", Format.Language);		IO.mapOptional("Language", Format.Language);
IO.mapOptional("BasedOnStyle", Format.BasedOnStyle);		IO.mapOptional("Delimiters", Format.Delimiters);
		IO.mapOptional("EnclosingFunctionNames", Format.EnclosingFunctionNames);
		IO.mapOptional("CanonicalDelimiter", Format.CanonicalDelimiter);
}		}
};		};

// Allows to read vector<FormatStyle> while keeping default values.		// Allows to read vector<FormatStyle> while keeping default values.
// IO.getContext() should contain a pointer to the FormatStyle structure, that		// IO.getContext() should contain a pointer to the FormatStyle structure, that
// will be used to get default values for missing keys.		// will be used to get default values for missing keys.
// If the first element has no Language specified, it will be treated as the		// If the first element has no Language specified, it will be treated as the
// default one for the following elements.		// default one for the following elements.
▲ Show 20 Lines • Show All 167 Lines • ▼ Show 20 Lines	FormatStyle getLLVMStyle() {
LLVMStyle.NamespaceIndentation = FormatStyle::NI_None;		LLVMStyle.NamespaceIndentation = FormatStyle::NI_None;
LLVMStyle.ObjCBlockIndentWidth = 2;		LLVMStyle.ObjCBlockIndentWidth = 2;
LLVMStyle.ObjCSpaceAfterProperty = false;		LLVMStyle.ObjCSpaceAfterProperty = false;
LLVMStyle.ObjCSpaceBeforeProtocolList = true;		LLVMStyle.ObjCSpaceBeforeProtocolList = true;
LLVMStyle.PointerAlignment = FormatStyle::PAS_Right;		LLVMStyle.PointerAlignment = FormatStyle::PAS_Right;
LLVMStyle.SpacesBeforeTrailingComments = 1;		LLVMStyle.SpacesBeforeTrailingComments = 1;
LLVMStyle.Standard = FormatStyle::LS_Cpp11;		LLVMStyle.Standard = FormatStyle::LS_Cpp11;
LLVMStyle.UseTab = FormatStyle::UT_Never;		LLVMStyle.UseTab = FormatStyle::UT_Never;
LLVMStyle.RawStringFormats = {{"pb", FormatStyle::LK_TextProto, "google"}};
LLVMStyle.ReflowComments = true;		LLVMStyle.ReflowComments = true;
LLVMStyle.SpacesInParentheses = false;		LLVMStyle.SpacesInParentheses = false;
LLVMStyle.SpacesInSquareBrackets = false;		LLVMStyle.SpacesInSquareBrackets = false;
LLVMStyle.SpaceInEmptyParentheses = false;		LLVMStyle.SpaceInEmptyParentheses = false;
LLVMStyle.SpacesInContainerLiterals = true;		LLVMStyle.SpacesInContainerLiterals = true;
LLVMStyle.SpacesInCStyleCastParentheses = false;		LLVMStyle.SpacesInCStyleCastParentheses = false;
LLVMStyle.SpaceAfterCStyleCast = false;		LLVMStyle.SpaceAfterCStyleCast = false;
LLVMStyle.SpaceAfterTemplateKeyword = true;		LLVMStyle.SpaceAfterTemplateKeyword = true;
Show All 37 Lines	FormatStyle getGoogleStyle(FormatStyle::LanguageKind Language) {
GoogleStyle.IncludeCategories = {		GoogleStyle.IncludeCategories = {
{"^<ext/.\\.h>", 2}, {"^<.\\.h>", 1}, {"^<.", 2}, {".", 3}};		{"^<ext/.\\.h>", 2}, {"^<.\\.h>", 1}, {"^<.", 2}, {".", 3}};
GoogleStyle.IncludeIsMainRegex = "([-_](test\|unittest))?$";		GoogleStyle.IncludeIsMainRegex = "([-_](test\|unittest))?$";
GoogleStyle.IndentCaseLabels = true;		GoogleStyle.IndentCaseLabels = true;
GoogleStyle.KeepEmptyLinesAtTheStartOfBlocks = false;		GoogleStyle.KeepEmptyLinesAtTheStartOfBlocks = false;
GoogleStyle.ObjCSpaceAfterProperty = false;		GoogleStyle.ObjCSpaceAfterProperty = false;
GoogleStyle.ObjCSpaceBeforeProtocolList = false;		GoogleStyle.ObjCSpaceBeforeProtocolList = false;
GoogleStyle.PointerAlignment = FormatStyle::PAS_Left;		GoogleStyle.PointerAlignment = FormatStyle::PAS_Left;
		GoogleStyle.RawStringFormats = {
		{FormatStyle::LK_TextProto,
		/Delimiters=/
		{
		"pb",
		"PB",
		"proto",
		"PROTO",
		"textproto",
		"TEXTPROTO",
		},
		/EnclosingFunctionNames=/
		{
		"EqualsProto",
		"PARSE_TEXT_PROTO",
		"ParseTextProto",
		},
		/CanonicalDelimiter=/""},
		};
GoogleStyle.SpacesBeforeTrailingComments = 2;		GoogleStyle.SpacesBeforeTrailingComments = 2;
GoogleStyle.Standard = FormatStyle::LS_Auto;		GoogleStyle.Standard = FormatStyle::LS_Auto;

GoogleStyle.PenaltyReturnTypeOnItsOwnLine = 200;		GoogleStyle.PenaltyReturnTypeOnItsOwnLine = 200;
GoogleStyle.PenaltyBreakBeforeFirstCallParameter = 1;		GoogleStyle.PenaltyBreakBeforeFirstCallParameter = 1;

if (Language == FormatStyle::LK_Java) {		if (Language == FormatStyle::LK_Java) {
GoogleStyle.AlignAfterOpenBracket = FormatStyle::BAS_DontAlign;		GoogleStyle.AlignAfterOpenBracket = FormatStyle::BAS_DontAlign;
▲ Show 20 Lines • Show All 177 Lines • ▼ Show 20 Lines	for (unsigned j = 0; j < i; ++j) {
<< " and " << i << "\n");		<< " and " << i << "\n");
return make_error_code(ParseError::Error);		return make_error_code(ParseError::Error);
}		}
}		}
}		}
// Look for a suitable configuration starting from the end, so we can		// Look for a suitable configuration starting from the end, so we can
// find the configuration for the specific language first, and the default		// find the configuration for the specific language first, and the default
// configuration (which can only be at slot 0) after it.		// configuration (which can only be at slot 0) after it.
		bool LanguageFound = false;
for (int i = Styles.size() - 1; i >= 0; --i) {		for (int i = Styles.size() - 1; i >= 0; --i) {
if (Styles[i].Language == Language \|\|		if (!LanguageFound && (Styles[i].Language == Language \|\|
Styles[i].Language == FormatStyle::LK_None) {		Styles[i].Language == FormatStyle::LK_None)) {
*Style = Styles[i];		*Style = Styles[i];
Style->Language = Language;		Style->Language = Language;
return make_error_code(ParseError::Success);		LanguageFound = true;
		break;
}		}
}		}
		if (!LanguageFound)
		djasperUnsubmitted Done Reply Inline Actions Prefer early exit, i.e. if (!LanguageFound) return make_error_code(ParseError::Unsuitable); ... djasper: Prefer early exit, i.e. if (!LanguageFound) return make_error_code(ParseError…
return make_error_code(ParseError::Unsuitable);		return make_error_code(ParseError::Unsuitable);
		djasperUnsubmitted Not Done Reply Inline Actions I think this is getting a bit convoluted and I don't even understand whether we are doing what is document (even before this patch). So in lines 892-905, we verify that: Only the first Style in the file is allowed be LK_None. No language is duplicated. That seems good. According to the documentation: "The first section may have no language set, it will set the default style options for all lanugages.". Does the latter part actually happen? Seems to me that we are just setting "Style" to the style configured for a specific language, completely ignoring values that might have been set in the LK_None style. Or is that somehow happening when reading the JSON? Independent of that, I think we should use this structure more explicitly. I think we should create an additional class (FormatStyles or FormatStyleSet or something) that is returned by this function and handed to the formatting library. This function then doesn't need to look at the language anymore. That class should then have a function getFormatStyle(LanguageKind Language); that returns the style for a particular language (doing the default logic, etc.). Internally, it can likely just have a map<LK, Style> and I don't think you need to pre-fill that for all language kinds. If a language kind is not in the map, you can just return what's stored for LK_None. WDYT? djasper: I think this is getting a bit convoluted and I don't even understand whether we are doing what…
		krasimirAuthorUnsubmitted Not Done Reply Inline Actions Yes, defaulting to the None for missing language specifications is handled at line 912: if (!LanguageFound && (Styles[i].Language == Language \|\| Styles[i].Language == FormatStyle::LK_None I was thinking of the FormatStyleSet approach but the problem is that this has repercussions all over the library. We could indeed update this specific function that way, but fundamentally the additional language styles are part of the FormatStyle and need to somehow be recorded inside there. That's why I went with KISS and directly made this function handle that case. krasimir: Yes, defaulting to the None for missing language specifications is handled at line 912: ```…
		djasperUnsubmitted Not Done Reply Inline Actions But it's not just defaulting to LK_None what we are saying we are implementing. I think the documentation suggestion that we implement very basic inheritance. E.g. if the style for LK_None set the ColumnLimit to 42, I would expect that the styles for all other languages that don't explicitly set a ColumnLimit would also use 42. I don't think this is currently implemented and I don't think this patch implements it. But I think we should :). I agree that the FormatStyleSet approach would have some consequences, but I also think that it is much cleaner. Your current solution feels like we us working around technical debt and creating more technical debt to do it :(. Maybe Manuel has thoughts here? djasper: But it's not just defaulting to LK_None what we are saying we are implementing. I think the…
		klimekUnsubmitted Not Done Reply Inline Actions I agree that we should test the inheritance if it's documented :) I don't have super strong feelings where the logic of implementing that inheritance lives - both the function that parses the data and the data structure we hand around seem fine for that, as it's easy to change That said, I do agree that FormatStyle being a recursive graph data structure is weird and unexpected; I had to ask Daniel what he actually meant here, and then went "how does this even work??!" before realizing that FormatStyle is both a single style and also containing all other styles. -> in conclusion, I agree with Daniel: we should have a FormatStyleSet and pass that around everywhere. That's a pure refactoring that looks like it would already make the current code better, so I also agree it should be done first and be covered by existing tests. klimek: I agree that we should test the inheritance if it's documented :) I don't have super strong…
		for (int i = Styles.size() - 1; i >= 0; --i) {
		if (Styles[i].Language == FormatStyle::LK_None) {
		for (size_t AdditionalLanguage = 0;
		AdditionalLanguage < FormatStyle::kLanguageKindSize;
		++AdditionalLanguage) {
		FormatStyle AdditionalLanguageStyle = Styles[i];
		AdditionalLanguageStyle.Language =
		static_cast<FormatStyle::LanguageKind>(AdditionalLanguage);
		if (!Style->AdditionalLanguageStyles[AdditionalLanguage])
		Style->AdditionalLanguageStyles[AdditionalLanguage].reset(
		new FormatStyle(AdditionalLanguageStyle));
		}
		continue;
		}
		Style->AdditionalLanguageStyles[Styles[i].Language].reset(
		new FormatStyle(Styles[i]));
		}
		return make_error_code(ParseError::Success);
}		}

std::string configurationAsText(const FormatStyle &Style) {		std::string configurationAsText(const FormatStyle &Style) {
std::string Text;		std::string Text;
llvm::raw_string_ostream Stream(Text);		llvm::raw_string_ostream Stream(Text);
llvm::yaml::Output Output(Stream);		llvm::yaml::Output Output(Stream);
// We use the same mapping method for input and output, so we need a non-const		// We use the same mapping method for input and output, so we need a non-const
// reference here.		// reference here.
▲ Show 20 Lines • Show All 1,306 Lines • Show Last 20 Lines

unittests/Format/FormatTest.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 10,396 Lines • ▼ Show 20 Lines	CHECK_PARSE("IncludeCategories:\n"
" Priority: 2\n"		" Priority: 2\n"
" - Regex: .*\n"		" - Regex: .*\n"
" Priority: 1",		" Priority: 1",
IncludeCategories, ExpectedCategories);		IncludeCategories, ExpectedCategories);
CHECK_PARSE("IncludeIsMainRegex: 'abc$'", IncludeIsMainRegex, "abc$");		CHECK_PARSE("IncludeIsMainRegex: 'abc$'", IncludeIsMainRegex, "abc$");

Style.RawStringFormats.clear();		Style.RawStringFormats.clear();
std::vector<FormatStyle::RawStringFormat> ExpectedRawStringFormats = {		std::vector<FormatStyle::RawStringFormat> ExpectedRawStringFormats = {
{"pb", FormatStyle::LK_TextProto, "llvm"},		{FormatStyle::LK_TextProto,
{"cpp", FormatStyle::LK_Cpp, "google"}};		{"pb", "proto"},
		{"PARSE_TEXT_PROTO"},
		"textproto"},
		{FormatStyle::LK_Cpp, {"cc", "cpp"}, {"C_CODEBLOCK", "CPPEVAL"}, ""}};

CHECK_PARSE("RawStringFormats:\n"		CHECK_PARSE("RawStringFormats:\n"
" - Delimiter: 'pb'\n"		" - Language: TextProto\n"
" Language: TextProto\n"		" Delimiters:\n"
" BasedOnStyle: llvm\n"		" - 'pb'\n"
" - Delimiter: 'cpp'\n"		" - 'proto'\n"
" Language: Cpp\n"		" EnclosingFunctionNames:\n"
" BasedOnStyle: google",		" - 'PARSE_TEXT_PROTO'\n"
		" CanonicalDelimiter: 'textproto'\n"
		" - Language: Cpp\n"
		" Delimiters:\n"
		" - 'cc'\n"
		" - 'cpp'\n"
		" EnclosingFunctionNames:\n"
		" - 'C_CODEBLOCK'\n"
		" - 'CPPEVAL'",
RawStringFormats, ExpectedRawStringFormats);		RawStringFormats, ExpectedRawStringFormats);
}		}

TEST_F(FormatTest, ParsesConfigurationWithLanguages) {		TEST_F(FormatTest, ParsesConfigurationWithLanguages) {
FormatStyle Style = {};		FormatStyle Style = {};
Style.Language = FormatStyle::LK_Cpp;		Style.Language = FormatStyle::LK_Cpp;
CHECK_PARSE("Language: Cpp\n"		CHECK_PARSE("Language: Cpp\n"
"IndentWidth: 12",		"IndentWidth: 12",
▲ Show 20 Lines • Show All 1,412 Lines • Show Last 20 Lines

unittests/Format/FormatTestRawStrings.cpp

Show First 20 Lines • Show All 59 Lines • ▼ Show 20 Lines	FormatStyle getLLVMStyleWithColumns(unsigned ColumnLimit) {
return getStyleWithColumns(getLLVMStyle(), ColumnLimit);		return getStyleWithColumns(getLLVMStyle(), ColumnLimit);
}		}

int ReplacementCount;		int ReplacementCount;

FormatStyle getRawStringPbStyleWithColumns(unsigned ColumnLimit) {		FormatStyle getRawStringPbStyleWithColumns(unsigned ColumnLimit) {
FormatStyle Style = getLLVMStyle();		FormatStyle Style = getLLVMStyle();
Style.ColumnLimit = ColumnLimit;		Style.ColumnLimit = ColumnLimit;
Style.RawStringFormats = {{/Delimiter=/"pb",		Style.AdditionalLanguageStyles[FormatStyle::LK_TextProto] =
/Kind=/FormatStyle::LK_TextProto,		std::make_shared<FormatStyle>(
/BasedOnStyle=/"google"}};		getGoogleStyle(FormatStyle::LK_TextProto));
		Style.RawStringFormats = {{/Language=/FormatStyle::LK_TextProto,
		/Delimiters=/{"pb"},
		/EnclosingFunctionNames=/{},
		/CanonicalDelimiter=/""}};
return Style;		return Style;
}		}

FormatStyle getRawStringLLVMCppStyleBasedOn(std::string BasedOnStyle) {		FormatStyle getRawStringLLVMCppStyleBasedOn(std::string Name) {
FormatStyle Style = getLLVMStyle();		FormatStyle Style = getLLVMStyle();
Style.RawStringFormats = {{/Delimiter=/"cpp",		FormatStyle BasedOnStyle = getLLVMStyle();
/Kind=/FormatStyle::LK_Cpp, BasedOnStyle}};		getPredefinedStyle(Name, FormatStyle::LK_Cpp, &BasedOnStyle);
		Style.AdditionalLanguageStyles[FormatStyle::LK_Cpp] =
		std::make_shared<FormatStyle>(BasedOnStyle);
		Style.RawStringFormats = {{/Language=/FormatStyle::LK_Cpp,
		/Delimiters=/{"cpp"},
		/EnclosingFunctionNames=/{},
		/CanonicalDelimiter=/""}};
return Style;		return Style;
}		}

FormatStyle getRawStringGoogleCppStyleBasedOn(std::string BasedOnStyle) {		FormatStyle getRawStringGoogleCppStyleBasedOn(std::string Name) {
FormatStyle Style = getGoogleStyle(FormatStyle::LK_Cpp);		FormatStyle Style = getGoogleStyle(FormatStyle::LK_Cpp);
Style.RawStringFormats = {{/Delimiter=/"cpp",		FormatStyle BasedOnStyle = getLLVMStyle();
/Kind=/FormatStyle::LK_Cpp, BasedOnStyle}};		getPredefinedStyle(Name, FormatStyle::LK_Cpp, &BasedOnStyle);
		Style.AdditionalLanguageStyles[FormatStyle::LK_Cpp] =
		std::make_shared<FormatStyle>(BasedOnStyle);
		Style.RawStringFormats = {{/Language=/FormatStyle::LK_Cpp,
		/Delimiters=/{"cpp"},
		/EnclosingFunctionNames=/{},
		/CanonicalDelimiter=/""}};
return Style;		return Style;
}		}

// Gcc 4.8 doesn't support raw string literals in macros, which breaks some		// Gcc 4.8 doesn't support raw string literals in macros, which breaks some
// build bots. We use this function instead.		// build bots. We use this function instead.
void expect_eq(const std::string Expected, const std::string Actual) {		void expect_eq(const std::string Expected, const std::string Actual) {
EXPECT_EQ(Expected, Actual);		EXPECT_EQ(Expected, Actual);
}		}
};		};

TEST_F(FormatTestRawStrings, ReformatsAccordingToBaseStyle) {		TEST_F(FormatTestRawStrings, ReformatsAccordingToBaseStyle) {
// llvm style puts '*' on the right.		// llvm style puts '*' on the right.
// google style puts '*' on the left.		// google style puts '*' on the left.

// Use the llvm style if the raw string style has no BasedOnStyle.		// Use llvm style outside and the google style inside if the raw string style
		// is based on google.
expect_eq(R"test(int i = R"cpp(int p = nullptr;)cpp")test",		expect_eq(R"test(int i = R"cpp(int p = nullptr;)cpp")test",
format(R"test(int * i = R"cpp(int * p = nullptr;)cpp")test",		format(R"test(int * i = R"cpp(int * p = nullptr;)cpp")test",
getRawStringLLVMCppStyleBasedOn("")));		getRawStringLLVMCppStyleBasedOn("google")));

// Use the google style if the raw string style has BasedOnStyle=google.		// Use llvm style if the raw string style has no BasedOnStyle.
expect_eq(R"test(int i = R"cpp(int p = nullptr;)cpp")test",		expect_eq(R"test(int i = R"cpp(int p = nullptr;)cpp")test",
format(R"test(int * i = R"cpp(int * p = nullptr;)cpp")test",		format(R"test(int * i = R"cpp(int * p = nullptr;)cpp")test",
getRawStringLLVMCppStyleBasedOn("google")));		getRawStringLLVMCppStyleBasedOn("")));

// Use the llvm style if the raw string style has no BasedOnStyle=llvm.		// Use google style outside and the llvm style inside if the raw string style
		// is based on llvm.
expect_eq(R"test(int* i = R"cpp(int *p = nullptr;)cpp")test",		expect_eq(R"test(int* i = R"cpp(int *p = nullptr;)cpp")test",
format(R"test(int * i = R"cpp(int * p = nullptr;)cpp")test",		format(R"test(int * i = R"cpp(int * p = nullptr;)cpp")test",
getRawStringGoogleCppStyleBasedOn("llvm")));		getRawStringGoogleCppStyleBasedOn("llvm")));
}		}

TEST_F(FormatTestRawStrings, MatchesDelimitersCaseSensitively) {		TEST_F(FormatTestRawStrings, MatchesDelimitersCaseSensitively) {
// Don't touch the 'PB' raw string, format the 'pb' raw string.		// Don't touch the 'PB' raw string, format the 'pb' raw string.
expect_eq(R"test(		expect_eq(R"test(
s = R"PB(item:1)PB";		s = R"PB(item:1)PB";
t = R"pb(item: 1)pb";)test",		t = R"pb(item: 1)pb";)test",
format(R"test(		format(R"test(
s = R"PB(item:1)PB";		s = R"PB(item:1)PB";
t = R"pb(item:1)pb";)test",		t = R"pb(item:1)pb";)test",
getRawStringPbStyleWithColumns(40)));		getRawStringPbStyleWithColumns(40)));

FormatStyle MixedStyle = getLLVMStyle();
MixedStyle.RawStringFormats = {
{/Delimiter=/"cpp", /Kind=/FormatStyle::LK_Cpp,
/BasedOnStyle=/"llvm"},
{/Delimiter=/"CPP", /Kind=/FormatStyle::LK_Cpp,
/BasedOnStyle=/"google"}};

// Format the 'cpp' raw string with '*' on the right.
// Format the 'CPP' raw string with '*' on the left.
// Do not format the 'Cpp' raw string.
// Do not format non-raw strings.
expect_eq(R"test(
a = R"cpp(int *i = 0;)cpp";
b = R"CPP(int* j = 0;)CPP";
c = R"Cpp(int * k = 0;)Cpp";
d = R"cpp(int * k = 0;)Cpp";)test",
format(R"test(
a = R"cpp(int * i = 0;)cpp";
b = R"CPP(int * j = 0;)CPP";
c = R"Cpp(int * k = 0;)Cpp";
d = R"cpp(int * k = 0;)Cpp";)test",
MixedStyle));
}		}

TEST_F(FormatTestRawStrings, ReformatsShortRawStringsOnSingleLine) {		TEST_F(FormatTestRawStrings, ReformatsShortRawStringsOnSingleLine) {
expect_eq(		expect_eq(
R"test(P p = TP(R"pb()pb");)test",		R"test(P p = TP(R"pb()pb");)test",
format(		format(
R"test(P p = TP(R"pb( )pb");)test",		R"test(P p = TP(R"pb( )pb");)test",
getRawStringPbStyleWithColumns(40)));		getRawStringPbStyleWithColumns(40)));
▲ Show 20 Lines • Show All 50 Lines • ▼ Show 20 Lines	TPPPPPPPPPPPPPPP(
format(R"test(		format(R"test(
P p = TPPPPPPPPPPPPPPP(R"pb(item_1: 1, item_2: 2, item_3: 3)pb");)test",		P p = TPPPPPPPPPPPPPPP(R"pb(item_1: 1, item_2: 2, item_3: 3)pb");)test",
getRawStringPbStyleWithColumns(40)));		getRawStringPbStyleWithColumns(40)));

expect_eq(R"test(		expect_eq(R"test(
P p = TP(R"pb(item_1 <1>		P p = TP(R"pb(item_1 <1>
item_2: <2>		item_2: <2>
item_3 {})pb");)test",		item_3 {})pb");)test",
format(R"test(		format(R"test(
P p = TP(R"pb(item_1<1> item_2:<2> item_3{ })pb");)test",		P p = TP(R"pb(item_1<1> item_2:<2> item_3{ })pb");)test",
getRawStringPbStyleWithColumns(40)));		getRawStringPbStyleWithColumns(40)));

expect_eq(		expect_eq(
R"test(		R"test(
P p = TP(R"pb(item_1: 1,		P p = TP(R"pb(item_1: 1,
item_2: 2,		item_2: 2,
item_3: 3,		item_3: 3,
item_4: 4)pb");)test",		item_4: 4)pb");)test",
format(		format(
▲ Show 20 Lines • Show All 286 Lines • ▼ Show 20 Lines	ASSERT_TRUE(ParseFromString(R"pb(item_1: 1, item_2: 2)pb"), ptr);)test",
expect_eq(R"test(		expect_eq(R"test(
ASSERT_TRUE(		ASSERT_TRUE(
ParseFromString(		ParseFromString(
R"pb(item_1: 1 item_2: 2)pb"),		R"pb(item_1: 1 item_2: 2)pb"),
ptr);)test",		ptr);)test",
format(R"test(		format(R"test(
ASSERT_TRUE(ParseFromString(R"pb(item_1: 1 item_2: 2)pb"), ptr);)test",		ASSERT_TRUE(ParseFromString(R"pb(item_1: 1 item_2: 2)pb"), ptr);)test",
getRawStringPbStyleWithColumns(40)));		getRawStringPbStyleWithColumns(40)));

}		}

TEST_F(FormatTestRawStrings, RawStringsInOperands) {		TEST_F(FormatTestRawStrings, RawStringsInOperands) {
// Formats the raw string first operand of a binary operator expression.		// Formats the raw string first operand of a binary operator expression.
expect_eq(R"test(auto S = R"pb(item_1: 1)pb" + rest;)test",		expect_eq(R"test(auto S = R"pb(item_1: 1)pb" + rest;)test",
format(R"test(auto S = R"pb(item_1:1)pb" + rest;)test",		format(R"test(auto S = R"pb(item_1:1)pb" + rest;)test",
getRawStringPbStyleWithColumns(40)));		getRawStringPbStyleWithColumns(40)));

▲ Show 20 Lines • Show All 110 Lines • ▼ Show 20 Lines	auto S =
(count < 3)		(count < 3)
? R"pb(item_1: 1)pb"		? R"pb(item_1: 1)pb"
: R"pb(item_2: 2, item_3: 3)pb";		: R"pb(item_2: 2, item_3: 3)pb";
)test",		)test",
format(R"test(		format(R"test(
auto S=(count<3)?R"pb(item_1:1)pb":R"pb(item_2:2,item_3:3)pb";		auto S=(count<3)?R"pb(item_1:1)pb":R"pb(item_2:2,item_3:3)pb";
)test",		)test",
getRawStringPbStyleWithColumns(40)));		getRawStringPbStyleWithColumns(40)));

}		}

TEST_F(FormatTestRawStrings, PrefixAndSuffixAlignment) {		TEST_F(FormatTestRawStrings, PrefixAndSuffixAlignment) {
// Keep the suffix at the end of line if not on newline.		// Keep the suffix at the end of line if not on newline.
expect_eq(R"test(		expect_eq(R"test(
int s() {		int s() {
auto S = PTP(		auto S = PTP(
R"pb(		R"pb(
▲ Show 20 Lines • Show All 69 Lines • ▼ Show 20 Lines
}		}

TEST_F(FormatTestRawStrings, DontFormatNonRawStrings) {		TEST_F(FormatTestRawStrings, DontFormatNonRawStrings) {
expect_eq(R"test(a = R"pb(key:value)";)test",		expect_eq(R"test(a = R"pb(key:value)";)test",
format(R"test(a = R"pb(key:value)";)test",		format(R"test(a = R"pb(key:value)";)test",
getRawStringPbStyleWithColumns(20)));		getRawStringPbStyleWithColumns(20)));
}		}

		TEST_F(FormatTestRawStrings, UpdatesToCanonicalDelimiters) {
		FormatStyle Style = getRawStringPbStyleWithColumns(25);
		Style.RawStringFormats[0].CanonicalDelimiter = "proto";
		expect_eq(R"test(a = R"proto(key: value)proto";)test",
		format(R"test(a = R"pb(key:value)pb";)test", Style));

		// Don't update to canonical delimiter if it occurs as a raw string suffix in
		// the raw string content.
		expect_eq(R"test(a = R"pb(key: ")proto")pb";)test",
		format(R"test(a = R"pb(key:")proto")pb";)test", Style));
		}

		TEST_F(FormatTestRawStrings, FormatsRawStringsWithEnclosingFunctionName) {
		FormatStyle Style = getRawStringPbStyleWithColumns(40);
		Style.RawStringFormats[0].EnclosingFunctionNames.push_back(
		"PARSE_TEXT_PROTO");
		Style.RawStringFormats[0].EnclosingFunctionNames.push_back("ParseTextProto");
		expect_eq(R"test(a = PARSE_TEXT_PROTO(R"(key: value)");)test",
		format(R"test(a = PARSE_TEXT_PROTO(R"(key:value)");)test", Style));

		expect_eq(R"test(
		a = PARSE_TEXT_PROTO /**/ (
		/**/ R"(key: value)");)test",
		format(R"test(
		a = PARSE_TEXT_PROTO//(//R"(key:value)");)test",
		Style));

		expect_eq(R"test(
		a = ParseTextProto<ProtoType>(
		R"(key: value)");)test",
		format(R"test(
		a = ParseTextProto<ProtoType>(R"(key:value)");)test",
		Style));
		}

} // end namespace		} // end namespace
} // end namespace format		} // end namespace format
} // end namespace clang		} // end namespace clang

This is an archive of the discontinued LLVM Phabricator instance.

[clang-format] Reorganize raw string delimiters
Needs ReviewPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 125939

docs/ClangFormatStyleOptions.rst

include/clang/Format/Format.h

lib/Format/ContinuationIndenter.h

lib/Format/ContinuationIndenter.cpp

lib/Format/Format.cpp

unittests/Format/FormatTest.cpp

unittests/Format/FormatTestRawStrings.cpp

This is an archive of the discontinued LLVM Phabricator instance.

[clang-format] Reorganize raw string delimitersNeeds ReviewPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 125939

docs/ClangFormatStyleOptions.rst

include/clang/Format/Format.h

lib/Format/ContinuationIndenter.h

lib/Format/ContinuationIndenter.cpp

lib/Format/Format.cpp

unittests/Format/FormatTest.cpp

unittests/Format/FormatTestRawStrings.cpp

[clang-format] Reorganize raw string delimiters
Needs ReviewPublic