Download Raw Diff

Details

Reviewers

Commits

rGaecc59c5f948: [LibTooling] Change Transformer's TextGenerator to a partial function.
rC359574: [LibTooling] Change Transformer's TextGenerator to a partial function.
rL359574: [LibTooling] Change Transformer's TextGenerator to a partial function.

Summary

Changes the signature of the std::function to return an Expected<std::string>
instead of std::string to allow for (non-fatal) failures. Previously, we
expected that any failures would be expressed with assertions. However, that's
unfriendly to running the code in servers or other places that don't want their
library calls to crash the program.

Diff Detail

Repository

rG LLVM Github Monorepo

Build Status

Buildable 31108
Build 31107: arc lint + arc unit

Event Timeline

ymandel created this revision.Apr 23 2019, 6:53 AM

Herald added a project: Restricted Project. · View Herald TranscriptApr 23 2019, 6:53 AM

Harbormaster completed remote builds in B30886: Diff 196242.Apr 23 2019, 6:53 AM

Why would we consider this a legitimate failure, rather than a programming error?
Same argument could be made about any form of format-string-like functions, e.g. llvm::formatv or sprintf.
Yet, they return strings and not Expected<string> or their equivalent.

In D61015#1478539, @ilya-biryukov wrote:

Why would we consider this a legitimate failure, rather than a programming error?
Same argument could be made about any form of format-string-like functions, e.g. llvm::formatv or sprintf.
Yet, they return strings and not Expected<string> or their equivalent.

It's not that it's a legitimate failure so much as an invalid argument -- that is, the caller caused it and therefore should handle the failure. For a standalone tool making the call, I'd argue they should treat it as a programming error and assert to crash. However, a server that takes the input from, say, a user will want to propagate that back to the user.

As for formatv and sprintf -- I think the difference is that this has a more dynamic design. Both of those take explicit arguments at the callsite, vs. TextGenerator which takes a MatchResult. But, that's just a detail -- the key thing is that we're trying to support a usecase where the input may be flawed and we want to fail gracefully. We do so at the cost of the added complexity of Expected<>.

Alternatives:

add a separate validation function. But, this will complicate the design since we'd need to pass two functions rather than one. That is, TextGenerator would need to be a pair of functions.
return an empty string on failure. This has the typical tradeoffs of using a sentinel value.

I'd argue it's the server's job to validate the inputs in that case.

The code that landed so far clearly looks like a C++ DSL to describe transformations of the source. While it can be used a dependency in the server-side, I don't see why doing user-input checking should be done by the library, rather than the server itself.
User input would have to be transformed into the calls of the C++ API somehow, that looks like a proper layer to do the validation.

In D61015#1478586, @ilya-biryukov wrote:

I'd argue it's the server's job to validate the inputs in that case.

The code that landed so far clearly looks like a C++ DSL to describe transformations of the source. While it can be used a dependency in the server-side, I don't see why doing user-input checking should be done by the library, rather than the server itself.
User input would have to be transformed into the calls of the C++ API somehow, that looks like a proper layer to do the validation.

The problem is that validation can't* be done in the abstract. It has to be done with respect to a specific match result. Unfortunately, the server won't be layered directly on top of the call to the TextGenerator -- the rewrite rule is interposed between them. That is, the client of the TG is the rewriterule and that's where the validation has to happen, but the TG is opaque to the rewrite rule, so it can't hardcode that validation logic. So, we'd need to change TextGenerator to bundle a validation function and string generator. Is it still worth it in that case?

*"can't" is a bit strong. I could imagine a design which allows full analysis and validation of rewrite rules before they are executed, but it would be far more sophisticated than the current design.

In D61015#1478886, @ymandel wrote:

The problem is that validation can't* be done in the abstract. It has to be done with respect to a specific match result. Unfortunately, the server won't be layered directly on top of the call to the TextGenerator -- the rewrite rule is interposed between them. That is, the client of the TG is the rewriterule and that's where the validation has to happen, but the TG is opaque to the rewrite rule, so it can't hardcode that validation logic. So, we'd need to change TextGenerator to bundle a validation function and string generator. Is it still worth it in that case?

*"can't" is a bit strong. I could imagine a design which allows full analysis and validation of rewrite rules before they are executed, but it would be far more sophisticated than the current design.

Could you provide more details into how the server app is layered? Specifically, what are the user inputs and how do they get translated into the transformer library calls?
I imagine their should be a syntax for textual representation of the generator, the representation should allow referring to the named match results. While parsing this representation, it should be possible to collect all matches of named nodes and make sure there are corresponding binding in the rewrite rule.

If the setup is different, I may be missing where the complexity comes from and I would love to learn about it.

Updated comment to more explicity describe motivation for new signature.

ymandel retitled this revision from [LibTooing] Change Transformer's TextGenerator to a partial function. to [LibTooling] Change Transformer's TextGenerator to a partial function..Apr 29 2019, 10:53 AM

Harbormaster completed remote builds in B31108: Diff 197146.Apr 29 2019, 10:55 AM

Add test for new behavior.

In the process, tweak the handling of errors from TextGenerators in Transformer:
instead of printing to llvm::errs, we set the error in the AtomicChange.

Herald added a subscriber: jfb. · View Herald TranscriptApr 29 2019, 7:59 PM

Harbormaster completed remote builds in B31135: Diff 197250.Apr 29 2019, 8:02 PM

Updates comments on Transformer to make explicit the error reporting.

Harbormaster completed remote builds in B31136: Diff 197255.Apr 29 2019, 8:08 PM

ilya-biryukov added inline comments.Apr 30 2019, 3:28 AM

clang/include/clang/Tooling/Refactoring/Transformer.h
47–56	NIT: maybe shorten a bit, still capturing the essence? Something like the following should be enough: Note that \p TextGenerator is allowed to fail, e.g. when trying to access a matched node that was not bound. Allowing this to fail simplifies error handling for interactive tools like clang-query.
58	Maybe drop `trivially successful`? Does not seem to be super-important.
240	s/it's/its
clang/lib/Tooling/Refactoring/Transformer.cpp
164–169	Maybe follow a typical pattern for handling errors here (to avoid `OrErr` suffixes and an extra `Err` variable)? I.e. auto Replacement = Edit.Replacement(Result); if (!Replacement) return Replacement.takeError(); T.Replacement = std::move(*Replacement);
204	This looks super-complicated. Having `Error` in `AtomicChange` seems like a bad idea in the first place, why would we choose to use it here? The following alternatives would encourage clients to handle errors properly: accept an `Expected<AtomicChange>` in our callback, provide a separate callback to consume errors. WDYT about picking one of those two?

ymandel marked 2 inline comments as done.Apr 30 2019, 5:34 AM

ymandel added inline comments.

clang/lib/Tooling/Refactoring/Transformer.cpp
204	Agreed! I was using `setError` on the assumption that it was the "standard" way to express errors. Given that it seems to be totally ignored otherwise, let's go with option 1. I'll update the revision.

Addresses comments, including error handling style and signature of ChangeConsumer.

Updates testing code to use new ChangeConsumer signature.

Harbormaster completed remote builds in B31154: Diff 197316.Apr 30 2019, 6:44 AM

ymandel marked 4 inline comments as done.Apr 30 2019, 6:51 AM

ymandel added inline comments.

clang/lib/Tooling/Refactoring/Transformer.cpp
164–169	Here and elsewhere.

Thanks! LGTM

This revision is now accepted and ready to land.Apr 30 2019, 9:13 AM

Closed by commit rL359574: [LibTooling] Change Transformer's TextGenerator to a partial function. (authored by ymandel). · Explain WhyApr 30 2019, 9:47 AM

This revision was automatically updated to reflect the committed changes.

ymandel marked an inline comment as done.

Herald added a project: Restricted Project. · View Herald TranscriptApr 30 2019, 9:47 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

This breaks a test: http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-bootstrap-msan/builds/12112/steps/check-llvm%20check-clang%20stage3%2Fmsan/logs/stdio

[----------] 1 test from TransformerTest
[ RUN      ] TransformerTest.NodePartNameDeclRefFailure
/b/sanitizer-x86_64-linux-bootstrap-msan/build/llvm/tools/clang/unittests/Tooling/TransformerTest.cpp:66: Failure
Value of: MaybeActual
  Actual: false
Expected: true
Rewrite failed. Expecting: 
    struct Y {
      int operator*();
    };
    int neutral(int x) {
      Y y;
      int (Y::*ptr)() = &Y::operator*;
      return *y + x;
    }
  
[  FAILED  ] TransformerTest.NodePartNameDeclRefFailure (83 ms)
[----------] 1 test from TransformerTest (83 ms total)

Can you take a look?

In D61015#1484669, @thakis wrote:

This breaks a test: http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-bootstrap-msan/builds/12112/steps/check-llvm%20check-clang%20stage3%2Fmsan/logs/stdio

[----------] 1 test from TransformerTest
[ RUN      ] TransformerTest.NodePartNameDeclRefFailure
/b/sanitizer-x86_64-linux-bootstrap-msan/build/llvm/tools/clang/unittests/Tooling/TransformerTest.cpp:66: Failure
Value of: MaybeActual
  Actual: false
Expected: true
Rewrite failed. Expecting: 
    struct Y {
      int operator*();
    };
    int neutral(int x) {
      Y y;
      int (Y::*ptr)() = &Y::operator*;
      return *y + x;
    }
  
[  FAILED  ] TransformerTest.NodePartNameDeclRefFailure (83 ms)
[----------] 1 test from TransformerTest (83 ms total)

Can you take a look?

Fixed in r359578.

Diff 197146

clang/include/clang/Tooling/Refactoring/Transformer.h

Show All 38 Lines	enum class NodePart {
Node,		Node,
/// Given a \c MemberExpr, selects the member's token.		/// Given a \c MemberExpr, selects the member's token.
Member,		Member,
/// Given a \c NamedDecl or \c CxxCtorInitializer, selects that token of the		/// Given a \c NamedDecl or \c CxxCtorInitializer, selects that token of the
/// relevant name, not including qualifiers.		/// relevant name, not including qualifiers.
Name,		Name,
};		};

using TextGenerator =		// \c TextGenerator may fail, because it processes dynamically-bound match
std::function<std::string(const ast_matchers::MatchFinder::MatchResult &)>;		// results. For example, a typo in the name of a bound node or a mismatch in
		// the node's type can lead to a failure in the string generation code. We
		// allow the generator to return \c Expected, rather than assert on such
		// failures, so that the Transformer client can choose how to handle the error.
		// For example, if used in a UI (for example, clang-query or a web app), in
		// which the user specifies the rewrite rule, the backend might choose to return
		// a diagnostic error, rather than crash.
		using TextGenerator = std::function<Expected<std::string>(
		const ast_matchers::MatchFinder::MatchResult &)>;
		ilya-biryukovUnsubmitted Done Reply Inline Actions NIT: maybe shorten a bit, still capturing the essence? Something like the following should be enough: Note that \p TextGenerator is allowed to fail, e.g. when trying to access a matched node that was not bound. Allowing this to fail simplifies error handling for interactive tools like clang-query. ilya-biryukov: NIT: maybe shorten a bit, still capturing the essence? Something like the following should be…

/// Wraps a string as a TextGenerator.		/// Wraps a string as a (trivially successful) TextGenerator.
		ilya-biryukovUnsubmitted Done Reply Inline Actions Maybe drop `trivially successful`? Does not seem to be super-important. ilya-biryukov: Maybe drop `trivially successful`? Does not seem to be super-important.
inline TextGenerator text(std::string M) {		inline TextGenerator text(std::string M) {
return [M](const ast_matchers::MatchFinder::MatchResult &) { return M; };		return [M](const ast_matchers::MatchFinder::MatchResult &)
		-> Expected<std::string> { return M; };
}		}

// Description of a source-code edit, expressed in terms of an AST node.		// Description of a source-code edit, expressed in terms of an AST node.
// Includes: an ID for the (bound) node, a selector for source related to the		// Includes: an ID for the (bound) node, a selector for source related to the
// node, a replacement and, optionally, an explanation for the edit.		// node, a replacement and, optionally, an explanation for the edit.
//		//
// * Target: the source code impacted by the rule. This identifies an AST node,		// * Target: the source code impacted by the rule. This identifies an AST node,
// or part thereof (\c Part), whose source range indicates the extent of the		// or part thereof (\c Part), whose source range indicates the extent of the
▲ Show 20 Lines • Show All 162 Lines • ▼ Show 20 Lines
public:		public:
using ChangeConsumer =		using ChangeConsumer =
std::function<void(const clang::tooling::AtomicChange &Change)>;		std::function<void(const clang::tooling::AtomicChange &Change)>;

/// \param Consumer Receives each successful rewrite as an \c AtomicChange.		/// \param Consumer Receives each successful rewrite as an \c AtomicChange.
/// Note that clients are responsible for handling the case that independent		/// Note that clients are responsible for handling the case that independent
/// \c AtomicChanges conflict with each other.		/// \c AtomicChanges conflict with each other.
Transformer(RewriteRule Rule, ChangeConsumer Consumer)		Transformer(RewriteRule Rule, ChangeConsumer Consumer)
: Rule(std::move(Rule)), Consumer(std::move(Consumer)) {}		: Rule(std::move(Rule)), Consumer(std::move(Consumer)) {}
		ilya-biryukovUnsubmitted Done Reply Inline Actions s/it's/its ilya-biryukov: s/it's/its

/// N.B. Passes `this` pointer to `MatchFinder`. So, this object should not		/// N.B. Passes `this` pointer to `MatchFinder`. So, this object should not
/// be moved after this call.		/// be moved after this call.
void registerMatchers(ast_matchers::MatchFinder *MatchFinder);		void registerMatchers(ast_matchers::MatchFinder *MatchFinder);

/// Not called directly by users -- called by the framework, via base class		/// Not called directly by users -- called by the framework, via base class
/// pointer.		/// pointer.
void run(const ast_matchers::MatchFinder::MatchResult &Result) override;		void run(const ast_matchers::MatchFinder::MatchResult &Result) override;
Show All 10 Lines

clang/lib/Tooling/Refactoring/Transformer.cpp

Show First 20 Lines • Show All 151 Lines • ▼ Show 20 Lines	tooling::translateEdits(const MatchResult &Result,
for (const auto &Edit : Edits) {		for (const auto &Edit : Edits) {
auto It = NodesMap.find(Edit.Target);		auto It = NodesMap.find(Edit.Target);
assert(It != NodesMap.end() && "Edit target must be bound in the match.");		assert(It != NodesMap.end() && "Edit target must be bound in the match.");

Expected<CharSourceRange> RangeOrErr = getTargetRange(		Expected<CharSourceRange> RangeOrErr = getTargetRange(
Edit.Target, It->second, Edit.Kind, Edit.Part, *Result.Context);		Edit.Target, It->second, Edit.Kind, Edit.Part, *Result.Context);
if (auto Err = RangeOrErr.takeError())		if (auto Err = RangeOrErr.takeError())
return std::move(Err);		return std::move(Err);
Transformation T;		auto &Range = *RangeOrErr;
T.Range = *RangeOrErr;		if (Range.isInvalid() \|\|
if (T.Range.isInvalid() \|\|		isOriginMacroBody(*Result.SourceManager, Range.getBegin()))
isOriginMacroBody(*Result.SourceManager, T.Range.getBegin()))
return SmallVector<Transformation, 0>();		return SmallVector<Transformation, 0>();
T.Replacement = Edit.Replacement(Result);		auto ReplacementOrErr = Edit.Replacement(Result);
		if (auto Err = ReplacementOrErr.takeError())
		return std::move(Err);
		Transformation T;
		T.Range = Range;
		T.Replacement = std::move(*ReplacementOrErr);
		ilya-biryukovUnsubmitted Done Reply Inline Actions Maybe follow a typical pattern for handling errors here (to avoid `OrErr` suffixes and an extra `Err` variable)? I.e. auto Replacement = Edit.Replacement(Result); if (!Replacement) return Replacement.takeError(); T.Replacement = std::move(Replacement); ilya-biryukov:* Maybe follow a typical pattern for handling errors here (to avoid `OrErr` suffixes and an…
		ymandelAuthorUnsubmitted Done Reply Inline Actions Here and elsewhere. ymandel: Here and elsewhere.
Transformations.push_back(std::move(T));		Transformations.push_back(std::move(T));
}		}
return Transformations;		return Transformations;
}		}

RewriteRule tooling::makeRule(ast_matchers::internal::DynTypedMatcher M,		RewriteRule tooling::makeRule(ast_matchers::internal::DynTypedMatcher M,
SmallVector<ASTEdit, 1> Edits) {		SmallVector<ASTEdit, 1> Edits) {
M.setAllowBind(true);		M.setAllowBind(true);
Show All 18 Lines	void Transformer::run(const MatchResult &Result) {
assert(Root != NodesMap.end() && "Transformation failed: missing root node.");		assert(Root != NodesMap.end() && "Transformation failed: missing root node.");
SourceLocation RootLoc = Result.SourceManager->getExpansionLoc(		SourceLocation RootLoc = Result.SourceManager->getExpansionLoc(
Root->second.getSourceRange().getBegin());		Root->second.getSourceRange().getBegin());
assert(RootLoc.isValid() && "Invalid location for Root node of match.");		assert(RootLoc.isValid() && "Invalid location for Root node of match.");

auto TransformationsOrErr = translateEdits(Result, Rule.Edits);		auto TransformationsOrErr = translateEdits(Result, Rule.Edits);
if (auto Err = TransformationsOrErr.takeError()) {		if (auto Err = TransformationsOrErr.takeError()) {
llvm::errs() << "Transformation failed: " << llvm::toString(std::move(Err))		llvm::errs() << "Transformation failed: " << llvm::toString(std::move(Err))
<< "\n";		<< "\n";
		ilya-biryukovUnsubmitted Done Reply Inline Actions This looks super-complicated. Having `Error` in `AtomicChange` seems like a bad idea in the first place, why would we choose to use it here? The following alternatives would encourage clients to handle errors properly: accept an `Expected<AtomicChange>` in our callback, provide a separate callback to consume errors. WDYT about picking one of those two? ilya-biryukov: This looks super-complicated. Having `Error` in `AtomicChange` seems like a bad idea in the…
		ymandelAuthorUnsubmitted Done Reply Inline Actions Agreed! I was using `setError` on the assumption that it was the "standard" way to express errors. Given that it seems to be totally ignored otherwise, let's go with option 1. I'll update the revision. ymandel: Agreed! I was using `setError` on the assumption that it was the "standard" way to express…
return;		return;
}		}
auto &Transformations = *TransformationsOrErr;		auto &Transformations = *TransformationsOrErr;
if (Transformations.empty()) {		if (Transformations.empty()) {
// No rewrite applied (but no error encountered either).		// No rewrite applied (but no error encountered either).
RootLoc.print(llvm::errs() << "note: skipping match at loc ",		RootLoc.print(llvm::errs() << "note: skipping match at loc ",
*Result.SourceManager);		*Result.SourceManager);
llvm::errs() << "\n";		llvm::errs() << "\n";
Show All 14 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[LibTooling] Change Transformer's TextGenerator to a partial function.
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 197146

clang/include/clang/Tooling/Refactoring/Transformer.h

clang/lib/Tooling/Refactoring/Transformer.cpp

This is an archive of the discontinued LLVM Phabricator instance.

[LibTooling] Change Transformer's TextGenerator to a partial function.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 197146

clang/include/clang/Tooling/Refactoring/Transformer.h

clang/lib/Tooling/Refactoring/Transformer.cpp

[LibTooling] Change Transformer's TextGenerator to a partial function.
ClosedPublic