This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
include/clang/Analysis/Analyses/
-
clang/
-
Analysis/
-
Analyses/
1
UnsafeBufferUsageGadgets.def
-
lib/Analysis/
-
Analysis/
6/15
UnsafeBufferUsage.cpp
-
test/SemaCXX/
-
SemaCXX/
1
warn-unsafe-buffer-usage-fixits-deref-simple-ptr-arith.cpp

Differential D142795

[-Wunsafe-buffer-usage] Add Fixable for dereference of simple ptr arithmetic
ClosedPublic

Authored by ziqingluo-90 on Jan 27 2023, 6:26 PM.

Download Raw Diff

Details

Reviewers

NoQ
malavikasamak
t-rasmud
jkorous
aaron.ballman
xazax.hun
gribozavr
ymandel
sgatev

Commits

rG6a0f2e539b8e: [-Wunsafe-buffer-usage] Add Fixable for dereference of simple ptr arithmetic

Summary

For each expression e of the form *(DRE + n) (or *(n + DRE)), where DRE has a pointer type and n is an integer literal, e will be transformed to DRE[n] (or n[DRE] respectively), if

1. e is at the left-hand side of an assignment or is an lvalue being casted to an rvalue; and
2. the variable referred by DRE is going to be transformed to be of std::span type.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

jkorous created this revision.Jan 27 2023, 6:26 PM

Herald added a project: Restricted Project. · View Herald TranscriptJan 27 2023, 6:26 PM

Herald added a subscriber: ChuanqiXu. · View Herald Transcript

jkorous requested review of this revision.Jan 27 2023, 6:26 PM

jkorous added a parent revision: D142794: [-Wunsafe-buffer-usage] Fixits for assignment to array subscript expr.

Harbormaster completed remote builds in B210506: Diff 492962.Jan 27 2023, 6:27 PM

I think that my current name of the Fixable is pretty bad - open to suggestions!

clang/lib/Analysis/UnsafeBufferUsage.cpp
608	I promise I will change the format - this one is just natural to me and I used it to create the matcher.

jkorous added inline comments.Jan 27 2023, 6:33 PM

clang/lib/Analysis/UnsafeBufferUsage.cpp
839	We will do this in: https://reviews.llvm.org/D139737 I will remove it from this patch.

Nice! I have some nitpicks.

clang/include/clang/Analysis/Analyses/UnsafeBufferUsageGadgets.def
33–36	Spacing is a bit weird here.
clang/lib/Analysis/UnsafeBufferUsage.cpp
608	Looks quite readable to me!
633–635	Interesting, you're using `std::nullopt` to indicate both that fix isn't implemented yet (below) and that the fix is fundamentally impossible (here). I think a slightly cleaner design would be to make this check part of the matcher: if it's not fixable, let's not construct a Fixable. Then `return std::nullopt` will always indicate "well, it's probably fixable but not implemented yet". This way later we'll be able to "profile" our fixit coverage by catching these `std::nullopt` return values and classifying them; it'd be harder to do if there are "valid" `std::nullopt` results. This also implies that `std::nullopt` will never indicate that "the strategy isn't appropriate for this fixable pattern". The gadget will need a different channel to communicate that. But then I realize that this same gadget is quite fixable for eg. iterator strategy (which can deal with negative values easily). So if we add this check to the matcher, we'll have to make a separate gadget for potentially negative values. Which may start conflicting with this gadget because the new gadget will also happily accept positive values. So I guess a better thing to do would be to make the "different channel" more flexible, so that it could say "Well, I matched, but I'm not going to do this strategy because I'm negative, please try a different strategy". In this case we'll move the check from the matcher to the strategy feedback method. So, yeah, there's some room for design discussions, but as of today we probably don't care.
647	This needs some clang-format.
clang/test/SemaCXX/warn-unsafe-buffer-usage-fixits-deref-simple-ptr-arith.cpp
77

jkorous added inline comments.Jan 30 2023, 3:32 PM

clang/lib/Analysis/UnsafeBufferUsage.cpp
633–635	Oh! This is interesting! Let me start by saying that I agree that at some point we should discuss the contract of this part of the machinery. I actually would be quite surprised if we won't have more broader design discussion once we look into things like more advanced type replacement strategies, etc. in next couple months. That makes me want to stick to our current design for a bit longer because despite its limitations it will allow us to still get more data that we can use to plan the next iteration better. Now, about the current design. My understanding was that the AST pattern defines "area of authority" for a particular `Fixable` and returning `std::nullopt` from `getFixits` method just means that a given strategy can't be achieved (for whatever reason). Since we expect to have a pretty diverse set of strategies (e. g. array, vector, span, span_iterator, ...) I implicitly assumed that it is almost inevitable that some Fixables won't be able to provide a Fix-It for each and every strategy and thought that's how the design is intended to work. But I guess we never really discussed these details because we didn't have a specific example and we all just assumed something - not necessarily the same thing though :) Related to how do we learn about what hasn't been implemented - I actually started imagining we might use some logging feature (possibly only for the debug mode) which would tell us why are fixits not emitted. But since it is just a solution and not necessarily the optimal one I'd also prefer to first finish our initial batch of Fixable and go through the exercise of verifying them against real codebases before we decide on "if && what" is necessary. Anyways, would you find it reasonable to add a FIXME here and/or to the `FixableGadget::getFixits()` declaration but keep using `std::nullopt` for both cases described above for now?

rebased

Harbormaster completed remote builds in B211374: Diff 494151.Feb 1 2023, 8:29 PM

malavikasamak added a child revision: D143206: [-Wunsafe-buffer-usage] Add Fixable for simple pointer dereference.Feb 2 2023, 11:43 AM

jkorous mentioned this in D139737: [-Wunsafe-buffer-usage] Initiate Fix-it generation for local variable declarations.Feb 2 2023, 7:11 PM

ziqingluo-90 added inline comments.Feb 9 2023, 3:36 PM

clang/lib/Analysis/UnsafeBufferUsage.cpp
568	The parameter `Loc` is always the begin location of some token. So maybe we could instead use a for-loop over the token length to avoid any risk of looping forever?
645	I wonder if this would be an at-least-not-worse fix-it: replacing `*(pointer + 123)` with `pointer[123]` in one step? I think it could reduce the whitespace problem.

NoQ added inline comments.Feb 16 2023, 6:11 PM

clang/lib/Analysis/UnsafeBufferUsage.cpp
569	Looks like this will crash with assertion failure if `Loc` is at the beginning of the file. Probably not a real issue but might be worth checking.
633–635	Ok sure let's add a FIXME and handle this later!

ziqingluo-90 added inline comments.Feb 21 2023, 6:14 PM

clang/lib/Analysis/UnsafeBufferUsage.cpp
648	Just realized that we can achieve the same goal without using `getBeginOfPrecHWSpace`.
649

Taking care of this revision on behalf of @jkorous

Address some of the comments.
Add handling for both *(ptr + n) and *(n + ptr).
Add handling for *((..(ptr + n) .. )
Remove the white space handling

ziqingluo-90 retitled this revision from [-Wunsafe-buffer-usage][WIP] Add Fixable for dereference of simple ptr arithmetic to [-Wunsafe-buffer-usage] Add Fixable for dereference of simple ptr arithmetic.Feb 22 2023, 2:36 PM

ziqingluo-90 edited the summary of this revision. (Show Details)

ziqingluo-90 added reviewers: aaron.ballman, xazax.hun, gribozavr, ymandel, sgatev.

Herald added a subscriber: rnkovacs. · View Herald TranscriptFeb 22 2023, 2:36 PM

NoQ added inline comments.Feb 22 2023, 5:17 PM

clang/lib/Analysis/UnsafeBufferUsage.cpp
868–869	You can combine `dyn_cast()` with assertion by using `cast()` instead. But in any case, I'm not sure the assertion is actually correct here (see also), I think it's a good idea to add a test case for `BindingDecl` here as well.

Harbormaster completed remote builds in B215353: Diff 499636.Feb 22 2023, 7:11 PM

Looking over this patch, I'm a little concerned that you seem to have re-invented the infrastructure of RewriteRules from Transformer. Not the details of the patch itself, but the infrastructure its built on -- the FixableGadget and FixitLists all seem very similar. While you're welcome to do as you please, I imagine the project would be best off if there was a common infrastructure. Have you taken a look at clang::transformer? If so, is there a reason that it doesn't meet your needs? https://github.com/llvm/llvm-project/blob/main/clang/include/clang/Tooling/Transformer/RewriteRule.h is the starting point, if you're not familiar with it. The doc page is here: https://intel.github.io/llvm-docs/clang/ClangTransformerTutorial.html.

@ymandel I think we should definitely try it.

I don't think our FixableGadgets correspond nicely to RewriteRules. Our story is that we gather *global* understanding of the entire function by collecting gadgets inside it, which is then used for discovering the best "strategy" for transforming the function, which may activate certain (not necessarily all) fixable gadgets in a specific manner. For instance, once D143133 gets fully developed, it'll involve fixpoint iteration over all fixables to discover variables that need to be transformed simultaneously. So I suspect that RewriteRule is too high-level / restrictive for our purposes.

But I do think that we can use the EditGenerator infrastructure to greatly simplify fixit generation once we figure out the strategy. So we can have something like

class FixableGadget {
  virtual EditGenerator generator(Strategy &S) = 0;

  FixItList getFixits(Strategy &S) { // non-virtual!
    return generator(S)(MatchResult).somehowFlattenAllTheseFixits();
  }
};

class DerefSimplePtrArithFixableGadget : public FixableGadget {
  static constexpr const char *const AddOpTag = "AddOp";
  static constexpr const char *const OffsetTag = "Offset";

  static StmtMatcher matcher() { /* matcher that makes all these bindings */ }

  EditGenerator generator(Strategy &S) override {
    return changeTo(node(AddOpTag), cat(node(BaseDRETag), "[", node(OffsetTag), "]"));
  }
};

which is much simpler than what we have. So I agree that we should consider this, at least for new code.

Things I don't immediately understand, which we'll need to figure out:

How do we preserve MatchResult long enough? It needs to be kept alive while we build the Strategy object, so that later we didn't have to rerun the matchers for fixit purposes.
We're somewhat conscious about minimizing our fixit's source ranges. In particular, we really need the fixit replacement range to not overlap with the OffsetTag range in this example. This is because the offset may contain other fixable operations inside it! So we won't be happy if the fixit produced by the generator throws out the entire expression and replaces it with concatenated text, we need something more targeted.

How do we preserve MatchResult long enough?

(it looks easily copyable as it's basically a std::map<std::string, Stmt *>; but it also looks like nobody tried this before, so we'll have to define a copy constructor, or like a .clone() method if we're worried about accidental copies)

It also sounds like this is going to be the first use of libClangTransformers in clang proper, so we'll have to link the clang binary to it.

Also, @ymandel would you mind if we commit this patch as-is and experiment with Transformers in a follow-up patch? 'Cause we have like 6 patches waiting for this one to land, and we'd rather avoid large-scale rebasing as we fight to keep our backlogs short.

Very sorry for the delayed response! Also, feel free to move this discussion to some other forum. Responses inline:

In D142795#4148991, @NoQ wrote:

@ymandel I think we should definitely try it.

I don't think our FixableGadgets correspond nicely to RewriteRules. Our story is that we gather *global* understanding of the entire function by collecting gadgets inside it, which is then used for discovering the best "strategy" for transforming the function, which may activate certain (not necessarily all) fixable gadgets in a specific manner. For instance, once D143133 gets fully developed, it'll involve fixpoint iteration over all fixables to discover variables that need to be transformed simultaneously. So I suspect that RewriteRule is too high-level / restrictive for our purposes.

This makes a lot of sense. We've actually moved in the same direction ourself, away from one-shot rules (though we still use them for simpler tasks) towards analysis (or even, an analysis pipeline) followed by application of edits. I think the RewriteRule concept needs to expand to encompass this approach and would hope that feedback from your use case could help drive that. In the meantime, we've found taken two approaches:

Embed the analysis in the matchers and thread that state to the edit generator. So, the rule looks something like makeRule(matcherThatAnchorsAnalysis(SharedState), generatorFromState(SharedState)). Sometimes the generator is a custom function, but we can often use the standard combinators.
Generate _metadata_ from the rewrite rule, and don't generate edits. Then, followup passes can consume the metadata and turn them into edits (using EditGenerators or otherwise). The metadata can also just bundle edits directly, but in a form suited to followup processing rather than the standard output from rules that's intended for direct application.

Regardless, I'd love to work together to find a format that works for these use cases.

But I do think that we can use the EditGenerator infrastructure to greatly simplify fixit generation once we figure out the strategy. So we can have something like
class FixableGadget {
  virtual EditGenerator generator(Strategy &S) = 0;

  FixItList getFixits(Strategy &S) { // non-virtual!
    return generator(S)(MatchResult).somehowFlattenAllTheseFixits();
  }
};

class DerefSimplePtrArithFixableGadget : public FixableGadget {
  static constexpr const char *const AddOpTag = "AddOp";
  static constexpr const char *const OffsetTag = "Offset";

  static StmtMatcher matcher() { /* matcher that makes all these bindings */ }

  EditGenerator generator(Strategy &S) override {
    return changeTo(node(AddOpTag), cat(node(BaseDRETag), "[", node(OffsetTag), "]"));
  }
};
which is much simpler than what we have. So I agree that we should consider this, at least for new code.

Things I don't immediately understand, which we'll need to figure out:

How do we preserve MatchResult long enough? It needs to be kept alive while we build the Strategy object, so that later we didn't have to rerun the matchers for fixit purposes.

I'm not sure I understand the issue, can you expand? I'd sooner expect that you would eagerly generate the Edits, rather than building up EditGenerators and then you don't need to keep around the MatchResult.

We're somewhat conscious about minimizing our fixit's source ranges. In particular, we really need the fixit replacement range to not overlap with the OffsetTag range in this example. This is because the offset may contain other fixable operations inside it! So we won't be happy if the fixit produced by the generator throws out the entire expression and replaces it with concatenated text, we need something more targeted.

Yes, this is a common concern. We tend to apply targeted fixes where possible, using the combinators to correctly select the parts of the syntax to target. Ideally, the framework could do that for you -- that is, you could do larger edit for clarity and it would diff only produce the smaller edit -- but that's "future work" with no plans at this point.

In D142795#4165331, @ymandel wrote:

In D142795#4148991, @NoQ wrote:

@ymandel I think we should definitely try it.

I don't think our FixableGadgets correspond nicely to RewriteRules. Our story is that we gather *global* understanding of the entire function by collecting gadgets inside it, which is then used for discovering the best "strategy" for transforming the function, which may activate certain (not necessarily all) fixable gadgets in a specific manner. For instance, once D143133 gets fully developed, it'll involve fixpoint iteration over all fixables to discover variables that need to be transformed simultaneously. So I suspect that RewriteRule is too high-level / restrictive for our purposes.

This makes a lot of sense. We've actually moved in the same direction ourself, away from one-shot rules (though we still use them for simpler tasks) towards analysis (or even, an analysis pipeline) followed by application of edits. I think the RewriteRule concept needs to expand to encompass this approach and would hope that feedback from your use case could help drive that. In the meantime, we've found taken two approaches:

Embed the analysis in the matchers and thread that state to the edit generator. So, the rule looks something like makeRule(matcherThatAnchorsAnalysis(SharedState), generatorFromState(SharedState)). Sometimes the generator is a custom function, but we can often use the standard combinators.

Generate _metadata_ from the rewrite rule, and don't generate edits. Then, followup passes can consume the metadata and turn them into edits (using EditGenerators or otherwise). The metadata can also just bundle edits directly, but in a form suited to followup processing rather than the standard output from rules that's intended for direct application.

Regardless, I'd love to work together to find a format that works for these use cases.

Yeah this is a really interesting subject, I'd love to have a bigger conversation about it. We've got a lot of very interesting examples of after-the-fact interactions, and a lot of them seem to be specific to our work. A lot of them stem from the fact that we aren't settling for any fix that would compile and silence the warning, but we aspire to generate code with the maximum security benefit. And when we cannot, we find it better to abandon the entire fixit and recommend manual transformation, than to recommend a less-than-ideal automatic solution. Because the less-than-ideal solution would silence the warning, and the code won't be revisited for later security improvements or flagged for future security audit. So it's better to have our tool teach the developer by example how to write better code whenever it can, and offer the developer a chance to demonstrate what they learned when it cannot find a good solution automatically.

The ideal solution often involves substituting multiple variables at a time, so that, say, the span object didn't need to be unpacked and repacked. In fact, because span size is a thing (but pointer size isn't), there's no easy analysis that would help us eliminate span-repacking code after the fact! Partially spanified code is much harder to analyze than fully unspanified code because there's now this entire extra aspect of the program's behavior (span sizes) that we need to either preserve or prove away as irrelevant. Whereas in the original unspanified program it's obviously irrelevant from the start. So even though we want to successfully analyze partially spanified code in some cases, it's clearly a much harder problem and ideally we don't want to block the tool on solving it, so instead we're trying to make larger leaps from unspanified code directly to ideal code.

(This sounds as if it relates to our discussion in D143128 but it really doesn't, that patch's dilemma was about two fixes that are both equally (un-)ideal, and it's probably easy to transform one into another "after the fact".)

How do we preserve MatchResult long enough? It needs to be kept alive while we build the Strategy object, so that later we didn't have to rerun the matchers for fixit purposes.

I'm not sure I understand the issue, can you expand? I'd sooner expect that you would eagerly generate the Edits, rather than building up EditGenerators and then you don't need to keep around the MatchResult.

I suspect that as our machine grows larger, that'd cause us to eagerly generate around 5-10x more edits than we ever emit. Just because a code pattern is fixable, doesn't mean we want to fix it. Even if we do, there's also like five different ways to fix it (encoded in our Strategy class, currently we implement just one way: "span"). And answers to these questions - whether we want to fix the pattern, and how - depend not only on this pattern, but also on properties of the space of all patterns discovered in the function. This isn't something we can decide upon in the constructor of the fixable pattern, when half of patterns aren't even constructed yet. We either need to see all MatchResults together before emitting any edits, or ~90% of our edits go to waste.

We're somewhat conscious about minimizing our fixit's source ranges. In particular, we really need the fixit replacement range to not overlap with the OffsetTag range in this example. This is because the offset may contain other fixable operations inside it! So we won't be happy if the fixit produced by the generator throws out the entire expression and replaces it with concatenated text, we need something more targeted.

Yes, this is a common concern. We tend to apply targeted fixes where possible, using the combinators to correctly select the parts of the syntax to target. Ideally, the framework could do that for you -- that is, you could do larger edit for clarity and it would diff only produce the smaller edit -- but that's "future work" with no plans at this point.

That's another fascinating subject.

We have a weak signal that it might not even be the right thing to do. Say, if you're replacing a function call foo with foo2, it's arguably better to have the fixit look like this:

foo();
^~~
foo2

than to have it look like this:

foo();
   ^
   2

The second solution may be alright in some cases (especially when you hardcode the names of the function in the compiler, so you know for sure that "adding a suffix to the function" is an accurate way to describe the fix from user's point of view) but in the general case (say, when you discover functions through attributes) the second solution may be very surprising. But a purely textual "fixit minimizer" machine will not be able to tell the difference and recognize that the first approach is more appropriate in this case.

It could also be bad in the same way as the command-line diff tool sometimes completely misunderstands the nature of the diff (say, when it tells us that we've added } foo() { instead of foo() {}). So I think there's a few good reasons to retain full control over the shape of the edits.

ziqingluo-90 added inline comments.Mar 9 2023, 5:15 PM

clang/lib/Analysis/UnsafeBufferUsage.cpp
868–869	Good point! Actually an example with `BindingDecl`s could break our code. The `Tracker` collects all `declRefExpr(to(varDecl()))` but most of the Gadget matchers simply look for `declRefExpr()`s. If a Gadget is associated to a DRE to a `BindingDecl`, it has an unclaimed DRE. We probably need another patch to add `to(varDecl())` to places where its missing and add proper tests.

We agreed with @ymandel to have a separate discussion about how can Safe Buffers Fix-Its benefit from using libClangTransformers .
Our tentative plan is to it for new code that we write and eventually change all our code to use it.

Other than the to(varDecl()) discussion, LGTM!

clang/lib/Analysis/UnsafeBufferUsage.cpp
868–869	Aha, sure, we probably need to fix all places, but I also don't want this patch to introduce a new source of crashes, where other gadgets already act defensively. So let's at least replace `assert` with an early return before we land this patch?

Address comments

Harbormaster completed remote builds in B219960: Diff 505933.Mar 16 2023, 2:49 PM

ziqingluo-90 marked 2 inline comments as done.Mar 16 2023, 2:49 PM

LGTM

This revision is now accepted and ready to land.Mar 20 2023, 2:41 PM

NoQ accepted this revision.Mar 20 2023, 2:48 PM

This revision was landed with ongoing or failed builds.Mar 20 2023, 5:07 PM

Closed by commit rG6a0f2e539b8e: [-Wunsafe-buffer-usage] Add Fixable for dereference of simple ptr arithmetic (authored by ziqingluo-90). · Explain Why

This revision was automatically updated to reflect the committed changes.

ziqingluo-90 added a commit: rG6a0f2e539b8e: [-Wunsafe-buffer-usage] Add Fixable for dereference of simple ptr arithmetic.

Herald added a project: Restricted Project. · View Herald TranscriptMar 20 2023, 5:07 PM

Herald added a subscriber: cfe-commits. · View Herald Transcript

Revision Contents

Path

Size

clang/

include/

clang/

Analysis/

Analyses/

UnsafeBufferUsageGadgets.def

1 line

lib/

Analysis/

UnsafeBufferUsage.cpp

102 lines

test/

SemaCXX/

warn-unsafe-buffer-usage-fixits-deref-simple-ptr-arith.cpp

199 lines

Diff 506782

clang/include/clang/Analysis/Analyses/UnsafeBufferUsageGadgets.def

Show All 24 Lines

#define FIXABLE_GADGET(name) GADGET(name)

#endif

WARNING_GADGET(Increment)

WARNING_GADGET(Decrement)

WARNING_GADGET(ArraySubscript)

WARNING_GADGET(PointerArithmetic)

WARNING_GADGET(UnsafeBufferUsageAttr)

FIXABLE_GADGET(ULCArraySubscript)

FIXABLE_GADGET(DerefSimplePtrArithFixable)

#undef FIXABLE_GADGET

NoQUnsubmitted

Not Done

WARNING_GADGET(PointerArithmetic)

- FIXABLE_GADGET(ULCArraySubscript)

+ FIXABLE_GADGET(ULCArraySubscript)

FIXABLE_GADGET(DerefSimplePtrArithFixable)

#undef FIXABLE_GADGET

Spacing is a bit weird here.

NoQ: Spacing is a bit weird here.

#undef WARNING_GADGET

#undef GADGET

clang/lib/Analysis/UnsafeBufferUsage.cpp

//===- UnsafeBufferUsage.cpp - Replace pointers with modern C++ -----------===// //===- UnsafeBufferUsage.cpp - Replace pointers with modern C++ -----------===//

// //

// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions. // Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.

// See https://llvm.org/LICENSE.txt for license information. // See https://llvm.org/LICENSE.txt for license information.

// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception // SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception

// //

//===----------------------------------------------------------------------===// //===----------------------------------------------------------------------===//

#include "clang/Analysis/Analyses/UnsafeBufferUsage.h" #include "clang/Analysis/Analyses/UnsafeBufferUsage.h"

#include "clang/AST/Decl.h"

#include "clang/AST/RecursiveASTVisitor.h" #include "clang/AST/RecursiveASTVisitor.h"

#include "clang/ASTMatchers/ASTMatchFinder.h" #include "clang/ASTMatchers/ASTMatchFinder.h"

#include "clang/Lex/Lexer.h" #include "clang/Lex/Lexer.h"

#include "clang/Lex/Preprocessor.h" #include "clang/Lex/Preprocessor.h"

#include "llvm/ADT/SmallVector.h" #include "llvm/ADT/SmallVector.h"

#include <memory> #include <memory>

#include <optional> #include <optional>

▲ Show 20 Lines • Show All 535 Lines • ▼ Show 20 Lines Kind lookup(const VarDecl *VD) const {

if (I == Map.end()) if (I == Map.end())

return Kind::Wontfix; return Kind::Wontfix;

return I->second; return I->second;

} }

}; };

} // namespace } // namespace

// Representing a fixable expression of the form `*(ptr + 123)` or `*(123 +

// ptr)`:

class DerefSimplePtrArithFixableGadget : public FixableGadget {

static constexpr const char *const BaseDeclRefExprTag = "BaseDRE";

static constexpr const char *const DerefOpTag = "DerefOp";

static constexpr const char *const AddOpTag = "AddOp";

static constexpr const char *const OffsetTag = "Offset";

ziqingluo-90AuthorUnsubmitted

Not Done

The parameter Loc is always the begin location of some token. So maybe we could instead use a for-loop over the token length to avoid any risk of looping forever?

ziqingluo-90: The parameter `Loc` is always the begin location of some token. So maybe we could instead use…

NoQUnsubmitted

Not Done

Looks like this will crash with assertion failure if Loc is at the beginning of the file. Probably not a real issue but might be worth checking.

NoQ: Looks like this will crash with assertion failure if `Loc` is at the beginning of the file.

const DeclRefExpr *BaseDeclRefExpr = nullptr;

const UnaryOperator *DerefOp = nullptr;

const BinaryOperator *AddOp = nullptr;

const IntegerLiteral *Offset = nullptr;

public:

DerefSimplePtrArithFixableGadget(const MatchFinder::MatchResult &Result)

: FixableGadget(Kind::DerefSimplePtrArithFixable),

BaseDeclRefExpr(

Result.Nodes.getNodeAs<DeclRefExpr>(BaseDeclRefExprTag)),

DerefOp(Result.Nodes.getNodeAs<UnaryOperator>(DerefOpTag)),

AddOp(Result.Nodes.getNodeAs<BinaryOperator>(AddOpTag)),

Offset(Result.Nodes.getNodeAs<IntegerLiteral>(OffsetTag)) {}

static Matcher matcher() {

// clang-format off

auto ThePtr = expr(hasPointerType(),

ignoringImpCasts(declRefExpr(to(varDecl())).bind(BaseDeclRefExprTag)));

auto PlusOverPtrAndInteger = expr(anyOf(

binaryOperator(hasOperatorName("+"), hasLHS(ThePtr),

hasRHS(integerLiteral().bind(OffsetTag)))

.bind(AddOpTag),

binaryOperator(hasOperatorName("+"), hasRHS(ThePtr),

hasLHS(integerLiteral().bind(OffsetTag)))

.bind(AddOpTag)));

return isInUnspecifiedLvalueContext(unaryOperator(

hasOperatorName("*"),

hasUnaryOperand(ignoringParens(PlusOverPtrAndInteger)))

.bind(DerefOpTag));

// clang-format on

}

virtual std::optional<FixItList> getFixits(const Strategy &s) const final;

// TODO remove this method from FixableGadget interface

virtual const Stmt *getBaseStmt() const final { return nullptr; }

virtual DeclUseList getClaimedVarUseSites() const final {

return {BaseDeclRefExpr};

jkorousUnsubmitted

Done

I promise I will change the format - this one is just natural to me and I used it to create the matcher.

jkorous: I promise I will change the format - this one is just natural to me and I used it to create the…

NoQUnsubmitted

Not Done

Looks quite readable to me!

NoQ: Looks quite readable to me!

}

};

/// Scan the function and return a list of gadgets found with provided kits. /// Scan the function and return a list of gadgets found with provided kits.

static std::tuple<FixableGadgetList, WarningGadgetList, DeclUseTracker> static std::tuple<FixableGadgetList, WarningGadgetList, DeclUseTracker>

findGadgets(const Decl *D, const UnsafeBufferUsageHandler &Handler) { findGadgets(const Decl *D, const UnsafeBufferUsageHandler &Handler) {

struct GadgetFinderCallback : MatchFinder::MatchCallback { struct GadgetFinderCallback : MatchFinder::MatchCallback {

FixableGadgetList FixableGadgets; FixableGadgetList FixableGadgets;

WarningGadgetList WarningGadgets; WarningGadgetList WarningGadgets;

DeclUseTracker Tracker; DeclUseTracker Tracker;

void run(const MatchFinder::MatchResult &Result) override { void run(const MatchFinder::MatchResult &Result) override {

// In debug mode, assert that we've found exactly one gadget. // In debug mode, assert that we've found exactly one gadget.

// This helps us avoid conflicts in .bind() tags. // This helps us avoid conflicts in .bind() tags.

#if NDEBUG #if NDEBUG

#define NEXT return #define NEXT return

#else #else

[[maybe_unused]] int numFound = 0; [[maybe_unused]] int numFound = 0;

#define NEXT ++numFound #define NEXT ++numFound

#endif #endif

if (const auto *DRE = Result.Nodes.getNodeAs<DeclRefExpr>("any_dre")) { if (const auto *DRE = Result.Nodes.getNodeAs<DeclRefExpr>("any_dre")) {

Tracker.discoverUse(DRE); Tracker.discoverUse(DRE);

NEXT; NEXT;

} }

NoQUnsubmitted

Not Done

Interesting, you're using std::nullopt to indicate both that fix isn't implemented yet (below) and that the fix is fundamentally impossible (here). I think a slightly cleaner design would be to make this check part of the matcher: if it's not fixable, let's not construct a Fixable. Then return std::nullopt will always indicate "well, it's probably fixable but not implemented yet". This way later we'll be able to "profile" our fixit coverage by catching these std::nullopt return values and classifying them; it'd be harder to do if there are "valid" std::nullopt results.

This also implies that std::nullopt will never indicate that "the strategy isn't appropriate for this fixable pattern". The gadget will need a different channel to communicate that. But then I realize that this same gadget is quite fixable for eg. iterator strategy (which can deal with negative values easily). So if we add this check to the matcher, we'll have to make a separate gadget for potentially negative values. Which may start conflicting with this gadget because the new gadget will also happily accept positive values. So I guess a better thing to do would be to make the "different channel" more flexible, so that it could say "Well, I matched, but I'm not going to do *this* strategy because I'm negative, please try a different strategy". In this case we'll move the check from the matcher to the strategy feedback method.

So, yeah, there's some room for design discussions, but as of today we probably don't care.

NoQ: Interesting, you're using `std::nullopt` to indicate both that fix isn't implemented yet…

jkorousUnsubmitted

Done

Oh! This is interesting!

Let me start by saying that I agree that at some point we should discuss the contract of this part of the machinery. I actually would be quite surprised if we won't have more broader design discussion once we look into things like more advanced type replacement strategies, etc. in next couple months. That makes me want to stick to our current design for a bit longer because despite its limitations it will allow us to still get more data that we can use to plan the next iteration better.

Now, about the current design. My understanding was that the AST pattern defines "area of authority" for a particular Fixable and returning std::nullopt from getFixits method just means that a given strategy can't be achieved (for whatever reason).
Since we expect to have a pretty diverse set of strategies (e. g. array, vector, span, span_iterator, ...) I implicitly assumed that it is almost inevitable that some Fixables won't be able to provide a Fix-It for each and every strategy and thought that's how the design is intended to work. But I guess we never really discussed these details because we didn't have a specific example and we all just assumed something - not necessarily the same thing though :)

Related to how do we learn about what hasn't been implemented - I actually started imagining we might use some logging feature (possibly only for the debug mode) which would tell us why are fixits not emitted. But since it is just a solution and not necessarily the optimal one I'd also prefer to first finish our initial batch of Fixable and go through the exercise of verifying them against real codebases before we decide on "if && what" is necessary.

Anyways, would you find it reasonable to add a FIXME here and/or to the FixableGadget::getFixits() declaration but keep using std::nullopt for both cases described above for now?

jkorous: Oh! This is interesting! Let me start by saying that I agree that at some point we should…

NoQUnsubmitted

Not Done

Ok sure let's add a FIXME and handle this later!

NoQ: Ok sure let's add a FIXME and handle this later!

if (const auto *DS = Result.Nodes.getNodeAs<DeclStmt>("any_ds")) { if (const auto *DS = Result.Nodes.getNodeAs<DeclStmt>("any_ds")) {

Tracker.discoverDecl(DS); Tracker.discoverDecl(DS);

NEXT; NEXT;

} }

// Figure out which matcher we've found, and call the appropriate // Figure out which matcher we've found, and call the appropriate

// subclass constructor. // subclass constructor.

// FIXME: Can we do this more logarithmically? // FIXME: Can we do this more logarithmically?

#define FIXABLE_GADGET(name) \ #define FIXABLE_GADGET(name) \

if (Result.Nodes.getNodeAs<Stmt>(#name)) { \ if (Result.Nodes.getNodeAs<Stmt>(#name)) { \

ziqingluo-90AuthorUnsubmitted

Not Done

I wonder if this would be an at-least-not-worse fix-it: replacing *(pointer + 123) with pointer[123] in one step? I think it could reduce the whitespace problem.

ziqingluo-90: I wonder if this would be an at-least-not-worse fix-it: replacing `*(pointer + 123)` with…

FixableGadgets.push_back(std::make_unique<name##Gadget>(Result)); \ FixableGadgets.push_back(std::make_unique<name##Gadget>(Result)); \

NEXT; \ NEXT; \

NoQUnsubmitted

Not Done

This needs some clang-format.

NoQ: This needs some clang-format.

} }

ziqingluo-90AuthorUnsubmitted

Not Done

CharSourceRange StarWithTrailWhitespace = clang::CharSourceRange::getCharRange(DerefOp->getOperatorLoc(), BaseDeclRefExpr->getBeginLoc());

- CharSourceRange PlusWithSurroundingWhitespace = clang::CharSourceRange::getCharRange(getBeginOfPrecHWSpace(AddOp->getOperatorLoc(), SM), RHS->getLocation());

+ CharSourceRange PlusWithSurroundingWhitespace = clang::CharSourceRange::getCharRange(getPastLoc(AddOp->getLHS(), SM, Ctx.getLangOpts()), RHS->getLocation());

CharSourceRange ClosingParenWithPrecWhitespace = clang::CharSourceRange::getCharRange(getBeginOfPrecHWSpace(ParenEx->getEndLoc(), SM), ParenEx->getRParen().getLocWithOffset(1));

Just realized that we can achieve the same goal without using getBeginOfPrecHWSpace.

ziqingluo-90: Just realized that we can achieve the same goal without using `getBeginOfPrecHWSpace`.

#include "clang/Analysis/Analyses/UnsafeBufferUsageGadgets.def" #include "clang/Analysis/Analyses/UnsafeBufferUsageGadgets.def"

ziqingluo-90AuthorUnsubmitted

Not Done

CharSourceRange PlusWithSurroundingWhitespace = clang::CharSourceRange::getCharRange(getBeginOfPrecHWSpace(AddOp->getOperatorLoc(), SM), RHS->getLocation());

- CharSourceRange ClosingParenWithPrecWhitespace = clang::CharSourceRange::getCharRange(getBeginOfPrecHWSpace(ParenEx->getEndLoc(), SM), ParenEx->getRParen().getLocWithOffset(1));

+ CharSourceRange ClosingParenWithPrecWhitespace = clang::CharSourceRange::getCharRange(getPastLoc(AddOp, SM, Ctx.getLangOpts()), getPastLoc(ParenEx, SM, Ctx.getLangOpts()));

return FixItList{{

ziqingluo-90:

#define WARNING_GADGET(name) \ #define WARNING_GADGET(name) \

if (Result.Nodes.getNodeAs<Stmt>(#name)) { \ if (Result.Nodes.getNodeAs<Stmt>(#name)) { \

WarningGadgets.push_back(std::make_unique<name##Gadget>(Result)); \ WarningGadgets.push_back(std::make_unique<name##Gadget>(Result)); \

NEXT; \ NEXT; \

} }

#include "clang/Analysis/Analyses/UnsafeBufferUsageGadgets.def" #include "clang/Analysis/Analyses/UnsafeBufferUsageGadgets.def"

assert(numFound >= 1 && "Gadgets not found in match result!"); assert(numFound >= 1 && "Gadgets not found in match result!");

▲ Show 20 Lines • Show All 173 Lines • ▼ Show 20 Lines

} }

// Return the text representation of the given `APInt Val`: // Return the text representation of the given `APInt Val`:

static std::string getAPIntText(APInt Val) { static std::string getAPIntText(APInt Val) {

SmallVector<char> Txt; SmallVector<char> Txt;

Val.toString(Txt, 10, true); Val.toString(Txt, 10, true);

// APInt::toString does not add '\0' to the end of the string for us: // APInt::toString does not add '\0' to the end of the string for us:

Txt.push_back('\0'); Txt.push_back('\0');

return Txt.data(); return Txt.data();

jkorousUnsubmitted

Done

We will do this in: https://reviews.llvm.org/D139737
I will remove it from this patch.

jkorous: We will do this in: https://reviews.llvm.org/D139737 I will remove it from this patch.

} }

// Return the source location of the last character of the AST `Node`. // Return the source location of the last character of the AST `Node`.

template <typename NodeTy> template <typename NodeTy>

static SourceLocation getEndCharLoc(const NodeTy *Node, const SourceManager &SM, static SourceLocation getEndCharLoc(const NodeTy *Node, const SourceManager &SM,

const LangOptions &LangOpts) { const LangOptions &LangOpts) {

return Lexer::getLocForEndOfToken(Node->getEndLoc(), 1, SM, LangOpts); return Lexer::getLocForEndOfToken(Node->getEndLoc(), 1, SM, LangOpts);

} }

Show All 10 Lines static StringRef getExprText(const Expr *E, const SourceManager &SM,

const LangOptions &LangOpts) { const LangOptions &LangOpts) {

SourceLocation LastCharLoc = getPastLoc(E, SM, LangOpts); SourceLocation LastCharLoc = getPastLoc(E, SM, LangOpts);

return Lexer::getSourceText( return Lexer::getSourceText(

CharSourceRange::getCharRange(E->getBeginLoc(), LastCharLoc), SM, CharSourceRange::getCharRange(E->getBeginLoc(), LastCharLoc), SM,

LangOpts); LangOpts);

} }

std::optional<FixItList>

DerefSimplePtrArithFixableGadget::getFixits(const Strategy &s) const {

const VarDecl *VD = dyn_cast<VarDecl>(BaseDeclRefExpr->getDecl());

NoQUnsubmitted

Done

You can combine dyn_cast() with assertion by using cast() instead. But in any case, I'm not sure the assertion is actually correct here (see also), I think it's a good idea to add a test case for BindingDecl here as well.

NoQ: You can combine `dyn_cast()` with assertion by using `cast()` instead. But in any case, I'm not…

ziqingluo-90AuthorUnsubmitted

Done

Good point! Actually an example with BindingDecls could break our code. The Tracker collects all declRefExpr(to(varDecl())) but most of the Gadget matchers simply look for declRefExpr()s. If a Gadget is associated to a DRE to a BindingDecl, it has an unclaimed DRE.

We probably need another patch to add to(varDecl()) to places where its missing and add proper tests.

ziqingluo-90: Good point! Actually an example with `BindingDecl`s could break our code. The `Tracker`…

NoQUnsubmitted

Done

Aha, sure, we probably need to fix all places, but I also don't want this patch to introduce a new source of crashes, where other gadgets already act defensively. So let's at least replace assert with an early return before we land this patch?

NoQ: Aha, sure, we probably need to fix all places, but I also don't want this patch to introduce a…

if (VD && s.lookup(VD) == Strategy::Kind::Span) {

ASTContext &Ctx = VD->getASTContext();

// std::span can't represent elements before its begin()

if (auto ConstVal = Offset->getIntegerConstantExpr(Ctx))

if (ConstVal->isNegative())

return std::nullopt;

// note that the expr may (oddly) has multiple layers of parens

// example:

// *((..(pointer + 123)..))

// goal:

// pointer[123]

// Fix-It:

// remove '*('

// replace ' + ' with '['

// replace ')' with ']'

// example:

// *((..(123 + pointer)..))

// goal:

// 123[pointer]

// Fix-It:

// remove '*('

// replace ' + ' with '['

// replace ')' with ']'

const Expr *LHS = AddOp->getLHS(), *RHS = AddOp->getRHS();

const SourceManager &SM = Ctx.getSourceManager();

const LangOptions &LangOpts = Ctx.getLangOpts();

CharSourceRange StarWithTrailWhitespace =

clang::CharSourceRange::getCharRange(DerefOp->getOperatorLoc(),

LHS->getBeginLoc());

CharSourceRange PlusWithSurroundingWhitespace =

clang::CharSourceRange::getCharRange(getPastLoc(LHS, SM, LangOpts),

RHS->getBeginLoc());

CharSourceRange ClosingParenWithPrecWhitespace =

clang::CharSourceRange::getCharRange(getPastLoc(AddOp, SM, LangOpts),

getPastLoc(DerefOp, SM, LangOpts));

return FixItList{

{FixItHint::CreateRemoval(StarWithTrailWhitespace),

FixItHint::CreateReplacement(PlusWithSurroundingWhitespace, "["),

FixItHint::CreateReplacement(ClosingParenWithPrecWhitespace, "]")}};

}

return std::nullopt; // something wrong or unsupported, give up

}

// For a non-null initializer `Init` of `T *` type, this function returns // For a non-null initializer `Init` of `T *` type, this function returns

// `FixItHint`s producing a list initializer `{Init, S}` as a part of a fix-it // `FixItHint`s producing a list initializer `{Init, S}` as a part of a fix-it

// to output stream. // to output stream.

// In many cases, this function cannot figure out the actual extent `S`. It // In many cases, this function cannot figure out the actual extent `S`. It

// then will use a place holder to replace `S` to ask users to fill `S` in. The // then will use a place holder to replace `S` to ask users to fill `S` in. The

// initializer shall be used to initialize a variable of type `std::span<T>`. // initializer shall be used to initialize a variable of type `std::span<T>`.

// //

// FIXME: Support multi-level pointers // FIXME: Support multi-level pointers

▲ Show 20 Lines • Show All 263 Lines • Show Last 20 Lines

clang/test/SemaCXX/warn-unsafe-buffer-usage-fixits-deref-simple-ptr-arith.cpp

This file was added.

// RUN: %clang_cc1 -std=c++20 -Wunsafe-buffer-usage -fdiagnostics-parseable-fixits -fsyntax-only %s 2>&1 | FileCheck %s

// TODO test we don't mess up vertical whitespace

// TODO test different whitespaces

// TODO test different contexts

// when it's on the right side

void basic() {

int *ptr;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:11}:"std::span<int> ptr"

*(ptr+5)=1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:5}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:8-[[@LINE-2]]:9}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:10-[[@LINE-3]]:11}:"]"

}

// The weird preceding semicolon ensures that we preserve that range intact.

void char_ranges() {

int *p;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:9}:"std::span<int> p"

;* ( p + 5 ) = 1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:4-[[@LINE-1]]:8}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:9-[[@LINE-2]]:12}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:13-[[@LINE-3]]:15}:"]"

;* (p+5)= 1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:4-[[@LINE-1]]:9}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:10-[[@LINE-2]]:11}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:12-[[@LINE-3]]:13}:"]"

;*( p+5)= 1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:4-[[@LINE-1]]:9}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:10-[[@LINE-2]]:11}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:12-[[@LINE-3]]:13}:"]"

;*( p+5)= 1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:4-[[@LINE-1]]:9}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:10-[[@LINE-2]]:11}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:12-[[@LINE-3]]:13}:"]"

;*( p +5)= 1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:4-[[@LINE-1]]:7}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:8-[[@LINE-2]]:12}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:13-[[@LINE-3]]:14}:"]"

;*(p+ 5)= 1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:4-[[@LINE-1]]:6}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:7-[[@LINE-2]]:11}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:12-[[@LINE-3]]:13}:"]"

;*(p+ 5 )= 1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:4-[[@LINE-1]]:6}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:7-[[@LINE-2]]:9}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:10-[[@LINE-3]]:14}:"]"

;*(p+ 5) = 1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:4-[[@LINE-1]]:6}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:7-[[@LINE-2]]:9}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:10-[[@LINE-3]]:11}:"]"

; *(p+5)= 1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:7-[[@LINE-1]]:9}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:10-[[@LINE-2]]:11}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:12-[[@LINE-3]]:13}:"]"

;*(p+123456)= 1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:4-[[@LINE-1]]:6}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:7-[[@LINE-2]]:8}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:14-[[@LINE-3]]:15}:"]"

;* (p+123456)= 1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:4-[[@LINE-1]]:9}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:10-[[@LINE-2]]:11}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:17-[[@LINE-3]]:18}:"]"

;*( p+123456)= 1;

NoQUnsubmitted

Not Done

// CHECK-NOT: [

- // Array subsctipt opertor of std::span accepts unsigned integer.

+ // Array subscript operator of std::span accepts unsigned integer.

void negative() {

NoQ:

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:4-[[@LINE-1]]:9}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:10-[[@LINE-2]]:11}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:17-[[@LINE-3]]:18}:"]"

;*( p+123456)= 1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:4-[[@LINE-1]]:9}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:10-[[@LINE-2]]:11}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:17-[[@LINE-3]]:18}:"]"

;*(p +123456)= 1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:4-[[@LINE-1]]:6}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:7-[[@LINE-2]]:11}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:17-[[@LINE-3]]:18}:"]"

;*(p+ 123456)= 1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:4-[[@LINE-1]]:6}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:7-[[@LINE-2]]:11}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:17-[[@LINE-3]]:18}:"]"

;*(p+123456 )= 1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:4-[[@LINE-1]]:6}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:7-[[@LINE-2]]:8}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:14-[[@LINE-3]]:18}:"]"

;*(p+123456) = 1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:4-[[@LINE-1]]:6}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:7-[[@LINE-2]]:8}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:14-[[@LINE-3]]:15}:"]"

int *ptrrrrrr;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:16}:"std::span<int> ptrrrrrr"

;* ( ptrrrrrr + 123456 )= 1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:4-[[@LINE-1]]:8}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:16-[[@LINE-2]]:19}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:25-[[@LINE-3]]:27}:"]"

;* (ptrrrrrr+123456)= 1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:4-[[@LINE-1]]:9}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:17-[[@LINE-2]]:18}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:24-[[@LINE-3]]:25}:"]"

;*( ptrrrrrr+123456)= 1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:4-[[@LINE-1]]:9}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:17-[[@LINE-2]]:18}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:24-[[@LINE-3]]:25}:"]"

;*( ptrrrrrr+123456)= 1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:4-[[@LINE-1]]:9}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:17-[[@LINE-2]]:18}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:24-[[@LINE-3]]:25}:"]"

;*(ptrrrrrr +123456)= 1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:4-[[@LINE-1]]:6}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:14-[[@LINE-2]]:18}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:24-[[@LINE-3]]:25}:"]"

;*(ptrrrrrr+ 123456)= 1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:4-[[@LINE-1]]:6}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:14-[[@LINE-2]]:18}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:24-[[@LINE-3]]:25}:"]"

;*(ptrrrrrr+123456 )= 1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:4-[[@LINE-1]]:6}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:14-[[@LINE-2]]:15}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:21-[[@LINE-3]]:25}:"]"

;*(ptrrrrrr+123456) = 1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:4-[[@LINE-1]]:6}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:14-[[@LINE-2]]:15}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:21-[[@LINE-3]]:22}:"]"

}

void base_on_rhs() {

int* ptr;

*(10 + ptr) = 1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:5}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:7-[[@LINE-2]]:10}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:13-[[@LINE-3]]:14}:"]"

}

void many_parens() {

int* ptr;

*(( (10 + ptr)) ) = 1;

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:8}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:10-[[@LINE-2]]:13}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:16-[[@LINE-3]]:20}:"]"

}

void lvaue_to_rvalue() {

int * ptr;

int tmp = *(ptr + 10);

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:13-[[@LINE-1]]:15}:""

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:18-[[@LINE-2]]:21}:"["

// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:23-[[@LINE-3]]:24}:"]"

}

// Fixits emitted for the cases below would be incorrect.

// CHECK-NOT: fix-it:

// Array subsctipt opertor of std::span accepts unsigned integer.

void negative() {

int* ptr;

*(ptr + -5) = 1; // skip

}

void subtraction() {

int* ptr;

*(ptr - 5) = 1; // skip

}

void subtraction_of_negative() {

int* ptr;

*(ptr - -5) = 1; // FIXME: implement fixit (uncommon case - low priority)

}

void bindingDecl(int *p, int *q) {

int * a[2] = {p, q};

auto [x, y] = a;

*(x + 1) = 1; // FIXME: deal with `BindingDecl`s

}

This is an archive of the discontinued LLVM Phabricator instance.

[-Wunsafe-buffer-usage] Add Fixable for dereference of simple ptr arithmeticClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 506782

clang/include/clang/Analysis/Analyses/UnsafeBufferUsageGadgets.def

clang/lib/Analysis/UnsafeBufferUsage.cpp

clang/test/SemaCXX/warn-unsafe-buffer-usage-fixits-deref-simple-ptr-arith.cpp

[-Wunsafe-buffer-usage] Add Fixable for dereference of simple ptr arithmetic
ClosedPublic