This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
include/clang/
-
clang/
-
Analysis/Analyses/
-
Analyses/
5/5
UnsafeBufferUsage.h
-
UnsafeBufferUsageGadgets.def
-
Basic/
2/2
DiagnosticSemaKinds.td
-
lib/
-
Analysis/
26/26
UnsafeBufferUsage.cpp
-
Sema/
3/3
AnalysisBasedWarnings.cpp
-
test/SemaCXX/
-
SemaCXX/
1/1
warn-unsafe-buffer-usage-multi-decl-fixits-test.cpp
5/5
warn-unsafe-buffer-usage-multi-decl-warnings.cpp

Differential D145739

[-Wunsafe-buffer-usage] Group variables associated by pointer assignments
ClosedPublic

Authored by t-rasmud on Mar 9 2023, 4:14 PM.

Download Raw Diff

Details

Reviewers

jkorous
NoQ
ziqingluo-90
malavikasamak
aaron.ballman
gribozavr
ymandel

Commits

rG171dfc5462a2: [-Wunsafe-buffer-usage] Group variables associated by pointer assignments
rGee6b08e99375: [-Wunsafe-buffer-usage] Group variables associated by pointer assignments

Summary

For assignments involving pointer types, if the machine decides that the LHS pointer has to be fixed to std::span, this patch introduces the same fixit for the RHS pointer.
It groups all such pointer variables (that transitively depend on the unsafe pointer) and suggests that they all be fixed atomically to correctly propagate bounds information among them.
TODOs: UUC, variables claimed by multiple gadgets (ex: int *p = q), handle parameters.

Diff Detail

Event Timeline

t-rasmud created this revision.Mar 9 2023, 4:14 PM

Herald added a project: Restricted Project. · View Herald TranscriptMar 9 2023, 4:14 PM

t-rasmud requested review of this revision.Mar 9 2023, 4:14 PM

Harbormaster completed remote builds in B218543: Diff 503962.Mar 9 2023, 4:15 PM

Brilliant!!

I guess we can abandon D143133 now in favor of this patch?

Before I jump to nitpicking, I think we should sync up on how the end result should look like, in terms of warnings and notes that we display. This may affect the overall algorithm, like require us to gather more data.

clang/test/SemaCXX/warn-unsafe-buffer-usage-multi-decl-warnings.cpp
5–14	We probably want to build some finer narrative here 🤔 In this case `p` and `q` are both independently unsafe and there's a one-way connection between them. Would it make sense to emit just one warning for both of them? I suspect it also makes sense to leave a note at `q = p` with a text like "note: bounds information needs to propagate from `p` to `q` here" (I've no idea if it'll scale to more than 2 variables, or how much more information we'll need to gather).
57–63	In this case we definitely want to spanify `q`, for two independent reasons. We need to figure out how to tell that to the user though 🤔

t-rasmud mentioned this in D143133: [-Wunsafe-buffer-usage][WIP] Add fixit for variables that have an assignment from another spanified pointer.Mar 14 2023, 11:42 AM

I have one general question: suppose we have two variables A and B in the same group. So fix-its emitted for A includes fix-its for B and vice verse, right? Does it mean that we cannot apply all fix-its at once from the command line?

clang/include/clang/Analysis/Analyses/UnsafeBufferUsage.h
42	It could be nice to have a document for this function since it is in the header.
clang/lib/Analysis/UnsafeBufferUsage.cpp
277	So every Fixable needs to implement this. I would love to see a document for this function.
587	`DeclPtrAtLeft` is not used.
1828	It seems like we do not care about the direction of edges in the graph, maybe we should call it `UndirectedGraph`?
clang/lib/Sema/AnalysisBasedWarnings.cpp
2267	would it be better if to make `VariableGroups` a map from `VarDecl`s to groups ?

t-rasmud added a parent revision: D143048: [-Wunsafe-buffer-usage] Add T* -> span<T> Fix-Its for function parameters.Mar 15 2023, 2:44 PM

t-rasmud updated this revision to Diff 507863.Mar 23 2023, 1:40 PM

Harbormaster completed remote builds in B221405: Diff 507863.Mar 23 2023, 1:41 PM

ziqingluo-90 added inline comments.Mar 29 2023, 11:03 AM

clang/lib/Sema/AnalysisBasedWarnings.cpp
2250	These note messages are nice. Maybe we can have some tests for them. It will also help telling whether variables are grouped correctly.
clang/test/SemaCXX/warn-unsafe-buffer-usage-multi-decl-fixits-test.cpp
139	Could we have an example where there is a cyclic dependency? Such as `p = q; q = r; r = p;`.

Adds tests that were accidentally deleted in the last diff.

Harbormaster completed remote builds in B222577: Diff 509440.Mar 29 2023, 12:32 PM

Handle the case where implied variables don't have a fixit strategy
TODO: address feedback for the previous diff

Harbormaster completed remote builds in B223894: Diff 511227.Apr 5 2023, 3:54 PM

Address review comments.

Harbormaster completed remote builds in B224074: Diff 511492.Apr 6 2023, 11:33 AM

t-rasmud marked 4 inline comments as done.Apr 6 2023, 11:48 AM

t-rasmud added inline comments.

clang/lib/Analysis/UnsafeBufferUsage.cpp
1828	I changed it to `PtrAssignmentGraph` which seems (atleast to me) to give more context. It is still a directed graph because `Deps[0]` represents the LHS pointer and `Deps[1]` the RHS pointer. Hopefully the new documentation on `getStrategyImplications` makes this clear!
clang/test/SemaCXX/warn-unsafe-buffer-usage-multi-decl-warnings.cpp
5–14	The new note is more descriptive and summarizes the reason for grouping variables together. We still don't inform the user of the direction of bounds propagation or the fact that buffers are independently unsafe. Like we discussed offline, we could consider these refinements in future versions of the tool.

t-rasmud edited the summary of this revision. (Show Details)Apr 6 2023, 11:52 AM

ziqingluo-90 added inline comments.Apr 6 2023, 2:44 PM

clang/lib/Analysis/UnsafeBufferUsage.cpp
579	I was wondering if we need this sub-matcher `declRefExpr(expr()).bind(PointerAssignRHSTag)`. It is eventually used as an inner matcher of `binaryOperator`. I doubt it will match anything. Ditto for line 582-583.
1658	How about using `FixablesForUnsafeVars.byVar[Var]` instead of the outer loop?

Address review comments.

Harbormaster completed remote builds in B224127: Diff 511552.Apr 6 2023, 3:39 PM

t-rasmud marked 2 inline comments as done.Apr 6 2023, 3:40 PM

t-rasmud added inline comments.

clang/lib/Analysis/UnsafeBufferUsage.cpp
579	Great catch @ziqingluo-90! I was experimenting to match assignments at declarations and never intended for those sub-matchers to be part of this patch.

I think it'd be really valuable to document the algorithm.

It's essential for the reader to understand why this is a two-step process and a simple flood-fill through bidirectional implications graph isn't going to be correct.

Then, you did something very clever to avoid finding intersections between directed "zones of influence" of every variable (and then joining them), and I really like it. In inline comments i have a couple suggestions on how to demonstrate to the reader that it's equivalent to (my) intuitive understanding of the problem. It looks like your algorithm is pretty much linear over the size of the AST, which is great because this means we don't have to worry too much about budgets! - this is probably worth proclaiming as well.

clang/lib/Analysis/UnsafeBufferUsage.cpp
277–281	You're saying this is a two-element list with well-defined LHS and RHS, but the caller implementation accepts arbitrarily long list and treats it as implication from item [0] to items [1 ... N-1]. I'm not really sure how it should behave in the more-than-two-elements case. IIRC our primary motivating example for more than 2 elements is `foo(p1, p2, ..., pN)` where `foo` has attribute `[[unsafe_buffer_usage]]` with respect to all N parameters and has a safe version that accepts N spans, but no safe versions for smaller subsets of parameters. If we try to represent it with these "directed" vectors, it becomes quadratic over N as you'll need edges from each argument to each argument. Maybe before we get to handling the other case, it makes sense to have an abstract class `StrategyImplication`, so that assignment-like implication was one concrete subclass, and a symmetric set of N related variables was another concrete subclass. Then the call site can handle these cases separately. For the purposes of this patch it probably makes sense for this function to just return `optional<pair<VarDecl *>>`, so that to manage the reader's expectations :)
1760	Probably better to compare pointers. There could be variables with the same name in the same function (in smaller scopes). Also IIRC `getName()` crashes on anonymous declarations (not sure if it can happen in this case).
1820	I was about to say that, there could be fixables that don't connect any unsafe variables, but connect two implicated variables together, so you need to iterate over all fixables instead. But then I noticed that `FixablesForUnsafeVars` is actually mis-named 🙁 According to `groupFixablesByVar()` contains fixables for all variables, sorted by variable, and nobody ever checks whether it's unsafe. It'd be really valuable if we could rename it before or together with this patch! – as it makes the code much harder to read and reason about, there could even be existing bugs based on this misunderstanding. Also, iterating over `FixablesForUnsafeVars.byVar` here would cause you to visit multivariable fixables twice (once for each variable), so there could be a bit of duplicated work here, maybe it still makes sense to iterate over the whole list.
1849–1850	Using `Var` here instead of `CurrentVar` is really clever. I think there needs to be a comment bragging about this decision! I was quite confused initially when I was thinking of it as essential to the algorithm, whereas in reality it looks like it's just an optimization. If `CurrentVar` was used, this would have straightforwardly meant "Just keep the edges in the original graph that are reachable from unsafe variables, and add reverse edges; and also no need to explore from the same var more than once". It's somewhat obvious that it would correctly define the graph for the second part of the algorithm. By using `Var` you're most likely achieving the same result, but dramatically cutting the amount of hoops the second part of the algorithm needs to jump through, given that a lot of these connections become "direct". But this results in a dramatically different graph, and the shape of that graph is non-deterministic (depends on iteration order in this loop). So I guess my point is, it's valuable to tell the reader that they shouldn't try to imagine how the graph is transformed because of using `Var` instead of `CurrentVar`. It's still essentially the same graph, just a bit shallower. The reader can continue reading as if `CurrentVar` was used, and think about why it's equivalent later.

This looks good!
I am trying to catch up with this patch and my comments are meant as "Did we consider case X?" rather than "This doesn't work for case X."

clang/lib/Analysis/UnsafeBufferUsage.cpp
577	Shall we back-port how `matcher()` and `getFixits()` methods split responsibilities that we've started to use in more recent patches? For other fixables we now check if the DRE refers to a var decl already in the matcher like this: `declRefExpr(to(varDecl()))`
1654	Array subscript operator adds an entry to the map if the key is not found. https://en.cppreference.com/w/cpp/container/map/operator_at That might or might not be ok here. But we might also just take `FixablesForUnsafeVars` as a `const` reference and use `find()` instead. https://en.cppreference.com/w/cpp/container/map/find
1695	I am trying to understand how do the Fixables matched to other variables in the `VarGroupForVD` learn that they shouldn't emit any Fix-It. I was naively expecting that we'd do: for (const VarDecl * V : VarGroupForVD) { FixItsForVariable.erase(VD); }
1699	Note: `VarGrpMap` being `std::map` inserts a record if `VD` is not among the keys yet.

jkorous added inline comments.Apr 7 2023, 6:04 PM

clang/lib/Analysis/UnsafeBufferUsage.cpp
1760	+1 to comparing pointers That's what we do above.
1827	IIUC we're copying elements from an `std::vector` to an `std::set`. Nit: Maybe we could avoid the explicit iteration? assert(Deps.size() > 1); PtrAssignmentGraph[Deps[0]].insert(Deps.begin() + 1, Deps.end()); But I feel we might want to handle the case where `Deps` has exactly one element regardless.
1849–1850	+1 to adding comments Our future selves might be pretty grateful if they have to debug this code :)
clang/test/SemaCXX/warn-unsafe-buffer-usage-multi-decl-warnings.cpp
2	I am looking for a test that checks that if one of the variables in the group is referred to by at least one DRE for which we don't have a Fix-It (e. g. negative index for `span` strategy) that the whole group will remain Fix-It-less (and note-less). I can't find any and maybe that's just because I didn't look hard enough but it makes me wonder - should we add a short explanation to such test (either an existing one or the one we should add)?

Address partial feedback.

Harbormaster completed remote builds in B225182: Diff 512973.Apr 12 2023, 2:12 PM

I'm working on fixing multiple parameters at a time. It is based on this patch. So I suddenly had a few more questions.

clang/lib/Analysis/UnsafeBufferUsage.cpp
1712	If `VD` is removed (it means giving up on fixing `VD`), do we also need to remove the whole group of `VD`?
1764	Could we do the grouping after all variables having their fix-its generated so that we can directly copy their fix-its instead of re-generating them? This requires fix-its generated for two different variables to be independent. And, I think so far they are independent.

ziqingluo-90 removed a parent revision: D143048: [-Wunsafe-buffer-usage] Add T* -> span<T> Fix-Its for function parameters.Apr 14 2023, 11:11 AM

ziqingluo-90 added a child revision: D143048: [-Wunsafe-buffer-usage] Add T* -> span<T> Fix-Its for function parameters.

ziqingluo-90 added a parent revision: D143628: [-Wunsafe-buffer-usage][PoC][WIP] Add #include <span> to emitted Fix-Its.

t-rasmud marked 6 inline comments as done.Apr 25 2023, 12:00 PM

t-rasmud added inline comments.

clang/lib/Analysis/UnsafeBufferUsage.cpp
1712	That is right and a good optimization to have. I'll include it in the next iteration of the patch along.

Address more feedback.

Harbormaster completed remote builds in B228100: Diff 516910.Apr 25 2023, 2:26 PM

Minor fix.

t-rasmud marked 2 inline comments as done.Apr 25 2023, 2:38 PM

Harbormaster completed remote builds in B228113: Diff 516923.Apr 25 2023, 2:41 PM

t-rasmud marked an inline comment as done.Apr 25 2023, 3:26 PM

Address feedback.

Harbormaster completed remote builds in B228444: Diff 517385.Apr 26 2023, 4:42 PM

t-rasmud marked 3 inline comments as done.Apr 26 2023, 4:44 PM

t-rasmud added inline comments.

clang/lib/Analysis/UnsafeBufferUsage.cpp
277–281	You are right, hadn't considered the case with more than two elements and will address it in a separate patch.

Add documentation.

t-rasmud marked 2 inline comments as done.Apr 28 2023, 2:59 PM

Harbormaster completed remote builds in B228934: Diff 518064.Apr 28 2023, 3:00 PM

t-rasmud retitled this revision from [-Wunsafe-buffer-usage][WIP] Group variables associated by pointer assignments to [-Wunsafe-buffer-usage] Group variables associated by pointer assignments.Apr 28 2023, 3:00 PM

t-rasmud added reviewers: aaron.ballman, gribozavr, ymandel.

Alright I think everything looks mostly good from high-level perspective! I have a few minor nitpicks.

clang/include/clang/Analysis/Analyses/UnsafeBufferUsage.h
35–37	We can remove the old interface now right? You removed both the definition and the call site.
43	We should probably turn `VarGrpMap` into a const reference. Otherwise it'll be deep-copied every time we call the function. It's probably also better to make a typedef for this map type, as it shows up multiple times. Does this map need to be ordered/tree-based? Maybe we should use `std::unordered_map` or `llvm::DenseMap`?
clang/lib/Analysis/UnsafeBufferUsage.cpp
579	Looks like the `hasOperatorName("=")` check is duplicated on both sides.
597–598	We're confident that this cast always succeeds. And we really don't want to see null pointers inside these pairs.
1125–1126	(valid use case for `auto` according to https://llvm.org/docs/CodingStandards.html#use-auto-type-deduction-to-make-code-more-readable)
clang/lib/Sema/AnalysisBasedWarnings.cpp
2215	We should probably find a way to put all these plain text pieces into `DiagnosticSemaKinds.td` (except probably the `and` part). I wonder if a ... %select{\|, and change %2 to %select{std::span\|std::array\|std::span::iterator}1 to propagate bounds information betwen them'.}` would work (like, nest more format specifiers into a `%select`). Or we could keep using the old diagnostic id when we don't need extra stuff at the end. (It might be a good idea to make a new format specifier for this purpose, like `plural` but with lists. But this is probably an overkill if there's just one warning of this kind.)
clang/test/SemaCXX/warn-unsafe-buffer-usage-multi-decl-warnings.cpp
44	This is a FIXME test right? In theory we want to propagate all the way up to `a`, but we only support assignments so far, not initializations.

TODOs: UUC

I think now is a good time to address this. The patch isn't going to be correct without it so we can't land it until the context is checked.

Address comments.

Harbormaster completed remote builds in B229832: Diff 519273.May 3 2023, 3:09 PM

t-rasmud marked 4 inline comments as done.May 3 2023, 3:10 PM

t-rasmud marked an inline comment as done.

t-rasmud added inline comments.May 3 2023, 3:21 PM

clang/include/clang/Analysis/Analyses/UnsafeBufferUsage.h
35–37	Assuming you're referring to `handleFixableVariable` I have already removed it in this patch. Did I miss something here?

Modify diagnostic message.

Harbormaster completed remote builds in B229849: Diff 519297.May 3 2023, 4:24 PM

ziqingluo-90 removed a parent revision: D143628: [-Wunsafe-buffer-usage][PoC][WIP] Add #include <span> to emitted Fix-Its.May 5 2023, 11:33 AM

ziqingluo-90 added a parent revision: D146773: [-Wunsafe-buffer-usage] Make raw (ungrouped) warnings a bit more verbose..

t-rasmud marked an inline comment as done.May 9 2023, 11:28 AM

Use nested %select in format specifier of the diagnostic message.

Harbormaster completed remote builds in B230965: Diff 520821.May 9 2023, 2:40 PM

t-rasmud marked an inline comment as done.May 9 2023, 2:42 PM

Friendly Ping.

Ok this looks great to me, almost ready to land! Still needs the UUC work.

clang/include/clang/Analysis/Analyses/UnsafeBufferUsage.h
35–37	Yeah nvm I misread, you're absolutely right!
clang/include/clang/Basic/DiagnosticSemaKinds.td
11793–11796	Ok this one's also unused right?
11796	Found a typo! Also I think we shouldn't have single quotes around `%0` here, but instead we're supposed to stuff our `const VarDecl *` directly into the diagnostic stream, just like we already do in `warn_unsafe_buffer_variable`, which automatically surrounds it with single quotes. (unfortunately we can't do the same with `%2` so we add quotes manually when we build the dynamic string)
clang/lib/Analysis/UnsafeBufferUsage.cpp
557

t-rasmud mentioned this in D150489: [-Wunsafe-buffer-usage] Handle pointer initializations for grouping related variables.May 12 2023, 3:07 PM

t-rasmud added a child revision: D150489: [-Wunsafe-buffer-usage] Handle pointer initializations for grouping related variables.May 12 2023, 3:10 PM

t-rasmud added a child revision: D150811: [-Wunsafe-buffer-usage] Handle pointer assignments only in unspecified contexts.May 17 2023, 1:08 PM

NoQ mentioned this in D150811: [-Wunsafe-buffer-usage] Handle pointer assignments only in unspecified contexts.May 17 2023, 1:10 PM

Squash changes from https://reviews.llvm.org/D150811.

Harbormaster completed remote builds in B233036: Diff 523590.May 18 2023, 3:47 PM

Address comments.

Harbormaster completed remote builds in B233039: Diff 523596.May 18 2023, 4:10 PM

t-rasmud marked 3 inline comments as done.May 18 2023, 4:11 PM

Ok I think this is good to go! Amazing work!!

This revision is now accepted and ready to land.May 22 2023, 11:55 AM

This revision was landed with ongoing or failed builds.May 24 2023, 4:21 PM

Closed by commit rGee6b08e99375: [-Wunsafe-buffer-usage] Group variables associated by pointer assignments (authored by t-rasmud). · Explain Why

This revision was automatically updated to reflect the committed changes.

t-rasmud added a commit: rGee6b08e99375: [-Wunsafe-buffer-usage] Group variables associated by pointer assignments.

Herald added a project: Restricted Project. · View Herald TranscriptMay 24 2023, 4:21 PM

Herald added a subscriber: cfe-commits. · View Herald Transcript

Hi, just a heads-up, some bots seem to be unhappy with the test:

https://lab.llvm.org/buildbot/#/builders/216/builds/21765

error: 'note' diagnostics expected but not seen: 
  File Z:\b\llvm-clang-x86_64-sie-win\llvm-project\clang\test\SemaCXX\warn-unsafe-buffer-usage-multi-decl-warnings.cpp Line 169: {{^change type of 'q' to 'std::span' to preserve bounds information, and change 'r' and 'p' to 'std::span' to propagate bounds information between them$}}
error: 'note' diagnostics seen but not expected: 
  File Z:\b\llvm-clang-x86_64-sie-win\llvm-project\clang\test\SemaCXX\warn-unsafe-buffer-usage-multi-decl-warnings.cpp Line 169: change type of 'q' to 'std::span' to preserve bounds information, and change 'p' and 'r' to 'std::span' to propagate bounds information between them
2 errors generated.

In D145739#4370674, @barannikov88 wrote:

Hi, just a heads-up, some bots seem to be unhappy with the test:

https://lab.llvm.org/buildbot/#/builders/216/builds/21765

error: 'note' diagnostics expected but not seen: 
  File Z:\b\llvm-clang-x86_64-sie-win\llvm-project\clang\test\SemaCXX\warn-unsafe-buffer-usage-multi-decl-warnings.cpp Line 169: {{^change type of 'q' to 'std::span' to preserve bounds information, and change 'r' and 'p' to 'std::span' to propagate bounds information between them$}}
error: 'note' diagnostics seen but not expected: 
  File Z:\b\llvm-clang-x86_64-sie-win\llvm-project\clang\test\SemaCXX\warn-unsafe-buffer-usage-multi-decl-warnings.cpp Line 169: change type of 'q' to 'std::span' to preserve bounds information, and change 'p' and 'r' to 'std::span' to propagate bounds information between them
2 errors generated.

The test seems to randomly pass and fail on the bot. It seems that the order of 'p' and 'r' in the output string may not be deterministic? Can you make the test more reliable or make it handle each situation (if appropriate)?

dyung added a reverting change: rG2e6325c71feb: Revert "[-Wunsafe-buffer-usage] Group variables associated by pointer….May 25 2023, 2:10 AM

@t-rasmud I am sorry, but I had to revert your change, it is randomly failing on the Windows bots causing a lot of instability. See the revert commit message for examples.

Hi Douglas,

No worries, I know the root cause for the issue. I will make the necessary changes and re-commit the patch.

Thanks,
Rashmi

t-rasmud added a commit: rG171dfc5462a2: [-Wunsafe-buffer-usage] Group variables associated by pointer assignments.May 25 2023, 11:32 AM

Thank you Sergei for catching this! Thank you Rashmi for fixing this!

NoQ removed a parent revision: D146773: [-Wunsafe-buffer-usage] Make raw (ungrouped) warnings a bit more verbose..Aug 23 2023, 5:31 PM

Revision Contents

Path

Size

clang/

include/

clang/

Analysis/

Analyses/

UnsafeBufferUsage.h

9 lines

UnsafeBufferUsageGadgets.def

1 line

Basic/

DiagnosticSemaKinds.td

2 lines

lib/

Analysis/

UnsafeBufferUsage.cpp

246 lines

Sema/

AnalysisBasedWarnings.cpp

56 lines

test/

SemaCXX/

warn-unsafe-buffer-usage-multi-decl-fixits-test.cpp

138 lines

warn-unsafe-buffer-usage-multi-decl-warnings.cpp

346 lines

Diff 512973

clang/include/clang/Analysis/Analyses/UnsafeBufferUsage.h

	Show All 26 Lines
	public:			public:
	UnsafeBufferUsageHandler() = default;			UnsafeBufferUsageHandler() = default;
	virtual ~UnsafeBufferUsageHandler() = default;			virtual ~UnsafeBufferUsageHandler() = default;

	/// This analyses produces large fixits that are organized into lists			/// This analyses produces large fixits that are organized into lists
	/// of primitive fixits (individual insertions/removals/replacements).			/// of primitive fixits (individual insertions/removals/replacements).
	using FixItList = llvm::SmallVectorImpl<FixItHint>;			using FixItList = llvm::SmallVectorImpl<FixItHint>;

	/// Invoked when an unsafe operation over raw pointers is found.			/// Invoked when an unsafe operation over raw pointers is found.
	virtual void handleUnsafeOperation(const Stmt *Operation,			virtual void handleUnsafeOperation(const Stmt *Operation,
	bool IsRelatedToDecl) = 0;			bool IsRelatedToDecl) = 0;
				NoQUnsubmitted Done Reply Inline Actions We can remove the old interface now right? You removed both the definition and the call site. NoQ: We can remove the old interface now right? You removed both the definition and the call site.
				t-rasmudAuthorUnsubmitted Done Reply Inline Actions Assuming you're referring to `handleFixableVariable` I have already removed it in this patch. Did I miss something here? t-rasmud: Assuming you're referring to `handleFixableVariable` I have already removed it in this patch.
				NoQUnsubmitted Done Reply Inline Actions Yeah nvm I misread, you're absolutely right! NoQ: Yeah nvm I misread, you're absolutely right!

	/// Invoked when a fix is suggested against a variable.			/// Invoked when a fix is suggested against a variable. This function groups
	virtual void handleFixableVariable(const VarDecl *Variable,			/// all variables that must be fixed together (i.e their types must be changed to the
	FixItList &&List) = 0;			/// same target type to prevent type mismatches) into a single fixit.
				virtual void handleUnsafeVariableGroup(const VarDecl *Variable,
				ziqingluo-90Unsubmitted Done Reply Inline Actions It could be nice to have a document for this function since it is in the header. ziqingluo-90: It could be nice to have a document for this function since it is in the header.
				std::map<const VarDecl , std::vector<const VarDecl >> VarGrpMap,
				NoQUnsubmitted Done Reply Inline Actions We should probably turn `VarGrpMap` into a const reference. Otherwise it'll be deep-copied every time we call the function. It's probably also better to make a typedef for this map type, as it shows up multiple times. Does this map need to be ordered/tree-based? Maybe we should use `std::unordered_map` or `llvm::DenseMap`? NoQ: We should probably turn `VarGrpMap` into a const reference. Otherwise it'll be deep-copied…
				FixItList &&Fixes) = 0;

	/// Returns a reference to the `Preprocessor`:			/// Returns a reference to the `Preprocessor`:
	virtual bool isSafeBufferOptOut(const SourceLocation &Loc) const = 0;			virtual bool isSafeBufferOptOut(const SourceLocation &Loc) const = 0;

	/// Returns the text indicating that the user needs to provide input there:			/// Returns the text indicating that the user needs to provide input there:
	virtual std::string			virtual std::string
	getUserFillPlaceHolder(StringRef HintTextToUser = "placeholder") const {			getUserFillPlaceHolder(StringRef HintTextToUser = "placeholder") const {
	std::string s = std::string("<# ");			std::string s = std::string("<# ");
	Show All 20 Lines

clang/include/clang/Analysis/Analyses/UnsafeBufferUsageGadgets.def

	Show All 30 Lines
	WARNING_GADGET(PointerArithmetic)			WARNING_GADGET(PointerArithmetic)
	WARNING_GADGET(UnsafeBufferUsageAttr)			WARNING_GADGET(UnsafeBufferUsageAttr)

	FIXABLE_GADGET(ULCArraySubscript) // `DRE[any]` in an Unspecified Lvalue Context			FIXABLE_GADGET(ULCArraySubscript) // `DRE[any]` in an Unspecified Lvalue Context
	FIXABLE_GADGET(PointerDereference)			FIXABLE_GADGET(PointerDereference)
	FIXABLE_GADGET(UPCAddressofArraySubscript) // '&DRE[any]' in an Unspecified Pointer Context			FIXABLE_GADGET(UPCAddressofArraySubscript) // '&DRE[any]' in an Unspecified Pointer Context
	FIXABLE_GADGET(DerefSimplePtrArithFixable)			FIXABLE_GADGET(DerefSimplePtrArithFixable)
	FIXABLE_GADGET(PointerCtxAccess)			FIXABLE_GADGET(PointerCtxAccess)
				FIXABLE_GADGET(PointerAssignment)

	#undef FIXABLE_GADGET			#undef FIXABLE_GADGET
	#undef WARNING_GADGET			#undef WARNING_GADGET
	#undef GADGET			#undef GADGET

clang/include/clang/Basic/DiagnosticSemaKinds.td

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 11,784 Lines • ▼ Show 20 Lines def warn_unsafe_buffer_variable : Warning<

"does not perform bounds checks}1">, "does not perform bounds checks}1">,

InGroup<UnsafeBufferUsage>, DefaultIgnore; InGroup<UnsafeBufferUsage>, DefaultIgnore;

def warn_unsafe_buffer_operation : Warning< def warn_unsafe_buffer_operation : Warning<

"%select{unsafe pointer operation|unsafe pointer arithmetic|" "%select{unsafe pointer operation|unsafe pointer arithmetic|"

"unsafe buffer access|function introduces unsafe buffer manipulation}0">, "unsafe buffer access|function introduces unsafe buffer manipulation}0">,

InGroup<UnsafeBufferUsage>, DefaultIgnore; InGroup<UnsafeBufferUsage>, DefaultIgnore;

def note_unsafe_buffer_operation : Note< def note_unsafe_buffer_operation : Note<

"used%select{| in pointer arithmetic| in buffer access}0 here">; "used%select{| in pointer arithmetic| in buffer access}0 here">;

def note_unsafe_buffer_variable_fixit : Note< def note_unsafe_buffer_variable_fixit : Note<

"change type of '%0' to '%select{std::span|std::array|std::span::iterator}1' to preserve bounds information">; "change type of '%0' to '%select{std::span|std::array|std::span::iterator}1' to preserve bounds information">;

def note_unsafe_buffer_variable_fixit_group : Note<

"change type of '%0' to '%select{std::span|std::array|std::span::iterator}1' to preserve bounds information%2">;

NoQUnsubmitted

Done

def note_unsafe_buffer_variable_fixit_group : Note<

- "change type of '%0' to '%select{std::span|std::array|std::span::iterator}1' to preserve bounds information%select{|, and change %2 to '%select{std::span|std::array|std::span::iterator}1' to propagate bounds information betwen them}3">;

+ "change type of '%0' to '%select{std::span|std::array|std::span::iterator}1' to preserve bounds information%select{|, and change %2 to '%select{std::span|std::array|std::span::iterator}1' to propagate bounds information between them}3">;

def err_loongarch_builtin_requires_la32 : Error<

Found a typo!

Also I think we shouldn't have single quotes around %0 here, but instead we're supposed to stuff our const VarDecl * directly into the diagnostic stream, just like we already do in warn_unsafe_buffer_variable, which automatically surrounds it with single quotes.

(unfortunately we can't do the same with %2 so we add quotes manually when we build the dynamic string)

NoQ: Found a typo! Also I think we shouldn't have single quotes around `%0` here, but instead we're…

NoQUnsubmitted

Done

Ok this one's also unused right?

NoQ: Ok this one's also unused right?

def err_loongarch_builtin_requires_la32 : Error< def err_loongarch_builtin_requires_la32 : Error<

"this builtin requires target: loongarch32">; "this builtin requires target: loongarch32">;

} // end of sema component. } // end of sema component.

clang/lib/Analysis/UnsafeBufferUsage.cpp

Show All 18 Lines

#include "clang/Lex/Lexer.h" #include "clang/Lex/Lexer.h"

#include "clang/Lex/Preprocessor.h" #include "clang/Lex/Preprocessor.h"

#include "clang/Tooling/Inclusions/HeaderIncludes.h" #include "clang/Tooling/Inclusions/HeaderIncludes.h"

#include "clang/Tooling/Inclusions/IncludeStyle.h" #include "clang/Tooling/Inclusions/IncludeStyle.h"

#include "clang/Sema/Lookup.h" #include "clang/Sema/Lookup.h"

#include "llvm/ADT/SmallVector.h" #include "llvm/ADT/SmallVector.h"

#include <memory> #include <memory>

#include <optional> #include <optional>

#include <queue>

using namespace llvm; using namespace llvm;

using namespace clang; using namespace clang;

using namespace ast_matchers; using namespace ast_matchers;

namespace clang::ast_matchers { namespace clang::ast_matchers {

// A `RecursiveASTVisitor` that traverses all descendants of a given node "n" // A `RecursiveASTVisitor` that traverses all descendants of a given node "n"

// except for those belonging to a different callable of "n". // except for those belonging to a different callable of "n".

▲ Show 20 Lines • Show All 156 Lines • ▼ Show 20 Lines

} }

} // namespace clang::ast_matchers } // namespace clang::ast_matchers

namespace { namespace {

// Because the analysis revolves around variables and their types, we'll need to // Because the analysis revolves around variables and their types, we'll need to

// track uses of variables (aka DeclRefExprs). // track uses of variables (aka DeclRefExprs).

using DeclUseList = SmallVector<const DeclRefExpr *, 1>; using DeclUseList = SmallVector<const DeclRefExpr *, 1>;

using ImplicationsList = std::vector<const VarDecl *>;

// Convenience typedef. // Convenience typedef.

using FixItList = SmallVector<FixItHint, 4>; using FixItList = SmallVector<FixItHint, 4>;

// Defined below. // Defined below.

class Strategy; class Strategy;

} // namespace } // namespace

namespace { namespace {

▲ Show 20 Lines • Show All 58 Lines • ▼ Show 20 Lines public:

bool isWarningGadget() const final { return false; } bool isWarningGadget() const final { return false; }

/// Returns a fixit that would fix the current gadget according to /// Returns a fixit that would fix the current gadget according to

/// the current strategy. Returns None if the fix cannot be produced; /// the current strategy. Returns None if the fix cannot be produced;

/// returns an empty list if no fixes are necessary. /// returns an empty list if no fixes are necessary.

virtual std::optional<FixItList> getFixits(const Strategy &) const { virtual std::optional<FixItList> getFixits(const Strategy &) const {

return std::nullopt; return std::nullopt;

} }

/// Returns a list of two elements where the first element is the LHS of a pointer assignment

ziqingluo-90Unsubmitted

Done

So every Fixable needs to implement this. I would love to see a document for this function.

ziqingluo-90: So every Fixable needs to implement this. I would love to see a document for this function.

/// statement and the second element is the RHS. This two-element list represents the fact that

/// the LHS buffer gets its bounds information from the RHS buffer. This information will be used

/// later to group all those variables whose types must be modified together to prevent type

/// mismatches.

NoQUnsubmitted

Done

You're saying this is a two-element list with well-defined LHS and RHS, but the caller implementation accepts arbitrarily long list and treats it as implication from item [0] to items [1 ... N-1].

I'm not really sure how it *should* behave in the more-than-two-elements case.

IIRC our primary motivating example for more than 2 elements is foo(p1, p2, ..., pN) where foo has attribute [[unsafe_buffer_usage]] with respect to all N parameters and has a safe version that accepts N spans, but no safe versions for smaller subsets of parameters. If we try to represent it with these "directed" vectors, it becomes quadratic over N as you'll need edges from each argument to each argument.

Maybe before we get to handling the other case, it makes sense to have an abstract class StrategyImplication, so that assignment-like implication was one concrete subclass, and a symmetric set of N related variables was another concrete subclass. Then the call site can handle these cases separately. For the purposes of this patch it probably makes sense for this function to just return optional<pair<VarDecl *>>, so that to manage the reader's expectations :)

NoQ: You're saying this is a two-element list with well-defined LHS and RHS, but the caller…

t-rasmudAuthorUnsubmitted

Done

You are right, hadn't considered the case with more than two elements and will address it in a separate patch.

t-rasmud: You are right, hadn't considered the case with more than two elements and will address it in a…

virtual std::optional<ImplicationsList> getStrategyImplications() const {

return std::nullopt;

}

}; };

using FixableGadgetList = std::vector<std::unique_ptr<FixableGadget>>; using FixableGadgetList = std::vector<std::unique_ptr<FixableGadget>>;

using WarningGadgetList = std::vector<std::unique_ptr<WarningGadget>>; using WarningGadgetList = std::vector<std::unique_ptr<WarningGadget>>;

/// An increment of a pointer-type value is unsafe as it may run the pointer /// An increment of a pointer-type value is unsafe as it may run the pointer

/// out of bounds. /// out of bounds.

class IncrementGadget : public WarningGadget { class IncrementGadget : public WarningGadget {

▲ Show 20 Lines • Show All 250 Lines • ▼ Show 20 Lines public:

virtual DeclUseList getClaimedVarUseSites() const override { virtual DeclUseList getClaimedVarUseSites() const override {

if (const auto *DRE = dyn_cast<DeclRefExpr>(Node)) { if (const auto *DRE = dyn_cast<DeclRefExpr>(Node)) {

return {DRE}; return {DRE};

} }

return {}; return {};

} }

}; };

/// A pointer assignment expression of the form:

/// \code

/// p = q;

/// \endcode

class PointerAssignmentGadget : public FixableGadget {

private:

static constexpr const char *const PointerAssignemntTag = "ptrAssign";

NoQUnsubmitted

Done

private:

- static constexpr const char *const PointerAssignemntTag = "ptrAssign";

+ static constexpr const char *const PointerAssignmentTag = "ptrAssign";

static constexpr const char *const PointerAssignLHSTag = "ptrLHS";

NoQ:

static constexpr const char *const PointerAssignLHSTag = "ptrLHS";

static constexpr const char *const PointerAssignRHSTag = "ptrRHS";

const BinaryOperator *PA; // pointer arithmetic expression

const DeclRefExpr * PtrLHS; // the LHS pointer expression in `PA`

const DeclRefExpr * PtrRHS; // the RHS pointer expression in `PA`

public:

PointerAssignmentGadget(const MatchFinder::MatchResult &Result)

: FixableGadget(Kind::PointerAssignment),

PA(Result.Nodes.getNodeAs<BinaryOperator>(PointerAssignemntTag)),

PtrLHS(Result.Nodes.getNodeAs<DeclRefExpr>(PointerAssignLHSTag)),

PtrRHS(Result.Nodes.getNodeAs<DeclRefExpr>(PointerAssignRHSTag)) {}

static bool classof(const Gadget *G) {

return G->getKind() == Kind::PointerAssignment;

}

static Matcher matcher() {

auto PtrAtRight = allOf(hasOperatorName("="),

hasRHS(ignoringParenImpCasts(declRefExpr(hasPointerType()).

jkorousUnsubmitted

Done

Shall we back-port how matcher() and getFixits() methods split responsibilities that we've started to use in more recent patches?
For other fixables we now check if the DRE refers to a var decl already in the matcher like this:
declRefExpr(to(varDecl()))

jkorous: Shall we back-port how `matcher()` and `getFixits()` methods split responsibilities that we've…

bind(PointerAssignRHSTag))));

auto PtrAtLeft = allOf(hasOperatorName("="),

ziqingluo-90Unsubmitted

Done

I was wondering if we need this sub-matcher declRefExpr(expr()).bind(PointerAssignRHSTag). It is eventually used as an inner matcher of binaryOperator. I doubt it will match anything.

Ditto for line 582-583.

ziqingluo-90: I was wondering if we need this sub-matcher `declRefExpr(expr()).bind(PointerAssignRHSTag)`.

t-rasmudAuthorUnsubmitted

Done

Great catch @ziqingluo-90! I was experimenting to match assignments at declarations and never intended for those sub-matchers to be part of this patch.

t-rasmud: Great catch @ziqingluo-90! I was experimenting to match assignments at declarations and never…

NoQUnsubmitted

Done

Looks like the hasOperatorName("=") check is duplicated on both sides.

NoQ: Looks like the `hasOperatorName("=")` check is duplicated on both sides.

hasLHS(declRefExpr(hasPointerType()).bind(PointerAssignLHSTag)));

//FIXME: Handle declarations at assignments

return stmt(binaryOperator(allOf(PtrAtLeft, PtrAtRight)));

}

virtual std::optional<FixItList> getFixits(const Strategy &S) const override;

ziqingluo-90Unsubmitted

Done

DeclPtrAtLeft is not used.

ziqingluo-90: `DeclPtrAtLeft` is not used.

virtual const Stmt *getBaseStmt() const override { return PA; }

virtual DeclUseList getClaimedVarUseSites() const override {

return DeclUseList{PtrLHS, PtrRHS};

}

virtual std::optional<ImplicationsList> getStrategyImplications()

const override {

return ImplicationsList{dyn_cast<VarDecl>(PtrLHS->getDecl()),

dyn_cast<VarDecl>(PtrRHS->getDecl())};

NoQUnsubmitted

Done

getStrategyImplications() const override {

- return std::make_pair(dyn_cast<VarDecl>(PtrLHS->getDecl()),

- dyn_cast<VarDecl>(PtrRHS->getDecl()));

+ return std::make_pair(cast<VarDecl>(PtrLHS->getDecl()),

+ cast<VarDecl>(PtrRHS->getDecl()));

}

};

We're confident that this cast always succeeds. And we really don't want to see null pointers inside these pairs.

NoQ: We're confident that this cast always succeeds. And we really don't want to see null pointers…

}

};

class PointerDereferenceGadget : public FixableGadget { class PointerDereferenceGadget : public FixableGadget {

static constexpr const char *const BaseDeclRefExprTag = "BaseDRE"; static constexpr const char *const BaseDeclRefExprTag = "BaseDRE";

static constexpr const char *const OperatorTag = "op"; static constexpr const char *const OperatorTag = "op";

const DeclRefExpr *BaseDeclRefExpr = nullptr; const DeclRefExpr *BaseDeclRefExpr = nullptr;

const UnaryOperator *Op; const UnaryOperator *Op;

public: public:

▲ Show 20 Lines • Show All 375 Lines • ▼ Show 20 Lines for (const auto &G : CB.FixableGadgets) {

} }

return {std::move(CB.FixableGadgets), std::move(CB.WarningGadgets), return {std::move(CB.FixableGadgets), std::move(CB.WarningGadgets),

std::move(CB.Tracker)}; std::move(CB.Tracker)};

} }

struct WarningGadgetSets { struct WarningGadgetSets {

std::map<const VarDecl *, std::set<std::unique_ptr<WarningGadget>>> byVar; std::map<const VarDecl *, std::set<const WarningGadget *>> byVar;

// These Gadgets are not related to pointer variables (e. g. temporaries). // These Gadgets are not related to pointer variables (e. g. temporaries).

llvm::SmallVector<std::unique_ptr<WarningGadget>, 16> noVar; llvm::SmallVector<const WarningGadget *, 16> noVar;

}; };

static WarningGadgetSets static WarningGadgetSets

groupWarningGadgetsByVar(WarningGadgetList &&AllUnsafeOperations) { groupWarningGadgetsByVar(const WarningGadgetList &AllUnsafeOperations) {

WarningGadgetSets result; WarningGadgetSets result;

// If some gadgets cover more than one // If some gadgets cover more than one

// variable, they'll appear more than once in the map. // variable, they'll appear more than once in the map.

for (auto &G : AllUnsafeOperations) { for (auto &G : AllUnsafeOperations) {

DeclUseList ClaimedVarUseSites = G->getClaimedVarUseSites(); DeclUseList ClaimedVarUseSites = G->getClaimedVarUseSites();

bool AssociatedWithVarDecl = false; bool AssociatedWithVarDecl = false;

for (const DeclRefExpr *DRE : ClaimedVarUseSites) { for (const DeclRefExpr *DRE : ClaimedVarUseSites) {

if (const auto *VD = dyn_cast<VarDecl>(DRE->getDecl())) { if (const auto *VD = dyn_cast<VarDecl>(DRE->getDecl())) {

result.byVar[VD].emplace(std::move(G)); result.byVar[VD].insert(G.get());

AssociatedWithVarDecl = true; AssociatedWithVarDecl = true;

} }

if (!AssociatedWithVarDecl) { if (!AssociatedWithVarDecl) {

result.noVar.emplace_back(std::move(G)); result.noVar.push_back(G.get());

continue; continue;

} }

return result; return result;

} }

struct FixableGadgetSets { struct FixableGadgetSets {

std::map<const VarDecl *, std::set<std::unique_ptr<FixableGadget>>> byVar; std::map<const VarDecl *, std::set<const FixableGadget *>> byVar;

}; };

static FixableGadgetSets static FixableGadgetSets

groupFixablesByVar(FixableGadgetList &&AllFixableOperations) { groupFixablesByVar(FixableGadgetList &&AllFixableOperations) {

FixableGadgetSets FixablesForUnsafeVars; FixableGadgetSets FixablesForUnsafeVars;

for (auto &F : AllFixableOperations) { for (auto &F : AllFixableOperations) {

DeclUseList DREs = F->getClaimedVarUseSites(); DeclUseList DREs = F->getClaimedVarUseSites();

for (const DeclRefExpr *DRE : DREs) { for (const DeclRefExpr *DRE : DREs) {

if (const auto *VD = dyn_cast<VarDecl>(DRE->getDecl())) { if (const auto *VD = dyn_cast<VarDecl>(DRE->getDecl())) {

FixablesForUnsafeVars.byVar[VD].emplace(std::move(F)); FixablesForUnsafeVars.byVar[VD].insert(F.get());

} }

return FixablesForUnsafeVars; return FixablesForUnsafeVars;

} }

bool clang::internal::anyConflict(const SmallVectorImpl<FixItHint> &FixIts, bool clang::internal::anyConflict(const SmallVectorImpl<FixItHint> &FixIts,

const SourceManager &SM) { const SourceManager &SM) {

▲ Show 20 Lines • Show All 72 Lines • ▼ Show 20 Lines if (const auto *VD = dyn_cast<VarDecl>(DREs.front()->getDecl())) {

case Strategy::Kind::Array: case Strategy::Kind::Array:

case Strategy::Kind::Vector: case Strategy::Kind::Vector:

llvm_unreachable("unsupported strategies for FixableGadgets"); llvm_unreachable("unsupported strategies for FixableGadgets");

} }

return std::nullopt; // something went wrong, no fix-it return std::nullopt; // something went wrong, no fix-it

} }

std::optional<FixItList>

PointerAssignmentGadget::getFixits(const Strategy &S) const {

if (const VarDecl *LeftVD = dyn_cast<VarDecl>(PtrLHS->getDecl()))

if (const VarDecl *RightVD = dyn_cast<VarDecl>(PtrRHS->getDecl())) {

NoQUnsubmitted

Done

PointerAssignmentGadget::getFixits(const Strategy &S) const {

- if (const VarDecl *LeftVD = dyn_cast<VarDecl>(PtrLHS->getDecl()))

- if (const VarDecl *RightVD = dyn_cast<VarDecl>(PtrRHS->getDecl())) {

+ const auto *LeftVD = cast<VarDecl>(PtrLHS->getDecl()));

+ const auto *RightVD = cast<VarDecl>(PtrRHS->getDecl()));

switch (S.lookup(LeftVD)) {

(valid use case for auto according to https://llvm.org/docs/CodingStandards.html#use-auto-type-deduction-to-make-code-more-readable)

NoQ: (valid use case for `auto` according to https://llvm.org/docs/CodingStandards.html#use-auto…

switch (S.lookup(LeftVD)) {

case Strategy::Kind::Span:

if (S.lookup(RightVD) == Strategy::Kind::Span)

return FixItList{};

return std::nullopt;

case Strategy::Kind::Wontfix:

return std::nullopt;

case Strategy::Kind::Iterator:

case Strategy::Kind::Array:

case Strategy::Kind::Vector:

llvm_unreachable("unsupported strategies for FixableGadgets");

}

return std::nullopt;

}

// Return the text representation of the given `APInt Val`: // Return the text representation of the given `APInt Val`:

static std::string getAPIntText(APInt Val) { static std::string getAPIntText(APInt Val) {

SmallVector<char> Txt; SmallVector<char> Txt;

Val.toString(Txt, 10, true); Val.toString(Txt, 10, true);

// APInt::toString does not add '\0' to the end of the string for us: // APInt::toString does not add '\0' to the end of the string for us:

Txt.push_back('\0'); Txt.push_back('\0');

return Txt.data(); return Txt.data();

} }

▲ Show 20 Lines • Show All 492 Lines • ▼ Show 20 Lines case Strategy::Kind::Vector:

return "vector"; return "vector";

case Strategy::Kind::Iterator: case Strategy::Kind::Iterator:

return "span"; return "span";

case Strategy::Kind::Wontfix: case Strategy::Kind::Wontfix:

assert(false && "Wonfix strategy shouldn't be used to generate Fix-Its"); assert(false && "Wonfix strategy shouldn't be used to generate Fix-Its");

}; };

} }

static bool impossibleToFixForVar(const FixableGadgetSets &FixablesForUnsafeVars,

const Strategy &S,

const VarDecl * Var) {

for (const auto &F : FixablesForUnsafeVars.byVar.find(Var)->second) {

jkorousUnsubmitted

Done

Array subscript operator adds an entry to the map if the key is not found.
https://en.cppreference.com/w/cpp/container/map/operator_at
That might or might not be ok here.
But we might also just take FixablesForUnsafeVars as a const reference and use find() instead.
https://en.cppreference.com/w/cpp/container/map/find

jkorous: Array subscript operator adds an entry to the map if the key is not found. https://en.

std::optional<FixItList> Fixits = F->getFixits(S);

if (!Fixits) {

return true;

}

ziqingluo-90Unsubmitted

Done

How about using FixablesForUnsafeVars.byVar[Var] instead of the outer loop?

ziqingluo-90: How about using `FixablesForUnsafeVars.byVar[Var]` instead of the outer loop?

}

return false;

}

static std::map<const VarDecl *, FixItList> static std::map<const VarDecl *, FixItList>

getFixIts(FixableGadgetSets &FixablesForUnsafeVars, const Strategy &S, getFixIts(FixableGadgetSets &FixablesForUnsafeVars, const Strategy &S,

const DeclUseTracker &Tracker, const ASTContext &Ctx, Sema &SA, const DeclUseTracker &Tracker, const ASTContext &Ctx, Sema &SA,

UnsafeBufferUsageHandler &Handler) { UnsafeBufferUsageHandler &Handler,

const std::map<const VarDecl *, std::vector<const VarDecl *>> &VarGrpMap) {

const SourceManager &SM = Ctx.getSourceManager(); const SourceManager &SM = Ctx.getSourceManager();

std::map<const VarDecl *, FixItList> FixItsForVariable; std::map<const VarDecl *, FixItList> FixItsForVariable;

for (const auto &[VD, Fixables] : FixablesForUnsafeVars.byVar) { for (const auto &[VD, Fixables] : FixablesForUnsafeVars.byVar) {

const Strategy::Kind ReplacementTypeForVD = S.lookup(VD); const Strategy::Kind ReplacementTypeForVD = S.lookup(VD);

FixItsForVariable[VD] = FixItsForVariable[VD] =

fixVariable(VD, ReplacementTypeForVD, Tracker, Ctx, SA, Handler); fixVariable(VD, ReplacementTypeForVD, Tracker, Ctx, SA, Handler);

// If we fail to produce Fix-It for the declaration we have to skip the // If we fail to produce Fix-It for the declaration we have to skip the

// variable entirely. // variable entirely.

Show All 11 Lines for (const auto &F : Fixables) {

} else { } else {

const FixItList CorrectFixes = Fixits.value(); const FixItList CorrectFixes = Fixits.value();

FixItsForVD.insert(FixItsForVD.end(), CorrectFixes.begin(), FixItsForVD.insert(FixItsForVD.end(), CorrectFixes.begin(),

CorrectFixes.end()); CorrectFixes.end());

} }

if (ImpossibleToFix) { if (ImpossibleToFix) {

FixItsForVariable.erase(VD); FixItsForVariable.erase(VD);

jkorousUnsubmitted

Done

I am trying to understand how do the Fixables matched to other variables in the VarGroupForVD learn that they shouldn't emit any Fix-It.
I was naively expecting that we'd do:

for (const VarDecl * V : VarGroupForVD) {
    FixItsForVariable.erase(VD);
}

jkorous: I am trying to understand how do the Fixables matched to other variables in the `VarGroupForVD`…

continue; continue;

} }

const auto VarGroupForVD = VarGrpMap.find(VD);

jkorousUnsubmitted

Done

Note: VarGrpMap being std::map inserts a record if VD is not among the keys yet.

jkorous: Note: `VarGrpMap` being `std::map` inserts a record if `VD` is not among the keys yet.

if (VarGroupForVD != VarGrpMap.end()) {

for (const VarDecl * V : VarGroupForVD->second) {

if (V == VD) {

continue;

}

if (impossibleToFixForVar(FixablesForUnsafeVars, S, V)) {

ImpossibleToFix = true;

break;

}

if (ImpossibleToFix) {

FixItsForVariable.erase(VD);

ziqingluo-90Unsubmitted

Done

If VD is removed (it means giving up on fixing VD), do we also need to remove the whole group of VD?

ziqingluo-90: If `VD` is removed (it means giving up on fixing `VD`), do we also need to remove the whole…

t-rasmudAuthorUnsubmitted

Done

That is right and a good optimization to have. I'll include it in the next iteration of the patch along.

t-rasmud: That is right and a good optimization to have. I'll include it in the next iteration of the…

continue;

}

FixItsForVariable[VD].insert(FixItsForVariable[VD].end(), FixItsForVariable[VD].insert(FixItsForVariable[VD].end(),

FixItsForVD.begin(), FixItsForVD.end()); FixItsForVD.begin(), FixItsForVD.end());

// Fix-it shall not overlap with macros or/and templates: // Fix-it shall not overlap with macros or/and templates:

if (overlapWithMacro(FixItsForVariable[VD]) || if (overlapWithMacro(FixItsForVariable[VD]) ||

clang::internal::anyConflict(FixItsForVariable[VD], clang::internal::anyConflict(FixItsForVariable[VD],

Ctx.getSourceManager())) { Ctx.getSourceManager())) {

FixItsForVariable.erase(VD); FixItsForVariable.erase(VD);

Show All 25 Lines std::optional<tooling::Replacement> Include = HeaderIncls.insert(

getHeaderFilenameForStrategyKind(ReplacementTypeForVD), true, getHeaderFilenameForStrategyKind(ReplacementTypeForVD), true,

clang::tooling::IncludeDirective::Include); clang::tooling::IncludeDirective::Include);

if (Include.has_value()) { if (Include.has_value()) {

SourceLocation HeaderInsertLoc = SourceLocation HeaderInsertLoc =

SM.getComposedLoc(FileOfVarDeclFixIt, Include->getOffset()); SM.getComposedLoc(FileOfVarDeclFixIt, Include->getOffset());

FixItsForVariable[VD].push_back(FixItHint::CreateInsertion( FixItsForVariable[VD].push_back(FixItHint::CreateInsertion(

HeaderInsertLoc, Include->getReplacementText())); HeaderInsertLoc, Include->getReplacementText()));

} }

if (VarGroupForVD != VarGrpMap.end()) {

for (const VarDecl * Var : VarGroupForVD->second) {

NoQUnsubmitted

Done

Probably better to compare pointers. There could be variables with the same name in the same function (in smaller scopes). Also IIRC getName() crashes on anonymous declarations (not sure if it can happen in this case).

NoQ: Probably better to compare pointers. There could be variables with the same name in the same…

jkorousUnsubmitted

Done

+1 to comparing pointers
That's what we do above.

jkorous: +1 to comparing pointers That's what we do above.

if (Var == VD) {

continue;

}

FixItList GroupFix = fixVariable(Var, ReplacementTypeForVD, Tracker,

ziqingluo-90Unsubmitted

Done

Could we do the grouping after all variables having their fix-its generated so that we can directly copy their fix-its instead of re-generating them?
This requires fix-its generated for two different variables to be independent. And, I think so far they are independent.

ziqingluo-90: Could we do the grouping after all variables having their fix-its generated so that we can…

Var->getASTContext(), SA, Handler);

for (auto Fix : GroupFix) {

FixItsForVariable[VD].push_back(Fix);

}

} }

return FixItsForVariable; return FixItsForVariable;

} }

static Strategy static Strategy

getNaiveStrategy(const llvm::SmallVectorImpl<const VarDecl *> &UnsafeVars) { getNaiveStrategy(const llvm::SmallVectorImpl<const VarDecl *> &UnsafeVars) {

Strategy S; Strategy S;

for (const VarDecl *VD : UnsafeVars) { for (const VarDecl *VD : UnsafeVars) {

S.set(VD, Strategy::Kind::Span); S.set(VD, Strategy::Kind::Span);

} }

return S; return S;

} }

void clang::checkUnsafeBufferUsage(const Decl *D, void clang::checkUnsafeBufferUsage(const Decl *D,

UnsafeBufferUsageHandler &Handler, UnsafeBufferUsageHandler &Handler,

bool EmitFixits, Sema &S) { bool EmitFixits, Sema &S) {

assert(D && D->getBody()); assert(D && D->getBody());

WarningGadgetSets UnsafeOps; WarningGadgetSets UnsafeOps;

FixableGadgetSets FixablesForUnsafeVars; FixableGadgetSets FixablesForAllVars;

DeclUseTracker Tracker;

{ auto [FixableGadgets, WarningGadgets, Tracker] = findGadgets(D, Handler);

auto [FixableGadgets, WarningGadgets, TrackerRes] = findGadgets(D, Handler);

UnsafeOps = groupWarningGadgetsByVar(std::move(WarningGadgets)); UnsafeOps = groupWarningGadgetsByVar(std::move(WarningGadgets));

FixablesForUnsafeVars = groupFixablesByVar(std::move(FixableGadgets)); FixablesForAllVars = groupFixablesByVar(std::move(FixableGadgets));

Tracker = std::move(TrackerRes);

}

std::map<const VarDecl *, FixItList> FixItsForVariable; std::map<const VarDecl *, FixItList> FixItsForVariable;

std::map<const VarDecl *, FixItList> FixItsForVariableGroup;

std::map<const VarDecl *, std::vector<const VarDecl *>> VariableGroupsMap{};

if (EmitFixits) { if (EmitFixits) {

// Filter out non-local vars and vars with unclaimed DeclRefExpr-s. // Filter out non-local vars and vars with unclaimed DeclRefExpr-s.

for (auto it = FixablesForUnsafeVars.byVar.cbegin(); for (auto it = FixablesForAllVars.byVar.cbegin();

it != FixablesForUnsafeVars.byVar.cend();) { it != FixablesForAllVars.byVar.cend();) {

if (Tracker.hasUnclaimedUses(it->first)) { if (Tracker.hasUnclaimedUses(it->first)) {

it = FixablesForUnsafeVars.byVar.erase(it); it = FixablesForAllVars.byVar.erase(it);

} else { } else {

++it; ++it;

} }

llvm::SmallVector<const VarDecl *, 16> UnsafeVars; llvm::SmallVector<const VarDecl *, 16> UnsafeVars;

for (const auto &[VD, ignore] : FixablesForUnsafeVars.byVar) for (const auto &[VD, ignore] : FixablesForAllVars.byVar)

UnsafeVars.push_back(VD); UnsafeVars.push_back(VD);

// Fixpoint iteration for pointer assignments

using DepMapTy = DenseMap<const VarDecl *, std::set<const VarDecl *>>;

DepMapTy DependenciesMap{};

DepMapTy PtrAssignmentGraph{};

for (auto it : FixablesForAllVars.byVar) {

NoQUnsubmitted

Done

I was about to say that, there could be fixables that don't connect any unsafe variables, but connect two implicated variables together, so you need to iterate over *all* fixables instead.

But then I noticed that FixablesForUnsafeVars is actually mis-named 🙁 According to groupFixablesByVar() contains fixables for *all* variables, sorted by variable, and nobody ever checks whether it's unsafe. It'd be really valuable if we could rename it before or together with this patch! – as it makes the code much harder to read and reason about, there could even be existing bugs based on this misunderstanding.

Also, iterating over FixablesForUnsafeVars.byVar here would cause you to visit multivariable fixables twice (once for each variable), so there could be a bit of duplicated work here, maybe it still makes sense to iterate over the whole list.

NoQ: I was about to say that, there could be fixables that don't connect any unsafe variables, but…

for (const FixableGadget *fixable : it.second) {

std::optional<ImplicationsList> ImplList =

fixable->getStrategyImplications();

if (ImplList) {

ImplicationsList Deps = ImplList.value();

assert(Deps.size() > 1);

PtrAssignmentGraph[Deps[0]].insert(Deps.begin() + 1, Deps.end());

jkorousUnsubmitted

Done

IIUC we're copying elements from an std::vector to an std::set.
Nit: Maybe we could avoid the explicit iteration?

assert(Deps.size() > 1);
PtrAssignmentGraph[Deps[0]].insert(Deps.begin() + 1, Deps.end());

But I feel we might want to handle the case where Deps has exactly one element regardless.

jkorous: IIUC we're copying elements from an `std::vector` to an `std::set`. Nit: Maybe we could avoid…

}

ziqingluo-90Unsubmitted

Done

It seems like we do not care about the direction of edges in the graph, maybe we should call it UndirectedGraph?

ziqingluo-90: It seems like we do not care about the direction of edges in the graph, maybe we should call…

t-rasmudAuthorUnsubmitted

Done

I changed it to PtrAssignmentGraph which seems (atleast to me) to give more context. It is still a directed graph because Deps[0] represents the LHS pointer and Deps[1] the RHS pointer. Hopefully the new documentation on getStrategyImplications makes this clear!

t-rasmud: I changed it to `PtrAssignmentGraph` which seems (atleast to me) to give more context. It is…

}

std::set<const VarDecl *> VisitedVarsDirected{};

for (const auto &[Var, ignore] : UnsafeOps.byVar) {

if (VisitedVarsDirected.find(Var) == VisitedVarsDirected.end()) {

std::queue<const VarDecl*> QueueDirected{};

QueueDirected.push(Var);

while(!QueueDirected.empty()) {

const VarDecl* CurrentVar = QueueDirected.front();

QueueDirected.pop();

VisitedVarsDirected.insert(CurrentVar);

auto AdjacentNodes = PtrAssignmentGraph[CurrentVar];

for (const VarDecl *Adj : AdjacentNodes) {

if (VisitedVarsDirected.find(Adj) == VisitedVarsDirected.end()) {

QueueDirected.push(Adj);

}

DependenciesMap[Var].insert(Adj);

DependenciesMap[Adj].insert(Var);

}

NoQUnsubmitted

Done

Using Var here instead of CurrentVar is really clever. I think there needs to be a comment bragging about this decision! I was quite confused initially when I was thinking of it as essential to the algorithm, whereas in reality it looks like it's just an optimization.

If CurrentVar was used, this would have straightforwardly meant "Just keep the edges in the original graph that are reachable from unsafe variables, and add reverse edges; and also no need to explore from the same var more than once". It's somewhat obvious that it would correctly define the graph for the second part of the algorithm.

By using Var you're most likely achieving the same result, but dramatically cutting the amount of hoops the second part of the algorithm needs to jump through, given that a lot of these connections become "direct". But this results in a dramatically different graph, and the shape of that graph is non-deterministic (depends on iteration order in this loop).

So I guess my point is, it's valuable to tell the reader that they shouldn't try to imagine how the graph is transformed because of using Var instead of CurrentVar. It's still essentially the same graph, just a bit shallower. The reader can continue reading as if CurrentVar was used, and think about why it's equivalent later.

NoQ: Using `Var` here instead of `CurrentVar` is really clever. I think there needs to be a comment…

jkorousUnsubmitted

Done

+1 to adding comments
Our future selves might be pretty grateful if they have to debug this code :)

jkorous: +1 to adding comments Our future selves might be pretty grateful if they have to debug this…

}

// Group Connected Components for Unsafe Vars

// (Dependencies based on pointer assignments)

std::set<const VarDecl *> VisitedVars{};

for (const auto &[Var, ignore] : UnsafeOps.byVar) {

if (VisitedVars.find(Var) == VisitedVars.end()) {

std::vector<const VarDecl *> VarGroup{};

std::queue<const VarDecl*> Queue{};

Queue.push(Var);

while(!Queue.empty()) {

const VarDecl* CurrentVar = Queue.front();

Queue.pop();

VisitedVars.insert(CurrentVar);

VarGroup.push_back(CurrentVar);

auto AdjacentNodes = DependenciesMap[CurrentVar];

for (const VarDecl *Adj : AdjacentNodes) {

if (VisitedVars.find(Adj) == VisitedVars.end()) {

Queue.push(Adj);

}

for (const VarDecl * V : VarGroup) {

if (UnsafeOps.byVar.find(V) != UnsafeOps.byVar.end()) {

VariableGroupsMap[V] = VarGroup;

}

Strategy NaiveStrategy = getNaiveStrategy(UnsafeVars); Strategy NaiveStrategy = getNaiveStrategy(UnsafeVars);

FixItsForVariable = getFixIts(FixablesForUnsafeVars, NaiveStrategy, Tracker, FixItsForVariableGroup =

D->getASTContext(), S, Handler); getFixIts(FixablesForAllVars, NaiveStrategy, Tracker,

D->getASTContext(), S, Handler, VariableGroupsMap);

// FIXME Detect overlapping FixIts. // FIXME Detect overlapping FixIts.

} }

for (const auto &G : UnsafeOps.noVar) { for (const auto &G : UnsafeOps.noVar) {

Handler.handleUnsafeOperation(G->getBaseStmt(), /*IsRelatedToDecl=*/false); Handler.handleUnsafeOperation(G->getBaseStmt(), /*IsRelatedToDecl=*/false);

} }

for (const auto &[VD, WarningGadgets] : UnsafeOps.byVar) { for (const auto &[VD, WarningGadgets] : UnsafeOps.byVar) {

auto FixItsIt = auto FixItsIt = FixItsForVariableGroup.find(VD);

EmitFixits ? FixItsForVariable.find(VD) : FixItsForVariable.end(); Handler.handleUnsafeVariableGroup(VD, VariableGroupsMap, FixItsIt !=

Handler.handleFixableVariable(VD, FixItsIt != FixItsForVariable.end() FixItsForVariableGroup.end()

? std::move(FixItsIt->second) ? std::move(FixItsIt->second)

: FixItList{}); : FixItList{});

for (const auto &G : WarningGadgets) { for (const auto &G : WarningGadgets) {

Handler.handleUnsafeOperation(G->getBaseStmt(), /*IsRelatedToDecl=*/true); Handler.handleUnsafeOperation(G->getBaseStmt(), /*IsRelatedToDecl=*/true);

} }

clang/lib/Sema/AnalysisBasedWarnings.cpp

Show First 20 Lines • Show All 2,191 Lines • ▼ Show 20 Lines	if (const auto *ASE = dyn_cast<ArraySubscriptExpr>(Operation)) {
Range = Operation->getSourceRange();		Range = Operation->getSourceRange();
}		}
if (IsRelatedToDecl)		if (IsRelatedToDecl)
S.Diag(Loc, diag::note_unsafe_buffer_operation) << MsgParam << Range;		S.Diag(Loc, diag::note_unsafe_buffer_operation) << MsgParam << Range;
else		else
S.Diag(Loc, diag::warn_unsafe_buffer_operation) << MsgParam << Range;		S.Diag(Loc, diag::warn_unsafe_buffer_operation) << MsgParam << Range;
}		}

// FIXME: rename to handleUnsafeVariable		void handleUnsafeVariableGroup(const VarDecl *Variable,
void handleFixableVariable(const VarDecl *Variable,		std::map<const VarDecl , std::vector<const VarDecl >> VarGrpMap,
FixItList &&Fixes) override {		FixItList &&Fixes) override {
S.Diag(Variable->getLocation(), diag::warn_unsafe_buffer_variable)		S.Diag(Variable->getLocation(), diag::warn_unsafe_buffer_variable)
<< Variable << (Variable->getType()->isPointerType() ? 0 : 1)		<< Variable << (Variable->getType()->isPointerType() ? 0 : 1)
<< Variable->getSourceRange();		<< Variable->getSourceRange();
		std::vector<const VarDecl *> VarGroupForVD = VarGrpMap[Variable];
if (!Fixes.empty()) {		if (!Fixes.empty()) {
unsigned FixItStrategy = 0; // For now we only has 'std::span' strategy		unsigned FixItStrategy = 0; // For now we only have 'std::span' strategy
const auto &FD = S.Diag(Variable->getLocation(),		Sema::SemaDiagnosticBuilder const &FD = S.Diag(Variable->getLocation(),
diag::note_unsafe_buffer_variable_fixit);		diag::note_unsafe_buffer_variable_fixit_group);

FD << Variable->getName() << FixItStrategy;		FD << Variable->getName() << FixItStrategy;
		std::string AllVars = "";
		if (VarGroupForVD.size() > 1) {
		AllVars.append(", and change ");
		NoQUnsubmitted Done Reply Inline Actions We should probably find a way to put all these plain text pieces into `DiagnosticSemaKinds.td` (except probably the `and` part). I wonder if a ... %select{\|, and change %2 to %select{std::span\|std::array\|std::span::iterator}1 to propagate bounds information betwen them'.}` would work (like, nest more format specifiers into a `%select`). Or we could keep using the old diagnostic id when we don't need extra stuff at the end. (It might be a good idea to make a new format specifier for this purpose, like `plural` but with lists. But this is probably an overkill if there's just one warning of this kind.) NoQ: We should probably find a way to put all these plain text pieces into `DiagnosticSemaKinds.td`…
		if (VarGroupForVD.size() == 2) {
		if (VarGroupForVD[0] == Variable) {
		AllVars.append("'" + VarGroupForVD[1]->getName().str() + "'");
		} else {
		AllVars.append("'" + VarGroupForVD[0]->getName().str() + "'");
		}
		} else {
		bool first = false;
		if (VarGroupForVD.size() == 3) {
		for (const VarDecl * V : VarGroupForVD) {
		if (V == Variable) {
		continue;
		}
		if (!first) {
		first = true;
		AllVars.append("'" + V->getName().str() + "'" + " and ");
		} else {
		AllVars.append("'" + V->getName().str() + "'");
		}
		}
		} else {
		for (const VarDecl * V : VarGroupForVD) {
		if (V == Variable) {
		continue;
		}
		if (VarGroupForVD.back() != V) {
		AllVars.append("'" + V->getName().str() + "'" + ", ");
		} else {
		AllVars.append("and '" + V->getName().str() + "'");
		}
		}
		}
		}
		// FIXME: change std::span to the correct type (array/iterator)
		AllVars.append(" to 'std::span' to propagate bounds information between them");
		ziqingluo-90Unsubmitted Done Reply Inline Actions These note messages are nice. Maybe we can have some tests for them. It will also help telling whether variables are grouped correctly. ziqingluo-90: These note messages are nice. Maybe we can have some tests for them. It will also help…
		}
		FD << AllVars;

for (const auto &F : Fixes)		for (const auto &F : Fixes)
FD << F;		FD << F;
}		}
}		}

bool isSafeBufferOptOut(const SourceLocation &Loc) const override {		bool isSafeBufferOptOut(const SourceLocation &Loc) const override {
return S.PP.isSafeBufferOptOut(S.getSourceManager(), Loc);		return S.PP.isSafeBufferOptOut(S.getSourceManager(), Loc);
}		}
};		};
} // namespace		} // namespace

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// AnalysisBasedWarnings - Worker object used by Sema to execute analysis-based		// AnalysisBasedWarnings - Worker object used by Sema to execute analysis-based
// warnings on a function, method, or block.		// warnings on a function, method, or block.
		ziqingluo-90Unsubmitted Done Reply Inline Actions would it be better if to make `VariableGroups` a map from `VarDecl`s to groups ? ziqingluo-90: would it be better if to make `VariableGroups` a map from `VarDecl`s to groups ?
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

sema::AnalysisBasedWarnings::Policy::Policy() {		sema::AnalysisBasedWarnings::Policy::Policy() {
enableCheckFallThrough = 1;		enableCheckFallThrough = 1;
enableCheckUnreachable = 0;		enableCheckUnreachable = 0;
enableThreadSafetyAnalysis = 0;		enableThreadSafetyAnalysis = 0;
enableConsumedAnalysis = 0;		enableConsumedAnalysis = 0;
}		}
▲ Show 20 Lines • Show All 343 Lines • Show Last 20 Lines

clang/test/SemaCXX/warn-unsafe-buffer-usage-multi-decl-fixits-test.cpp

This file was added.

				// RUN: %clang_cc1 -std=c++20 -Wunsafe-buffer-usage -fdiagnostics-parseable-fixits %s 2>&1 \| FileCheck %s

				void foo1a() {
				int *r = new int[7];
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:11}:"std::span<int> r"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:12-[[@LINE-2]]:12}:"{"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:22-[[@LINE-3]]:22}:", 7}"
				int *p = new int[4];
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:11}:"std::span<int> p"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:12-[[@LINE-2]]:12}:"{"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:22-[[@LINE-3]]:22}:", 4}"
				p = r;
				int tmp = p[9];
				int *q;
				// CHECK-NOT: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:11}:"std::span<int> q"
				q = r;
				}

				void foo1b() {
				int *r = new int[7];
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:11}:"std::span<int> r"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:12-[[@LINE-2]]:12}:"{"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:22-[[@LINE-3]]:22}:", 7}"
				int *p = new int[4];
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:11}:"std::span<int> p"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:12-[[@LINE-2]]:12}:"{"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:22-[[@LINE-3]]:22}:", 4}"
				p = r;
				int tmp = p[9];
				int *q = new int[4];
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:11}:"std::span<int> q"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:12-[[@LINE-2]]:12}:"{"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:22-[[@LINE-3]]:22}:", 4}"
				q = r;
				tmp = q[9];
				}

				void foo1c() {
				int *r = new int[7];
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:11}:"std::span<int> r"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:12-[[@LINE-2]]:12}:"{"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:22-[[@LINE-3]]:22}:", 7}"
				int *p = new int[4];
				// CHECK-NOT: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:11}:"std::span<int> p"
				p = r;
				int tmp = r[9];
				int *q;
				// CHECK-NOT: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:11}:"std::span<int> q"
				q = r;
				tmp = q[9];
				}

				void foo2a() {
				int *r = new int[7];
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:11}:"std::span<int> r"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:12-[[@LINE-2]]:12}:"{"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:22-[[@LINE-3]]:22}:", 7}"
				int *p = new int[5];
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:11}:"std::span<int> p"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:12-[[@LINE-2]]:12}:"{"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:22-[[@LINE-3]]:22}:", 5}"
				int *q = new int[4];
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:11}:"std::span<int> q"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:12-[[@LINE-2]]:12}:"{"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:22-[[@LINE-3]]:22}:", 4}"
				p = q;
				int tmp = p[8];
				q = r;
				}

				void foo2b() {
				int *r = new int[7];
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:11}:"std::span<int> r"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:12-[[@LINE-2]]:12}:"{"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:22-[[@LINE-3]]:22}:", 7}"
				int *p = new int[5];
				// CHECK-NOT: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:11}:"std::span<int> p"
				// CHECK-NOT: fix-it:"{{.*}}":{[[@LINE-2]]:12-[[@LINE-2]]:12}:"{"
				// CHECK-NOT: fix-it:"{{.*}}":{[[@LINE-3]]:22-[[@LINE-3]]:22}:", 5}"
				int *q = new int[4];
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:11}:"std::span<int> q"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:12-[[@LINE-2]]:12}:"{"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:22-[[@LINE-3]]:22}:", 4}"
				p = q;
				int tmp = q[8];
				q = r;
				}

				void foo2c() {
				int *r = new int[7];
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:11}:"std::span<int> r"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:12-[[@LINE-2]]:12}:"{"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:22-[[@LINE-3]]:22}:", 7}"
				int *p = new int[5];
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:11}:"std::span<int> p"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:12-[[@LINE-2]]:12}:"{"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:22-[[@LINE-3]]:22}:", 5}"
				int *q = new int[4];
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:11}:"std::span<int> q"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:12-[[@LINE-2]]:12}:"{"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:22-[[@LINE-3]]:22}:", 4}"
				p = q;
				int tmp = p[8];
				q = r;
				tmp = q[8];
				}

				void foo3a() {
				int *r = new int[7];
				// CHECK-NOT: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:11}:"std::span<int> r"
				int *p = new int[5];
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:11}:"std::span<int> p"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:12-[[@LINE-2]]:12}:"{"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:22-[[@LINE-3]]:22}:", 5}"
				int *q = new int[4];
				// CHECK-NOT: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:11}:"std::span<int> q"
				q = p;
				int tmp = p[8];
				q = r;
				}

				void foo3b() {
				int *r = new int[10];
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:11}:"std::span<int> r"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:12-[[@LINE-2]]:12}:"{"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:23-[[@LINE-3]]:23}:", 10}"
				int *p = new int[10];
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:11}:"std::span<int> p"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:12-[[@LINE-2]]:12}:"{"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:23-[[@LINE-3]]:23}:", 10}"
				int *q = new int[10];
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-1]]:3-[[@LINE-1]]:11}:"std::span<int> q"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-2]]:12-[[@LINE-2]]:12}:"{"
				// CHECK-DAG: fix-it:"{{.*}}":{[[@LINE-3]]:23-[[@LINE-3]]:23}:", 10}"
				q = p;
				int tmp = q[8];
				q = r;
				}
				ziqingluo-90Unsubmitted Done Reply Inline Actions Could we have an example where there is a cyclic dependency? Such as `p = q; q = r; r = p;`. ziqingluo-90: Could we have an example where there is a cyclic dependency? Such as `p = q; q = r; r = p;`.

clang/test/SemaCXX/warn-unsafe-buffer-usage-multi-decl-warnings.cpp

This file was added.

				// RUN: %clang_cc1 -std=c++20 -Wunsafe-buffer-usage -verify %s

				jkorousUnsubmitted Done Reply Inline Actions I am looking for a test that checks that if one of the variables in the group is referred to by at least one DRE for which we don't have a Fix-It (e. g. negative index for `span` strategy) that the whole group will remain Fix-It-less (and note-less). I can't find any and maybe that's just because I didn't look hard enough but it makes me wonder - should we add a short explanation to such test (either an existing one or the one we should add)? jkorous: I am looking for a test that checks that if one of the variables in the group is referred to by…
				namespace std {
				class type_info { };
				}

				void local_assign_both_span() {
				int tmp;
				int* p = new int[10]; // expected-warning{{'p' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'p' to 'std::span' to preserve bounds information, and change 'q' to 'std::span' to propagate bounds information between them$}}}}
				tmp = p[4]; // expected-note{{used in buffer access here}}

				int* q = new int[10]; // expected-warning{{'q' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'q' to 'std::span' to preserve bounds information, and change 'p' to 'std::span' to propagate bounds information between them$}}}}
				tmp = q[4]; // expected-note{{used in buffer access here}}

				NoQUnsubmitted Done Reply Inline Actions We probably want to build some finer narrative here 🤔 In this case `p` and `q` are both independently unsafe and there's a one-way connection between them. Would it make sense to emit just one warning for both of them? I suspect it also makes sense to leave a note at `q = p` with a text like "note: bounds information needs to propagate from `p` to `q` here" (I've no idea if it'll scale to more than 2 variables, or how much more information we'll need to gather). NoQ: We probably want to build some finer narrative here 🤔 In this case `p` and `q` are both…
				t-rasmudAuthorUnsubmitted Done Reply Inline Actions The new note is more descriptive and summarizes the reason for grouping variables together. We still don't inform the user of the direction of bounds propagation or the fact that buffers are independently unsafe. Like we discussed offline, we could consider these refinements in future versions of the tool. t-rasmud: The new note is more descriptive and summarizes the reason for grouping variables together. We…
				q = p;
				}

				void local_assign_rhs_span() {
				int tmp;
				int* p = new int[10];
				int* q = new int[10]; // expected-warning{{'q' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'q' to 'std::span' to preserve bounds information$}}}}
				tmp = q[4]; // expected-note{{used in buffer access here}}
				p = q;
				}

				void local_assign_no_span() {
				int tmp;
				int* p = new int[10];
				int* q = new int[10];
				p = q;
				}

				void local_assign_lhs_span() {
				int tmp;
				int* p = new int[10]; // expected-warning{{'p' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'p' to 'std::span' to preserve bounds information, and change 'q' to 'std::span' to propagate bounds information between them$}}}}
				tmp = p[4]; // expected-note{{used in buffer access here}}
				int* q = new int[10];

				p = q;
				}


				void lhs_span_multi_assign() {
				int *a = new int[2];
				NoQUnsubmitted Done Reply Inline Actions This is a FIXME test right? In theory we want to propagate all the way up to `a`, but we only support assignments so far, not initializations. NoQ: This is a FIXME test right? In theory we want to propagate all the way up to `a`, but we only…
				int *b = a;
				int *c = b;
				int *d = c; // expected-warning{{'d' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'd' to 'std::span' to preserve bounds information$}}}}
				int tmp = d[2]; // expected-note{{used in buffer access here}}
				}

				void rhs_span() {
				int *x = new int[3];
				int *y; // expected-warning{{'y' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'y' to 'std::span' to preserve bounds information$}}}}
				y[5] = 10; // expected-note{{used in buffer access here}}

				x = y;
				}

				void rhs_span1() {
				int *q = new int[12];
				int *p = q; // expected-warning{{'p' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'p' to 'std::span' to preserve bounds information$}}}}
				p[5] = 10; // expected-note{{used in buffer access here}}
				int *r = q; // expected-warning{{'r' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'r' to 'std::span' to preserve bounds information$}}}}
				NoQUnsubmitted Done Reply Inline Actions In this case we definitely want to spanify `q`, for two independent reasons. We need to figure out how to tell that to the user though 🤔 NoQ: In this case we definitely want to spanify `q`, for two independent reasons. We need to figure…
				r[10] = 5; // expected-note{{used in buffer access here}}
				}

				void rhs_span2() {
				int *q = new int[6];
				int *p = q; // expected-warning{{'p' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'p' to 'std::span' to preserve bounds information$}}}}
				p[5] = 10; // expected-note{{used in buffer access here}}
				int *r = q;
				}

				void test_grouping() {
				int *z = new int[8];
				int tmp;
				int *y = new int[10]; // expected-warning{{'y' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'y' to 'std::span' to preserve bounds information$}}}}
				tmp = y[5]; // expected-note{{used in buffer access here}}

				int *x = new int[10];
				x = y;

				int *w = z;
				}

				void test_grouping1() {
				int tmp;
				int *y = new int[10]; // expected-warning{{'y' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'y' to 'std::span' to preserve bounds information$}}}}
				tmp = y[5]; // expected-note{{used in buffer access here}}
				int *x = new int[10];
				x = y;

				int *w = new int[10]; // expected-warning{{'w' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'w' to 'std::span' to preserve bounds information$}}}}
				tmp = w[5]; // expected-note{{used in buffer access here}}
				int *z = new int[10];
				z = w;
				}

				void foo1a() {
				int *r = new int[7];
				int *p = new int[4]; // expected-warning{{'p' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'p' to 'std::span' to preserve bounds information, and change 'r' to 'std::span' to propagate bounds information between them$}}}}
				p = r;
				int tmp = p[9]; // expected-note{{used in buffer access here}}
				int *q;
				q = r;
				}

				void foo1b() {
				int *r = new int[7];
				int *p = new int[4]; // expected-warning{{'p' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'p' to 'std::span' to preserve bounds information, and change 'r' and 'q' to 'std::span' to propagate bounds information between them$}}}}
				p = r;
				int tmp = p[9]; // expected-note{{used in buffer access here}}
				int *q; // expected-warning{{'q' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'q' to 'std::span' to preserve bounds information, and change 'p' and 'r' to 'std::span' to propagate bounds information between them$}}}}
				q = r;
				tmp = q[9]; // expected-note{{used in buffer access here}}
				}

				void foo1c() {
				int *r = new int[7]; // expected-warning{{'r' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'r' to 'std::span' to preserve bounds information, and change 'q' to 'std::span' to propagate bounds information between them$}}}}
				int *p = new int[4];
				p = r;
				int tmp = r[9]; // expected-note{{used in buffer access here}}
				int *q; // expected-warning{{'q' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'q' to 'std::span' to preserve bounds information, and change 'r' to 'std::span' to propagate bounds information between them$}}}}
				q = r;
				tmp = q[9]; // expected-note{{used in buffer access here}}
				}

				void foo2a() {
				int *r = new int[7];
				int *p = new int[5]; // expected-warning{{'p' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'p' to 'std::span' to preserve bounds information, and change 'r' and 'q' to 'std::span' to propagate bounds information between them$}}}}
				int *q = new int[4];
				p = q;
				int tmp = p[8]; // expected-note{{used in buffer access here}}
				q = r;
				}

				void foo2b() {
				int *r = new int[7];
				int *p = new int[5];
				int *q = new int[4]; // expected-warning{{'q' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'q' to 'std::span' to preserve bounds information, and change 'r' to 'std::span' to propagate bounds information between them$}}}}
				p = q;
				int tmp = q[8]; // expected-note{{used in buffer access here}}
				q = r;
				}

				void foo2c() {
				int *r = new int[7];
				int *p = new int[5]; // expected-warning{{'p' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'p' to 'std::span' to preserve bounds information, and change 'r' and 'q' to 'std::span' to propagate bounds information between them$}}}}
				int *q = new int[4]; // expected-warning{{'q' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'q' to 'std::span' to preserve bounds information, and change 'p' and 'r' to 'std::span' to propagate bounds information between them$}}}}
				p = q;
				int tmp = p[8]; // expected-note{{used in buffer access here}}
				q = r;
				tmp = q[8]; // expected-note{{used in buffer access here}}
				}

				void foo3a() {
				int *r = new int[7];
				int *p = new int[5]; // expected-warning{{'p' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'p' to 'std::span' to preserve bounds information$}}}}
				int *q = new int[4];
				q = p;
				int tmp = p[8]; // expected-note{{used in buffer access here}}
				q = r;
				}

				void foo3b() {
				int *r = new int[7];
				int *p = new int[5];
				int *q = new int[4]; // expected-warning{{'q' is an unsafe pointer used for buffer access}} //expected-note-re{{{{^change type of 'q' to 'std::span' to preserve bounds information, and change 'r' and 'p' to 'std::span' to propagate bounds information between them$}}}}
				q = p;
				int tmp = q[8]; // expected-note{{used in buffer access here}}
				q = r;
				}

				void test_crash() {
				int *r = new int[8];
				int *q = r;
				int *p; // expected-warning{{'p' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'p' to 'std::span' to preserve bounds information, and change 'q' to 'std::span' to propagate bounds information between them$}}}}
				p = q;
				int tmp = p[9]; // expected-note{{used in buffer access here}}
				}

				void foo_uuc() {
				int *ptr;
				int *local; // expected-warning{{'local' is an unsafe pointer used for buffer access}}
				local = ptr;
				local++; // expected-note{{used in pointer arithmetic here}}

				(local = ptr) += 5; // expected-warning{{unsafe pointer arithmetic}}
				}

				void check_rhs_fix() {
				int *r = new int[8]; // expected-warning{{'r' is an unsafe pointer used for buffer access}} // expected-note-re{{{{^change type of 'r' to 'std::span' to preserve bounds information, and change 'x' to 'std::span' to propagate bounds information between them$}}}}
				int *x;
				r[7] = 9; // expected-note{{used in buffer access here}}
				r = x;
				}

				void check_rhs_nofix() {
				int *r = new int[8]; // expected-warning{{'r' is an unsafe pointer used for buffer access}}
				int *x; // expected-warning{{'x' is an unsafe pointer used for buffer access}}
				r[7] = 9; // expected-note{{used in buffer access here}}
				r = x;
				x++; // expected-note{{used in pointer arithmetic here}}
				}

				void check_rhs_nofix_order() {
				int *r = new int[8]; // expected-warning{{'r' is an unsafe pointer used for buffer access}}
				int *x; // expected-warning{{'x' is an unsafe pointer used for buffer access}}
				x++; // expected-note{{used in pointer arithmetic here}}
				r[7] = 9; // expected-note{{used in buffer access here}}
				r = x;
				}

				void check_rhs_nofix_order1() {
				int *r = new int[8]; // expected-warning{{'r' is an unsafe pointer used for buffer access}}
				r[7] = 9; // expected-note{{used in buffer access here}}
				int *x; // expected-warning{{'x' is an unsafe pointer used for buffer access}}
				x++; // expected-note{{used in pointer arithmetic here}}
				r = x;
				}

				void check_rhs_nofix_order2() {
				int *x; // expected-warning{{'x' is an unsafe pointer used for buffer access}}
				int *r = new int[8]; // expected-warning{{'r' is an unsafe pointer used for buffer access}}
				r[7] = 9; // expected-note{{used in buffer access here}}
				x++; // expected-note{{used in pointer arithmetic here}}
				r = x;
				}

				void check_rhs_nofix_order3() {
				int *x; // expected-warning{{'x' is an unsafe pointer used for buffer access}}
				int *r = new int[8]; // expected-warning{{'r' is an unsafe pointer used for buffer access}}
				r = x;
				r[7] = 9; // expected-note{{used in buffer access here}}
				x++; // expected-note{{used in pointer arithmetic here}}
				}

				void check_rhs_nofix_order4() {
				int *x; // expected-warning{{'x' is an unsafe pointer used for buffer access}}
				int *r = new int[8]; // expected-warning{{'r' is an unsafe pointer used for buffer access}}
				r[7] = 9; // expected-note{{used in buffer access here}}
				r = x;
				x++; // expected-note{{used in pointer arithmetic here}}
				}

				void no_unhandled_lhs() {
				int *r = new int[8]; // expected-warning{{'r' is an unsafe pointer used for buffer access}} // expected-note-re{{{{^change type of 'r' to 'std::span' to preserve bounds information, and change 'x' to 'std::span' to propagate bounds information between them$}}}}
				r[7] = 9; // expected-note{{used in buffer access here}}
				int *x;
				r = x;
				}

				const std::type_info unhandled_lhs() {
				int *r = new int[8]; // expected-warning{{'r' is an unsafe pointer used for buffer access}}
				r[7] = 9; // expected-note{{used in buffer access here}}
				int *x;
				r = x;
				return typeid(*r);
				}

				const std::type_info unhandled_rhs() {
				int *r = new int[8]; // expected-warning{{'r' is an unsafe pointer used for buffer access}}
				r[7] = 9; // expected-note{{used in buffer access here}}
				int *x;
				r = x;
				return typeid(*x);
				}

				void test_negative_index() {
				int *x = new int[4]; // expected-warning{{'x' is an unsafe pointer used for buffer access}}
				int *p; // expected-warning{{'p' is an unsafe pointer used for buffer access}}
				p = &x[1]; // expected-note{{used in buffer access here}}
				p[-1] = 9; // expected-note{{used in buffer access here}}
				}

				void test_unfixable() {
				int *r = new int[8]; // expected-warning{{'r' is an unsafe pointer used for buffer access}}
				int *x; // expected-warning{{'x' is an unsafe pointer used for buffer access}}
				x[7] = 9; // expected-note{{used in buffer access here}}
				r = x;
				r++; // expected-note{{used in pointer arithmetic here}}
				}

				void test_cyclic_deps() {
				int *r = new int[10]; // expected-warning{{'r' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'r' to 'std::span' to preserve bounds information, and change 'q' and 'p' to 'std::span' to propagate bounds information between them$}}}}
				int *q;
				q = r;
				int *p;
				p = q;
				r[3] = 9; // expected-note{{used in buffer access here}}
				r = p;
				}

				void test_cyclic_deps_a() {
				int *r = new int[10]; // expected-warning{{'r' is an unsafe pointer used for buffer access}}
				int *q;
				q = r;
				int *p; // expected-warning{{'p' is an unsafe pointer used for buffer access}}
				p = q;
				r[3] = 9; // expected-note{{used in buffer access here}}
				r = p;
				p++; // expected-note{{used in pointer arithmetic here}}
				}

				void test_cyclic_deps1() {
				int *r = new int[10];
				int *q;
				q = r;
				int *p; // expected-warning{{'p' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'p' to 'std::span' to preserve bounds information, and change 'r' and 'q' to 'std::span' to propagate bounds information between them$}}}}
				p = q;
				p[3] = 9; // expected-note{{used in buffer access here}}
				r = p;
				}

				void test_cyclic_deps2() {
				int *r = new int[10];
				int *q; // expected-warning{{'q' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'q' to 'std::span' to preserve bounds information, and change 'r' and 'p' to 'std::span' to propagate bounds information between them$}}}}
				q = r;
				int *p;
				p = q;
				q[3] = 9; // expected-note{{used in buffer access here}}
				r = p;
				}

				void test_cyclic_deps3() {
				int *r = new int[10];
				int *q; // expected-warning{{'q' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'q' to 'std::span' to preserve bounds information, and change 'r' and 'p' to 'std::span' to propagate bounds information between them$}}}}
				q = r;
				int *p; // expected-warning{{'p' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'p' to 'std::span' to preserve bounds information, and change 'q' and 'r' to 'std::span' to propagate bounds information between them$}}}}
				p = q;
				q[3] = 9; // expected-note{{used in buffer access here}}
				p[4] = 7; // expected-note{{used in buffer access here}}
				r = p;
				}

				void test_cyclic_deps4() {
				int *r = new int[10]; // expected-warning{{'r' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'r' to 'std::span' to preserve bounds information, and change 'q' and 'p' to 'std::span' to propagate bounds information between them$}}}}
				int *q; // expected-warning{{'q' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'q' to 'std::span' to preserve bounds information, and change 'r' and 'p' to 'std::span' to propagate bounds information between them$}}}}
				q = r;
				int *p; // expected-warning{{'p' is an unsafe pointer used for buffer access}} expected-note-re{{{{^change type of 'p' to 'std::span' to preserve bounds information, and change 'r' and 'q' to 'std::span' to propagate bounds information between them$}}}}
				p = q;
				q[3] = 9; // expected-note{{used in buffer access here}}
				p[4] = 7; // expected-note{{used in buffer access here}}
				r[1] = 5; // expected-note{{used in buffer access here}}
				r = p;
				}

This is an archive of the discontinued LLVM Phabricator instance.

[-Wunsafe-buffer-usage] Group variables associated by pointer assignmentsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 512973

clang/include/clang/Analysis/Analyses/UnsafeBufferUsage.h

clang/include/clang/Analysis/Analyses/UnsafeBufferUsageGadgets.def

clang/include/clang/Basic/DiagnosticSemaKinds.td

clang/lib/Analysis/UnsafeBufferUsage.cpp

clang/lib/Sema/AnalysisBasedWarnings.cpp

clang/test/SemaCXX/warn-unsafe-buffer-usage-multi-decl-fixits-test.cpp

clang/test/SemaCXX/warn-unsafe-buffer-usage-multi-decl-warnings.cpp

[-Wunsafe-buffer-usage] Group variables associated by pointer assignments
ClosedPublic