This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
cfe/trunk/
-
trunk/
-
include/clang/StaticAnalyzer/Core/
-
clang/
-
StaticAnalyzer/
-
Core/
-
AnalyzerOptions.def
-
BugReporter/
-
BugReporter.h
-
lib/StaticAnalyzer/Core/
-
StaticAnalyzer/
-
Core/
-
BugReporterVisitors.cpp
-
test/Analysis/
-
Analysis/
-
analyzer-config.c
-
track-control-dependency-conditions.cpp

Differential D62883

[analyzer] Track terminator conditions on which a tracked expressions depends
ClosedPublic

Authored by Szelethus on Jun 4 2019, 3:51 PM.

Download Raw Diff

Details

Reviewers

NoQ
dcoughlin
a.sidorin
baloghadamsoftware
xazax.hun
Charusso
rnkovacs

Commits

rG258e5e457a6d: [analyzer] Track terminator conditions on which a tracked expression depends
rC365207: [analyzer] Track terminator conditions on which a tracked expression depends
rL365207: [analyzer] Track terminator conditions on which a tracked expression depends

Summary

This patch implements the idea discussed on the mailing list, in fact, the included testfile contains the functions example_1 and example_2 exactly how it's described there.

The idea is to, as the title says, to track the value of the condition of the terminator statement on which the reported node depends on:

01 int flag;
02 bool coin();
03 
04 void foo() {
05   flag = coin(); // no note
06 }
07 
08 int main() {
09   int *x = 0; // x initialized to 0
10   flag = 1;
11   foo();
12   if (flag) // assumed false
13     x = new int;
14   foo();
15 
16   if (flag) // assumed true
17     *x = 5; // warn
18 }

We emit a warning at statement 17. The new BugReporter visitor figures out that statement 16 is in fact a control dependency if the reported node, and uses trackExpressionValue() to track it's condition, in this case, flag, resulting in new notes being placed at for the call to foo() on line 14 and a note about flag being invalidated on line 5.

Now, whether this change is any good is practically impossible to tell without evaluation on production code, so I'll get back with that once I gather some data.

Diff Detail

Repository: rL LLVM

Event Timeline

Szelethus created this revision.Jun 4 2019, 3:51 PM

Herald added subscribers: cfe-commits, gamesh411, dkrupp and 4 others. · View Herald TranscriptJun 4 2019, 3:51 PM

Szelethus added a parent revision: D62619: [analyzer][IDF] Add a control dependency calculator + a new debug checker.Jun 4 2019, 3:52 PM

For the test files where plist files have changed: (note to self: there has got to be a better way to show this off)

track_cond_undef_value_param_before.html24 KBDownload

track_cond_undef_value_param_after.html24 KBDownload

track_cond_unix_fns_before.html53 KBDownload

track_cond_unix_fns_after.html53 KBDownload

track_cond_retain_release_m_objc_1_before.html395 KBDownload

track_cond_retain_release_m_objc_1_after.html396 KBDownload

(same for the .*objcpp file)

track_cond_retain_release_m_objc_2_before.html397 KBDownload

track_cond_retain_release_m_objc_2_after.html397 KBDownload

track_cond_edges_new_before.html92 KBDownload

track_cond_edges_new_after.html92 KBDownload

track_cond_cxx_for_range_before.html25 KBDownload

track_cond_cxx_for_range_after.html25 KBDownload

clang/lib/StaticAnalyzer/Core/BugReporterVisitors.cpp
1863–1864 ↗	(On Diff #203040)	@NoQ Any feelings on this?

Yay, this thing really works!

Now, whether this change is any good is practically impossible to tell without evaluation on production code, so I'll get back with that once I gather some data.

Yes. This patch deserves some data on how much have reports grown in length (i.e., how many have grown in length and how much did they do so). Ideally we should also have a look at how many of the new notes were truly useful. Eg., you can start with the old report, understand it (or fail to do so), then take the new report. If you get this feeling that's like "ugh, if i've seen this note before i would have figured this out much sooner!", then it's a good note. If the bug report was obvious to begin with and the note didn't add much, it's probably not a good note. But it's not necessarily bad as long as the report remains readable.

Additionally, it's likely that some reports will in fact get suppressed due to inline defensive checks suppressions kicking in over condition values. That'd be a lot of fun to see as well.

clang/lib/StaticAnalyzer/Core/BugReporterVisitors.cpp
1863–1864 ↗	(On Diff #203040)	This makes some sense in the long run but i think you should give up here for now. Unlike `CXXForRangeStmt`, its Objective-C counterpart doesn't mock up the AST that would have made it look like a regular for-loop, so there just isn't an AST node that would represent the part of it that corresponds to "`__begin != __end`". Even in the C++ case, i'm not sure your current behavior would make much sense. We should probably delegate this work to a checker that knows how collections work and what makes them empty or have a certain size, something similar to what the `IteratorChecker` seems to be becoming :) Should we give mark the report as invalid when we give up here? I've no idea, i guess i'll have to gather more empirical evidence on that.
1894–1897 ↗	(On Diff #203040)	What if they're in the same function but in different stack frames, i.e. due to recursion? I think you should just compare stack frame contexts here.
1899–1901 ↗	(On Diff #203040)	I didn't really understand this part.
clang/test/Analysis/diagnostics/no-store-func-path-notes.m
26 ↗	(On Diff #203040)	I suspect we'll have to play with the wording a little bit. The current `trackExpressionValue()` was worded for the scenario in which there's only one value that we're tracking. So it keeps saying things like "THE VALUE" as if we're focused on one value. In reality, however, "the value 0" isn't the same value as the "undefined or garbage value" that the report is about. I think we should discriminate between these values somehow. Eg., "Passing the condition value 0 via 2nd parameter 'param'". We have this information when we're attaching the visitor, so we can keep passing it around.
clang/test/Analysis/track-control-dependency-conditions.cpp
34 ↗	(On Diff #203040)	Why?

xazax.hun requested changes to this revision.Jun 5 2019, 2:35 AM

xazax.hun added inline comments.

clang/include/clang/StaticAnalyzer/Core/BugReporter/BugReporterVisitors.h
176 ↗	(On Diff #203040)	I think we should make clear that this visitor only operates within one function and does not track controll dependencies across functions.
clang/lib/StaticAnalyzer/Core/BugReporterVisitors.cpp
680 ↗	(On Diff #203040)	What was the reason of adding these begin/ends? I prefer to keep such refactorings separate, also it might be controversial whether this change is desired.
1863–1864 ↗	(On Diff #203040)	I vaguely recall some problem with the results of getTerminatorStmt for logical operators like &&. I believe what we really want is the last expression we evaluated in the basic block which will be the last Stmt of the basic block. So if we can settle with the last stmt we can get rid of this code.
1899–1901 ↗	(On Diff #203040)	I guess detecting control depdendency only makes sense for the last statement of the basic block (which is the first one we see in the visitor order). But instead of maintaining a state with the visited nodes we could also check if the statement is the last one of its basic block.

This revision now requires changes to proceed.Jun 5 2019, 2:35 AM

I think we should make clear that this visitor only operates within one function and does not track controll dependencies across functions.

Hmm, maybe it's worth investigating whether if we pop the call stack, does the bug report improve if we update the visitor to build control dependencies for the recent function, and update the origin block to be the one where the function call was located.

I implemented the fixes you commented on, but during evaluation it turned out that my visitor eats ram for breakfast, and then goes for seconds. I mean, like 5-30x the normal memory consumption, and the same for analysis speed. I counted the number of concurrent instances (incrementing a static counter in ctor, decrementing in dtor), and it seems to go up into the hundreds of thousands, even millions before restarting the counter. I killed the process before crippling the server.

I'm not sure how long it'll take for me to figure out what's wrong, but these numbers are so ridiculous, I suspect a programming error rather then an algorithmic issue.

edit: I seem to have found a solution, I'll share more later when I get to evaluate this :)

In D62883#1536894, @Szelethus wrote:

I'm not sure how long it'll take for me to figure out what's wrong, but these numbers are so ridiculous, I suspect a programming error rather then an algorithmic issue.

edit: I seem to have found a solution, I'll share more later when I get to evaluate this :)

At a glance, it might be that bugreporter::trackExpressionValue() is called on every node within the block, rather than only once in a lifetime of the visitor.

Resolved some reviewer comments
Added a BugReport level set to avoid tracking the same condition (which would result in an almost infinite loop)

Aaaand I have some results to show: http://cc.elte.hu:15001/Default/#

Sort by "Storage date", and look for "LLVM/Clang/Clang-tools-extra BEFORE tracking conditions" and "LLVM/Clang/Clang-tools-extra AFTER tracking conditions". I didn't have much time to draw conclusions yet, but it's clear that the amount of extra notes are intolerable.

Also, this visitor should be TU-local, I could have better comments, formatting, etc, but decided to update the patch because this is the version I used for evaluation.

Szelethus marked 6 inline comments as done.Jun 13 2019, 8:45 AM

Szelethus added inline comments.

clang/test/Analysis/track-control-dependency-conditions.cpp
34 ↗	(On Diff #203040)	I'll remove it in the next diff pinky swear.

I marked reports, confusingly, "Confirmed" if the extra notes were meaningful, "Intentional" if they were meaningless, and "False positive" if it's in between.

I took a look at the results. You can take a look at selected ones here:

http://cc.elte.hu:15001/Default/#report-hash=&run=LLVM%2FClang%2FClang-tools-extra%20AFTER%20tracking%20conditions&review-status=Confirmed&review-status=False%20positive&review-status=Intentional&detection-status=New&detection-status=Reopened&detection-status=Unresolved&tab=LLVM%2FClang%2FClang-tools-extra%20AFTER%20tracking%20conditions&is-unique=on

Due to the lack of better categorization, I used the following (so this has in fact no relation to whether the report is correct or not):

Intentional reports (green x) are the ones where the bugreport got unquestionably worse.
Confirmed reports (red check mark) got definitely better
False positives (grey line) are in between.

If you hover your mouse over the icon, you can also read a comment of mine. When you open a report, in the right corner of the source code you'll find a dropdown menu, allowing you to see the report without this patch applied:

There, you can observe the original bug report without this patch.

Some conclusions:

Cases where the condition is also a variable initialization tend to expand the bug path greatly. This isn't always bad, but should be noted. In general, unless we have a good heuristic to figure out whether there is meaningful information on the right hand side of the initialization, I think we just shouldn't track these. A good example for this is this one. The 37th event contains pretty much every information we need, it's obvious that the optional could either be None or non-None, since it's in a condition. dyn_cast is a prime example too: in this case, note 9 is all we really need.
We should probably believe that operator bool() is implemented correctly, and shouldn't track the value all the way there (at least, when we're only tracking the condition). Example
We shouldn't ever track assert-like conditions. Example (note 38-41)
When the report didn't suffer from any of the above issues, I found the extra notes to be helpful! :D

I'm doing another analysis to help a bit on evaluation with the following modification:

diff --git a/clang/lib/StaticAnalyzer/Core/BugReporterVisitors.cpp b/clang/lib/StaticAnalyzer/Core/BugReporterVisitors.cpp
index 9c40ebead56..4ac4b873675 100644
--- a/clang/lib/StaticAnalyzer/Core/BugReporterVisitors.cpp
+++ b/clang/lib/StaticAnalyzer/Core/BugReporterVisitors.cpp
@@ -1892,12 +1892,28 @@ TrackControlDependencyCondBRVisitor::VisitNode(const ExplodedNode *N,
   if (!OriginB || !NB)
     return nullptr;
 
-  if (ControlDepTree.isControlDependency(OriginB, NB))
-    if (const Expr *Condition = getTerminatorCondition(NB))
-      if (BR.addTrackedCondition(Condition))
+  if (ControlDepTree.isControlDependency(OriginB, NB)) {
+    if (const Expr *Condition = getTerminatorCondition(NB)) {
+      if (BR.addTrackedCondition(Condition)) {
         bugreporter::trackExpressionValue(
             N, Condition, BR, /*EnableNullFPSuppression=*/false);
 
+        if (BRC.getAnalyzerOptions().AnalysisDiagOpt == PD_NONE)
+          return nullptr;
+
+        std::string ConditionText = Lexer::getSourceText(
+            CharSourceRange::getTokenRange(Condition->getSourceRange()),
+                                           BRC.getSourceManager(),
+                                           BRC.getASTContext().getLangOpts());
+
+        return std::make_shared<PathDiagnosticEventPiece>(
+            PathDiagnosticLocation::createEnd(
+                Condition, BRC.getSourceManager(), N->getLocationContext()),
+                "Tracking condition " + ConditionText);
+      }
+    }
+  }
+
   return nullptr;
 }

*sloowly catches up >.<*

sanaanajjar231288 changed the visibility from "Public (No Login Required)" to "All Users".Jun 14 2019, 5:31 AM

I looked at bug path with the Optional. I did not debug into the analyzer but my intuition is the following: DefChainEnd is interesting. TerminatedPaths is the control dependency of DefChainEnd. And there are probably other things that are the control and/or value dependency of TerminatedPaths . One trick we could try is to limit the number of dependencies we track. For example, we could try what happens if we only track the immediate control dependencies of the original interesting region/symbol and does not add more control dependency recursively.

In case we find limiting the number of dependencies useful, I could imagine a user-facing flag like report-verbosity. The default one could include only the notes we find useful after limiting the tracking. A detailed option could add all the notes from the transitive tracking of dependencies. A verbose option could remove path pruning as well. This is just random brainstorming :) First, we need to so whether non-transitive tracking it is actually improving the situation.

I am not sure about assuming operator bool being correct. I think we first could think about other tricks to limit the tracking (see my idea above) and if we fail I would only add such rules as a last resort.

clang/include/clang/StaticAnalyzer/Core/BugReporter/BugReporter.h
157 ↗	(On Diff #204554)	Do we need this? I wonder if marking the result of the condition as interesting is sufficient.
clang/lib/StaticAnalyzer/Core/BugReporterVisitors.cpp
1850 ↗	(On Diff #204554)	Why do we need a ptr to non-const CFGBlock?
1895 ↗	(On Diff #204554)	Maybe this is the worng place to comment, but `isControlDependency` sounds a bit weird. How about `isControlDependenent`?

I don't understand why the visibility had to be changed (especially without explanation), and unless there's a good reason for it, I strongly disagree with it.

I am not sure about assuming operator bool being correct. I think we first could think about other tricks to limit the tracking (see my idea above) and if we fail I would only add such rules as a last resort.

I think this depends greatly on the stack frame layout. I seem to be perfectly fine with not tracking the value within foo() in

if (foo()) {
  ...
}

whenever foo() returns a bool. Simply because "why else would you add a check if it always returns the same value?".

However, this heuristic breaks when the same code appears in an inlined stack frame. Because given the context, we need to prove that this check makes sense *in this context*.

I strongly suspect that all of the heuristics that you're talking about depend significantly on the surrounding call stacks. Like, i suspect it so strongly that i'm ready to admit that i've no idea what we're doing anymore, and it makes sense to take a pen and a sheet of paper and try to write down the tracking rules, and then come up with an easily editable and extensible architecture for tweaking those rules as we discover more and more of them.

In D62883#1542802, @Szelethus wrote:

Some conclusions:

Cases where the condition is also a variable initialization tend to expand the bug path greatly. This isn't always bad, but should be noted. In general, unless we have a good heuristic to figure out whether there is meaningful information on the right hand side of the initialization, I think we just shouldn't track these. A good example for this is this one. The 37th event contains pretty much every information we need, it's obvious that the optional could either be None or non-None, since it's in a condition. dyn_cast is a prime example too: in this case, note 9 is all we really need.

We should probably believe that operator bool() is implemented correctly, and shouldn't track the value all the way there (at least, when we're only tracking the condition). Example

We shouldn't ever track assert-like conditions. Example (note 38-41)

When the report didn't suffer from any of the above issues, I found the extra notes to be helpful! :D

On the other hand, all of these problems seem to be examples of the problem of D62978. Might it be that it's the only thing that we're missing? Like, we need to formulate a rule that'll give us an understanding of when do we track the value past the point where it has been constrained to true or false - do we care about the origins of the value or do we care about its constraints only? In case of flag in the test examples, we clearly do. In case of these bools that come from boolean conversions and assertions and such, we clearly don't. What's the difference?

clang/test/Analysis/track-control-dependency-conditions.cpp
66 ↗	(On Diff #204554)	It's invalidation of globals by `coin()` which potentially writes to `bar`.
80–81 ↗	(On Diff #204554)	These notes should be in the opposite order. It doesn't matter for the test case, but it improves readability of the test case :)

How about we track the condition value past its collapse point only if it involves an overwrite of a variable (or other region) from which the value is loaded? Like, if there are no overwrites, stop at the collapse point. If there are overwrites, stop at the oldest overwrite point (i.e., the value was loaded from 'x' which was previously overwritten by the value of value 'y' which was previously overwritten by the initial value of variable 'z' => stop at the overwrite point of 'y').

(then, again, not sure what happens if more nested stack frames are around)

In D62883#1544283, @NoQ wrote:

However, this heuristic breaks when the same code appears in an inlined stack frame. Because given the context, we need to prove that this check makes sense *in this context*. I strongly suspect that all of the heuristics that you're talking about depend significantly on the surrounding call stacks.

I don't follow you here, unfortunately, could you please elaborate? Also, I'll come out with this right now: I never really understood these contexts, is there a handy doc or a paper I could take a look at?

Like, i suspect it so strongly that i'm ready to admit that i've no idea what we're doing anymore, and it makes sense to take a pen and a sheet of paper and try to write down the tracking rules, and then come up with an easily editable and extensible architecture for tweaking those rules as we discover more and more of them.

That sounds great. The should-not-have-happened and the must-have-happened cases will all rely on this (since in both cases, at least for now, we'd like to use the extra information to track conditions), so maybe it's due to refresh trackExpressionValue. It would also make publishing and reviewing my works a lot easier.

In D62883#1544302, @NoQ wrote:

On the other hand, all of these problems seem to be examples of the problem of D62978. Might it be that it's the only thing that we're missing? Like, we need to formulate a rule that'll give us an understanding of when do we track the value past the point where it has been constrained to true or false - do we care about the origins of the value or do we care about its constraints only? In case of flag in the test examples, we clearly do. In case of these bools that come from boolean conversions and assertions and such, we clearly don't. What's the difference?

How about we track the condition value past its collapse point only if it involves an overwrite of a variable (or other region) from which the value is loaded? Like, if there are no overwrites, stop at the collapse point. If there are overwrites, stop at the oldest overwrite point (i.e., the value was loaded from 'x' which was previously overwritten by the value of value 'y' which was previously overwritten by the initial value of variable 'z' => stop at the overwrite point of 'y').

(then, again, not sure what happens if more nested stack frames are around)

For now, I'd like to restrict my efforts to simply "we should track this" or "we shouldn't track this". I think simply not tracking cases where the condition is a single variable assignment is a good initial approach. Let how the tracking is done be a worry for another day. Also, I'm a little confused, isn't this comment about how D62978 could be solved?

In D62883#1542802, @Szelethus wrote:

If you hover your mouse over the icon, you can also read a comment of mine. When you open a report, in the right corner of the source code you'll find a dropdown menu, allowing you to see the report without this patch applied:

I added another run that displays notes at each condition that this new visitor tracks.

Szelethus marked an inline comment as done.Jun 16 2019, 11:44 AM

Szelethus added inline comments.

clang/include/clang/StaticAnalyzer/Core/BugReporter/BugReporter.h
157 ↗	(On Diff #204554)	Later on, when I'm covering the "should-not-have-happened" case, it's possible that a condition would be added that wasn't even explored by the analyzer, so this is kinda necessary.

Szelethus mentioned this in D63389: [IDF] Generalize IDFCalculator to be used with Clang's CFG.Jun 16 2019, 3:12 PM

Rebase on the rest of the patches
Make TrackControlDependencyCondBRVisitor local to BugReporterVisitors.cpp
Hide the current implementation behind the off-by-default analyzer config "track-conditions"
Add two more testcases I'd like to improve on in followup patches

Do you think that this patch looks good enough now that it's off by default? Maybe it'd be easier to develop/review followup changes separately.

Szelethus marked 8 inline comments as done.Jun 16 2019, 4:13 PM

Szelethus added inline comments.

clang/lib/StaticAnalyzer/Core/BugReporterVisitors.cpp
1850 ↗	(On Diff #204554)	`llvm::IDFCalculator` takes the blocks as a non-const pointer.

Hey! It is a cool patch as the smallest the scope the better to understand what is going on. I like the direction, but we saw with @NoQ, we have to think about "In which range do we track?".

In D62883#1545282, @Szelethus wrote:

Hide the current implementation behind the off-by-default analyzer config "track-conditions"

Do you think that this patch looks good enough now that it's off by default? Maybe it'd be easier to develop/review followup changes separately.

It would be helpful to see what is exactly new, what trackExpressionValue() cannot provide. Could you adjust the test case with a second run, where you switch off that new feature, please? (I highly recommend that for future developments also.)

In D62883#1544609, @Szelethus wrote:

In D62883#1544302, @NoQ wrote:

On the other hand, all of these problems seem to be examples of the problem of D62978. Might it be that it's the only thing that we're missing? Like, we need to formulate a rule that'll give us an understanding of when do we track the value past the point where it has been constrained to true or false - do we care about the origins of the value or do we care about its constraints only? In case of flag in the test examples, we clearly do. In case of these bools that come from boolean conversions and assertions and such, we clearly don't. What's the difference?

How about we track the condition value past its collapse point only if it involves an overwrite of a variable (or other region) from which the value is loaded? Like, if there are no overwrites, stop at the collapse point. If there are overwrites, stop at the oldest overwrite point (i.e., the value was loaded from 'x' which was previously overwritten by the value of value 'y' which was previously overwritten by the initial value of variable 'z' => stop at the overwrite point of 'y').

(then, again, not sure what happens if more nested stack frames are around)

For now, I'd like to restrict my efforts to simply "we should track this" or "we shouldn't track this". I think simply not tracking cases where the condition is a single variable assignment is a good initial approach. Let how the tracking is done be a worry for another day. Also, I'm a little confused, isn't this comment about how D62978 could be solved?

As @NoQ pointed out we have some problem with that function. We are tracking *values* without using the redecl()-chain (see in https://clang.llvm.org/doxygen/DeclBase_8h_source.html#l00948), or without tracking conditions. You have solved(?) the latter, but the former is equally important to solve.

As I see that is the current situation:

ReturnVisitor: give us too much unrelated information as we go out of scope.
FindLastStoreBRVisitor: do not use redecls() so gets confused with all the hackish SVal conversions.
TrackControlDependencyCondBRVisitor: swaps notes which is questionable if I want to see what the above two kindly provides for me, or I would remove them instead.

I believe this patch should be on by default, but it requires to fix the "In which range do we track?" problem with D62978.

This revision now requires changes to proceed.Jun 16 2019, 5:01 PM

In D62883#1545324, @Charusso wrote:

Hey! It is a cool patch as the smallest the scope the better to understand what is going on. I like the direction, but we saw with @NoQ, we have to think about "In which range do we track?".

In D62883#1545282, @Szelethus wrote:

Hide the current implementation behind the off-by-default analyzer config "track-conditions"

Do you think that this patch looks good enough now that it's off by default? Maybe it'd be easier to develop/review followup changes separately.

It would be helpful to see what is exactly new, what trackExpressionValue() cannot provide. Could you adjust the test case with a second run, where you switch off that new feature, please? (I highly recommend that for future developments also.)

You can see how this would look like by checking the history of this revision: https://reviews.llvm.org/D62883?vs=204554&id=204973&whitespace=ignore-most#toc. Adding a second RUN: line might be a good idea though.

In D62883#1544609, @Szelethus wrote:

In D62883#1544302, @NoQ wrote:

On the other hand, all of these problems seem to be examples of the problem of D62978. Might it be that it's the only thing that we're missing? Like, we need to formulate a rule that'll give us an understanding of when do we track the value past the point where it has been constrained to true or false - do we care about the origins of the value or do we care about its constraints only? In case of flag in the test examples, we clearly do. In case of these bools that come from boolean conversions and assertions and such, we clearly don't. What's the difference?

How about we track the condition value past its collapse point only if it involves an overwrite of a variable (or other region) from which the value is loaded? Like, if there are no overwrites, stop at the collapse point. If there are overwrites, stop at the oldest overwrite point (i.e., the value was loaded from 'x' which was previously overwritten by the value of value 'y' which was previously overwritten by the initial value of variable 'z' => stop at the overwrite point of 'y').

(then, again, not sure what happens if more nested stack frames are around)

For now, I'd like to restrict my efforts to simply "we should track this" or "we shouldn't track this". I think simply not tracking cases where the condition is a single variable assignment is a good initial approach. Let how the tracking is done be a worry for another day. Also, I'm a little confused, isn't this comment about how D62978 could be solved?

As @NoQ pointed out we have some problem with that function. We are tracking *values* without using the redecl()-chain (see in https://clang.llvm.org/doxygen/DeclBase_8h_source.html#l00948), or without tracking conditions. You have solved(?) the latter, but the former is equally important to solve.

I'm a little confused, which is the "former" and "latter" you're referring to?

As I see that is the current situation:

ReturnVisitor: give us too much unrelated information as we go out of scope.

FindLastStoreBRVisitor: do not use redecls() so gets confused with all the hackish SVal conversions.

TrackControlDependencyCondBRVisitor: swaps notes which is questionable if I want to see what the above two kindly provides for me, or I would remove them instead.

I believe this patch should be on by default, but it requires to fix the "In which range do we track?" problem with D62978.

I disagree with this. The reason why I'd like to make this an off-by-default option is to implement my followup improvements incrementally (not only that, but a whole family of conditions is going to be added), and allow us to observe the changes those make in relation to this patch -- besides, I don't really see this patch changing even if I manage to fix those issues. Would you like to see a change being made to this specific patch?

In D62883#1545341, @Szelethus wrote:

In D62883#1545324, @Charusso wrote:

As @NoQ pointed out we have some problem with that function. We are tracking *values* without using the redecl()-chain (see in https://clang.llvm.org/doxygen/DeclBase_8h_source.html#l00948), or without tracking conditions. You have solved(?) the latter, but the former is equally important to solve.

I'm a little confused, which is the "former" and "latter" you're referring to?

I believe you have solved the condition tracking as you move in-place what is going on.

I believe this patch should be on by default, but it requires to fix the "In which range do we track?" problem with D62978.

I disagree with this. The reason why I'd like to make this an off-by-default option is to implement my followup improvements incrementally (not only that, but a whole family of conditions is going to be added), and allow us to observe the changes those make in relation to this patch -- besides, I don't really see this patch changing even if I manage to fix those issues. Would you like to see a change being made to this specific patch?

As you moving stuff around it just bypasses the usefulness of the mentioned two visitors, which is a problem. Also you have mentioned some very crazy bad side-effects which is yes, related to in which range do we operate.

Add another RUN: line to see more clearly the effects of this patch.

In D62883#1545343, @Charusso wrote:

In D62883#1545341, @Szelethus wrote:

In D62883#1545324, @Charusso wrote:

As @NoQ pointed out we have some problem with that function. We are tracking *values* without using the redecl()-chain (see in https://clang.llvm.org/doxygen/DeclBase_8h_source.html#l00948), or without tracking conditions. You have solved(?) the latter, but the former is equally important to solve.

I'm a little confused, which is the "former" and "latter" you're referring to?

I believe you have solved the condition tracking as you move in-place what is going on.

I'm still not sure I follow :) What are the two distinct things to be solved here?

I believe this patch should be on by default, but it requires to fix the "In which range do we track?" problem with D62978.

I disagree with this. The reason why I'd like to make this an off-by-default option is to implement my followup improvements incrementally (not only that, but a whole family of conditions is going to be added), and allow us to observe the changes those make in relation to this patch -- besides, I don't really see this patch changing even if I manage to fix those issues. Would you like to see a change being made to this specific patch?

As you moving stuff around it just bypasses the usefulness of the mentioned two visitors, which is a problem. Also you have mentioned some very crazy bad side-effects which is yes, related to in which range do we operate.

Could you please elaborate? The particular thing I'm missing, I guess, is the need to get those issues fixed before this patch, and making this on-by-default. If anything, enabling a flag like this could really demonstrate changes on those problems. I also like to think that tracking a condition (especially a condition of a control dependency of yet another condition) is a drastically different case that tracking a "regular" variable (in particular, I think more aggressive pruning could be used), and such a change would definitely be out of scope for this patch.

I like to think that most of the unimportant notes can be easily categorized, as seen above. With some, admittedly crude heuristics, we could cut these down greatly, and leave some breathing room for fine-tuning it.

My opinion based on the Summary, which is the kind of the opposite what we have. No invalidation happens, so no swap happens, so it is kind of misleading like writing out "Returning something" where we cannot be sure what we return ("resulting in new notes being placed at for the call to foo() on line 14 and a note about flag being invalidated on line 5.").
Please update the Summary according to the changes in the patch.

Also please note that, we are very pessimistic and everything is uninteresting (like returning unknown values), you only could markInteresting().

In D62883#1545347, @Szelethus wrote:

I like to think that most of the unimportant notes can be easily categorized, as seen above. With some, admittedly crude heuristics, we could cut these down greatly, and leave some breathing room for fine-tuning it.

I am not against that direction, just you throw oil on fire, and we already have a huge fire (D62978). I believe with that we could see more bad-patterns and we could answer "In which range do we track?".
I hope you find the answer, and thanks for working on that to solve the problem!

NoQ added inline comments.Jun 17 2019, 2:29 PM

clang/lib/StaticAnalyzer/Core/BugReporterVisitors.cpp
1609–1613 ↗	(On Diff #204980)	A bit clearner: auto StmtElem = B->rbegin().getAs<CFGStmt>(); if (!StmtElem) return nullptr; const Stmt *S = StmtElem->getStmt(); Also how about `CFGBlock::getTerminatorCondition()`?
1640–1642 ↗	(On Diff #204980)	`// TODO: This can be cached.`
1646 ↗	(On Diff #204980)	All right, i still don't understand this caching based on condition expression. You mean, like, if we're encountering the same condition multiple times (say, in a loop), we should only track it once? Why? Like, its values (which are the thing we'll really be tracking) may be different (say, on different loop iterations).

Szelethus marked 3 inline comments as done.Jun 17 2019, 3:17 PM

Szelethus added inline comments.

clang/lib/StaticAnalyzer/Core/BugReporterVisitors.cpp
1609–1613 ↗	(On Diff #204980)	I peeked at it's implementation, and it seems to be incorrect. Referencing @xazax.hun's inline: I vaguely recall some problem with the results of getTerminatorStmt for logical operators like &&. I believe what we really want is the last expression we evaluated in the basic block which will be the last Stmt of the basic block. So if we can settle with the last stmt we can get rid of this code. And this is what `CFGBlock::getTerminatorCondition()` does: Stmt Terminator = getTerminatorStmt(); if (!Terminator) return nullptr; Expr E = nullptr; switch (Terminator->getStmtClass()) { // ... case Stmt::BinaryOperatorClass: // '&&' and '\|\|' E = cast<BinaryOperator>(Terminator)->getLHS(); // .... } return E; But nevertheless, maybe it'd be good to fix this in there and use it here.
1640–1642 ↗	(On Diff #204980)	Like, caching the `CFGBlock`s for given `ExplodedNode`s?
1646 ↗	(On Diff #204980)	Yup, can't argue with that, I'll revise this.

NoQ added inline comments.Jun 17 2019, 3:23 PM

clang/lib/StaticAnalyzer/Core/BugReporterVisitors.cpp
1640–1642 ↗	(On Diff #204980)	I mean, on this line it's the same block every time (per instance of a visitor).

Szelethus set the repository for this revision to rG LLVM Github Monorepo.Jun 20 2019, 2:29 PM

Now using CFGBlock::getTerminatorConditionExpr()
Uniqueing already tracked conditions as an (Expr, ExplodedNode) pair instead of on Expr
Add a TODO: about caching
Add plenty of new test cases for good measure (examples from my latest mail http://lists.llvm.org/pipermail/cfe-dev/2019-June/062613.html)

Szelethus marked 3 inline comments as done.Jun 20 2019, 3:42 PM

Szelethus added a parent revision: D63538: [CFG] Add a new function to get the proper condition of a CFGBlock.

In D62883#1552870, @Szelethus wrote:

Now using CFGBlock::getTerminatorConditionExpr()

Uniqueing already tracked conditions as an (Expr, ExplodedNode) pair instead of on Expr

I would be surprised to see the same ExploderNode multiple time a lot. Also, ExploderNode will have a program point, so having both an ExploderNode and an Expr feels redundant to me.

Add a TODO: about caching

Add plenty of new test cases for good measure (examples from my latest mail http://lists.llvm.org/pipermail/cfe-dev/2019-June/062613.html)

In D62883#1552969, @xazax.hun wrote:

In D62883#1552870, @Szelethus wrote:

Uniqueing already tracked conditions as an (Expr, ExplodedNode) pair instead of on Expr

I would be surprised to see the same ExploderNode multiple time a lot. Also, ExploderNode will have a program point, so having both an ExploderNode and an Expr feels redundant to me.

(Expr, LocationContext) sounds right.

Oh, wait, no, loops.

Collect only ExplodedNodes. Lesson learned!
Add a TODO: in trackExpressionValue about maybe tracking conditions to all bug locations, rather than only for tracked variables.

Szelethus marked an inline comment as done.Jun 21 2019, 2:53 AM

Szelethus added a child revision: D63642: [analyzer] Add a debug analyzer config to place an event for each tracked condition.Jun 21 2019, 4:21 AM

Szelethus retitled this revision from [analyzer] Track conditions of terminator statements on which the reported node depends on to [analyzer] Track terminator conditions on which a tracked expressions depends.Jun 21 2019, 4:28 AM

Szelethus set the repository for this revision to rG LLVM Github Monorepo.

In D62883#1553248, @Szelethus wrote:

Add a TODO: in trackExpressionValue about maybe tracking conditions to all bug locations, rather than only for tracked variables.

What do you mean by "all bug locations"?

clang/include/clang/StaticAnalyzer/Core/BugReporter/BugReporter.h
355–357 ↗	(On Diff #205954)	Pls add a comment that explains when does this function return true or false. I always forget what does insert().second do :)
clang/lib/StaticAnalyzer/Core/BugReporterVisitors.cpp
1628 ↗	(On Diff #205954)	Maybe let's add a comment that this is for inter-visitor communication only. Because otherwise we won't visit the same node twice in the same visitor.

In D62883#1554339, @NoQ wrote:

In D62883#1553248, @Szelethus wrote:

Add a TODO: in trackExpressionValue about maybe tracking conditions to all bug locations, rather than only for tracked variables.

What do you mean by "all bug locations"?

This visitor is added when a variable is tracked, yet I feel the control dependency of every report is important (else we would never have discovered the error).

Aha, ok, got it. I guess the official term is "error node" (where "error" means "warning").

clang/test/Analysis/track-control-dependency-conditions.cpp
1–8 ↗	(On Diff #205954)	Btw did you try `-verify=` as in D60732? It might make the test more readable.

It should be pretty easy to implement, just add your new visitor to the list of default visitors in findValidReport().

Woohoo! Thanks for everything, this is the most fun I've had working on this project! Let's wait for @xazax.hun to have the final say.

In D62883#1554494, @NoQ wrote:

It should be pretty easy to implement, just add your new visitor to the list of default visitors in findValidReport().

The thinking was to preserve this patch (roughly) in the state as the analyses were run. I also plan to patch CodeChecker so that it can diff two runs only only by bug report hash, but also by bug report length, which should make evaluation a lot easier. This may imply that I'll delay tracking control dependencies to all error nodes for later, as LLVM literally takes a day to analyze each time (and the sheer size of it gives invaluable data, which is why I plan doing it, but will check other projects too).

Szelethus marked an inline comment as done.Jun 21 2019, 4:32 PM

Szelethus added inline comments.

clang/test/Analysis/track-control-dependency-conditions.cpp
1–8 ↗	(On Diff #205954)	So thats how you do it! Yea, this looks a lot better.

In D62883#1554514, @Szelethus wrote:

In D62883#1554494, @NoQ wrote:

It should be pretty easy to implement, just add your new visitor to the list of default visitors in findValidReport().

The thinking was to preserve this patch (roughly) in the state as the analyses were run.

Sure!

Artem had some comments that are not marked as done, but LGTM!

clang/include/clang/StaticAnalyzer/Core/BugReporter/BugReporter.h
355–357 ↗	(On Diff #205954)	This is not done yet :)
157 ↗	(On Diff #204554)	When the analyzer did not explore a given code snippet we will not have an exploded node, but we can change this in a followup patch.
clang/lib/StaticAnalyzer/Core/BugReporterVisitors.cpp
1628 ↗	(On Diff #205954)	Also not done.

This revision is now accepted and ready to land.Jun 25 2019, 6:37 AM

I usually do the final update right before commiting. There still are non-accepted dependencies, who knows what'll happen.

clang/include/clang/StaticAnalyzer/Core/BugReporter/BugReporter.h
157 ↗	(On Diff #204554)	Ugh, right. Let's keep this as is is for now, dunno how I forgot about it.

Szelethus mentioned this in rL364911: [IDF] Generalize IDFCalculator to be used with Clang's CFG.Jul 2 2019, 4:30 AM

Szelethus mentioned this in rG9353421ecd12: [IDF] Generalize IDFCalculator to be used with Clang's CFG.Jul 2 2019, 4:35 AM

Add two more test cases when a "Returning value" note is meaningful, and one where it's not
Fix inlines!

Szelethus mentioned this in D64232: [analyzer] Prune calls to functions with linear CFGs that return a non-zero constrained value.Jul 4 2019, 5:08 PM

Accept².

clang/lib/StaticAnalyzer/Core/BugReporterVisitors.cpp
1824–1825 ↗	(On Diff #208084)	To think: not sure you want to do this for memory leak reports, because the place where the leak is discovered (i.e., where we randomly decided to run garbage collection and noticed a leak) may be fairly weird. Path to that point may still be moderately interesting (after all, it presumably doesn't leak on other paths), but i'm still worried that we may bring in some completely unrelated stuff. I might be wrong.

Closed by commit rL365207: [analyzer] Track terminator conditions on which a tracked expression depends (authored by Szelethus). · Explain WhyJul 5 2019, 6:30 AM

This revision was automatically updated to reflect the committed changes.

Herald added a project: Restricted Project. · View Herald TranscriptJul 5 2019, 6:30 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Szelethus mentioned this in rL368771: [analyzer] Prune calls to functions with linear CFGs that return a non-zero….Aug 13 2019, 4:21 PM

Szelethus mentioned this in rG46929df72333: [analyzer] Prune calls to functions with linear CFGs that return a non-zero….

Revision Contents

Path

Size

cfe/

trunk/

include/

clang/

StaticAnalyzer/

Core/

AnalyzerOptions.def

5 lines

BugReporter/

BugReporter.h

10 lines

lib/

StaticAnalyzer/

Core/

BugReporterVisitors.cpp

93 lines

test/

Analysis/

analyzer-config.c

3 lines

track-control-dependency-conditions.cpp

285 lines

Diff 208167

cfe/trunk/include/clang/StaticAnalyzer/Core/AnalyzerOptions.def

Show First 20 Lines • Show All 285 Lines • ▼ Show 20 Lines	ANALYZER_OPTION(bool, ShouldDisplayMacroExpansions, "expand-macros",
"expanded and included in the plist output.",		"expanded and included in the plist output.",
false)		false)

ANALYZER_OPTION(bool, DisplayCTUProgress, "display-ctu-progress",		ANALYZER_OPTION(bool, DisplayCTUProgress, "display-ctu-progress",
"Whether to emit verbose output about "		"Whether to emit verbose output about "
"the analyzer's progress related to ctu.",		"the analyzer's progress related to ctu.",
false)		false)

		ANALYZER_OPTION(bool, ShouldTrackConditions, "track-conditions",
		"Whether to track conditions that are a control dependency of "
		"an already tracked variable.",
		false)

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Unsinged analyzer options.		// Unsinged analyzer options.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

ANALYZER_OPTION(		ANALYZER_OPTION(
unsigned, AlwaysInlineSize, "ipa-always-inline-size",		unsigned, AlwaysInlineSize, "ipa-always-inline-size",
"The size of the functions (in basic blocks), which should be considered "		"The size of the functions (in basic blocks), which should be considered "
"to be small enough to always inline.",		"to be small enough to always inline.",
▲ Show 20 Lines • Show All 79 Lines • Show Last 20 Lines

cfe/trunk/include/clang/StaticAnalyzer/Core/BugReporter/BugReporter.h

Show First 20 Lines • Show All 147 Lines • ▼ Show 20 Lines	protected:

/// If non-empty, this bug report is likely a false positive and should not be		/// If non-empty, this bug report is likely a false positive and should not be
/// shown to the user.		/// shown to the user.
///		///
/// \sa markInvalid		/// \sa markInvalid
/// \sa removeInvalidation		/// \sa removeInvalidation
llvm::SmallSet<InvalidationRecord, 4> Invalidations;		llvm::SmallSet<InvalidationRecord, 4> Invalidations;

		/// Conditions we're already tracking.
		llvm::SmallSet<const ExplodedNode *, 4> TrackedConditions;

private:		private:
// Used internally by BugReporter.		// Used internally by BugReporter.
Symbols &getInterestingSymbols();		Symbols &getInterestingSymbols();
Regions &getInterestingRegions();		Regions &getInterestingRegions();

void lazyInitializeInterestingSets();		void lazyInitializeInterestingSets();
void pushInterestingSymbolsAndRegions();		void pushInterestingSymbolsAndRegions();
void popInterestingSymbolsAndRegions();		void popInterestingSymbolsAndRegions();
▲ Show 20 Lines • Show All 180 Lines • ▼ Show 20 Lines	public:

/// Remove all visitors attached to this bug report.		/// Remove all visitors attached to this bug report.
void clearVisitors();		void clearVisitors();

/// Iterators through the custom diagnostic visitors.		/// Iterators through the custom diagnostic visitors.
visitor_iterator visitor_begin() { return Callbacks.begin(); }		visitor_iterator visitor_begin() { return Callbacks.begin(); }
visitor_iterator visitor_end() { return Callbacks.end(); }		visitor_iterator visitor_end() { return Callbacks.end(); }

		/// Notes that the condition of the CFGBlock associated with \p Cond is
		/// being tracked.
		/// \returns false if the condition is already being tracked.
		bool addTrackedCondition(const ExplodedNode *Cond) {
		return TrackedConditions.insert(Cond).second;
		}

/// Profile to identify equivalent bug reports for error report coalescing.		/// Profile to identify equivalent bug reports for error report coalescing.
/// Reports are uniqued to ensure that we do not emit multiple diagnostics		/// Reports are uniqued to ensure that we do not emit multiple diagnostics
/// for each bug.		/// for each bug.
virtual void Profile(llvm::FoldingSetNodeID& hash) const;		virtual void Profile(llvm::FoldingSetNodeID& hash) const;
};		};

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// BugTypes (collections of related reports).		// BugTypes (collections of related reports).
▲ Show 20 Lines • Show All 297 Lines • Show Last 20 Lines

cfe/trunk/lib/StaticAnalyzer/Core/BugReporterVisitors.cpp

Show All 16 Lines
#include "clang/AST/DeclBase.h"		#include "clang/AST/DeclBase.h"
#include "clang/AST/DeclCXX.h"		#include "clang/AST/DeclCXX.h"
#include "clang/AST/Expr.h"		#include "clang/AST/Expr.h"
#include "clang/AST/ExprCXX.h"		#include "clang/AST/ExprCXX.h"
#include "clang/AST/ExprObjC.h"		#include "clang/AST/ExprObjC.h"
#include "clang/AST/Stmt.h"		#include "clang/AST/Stmt.h"
#include "clang/AST/Type.h"		#include "clang/AST/Type.h"
#include "clang/ASTMatchers/ASTMatchFinder.h"		#include "clang/ASTMatchers/ASTMatchFinder.h"
		#include "clang/Analysis/Analyses/Dominators.h"
#include "clang/Analysis/AnalysisDeclContext.h"		#include "clang/Analysis/AnalysisDeclContext.h"
#include "clang/Analysis/CFG.h"		#include "clang/Analysis/CFG.h"
#include "clang/Analysis/CFGStmtMap.h"		#include "clang/Analysis/CFGStmtMap.h"
#include "clang/Analysis/ProgramPoint.h"		#include "clang/Analysis/ProgramPoint.h"
#include "clang/Basic/IdentifierTable.h"		#include "clang/Basic/IdentifierTable.h"
#include "clang/Basic/LLVM.h"		#include "clang/Basic/LLVM.h"
#include "clang/Basic/SourceLocation.h"		#include "clang/Basic/SourceLocation.h"
#include "clang/Basic/SourceManager.h"		#include "clang/Basic/SourceManager.h"
▲ Show 20 Lines • Show All 1,581 Lines • ▼ Show 20 Lines	if (TerminatorLoc.isMacroID()) {
}		}
return nullptr;		return nullptr;
}		}
}		}
return nullptr;		return nullptr;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
		// TrackControlDependencyCondBRVisitor.
		//===----------------------------------------------------------------------===//

		namespace {
		/// Tracks the expressions that are a control dependency of the node that was
		/// supplied to the constructor.
		/// For example:
		///
		/// cond = 1;
		/// if (cond)
		/// 10 / 0;
		///
		/// An error is emitted at line 3. This visitor realizes that the branch
		/// on line 2 is a control dependency of line 3, and tracks it's condition via
		/// trackExpressionValue().
		class TrackControlDependencyCondBRVisitor final : public BugReporterVisitor {
		const ExplodedNode *Origin;
		ControlDependencyCalculator ControlDeps;
		llvm::SmallSet<const CFGBlock *, 32> VisitedBlocks;

		public:
		TrackControlDependencyCondBRVisitor(const ExplodedNode *O)
		: Origin(O), ControlDeps(&O->getCFG()) {}

		void Profile(llvm::FoldingSetNodeID &ID) const override {
		static int x = 0;
		ID.AddPointer(&x);
		}

		std::shared_ptr<PathDiagnosticPiece> VisitNode(const ExplodedNode *N,
		BugReporterContext &BRC,
		BugReport &BR) override;
		};
		} // end of anonymous namespace

		static CFGBlock GetRelevantBlock(const ExplodedNode Node) {
		if (auto SP = Node->getLocationAs<StmtPoint>()) {
		const Stmt *S = SP->getStmt();
		assert(S);

		return const_cast<CFGBlock *>(Node->getLocationContext()
		->getAnalysisDeclContext()->getCFGStmtMap()->getBlock(S));
		}

		return nullptr;
		}

		std::shared_ptr<PathDiagnosticPiece>
		TrackControlDependencyCondBRVisitor::VisitNode(const ExplodedNode *N,
		BugReporterContext &BRC,
		BugReport &BR) {
		// We can only reason about control dependencies within the same stack frame.
		if (Origin->getStackFrame() != N->getStackFrame())
		return nullptr;

		CFGBlock *NB = GetRelevantBlock(N);

		// Skip if we already inspected this block.
		if (!VisitedBlocks.insert(NB).second)
		return nullptr;

		CFGBlock *OriginB = GetRelevantBlock(Origin);

		// TODO: Cache CFGBlocks for each ExplodedNode.
		if (!OriginB \|\| !NB)
		return nullptr;

		if (ControlDeps.isControlDependent(OriginB, NB)) {
		if (const Expr *Condition = NB->getLastCondition()) {
		// Keeping track of the already tracked conditions on a visitor level
		// isn't sufficient, because a new visitor is created for each tracked
		// expression, hence the BugReport level set.
		if (BR.addTrackedCondition(N)) {
		bugreporter::trackExpressionValue(
		N, Condition, BR, /EnableNullFPSuppression=/false);
		}
		}
		}

		return nullptr;
		}

		//===----------------------------------------------------------------------===//
// Implementation of trackExpressionValue.		// Implementation of trackExpressionValue.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

static const MemRegion getLocationRegionIfReference(const Expr E,		static const MemRegion getLocationRegionIfReference(const Expr E,
const ExplodedNode *N) {		const ExplodedNode *N) {
if (const auto *DR = dyn_cast<DeclRefExpr>(E)) {		if (const auto *DR = dyn_cast<DeclRefExpr>(E)) {
if (const auto *VD = dyn_cast<VarDecl>(DR->getDecl())) {		if (const auto *VD = dyn_cast<VarDecl>(DR->getDecl())) {
if (!VD->getType()->isReferenceType())		if (!VD->getType()->isReferenceType())
▲ Show 20 Lines • Show All 99 Lines • ▼ Show 20 Lines	bool bugreporter::trackExpressionValue(const ExplodedNode *InputNode,

const Expr *Inner = peelOffOuterExpr(E, InputNode);		const Expr *Inner = peelOffOuterExpr(E, InputNode);
const ExplodedNode *LVNode = findNodeForExpression(InputNode, Inner);		const ExplodedNode *LVNode = findNodeForExpression(InputNode, Inner);
if (!LVNode)		if (!LVNode)
return false;		return false;

ProgramStateRef LVState = LVNode->getState();		ProgramStateRef LVState = LVNode->getState();

		// We only track expressions if we believe that they are important. Chances
		// are good that control dependencies to the tracking point are also improtant
		// because of this, let's explain why we believe control reached this point.
		// TODO: Shouldn't we track control dependencies of every bug location, rather
		// than only tracked expressions?
		if (LVState->getAnalysisManager().getAnalyzerOptions().ShouldTrackConditions)
		report.addVisitor(llvm::make_unique<TrackControlDependencyCondBRVisitor>(
		InputNode));

// The message send could be nil due to the receiver being nil.		// The message send could be nil due to the receiver being nil.
// At this point in the path, the receiver should be live since we are at the		// At this point in the path, the receiver should be live since we are at the
// message send expr. If it is nil, start tracking it.		// message send expr. If it is nil, start tracking it.
if (const Expr *Receiver = NilReceiverBRVisitor::getNilReceiver(Inner, LVNode))		if (const Expr *Receiver = NilReceiverBRVisitor::getNilReceiver(Inner, LVNode))
trackExpressionValue(LVNode, Receiver, report, EnableNullFPSuppression);		trackExpressionValue(LVNode, Receiver, report, EnableNullFPSuppression);

// Track the index if this is an array subscript.		// Track the index if this is an array subscript.
if (const auto *Arr = dyn_cast<ArraySubscriptExpr>(Inner))		if (const auto *Arr = dyn_cast<ArraySubscriptExpr>(Inner))
▲ Show 20 Lines • Show All 946 Lines • Show Last 20 Lines

cfe/trunk/test/Analysis/analyzer-config.c

	Show First 20 Lines • Show All 78 Lines • ▼ Show 20 Lines
	// CHECK-NEXT: prune-paths = true			// CHECK-NEXT: prune-paths = true
	// CHECK-NEXT: region-store-small-struct-limit = 2			// CHECK-NEXT: region-store-small-struct-limit = 2
	// CHECK-NEXT: report-in-main-source-file = false			// CHECK-NEXT: report-in-main-source-file = false
	// CHECK-NEXT: serialize-stats = false			// CHECK-NEXT: serialize-stats = false
	// CHECK-NEXT: stable-report-filename = false			// CHECK-NEXT: stable-report-filename = false
	// CHECK-NEXT: suppress-c++-stdlib = true			// CHECK-NEXT: suppress-c++-stdlib = true
	// CHECK-NEXT: suppress-inlined-defensive-checks = true			// CHECK-NEXT: suppress-inlined-defensive-checks = true
	// CHECK-NEXT: suppress-null-return-paths = true			// CHECK-NEXT: suppress-null-return-paths = true
				// CHECK-NEXT: track-conditions = false
	// CHECK-NEXT: unix.DynamicMemoryModeling:Optimistic = false			// CHECK-NEXT: unix.DynamicMemoryModeling:Optimistic = false
	// CHECK-NEXT: unroll-loops = false			// CHECK-NEXT: unroll-loops = false
	// CHECK-NEXT: widen-loops = false			// CHECK-NEXT: widen-loops = false
	// CHECK-NEXT: [stats]			// CHECK-NEXT: [stats]
	// CHECK-NEXT: num-entries = 85			// CHECK-NEXT: num-entries = 86

cfe/trunk/test/Analysis/track-control-dependency-conditions.cpp

				// RUN: %clang_analyze_cc1 %s \
				// RUN: -verify=expected,tracking \
				// RUN: -analyzer-config track-conditions=true \
				// RUN: -analyzer-output=text \
				// RUN: -analyzer-checker=core
				//
				// RUN: %clang_analyze_cc1 %s -verify \
				// RUN: -analyzer-output=text \
				// RUN: -analyzer-checker=core

				namespace example_1 {
				int flag;
				bool coin();

				void foo() {
				flag = coin(); // tracking-note{{Value assigned to 'flag'}}
				}

				void test() {
				int *x = 0; // expected-note{{'x' initialized to a null pointer value}}
				flag = 1;

				foo(); // TODO: Add nodes here about flag's value being invalidated.
				if (flag) // expected-note {{Assuming 'flag' is 0}}
				// expected-note@-1{{Taking false branch}}
				x = new int;

				foo(); // tracking-note{{Calling 'foo'}}
				// tracking-note@-1{{Returning from 'foo'}}

				if (flag) // expected-note {{Assuming 'flag' is not equal to 0}}
				// expected-note@-1{{Taking true branch}}
				*x = 5; // expected-warning{{Dereference of null pointer}}
				// expected-note@-1{{Dereference of null pointer}}
				}
				} // end of namespace example_1

				namespace example_2 {
				int flag;
				bool coin();

				void foo() {
				flag = coin(); // tracking-note{{Value assigned to 'flag'}}
				}

				void test() {
				int *x = 0;
				flag = 1;

				foo();
				if (flag) // expected-note {{Assuming 'flag' is 0}}
				// expected-note@-1{{Taking false branch}}
				x = new int;

				x = 0; // expected-note{{Null pointer value stored to 'x'}}

				foo(); // tracking-note{{Calling 'foo'}}
				// tracking-note@-1{{Returning from 'foo'}}

				if (flag) // expected-note {{Assuming 'flag' is not equal to 0}}
				// expected-note@-1{{Taking true branch}}
				*x = 5; // expected-warning{{Dereference of null pointer}}
				// expected-note@-1{{Dereference of null pointer}}
				}
				} // end of namespace example_2

				namespace global_variable_invalidation {
				int flag;
				bool coin();

				void foo() {
				// coin() could write bar, do it's invalidated.
				flag = coin(); // tracking-note{{Value assigned to 'flag'}}
				// tracking-note@-1{{Value assigned to 'bar'}}
				}

				int bar;

				void test() {
				int *x = 0; // expected-note{{'x' initialized to a null pointer value}}
				flag = 1;

				foo(); // tracking-note{{Calling 'foo'}}
				// tracking-note@-1{{Returning from 'foo'}}

				if (bar) // expected-note {{Assuming 'bar' is not equal to 0}}
				// expected-note@-1{{Taking true branch}}
				if (flag) // expected-note {{Assuming 'flag' is not equal to 0}}
				// expected-note@-1{{Taking true branch}}
				*x = 5; // expected-warning{{Dereference of null pointer}}
				// expected-note@-1{{Dereference of null pointer}}
				}
				} // end of namespace global_variable_invalidation

				namespace variable_declaration_in_condition {
				bool coin();

				bool foo() {
				return coin(); // tracking-note{{Returning value}}
				}

				int bar;

				void test() {
				int *x = 0; // expected-note{{'x' initialized to a null pointer value}}

				if (int flag = foo()) // tracking-note{{Calling 'foo'}}
				// tracking-note@-1{{Returning from 'foo'}}
				// tracking-note@-2{{'flag' initialized here}}

				// expected-note@-4{{Assuming 'flag' is not equal to 0}}
				// expected-note@-5{{Taking true branch}}

				*x = 5; // expected-warning{{Dereference of null pointer}}
				// expected-note@-1{{Dereference of null pointer}}
				}
				} // end of namespace variable_declaration_in_condition

				namespace conversion_to_bool {
				bool coin();

				struct ConvertsToBool {
				operator bool() const { return coin(); } // tracking-note{{Returning value}}
				};

				void test() {
				int *x = 0; // expected-note{{'x' initialized to a null pointer value}}

				if (ConvertsToBool())
				// tracking-note@-1 {{Calling 'ConvertsToBool::operator bool'}}
				// tracking-note@-2{{Returning from 'ConvertsToBool::operator bool'}}

				// expected-note@-4{{Assuming the condition is true}}
				// expected-note@-5{{Taking true branch}}
				*x = 5; // expected-warning{{Dereference of null pointer}}
				// expected-note@-1{{Dereference of null pointer}}
				}

				} // end of namespace variable_declaration_in_condition

				namespace unimportant_returning_value_note {
				bool coin();

				bool flipCoin() { return coin(); } // tracking-note{{Returning value}}

				void i(int *ptr) {
				if (ptr) // expected-note{{Assuming 'ptr' is null}}
				// expected-note@-1{{Taking false branch}}
				;
				if (!flipCoin())
				// tracking-note@-1{{Calling 'flipCoin'}}
				// tracking-note@-2{{Returning from 'flipCoin'}}
				// expected-note@-3{{Assuming the condition is true}}
				// expected-note@-4{{Taking true branch}}
				*ptr = 5; // expected-warning{{Dereference of null pointer}}
				// expected-note@-1{{Dereference of null pointer}}
				}
				} // end of namespace unimportant_returning_value_note

				namespace important_returning_value_note {
				bool coin();

				bool flipCoin() {
				if (coin()) // tracking-note{{Assuming the condition is false}}
				// tracking-note@-1{{Taking false branch}}
				return true;
				return coin(); // tracking-note{{Returning value}}
				}

				void i(int *ptr) {
				if (ptr) // expected-note{{Assuming 'ptr' is null}}
				// expected-note@-1{{Taking false branch}}
				;
				if (!flipCoin())
				// tracking-note@-1{{Calling 'flipCoin'}}
				// tracking-note@-2{{Returning from 'flipCoin'}}
				// expected-note@-3{{Assuming the condition is true}}
				// expected-note@-4{{Taking true branch}}
				*ptr = 5; // expected-warning{{Dereference of null pointer}}
				// expected-note@-1{{Dereference of null pointer}}
				}
				} // end of namespace important_returning_value_note

				namespace tracked_condition_is_only_initialized {
				int getInt();

				void f() {
				int flag = getInt(); // tracking-note{{'flag' initialized here}}
				int *x = 0; // expected-note{{'x' initialized to a null pointer value}}
				if (flag) // expected-note{{Assuming 'flag' is not equal to 0}}
				// expected-note@-1{{Taking true branch}}
				*x = 5; // expected-warning{{Dereference of null pointer}}
				// expected-note@-1{{Dereference of null pointer}}
				}
				} // end of namespace tracked_condition_is_only_initialized

				namespace tracked_condition_written_in_same_stackframe {
				int flag;
				int getInt();

				void f(int y) {
				y = 1; // tracking-note{{The value 1 is assigned to 'y'}}
				flag = y; // tracking-note{{The value 1 is assigned to 'flag'}}

				int *x = 0; // expected-note{{'x' initialized to a null pointer value}}
				if (flag) // expected-note{{'flag' is 1}}
				// expected-note@-1{{Taking true branch}}
				*x = 5; // expected-warning{{Dereference of null pointer}}
				// expected-note@-1{{Dereference of null pointer}}
				}
				} // end of namespace tracked_condition_written_in_same_stackframe

				namespace tracked_condition_written_in_nested_stackframe {
				int flag;
				int getInt();

				void foo() {
				int y;
				y = 1; // tracking-note{{The value 1 is assigned to 'y'}}
				flag = y; // tracking-note{{The value 1 is assigned to 'flag'}}

				}

				void f(int y) {
				int *x = 0; // expected-note{{'x' initialized to a null pointer value}}

				foo(); // tracking-note{{Calling 'foo'}}
				// tracking-note@-1{{Returning from 'foo'}}

				if (flag) // expected-note{{'flag' is 1}}
				// expected-note@-1{{Taking true branch}}
				*x = 5; // expected-warning{{Dereference of null pointer}}
				// expected-note@-1{{Dereference of null pointer}}
				}
				} // end of namespace tracked_condition_written_in_nested_stackframe

				namespace collapse_point_not_in_condition {

				[[noreturn]] void halt();

				void assert(int b) {
				if (!b) // tracking-note{{Assuming 'b' is not equal to 0}}
				// tracking-note@-1{{Taking false branch}}
				halt();
				}

				void f(int flag) {
				int *x = 0; // expected-note{{'x' initialized to a null pointer value}}

				assert(flag); // tracking-note{{Calling 'assert'}}
				// tracking-note@-1{{Returning from 'assert'}}

				if (flag) // expected-note{{'flag' is not equal to 0}}
				// expected-note@-1{{Taking true branch}}
				*x = 5; // expected-warning{{Dereference of null pointer}}
				// expected-note@-1{{Dereference of null pointer}}
				}

				} // end of namespace collapse_point_not_in_condition

				namespace unimportant_write_before_collapse_point {

				[[noreturn]] void halt();

				void assert(int b) {
				if (!b) // tracking-note{{Assuming 'b' is not equal to 0}}
				// tracking-note@-1{{Taking false branch}}
				halt();
				}
				int getInt();

				void f(int flag) {
				int *x = 0; // expected-note{{'x' initialized to a null pointer value}}

				flag = getInt(); // tracking-note{{Value assigned to 'flag'}}
				assert(flag); // tracking-note{{Calling 'assert'}}
				// tracking-note@-1{{Returning from 'assert'}}

				if (flag) // expected-note{{'flag' is not equal to 0}}
				// expected-note@-1{{Taking true branch}}
				*x = 5; // expected-warning{{Dereference of null pointer}}
				// expected-note@-1{{Dereference of null pointer}}
				}

				} // end of namespace unimportant_write_before_collapse_point

This is an archive of the discontinued LLVM Phabricator instance.

[analyzer] Track terminator conditions on which a tracked expressions dependsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 208167

cfe/trunk/include/clang/StaticAnalyzer/Core/AnalyzerOptions.def

cfe/trunk/include/clang/StaticAnalyzer/Core/BugReporter/BugReporter.h

cfe/trunk/lib/StaticAnalyzer/Core/BugReporterVisitors.cpp

cfe/trunk/test/Analysis/analyzer-config.c

cfe/trunk/test/Analysis/track-control-dependency-conditions.cpp

[analyzer] Track terminator conditions on which a tracked expressions depends
ClosedPublic