This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
include/clang/StaticAnalyzer/Core/PathSensitive/
-
clang/
-
StaticAnalyzer/
-
Core/
-
PathSensitive/
-
RangedConstraintManager.h
-
lib/StaticAnalyzer/Core/
-
StaticAnalyzer/
-
Core/
22/22
RangeConstraintManager.cpp
-
test/Analysis/
-
Analysis/
5/5
find-binop-constraints.cpp

Differential D103314

[Analyzer][solver] Simplify existing constraints when a new constraint is added
ClosedPublic

Authored by martong on May 28 2021, 6:26 AM.

Download Raw Diff

Details

Reviewers

vsavchenko
NoQ
steakhal
Szelethus

Commits

rG8ddbb442b6e8: [Analyzer][solver] Simplify existing eq classes and constraints when a new…

Summary

Update setConstraint to simplify existing constraints (and adding the
simplified constraint) when a new constraint is added. In this patch we just
simply iterate over all existing constraints and try to simplfy them with
simplifySVal. This solves the simplest problematic cases where we have two
symbols in the tree, e.g.:

int test_rhs_further_constrained(int x, int y) {
  if (x + y != 0)
    return 0;
  if (y != 0)
    return 0;
  clang_analyzer_eval(x + y == 0); // expected-warning{{TRUE}}
  clang_analyzer_eval(y == 0);     // expected-warning{{TRUE}}
  return 0;
}

This patch is the first step of a sequence of patches, and not intended to be
commited as a standalone change. The sequence of patches (and the plan) is
described here: https://reviews.llvm.org/D102696#2784624

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

martong created this revision.May 28 2021, 6:26 AM

Herald added a reviewer: Szelethus. · View Herald TranscriptMay 28 2021, 6:26 AM

Herald added subscribers: ASDenysPetrov, gamesh411, dkrupp and 9 others. · View Herald Transcript

martong requested review of this revision.May 28 2021, 6:26 AM

Herald added a project: Restricted Project. · View Herald TranscriptMay 28 2021, 6:26 AM

Herald added a subscriber: cfe-commits. · View Herald Transcript

martong added a child revision: D103317: [Analyzer][Core] Make SValBuilder to better simplify svals with 3 symbols in the tree.May 28 2021, 6:40 AM

martong mentioned this in D102696: [Analyzer] Find constraints that are directly attached to a BinOp.May 28 2021, 6:43 AM

Hey, great job! This is really something that we need, but it's implemented not entirely correctly.
I tried to cover it in the inline comment.

clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp
1586–1602	I tried to cover it in the comment to another patch. This solution includes a lot of extra work and it will lose equality/disequality information for simplified expressions, and I think it's safe to say that if `a == b` then `simplify(a) == b`. Let's start with `getConstraintMap`. It is a completely artificial data structure (and function) that exists for Z3 refutation. It's not what we keep in the state and it has a lot of duplicated constraints. If we have an equivalence class `{a, b, c, d, e, f}`, we store only one constraint for all of them (thus when we update the class, or one of the members receives a new constraint, we can update all of them). `getConstraintMap` returns a map where `a`, `b`, `c`, `d`, `e`, and `f` are mapped to the same constraint. It's not super bad, but it's extra work constructing this map and then processing it. Another, and more important aspect is that when you `setConstraint`, you lose information that this symbol is equal/disequal to other symbols. One example here would be a situation where `x + y == z`, and we find out that `y == 0`, we should update equivalence class `{x + y, z}` to be a class `{x, z}`. In order to do this, you need to update two maps: `ClassMap` (it's mapping `x + y` to `{x + y, z}`) and `ClassMembers` (it's mapping `{x + y, z}` to `x + y` and `z`). Similar example can be made with `x + y != z`, but updating `ClassMap` and `ClassMembers` will fix it. And you don't even need to touch the actual mapping with the actual constraints.

This revision now requires changes to proceed.May 28 2021, 6:53 AM

Thanks Valeriy for the quick review and guidance! I am planning to do the changes and continue next week :)

vsavchenko added inline comments.May 28 2021, 7:31 AM

clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp
1585	Also I think we can introduce a simple, but efficient optimization of kicking off the simplification process only when `Constraint` is a constant.

Harbormaster completed remote builds in B106698: Diff 348508.May 28 2021, 7:45 AM

martong marked an inline comment as done.May 31 2021, 8:26 AM

martong added inline comments.

clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp
1585	Yes, good point.
1586–1602	Absolutely, great findings! I think the most straightforward and consistent implementation of updating `ClassMap` and `ClassMembers` is to directly use the `merge` method. I.e. we can merge the simplified symbol (as a trivial eq class) to the existing equivalence class. Using `merge`, however, would not remove the non-simplified original symbol. But this might not be a problem; rather it is a necessity (as the child patch demonstrates) it might be very useful if we can find the symbol (without simplification, i.e. as written) in the `ConstraintRange` map. Do you see any drawbacks of reusing `merge` here?

Merge the simplified symbol to the old class

That's awesome, just a few stylistic tweaks and tests and we are ready to land!

clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp
1586–1587	I think we need a comment why we care about this early exit.
1586–1602	Oh, that's actually even better! If we consider the following example. Let `a + b == c` and `a == d` be known and `b == 0` to be a new constraint. Then your approach will help us to figure out that `c == d`. So, you found a great way! I think that we should still add the test cases I briefly described in my previous comment and that one from above.
1589	Here I also think that we need to give more context to the readers, so they understand what simplification you are talking about here.
1591–1594	You don't actually use constraints here, so (let me write it in python) instead of: [update(classMap[class]) for class, constraint in constraints.items()] you can use [update(members) for class, members in classMap.items()]
1600	It would be great if we provide some justification why we do merge here.

vsavchenko added inline comments.May 31 2021, 9:08 AM

clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp
1601–1602	Uh-oh, almost let yet another null-state bug to happen! During this iteration, `State` can become null, so we need to check for it.

Harbormaster completed remote builds in B106922: Diff 348808.May 31 2021, 9:16 AM

I had another thought, merge is usually called in situations when we found out that two symbols should be marked equal (and checked that it's possible to begin with), which is not true in your case.

If we update my case from before, we can get: a + b == c and a != c as given, and b == 0 as a new constraint. In this situation, you will merge classes {a + b, c} and {a}, which contradicts our existing disequality information.

In D103314#2789754, @vsavchenko wrote:

I had another thought, merge is usually called in situations when we found out that two symbols should be marked equal (and checked that it's possible to begin with), which is not true in your case.

If we update my case from before, we can get: a + b == c and a != c as given, and b == 0 as a new constraint. In this situation, you will merge classes {a + b, c} and {a}, which contradicts our existing disequality information.

Yes, we must check the disequivalence classes to discover such a contradiction, I updated the code to do so. Also, added a test case for the contradiction handling.

clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp
1591–1594	Actually, trivial equivalence classes (those that have only one symbol member) are not stored in the State. Thus, we must skim through the constraints as well in order to be able to simplify symbols in the constraints. In short, we have to iterate both collections.
1601–1602	Good catch!

Simplify equivalence classes when iterate over ClassMap, simplify constraints by iterating over the ConstraintsMap

I was wondering if there is a direct way to check the equivalence classes?
I am thinking about to add a clang_annalyzer_dump_equivalence_classes function to the ExprInspection checker.

Awesome!
I know, I said that we are ready to land, but I think I was too excited about this change. We probably should have some data on how it performs on real-life codebases.

clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp
1580	Maybe it should be a `simplify` method of the class itself?
1589–1594	I think we can add a method `isDisequalTo` or just use `areEqual` in a this way: are equal? [Yes] -> nothing to do here [No] -> return nullptr [Don't know] -> merge
1591–1594	Ah, I see. Then I would say that your previous solution is more readable (if we keep `simplify`, of course).

Harbormaster completed remote builds in B107020: Diff 348945.Jun 1 2021, 6:44 AM

martong marked 3 inline comments as done.Jun 2 2021, 7:06 AM

martong added inline comments.

clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp
1580	Yeah, makes sense.
1589–1594	Good point, I've added a new overload to the static `areEqual` and added a method `isEqualTo` that uses `areEqual`.
1591–1594	My previous solution might be more readable, though, that's not working. Actually, I think I failed to explain properly why do we have to iterate both collections. We have to iterate the ConstraintMap because trivial constraints are not stored in the State but we want to simplify symbols in the constraints. So, if we were to iterate over only the ClassMap then the simplest test-case would fail: int test_rhs_further_constrained(int x, int y) { if (x + y != 0) return 0; if (y != 0) return 0; clang_analyzer_eval(x + y == 0); // expected-warning{{TRUE}} clang_analyzer_eval(y == 0); // expected-warning{{TRUE}} FAIL return 0; } We have to iterate the ClassMap in order to update all equivalence classes that we store in the State. Consider the example you brought up before: void test_equivalence_classes_are_updated(int a, int b, int c, int d) { if (a + b != c) return; if (a != d) return; if (b != 0) return; // Keep the symbols and the constraints! alive. (void)(a * b * c * d); clang_analyzer_eval(c == d); // expected-warning{{TRUE}} return; } Before we start to simulate `b==0`, we have only these equivalence classes in the State: E1{`a+b`, `c`} and E2{`a`, `d`}. And we have these constraints: SymExpr(`a+b==c`) -> out-of [0, 0], SymExpr(`a==d`) -> out-of [0, 0]. Now, when we evaluate `b==0`in setConstraint when iterating the ConstraintMap then SymExpr(`a+b==c`) becomes SymExpr(`a==c`). But the equality classes are not updated. And we can update them if we scan through the ClassMap. Another alternative solution could be to re-trigger the `track` mechanism when we iterate over the ConstraintMap, but `track` seemed to be an exclusive interface towards the higher abstraction RangedConstraintManager. On the other hand, reusing the `track` mechanism could result better performance than doing another iteration on the ClassMap. Do you think it would be a better approach? And how could we reuse the `track` mechanism without getting confused with the `Adjustment` stuff?

Add isEqualTo and simplify members to EquivalenceClass

Harbormaster completed remote builds in B107241: Diff 349261.Jun 2 2021, 8:04 AM

I am terribly sorry, but I uploaded an unfinished Diff previously, please disregard that. So these are the changes:

Add isEqualTo and simplify members to EquivalenceClass

Harbormaster completed remote builds in B107304: Diff 349352.Jun 2 2021, 1:42 PM

In D103314#2790868, @vsavchenko wrote:

Awesome!
I know, I said that we are ready to land, but I think I was too excited about this change. We probably should have some data on how it performs on real-life codebases.

Just some quick update on the status of this patch. I've done some measurements on smaller open source C projects (e.g tmux) and didn't see any noticeable slow-down. However, I've run into a bad-bad assertion failure in my favorite Checker (StdLibraryFu...). The assertion indicates that neither !State nor State is feasible, so this throws me back to the debugger for a while.

Simplify the symbol before eq tracking as well

In D103314#2798968, @martong wrote:

In D103314#2790868, @vsavchenko wrote:

Awesome!
I know, I said that we are ready to land, but I think I was too excited about this change. We probably should have some data on how it performs on real-life codebases.

Just some quick update on the status of this patch. I've done some measurements on smaller open source C projects (e.g tmux) and didn't see any noticeable slow-down. However, I've run into a bad-bad assertion failure in my favorite Checker (StdLibraryFu...). The assertion indicates that neither !State nor State is feasible, so this throws me back to the debugger for a while.

Finally, I could boil down the infeasible parent state problem and added a test case test_deferred_contradiction to catch that. The solution is surprisingly simple: just try to simplify the symbolic expression of an equivalency before we start to update the State with the equivalency info.

Harbormaster completed remote builds in B108417: Diff 350898.Jun 9 2021, 8:43 AM

OK, we definitely need to know about performance.
Plus, I'm still curious about the crash. I didn't get how simplification helped/caused that crash.

I have one thought here. If the lack of simplification indeed caused the crash, we are in trouble with this patch. IMO simplification in just one place should make it better, but shouldn't produce infeasible states for us. In other words, any number simplifications is a conservative operation that makes our lives a bit better. The moment they become a requirement (i.e. simplifications call for more simplifications or we crash) this solution from this patch has to become much harder. This is because whenever we do merge, we essentially can create another situation when we find out that some symbolic expression is a constant. Let's say that we are merging classes A and B which have constraints [INT_MIN, 42] and [42, INT_MAX]. After the merge, we are positive that all the members of this new class are equal to 42. And if so, we can further simplify classes and their members. This algorithm turns into a fixed point algorithm, which has a good chance to sabotage our performance.

This being said, can we re-iterate on that crash and the proposed fix in much more detail?

clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp
1787	very opinionated nit: can you please add extra new line after this?
1975–1985	Now, since you put this logic into `merge`, you can just merge.
clang/test/Analysis/find-binop-constraints.cpp
151	It's not really connected to your patch, but this confuses me! Why does the analyzer think that `b0` is guaranteed to be 2 after this statement. Even if we eagerly assume here, shouldn't it mean that there are still two paths `b0 == 2` and `b0 != 2`?
156–159	Hmm, I don't see how simplification helped here. After the previous `if` statement, we should have had two equivalence classes known to be disequal: `reg_$2<int b1>` and `(reg_$0<int e0>) - (reg_$1<int b0>)`. Further, we directly compare these two symbols. We can figure it out without any simplifications. Am I missing something here?

I have one thought here. If the lack of simplification indeed caused the crash, we are in trouble with this patch. IMO simplification in just one place should make it better, but shouldn't produce infeasible states for us. In other words, any number simplifications is a conservative operation that makes our lives a bit better. The moment they become a requirement (i.e. simplifications call for more simplifications or we crash) this solution from this patch has to become much harder. This is because whenever we do merge, we essentially can create another situation when we find out that some symbolic expression is a constant. Let's say that we are merging classes A and B which have constraints [INT_MIN, 42] and [42, INT_MAX]. After the merge, we are positive that all the members of this new class are equal to 42. And if so, we can further simplify classes and their members. This algorithm turns into a fixed point algorithm, which has a good chance to sabotage our performance.

Yes, good point(s). I am trying to avoid turning into a fixed point algorithm by directly iterating over the equivalence classes instead of reusing the existing track mechanism. On the other hand, perhaps with some budge the fixpoint algo would be worth to experiment with.

clang/test/Analysis/find-binop-constraints.cpp
151	Don't be puzzled by this. This indeed bifurcates. The interesting path is where `b0 == 2` is true. I am going to update this line with `if (b0 ==2) {` to achieve a similar effect. (I was using creduce and tried to simplify even more after that, but i missed this.)
156–159	When we evaluate `e2 > 0` then we will set `e1` as disequal to `b1`. However, at this point because of the eager constant folding `e1` is `e0 - 2` (on the path where `b0 == 2` is true). So, when we evaluate `b1 == e1` then this is the diseq info we have in the State (I used `dumpDisEq` from D103967): reg_$2<int b1> DisequalTo: (reg_$0<int e0>) - 2 (reg_$0<int e0>) - 2 DisequalTo: reg_$2<int b1> And indeed we ask directly whether the LHS (`reg_$2<int b1>`) is equal to RHS`(reg_$0<int e0>) - (reg_$1<int b0>)`. This is because the` DeclRefExpr` of `e1` is still bound to SVal which originates from the time before we constrained b0 to 2. With other words: the `Environment` is not changed by introducing a new constraint. BTW, this test fails even in llvm/main.

martong marked 2 inline comments as done.Jun 9 2021, 11:25 AM

martong added inline comments.

clang/test/Analysis/find-binop-constraints.cpp
156–159	With other words: the Environment is not changed by introducing a new constraint. This suggests that another approach could be to do change the `Environment` when we add a new constraint. I am not sure about the pros/cons atm, but might be worth to experiment. What do you think?

OK, we definitely need to know about performance.

Couldn't agree more. I am in the middle of a performance measurement that I do with csa-testbench (on memchached,tmux,curl,twin,redis,vim,openssl,sqlite,ffmpeg,postgresql,tinyxml2,libwebm,xerces,bitcoin,protobuf). Hopefully I can give you some results soon.

Plus, I'm still curious about the crash. I didn't get how simplification helped/caused that crash.

So, the crash was actually an assertion failure in StdLibraryFunctionsChecker, which came when I made a test analysis run on the twin project. The assertion was here:

if (FailureSt && !SuccessSt) {
  if (ExplodedNode *N = C.generateErrorNode(NewState))
    reportBug(Call, N, Constraint.get(), Summary, C);
  break;
} else {
  // We will apply the constraint even if we cannot reason about the
  // argument. This means both SuccessSt and FailureSt can be true. If we
  // weren't applying the constraint that would mean that symbolic
  // execution continues on a code whose behaviour is undefined.
  assert(SuccessSt);                   // <----------------------------------------------------------------- This fired !!!
  NewState = SuccessSt;
}

With multiple creduce iterations below is a minimal example with StdLibraryFunctionsChecker. That crashed when we applied the BufferSize constraint of fread.

typedef int FILE;
long b;
unsigned long fread(void *__restrict, unsigned long, unsigned long,
                    FILE *__restrict);
void foo();
void c(int *a, int e0) {

  int e1 = e0 - b;
  b == 2;
  foo();

  int e2 = e1 - b;
  if (e2 > 0 && b == e1) {
    (void)a; (void)e1; (void)c;
    fread(a, sizeof(char), e1, c);
  }
}

Turned out, the checker had the assertion because before applying the arg constraint and its negated counterpart, the state was already infeasible. (But the analyzer recognized this only when it added the new assumptions when checking the applicability of the arg constraint.)
Thus, I could remove fread and the Checker from the problem set and could create the test case that synthesizes the unfeasible state.

Remove isEqualTo

martong marked an inline comment as done.Jun 9 2021, 11:52 AM

martong added inline comments.

clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp
1787	Sure.
1975–1985	Wow, good catch.

Harbormaster completed remote builds in B108465: Diff 350966.Jun 9 2021, 12:46 PM

I have the first measurements results in the attached zip file. The file contains the html file generated by csa-testbench. It's name contains CTU but actually it was a regular non-CTU analysis. The most interesting is probably the run-times, where we can notice a small increase:

Other than that, the number of the warnings seems to be unchanged. The most notable change in the statistics is in the number of paths explored by the analyzer: in some cases (e.g. twin) it increased with 2-3 %.

CTU_20results_20on_20open_20projects_201.zip20 KBDownload

In D103314#2810795, @martong wrote:

I have the first measurements results in the attached zip file. The file contains the html file generated by csa-testbench. It's name contains CTU but actually it was a regular non-CTU analysis. The most interesting is probably the run-times, where we can notice a small increase:

Other than that, the number of the warnings seems to be unchanged. The most notable change in the statistics is in the number of paths explored by the analyzer: in some cases (e.g. twin) it increased with 2-3 %.
CTU_20results_20on_20open_20projects_201.zip20 KBDownload

This sounds amazing! Great job!

vsavchenko accepted this revision.Jun 13 2021, 2:26 AM

This revision is now accepted and ready to land.Jun 13 2021, 2:26 AM

This revision was landed with ongoing or failed builds.Jun 14 2021, 3:19 AM

Closed by commit rG8ddbb442b6e8: [Analyzer][solver] Simplify existing eq classes and constraints when a new… (authored by martong). · Explain Why

This revision was automatically updated to reflect the committed changes.

martong added a commit: rG8ddbb442b6e8: [Analyzer][solver] Simplify existing eq classes and constraints when a new….

This patch is the first step of a sequence of patches, and not intended to be commited as a standalone change.

Although I planned to commit this in a lock-step when subsequent patches are also accepted, it makes sense to commit now since it's an obvious improvement and the performance penalty remains below a reasonable limit.

Hi,

I'm seeing a failed assertion with this patch.
Reproduce with

clang --analyze bbi-57338.c

Result:

clang: /repo/uabelho/master-github/llvm/include/llvm/ADT/APSInt.h:148: bool llvm::APSInt::operator<(const llvm::APSInt &) const: Assertion `IsUnsigned == RHS.IsUnsigned && "Signedness mismatch!"' failed.

bbi-57338.c147 BDownload

In D103314#2829806, @uabelho wrote:
Hi,

I'm seeing a failed assertion with this patch.
Reproduce with
clang --analyze bbi-57338.c
Result:
clang: /repo/uabelho/master-github/llvm/include/llvm/ADT/APSInt.h:148: bool llvm::APSInt::operator<(const llvm::APSInt &) const: Assertion `IsUnsigned == RHS.IsUnsigned && "Signedness mismatch!"' failed.
bbi-57338.c147 BDownload

Good that we found it that early! Thanks Mikael!

In D103314#2829806, @uabelho wrote:
Hi,

I'm seeing a failed assertion with this patch.
Reproduce with
clang --analyze bbi-57338.c
Result:
clang: /repo/uabelho/master-github/llvm/include/llvm/ADT/APSInt.h:148: bool llvm::APSInt::operator<(const llvm::APSInt &) const: Assertion `IsUnsigned == RHS.IsUnsigned && "Signedness mismatch!"' failed.
bbi-57338.c147 BDownload

Thanks Mikael for the reproducer, I am going to debug tomorrow.

Hi,

Another failed assertion that started appearing with this patch:

clang --analyze bbi-57589.c

which results in:

clang: ../lib/Support/APInt.cpp:284: int llvm::APInt::compareSigned(const llvm::APInt &) const: Assertion `BitWidth == RHS.BitWidth && "Bit widths must be same for comparison"' failed.

bbi-57589.c198 BDownload

Maybe it's the same root problem, but please make sure you fix both.
Thanks!

In D103314#2837907, @uabelho wrote:
Hi,

Another failed assertion that started appearing with this patch:
clang --analyze bbi-57589.c
which results in:
clang: ../lib/Support/APInt.cpp:284: int llvm::APInt::compareSigned(const llvm::APInt &) const: Assertion `BitWidth == RHS.BitWidth && "Bit widths must be same for comparison"' failed.
bbi-57589.c198 BDownload

Maybe it's the same root problem, but please make sure you fix both.
Thanks!

Thanks again Mikael for the report. I could find the root cause and I have a solution that solves the assertions (both test cases are fixed). I am going to upload the fix soon.

In D103314#2838065, @martong wrote:
In D103314#2837907, @uabelho wrote:
Hi,

Another failed assertion that started appearing with this patch:
clang --analyze bbi-57589.c
which results in:
clang: ../lib/Support/APInt.cpp:284: int llvm::APInt::compareSigned(const llvm::APInt &) const: Assertion `BitWidth == RHS.BitWidth && "Bit widths must be same for comparison"' failed.
bbi-57589.c198 BDownload

Maybe it's the same root problem, but please make sure you fix both.
Thanks!
Thanks again Mikael for the report. I could find the root cause and I have a solution that solves the assertions (both test cases are fixed). I am going to upload the fix soon.

Great! Ping me when it's on review, I'll try to look into it ASAP!

In D103314#2838065, @martong wrote:
In D103314#2837907, @uabelho wrote:
Hi,

Another failed assertion that started appearing with this patch:
clang --analyze bbi-57589.c
which results in:
clang: ../lib/Support/APInt.cpp:284: int llvm::APInt::compareSigned(const llvm::APInt &) const: Assertion `BitWidth == RHS.BitWidth && "Bit widths must be same for comparison"' failed.
bbi-57589.c198 BDownload

Maybe it's the same root problem, but please make sure you fix both.
Thanks!
Thanks again Mikael for the report. I could find the root cause and I have a solution that solves the assertions (both test cases are fixed). I am going to upload the fix soon.

Here it is: https://reviews.llvm.org/D104844

martong mentioned this in D106823: [analyzer][solver] Iterate to a fixpoint during symbol simplification with constants.Jul 26 2021, 1:58 PM

I believe this commit exposed a new false-positive bug in [core.DivideZero]. I've filed the report here: https://bugs.llvm.org/show_bug.cgi?id=51940

I believe this exposed another odd issue where a true positive (enabled by this commit) disappears when unrelated code is not present. Bug filed as: https://bugs.llvm.org/show_bug.cgi?id=51950.

martong mentioned this in rG806329da0700: [analyzer][solver] Iterate to a fixpoint during symbol simplification with….Nov 12 2021, 2:58 AM

martong removed a child revision: D103317: [Analyzer][Core] Make SValBuilder to better simplify svals with 3 symbols in the tree.Nov 12 2021, 4:09 AM

Revision Contents

Path

Size

clang/

include/

clang/

StaticAnalyzer/

Core/

PathSensitive/

RangedConstraintManager.h

2 lines

lib/

StaticAnalyzer/

Core/

RangeConstraintManager.cpp

97 lines

test/

Analysis/

find-binop-constraints.cpp

163 lines

Diff 351809

clang/include/clang/StaticAnalyzer/Core/PathSensitive/RangedConstraintManager.h

Show First 20 Lines • Show All 250 Lines • ▼ Show 20 Lines	public:
static void Profile(llvm::FoldingSetNodeID &ID, const RangeSet &RS) {		static void Profile(llvm::FoldingSetNodeID &ID, const RangeSet &RS) {
ID.AddPointer(RS.Impl);		ID.AddPointer(RS.Impl);
}		}

/// Profile - Generates a hash profile of this RangeSet for use		/// Profile - Generates a hash profile of this RangeSet for use
/// by FoldingSet.		/// by FoldingSet.
void Profile(llvm::FoldingSetNodeID &ID) const { Profile(ID, *this); }		void Profile(llvm::FoldingSetNodeID &ID) const { Profile(ID, *this); }

/// getConcreteValue - If a symbol is contrained to equal a specific integer		/// getConcreteValue - If a symbol is constrained to equal a specific integer
/// constant then this method returns that value. Otherwise, it returns		/// constant then this method returns that value. Otherwise, it returns
/// NULL.		/// NULL.
const llvm::APSInt *getConcreteValue() const {		const llvm::APSInt *getConcreteValue() const {
return Impl->size() == 1 ? begin()->getConcreteValue() : nullptr;		return Impl->size() == 1 ? begin()->getConcreteValue() : nullptr;
}		}

/// Get the minimal value covered by the ranges in the set.		/// Get the minimal value covered by the ranges in the set.
///		///
▲ Show 20 Lines • Show All 125 Lines • Show Last 20 Lines

clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp

Show All 15 Lines
#include "clang/StaticAnalyzer/Core/PathSensitive/ProgramState.h"		#include "clang/StaticAnalyzer/Core/PathSensitive/ProgramState.h"
#include "clang/StaticAnalyzer/Core/PathSensitive/ProgramStateTrait.h"		#include "clang/StaticAnalyzer/Core/PathSensitive/ProgramStateTrait.h"
#include "clang/StaticAnalyzer/Core/PathSensitive/RangedConstraintManager.h"		#include "clang/StaticAnalyzer/Core/PathSensitive/RangedConstraintManager.h"
#include "clang/StaticAnalyzer/Core/PathSensitive/SValVisitor.h"		#include "clang/StaticAnalyzer/Core/PathSensitive/SValVisitor.h"
#include "llvm/ADT/FoldingSet.h"		#include "llvm/ADT/FoldingSet.h"
#include "llvm/ADT/ImmutableSet.h"		#include "llvm/ADT/ImmutableSet.h"
#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
#include "llvm/ADT/StringExtras.h"		#include "llvm/ADT/StringExtras.h"
		#include "llvm/ADT/SmallSet.h"
#include "llvm/Support/Compiler.h"		#include "llvm/Support/Compiler.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
#include <algorithm>		#include <algorithm>
#include <iterator>		#include <iterator>

using namespace clang;		using namespace clang;
using namespace ento;		using namespace ento;

▲ Show 20 Lines • Show All 545 Lines • ▼ Show 20 Lines	markDisequal(BasicValueFactory &BV, RangeSet::Factory &F,
ProgramStateRef State, EquivalenceClass Other) const;		ProgramStateRef State, EquivalenceClass Other) const;
LLVM_NODISCARD static inline ClassSet		LLVM_NODISCARD static inline ClassSet
getDisequalClasses(ProgramStateRef State, SymbolRef Sym);		getDisequalClasses(ProgramStateRef State, SymbolRef Sym);
LLVM_NODISCARD inline ClassSet		LLVM_NODISCARD inline ClassSet
getDisequalClasses(ProgramStateRef State) const;		getDisequalClasses(ProgramStateRef State) const;
LLVM_NODISCARD inline ClassSet		LLVM_NODISCARD inline ClassSet
getDisequalClasses(DisequalityMapTy Map, ClassSet::Factory &Factory) const;		getDisequalClasses(DisequalityMapTy Map, ClassSet::Factory &Factory) const;

		LLVM_NODISCARD static inline Optional<bool> areEqual(ProgramStateRef State,
		EquivalenceClass First,
		EquivalenceClass Second);
LLVM_NODISCARD static inline Optional<bool>		LLVM_NODISCARD static inline Optional<bool>
areEqual(ProgramStateRef State, SymbolRef First, SymbolRef Second);		areEqual(ProgramStateRef State, SymbolRef First, SymbolRef Second);

		/// Iterate over all symbols and try to simplify them.
		LLVM_NODISCARD ProgramStateRef simplify(SValBuilder &SVB,
		RangeSet::Factory &F,
		ProgramStateRef State);

/// Check equivalence data for consistency.		/// Check equivalence data for consistency.
LLVM_NODISCARD LLVM_ATTRIBUTE_UNUSED static bool		LLVM_NODISCARD LLVM_ATTRIBUTE_UNUSED static bool
isClassDataConsistent(ProgramStateRef State);		isClassDataConsistent(ProgramStateRef State);

LLVM_NODISCARD QualType getType() const {		LLVM_NODISCARD QualType getType() const {
return getRepresentativeSymbol()->getType();		return getRepresentativeSymbol()->getType();
}		}

▲ Show 20 Lines • Show All 774 Lines • ▼ Show 20 Lines	RangeSet SymbolicRangeInferrer::VisitBinaryOperator<BO_Rem>(Range LHS,
// for any sign of either LHS, or RHS.		// for any sign of either LHS, or RHS.
return {RangeFactory, ValueFactory.getValue(Min), ValueFactory.getValue(Max)};		return {RangeFactory, ValueFactory.getValue(Min), ValueFactory.getValue(Max)};
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Constraint manager implementation details		// Constraint manager implementation details
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

		static SymbolRef simplify(ProgramStateRef State, SymbolRef Sym) {
		SValBuilder &SVB = State->getStateManager().getSValBuilder();
		SVal SimplifiedVal = SVB.simplifySVal(State, SVB.makeSymbolVal(Sym));
		return SimplifiedVal.getAsSymbol();
		}

class RangeConstraintManager : public RangedConstraintManager {		class RangeConstraintManager : public RangedConstraintManager {
public:		public:
RangeConstraintManager(ExprEngine *EE, SValBuilder &SVB)		RangeConstraintManager(ExprEngine *EE, SValBuilder &SVB)
: RangedConstraintManager(EE, SVB), F(getBasicVals()) {}		: RangedConstraintManager(EE, SVB), F(getBasicVals()) {}

//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//
// Implementation for interface from ConstraintManager.		// Implementation for interface from ConstraintManager.
//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//
▲ Show 20 Lines • Show All 97 Lines • ▼ Show 20 Lines	private:
template <bool EQ>		template <bool EQ>
ProgramStateRef track(RangeSet NewConstraint, ProgramStateRef State,		ProgramStateRef track(RangeSet NewConstraint, ProgramStateRef State,
SymbolRef Sym, const llvm::APSInt &Int,		SymbolRef Sym, const llvm::APSInt &Int,
const llvm::APSInt &Adjustment) {		const llvm::APSInt &Adjustment) {
if (NewConstraint.isEmpty())		if (NewConstraint.isEmpty())
// This is an infeasible assumption.		// This is an infeasible assumption.
return nullptr;		return nullptr;

		if (SymbolRef SimplifiedSym = simplify(State, Sym))
		Sym = SimplifiedSym;

if (ProgramStateRef NewState = setConstraint(State, Sym, NewConstraint)) {		if (ProgramStateRef NewState = setConstraint(State, Sym, NewConstraint)) {
if (auto Equality = EqualityInfo::extract(Sym, Int, Adjustment)) {		if (auto Equality = EqualityInfo::extract(Sym, Int, Adjustment)) {
// If the original assumption is not Sym + Adjustment !=/</> Int,		// If the original assumption is not Sym + Adjustment !=/</> Int,
// we should invert IsEquality flag.		// we should invert IsEquality flag.
Equality->IsEquality = Equality->IsEquality != EQ;		Equality->IsEquality = Equality->IsEquality != EQ;
return track(NewState, *Equality);		return track(NewState, *Equality);
}		}

▲ Show 20 Lines • Show All 50 Lines • ▼ Show 20 Lines	if (const llvm::APSInt *Point = Constraint.getConcreteValue())
}		}

assert(areFeasible(Constraints) && "Constraint manager shouldn't produce "		assert(areFeasible(Constraints) && "Constraint manager shouldn't produce "
"a state with infeasible constraints");		"a state with infeasible constraints");

return State->set<ConstraintRange>(Constraints);		return State->set<ConstraintRange>(Constraints);
}		}

		// Associate a constraint to a symbolic expression. First, we set the
		// constraint in the State, then we try to simplify existing symbolic
		// expressions based on the newly set constraint.
LLVM_NODISCARD inline ProgramStateRef		LLVM_NODISCARD inline ProgramStateRef
setConstraint(ProgramStateRef State, SymbolRef Sym, RangeSet Constraint) {		setConstraint(ProgramStateRef State, SymbolRef Sym, RangeSet Constraint) {
return setConstraint(State, EquivalenceClass::find(State, Sym), Constraint);		assert(State);
		vsavchenkoUnsubmitted Done Reply Inline Actions Maybe it should be a `simplify` method of the class itself? vsavchenko: Maybe it should be a `simplify` method of the class itself?
		martongAuthorUnsubmitted Done Reply Inline Actions Yeah, makes sense. martong: Yeah, makes sense.

		State = setConstraint(State, EquivalenceClass::find(State, Sym), Constraint);
		if (!State)
		return nullptr;

		vsavchenkoUnsubmitted Done Reply Inline Actions Also I think we can introduce a simple, but efficient optimization of kicking off the simplification process only when `Constraint` is a constant. vsavchenko: Also I think we can introduce a simple, but efficient optimization of kicking off the…
		martongAuthorUnsubmitted Done Reply Inline Actions Yes, good point. martong: Yes, good point.
		// We have a chance to simplify existing symbolic values if the new
		// constraint is a constant.
		vsavchenkoUnsubmitted Done Reply Inline Actions I think we need a comment why we care about this early exit. vsavchenko: I think we need a comment why we care about this early exit.
		if (!Constraint.getConcreteValue())
		return State;
		vsavchenkoUnsubmitted Done Reply Inline Actions Here I also think that we need to give more context to the readers, so they understand what simplification you are talking about here. vsavchenko: Here I also think that we need to give more context to the readers, so they understand what…

		llvm::SmallSet<EquivalenceClass, 4> SimplifiedClasses;
		// Iterate over all equivalence classes and try to simplify them.
		ClassMembersTy Members = State->get<ClassMembers>();
		for (std::pair<EquivalenceClass, SymbolSet> ClassToSymbolSet : Members) {
		vsavchenkoUnsubmitted Done Reply Inline Actions You don't actually use constraints here, so (let me write it in python) instead of: [update(classMap[class]) for class, constraint in constraints.items()] you can use [update(members) for class, members in classMap.items()] vsavchenko: You don't actually use constraints here, so (let me write it in python) instead of: ``` [update…
		martongAuthorUnsubmitted Done Reply Inline Actions Actually, trivial equivalence classes (those that have only one symbol member) are not stored in the State. Thus, we must skim through the constraints as well in order to be able to simplify symbols in the constraints. In short, we have to iterate both collections. martong: Actually, trivial equivalence classes (those that have only one symbol member) are not stored…
		vsavchenkoUnsubmitted Done Reply Inline Actions Ah, I see. Then I would say that your previous solution is more readable (if we keep `simplify`, of course). vsavchenko: Ah, I see. Then I would say that your previous solution is more readable (if we keep…
		martongAuthorUnsubmitted Done Reply Inline Actions My previous solution might be more readable, though, that's not working. Actually, I think I failed to explain properly why do we have to iterate both collections. We have to iterate the ConstraintMap because trivial constraints are not stored in the State but we want to simplify symbols in the constraints. So, if we were to iterate over only the ClassMap then the simplest test-case would fail: int test_rhs_further_constrained(int x, int y) { if (x + y != 0) return 0; if (y != 0) return 0; clang_analyzer_eval(x + y == 0); // expected-warning{{TRUE}} clang_analyzer_eval(y == 0); // expected-warning{{TRUE}} FAIL return 0; } We have to iterate the ClassMap in order to update all equivalence classes that we store in the State. Consider the example you brought up before: void test_equivalence_classes_are_updated(int a, int b, int c, int d) { if (a + b != c) return; if (a != d) return; if (b != 0) return; // Keep the symbols and the constraints! alive. (void)(a * b * c * d); clang_analyzer_eval(c == d); // expected-warning{{TRUE}} return; } Before we start to simulate `b==0`, we have only these equivalence classes in the State: E1{`a+b`, `c`} and E2{`a`, `d`}. And we have these constraints: SymExpr(`a+b==c`) -> out-of [0, 0], SymExpr(`a==d`) -> out-of [0, 0]. Now, when we evaluate `b==0`in setConstraint when iterating the ConstraintMap then SymExpr(`a+b==c`) becomes SymExpr(`a==c`). But the equality classes are not updated. And we can update them if we scan through the ClassMap. Another alternative solution could be to re-trigger the `track` mechanism when we iterate over the ConstraintMap, but `track` seemed to be an exclusive interface towards the higher abstraction RangedConstraintManager. On the other hand, reusing the `track` mechanism could result better performance than doing another iteration on the ClassMap. Do you think it would be a better approach? And how could we reuse the `track` mechanism without getting confused with the `Adjustment` stuff? martong: My previous solution might be more readable, though, that's not working. Actually, I think I…
		vsavchenkoUnsubmitted Done Reply Inline Actions I think we can add a method `isDisequalTo` or just use `areEqual` in a this way: are equal? [Yes] -> nothing to do here [No] -> return nullptr [Don't know] -> merge vsavchenko: I think we can add a method `isDisequalTo` or just use `areEqual` in a this way: are equal?
		martongAuthorUnsubmitted Done Reply Inline Actions Good point, I've added a new overload to the static `areEqual` and added a method `isEqualTo` that uses `areEqual`. martong: Good point, I've added a new overload to the static `areEqual` and added a method `isEqualTo`…
		EquivalenceClass Class = ClassToSymbolSet.first;
		State = Class.simplify(getSValBuilder(), F, State);
		if (!State)
		return nullptr;
		SimplifiedClasses.insert(Class);
		}
		vsavchenkoUnsubmitted Done Reply Inline Actions It would be great if we provide some justification why we do merge here. vsavchenko: It would be great if we provide some justification why we do merge here.

		// Trivial equivalence classes (those that have only one symbol member) are
		vsavchenkoUnsubmitted Done Reply Inline Actions I tried to cover it in the comment to another patch. This solution includes a lot of extra work and it will lose equality/disequality information for simplified expressions, and I think it's safe to say that if `a == b` then `simplify(a) == b`. Let's start with `getConstraintMap`. It is a completely artificial data structure (and function) that exists for Z3 refutation. It's not what we keep in the state and it has a lot of duplicated constraints. If we have an equivalence class `{a, b, c, d, e, f}`, we store only one constraint for all of them (thus when we update the class, or one of the members receives a new constraint, we can update all of them). `getConstraintMap` returns a map where `a`, `b`, `c`, `d`, `e`, and `f` are mapped to the same constraint. It's not super bad, but it's extra work constructing this map and then processing it. Another, and more important aspect is that when you `setConstraint`, you lose information that this symbol is equal/disequal to other symbols. One example here would be a situation where `x + y == z`, and we find out that `y == 0`, we should update equivalence class `{x + y, z}` to be a class `{x, z}`. In order to do this, you need to update two maps: `ClassMap` (it's mapping `x + y` to `{x + y, z}`) and `ClassMembers` (it's mapping `{x + y, z}` to `x + y` and `z`). Similar example can be made with `x + y != z`, but updating `ClassMap` and `ClassMembers` will fix it. And you don't even need to touch the actual mapping with the actual constraints. vsavchenko: I tried to cover it in the comment to another patch. This solution includes a lot of extra…
		martongAuthorUnsubmitted Done Reply Inline Actions Absolutely, great findings! I think the most straightforward and consistent implementation of updating `ClassMap` and `ClassMembers` is to directly use the `merge` method. I.e. we can merge the simplified symbol (as a trivial eq class) to the existing equivalence class. Using `merge`, however, would not remove the non-simplified original symbol. But this might not be a problem; rather it is a necessity (as the child patch demonstrates) it might be very useful if we can find the symbol (without simplification, i.e. as written) in the `ConstraintRange` map. Do you see any drawbacks of reusing `merge` here? martong: Absolutely, great findings! I think the most straightforward and consistent implementation of…
		vsavchenkoUnsubmitted Done Reply Inline Actions Oh, that's actually even better! If we consider the following example. Let `a + b == c` and `a == d` be known and `b == 0` to be a new constraint. Then your approach will help us to figure out that `c == d`. So, you found a great way! I think that we should still add the test cases I briefly described in my previous comment and that one from above. vsavchenko: Oh, that's actually even better! If we consider the following example. Let `a + b == c` and `a…
		vsavchenkoUnsubmitted Done Reply Inline Actions Uh-oh, almost let yet another null-state bug to happen! During this iteration, `State` can become null, so we need to check for it. vsavchenko: Uh-oh, almost let yet another null-state bug to happen! During this iteration, `State` can…
		martongAuthorUnsubmitted Done Reply Inline Actions Good catch! martong: Good catch!
		// not stored in the State. Thus, we must skim through the constraints as
		// well. And we try to simplify symbols in the constraints.
		ConstraintRangeTy Constraints = State->get<ConstraintRange>();
		for (std::pair<EquivalenceClass, RangeSet> ClassConstraint : Constraints) {
		EquivalenceClass Class = ClassConstraint.first;
		if (SimplifiedClasses.count(Class)) // Already simplified.
		continue;
		State = Class.simplify(getSValBuilder(), F, State);
		if (!State)
		return nullptr;
		}

		return State;
}		}
};		};

} // end anonymous namespace		} // end anonymous namespace

std::unique_ptr<ConstraintManager>		std::unique_ptr<ConstraintManager>
ento::CreateRangeConstraintManager(ProgramStateManager &StMgr,		ento::CreateRangeConstraintManager(ProgramStateManager &StMgr,
ExprEngine *Eng) {		ExprEngine *Eng) {
Show All 19 Lines
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// EqualityClass implementation details		// EqualityClass implementation details
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

inline EquivalenceClass EquivalenceClass::find(ProgramStateRef State,		inline EquivalenceClass EquivalenceClass::find(ProgramStateRef State,
SymbolRef Sym) {		SymbolRef Sym) {
		assert(State && "State should not be null");
		assert(Sym && "Symbol should not be null");
// We store far from all Symbol -> Class mappings		// We store far from all Symbol -> Class mappings
if (const EquivalenceClass *NontrivialClass = State->get<ClassMap>(Sym))		if (const EquivalenceClass *NontrivialClass = State->get<ClassMap>(Sym))
return *NontrivialClass;		return *NontrivialClass;

// This is a trivial class of Sym.		// This is a trivial class of Sym.
return Sym;		return Sym;
}		}

▲ Show 20 Lines • Show All 115 Lines • ▼ Show 20 Lines	EquivalenceClass::mergeImpl(BasicValueFactory &ValueFactory,
//		//
// No need in tracking members of a now-dissolved class.		// No need in tracking members of a now-dissolved class.
Members = MF.remove(Members, Other);		Members = MF.remove(Members, Other);
// Now only the current class is mapped to all the symbols.		// Now only the current class is mapped to all the symbols.
Members = MF.add(Members, *this, NewClassMembers);		Members = MF.add(Members, *this, NewClassMembers);

// 4. Update disequality relations		// 4. Update disequality relations
ClassSet DisequalToOther = Other.getDisequalClasses(DisequalityInfo, CF);		ClassSet DisequalToOther = Other.getDisequalClasses(DisequalityInfo, CF);
		// We are about to merge two classes but they are already known to be
		// non-equal. This is a contradiction.
		if (DisequalToOther.contains(*this))
		return nullptr;
		vsavchenkoUnsubmitted Done Reply Inline Actions very opinionated nit: can you please add extra new line after this? vsavchenko: very opinionated nit: can you please add extra new line after this?
		martongAuthorUnsubmitted Done Reply Inline Actions Sure. martong: Sure.

if (!DisequalToOther.isEmpty()) {		if (!DisequalToOther.isEmpty()) {
ClassSet DisequalToThis = getDisequalClasses(DisequalityInfo, CF);		ClassSet DisequalToThis = getDisequalClasses(DisequalityInfo, CF);
DisequalityInfo = DF.remove(DisequalityInfo, Other);		DisequalityInfo = DF.remove(DisequalityInfo, Other);

for (EquivalenceClass DisequalClass : DisequalToOther) {		for (EquivalenceClass DisequalClass : DisequalToOther) {
DisequalToThis = CF.add(DisequalToThis, DisequalClass);		DisequalToThis = CF.add(DisequalToThis, DisequalClass);

// Disequality is a symmetric relation meaning that if		// Disequality is a symmetric relation meaning that if
▲ Show 20 Lines • Show All 130 Lines • ▼ Show 20 Lines	if (const RangeSet *SecondConstraint = Constraints.lookup(Second))
}		}

return true;		return true;
}		}

inline Optional<bool> EquivalenceClass::areEqual(ProgramStateRef State,		inline Optional<bool> EquivalenceClass::areEqual(ProgramStateRef State,
SymbolRef FirstSym,		SymbolRef FirstSym,
SymbolRef SecondSym) {		SymbolRef SecondSym) {
EquivalenceClass First = find(State, FirstSym);		return EquivalenceClass::areEqual(State, find(State, FirstSym),
EquivalenceClass Second = find(State, SecondSym);		find(State, SecondSym));
		}

		inline Optional<bool> EquivalenceClass::areEqual(ProgramStateRef State,
		EquivalenceClass First,
		EquivalenceClass Second) {
// The same equivalence class => symbols are equal.		// The same equivalence class => symbols are equal.
if (First == Second)		if (First == Second)
return true;		return true;

// Let's check if we know anything about these two classes being not equal to		// Let's check if we know anything about these two classes being not equal to
// each other.		// each other.
ClassSet DisequalToFirst = First.getDisequalClasses(State);		ClassSet DisequalToFirst = First.getDisequalClasses(State);
if (DisequalToFirst.contains(Second))		if (DisequalToFirst.contains(Second))
return false;		return false;

// It is not clear.		// It is not clear.
return llvm::None;		return llvm::None;
}		}

		// Iterate over all symbols and try to simplify them. Once a symbol is
		// simplified then we check if we can merge the simplified symbol's equivalence
		// class to this class. This way, we simplify not just the symbols but the
		// classes as well: we strive to keep the number of the classes to be the
		// absolute minimum.
		LLVM_NODISCARD ProgramStateRef EquivalenceClass::simplify(
		SValBuilder &SVB, RangeSet::Factory &F, ProgramStateRef State) {
		SymbolSet ClassMembers = getClassMembers(State);
		for (const SymbolRef &MemberSym : ClassMembers) {
		SymbolRef SimplifiedMemberSym = ::simplify(State, MemberSym);
		if (SimplifiedMemberSym && MemberSym != SimplifiedMemberSym) {
		EquivalenceClass ClassOfSimplifiedSym =
		EquivalenceClass::find(State, SimplifiedMemberSym);
		// The simplified symbol should be the member of the original Class,
		// however, it might be in another existing class at the moment. We
		// have to merge these classes.
		State = merge(SVB.getBasicValueFactory(), F, State, ClassOfSimplifiedSym);
		if (!State)
		return nullptr;
		}
		}
		return State;
		}

inline ClassSet EquivalenceClass::getDisequalClasses(ProgramStateRef State,		inline ClassSet EquivalenceClass::getDisequalClasses(ProgramStateRef State,
SymbolRef Sym) {		SymbolRef Sym) {
return find(State, Sym).getDisequalClasses(State);		return find(State, Sym).getDisequalClasses(State);
}		}

inline ClassSet		inline ClassSet
		vsavchenkoUnsubmitted Done Reply Inline Actions Now, since you put this logic into `merge`, you can just merge. vsavchenko: Now, since you put this logic into `merge`, you can just merge.
		martongAuthorUnsubmitted Done Reply Inline Actions Wow, good catch. martong: Wow, good catch.
EquivalenceClass::getDisequalClasses(ProgramStateRef State) const {		EquivalenceClass::getDisequalClasses(ProgramStateRef State) const {
return getDisequalClasses(State->get<DisequalityMap>(),		return getDisequalClasses(State->get<DisequalityMap>(),
State->get_context<ClassSet>());		State->get_context<ClassSet>());
}		}

inline ClassSet		inline ClassSet
EquivalenceClass::getDisequalClasses(DisequalityMapTy Map,		EquivalenceClass::getDisequalClasses(DisequalityMapTy Map,
ClassSet::Factory &Factory) const {		ClassSet::Factory &Factory) const {
▲ Show 20 Lines • Show All 535 Lines • Show Last 20 Lines

clang/test/Analysis/find-binop-constraints.cpp

This file was added.

				// RUN: %clang_analyze_cc1 %s \
				// RUN: -analyzer-checker=core \
				// RUN: -analyzer-checker=debug.ExprInspection \
				// RUN: -analyzer-config eagerly-assume=false \
				// RUN: -verify

				void clang_analyzer_eval(bool);
				void clang_analyzer_warnIfReached();

				int test_legacy_behavior(int x, int y) {
				if (y != 0)
				return 0;
				if (x + y != 0)
				return 0;
				clang_analyzer_eval(x + y == 0); // expected-warning{{TRUE}}
				clang_analyzer_eval(y == 0); // expected-warning{{TRUE}}
				return y / (x + y); // expected-warning{{Division by zero}}
				}

				int test_rhs_further_constrained(int x, int y) {
				if (x + y != 0)
				return 0;
				if (y != 0)
				return 0;
				clang_analyzer_eval(x + y == 0); // expected-warning{{TRUE}}
				clang_analyzer_eval(y == 0); // expected-warning{{TRUE}}
				return 0;
				}

				int test_lhs_further_constrained(int x, int y) {
				if (x + y != 0)
				return 0;
				if (x != 0)
				return 0;
				clang_analyzer_eval(x + y == 0); // expected-warning{{TRUE}}
				clang_analyzer_eval(x == 0); // expected-warning{{TRUE}}
				return 0;
				}

				int test_lhs_and_rhs_further_constrained(int x, int y) {
				if (x % y != 1)
				return 0;
				if (x != 1)
				return 0;
				if (y != 2)
				return 0;
				clang_analyzer_eval(x % y == 1); // expected-warning{{TRUE}}
				clang_analyzer_eval(y == 2); // expected-warning{{TRUE}}
				return 0;
				}

				int test_commutativity(int x, int y) {
				if (x + y != 0)
				return 0;
				if (y != 0)
				return 0;
				clang_analyzer_eval(y + x == 0); // expected-warning{{TRUE}}
				clang_analyzer_eval(y == 0); // expected-warning{{TRUE}}
				return 0;
				}

				int test_binop_when_height_is_2_r(int a, int x, int y, int z) {
				switch (a) {
				case 1: {
				if (x + y + z != 0)
				return 0;
				if (z != 0)
				return 0;
				clang_analyzer_eval(x + y + z == 0); // expected-warning{{TRUE}}
				clang_analyzer_eval(z == 0); // expected-warning{{TRUE}}
				break;
				}
				case 2: {
				if (x + y + z != 0)
				return 0;
				if (y != 0)
				return 0;
				clang_analyzer_eval(x + y + z == 0); // expected-warning{{TRUE}}
				clang_analyzer_eval(y == 0); // expected-warning{{TRUE}}
				break;
				}
				case 3: {
				if (x + y + z != 0)
				return 0;
				if (x != 0)
				return 0;
				clang_analyzer_eval(x + y + z == 0); // expected-warning{{TRUE}}
				clang_analyzer_eval(x == 0); // expected-warning{{TRUE}}
				break;
				}
				case 4: {
				if (x + y + z != 0)
				return 0;
				if (x + y != 0)
				return 0;
				clang_analyzer_eval(x + y + z == 0); // expected-warning{{TRUE}}
				clang_analyzer_eval(x + y == 0); // expected-warning{{TRUE}}
				break;
				}
				case 5: {
				if (z != 0)
				return 0;
				if (x + y + z != 0)
				return 0;
				clang_analyzer_eval(x + y + z == 0); // expected-warning{{TRUE}}
				if (y != 0)
				return 0;
				clang_analyzer_eval(y == 0); // expected-warning{{TRUE}}
				clang_analyzer_eval(z == 0); // expected-warning{{TRUE}}
				clang_analyzer_eval(x + y + z == 0); // expected-warning{{TRUE}}
				break;
				}

				}
				return 0;
				}

				void test_equivalence_classes_are_updated(int a, int b, int c, int d) {
				if (a + b != c)
				return;
				if (a != d)
				return;
				if (b != 0)
				return;
				clang_analyzer_eval(c == d); // expected-warning{{TRUE}}
				// Keep the symbols and the constraints! alive.
				(void)(a * b * c * d);
				return;
				}

				void test_contradiction(int a, int b, int c, int d) {
				if (a + b != c)
				return;
				if (a == c)
				return;
				clang_analyzer_warnIfReached(); // expected-warning{{REACHABLE}}

				// Bring in the contradiction.
				if (b != 0)
				return;
				clang_analyzer_warnIfReached(); // no-warning, i.e. UNREACHABLE
				// Keep the symbols and the constraints! alive.
				(void)(a * b * c * d);
				return;
				}

				void test_deferred_contradiction(int e0, int b0, int b1) {

				int e1 = e0 - b0; // e1 is bound to (reg_$0<int e0>) - (reg_$1<int b0>)
				(void)(b0 == 2); // bifurcate

				vsavchenkoUnsubmitted Done Reply Inline Actions It's not really connected to your patch, but this confuses me! Why does the analyzer think that `b0` is guaranteed to be 2 after this statement. Even if we eagerly assume here, shouldn't it mean that there are still two paths `b0 == 2` and `b0 != 2`? vsavchenko: It's not really connected to your patch, but this confuses me! Why does the analyzer think…
				martongAuthorUnsubmitted Done Reply Inline Actions Don't be puzzled by this. This indeed bifurcates. The interesting path is where `b0 == 2` is true. I am going to update this line with `if (b0 ==2) {` to achieve a similar effect. (I was using creduce and tried to simplify even more after that, but i missed this.) martong: Don't be puzzled by this. This indeed bifurcates. The interesting path is where `b0 == 2` is…
				int e2 = e1 - b1;
				if (e2 > 0) { // b1 != e1
				clang_analyzer_warnIfReached(); // expected-warning{{REACHABLE}}
				// Here, e1 is still bound to (reg_$0<int e0>) - (reg_$1<int b0>) but we
				// should be able to simplify it to (reg_$0<int e0>) - 2 and thus realize
				// the contradiction.
				if (b1 == e1) {
				clang_analyzer_warnIfReached(); // no-warning, i.e. UNREACHABLE
				vsavchenkoUnsubmitted Done Reply Inline Actions Hmm, I don't see how simplification helped here. After the previous `if` statement, we should have had two equivalence classes known to be disequal: `reg_$2<int b1>` and `(reg_$0<int e0>) - (reg_$1<int b0>)`. Further, we directly compare these two symbols. We can figure it out without any simplifications. Am I missing something here? vsavchenko: Hmm, I don't see how simplification helped here. After the previous `if` statement, we should…
				martongAuthorUnsubmitted Done Reply Inline Actions When we evaluate `e2 > 0` then we will set `e1` as disequal to `b1`. However, at this point because of the eager constant folding `e1` is `e0 - 2` (on the path where `b0 == 2` is true). So, when we evaluate `b1 == e1` then this is the diseq info we have in the State (I used `dumpDisEq` from D103967): reg_$2<int b1> DisequalTo: (reg_$0<int e0>) - 2 (reg_$0<int e0>) - 2 DisequalTo: reg_$2<int b1> And indeed we ask directly whether the LHS (`reg_$2<int b1>`) is equal to RHS`(reg_$0<int e0>) - (reg_$1<int b0>)`. This is because the` DeclRefExpr` of `e1` is still bound to SVal which originates from the time before we constrained b0 to 2. With other words: the `Environment` is not changed by introducing a new constraint. BTW, this test fails even in llvm/main. martong: When we evaluate `e2 > 0` then we will set `e1` as disequal to `b1`. However, at this point…
				martongAuthorUnsubmitted Done Reply Inline Actions With other words: the Environment is not changed by introducing a new constraint. This suggests that another approach could be to do change the `Environment` when we add a new constraint. I am not sure about the pros/cons atm, but might be worth to experiment. What do you think? martong: > With other words: the Environment is not changed by introducing a new constraint. This…
				(void)(b0 * b1 * e0 * e1 * e2);
				}
				}
				}

This is an archive of the discontinued LLVM Phabricator instance.

[Analyzer][solver] Simplify existing constraints when a new constraint is addedClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 351809

clang/include/clang/StaticAnalyzer/Core/PathSensitive/RangedConstraintManager.h

clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp

clang/test/Analysis/find-binop-constraints.cpp

[Analyzer][solver] Simplify existing constraints when a new constraint is added
ClosedPublic