This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
include/clang/StaticAnalyzer/Core/PathSensitive/
-
clang/
-
StaticAnalyzer/
-
Core/
-
PathSensitive/
-
RangedConstraintManager.h
-
lib/StaticAnalyzer/Core/
-
StaticAnalyzer/
-
Core/
22/22
RangeConstraintManager.cpp
-
test/Analysis/
-
Analysis/
5/5
find-binop-constraints.cpp

Differential D103314

[Analyzer][solver] Simplify existing constraints when a new constraint is added
ClosedPublic

Authored by martong on May 28 2021, 6:26 AM.

Download Raw Diff

Details

Reviewers

vsavchenko
NoQ
steakhal
Szelethus

Commits

rG8ddbb442b6e8: [Analyzer][solver] Simplify existing eq classes and constraints when a new…

Summary

Update setConstraint to simplify existing constraints (and adding the
simplified constraint) when a new constraint is added. In this patch we just
simply iterate over all existing constraints and try to simplfy them with
simplifySVal. This solves the simplest problematic cases where we have two
symbols in the tree, e.g.:

int test_rhs_further_constrained(int x, int y) {
  if (x + y != 0)
    return 0;
  if (y != 0)
    return 0;
  clang_analyzer_eval(x + y == 0); // expected-warning{{TRUE}}
  clang_analyzer_eval(y == 0);     // expected-warning{{TRUE}}
  return 0;
}

This patch is the first step of a sequence of patches, and not intended to be
commited as a standalone change. The sequence of patches (and the plan) is
described here: https://reviews.llvm.org/D102696#2784624

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	25,290 ms	x64 debian > LeakSanitizer-AddressSanitizer-x86_64.TestCases/Linux::libdl_deadlock.cpp

Event Timeline

martong created this revision.May 28 2021, 6:26 AM

Herald added a reviewer: Szelethus. · View Herald TranscriptMay 28 2021, 6:26 AM

Herald added subscribers: ASDenysPetrov, gamesh411, dkrupp and 9 others. · View Herald Transcript

martong requested review of this revision.May 28 2021, 6:26 AM

Herald added a project: Restricted Project. · View Herald TranscriptMay 28 2021, 6:26 AM

Herald added a subscriber: cfe-commits. · View Herald Transcript

martong added a child revision: D103317: [Analyzer][Core] Make SValBuilder to better simplify svals with 3 symbols in the tree.May 28 2021, 6:40 AM

martong mentioned this in D102696: [Analyzer] Find constraints that are directly attached to a BinOp.May 28 2021, 6:43 AM

Hey, great job! This is really something that we need, but it's implemented not entirely correctly.
I tried to cover it in the inline comment.

clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp
1597–1613	I tried to cover it in the comment to another patch. This solution includes a lot of extra work and it will lose equality/disequality information for simplified expressions, and I think it's safe to say that if `a == b` then `simplify(a) == b`. Let's start with `getConstraintMap`. It is a completely artificial data structure (and function) that exists for Z3 refutation. It's not what we keep in the state and it has a lot of duplicated constraints. If we have an equivalence class `{a, b, c, d, e, f}`, we store only one constraint for all of them (thus when we update the class, or one of the members receives a new constraint, we can update all of them). `getConstraintMap` returns a map where `a`, `b`, `c`, `d`, `e`, and `f` are mapped to the same constraint. It's not super bad, but it's extra work constructing this map and then processing it. Another, and more important aspect is that when you `setConstraint`, you lose information that this symbol is equal/disequal to other symbols. One example here would be a situation where `x + y == z`, and we find out that `y == 0`, we should update equivalence class `{x + y, z}` to be a class `{x, z}`. In order to do this, you need to update two maps: `ClassMap` (it's mapping `x + y` to `{x + y, z}`) and `ClassMembers` (it's mapping `{x + y, z}` to `x + y` and `z`). Similar example can be made with `x + y != z`, but updating `ClassMap` and `ClassMembers` will fix it. And you don't even need to touch the actual mapping with the actual constraints.

This revision now requires changes to proceed.May 28 2021, 6:53 AM

Thanks Valeriy for the quick review and guidance! I am planning to do the changes and continue next week :)

vsavchenko added inline comments.May 28 2021, 7:31 AM

clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp
1596	Also I think we can introduce a simple, but efficient optimization of kicking off the simplification process only when `Constraint` is a constant.

Harbormaster completed remote builds in B106698: Diff 348508.May 28 2021, 7:45 AM

martong marked an inline comment as done.May 31 2021, 8:26 AM

martong added inline comments.

clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp
1596	Yes, good point.
1597–1613	Absolutely, great findings! I think the most straightforward and consistent implementation of updating `ClassMap` and `ClassMembers` is to directly use the `merge` method. I.e. we can merge the simplified symbol (as a trivial eq class) to the existing equivalence class. Using `merge`, however, would not remove the non-simplified original symbol. But this might not be a problem; rather it is a necessity (as the child patch demonstrates) it might be very useful if we can find the symbol (without simplification, i.e. as written) in the `ConstraintRange` map. Do you see any drawbacks of reusing `merge` here?

Merge the simplified symbol to the old class

That's awesome, just a few stylistic tweaks and tests and we are ready to land!

clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp
1597–1598	I think we need a comment why we care about this early exit.
1597–1613	Oh, that's actually even better! If we consider the following example. Let `a + b == c` and `a == d` be known and `b == 0` to be a new constraint. Then your approach will help us to figure out that `c == d`. So, you found a great way! I think that we should still add the test cases I briefly described in my previous comment and that one from above.
1600	Here I also think that we need to give more context to the readers, so they understand what simplification you are talking about here.
1602–1605	You don't actually use constraints here, so (let me write it in python) instead of: [update(classMap[class]) for class, constraint in constraints.items()] you can use [update(members) for class, members in classMap.items()]
1611	It would be great if we provide some justification why we do merge here.

vsavchenko added inline comments.May 31 2021, 9:08 AM

clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp
1612–1613	Uh-oh, almost let yet another null-state bug to happen! During this iteration, `State` can become null, so we need to check for it.

Harbormaster completed remote builds in B106922: Diff 348808.May 31 2021, 9:16 AM

I had another thought, merge is usually called in situations when we found out that two symbols should be marked equal (and checked that it's possible to begin with), which is not true in your case.

If we update my case from before, we can get: a + b == c and a != c as given, and b == 0 as a new constraint. In this situation, you will merge classes {a + b, c} and {a}, which contradicts our existing disequality information.

In D103314#2789754, @vsavchenko wrote:

I had another thought, merge is usually called in situations when we found out that two symbols should be marked equal (and checked that it's possible to begin with), which is not true in your case.

If we update my case from before, we can get: a + b == c and a != c as given, and b == 0 as a new constraint. In this situation, you will merge classes {a + b, c} and {a}, which contradicts our existing disequality information.

Yes, we must check the disequivalence classes to discover such a contradiction, I updated the code to do so. Also, added a test case for the contradiction handling.

clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp
1602–1605	Actually, trivial equivalence classes (those that have only one symbol member) are not stored in the State. Thus, we must skim through the constraints as well in order to be able to simplify symbols in the constraints. In short, we have to iterate both collections.
1612–1613	Good catch!

Simplify equivalence classes when iterate over ClassMap, simplify constraints by iterating over the ConstraintsMap

I was wondering if there is a direct way to check the equivalence classes?
I am thinking about to add a clang_annalyzer_dump_equivalence_classes function to the ExprInspection checker.

Awesome!
I know, I said that we are ready to land, but I think I was too excited about this change. We probably should have some data on how it performs on real-life codebases.

clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp
1559	Maybe it should be a `simplify` method of the class itself?
1568–1573	I think we can add a method `isDisequalTo` or just use `areEqual` in a this way: are equal? [Yes] -> nothing to do here [No] -> return nullptr [Don't know] -> merge
1602–1605	Ah, I see. Then I would say that your previous solution is more readable (if we keep `simplify`, of course).

Harbormaster completed remote builds in B107020: Diff 348945.Jun 1 2021, 6:44 AM

martong marked 3 inline comments as done.Jun 2 2021, 7:06 AM

martong added inline comments.

clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp
1559	Yeah, makes sense.
1568–1573	Good point, I've added a new overload to the static `areEqual` and added a method `isEqualTo` that uses `areEqual`.
1602–1605	My previous solution might be more readable, though, that's not working. Actually, I think I failed to explain properly why do we have to iterate both collections. We have to iterate the ConstraintMap because trivial constraints are not stored in the State but we want to simplify symbols in the constraints. So, if we were to iterate over only the ClassMap then the simplest test-case would fail: int test_rhs_further_constrained(int x, int y) { if (x + y != 0) return 0; if (y != 0) return 0; clang_analyzer_eval(x + y == 0); // expected-warning{{TRUE}} clang_analyzer_eval(y == 0); // expected-warning{{TRUE}} FAIL return 0; } We have to iterate the ClassMap in order to update all equivalence classes that we store in the State. Consider the example you brought up before: void test_equivalence_classes_are_updated(int a, int b, int c, int d) { if (a + b != c) return; if (a != d) return; if (b != 0) return; // Keep the symbols and the constraints! alive. (void)(a * b * c * d); clang_analyzer_eval(c == d); // expected-warning{{TRUE}} return; } Before we start to simulate `b==0`, we have only these equivalence classes in the State: E1{`a+b`, `c`} and E2{`a`, `d`}. And we have these constraints: SymExpr(`a+b==c`) -> out-of [0, 0], SymExpr(`a==d`) -> out-of [0, 0]. Now, when we evaluate `b==0`in setConstraint when iterating the ConstraintMap then SymExpr(`a+b==c`) becomes SymExpr(`a==c`). But the equality classes are not updated. And we can update them if we scan through the ClassMap. Another alternative solution could be to re-trigger the `track` mechanism when we iterate over the ConstraintMap, but `track` seemed to be an exclusive interface towards the higher abstraction RangedConstraintManager. On the other hand, reusing the `track` mechanism could result better performance than doing another iteration on the ClassMap. Do you think it would be a better approach? And how could we reuse the `track` mechanism without getting confused with the `Adjustment` stuff?

Add isEqualTo and simplify members to EquivalenceClass

Harbormaster completed remote builds in B107241: Diff 349261.Jun 2 2021, 8:04 AM

I am terribly sorry, but I uploaded an unfinished Diff previously, please disregard that. So these are the changes:

Add isEqualTo and simplify members to EquivalenceClass

Harbormaster completed remote builds in B107304: Diff 349352.Jun 2 2021, 1:42 PM

In D103314#2790868, @vsavchenko wrote:

Awesome!
I know, I said that we are ready to land, but I think I was too excited about this change. We probably should have some data on how it performs on real-life codebases.

Just some quick update on the status of this patch. I've done some measurements on smaller open source C projects (e.g tmux) and didn't see any noticeable slow-down. However, I've run into a bad-bad assertion failure in my favorite Checker (StdLibraryFu...). The assertion indicates that neither !State nor State is feasible, so this throws me back to the debugger for a while.

Simplify the symbol before eq tracking as well

In D103314#2798968, @martong wrote:

In D103314#2790868, @vsavchenko wrote:

Awesome!
I know, I said that we are ready to land, but I think I was too excited about this change. We probably should have some data on how it performs on real-life codebases.

Just some quick update on the status of this patch. I've done some measurements on smaller open source C projects (e.g tmux) and didn't see any noticeable slow-down. However, I've run into a bad-bad assertion failure in my favorite Checker (StdLibraryFu...). The assertion indicates that neither !State nor State is feasible, so this throws me back to the debugger for a while.

Finally, I could boil down the infeasible parent state problem and added a test case test_deferred_contradiction to catch that. The solution is surprisingly simple: just try to simplify the symbolic expression of an equivalency before we start to update the State with the equivalency info.

Harbormaster completed remote builds in B108417: Diff 350898.Jun 9 2021, 8:43 AM

OK, we definitely need to know about performance.
Plus, I'm still curious about the crash. I didn't get how simplification helped/caused that crash.

I have one thought here. If the lack of simplification indeed caused the crash, we are in trouble with this patch. IMO simplification in just one place should make it better, but shouldn't produce infeasible states for us. In other words, any number simplifications is a conservative operation that makes our lives a bit better. The moment they become a requirement (i.e. simplifications call for more simplifications or we crash) this solution from this patch has to become much harder. This is because whenever we do merge, we essentially can create another situation when we find out that some symbolic expression is a constant. Let's say that we are merging classes A and B which have constraints [INT_MIN, 42] and [42, INT_MAX]. After the merge, we are positive that all the members of this new class are equal to 42. And if so, we can further simplify classes and their members. This algorithm turns into a fixed point algorithm, which has a good chance to sabotage our performance.

This being said, can we re-iterate on that crash and the proposed fix in much more detail?

clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp
1800	very opinionated nit: can you please add extra new line after this?
1979–1989	Now, since you put this logic into `merge`, you can just merge.
clang/test/Analysis/find-binop-constraints.cpp
151	It's not really connected to your patch, but this confuses me! Why does the analyzer think that `b0` is guaranteed to be 2 after this statement. Even if we eagerly assume here, shouldn't it mean that there are still two paths `b0 == 2` and `b0 != 2`?
156–159	Hmm, I don't see how simplification helped here. After the previous `if` statement, we should have had two equivalence classes known to be disequal: `reg_$2<int b1>` and `(reg_$0<int e0>) - (reg_$1<int b0>)`. Further, we directly compare these two symbols. We can figure it out without any simplifications. Am I missing something here?

I have one thought here. If the lack of simplification indeed caused the crash, we are in trouble with this patch. IMO simplification in just one place should make it better, but shouldn't produce infeasible states for us. In other words, any number simplifications is a conservative operation that makes our lives a bit better. The moment they become a requirement (i.e. simplifications call for more simplifications or we crash) this solution from this patch has to become much harder. This is because whenever we do merge, we essentially can create another situation when we find out that some symbolic expression is a constant. Let's say that we are merging classes A and B which have constraints [INT_MIN, 42] and [42, INT_MAX]. After the merge, we are positive that all the members of this new class are equal to 42. And if so, we can further simplify classes and their members. This algorithm turns into a fixed point algorithm, which has a good chance to sabotage our performance.

Yes, good point(s). I am trying to avoid turning into a fixed point algorithm by directly iterating over the equivalence classes instead of reusing the existing track mechanism. On the other hand, perhaps with some budge the fixpoint algo would be worth to experiment with.

clang/test/Analysis/find-binop-constraints.cpp
151	Don't be puzzled by this. This indeed bifurcates. The interesting path is where `b0 == 2` is true. I am going to update this line with `if (b0 ==2) {` to achieve a similar effect. (I was using creduce and tried to simplify even more after that, but i missed this.)
156–159	When we evaluate `e2 > 0` then we will set `e1` as disequal to `b1`. However, at this point because of the eager constant folding `e1` is `e0 - 2` (on the path where `b0 == 2` is true). So, when we evaluate `b1 == e1` then this is the diseq info we have in the State (I used `dumpDisEq` from D103967): reg_$2<int b1> DisequalTo: (reg_$0<int e0>) - 2 (reg_$0<int e0>) - 2 DisequalTo: reg_$2<int b1> And indeed we ask directly whether the LHS (`reg_$2<int b1>`) is equal to RHS`(reg_$0<int e0>) - (reg_$1<int b0>)`. This is because the` DeclRefExpr` of `e1` is still bound to SVal which originates from the time before we constrained b0 to 2. With other words: the `Environment` is not changed by introducing a new constraint. BTW, this test fails even in llvm/main.

martong marked 2 inline comments as done.Jun 9 2021, 11:25 AM

martong added inline comments.

clang/test/Analysis/find-binop-constraints.cpp
156–159	With other words: the Environment is not changed by introducing a new constraint. This suggests that another approach could be to do change the `Environment` when we add a new constraint. I am not sure about the pros/cons atm, but might be worth to experiment. What do you think?

OK, we definitely need to know about performance.

Couldn't agree more. I am in the middle of a performance measurement that I do with csa-testbench (on memchached,tmux,curl,twin,redis,vim,openssl,sqlite,ffmpeg,postgresql,tinyxml2,libwebm,xerces,bitcoin,protobuf). Hopefully I can give you some results soon.

Plus, I'm still curious about the crash. I didn't get how simplification helped/caused that crash.

So, the crash was actually an assertion failure in StdLibraryFunctionsChecker, which came when I made a test analysis run on the twin project. The assertion was here:

if (FailureSt && !SuccessSt) {
  if (ExplodedNode *N = C.generateErrorNode(NewState))
    reportBug(Call, N, Constraint.get(), Summary, C);
  break;
} else {
  // We will apply the constraint even if we cannot reason about the
  // argument. This means both SuccessSt and FailureSt can be true. If we
  // weren't applying the constraint that would mean that symbolic
  // execution continues on a code whose behaviour is undefined.
  assert(SuccessSt);                   // <----------------------------------------------------------------- This fired !!!
  NewState = SuccessSt;
}

With multiple creduce iterations below is a minimal example with StdLibraryFunctionsChecker. That crashed when we applied the BufferSize constraint of fread.

typedef int FILE;
long b;
unsigned long fread(void *__restrict, unsigned long, unsigned long,
                    FILE *__restrict);
void foo();
void c(int *a, int e0) {

  int e1 = e0 - b;
  b == 2;
  foo();

  int e2 = e1 - b;
  if (e2 > 0 && b == e1) {
    (void)a; (void)e1; (void)c;
    fread(a, sizeof(char), e1, c);
  }
}

Turned out, the checker had the assertion because before applying the arg constraint and its negated counterpart, the state was already infeasible. (But the analyzer recognized this only when it added the new assumptions when checking the applicability of the arg constraint.)
Thus, I could remove fread and the Checker from the problem set and could create the test case that synthesizes the unfeasible state.

Remove isEqualTo

martong marked an inline comment as done.Jun 9 2021, 11:52 AM

martong added inline comments.

clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp
1800	Sure.
1979–1989	Wow, good catch.

Harbormaster completed remote builds in B108465: Diff 350966.Jun 9 2021, 12:46 PM

I have the first measurements results in the attached zip file. The file contains the html file generated by csa-testbench. It's name contains CTU but actually it was a regular non-CTU analysis. The most interesting is probably the run-times, where we can notice a small increase:

Other than that, the number of the warnings seems to be unchanged. The most notable change in the statistics is in the number of paths explored by the analyzer: in some cases (e.g. twin) it increased with 2-3 %.

CTU_20results_20on_20open_20projects_201.zip20 KBDownload

In D103314#2810795, @martong wrote:

I have the first measurements results in the attached zip file. The file contains the html file generated by csa-testbench. It's name contains CTU but actually it was a regular non-CTU analysis. The most interesting is probably the run-times, where we can notice a small increase:

Other than that, the number of the warnings seems to be unchanged. The most notable change in the statistics is in the number of paths explored by the analyzer: in some cases (e.g. twin) it increased with 2-3 %.
CTU_20results_20on_20open_20projects_201.zip20 KBDownload

This sounds amazing! Great job!

vsavchenko accepted this revision.Jun 13 2021, 2:26 AM

This revision is now accepted and ready to land.Jun 13 2021, 2:26 AM

This revision was landed with ongoing or failed builds.Jun 14 2021, 3:19 AM

Closed by commit rG8ddbb442b6e8: [Analyzer][solver] Simplify existing eq classes and constraints when a new… (authored by martong). · Explain Why

This revision was automatically updated to reflect the committed changes.

martong added a commit: rG8ddbb442b6e8: [Analyzer][solver] Simplify existing eq classes and constraints when a new….

This patch is the first step of a sequence of patches, and not intended to be commited as a standalone change.

Although I planned to commit this in a lock-step when subsequent patches are also accepted, it makes sense to commit now since it's an obvious improvement and the performance penalty remains below a reasonable limit.

Hi,

I'm seeing a failed assertion with this patch.
Reproduce with

clang --analyze bbi-57338.c

Result:

clang: /repo/uabelho/master-github/llvm/include/llvm/ADT/APSInt.h:148: bool llvm::APSInt::operator<(const llvm::APSInt &) const: Assertion `IsUnsigned == RHS.IsUnsigned && "Signedness mismatch!"' failed.

bbi-57338.c147 BDownload

In D103314#2829806, @uabelho wrote:
Hi,

I'm seeing a failed assertion with this patch.
Reproduce with
clang --analyze bbi-57338.c
Result:
clang: /repo/uabelho/master-github/llvm/include/llvm/ADT/APSInt.h:148: bool llvm::APSInt::operator<(const llvm::APSInt &) const: Assertion `IsUnsigned == RHS.IsUnsigned && "Signedness mismatch!"' failed.
bbi-57338.c147 BDownload

Good that we found it that early! Thanks Mikael!

In D103314#2829806, @uabelho wrote:
Hi,

I'm seeing a failed assertion with this patch.
Reproduce with
clang --analyze bbi-57338.c
Result:
clang: /repo/uabelho/master-github/llvm/include/llvm/ADT/APSInt.h:148: bool llvm::APSInt::operator<(const llvm::APSInt &) const: Assertion `IsUnsigned == RHS.IsUnsigned && "Signedness mismatch!"' failed.
bbi-57338.c147 BDownload

Thanks Mikael for the reproducer, I am going to debug tomorrow.

Hi,

Another failed assertion that started appearing with this patch:

clang --analyze bbi-57589.c

which results in:

clang: ../lib/Support/APInt.cpp:284: int llvm::APInt::compareSigned(const llvm::APInt &) const: Assertion `BitWidth == RHS.BitWidth && "Bit widths must be same for comparison"' failed.

bbi-57589.c198 BDownload

Maybe it's the same root problem, but please make sure you fix both.
Thanks!

In D103314#2837907, @uabelho wrote:
Hi,

Another failed assertion that started appearing with this patch:
clang --analyze bbi-57589.c
which results in:
clang: ../lib/Support/APInt.cpp:284: int llvm::APInt::compareSigned(const llvm::APInt &) const: Assertion `BitWidth == RHS.BitWidth && "Bit widths must be same for comparison"' failed.
bbi-57589.c198 BDownload

Maybe it's the same root problem, but please make sure you fix both.
Thanks!

Thanks again Mikael for the report. I could find the root cause and I have a solution that solves the assertions (both test cases are fixed). I am going to upload the fix soon.

In D103314#2838065, @martong wrote:
In D103314#2837907, @uabelho wrote:
Hi,

Another failed assertion that started appearing with this patch:
clang --analyze bbi-57589.c
which results in:
clang: ../lib/Support/APInt.cpp:284: int llvm::APInt::compareSigned(const llvm::APInt &) const: Assertion `BitWidth == RHS.BitWidth && "Bit widths must be same for comparison"' failed.
bbi-57589.c198 BDownload

Maybe it's the same root problem, but please make sure you fix both.
Thanks!
Thanks again Mikael for the report. I could find the root cause and I have a solution that solves the assertions (both test cases are fixed). I am going to upload the fix soon.

Great! Ping me when it's on review, I'll try to look into it ASAP!

In D103314#2838065, @martong wrote:
In D103314#2837907, @uabelho wrote:
Hi,

Another failed assertion that started appearing with this patch:
clang --analyze bbi-57589.c
which results in:
clang: ../lib/Support/APInt.cpp:284: int llvm::APInt::compareSigned(const llvm::APInt &) const: Assertion `BitWidth == RHS.BitWidth && "Bit widths must be same for comparison"' failed.
bbi-57589.c198 BDownload

Maybe it's the same root problem, but please make sure you fix both.
Thanks!
Thanks again Mikael for the report. I could find the root cause and I have a solution that solves the assertions (both test cases are fixed). I am going to upload the fix soon.

Here it is: https://reviews.llvm.org/D104844

martong mentioned this in D106823: [analyzer][solver] Iterate to a fixpoint during symbol simplification with constants.Jul 26 2021, 1:58 PM

I believe this commit exposed a new false-positive bug in [core.DivideZero]. I've filed the report here: https://bugs.llvm.org/show_bug.cgi?id=51940

I believe this exposed another odd issue where a true positive (enabled by this commit) disappears when unrelated code is not present. Bug filed as: https://bugs.llvm.org/show_bug.cgi?id=51950.

martong mentioned this in rG806329da0700: [analyzer][solver] Iterate to a fixpoint during symbol simplification with….Nov 12 2021, 2:58 AM

martong removed a child revision: D103317: [Analyzer][Core] Make SValBuilder to better simplify svals with 3 symbols in the tree.Nov 12 2021, 4:09 AM

Revision Contents

Path

Size

clang/

include/

clang/

StaticAnalyzer/

Core/

PathSensitive/

RangedConstraintManager.h

2 lines

lib/

StaticAnalyzer/

Core/

RangeConstraintManager.cpp

77 lines

test/

Analysis/

find-binop-constraints.cpp

145 lines

Diff 348945

clang/include/clang/StaticAnalyzer/Core/PathSensitive/RangedConstraintManager.h

Show First 20 Lines • Show All 250 Lines • ▼ Show 20 Lines	public:
static void Profile(llvm::FoldingSetNodeID &ID, const RangeSet &RS) {		static void Profile(llvm::FoldingSetNodeID &ID, const RangeSet &RS) {
ID.AddPointer(RS.Impl);		ID.AddPointer(RS.Impl);
}		}

/// Profile - Generates a hash profile of this RangeSet for use		/// Profile - Generates a hash profile of this RangeSet for use
/// by FoldingSet.		/// by FoldingSet.
void Profile(llvm::FoldingSetNodeID &ID) const { Profile(ID, *this); }		void Profile(llvm::FoldingSetNodeID &ID) const { Profile(ID, *this); }

/// getConcreteValue - If a symbol is contrained to equal a specific integer		/// getConcreteValue - If a symbol is constrained to equal a specific integer
/// constant then this method returns that value. Otherwise, it returns		/// constant then this method returns that value. Otherwise, it returns
/// NULL.		/// NULL.
const llvm::APSInt *getConcreteValue() const {		const llvm::APSInt *getConcreteValue() const {
return Impl->size() == 1 ? begin()->getConcreteValue() : nullptr;		return Impl->size() == 1 ? begin()->getConcreteValue() : nullptr;
}		}

/// Get the minimal value covered by the ranges in the set.		/// Get the minimal value covered by the ranges in the set.
///		///
▲ Show 20 Lines • Show All 125 Lines • Show Last 20 Lines

clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp

Show All 14 Lines
#include "clang/StaticAnalyzer/Core/PathSensitive/APSIntType.h"		#include "clang/StaticAnalyzer/Core/PathSensitive/APSIntType.h"
#include "clang/StaticAnalyzer/Core/PathSensitive/ProgramState.h"		#include "clang/StaticAnalyzer/Core/PathSensitive/ProgramState.h"
#include "clang/StaticAnalyzer/Core/PathSensitive/ProgramStateTrait.h"		#include "clang/StaticAnalyzer/Core/PathSensitive/ProgramStateTrait.h"
#include "clang/StaticAnalyzer/Core/PathSensitive/RangedConstraintManager.h"		#include "clang/StaticAnalyzer/Core/PathSensitive/RangedConstraintManager.h"
#include "clang/StaticAnalyzer/Core/PathSensitive/SValVisitor.h"		#include "clang/StaticAnalyzer/Core/PathSensitive/SValVisitor.h"
#include "llvm/ADT/FoldingSet.h"		#include "llvm/ADT/FoldingSet.h"
#include "llvm/ADT/ImmutableSet.h"		#include "llvm/ADT/ImmutableSet.h"
#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
		#include "llvm/ADT/SmallSet.h"
#include "llvm/Support/Compiler.h"		#include "llvm/Support/Compiler.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
#include <algorithm>		#include <algorithm>
#include <iterator>		#include <iterator>

using namespace clang;		using namespace clang;
using namespace ento;		using namespace ento;

▲ Show 20 Lines • Show All 1,514 Lines • ▼ Show 20 Lines	if (const llvm::APSInt *Point = Constraint.getConcreteValue())
}		}

assert(areFeasible(Constraints) && "Constraint manager shouldn't produce "		assert(areFeasible(Constraints) && "Constraint manager shouldn't produce "
"a state with infeasible constraints");		"a state with infeasible constraints");

return State->set<ConstraintRange>(Constraints);		return State->set<ConstraintRange>(Constraints);
}		}

		// Iterate over all symbols in an equivalence class and try to simplify them.
		// Once a symbol is simplified then we check if we can merge the simplified
		// symbol's equivalence class to the original class. This way, we simplify the
		// classes as well: we strive to keep the number of the classes to be the
		// absolute minimum.
		LLVM_NODISCARD ProgramStateRef simplifyEquivalenceClass(
		vsavchenkoUnsubmitted Done Reply Inline Actions Maybe it should be a `simplify` method of the class itself? vsavchenko: Maybe it should be a `simplify` method of the class itself?
		martongAuthorUnsubmitted Done Reply Inline Actions Yeah, makes sense. martong: Yeah, makes sense.
		ProgramStateRef State, EquivalenceClass Class, SymbolSet ClassMembers) {
		SValBuilder &SVB = getSValBuilder();
		for (const SymbolRef &MemberSym : ClassMembers) {
		SVal SimplifiedMemberVal =
		SVB.simplifySVal(State, SVB.makeSymbolVal(MemberSym));
		SymbolRef SimplifiedMemberSym = SimplifiedMemberVal.getAsSymbol();
		if (SimplifiedMemberSym && MemberSym != SimplifiedMemberSym) {
		ClassSet DisequalClasses = Class.getDisequalClasses(State);
		EquivalenceClass ClassOfSimplifiedSym =
		EquivalenceClass::find(State, SimplifiedMemberSym);
		// We are about to add the newly simplified symbol to the existing
		// equivalence class, but they are known to be non-equal. This is a
		// contradiction.
		if (DisequalClasses.contains(ClassOfSimplifiedSym))
		vsavchenkoUnsubmitted Done Reply Inline Actions I think we can add a method `isDisequalTo` or just use `areEqual` in a this way: are equal? [Yes] -> nothing to do here [No] -> return nullptr [Don't know] -> merge vsavchenko: I think we can add a method `isDisequalTo` or just use `areEqual` in a this way: are equal?
		martongAuthorUnsubmitted Done Reply Inline Actions Good point, I've added a new overload to the static `areEqual` and added a method `isEqualTo` that uses `areEqual`. martong: Good point, I've added a new overload to the static `areEqual` and added a method `isEqualTo`…
		return nullptr;
		// The simplified symbol should be the member of the original Class,
		// however, it might be in another existing Class at the moment. We
		// have to merge these classes.
		State = Class.merge(getBasicVals(), F, State, ClassOfSimplifiedSym);
		if (!State)
		return nullptr;
		}
		}
		return State;
		}

		// Associate a constraint to a symbolic expression. First, we set the
		// constraint in the State, then we try to simplify existing symbolic
		// expressions based on the newly set constraint.
LLVM_NODISCARD inline ProgramStateRef		LLVM_NODISCARD inline ProgramStateRef
setConstraint(ProgramStateRef State, SymbolRef Sym, RangeSet Constraint) {		setConstraint(ProgramStateRef State, SymbolRef Sym, RangeSet Constraint) {
return setConstraint(State, EquivalenceClass::find(State, Sym), Constraint);		assert(State);

		State = setConstraint(State, EquivalenceClass::find(State, Sym), Constraint);
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - State = setConstraint(State, EquivalenceClass::find(State, Sym), Constraint); + State = + setConstraint(State, EquivalenceClass::find(State, Sym), Constraint); Lint: Pre-merge checks: clang-format: please reformat the code ``` - State = setConstraint(State, EquivalenceClass…
		if (!State)
		return nullptr;

		vsavchenkoUnsubmitted Done Reply Inline Actions Also I think we can introduce a simple, but efficient optimization of kicking off the simplification process only when `Constraint` is a constant. vsavchenko: Also I think we can introduce a simple, but efficient optimization of kicking off the…
		martongAuthorUnsubmitted Done Reply Inline Actions Yes, good point. martong: Yes, good point.
		// We have a chance to simplify existing symbolic values if the new
		// constraint is a constant.
		vsavchenkoUnsubmitted Done Reply Inline Actions I think we need a comment why we care about this early exit. vsavchenko: I think we need a comment why we care about this early exit.
		if (!Constraint.getConcreteValue())
		return State;
		vsavchenkoUnsubmitted Done Reply Inline Actions Here I also think that we need to give more context to the readers, so they understand what simplification you are talking about here. vsavchenko: Here I also think that we need to give more context to the readers, so they understand what…

		llvm::SmallSet<EquivalenceClass, 4> SimplifiedClasses;
		// Iterate over all equivalence classes and try to simplify them.
		ClassMembersTy Members = State->get<ClassMembers>();
		for (std::pair<EquivalenceClass, SymbolSet> ClassToSymbolSet : Members) {
		vsavchenkoUnsubmitted Done Reply Inline Actions You don't actually use constraints here, so (let me write it in python) instead of: [update(classMap[class]) for class, constraint in constraints.items()] you can use [update(members) for class, members in classMap.items()] vsavchenko: You don't actually use constraints here, so (let me write it in python) instead of: ``` [update…
		martongAuthorUnsubmitted Done Reply Inline Actions Actually, trivial equivalence classes (those that have only one symbol member) are not stored in the State. Thus, we must skim through the constraints as well in order to be able to simplify symbols in the constraints. In short, we have to iterate both collections. martong: Actually, trivial equivalence classes (those that have only one symbol member) are not stored…
		vsavchenkoUnsubmitted Done Reply Inline Actions Ah, I see. Then I would say that your previous solution is more readable (if we keep `simplify`, of course). vsavchenko: Ah, I see. Then I would say that your previous solution is more readable (if we keep…
		martongAuthorUnsubmitted Done Reply Inline Actions My previous solution might be more readable, though, that's not working. Actually, I think I failed to explain properly why do we have to iterate both collections. We have to iterate the ConstraintMap because trivial constraints are not stored in the State but we want to simplify symbols in the constraints. So, if we were to iterate over only the ClassMap then the simplest test-case would fail: int test_rhs_further_constrained(int x, int y) { if (x + y != 0) return 0; if (y != 0) return 0; clang_analyzer_eval(x + y == 0); // expected-warning{{TRUE}} clang_analyzer_eval(y == 0); // expected-warning{{TRUE}} FAIL return 0; } We have to iterate the ClassMap in order to update all equivalence classes that we store in the State. Consider the example you brought up before: void test_equivalence_classes_are_updated(int a, int b, int c, int d) { if (a + b != c) return; if (a != d) return; if (b != 0) return; // Keep the symbols and the constraints! alive. (void)(a * b * c * d); clang_analyzer_eval(c == d); // expected-warning{{TRUE}} return; } Before we start to simulate `b==0`, we have only these equivalence classes in the State: E1{`a+b`, `c`} and E2{`a`, `d`}. And we have these constraints: SymExpr(`a+b==c`) -> out-of [0, 0], SymExpr(`a==d`) -> out-of [0, 0]. Now, when we evaluate `b==0`in setConstraint when iterating the ConstraintMap then SymExpr(`a+b==c`) becomes SymExpr(`a==c`). But the equality classes are not updated. And we can update them if we scan through the ClassMap. Another alternative solution could be to re-trigger the `track` mechanism when we iterate over the ConstraintMap, but `track` seemed to be an exclusive interface towards the higher abstraction RangedConstraintManager. On the other hand, reusing the `track` mechanism could result better performance than doing another iteration on the ClassMap. Do you think it would be a better approach? And how could we reuse the `track` mechanism without getting confused with the `Adjustment` stuff? martong: My previous solution might be more readable, though, that's not working. Actually, I think I…
		EquivalenceClass Class = ClassToSymbolSet.first;
		SymbolSet ClassMembers = ClassToSymbolSet.second;
		State = simplifyEquivalenceClass(State, Class, ClassMembers);
		if (!State)
		return nullptr;
		SimplifiedClasses.insert(Class);
		vsavchenkoUnsubmitted Done Reply Inline Actions It would be great if we provide some justification why we do merge here. vsavchenko: It would be great if we provide some justification why we do merge here.
		}

		vsavchenkoUnsubmitted Done Reply Inline Actions I tried to cover it in the comment to another patch. This solution includes a lot of extra work and it will lose equality/disequality information for simplified expressions, and I think it's safe to say that if `a == b` then `simplify(a) == b`. Let's start with `getConstraintMap`. It is a completely artificial data structure (and function) that exists for Z3 refutation. It's not what we keep in the state and it has a lot of duplicated constraints. If we have an equivalence class `{a, b, c, d, e, f}`, we store only one constraint for all of them (thus when we update the class, or one of the members receives a new constraint, we can update all of them). `getConstraintMap` returns a map where `a`, `b`, `c`, `d`, `e`, and `f` are mapped to the same constraint. It's not super bad, but it's extra work constructing this map and then processing it. Another, and more important aspect is that when you `setConstraint`, you lose information that this symbol is equal/disequal to other symbols. One example here would be a situation where `x + y == z`, and we find out that `y == 0`, we should update equivalence class `{x + y, z}` to be a class `{x, z}`. In order to do this, you need to update two maps: `ClassMap` (it's mapping `x + y` to `{x + y, z}`) and `ClassMembers` (it's mapping `{x + y, z}` to `x + y` and `z`). Similar example can be made with `x + y != z`, but updating `ClassMap` and `ClassMembers` will fix it. And you don't even need to touch the actual mapping with the actual constraints. vsavchenko: I tried to cover it in the comment to another patch. This solution includes a lot of extra…
		vsavchenkoUnsubmitted Done Reply Inline Actions Uh-oh, almost let yet another null-state bug to happen! During this iteration, `State` can become null, so we need to check for it. vsavchenko: Uh-oh, almost let yet another null-state bug to happen! During this iteration, `State` can…
		martongAuthorUnsubmitted Done Reply Inline Actions Absolutely, great findings! I think the most straightforward and consistent implementation of updating `ClassMap` and `ClassMembers` is to directly use the `merge` method. I.e. we can merge the simplified symbol (as a trivial eq class) to the existing equivalence class. Using `merge`, however, would not remove the non-simplified original symbol. But this might not be a problem; rather it is a necessity (as the child patch demonstrates) it might be very useful if we can find the symbol (without simplification, i.e. as written) in the `ConstraintRange` map. Do you see any drawbacks of reusing `merge` here? martong: Absolutely, great findings! I think the most straightforward and consistent implementation of…
		vsavchenkoUnsubmitted Done Reply Inline Actions Oh, that's actually even better! If we consider the following example. Let `a + b == c` and `a == d` be known and `b == 0` to be a new constraint. Then your approach will help us to figure out that `c == d`. So, you found a great way! I think that we should still add the test cases I briefly described in my previous comment and that one from above. vsavchenko: Oh, that's actually even better! If we consider the following example. Let `a + b == c` and `a…
		martongAuthorUnsubmitted Done Reply Inline Actions Good catch! martong: Good catch!
		// Trivial equivalence classes (those that have only one symbol member) are
		// not stored in the State. Thus, we must skim through the constraints as
		// well. And we try to simplify symbols in the constraints.
		ConstraintRangeTy Constraints = State->get<ConstraintRange>();
		for (std::pair<EquivalenceClass, RangeSet> ClassConstraint : Constraints) {
		EquivalenceClass Class = ClassConstraint.first;
		if (SimplifiedClasses.count(Class)) // Already simplified.
		continue;
		SymbolSet ClassMembers = Class.getClassMembers(State);
		State = simplifyEquivalenceClass(State, Class, ClassMembers);
		if (!State)
		return nullptr;
		}

		return State;
}		}
};		};

} // end anonymous namespace		} // end anonymous namespace

std::unique_ptr<ConstraintManager>		std::unique_ptr<ConstraintManager>
ento::CreateRangeConstraintManager(ProgramStateManager &StMgr,		ento::CreateRangeConstraintManager(ProgramStateManager &StMgr,
ExprEngine *Eng) {		ExprEngine *Eng) {
Show All 19 Lines
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// EqualityClass implementation details		// EqualityClass implementation details
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

inline EquivalenceClass EquivalenceClass::find(ProgramStateRef State,		inline EquivalenceClass EquivalenceClass::find(ProgramStateRef State,
SymbolRef Sym) {		SymbolRef Sym) {
		assert(State && "State should not be null");
		assert(Sym && "Symbol should not be null");
// We store far from all Symbol -> Class mappings		// We store far from all Symbol -> Class mappings
if (const EquivalenceClass *NontrivialClass = State->get<ClassMap>(Sym))		if (const EquivalenceClass *NontrivialClass = State->get<ClassMap>(Sym))
return *NontrivialClass;		return *NontrivialClass;

// This is a trivial class of Sym.		// This is a trivial class of Sym.
return Sym;		return Sym;
}		}

▲ Show 20 Lines • Show All 118 Lines • ▼ Show 20 Lines	EquivalenceClass::mergeImpl(BasicValueFactory &ValueFactory,
// Now only the current class is mapped to all the symbols.		// Now only the current class is mapped to all the symbols.
Members = MF.add(Members, *this, NewClassMembers);		Members = MF.add(Members, *this, NewClassMembers);

// 4. Update disequality relations		// 4. Update disequality relations
ClassSet DisequalToOther = Other.getDisequalClasses(DisequalityInfo, CF);		ClassSet DisequalToOther = Other.getDisequalClasses(DisequalityInfo, CF);
if (!DisequalToOther.isEmpty()) {		if (!DisequalToOther.isEmpty()) {
ClassSet DisequalToThis = getDisequalClasses(DisequalityInfo, CF);		ClassSet DisequalToThis = getDisequalClasses(DisequalityInfo, CF);
DisequalityInfo = DF.remove(DisequalityInfo, Other);		DisequalityInfo = DF.remove(DisequalityInfo, Other);

		vsavchenkoUnsubmitted Done Reply Inline Actions very opinionated nit: can you please add extra new line after this? vsavchenko: very opinionated nit: can you please add extra new line after this?
		martongAuthorUnsubmitted Done Reply Inline Actions Sure. martong: Sure.
for (EquivalenceClass DisequalClass : DisequalToOther) {		for (EquivalenceClass DisequalClass : DisequalToOther) {
DisequalToThis = CF.add(DisequalToThis, DisequalClass);		DisequalToThis = CF.add(DisequalToThis, DisequalClass);

// Disequality is a symmetric relation meaning that if		// Disequality is a symmetric relation meaning that if
// DisequalToOther not null then the set for DisequalClass is not		// DisequalToOther not null then the set for DisequalClass is not
// empty and has at least Other.		// empty and has at least Other.
ClassSet OriginalSetLinkedToOther =		ClassSet OriginalSetLinkedToOther =
*DisequalityInfo.lookup(DisequalClass);		*DisequalityInfo.lookup(DisequalClass);
▲ Show 20 Lines • Show All 162 Lines • ▼ Show 20 Lines
inline ClassSet		inline ClassSet
EquivalenceClass::getDisequalClasses(DisequalityMapTy Map,		EquivalenceClass::getDisequalClasses(DisequalityMapTy Map,
ClassSet::Factory &Factory) const {		ClassSet::Factory &Factory) const {
if (const ClassSet DisequalClasses = Map.lookup(this))		if (const ClassSet DisequalClasses = Map.lookup(this))
return *DisequalClasses;		return *DisequalClasses;

return Factory.getEmptySet();		return Factory.getEmptySet();
}		}

bool EquivalenceClass::isClassDataConsistent(ProgramStateRef State) {		bool EquivalenceClass::isClassDataConsistent(ProgramStateRef State) {
ClassMembersTy Members = State->get<ClassMembers>();		ClassMembersTy Members = State->get<ClassMembers>();

for (std::pair<EquivalenceClass, SymbolSet> ClassMembersPair : Members) {		for (std::pair<EquivalenceClass, SymbolSet> ClassMembersPair : Members) {
for (SymbolRef Member : ClassMembersPair.second) {		for (SymbolRef Member : ClassMembersPair.second) {
// Every member of the class should have a mapping back to the class.		// Every member of the class should have a mapping back to the class.
if (find(State, Member) == ClassMembersPair.first) {		if (find(State, Member) == ClassMembersPair.first) {
continue;		continue;
}		}

		vsavchenkoUnsubmitted Done Reply Inline Actions Now, since you put this logic into `merge`, you can just merge. vsavchenko: Now, since you put this logic into `merge`, you can just merge.
		martongAuthorUnsubmitted Done Reply Inline Actions Wow, good catch. martong: Wow, good catch.
return false;		return false;
}		}
}		}

DisequalityMapTy Disequalities = State->get<DisequalityMap>();		DisequalityMapTy Disequalities = State->get<DisequalityMap>();
for (std::pair<EquivalenceClass, ClassSet> DisequalityInfo : Disequalities) {		for (std::pair<EquivalenceClass, ClassSet> DisequalityInfo : Disequalities) {
EquivalenceClass Class = DisequalityInfo.first;		EquivalenceClass Class = DisequalityInfo.first;
ClassSet DisequalClasses = DisequalityInfo.second;		ClassSet DisequalClasses = DisequalityInfo.second;
▲ Show 20 Lines • Show All 511 Lines • Show Last 20 Lines

clang/test/Analysis/find-binop-constraints.cpp

This file was added.

				// RUN: %clang_analyze_cc1 %s \
				// RUN: -analyzer-checker=core \
				// RUN: -analyzer-checker=debug.ExprInspection \
				// RUN: -analyzer-config eagerly-assume=false \
				// RUN: -verify

				void clang_analyzer_eval(bool);
				void clang_analyzer_warnIfReached();

				int test_legacy_behavior(int x, int y) {
				if (y != 0)
				return 0;
				if (x + y != 0)
				return 0;
				clang_analyzer_eval(x + y == 0); // expected-warning{{TRUE}}
				clang_analyzer_eval(y == 0); // expected-warning{{TRUE}}
				return y / (x + y); // expected-warning{{Division by zero}}
				}

				int test_rhs_further_constrained(int x, int y) {
				if (x + y != 0)
				return 0;
				if (y != 0)
				return 0;
				clang_analyzer_eval(x + y == 0); // expected-warning{{TRUE}}
				clang_analyzer_eval(y == 0); // expected-warning{{TRUE}}
				return 0;
				}

				int test_lhs_further_constrained(int x, int y) {
				if (x + y != 0)
				return 0;
				if (x != 0)
				return 0;
				clang_analyzer_eval(x + y == 0); // expected-warning{{TRUE}}
				clang_analyzer_eval(x == 0); // expected-warning{{TRUE}}
				return 0;
				}

				int test_lhs_and_rhs_further_constrained(int x, int y) {
				if (x % y != 1)
				return 0;
				if (x != 1)
				return 0;
				if (y != 2)
				return 0;
				clang_analyzer_eval(x % y == 1); // expected-warning{{TRUE}}
				clang_analyzer_eval(y == 2); // expected-warning{{TRUE}}
				return 0;
				}

				int test_commutativity(int x, int y) {
				if (x + y != 0)
				return 0;
				if (y != 0)
				return 0;
				clang_analyzer_eval(y + x == 0); // expected-warning{{TRUE}}
				clang_analyzer_eval(y == 0); // expected-warning{{TRUE}}
				return 0;
				}

				int test_binop_when_height_is_2_r(int a, int x, int y, int z) {
				switch (a) {
				case 1: {
				if (x + y + z != 0)
				return 0;
				if (z != 0)
				return 0;
				clang_analyzer_eval(x + y + z == 0); // expected-warning{{TRUE}}
				clang_analyzer_eval(z == 0); // expected-warning{{TRUE}}
				break;
				}
				case 2: {
				if (x + y + z != 0)
				return 0;
				if (y != 0)
				return 0;
				clang_analyzer_eval(x + y + z == 0); // expected-warning{{TRUE}}
				clang_analyzer_eval(y == 0); // expected-warning{{TRUE}}
				break;
				}
				case 3: {
				if (x + y + z != 0)
				return 0;
				if (x != 0)
				return 0;
				clang_analyzer_eval(x + y + z == 0); // expected-warning{{TRUE}}
				clang_analyzer_eval(x == 0); // expected-warning{{TRUE}}
				break;
				}
				case 4: {
				if (x + y + z != 0)
				return 0;
				if (x + y != 0)
				return 0;
				clang_analyzer_eval(x + y + z == 0); // expected-warning{{TRUE}}
				clang_analyzer_eval(x + y == 0); // expected-warning{{TRUE}}
				break;
				}
				case 5: {
				if (z != 0)
				return 0;
				if (x + y + z != 0)
				return 0;
				clang_analyzer_eval(x + y + z == 0); // expected-warning{{TRUE}}
				if (y != 0)
				return 0;
				clang_analyzer_eval(y == 0); // expected-warning{{TRUE}}
				clang_analyzer_eval(z == 0); // expected-warning{{TRUE}}
				clang_analyzer_eval(x + y + z == 0); // expected-warning{{TRUE}}
				break;
				}

				}
				return 0;
				}

				void test_equivalence_classes_are_updated(int a, int b, int c, int d) {
				if (a + b != c)
				return;
				if (a != d)
				return;
				if (b != 0)
				return;
				// Keep the symbols and the constraints! alive.
				(void)(a * b * c * d);
				clang_analyzer_eval(c == d); // expected-warning{{TRUE}}
				return;
				}

				void test_contradiction(int a, int b, int c, int d) {
				if (a + b != c)
				return;
				if (a == c)
				return;
				clang_analyzer_warnIfReached(); // expected-warning{{REACHABLE}}

				// Bring in the contradiction.
				if (b != 0)
				return;
				// Keep the symbols and the constraints! alive.
				(void)(a * b * c * d);
				clang_analyzer_warnIfReached(); // no-warning, i.e. UNREACHABLE
				return;
				}
				vsavchenkoUnsubmitted Done Reply Inline Actions It's not really connected to your patch, but this confuses me! Why does the analyzer think that `b0` is guaranteed to be 2 after this statement. Even if we eagerly assume here, shouldn't it mean that there are still two paths `b0 == 2` and `b0 != 2`? vsavchenko: It's not really connected to your patch, but this confuses me! Why does the analyzer think…
				martongAuthorUnsubmitted Done Reply Inline Actions Don't be puzzled by this. This indeed bifurcates. The interesting path is where `b0 == 2` is true. I am going to update this line with `if (b0 ==2) {` to achieve a similar effect. (I was using creduce and tried to simplify even more after that, but i missed this.) martong: Don't be puzzled by this. This indeed bifurcates. The interesting path is where `b0 == 2` is…
				vsavchenkoUnsubmitted Done Reply Inline Actions Hmm, I don't see how simplification helped here. After the previous `if` statement, we should have had two equivalence classes known to be disequal: `reg_$2<int b1>` and `(reg_$0<int e0>) - (reg_$1<int b0>)`. Further, we directly compare these two symbols. We can figure it out without any simplifications. Am I missing something here? vsavchenko: Hmm, I don't see how simplification helped here. After the previous `if` statement, we should…
				martongAuthorUnsubmitted Done Reply Inline Actions When we evaluate `e2 > 0` then we will set `e1` as disequal to `b1`. However, at this point because of the eager constant folding `e1` is `e0 - 2` (on the path where `b0 == 2` is true). So, when we evaluate `b1 == e1` then this is the diseq info we have in the State (I used `dumpDisEq` from D103967): reg_$2<int b1> DisequalTo: (reg_$0<int e0>) - 2 (reg_$0<int e0>) - 2 DisequalTo: reg_$2<int b1> And indeed we ask directly whether the LHS (`reg_$2<int b1>`) is equal to RHS`(reg_$0<int e0>) - (reg_$1<int b0>)`. This is because the` DeclRefExpr` of `e1` is still bound to SVal which originates from the time before we constrained b0 to 2. With other words: the `Environment` is not changed by introducing a new constraint. BTW, this test fails even in llvm/main. martong: When we evaluate `e2 > 0` then we will set `e1` as disequal to `b1`. However, at this point…
				martongAuthorUnsubmitted Done Reply Inline Actions With other words: the Environment is not changed by introducing a new constraint. This suggests that another approach could be to do change the `Environment` when we add a new constraint. I am not sure about the pros/cons atm, but might be worth to experiment. What do you think? martong: > With other words: the Environment is not changed by introducing a new constraint. This…

This is an archive of the discontinued LLVM Phabricator instance.

[Analyzer][solver] Simplify existing constraints when a new constraint is addedClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 348945

clang/include/clang/StaticAnalyzer/Core/PathSensitive/RangedConstraintManager.h

clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp

clang/test/Analysis/find-binop-constraints.cpp

[Analyzer][solver] Simplify existing constraints when a new constraint is added
ClosedPublic