This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lib/StaticAnalyzer/Core/
-
StaticAnalyzer/
-
Core/
2/2
SimpleSValBuilder.cpp
-
test/Analysis/
-
Analysis/
-
explain-svals.cpp
-
inlining/
1/1
inline-defensive-checks.c
-
pointer-arithmetic.c

Differential D37478

[analyzer] Implement pointer arithmetic on constants
ClosedPublic

Authored by r.stahl on Sep 5 2017, 8:09 AM.

Download Raw Diff

Details

Reviewers

NoQ
xazax.hun
krememek
dcoughlin

Commits

rG21524676a2c9: [analyzer] Implement pointer arithmetic on constants
rC315296: [analyzer] Implement pointer arithmetic on constants
rL315296: [analyzer] Implement pointer arithmetic on constants

Summary

The "Multiplicand" variable in SimpleSValBuilder::evalBinOpLN was always initialized to zero, causing all pointer arithmetic on constant values to be no-ops.

This patch enables pointer arithmetic as long as the pointer is not a null pointer. The exception preserves useful warnings from the DereferenceChecker.

Diff Detail

Event Timeline

r.stahl created this revision.Sep 5 2017, 8:09 AM

Herald added a subscriber: eraman. · View Herald TranscriptSep 5 2017, 8:09 AM

One nit, otherwise LGTM! Thanks for fixing this!

lib/StaticAnalyzer/Core/SimpleSValBuilder.cpp
938	The rest of the code does not abbreviate the Type. I would prefer to name this `pointeeType`.

This revision is now accepted and ready to land.Sep 6 2017, 4:08 AM

I've seen this recently, and while i agree that the fix is correct, i'm not entirely sure that the test cases are correct. As weird as this may sound, null dereference is not an attempt to read from or write to memory address 0. Instead, it is about using a null pointer as if it was pointing to an actual object in memory, even if accessing it by a non-zero offset. For example, in

struct S {
  int x, y;
};

void foo() {
  struct S *s = NULL;
  s->y = 1;
}

we're in fact writing into *(0x4), not *(0x0), however it's intuitive that this code has a null pointer dereference, because we use a null pointer s as if it points to an actual object of type struct S. In this sense, i'd actually want the analyzer to warn in test4: it seems that the author of this code was expecting to find something useful by offset 1 to the pointer, so he must have made a mistake. Also i'm not entirely sure if i want the analyzer to warn in test1, test2, test3 (also this code pattern doesn't look widespread/idiomatic).

So the analyzer does this really really weird thing by treating many operations with concrete pointers as no-ops, keeping the pointer null when it was null before, and keeping it non-null when it was non-null before - not just in the place that you've fixed, but also when computing field or element or base-class offsets for null pointers. These are marked as FIXME all over the place, but taking up the fix would require to provide another heuristic to distinguish null dereferences from other fixed-address dereferences (i.e. the experimental FixedAddressChecker).

I guess there should be more comments on this issue near the FIXMEs you fixed, and more tests covering the intended behavior; adding them is definitely a good thing to do. I do not have any immediate ideas on how to fix the issue as a whole.

lib/StaticAnalyzer/Core/SimpleSValBuilder.cpp
938	Also `resultTy->getPointeeType()`. Note the fancy `operator->()` in `QualType`.

To be honest I was quite surprised that this change in behavior didn't cause more test failures, because for detecting null dereferences the old behavior is definitely more useful. Since it did not, I was convinced that this change is desired.

We use the analyzer for finding dereferences to fixed addresses - very similar to the FixedAddressChecker. For this purpose it is crucial that the execution engine works as perfect as possible, without "swallowing" any arithmetic.

For the struct example you mentioned you can still get the final address by asking the ASTContext if needed, but with pointer arithmetic the information is lost forever. Information is lost either way here. Either you forget that the arithmetic was based on a null pointer or you lose whatever was added to or subtracted from it.

So unless you can somehow tag the information in the SVal when an operation was based on a null pointer, this is pretty difficult. You also could introduce a heuristic that defines all dereferences around zero as null dereferences, but it would be very arbitrary and platform dependent. Or maybe the DereferenceChecker should explicitly break early on all statements that do arithmetic on pointers constrained to null. Overall I don't know enough about the analyzer to suggest more here.

Thanks for the comments, I will address them soon.

addressed the review comments

r.stahl marked 2 inline comments as done.Sep 18 2017, 2:54 AM

zaks.anna added a reviewer: dcoughlin.Sep 28 2017, 9:40 AM

Rafael: Thanks for the patch! @NoQ, @zaks.anna, and I spoke about this off-line yesterday.

While this patch improves the modeling of pointer arithmetic, we're worried about losing valuable alarms that rely on the existing behavior.

Here is a case where the analyzer would warn before your patch but doesn't with it:

void foo() {
  int *p = 0;
  int q = *(p + 5); // expected-warning {{Dereference of null pointer}}
}

The existing diagnostic machinery relies on the fact that the analyzer treats p + 5 as 0 to report the bad dereference. This comes up more often than you might think because the analyzer is sometimes quite aggressive about promoting a symbol constrained to 0 to be a concrete value of 0. For example:

void bar(int *p) {
  if (p)
    return;

  int q = *(p + 5); // expected-warning {{Dereference of null pointer}}
}

It would be good to add test cases for these diagnostics!

I think you can preserve the existing (although missing from the test suite) good diagnostics and still improve the modeling by skipping the addition/subtraction if the LHS is a concrete int with value 0. Doing so would be a very minor change to this patch.

Modeling the pointer arithmetic when the LHS is 0 while still keeping the diagnostics will likely be a more involved effort, with ramifications in multiple parts of the analyzer. We could discuss that, if you'd like to tackle it! But it would probably be good for you to get a couple more patches under your belt before taking that on.

test/Analysis/inlining/inline-defensive-checks.c
144	The analyzer doesn't warn on these on purpose. Throughout the analyzer, we have a broad heuristic that says: "if the programmer compares a pointer to NULL, then the analyzer should explicitly consider the case that the pointer is NULL". It will perform a case split in the symbolic execution: one case for when the value is definitely NULL and one case for one it is definitely not NULL. As a heuristic this works reasonably well: if the programmer bothered to add a check for NULL then they likely though the value could be NULL. However, when context-sensitive analysis via inlining was added, this heuristic broke down for functions that called other functions with what we call "inlined defensive checks" or null. Here is an example: void hasInlinedDefensiveCheck(int p) { if (!p) return; // Do something useful } void foo(int param) { hasInlinedDefensiveCheck(param); param = 7; } In this case the warning about `param = 7` is a false positive from foo's point of view because foo may have a strong invariant that param is not null; it doesn't care that hasInlinedDefensiveCheck() may have other callers that might call it with null. For this reason, we suppress reports about null pointer dereferences when we can detect an inlined defense check.

This revision now requires changes to proceed.Sep 29 2017, 5:16 PM

addressed review comments. updated summary.

Herald added a subscriber: szepet. · View Herald TranscriptOct 9 2017, 12:18 AM

This looks good to me! Thanks for adding this. Do you have commit access, or do you need someone to commit it for you?

This revision is now accepted and ready to land.Oct 9 2017, 9:14 PM

Since I do not have commit access, it would be nice if someone committed this for me. Thanks!

Closed by commit rL315296: [analyzer] Implement pointer arithmetic on constants (authored by xazax). · Explain WhyOct 10 2017, 4:01 AM

This revision was automatically updated to reflect the committed changes.

NoQ mentioned this in D65349: [analyzer] Be more careful with destructors of non-regions..Jul 29 2019, 3:19 PM

Revision Contents

Path

Size

lib/

StaticAnalyzer/

Core/

SimpleSValBuilder.cpp

2 lines

test/

Analysis/

explain-svals.cpp

3 lines

inlining/

inline-defensive-checks.c

3 lines

pointer-arithmetic.c

28 lines

Diff 113860

lib/StaticAnalyzer/Core/SimpleSValBuilder.cpp

Show First 20 Lines • Show All 929 Lines • ▼ Show 20 Lines	if (Optional<loc::ConcreteInt> lhsInt = lhs.getAs<loc::ConcreteInt>()) {
llvm::APSInt rightI(rhsInt->getValue(), /* isUnsigned */ true);		llvm::APSInt rightI(rhsInt->getValue(), /* isUnsigned */ true);

// Convert the bitwidth of rightI. This should deal with overflow		// Convert the bitwidth of rightI. This should deal with overflow
// since we are dealing with concrete values.		// since we are dealing with concrete values.
rightI = rightI.extOrTrunc(leftI.getBitWidth());		rightI = rightI.extOrTrunc(leftI.getBitWidth());

// Offset the increment by the pointer size.		// Offset the increment by the pointer size.
llvm::APSInt Multiplicand(rightI.getBitWidth(), /* isUnsigned */ true);		llvm::APSInt Multiplicand(rightI.getBitWidth(), /* isUnsigned */ true);
		QualType PteeTy = resultTy.getTypePtr()->castAs<PointerType>()->getPointeeType();
		xazax.hunUnsubmitted Done Reply Inline Actions The rest of the code does not abbreviate the Type. I would prefer to name this `pointeeType`. xazax.hun: The rest of the code does not abbreviate the Type. I would prefer to name this `pointeeType`.
		NoQUnsubmitted Done Reply Inline Actions Also `resultTy->getPointeeType()`. Note the fancy `operator->()` in `QualType`. NoQ: Also `resultTy->getPointeeType()`. Note the fancy `operator->()` in `QualType`.
		Multiplicand = getContext().getTypeSizeInChars(PteeTy).getQuantity();
rightI *= Multiplicand;		rightI *= Multiplicand;

// Compute the adjusted pointer.		// Compute the adjusted pointer.
switch (op) {		switch (op) {
case BO_Add:		case BO_Add:
rightI = leftI + rightI;		rightI = leftI + rightI;
break;		break;
case BO_Sub:		case BO_Sub:
▲ Show 20 Lines • Show All 134 Lines • Show Last 20 Lines

test/Analysis/explain-svals.cpp

	Show First 20 Lines • Show All 53 Lines • ▼ Show 20 Lines
	}			}

	void test_3(S s) {			void test_3(S s) {
	clang_analyzer_explain(&s); // expected-warning-re{{{{^pointer to parameter 's'$}}}}			clang_analyzer_explain(&s); // expected-warning-re{{{{^pointer to parameter 's'$}}}}
	clang_analyzer_explain(s.z); // expected-warning-re{{{{^initial value of field 'z' of parameter 's'$}}}}			clang_analyzer_explain(s.z); // expected-warning-re{{{{^initial value of field 'z' of parameter 's'$}}}}
	clang_analyzer_explain(&s.s2[5].y[3]); // expected-warning-re{{{{^pointer to element of type 'int' with index 3 of field 'y' of base object 'S::S3' inside element of type 'struct S::S2' with index 5 of field 's2' of parameter 's'$}}}}			clang_analyzer_explain(&s.s2[5].y[3]); // expected-warning-re{{{{^pointer to element of type 'int' with index 3 of field 'y' of base object 'S::S3' inside element of type 'struct S::S2' with index 5 of field 's2' of parameter 's'$}}}}
	if (!s.s2[7].x) {			if (!s.s2[7].x) {
	clang_analyzer_explain(s.s2[7].x); // expected-warning-re{{{{^concrete memory address '0'$}}}}			clang_analyzer_explain(s.s2[7].x); // expected-warning-re{{{{^concrete memory address '0'$}}}}
	// FIXME: we need to be explaining '1' rather than '0' here; not explainer bug.			clang_analyzer_explain(s.s2[7].x + 1); // expected-warning-re{{{{^concrete memory address '4'$}}}}
	clang_analyzer_explain(s.s2[7].x + 1); // expected-warning-re{{{{^concrete memory address '0'$}}}}
	}			}
	}			}

	void test_4(int x, int y) {			void test_4(int x, int y) {
	int z;			int z;
	static int stat;			static int stat;
	clang_analyzer_explain(x + 1); // expected-warning-re{{{{^$argument 'x'$ \+ 1$}}}}			clang_analyzer_explain(x + 1); // expected-warning-re{{{{^$argument 'x'$ \+ 1$}}}}
	clang_analyzer_explain(1 + y); // expected-warning-re{{{{^$argument 'y'$ \+ 1$}}}}			clang_analyzer_explain(1 + y); // expected-warning-re{{{{^$argument 'y'$ \+ 1$}}}}
	Show All 27 Lines

test/Analysis/inlining/inline-defensive-checks.c

	Show First 20 Lines • Show All 135 Lines • ▼ Show 20 Lines
	void idcTrackZeroThroughDoubleAssignemnt(int x) {			void idcTrackZeroThroughDoubleAssignemnt(int x) {
	idcZero(x);			idcZero(x);
	int y = x;			int y = x;
	int z = y;			int z = y;
	idcTriggerZeroValueThroughCall(z);			idcTriggerZeroValueThroughCall(z);
	}			}

	struct S {			struct S {
	int f1;			int f1;
				dcoughlinUnsubmitted Done Reply Inline Actions The analyzer doesn't warn on these on purpose. Throughout the analyzer, we have a broad heuristic that says: "if the programmer compares a pointer to NULL, then the analyzer should explicitly consider the case that the pointer is NULL". It will perform a case split in the symbolic execution: one case for when the value is definitely NULL and one case for one it is definitely not NULL. As a heuristic this works reasonably well: if the programmer bothered to add a check for NULL then they likely though the value could be NULL. However, when context-sensitive analysis via inlining was added, this heuristic broke down for functions that called other functions with what we call "inlined defensive checks" or null. Here is an example: void hasInlinedDefensiveCheck(int p) { if (!p) return; // Do something useful } void foo(int param) { hasInlinedDefensiveCheck(param); param = 7; } In this case the warning about `param = 7` is a false positive from foo's point of view because foo may have a strong invariant that param is not null; it doesn't care that hasInlinedDefensiveCheck() may have other callers that might call it with null. For this reason, we suppress reports about null pointer dereferences when we can detect an inlined defense check. dcoughlin: The analyzer doesn't warn on these on purpose. Throughout the analyzer, we have a broad…
	int f2;			int f2;
	};			};

	void idcTrackZeroValueThroughUnaryPointerOperators(struct S *s) {			void idcTrackZeroValueThroughUnaryPointerOperators(struct S *s) {
	idc(s);			idc(s);
	*(&(s->f1)) = 7; // no-warning			*(&(s->f1)) = 7; // no-warning
	}			}

	void idcTrackZeroValueThroughUnaryPointerOperatorsWithOffset1(struct S *s) {			void idcTrackZeroValueThroughUnaryPointerOperatorsWithOffset1(struct S *s) {
	idc(s);			idc(s);
	int *x = &(s->f2);			int *x = &(s->f2);
	*x = 7; // no-warning			*x = 7; // no-warning
	}			}

	void idcTrackZeroValueThroughUnaryPointerOperatorsWithOffset2(struct S *s) {			void idcTrackZeroValueThroughUnaryPointerOperatorsWithOffset2(struct S *s) {
	idc(s);			idc(s);
	int *x = &(s->f2) - 1;			int *x = &(s->f2) - 1;
	// FIXME: Should not warn.			*x = 7; // no-warning
	*x = 7; // expected-warning{{Dereference of null pointer}}
	}			}

	void idcTrackZeroValueThroughUnaryPointerOperatorsWithAssignment(struct S *s) {			void idcTrackZeroValueThroughUnaryPointerOperatorsWithAssignment(struct S *s) {
	idc(s);			idc(s);
	int *x = &(s->f1);			int *x = &(s->f1);
	*x = 7; // no-warning			*x = 7; // no-warning
	}			}

	Show All 9 Lines

test/Analysis/pointer-arithmetic.c

				// RUN: %clang_analyze_cc1 -analyzer-checker=core -verify %s

				int test1() {
				int p = (int )sizeof(int);
				p -= 1;
				return *p; // expected-warning {{Dereference of null pointer}}
				}

				int test2() {
				int p = (int )sizeof(int);
				p -= 2;
				p += 1;
				return *p; // expected-warning {{Dereference of null pointer}}
				}

				int test3() {
				int p = (int )sizeof(int);
				p++;
				p--;
				p--;
				return *p; // expected-warning {{Dereference of null pointer}}
				}

				int test4() {
				int *p = 0;
				p += 1;
				return *p; // no-warning
				}