This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
include/clang/StaticAnalyzer/Core/PathSensitive/
-
clang/
-
StaticAnalyzer/
-
Core/
-
PathSensitive/
1/2
CheckerContext.h
-
lib/StaticAnalyzer/Checkers/
-
StaticAnalyzer/
-
Checkers/
2/6
StdLibraryFunctionsChecker.cpp
-
test/Analysis/
-
Analysis/
-
std-c-library-functions-arg-constraints.c

Differential D137722

[clang][analyzer] No new nodes when bug is detected in StdLibraryFunctionsChecker.
ClosedPublic

Authored by balazske on Nov 9 2022, 8:53 AM.

Download Raw Diff

Details

Reviewers

Szelethus
NoQ
gamesh411

Commits

rGda0660691f74: [clang][analyzer] No new nodes when bug is detected in…

Summary

The checker applies constraints in a sequence and adds new nodes for these states. If a constraint violation is found this sequence should be stopped with a sink (error) node. Instead the generateErrorNode did add a new error node as a new branch that is parallel to the other node sequence, the other branch was not stopped and analysis was continuing on that invalid branch. To add an error node after any previous node a new version of generateErrorNode is needed, this function is added here and used by StdLibraryFunctionsChecker. The added test executes a situation where the checker adds a number of constraints before it finds a constraint violation.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

balazske created this revision.Nov 9 2022, 8:53 AM

Herald added a reviewer: Szelethus. · View Herald TranscriptNov 9 2022, 8:53 AM

Herald added a reviewer: NoQ. · View Herald Transcript

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: steakhal, manas, ASDenysPetrov and 10 others. · View Herald Transcript

balazske requested review of this revision.Nov 9 2022, 8:53 AM

Herald added a project: Restricted Project. · View Herald TranscriptNov 9 2022, 8:53 AM

Herald added a subscriber: cfe-commits. · View Herald Transcript

Harbormaster completed remote builds in B196917: Diff 474289.Nov 9 2022, 9:52 AM

balazske mentioned this in D135247: [clang][analyzer] Add stream functions to StdLibraryFunctionsChecker..Nov 10 2022, 8:37 AM

balazske mentioned this in D135360: [clang][analyzer] Add some more functions to StreamChecker and StdLibraryFunctionsChecker..

balazske added a child revision: D135360: [clang][analyzer] Add some more functions to StreamChecker and StdLibraryFunctionsChecker..Nov 10 2022, 8:42 AM

balazske removed a child revision: D135360: [clang][analyzer] Add some more functions to StreamChecker and StdLibraryFunctionsChecker..

balazske added a child revision: D137790: [clang][analyzer] Remove report of null stream from StreamChecker..Nov 10 2022, 8:45 AM

balazske added a child revision: D135247: [clang][analyzer] Add stream functions to StdLibraryFunctionsChecker..Dec 7 2022, 12:17 AM

balazske removed a child revision: D137790: [clang][analyzer] Remove report of null stream from StreamChecker..

balazske edited the summary of this revision. (Show Details)Dec 7 2022, 12:37 AM

balazske added a reviewer: gamesh411.Dec 7 2022, 1:00 AM

Szelethus added inline comments.Dec 11 2022, 2:58 PM

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
954	Let me know if I got this right. The reason behind `generateErrorNode` not behaving like it usually does for other checkers is because of the explicitly supplied `NewState` parameter -- in its absence, the current path of execution is sunk. With this parameter, a new parallel node is. Correct?

balazske added inline comments.Dec 13 2022, 1:24 AM

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
954	The `NewState` only sets the state of the new error node, if it is nullptr the current state is used. A new node is always added. The other new node functions (`addTransition`, `generateNonFatalErrorNode`, `generateSink` and `addSink`) have a version that can take a predecessor node, only `generateErrorNode` did not have this (and I can not find out why part of these is called "generate" and other part "add" instead of using only "generate" or "add"). The new function is used when a node sequence `CurrentNode->A->B->ErrorNode` is needed. Without the new function it is only possible to make a `CurrentNode->ErrorNode` transition, and the following incorrect graph is created: CurrentNode->A->B \|->ErrorNode The code here does exactly this (before the fix), in `NewNode` a sequence of nodes is appended (like A and B above), and if then an error node is created it is added to the CurrentNode. Not this is needed here, the error node should come after B. Otherwise analysis can continue after node B (that path is invalid because a constraint violation was found). (The "CurrentNode" is a `Pred` value that is stored in `CheckerContext` and not changed if other nodes are added.)

Szelethus added inline comments.Dec 13 2022, 3:44 AM

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
954	I've been wondering that, especially looking at the test case. Seems like this loop runs only once, how come that new nodes are added on top of `CurrentNode` (which, in this case refers to `C.getPredecessor()`, right?)? I checked the checker's code, and I can't really see why `A` and `B` would ever appear. Isn't that a bug?

Szelethus added inline comments.Dec 13 2022, 3:46 AM

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
954	My thinking was that each checker, unless it does state splits, should really only create a single node per callback, right? The new state, however many changes it contains, should be added all at once in the single callback, no?

balazske added inline comments.Dec 13 2022, 5:54 AM

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
954	The problem is that multiple NoteTags are added. It is only possible to add a single NoteTag in a single transition. This is why in line 969 (in the currently shown code at time of this comment) `addTransition` is used for every new `SuccessSt` (otherwise `NewState` could be used without `NewNode`). Or is there a possibility for multiple NoteTags at one transition, or can such a feature be added? (But if the other state add functions all have a version that accepts a predecessor node, why is `generateErrorNode` exception?) (This state apply loop was changed in the recent time at least once.)

Folks, I'm glad you caught it! This is a classic mistake to make with addTransition() APIs.

I wish we had better APIs that don't have this problem, but instead make it very clear how many execution paths does the checker callback *intend* to create. Eg.,

C.addStateUpdate(State) would "chain" nodes by default if called multiple times, like you did in this patch;
C.addStateSplit(State1, Tag1, State2, Tag2, ..., StateN, TagN) would add at most N nodes (some may merge) and can be called only once per checker callback, otherwise it traps with assertion failure;
Similarly, mixing C.addStateUpdate() and C.addStateSplit() in the same checker callback will trap with assertion failure ("Make up your mind, are you trying to split the path or not?!");
Then we can have a Swiss-Army-knife function C.addArbitraryTransition(Pred, State, Tag) for all other use cases which *requires* you to specify the predecessor manually even if it's just C.getPredecessor().

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
954	I think you're right, even though technically it's always possible to make all updates in a single transition, in practice it often leads to annoying architectural problems. It's nice to have separation of concerns between different parts of checker code, and "chaining" nodes together is a neat way to achieve that.

This revision is now accepted and ready to land.Dec 13 2022, 2:35 PM

This revision was landed with ongoing or failed builds.Dec 14 2022, 12:52 AM

Closed by commit rGda0660691f74: [clang][analyzer] No new nodes when bug is detected in… (authored by balazske). · Explain Why

This revision was automatically updated to reflect the committed changes.

balazske added a commit: rGda0660691f74: [clang][analyzer] No new nodes when bug is detected in….

Szelethus added inline comments.Dec 19 2022, 5:40 AM

clang/include/clang/StaticAnalyzer/Core/PathSensitive/CheckerContext.h
219	What I'm missing here is some guidance. Why would I pick this overload instead of the 2-parameter one? Especially for beginners, this is very confusing.

balazske added inline comments.Dec 19 2022, 7:06 AM

clang/include/clang/StaticAnalyzer/Core/PathSensitive/CheckerContext.h
219	The whole documentation contains not enough information, at least not for beginners. This overload works the same way as at `generateNonFatalErrorNode` (and this documentation is made similar as at that function). `addTransition` works similar too, there is a bit more documentation. Probably somebody who understands `addTransition` and the `Pred` parameter can understand `generateErrorNode` too. If the name would be `addErrorNode` the similarity would be even stronger.

Revision Contents

Path

Size

clang/

include/

clang/

StaticAnalyzer/

Core/

PathSensitive/

CheckerContext.h

16 lines

lib/

StaticAnalyzer/

Checkers/

StdLibraryFunctionsChecker.cpp

2 lines

test/

Analysis/

std-c-library-functions-arg-constraints.c

15 lines

Diff 482737

clang/include/clang/StaticAnalyzer/Core/PathSensitive/CheckerContext.h

Show First 20 Lines • Show All 206 Lines • ▼ Show 20 Lines	public:
/// the default tag for the checker will be used.		/// the default tag for the checker will be used.
ExplodedNode *generateErrorNode(ProgramStateRef State = nullptr,		ExplodedNode *generateErrorNode(ProgramStateRef State = nullptr,
const ProgramPointTag *Tag = nullptr) {		const ProgramPointTag *Tag = nullptr) {
return generateSink(State, Pred,		return generateSink(State, Pred,
(Tag ? Tag : Location.getTag()));		(Tag ? Tag : Location.getTag()));
}		}

/// Generate a transition to a node that will be used to report		/// Generate a transition to a node that will be used to report
		/// an error. This node will be a sink. That is, it will stop exploration of
		/// the given path.
		///
		/// @param State The state of the generated node.
		/// @param Pred The transition will be generated from the specified Pred node
		SzelethusUnsubmitted Not Done Reply Inline Actions What I'm missing here is some guidance. Why would I pick this overload instead of the 2-parameter one? Especially for beginners, this is very confusing. Szelethus: What I'm missing here is some guidance. Why would I pick this overload instead of the 2…
		balazskeAuthorUnsubmitted Done Reply Inline Actions The whole documentation contains not enough information, at least not for beginners. This overload works the same way as at `generateNonFatalErrorNode` (and this documentation is made similar as at that function). `addTransition` works similar too, there is a bit more documentation. Probably somebody who understands `addTransition` and the `Pred` parameter can understand `generateErrorNode` too. If the name would be `addErrorNode` the similarity would be even stronger. balazske: The whole documentation contains not enough information, at least not for beginners. This…
		/// to the newly generated node.
		/// @param Tag The tag to uniquely identify the creation site. If null,
		/// the default tag for the checker will be used.
		ExplodedNode *generateErrorNode(ProgramStateRef State,
		ExplodedNode *Pred,
		const ProgramPointTag *Tag = nullptr) {
		return generateSink(State, Pred,
		(Tag ? Tag : Location.getTag()));
		}

		/// Generate a transition to a node that will be used to report
/// an error. This node will not be a sink. That is, exploration will		/// an error. This node will not be a sink. That is, exploration will
/// continue along this path.		/// continue along this path.
///		///
/// @param State The state of the generated node.		/// @param State The state of the generated node.
/// @param Tag The tag to uniquely identify the creation site. If null,		/// @param Tag The tag to uniquely identify the creation site. If null,
/// the default tag for the checker will be used.		/// the default tag for the checker will be used.
ExplodedNode *		ExplodedNode *
generateNonFatalErrorNode(ProgramStateRef State = nullptr,		generateNonFatalErrorNode(ProgramStateRef State = nullptr,
▲ Show 20 Lines • Show All 192 Lines • Show Last 20 Lines

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp

Show First 20 Lines • Show All 945 Lines • ▼ Show 20 Lines	void StdLibraryFunctionsChecker::checkPreCall(const CallEvent &Call,
ProgramStateRef NewState = State;		ProgramStateRef NewState = State;
ExplodedNode *NewNode = C.getPredecessor();		ExplodedNode *NewNode = C.getPredecessor();
for (const ValueConstraintPtr &Constraint : Summary.getArgConstraints()) {		for (const ValueConstraintPtr &Constraint : Summary.getArgConstraints()) {
ProgramStateRef SuccessSt = Constraint->apply(NewState, Call, Summary, C);		ProgramStateRef SuccessSt = Constraint->apply(NewState, Call, Summary, C);
ProgramStateRef FailureSt =		ProgramStateRef FailureSt =
Constraint->negate()->apply(NewState, Call, Summary, C);		Constraint->negate()->apply(NewState, Call, Summary, C);
// The argument constraint is not satisfied.		// The argument constraint is not satisfied.
if (FailureSt && !SuccessSt) {		if (FailureSt && !SuccessSt) {
if (ExplodedNode *N = C.generateErrorNode(NewState))		if (ExplodedNode *N = C.generateErrorNode(NewState, NewNode))
		SzelethusUnsubmitted Not Done Reply Inline Actions Let me know if I got this right. The reason behind `generateErrorNode` not behaving like it usually does for other checkers is because of the explicitly supplied `NewState` parameter -- in its absence, the current path of execution is sunk. With this parameter, a new parallel node is. Correct? Szelethus: Let me know if I got this right. The reason behind `generateErrorNode` not behaving like it…
		balazskeAuthorUnsubmitted Done Reply Inline Actions The `NewState` only sets the state of the new error node, if it is nullptr the current state is used. A new node is always added. The other new node functions (`addTransition`, `generateNonFatalErrorNode`, `generateSink` and `addSink`) have a version that can take a predecessor node, only `generateErrorNode` did not have this (and I can not find out why part of these is called "generate" and other part "add" instead of using only "generate" or "add"). The new function is used when a node sequence `CurrentNode->A->B->ErrorNode` is needed. Without the new function it is only possible to make a `CurrentNode->ErrorNode` transition, and the following incorrect graph is created: CurrentNode->A->B \|->ErrorNode The code here does exactly this (before the fix), in `NewNode` a sequence of nodes is appended (like A and B above), and if then an error node is created it is added to the CurrentNode. Not this is needed here, the error node should come after B. Otherwise analysis can continue after node B (that path is invalid because a constraint violation was found). (The "CurrentNode" is a `Pred` value that is stored in `CheckerContext` and not changed if other nodes are added.) balazske: The `NewState` only sets the state of the new error node, if it is nullptr the current state is…
		SzelethusUnsubmitted Not Done Reply Inline Actions I've been wondering that, especially looking at the test case. Seems like this loop runs only once, how come that new nodes are added on top of `CurrentNode` (which, in this case refers to `C.getPredecessor()`, right?)? I checked the checker's code, and I can't really see why `A` and `B` would ever appear. Isn't that a bug? Szelethus: I've been wondering that, especially looking at the test case. Seems like this loop runs only…
		SzelethusUnsubmitted Not Done Reply Inline Actions My thinking was that each checker, unless it does state splits, should really only create a single node per callback, right? The new state, however many changes it contains, should be added all at once in the single callback, no? Szelethus: My thinking was that each checker, unless it does state splits, should really only create a…
		balazskeAuthorUnsubmitted Done Reply Inline Actions The problem is that multiple NoteTags are added. It is only possible to add a single NoteTag in a single transition. This is why in line 969 (in the currently shown code at time of this comment) `addTransition` is used for every new `SuccessSt` (otherwise `NewState` could be used without `NewNode`). Or is there a possibility for multiple NoteTags at one transition, or can such a feature be added? (But if the other state add functions all have a version that accepts a predecessor node, why is `generateErrorNode` exception?) (This state apply loop was changed in the recent time at least once.) balazske: The problem is that multiple NoteTags are added. It is only possible to add a single NoteTag in…
		NoQUnsubmitted Not Done Reply Inline Actions I think you're right, even though technically it's always possible to make all updates in a single transition, in practice it often leads to annoying architectural problems. It's nice to have separation of concerns between different parts of checker code, and "chaining" nodes together is a neat way to achieve that. NoQ: I think you're right, even though technically it's always possible to make all updates in a…
reportBug(Call, N, Constraint.get(), Summary, C);		reportBug(Call, N, Constraint.get(), Summary, C);
break;		break;
}		}
// We will apply the constraint even if we cannot reason about the		// We will apply the constraint even if we cannot reason about the
// argument. This means both SuccessSt and FailureSt can be true. If we		// argument. This means both SuccessSt and FailureSt can be true. If we
// weren't applying the constraint that would mean that symbolic		// weren't applying the constraint that would mean that symbolic
// execution continues on a code whose behaviour is undefined.		// execution continues on a code whose behaviour is undefined.
assert(SuccessSt);		assert(SuccessSt);
▲ Show 20 Lines • Show All 2,056 Lines • Show Last 20 Lines

clang/test/Analysis/std-c-library-functions-arg-constraints.c

Show All 14 Lines
// RUN: -analyzer-checker=alpha.unix.StdCLibraryFunctionArgs \		// RUN: -analyzer-checker=alpha.unix.StdCLibraryFunctionArgs \
// RUN: -analyzer-checker=debug.StdCLibraryFunctionsTester \		// RUN: -analyzer-checker=debug.StdCLibraryFunctionsTester \
// RUN: -analyzer-checker=debug.ExprInspection \		// RUN: -analyzer-checker=debug.ExprInspection \
// RUN: -triple x86_64-unknown-linux-gnu \		// RUN: -triple x86_64-unknown-linux-gnu \
// RUN: -analyzer-output=text \		// RUN: -analyzer-output=text \
// RUN: -verify=bugpath		// RUN: -verify=bugpath

void clang_analyzer_eval(int);		void clang_analyzer_eval(int);
		void clang_analyzer_warnIfReached();

int glob;		int glob;

#define EOF -1		#define EOF -1

int isalnum(int);		int isalnum(int);

void test_alnum_concrete(int v) {		void test_alnum_concrete(int v) {
▲ Show 20 Lines • Show All 179 Lines • ▼ Show 20 Lines	if (!buf) // bugpath-note{{Assuming 'buf' is null}} \
// bugpath-note{{Taking true branch}}		// bugpath-note{{Taking true branch}}
fread(buf, sizeof(int), 10, fp); // \		fread(buf, sizeof(int), 10, fp); // \
// report-warning{{Function argument constraint is not satisfied}} \		// report-warning{{Function argument constraint is not satisfied}} \
// report-note{{}} \		// report-note{{}} \
// bugpath-warning{{Function argument constraint is not satisfied}} \		// bugpath-warning{{Function argument constraint is not satisfied}} \
// bugpath-note{{}} \		// bugpath-note{{}} \
// bugpath-note{{Function argument constraint is not satisfied}}		// bugpath-note{{Function argument constraint is not satisfied}}
}		}
		void test_no_node_after_bug(FILE fp, size_t size, size_t n, void buf) {
		if (fp) // \
		// bugpath-note{{Assuming 'fp' is null}} \
		// bugpath-note{{Taking false branch}}
		return;
		size_t ret = fread(buf, size, n, fp); // \
		// report-warning{{Function argument constraint is not satisfied}} \
		// report-note{{}} \
		// bugpath-warning{{Function argument constraint is not satisfied}} \
		// bugpath-note{{}} \
		// bugpath-note{{Function argument constraint is not satisfied}}
		clang_analyzer_warnIfReached(); // not reachable
		}

typedef __WCHAR_TYPE__ wchar_t;		typedef __WCHAR_TYPE__ wchar_t;
// This is one test case for the ARR38-C SEI-CERT rule.		// This is one test case for the ARR38-C SEI-CERT rule.
void ARR38_C_F(FILE *file) {		void ARR38_C_F(FILE *file) {
enum { BUFFER_SIZE = 1024 };		enum { BUFFER_SIZE = 1024 };
wchar_t wbuf[BUFFER_SIZE]; // bugpath-note{{'wbuf' initialized here}}		wchar_t wbuf[BUFFER_SIZE]; // bugpath-note{{'wbuf' initialized here}}

const size_t size = sizeof(*wbuf); // bugpath-note{{'size' initialized to}}		const size_t size = sizeof(*wbuf); // bugpath-note{{'size' initialized to}}
const size_t nitems = sizeof(wbuf); // bugpath-note{{'nitems' initialized to}}		const size_t nitems = sizeof(wbuf); // bugpath-note{{'nitems' initialized to}}
▲ Show 20 Lines • Show All 115 Lines • Show Last 20 Lines