This is an archive of the discontinued LLVM Phabricator instance.

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
452	Yeah, a must-have for this check to be enabled by default would be to be able to provide a specific warning message for every function. I guess we could include them in the summaries as an extra argument of `ArgConstraint`.
455	Let's test our notes. That'll be especially important when we get to non-concrete values, because the visitor might need to be expanded (or we might need a completely new visitor).

steakhal mentioned this in D73536: [analyzer][taint] Remove taint from symbolic expressions if used in comparisons.Feb 5 2020, 6:02 AM

martong added reviewers: Szelethus, baloghadamsoftware.Feb 6 2020, 2:08 AM

xazax.hun added inline comments.Feb 6 2020, 3:39 PM

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
442	Dealing with only concrete ints might be a good start but we might want to handle symbolic cases in the future like: if (v > 255) return isalpha(v); I am ok with not addig this in the first version but adding TODOs and test cases upfront cannot hurt. So basivally, I was wondering if we should query the solver for the result instead of matching the sval kind and just early return if we do not want to support a specific kind.

I wouldn't like to see reports emitted by a checker that resides in apiModeling. Could we create a new one? Some checkers, like the IteratorChecker, MallocChecker and CStringChecker implement a variety of user-facing checkers within the same class, that is also an option, if creating a new checker class is too much of a hassle.

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
704–706	This is true for the rest of the summaries as well, but shouldn't we retrieve the `unsigned char` size from `ASTContext`?

This revision now requires changes to proceed.Feb 7 2020, 4:40 AM

In D73898#1863710, @Szelethus wrote:

I wouldn't like to see reports emitted by a checker that resides in apiModeling. Could we create a new one? Some checkers, like the IteratorChecker, MallocChecker and CStringChecker implement a variety of user-facing checkers within the same class, that is also an option, if creating a new checker class is too much of a hassle.

Yes, we could split the warning emitting part to a new checker. My concern with that is in that case we would have the argument constraining part in checkPostCall still in this checker, because that is part of the modelling. And actually it makes sense to apply the argument constraints only if we know for sure that they are not violated. The violation then would be checked in the new checker, this seems a bit awkward to me. Because checking the violation of the constraints and applying the constraints seems to be a cohesive action to me. I mean it would not even make sense to turn off the warning checker, because then we'd be applying the constraints blindly.

martong marked an inline comment as done.Feb 7 2020, 8:24 AM

martong added inline comments.

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
704–706	Yes this is a good idea. I will do this. What bothers me really much, however, is that we should handle EOF in a platform dependent way as well ... and I have absolutely no idea how to do that given that is defined by a macro in a platform specific header file. I am desperately in need for help and ideas about how could we get the value of EOF for the analysed platform.

In D73898#1864066, @martong wrote:

In D73898#1863710, @Szelethus wrote:

I wouldn't like to see reports emitted by a checker that resides in apiModeling. Could we create a new one? Some checkers, like the IteratorChecker, MallocChecker and CStringChecker implement a variety of user-facing checkers within the same class, that is also an option, if creating a new checker class is too much of a hassle.

... And actually it makes sense to apply the argument constraints only if we know for sure that they are not violated. ...

What I mean by that is that we must do over-approximation if the argument is symbolic. I.e. we presume that the constraints do hold otherwise the program would be ill-formed and there is no point to continue the analysis on this path. It is very similar to what we do in case of the DivZero or the NullDeref Checkers: if there is no violation (no warning) and the variable is symbolic then we constrain the value by the condition. E.g. in DivZero::checkPreStmt we have:

// If we get here, then the denom should not be zero. We abandon the implicit
// zero denom case for now.
C.addTransition(stateNotZero);

Strictly speaking, these transitions should be part of the modeling then in this sense (and they should be in PostStmt?). Still they are not separated into a different checker.

What I mean by that is that we must do over-approximation if the argument is symbolic. I.e. we presume that the constraints do hold otherwise the program would be ill-formed and there is no point to continue the analysis on this path.

Sorry, that's actually under-approximation because we elide paths.

Based on our verbal discussion with @Szelethus and @steakhal and based on the mailing archives, I am going to do the following changes:

Add a new checker that is implemented in the StdLibraryFunctionsChecker class.
This new checker if switched on is responsible for emitting the warning. Even if this is turned off, the sink node is generated if the argument violates the given condition.
This means, the new checker has the sole responsibility of emitting the warning, but nothing more.

martong added a parent revision: D74473: [analyzer] StdLibraryFunctionsChecker: Use platform dependent EOF and UCharMax.Feb 12 2020, 2:48 AM

Rebase to master

Harbormaster failed remote builds in B46416: Diff 244430!Feb 13 2020, 8:06 AM

balazske added a subscriber: balazske.Feb 14 2020, 8:04 AM

balazske added inline comments.

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
442	This check works now with concrete int values. We have a known value and a list of ranges with known limits, so testing for in any of the ranges does work the same way as testing for out of all ranges. And testing if the value is inside one of the ranges is more simple code. But I think the symbolic evaluation with "eval" and "assume" functions would be more generic here (and more simple code). Then the way of cutting-of the bad ranges is usable (probably still there is other solution).
503	If `evalCall` is used it could be more simple to test and apply the constraints for arguments and return value in a "single step".

Probably a better solution can be:
For every "case" build a single SVal that contains all argument constraints for that case. It is possible using multiple evalBinOp calls (with <=, >=, logical or) to build such a condition (or repeated calls to other assume functions to cut off outer ranges). If the condition can be satisfied (by assume) add the new state, the condition for return value can be added here too. Repeat this for every different case. If no applicable case is found none of the conditions can be assumed, this means argument constraint error.

Add new Checker that does the report
Refactor with negated RangeValues
Add overload to findFunctionSummary
Add tests for symbolic values
Add test file for bug path

I've done a major refactor with the handling of argument constraints. I am now reusing ValueRange::apply in checkPreCall on "negated" value ranges to check if the constraint is violated.

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
452	What about warning messages with placeholders? E.g. "Argument constraint of {nth} argument of {fun} is not satisfied. The value is {v}, however it should be in {range}." There will be a bunch of functions whose warning message template would be the same. On the other hand some others could have different warnings, and that justifies the need for specialized warnings. Still, I think the warning message in the summary should be optional, because otherwise it would be really hard to automatically add summaries from other sources (like from cppcheck). No matter how it turns out, this should be handled in a different patch.
455	Ok, I added a separate test file where the tests focus on the bug path.

Remove leftover call from test

Harbormaster completed remote builds in B46921: Diff 245651.Feb 20 2020, 7:26 AM

Harbormaster completed remote builds in B46922: Diff 245652.Feb 20 2020, 8:12 AM

martong added reviewers: gamesh411, balazske.Feb 21 2020, 2:54 AM

steakhal added inline comments.Feb 21 2020, 4:05 AM

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
441	StringRef

martong added a parent revision: D74973: [analyzer] StdLibraryFunctionsChecker refactor w/ inheritance.Feb 21 2020, 9:54 AM

balazske added inline comments.Feb 23 2020, 11:44 PM

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
461	Is this `addTransition` needed? It would be OK to call `generateErrorNode` with `State`. Even if not, adding the transition before should not be needed?
719–720	Why is this `{128, UCharMax}` here and at the next entry needed?
728	Is this `ArgConstraint` intentionally added only to `isalnum`?

Use StringRef for Msg
Remove superfluous addTransition

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
461	Yes, you are right it is superfluous, I removed it.

martong added inline comments.Feb 24 2020, 6:12 AM

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
719–720	This is the local specific range , [128, 255]. There are characters like `ä` which we don't know if they are treated as an alphanumerical character or not. We can't really tell how a specific libc implementation classifies them. On the other hand, with English letters we can state the classes confidently.
728	Yes, I wanted to create first the infrastructure and then later to add all these constraints to the rest of the summaries with new tests.

Harbormaster completed remote builds in B47126: Diff 246193.Feb 24 2020, 6:22 AM

Rebase on top of https://reviews.llvm.org/D74973

Harbormaster completed remote builds in B47140: Diff 246229.Feb 24 2020, 9:44 AM

martong added a child revision: D75063: [analyzer] StdLibraryFunctionsChecker: Add NotNull Arg Constraint.Feb 24 2020, 9:56 AM

martong marked an inline comment as done.Feb 24 2020, 10:00 AM

martong added inline comments.

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
165	This `default` branch is not needed here (actually gives a compiler warning too).

gamesh411 added inline comments.Feb 27 2020, 1:12 AM

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
704–706	If the EOF is not used in the TU analyzed, then there would be no way to find the specific `#define`. Another approach would be to check if the value is defined by an expression that is the EOF define (maybe transitively?).

It may be useful to make a "macro value map" kind of object. Some macros can be added to it as a string, and it is possible to lookup for an Expr if one of the added macros is used there. This can be done by checking the concrete (numeric) value of the Expr and compare to the value of the macro, or by checking if the expression comes from a macro and take this macro name (use string comparison). Such an object can be useful because the functionality is needed at more checkers, for example the ones I am working on (StreamChecker and ErrorReturnChecker too).

something like this:

class MacroUsageDetector {
public:
  void addMacroName(StringRef MName);
  bool isMacroUsed(StringRef MName, Expr *E, ???);
  APSInt getMacroValue(StringRef MName);
};

Or one that handles a single macro?

The high level idea and the implementation of the checker seems great. In general, things that you want to address in later patches should be stated in the code with a TODO. I wrote a couple nits that I don't want to delete, but maybe it'd be better to address them after the dependency patch is agreed upon.

clang/include/clang/StaticAnalyzer/Checkers/Checkers.td
299	How about we add an example as well?
clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
9–10	I suspect this comment is no longer relevant.
156	Maybe `complement` would be a better name? That sounds a lot more like a set operation. Also, this function highlights well that inheritance might not be the best solution here.
197	I think that is a rather poor example to help understand what `list of list of ranges` means :) -- Could you try to find something better?
437–445	While I find your usage of lambdas fascinating, this one seems a bit unnecessary :)
440	That is a `TODO`, rather :^)
clang/test/Analysis/std-c-library-functions-arg-constraints.c
2–8	Hmm, why do we have 2 different test files that essentially do the same? Shouldn't we only have a single one with `analyzer-output=text`?
clang/test/Analysis/std-c-library-functions.c
1–32	What a beautiful sight. Thanks.

Is it sure that the signedness in the ranges is handled correctly? The EOF is a negative value but the RangeInt is unsigned type. The tryExpandAsInteger returns int too that is put into an unsigned RangeInt later. Probably it is better to use APSInt for the ranges? (The problem exists already before this change.)

In D73898#1894923, @balazske wrote:

It may be useful to make a "macro value map" kind of object. Some macros can be added to it as a string, and it is possible to lookup for an Expr if one of the added macros is used there. This can be done by checking the concrete (numeric) value of the Expr and compare to the value of the macro, or by checking if the expression comes from a macro and take this macro name (use string comparison). Such an object can be useful because the functionality is needed at more checkers, for example the ones I am working on (StreamChecker and ErrorReturnChecker too).

Please see my previous answer to @gamesh411

clang/include/clang/StaticAnalyzer/Checkers/Checkers.td
299	You mean like NonNull or other constraints?
clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
9–10	Uh, yes.
156	Well, we check the argument constraint validity by trying to apply it's logical negation. In case of a range inclusion this is being out of that range. In case of non-null this is being null. And so on. The logic how we try to check an argument constraint is the same in all cases of the different constraints. And that is the point: in order to support a new kind of constraint we just have to figure out how to "apply" and "negate" one constraint. In my opinion this is a perfect case for polimorphism.
197	Yeah, that part definitely should be reworded.
704–706	I believe that the given standard C lib implementation (e.g. glibc) must provide a header for the prototypes of these functions where EOF is also defined transitively in any of the dependent system headers. Otherwise user code could misuse the value of EOF and thus the program would behave in an undefined manner. C99 clearly states that you should #include <ctype.h> to use isalhpa.
clang/test/Analysis/std-c-library-functions-arg-constraints.c
2–8	No, I wanted to have two different test files to test two different things: (1) We do have the constraints applied (here we don't care about the warnings and the path) (2) Check that we have a warning with the proper tracking and notes.
clang/test/Analysis/std-c-library-functions.c
1–32	Anytime :D

In D73898#1901142, @balazske wrote:

Is it sure that the signedness in the ranges is handled correctly? The EOF is a negative value but the RangeInt is unsigned type. The tryExpandAsInteger returns int too that is put into an unsigned RangeInt later. Probably it is better to use APSInt for the ranges? (The problem exists already before this change.)

That is not a problem, because finally in apply we use an APSInt that is constructed by considering the correct T type, e.g.:

const llvm::APSInt &Min = BVF.getValue(R[I].first, T);

We could consider RangeInt as a buffer that is big enough to hold the representation of the range values. The concrete interpretation of the bits (as T) is done by APSInt.

Just littering some more inlines, don't mind me :) Lets still wait on the dependency patch before updating.

clang/include/clang/StaticAnalyzer/Checkers/Checkers.td
299	Like Check constraints of arguments of C standard library functions, such as whether the parameter of isalpha is in the range [0, 255] or is EOF.
clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
91–92	How about `ValueConstraintRef`?
156	We agreed on inheritance in the previous patch, and regarding the name, sure, leave it as-is. :)

Looks great as long as other reviewers are happy, thanks!

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
456–458	Maybe we should add an assertion that the same argument isn't specified multiple times.

martong marked 16 inline comments as done.Mar 17 2020, 10:25 AM

martong added inline comments.

clang/include/clang/StaticAnalyzer/Checkers/Checkers.td
299	Ok, done.
clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
91–92	Yeah, we have `ProgramStateRef` and `SymbolRef`. And both are actually just synonyms to smart pointers. I'd rather not call a pointer as a reference, because that can be confusing when reading the code. E.g. when I see that we return with a `nullptr` from a function that can return with a `...Ref` I start to scratch my head.
197	I added an example with `isalpha`.
437–445	Ok I moved it to be a member function named `ReportBug`.
456–458	I think there could be cases when we want to have e.g. a not-null constraint on the 1st argument, but also we want to express that the 1st argument's size is described by the 2nd argument. I am planning to implement such a constraints in the future. In that case we would have two constraints on the 1st argument and the assert would fire.

Herald added a subscriber: DenisDvlp. · View Herald TranscriptMar 17 2020, 10:25 AM

Address review comments

Harbormaster completed remote builds in B49451: Diff 250820.Mar 17 2020, 11:16 AM

tmp -> Tmp

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
165	This default branch is not needed here (actually gives a compiler warning too). I am not sure why I thought that that's not needed, actually we need that. (Perhaps an intermediate version returned in each cases.)

Herald added a subscriber: ASDenysPetrov. · View Herald TranscriptMar 17 2020, 12:45 PM

Harbormaster completed remote builds in B49477: Diff 250869.Mar 17 2020, 1:31 PM

LGTM, aside from some checker tagging nightmare. Its a bit easy to mess up, please allow me to have a final look before commiting! :)

clang/include/clang/StaticAnalyzer/Checkers/Checkers.td
298	Just noticed, this checker still lies in the `apiModeling` package. Could we find a more appropriate place?
clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
91–92	Sure, I'm sold.
264	By passing `this`, the error message will be tied to the modeling checker, not to the one you just added. `BugType` has a constructor that accepts a string instead, pass `CheckNames[CK_StdCLibraryFunctionArgsChecker]` in there :) Also, how about `BT_InvalidArgument` or something?

This revision now requires changes to proceed.Mar 17 2020, 8:05 PM

balazske added inline comments.Mar 18 2020, 2:27 AM

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
112	Is it better done with `= 0`?
197	The "branches" are the structures that define relations between arguments and return values? This could be included in the description.
301	This should be called `reportBug`.

martong marked 11 inline comments as done.Mar 18 2020, 7:46 AM

martong added inline comments.

clang/include/clang/StaticAnalyzer/Checkers/Checkers.td
298	Technically speaking this is still api modeling. In midterm we'd like to add support for more libc functions, gnu and posix functions, they are all library functions i.e. they provide some api. Of course in long term, we'd like to experiment by getting some constraints from IR/Attributor, but we are still far from there.
clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
112	Not all of the constraint classes must implement this. Right now, e.g. the `ComparisonConstraint` does not implement this, because there is no such summary (yet) that uses a `ComparisonConstraint` as an argument constraint.
197	Not exactly. A branch represents a path in the exploded graph of a function (which is a tree). So, a branch is a series of assumptions. In other words, branches represent split states and additional assumptions on top of the splitting assumption. I added this explanation to the comments.
264	Thanks, good catch, I did not know about that. Please note that using `CheckNames` requires that we change the `BT` member to be lazily initialized. Because `CheckNames` is initialized only after the checker itself is created, thus we cannot initialize `BT` during the checkers construction, b/c that would be before `CheckNames` is set. So, I changed `BT` to be a unique_ptr and it is being lazily initialized in `reportBug`.
301	Yeah, can't get used to this strange naming convention that LLVM uses. Fixed it.

Add comments about what is a branch
Do not use 'this' for BugType
Lazily init BT and BT -> BT_InvalidArg
ReportBug -> reportBug

Harbormaster completed remote builds in B49598: Diff 251083.Mar 18 2020, 8:42 AM

Whoo! The patch looks great and well thought out, the tests look like they cover everything and we also talked about plans for future patches. Excellent!

I left a nit about merging the test files, but I'll leave it up to you to address or ignore it.

clang/test/Analysis/std-c-library-functions-arg-constraints.c
2–8	What if we had different `-verify`s? `clang/test/Analysis/track-conditions.cpp` is a great example.

This revision is now accepted and ready to land.Mar 19 2020, 12:52 PM

Use prefixes for -verify to check different things in the same test file

Thanks for the review guys!

clang/test/Analysis/std-c-library-functions-arg-constraints.c
2–8	Yeah, that's a very good approach, I just changed it like that. :)

Closed by commit rG94061df6e5f2: [analyzer] StdLibraryFunctionsChecker: Add argument constraints (authored by martong). · Explain WhyMar 20 2020, 8:39 AM

This revision was automatically updated to reflect the committed changes.

Harbormaster completed remote builds in B49893: Diff 251648.Mar 20 2020, 9:11 AM

NoQ added inline comments.Mar 25 2020, 12:06 AM

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
456–458	Wait, i misunderstood the code. It's even worse than that: you're adding transitions in a loop, so it'll cause state splits for every constraint. Because you do not intend to create multiple branches here, there needs to be exactly one `addTransition` performed every time `checkPreCall` is called. I.e., for now this code is breaking everything whenever there's more than one constraint, regardless of whether it's on the same argument.

martong marked an inline comment as done.Mar 25 2020, 7:33 AM

martong added inline comments.

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
456–458	Yeah, that's a very good catch, thanks! I am going to prepare a patch to fix this soon. My idea is to store the `SuccessSt` and apply the next argument constraint on that. And once the loop is finished I'll have call the `addTransition()`.

NoQ added inline comments.Mar 25 2020, 8:15 AM

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
456–458	Yup, that's the common thing to do in such cases.

NoQ added inline comments.Mar 25 2020, 9:29 AM

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
456–458	While we're at it, could you try to come up with a runtime assertion that'll help us prevent these mistakes? Like, dunno, make `CheckerContext` crash whenever there's more than one branch being added, and then add a method to opt out when it's actually necessary to add more transitions (i.e., the user would say `C.setMaxTransitions(2)` at the beginning of their checker callback whenever they need to make a state split, defaulting to 1). It's a bit tricky because i still want to allow multiple transitions when they allow one branch (i.e., transitions chained together) but i think it'll take a lot of review anxiety from me because it's a very dangerous mistake to make and for now code review is the only way to catch it. So, yay, faster code reviews.

I just created a quick fix for the issue: https://reviews.llvm.org/D76790

martong marked an inline comment as done.Mar 25 2020, 11:23 AM

martong added inline comments.

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
456–458	Hmm I see your point and I agree this would be a valuable sanity check. But if you don't mind I'd like to address this in a different and stand-alone patch (independently from the quick-fix https://reviews.llvm.org/D76790) because it does not seem to be trivial for me. My first concern is this: if we have `1` as the default value for `maxTranisitions` then we should add an extra `C.setMaxTransitions(N)` in every checker callback that does a state split, is that right?

Szelethus mentioned this in D79358: [analyzer] CERT: STR37-C.May 5 2020, 2:09 AM

Revision Contents

Path

Size

clang/

include/

clang/

StaticAnalyzer/

Checkers/

Checkers.td

7 lines

lib/

StaticAnalyzer/

Checkers/

StdLibraryFunctionsChecker.cpp

173 lines

test/

Analysis/

analyzer-enabled-checkers.c

1 line

std-c-library-functions-arg-constraints.c

61 lines

std-c-library-functions.c

36 lines

Diff 251656

clang/include/clang/StaticAnalyzer/Checkers/Checkers.td

	Show First 20 Lines • Show All 289 Lines • ▼ Show 20 Lines
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	let ParentPackage = APIModeling in {			let ParentPackage = APIModeling in {

	def StdCLibraryFunctionsChecker : Checker<"StdCLibraryFunctions">,			def StdCLibraryFunctionsChecker : Checker<"StdCLibraryFunctions">,
	HelpText<"Improve modeling of the C standard library functions">,			HelpText<"Improve modeling of the C standard library functions">,
	Documentation<NotDocumented>;			Documentation<NotDocumented>;

				def StdCLibraryFunctionArgsChecker : Checker<"StdCLibraryFunctionArgs">,
				SzelethusUnsubmitted Done Reply Inline Actions Just noticed, this checker still lies in the `apiModeling` package. Could we find a more appropriate place? Szelethus: Just noticed, this checker still lies in the `apiModeling` package. Could we find a more…
				martongAuthorUnsubmitted Done Reply Inline Actions Technically speaking this is still api modeling. In midterm we'd like to add support for more libc functions, gnu and posix functions, they are all library functions i.e. they provide some api. Of course in long term, we'd like to experiment by getting some constraints from IR/Attributor, but we are still far from there. martong: Technically speaking this is still api modeling. In midterm we'd like to add support for more…
				HelpText<"Check constraints of arguments of C standard library functions, "
				SzelethusUnsubmitted Done Reply Inline Actions How about we add an example as well? Szelethus: How about we add an example as well?
				martongAuthorUnsubmitted Done Reply Inline Actions You mean like NonNull or other constraints? martong: You mean like NonNull or other constraints?
				SzelethusUnsubmitted Done Reply Inline Actions Like Check constraints of arguments of C standard library functions, such as whether the parameter of isalpha is in the range [0, 255] or is EOF. Szelethus: Like ``` Check constraints of arguments of C standard library functions, such as whether the…
				martongAuthorUnsubmitted Done Reply Inline Actions Ok, done. martong: Ok, done.
				"such as whether the parameter of isalpha is in the range [0, 255] "
				"or is EOF.">,
				Dependencies<[StdCLibraryFunctionsChecker]>,
				Documentation<NotDocumented>;

	def TrustNonnullChecker : Checker<"TrustNonnull">,			def TrustNonnullChecker : Checker<"TrustNonnull">,
	HelpText<"Trust that returns from framework methods annotated with _Nonnull "			HelpText<"Trust that returns from framework methods annotated with _Nonnull "
	"are not null">,			"are not null">,
	Documentation<NotDocumented>;			Documentation<NotDocumented>;

	} // end "apiModeling"			} // end "apiModeling"

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	▲ Show 20 Lines • Show All 1,198 Lines • Show Last 20 Lines

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp

//=== StdLibraryFunctionsChecker.cpp - Model standard functions -- C++ --===//		//=== StdLibraryFunctionsChecker.cpp - Model standard functions -- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// This checker improves modeling of a few simple library functions.		// This checker improves modeling of a few simple library functions.
// It does not generate warnings.
//		//
		SzelethusUnsubmitted Done Reply Inline Actions I suspect this comment is no longer relevant. Szelethus: I suspect this comment is no longer relevant.
		martongAuthorUnsubmitted Done Reply Inline Actions Uh, yes. martong: Uh, yes.
// This checker provides a specification format - `Summary' - and		// This checker provides a specification format - `Summary' - and
// contains descriptions of some library functions in this format. Each		// contains descriptions of some library functions in this format. Each
// specification contains a list of branches for splitting the program state		// specification contains a list of branches for splitting the program state
// upon call, and range constraints on argument and return-value symbols that		// upon call, and range constraints on argument and return-value symbols that
// are satisfied on each branch. This spec can be expanded to include more		// are satisfied on each branch. This spec can be expanded to include more
// items, like external effects of the function.		// items, like external effects of the function.
//		//
// The main difference between this approach and the body farms technique is		// The main difference between this approach and the body farms technique is
Show All 26 Lines
// fwrite isalpha islower read		// fwrite isalpha islower read
// getc isascii isprint write		// getc isascii isprint write
// getchar isblank ispunct		// getchar isblank ispunct
// getdelim iscntrl isspace		// getdelim iscntrl isspace
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "clang/StaticAnalyzer/Checkers/BuiltinCheckerRegistration.h"		#include "clang/StaticAnalyzer/Checkers/BuiltinCheckerRegistration.h"
		#include "clang/StaticAnalyzer/Core/BugReporter/BugType.h"
#include "clang/StaticAnalyzer/Core/Checker.h"		#include "clang/StaticAnalyzer/Core/Checker.h"
#include "clang/StaticAnalyzer/Core/CheckerManager.h"		#include "clang/StaticAnalyzer/Core/CheckerManager.h"
#include "clang/StaticAnalyzer/Core/PathSensitive/CallEvent.h"		#include "clang/StaticAnalyzer/Core/PathSensitive/CallEvent.h"
#include "clang/StaticAnalyzer/Core/PathSensitive/CheckerContext.h"		#include "clang/StaticAnalyzer/Core/PathSensitive/CheckerContext.h"
#include "clang/StaticAnalyzer/Core/PathSensitive/CheckerHelpers.h"		#include "clang/StaticAnalyzer/Core/PathSensitive/CheckerHelpers.h"

using namespace clang;		using namespace clang;
using namespace clang::ento;		using namespace clang::ento;

namespace {		namespace {
class StdLibraryFunctionsChecker : public Checker<check::PostCall, eval::Call> {		class StdLibraryFunctionsChecker
		: public Checker<check::PreCall, check::PostCall, eval::Call> {
/// Below is a series of typedefs necessary to define function specs.		/// Below is a series of typedefs necessary to define function specs.
/// We avoid nesting types here because each additional qualifier		/// We avoid nesting types here because each additional qualifier
/// would need to be repeated in every function spec.		/// would need to be repeated in every function spec.
struct Summary;		struct Summary;

/// Specify how much the analyzer engine should entrust modeling this function		/// Specify how much the analyzer engine should entrust modeling this function
/// to us. If he doesn't, he performs additional invalidations.		/// to us. If he doesn't, he performs additional invalidations.
enum InvalidationKind { NoEvalCall, EvalCallAsPure };		enum InvalidationKind { NoEvalCall, EvalCallAsPure };
Show All 9 Lines	class StdLibraryFunctionsChecker
typedef std::vector<std::pair<RangeInt, RangeInt>> IntRangeVector;		typedef std::vector<std::pair<RangeInt, RangeInt>> IntRangeVector;

/// A reference to an argument or return value by its number.		/// A reference to an argument or return value by its number.
/// ArgNo in CallExpr and CallEvent is defined as Unsigned, but		/// ArgNo in CallExpr and CallEvent is defined as Unsigned, but
/// obviously uint32_t should be enough for all practical purposes.		/// obviously uint32_t should be enough for all practical purposes.
typedef uint32_t ArgNo;		typedef uint32_t ArgNo;
static const ArgNo Ret;		static const ArgNo Ret;

		class ValueConstraint;

		SzelethusUnsubmitted Done Reply Inline Actions How about `ValueConstraintRef`? Szelethus: How about `ValueConstraintRef`?
		martongAuthorUnsubmitted Done Reply Inline Actions Yeah, we have `ProgramStateRef` and `SymbolRef`. And both are actually just synonyms to smart pointers. I'd rather not call a pointer as a reference, because that can be confusing when reading the code. E.g. when I see that we return with a `nullptr` from a function that can return with a `...Ref` I start to scratch my head. martong: Yeah, we have `ProgramStateRef` and `SymbolRef`. And both are actually just synonyms to smart…
		SzelethusUnsubmitted Done Reply Inline Actions Sure, I'm sold. Szelethus: Sure, I'm sold.
		// Pointer to the ValueConstraint. We need a copyable, polymorphic and
		// default initialize able type (vector needs that). A raw pointer was good,
		// however, we cannot default initialize that. unique_ptr makes the Summary
		// class non-copyable, therefore not an option. Releasing the copyability
		// requirement would render the initialization of the Summary map infeasible.
		using ValueConstraintPtr = std::shared_ptr<ValueConstraint>;

/// Polymorphic base class that represents a constraint on a given argument		/// Polymorphic base class that represents a constraint on a given argument
/// (or return value) of a function. Derived classes implement different kind		/// (or return value) of a function. Derived classes implement different kind
/// of constraints, e.g range constraints or correlation between two		/// of constraints, e.g range constraints or correlation between two
/// arguments.		/// arguments.
class ValueConstraint {		class ValueConstraint {
public:		public:
ValueConstraint(ArgNo ArgN) : ArgN(ArgN) {}		ValueConstraint(ArgNo ArgN) : ArgN(ArgN) {}
virtual ~ValueConstraint() {}		virtual ~ValueConstraint() {}
/// Apply the effects of the constraint on the given program state. If null		/// Apply the effects of the constraint on the given program state. If null
/// is returned then the constraint is not feasible.		/// is returned then the constraint is not feasible.
virtual ProgramStateRef apply(ProgramStateRef State, const CallEvent &Call,		virtual ProgramStateRef apply(ProgramStateRef State, const CallEvent &Call,
const Summary &Summary) const = 0;		const Summary &Summary) const = 0;
		virtual ValueConstraintPtr negate() const {
		balazskeUnsubmitted Done Reply Inline Actions Is it better done with `= 0`? balazske: Is it better done with `= 0`?
		martongAuthorUnsubmitted Done Reply Inline Actions Not all of the constraint classes must implement this. Right now, e.g. the `ComparisonConstraint` does not implement this, because there is no such summary (yet) that uses a `ComparisonConstraint` as an argument constraint. martong: Not all of the constraint classes must implement this. Right now, e.g. the…
		llvm_unreachable("Not implemented");
		};
ArgNo getArgNo() const { return ArgN; }		ArgNo getArgNo() const { return ArgN; }

protected:		protected:
ArgNo ArgN; // Argument to which we apply the constraint.		ArgNo ArgN; // Argument to which we apply the constraint.
};		};

/// Given a range, should the argument stay inside or outside this range?		/// Given a range, should the argument stay inside or outside this range?
enum RangeKind { OutOfRange, WithinRange };		enum RangeKind { OutOfRange, WithinRange };
Show All 24 Lines	ProgramStateRef apply(ProgramStateRef State, const CallEvent &Call,
switch (Kind) {		switch (Kind) {
case OutOfRange:		case OutOfRange:
return applyAsOutOfRange(State, Call, Summary);		return applyAsOutOfRange(State, Call, Summary);
case WithinRange:		case WithinRange:
return applyAsWithinRange(State, Call, Summary);		return applyAsWithinRange(State, Call, Summary);
}		}
llvm_unreachable("Unknown range kind!");		llvm_unreachable("Unknown range kind!");
}		}

		ValueConstraintPtr negate() const override {
		SzelethusUnsubmitted Done Reply Inline Actions Maybe `complement` would be a better name? That sounds a lot more like a set operation. Also, this function highlights well that inheritance might not be the best solution here. Szelethus: Maybe `complement` would be a better name? That sounds a lot more like a set operation. Also…
		martongAuthorUnsubmitted Done Reply Inline Actions Well, we check the argument constraint validity by trying to apply it's logical negation. In case of a range inclusion this is being out of that range. In case of non-null this is being null. And so on. The logic how we try to check an argument constraint is the same in all cases of the different constraints. And that is the point: in order to support a new kind of constraint we just have to figure out how to "apply" and "negate" one constraint. In my opinion this is a perfect case for polimorphism. martong: Well, we check the argument constraint validity by trying to apply it's logical negation. In…
		SzelethusUnsubmitted Done Reply Inline Actions We agreed on inheritance in the previous patch, and regarding the name, sure, leave it as-is. :) Szelethus: We agreed on inheritance in the previous patch, and regarding the name, sure, leave it as-is. :)
		RangeConstraint Tmp(*this);
		switch (Kind) {
		case OutOfRange:
		Tmp.Kind = WithinRange;
		break;
		case WithinRange:
		Tmp.Kind = OutOfRange;
		break;
		default:
		martongAuthorUnsubmitted Done Reply Inline Actions This `default` branch is not needed here (actually gives a compiler warning too). martong: This `default` branch is not needed here (actually gives a compiler warning too).
		martongAuthorUnsubmitted Done Reply Inline Actions This default branch is not needed here (actually gives a compiler warning too). I am not sure why I thought that that's not needed, actually we need that. (Perhaps an intermediate version returned in each cases.) martong: > This default branch is not needed here (actually gives a compiler warning too). I am not…
		llvm_unreachable("Unknown RangeConstraint kind!");
		}
		return std::make_shared<RangeConstraint>(Tmp);
		}
};		};

class ComparisonConstraint : public ValueConstraint {		class ComparisonConstraint : public ValueConstraint {
BinaryOperator::Opcode Opcode;		BinaryOperator::Opcode Opcode;
ArgNo OtherArgN;		ArgNo OtherArgN;

public:		public:
ComparisonConstraint(ArgNo ArgN, BinaryOperator::Opcode Opcode,		ComparisonConstraint(ArgNo ArgN, BinaryOperator::Opcode Opcode,
ArgNo OtherArgN)		ArgNo OtherArgN)
: ValueConstraint(ArgN), Opcode(Opcode), OtherArgN(OtherArgN) {}		: ValueConstraint(ArgN), Opcode(Opcode), OtherArgN(OtherArgN) {}
ArgNo getOtherArgNo() const { return OtherArgN; }		ArgNo getOtherArgNo() const { return OtherArgN; }
BinaryOperator::Opcode getOpcode() const { return Opcode; }		BinaryOperator::Opcode getOpcode() const { return Opcode; }
ProgramStateRef apply(ProgramStateRef State, const CallEvent &Call,		ProgramStateRef apply(ProgramStateRef State, const CallEvent &Call,
const Summary &Summary) const override;		const Summary &Summary) const override;
};		};

// Pointer to the ValueConstraint. We need a copyable, polymorphic and
// default initialize able type (vector needs that). A raw pointer was good,
// however, we cannot default initialize that. unique_ptr makes the Summary
// class non-copyable, therefore not an option. Releasing the copyability
// requirement would render the initialization of the Summary map infeasible.
using ValueConstraintPtr = std::shared_ptr<ValueConstraint>;
/// The complete list of constraints that defines a single branch.		/// The complete list of constraints that defines a single branch.
typedef std::vector<ValueConstraintPtr> ConstraintSet;		typedef std::vector<ValueConstraintPtr> ConstraintSet;

using ArgTypes = std::vector<QualType>;		using ArgTypes = std::vector<QualType>;
using Cases = std::vector<ConstraintSet>;		using Cases = std::vector<ConstraintSet>;

/// Includes information about function prototype (which is necessary to		/// Includes information about
		/// * function prototype (which is necessary to
/// ensure we're modeling the right function and casting values properly),		/// ensure we're modeling the right function and casting values properly),
/// approach to invalidation, and a list of branches - essentially, a list		/// * approach to invalidation,
/// of list of ranges - essentially, a list of lists of lists of segments.		/// * a list of branches - a list of list of ranges -
		/// A branch represents a path in the exploded graph of a function (which
		SzelethusUnsubmitted Done Reply Inline Actions I think that is a rather poor example to help understand what `list of list of ranges` means :) -- Could you try to find something better? Szelethus: I think that is a rather poor example to help understand what `list of list of ranges` means :)…
		martongAuthorUnsubmitted Done Reply Inline Actions Yeah, that part definitely should be reworded. martong: Yeah, that part definitely should be reworded.
		martongAuthorUnsubmitted Done Reply Inline Actions I added an example with `isalpha`. martong: I added an example with `isalpha`.
		balazskeUnsubmitted Done Reply Inline Actions The "branches" are the structures that define relations between arguments and return values? This could be included in the description. balazske: The "branches" are the structures that define relations between arguments and return values?
		martongAuthorUnsubmitted Done Reply Inline Actions Not exactly. A branch represents a path in the exploded graph of a function (which is a tree). So, a branch is a series of assumptions. In other words, branches represent split states and additional assumptions on top of the splitting assumption. I added this explanation to the comments. martong: Not exactly. A branch represents a path in the exploded graph of a function (which is a tree).
		/// is a tree). So, a branch is a series of assumptions. In other words,
		/// branches represent split states and additional assumptions on top of
		/// the splitting assumption.
		/// For example, consider the branches in `isalpha(x)`
		/// Branch 1)
		/// x is in range ['A', 'Z'] or in ['a', 'z']
		/// then the return value is not 0. (I.e. out-of-range [0, 0])
		/// Branch 2)
		/// x is out-of-range ['A', 'Z'] and out-of-range ['a', 'z']
		/// then the return value is 0.
		/// * a list of argument constraints, that must be true on every branch.
		/// If these constraints are not satisfied that means a fatal error
		/// usually resulting in undefined behaviour.
struct Summary {		struct Summary {
const ArgTypes ArgTys;		const ArgTypes ArgTys;
const QualType RetTy;		const QualType RetTy;
const InvalidationKind InvalidationKd;		const InvalidationKind InvalidationKd;
Cases CaseConstraints;		Cases CaseConstraints;
ConstraintSet ArgConstraints;		ConstraintSet ArgConstraints;

Summary(ArgTypes ArgTys, QualType RetTy, InvalidationKind InvalidationKd)		Summary(ArgTypes ArgTys, QualType RetTy, InvalidationKind InvalidationKd)
: ArgTys(ArgTys), RetTy(RetTy), InvalidationKd(InvalidationKd) {}		: ArgTys(ArgTys), RetTy(RetTy), InvalidationKd(InvalidationKd) {}

Summary &Case(ConstraintSet&& CS) {		Summary &Case(ConstraintSet&& CS) {
CaseConstraints.push_back(std::move(CS));		CaseConstraints.push_back(std::move(CS));
return *this;		return *this;
}		}
		Summary &ArgConstraint(ValueConstraintPtr VC) {
		ArgConstraints.push_back(VC);
		return *this;
		}

private:		private:
static void assertTypeSuitableForSummary(QualType T) {		static void assertTypeSuitableForSummary(QualType T) {
assert(!T->isVoidType() &&		assert(!T->isVoidType() &&
"We should have had no significant void types in the spec");		"We should have had no significant void types in the spec");
assert(T.isCanonical() &&		assert(T.isCanonical() &&
"We should only have canonical types in the spec");		"We should only have canonical types in the spec");
// FIXME: lift this assert (but not the ones above!)		// FIXME: lift this assert (but not the ones above!)
Show All 19 Lines	class StdLibraryFunctionsChecker
// C++ function overloads, and also it can be used when the same function		// C++ function overloads, and also it can be used when the same function
// may have different definitions on different platforms.		// may have different definitions on different platforms.
typedef std::vector<Summary> Summaries;		typedef std::vector<Summary> Summaries;

// The map of all functions supported by the checker. It is initialized		// The map of all functions supported by the checker. It is initialized
// lazily, and it doesn't change after initialization.		// lazily, and it doesn't change after initialization.
mutable llvm::StringMap<Summaries> FunctionSummaryMap;		mutable llvm::StringMap<Summaries> FunctionSummaryMap;

		mutable std::unique_ptr<BugType> BT_InvalidArg;
		SzelethusUnsubmitted Done Reply Inline Actions By passing `this`, the error message will be tied to the modeling checker, not to the one you just added. `BugType` has a constructor that accepts a string instead, pass `CheckNames[CK_StdCLibraryFunctionArgsChecker]` in there :) Also, how about `BT_InvalidArgument` or something? Szelethus: By passing `this`, the error message will be tied to the modeling checker, not to the one you…
		martongAuthorUnsubmitted Done Reply Inline Actions Thanks, good catch, I did not know about that. Please note that using `CheckNames` requires that we change the `BT` member to be lazily initialized. Because `CheckNames` is initialized only after the checker itself is created, thus we cannot initialize `BT` during the checkers construction, b/c that would be before `CheckNames` is set. So, I changed `BT` to be a unique_ptr and it is being lazily initialized in `reportBug`. martong: Thanks, good catch, I did not know about that. Please note that using `CheckNames` requires…

// Auxiliary functions to support ArgNo within all structures		// Auxiliary functions to support ArgNo within all structures
// in a unified manner.		// in a unified manner.
static QualType getArgType(const Summary &Summary, ArgNo ArgN) {		static QualType getArgType(const Summary &Summary, ArgNo ArgN) {
return Summary.getArgType(ArgN);		return Summary.getArgType(ArgN);
}		}
static QualType getArgType(const CallEvent &Call, ArgNo ArgN) {		static QualType getArgType(const CallEvent &Call, ArgNo ArgN) {
return ArgN == Ret ? Call.getResultType().getCanonicalType()		return ArgN == Ret ? Call.getResultType().getCanonicalType()
: Call.getArgExpr(ArgN)->getType().getCanonicalType();		: Call.getArgExpr(ArgN)->getType().getCanonicalType();
}		}
static QualType getArgType(const CallExpr *CE, ArgNo ArgN) {		static QualType getArgType(const CallExpr *CE, ArgNo ArgN) {
return ArgN == Ret ? CE->getType().getCanonicalType()		return ArgN == Ret ? CE->getType().getCanonicalType()
: CE->getArg(ArgN)->getType().getCanonicalType();		: CE->getArg(ArgN)->getType().getCanonicalType();
}		}
static SVal getArgSVal(const CallEvent &Call, ArgNo ArgN) {		static SVal getArgSVal(const CallEvent &Call, ArgNo ArgN) {
return ArgN == Ret ? Call.getReturnValue() : Call.getArgSVal(ArgN);		return ArgN == Ret ? Call.getReturnValue() : Call.getArgSVal(ArgN);
}		}

public:		public:
		void checkPreCall(const CallEvent &Call, CheckerContext &C) const;
void checkPostCall(const CallEvent &Call, CheckerContext &C) const;		void checkPostCall(const CallEvent &Call, CheckerContext &C) const;
bool evalCall(const CallEvent &Call, CheckerContext &C) const;		bool evalCall(const CallEvent &Call, CheckerContext &C) const;

		enum CheckKind { CK_StdCLibraryFunctionArgsChecker, CK_NumCheckKinds };
		DefaultBool ChecksEnabled[CK_NumCheckKinds];
		CheckerNameRef CheckNames[CK_NumCheckKinds];

private:		private:
Optional<Summary> findFunctionSummary(const FunctionDecl *FD,		Optional<Summary> findFunctionSummary(const FunctionDecl *FD,
const CallExpr *CE,		const CallExpr *CE,
CheckerContext &C) const;		CheckerContext &C) const;
		Optional<Summary> findFunctionSummary(const CallEvent &Call,
		CheckerContext &C) const;

void initFunctionSummaries(CheckerContext &C) const;		void initFunctionSummaries(CheckerContext &C) const;

		void reportBug(const CallEvent &Call, ExplodedNode *N,
		balazskeUnsubmitted Done Reply Inline Actions This should be called `reportBug`. balazske: This should be called `reportBug`.
		martongAuthorUnsubmitted Done Reply Inline Actions Yeah, can't get used to this strange naming convention that LLVM uses. Fixed it. martong: Yeah, can't get used to this strange naming convention that LLVM uses. Fixed it.
		CheckerContext &C) const {
		if (!ChecksEnabled[CK_StdCLibraryFunctionArgsChecker])
		return;
		// TODO Add detailed diagnostic.
		StringRef Msg = "Function argument constraint is not satisfied";
		if (!BT_InvalidArg)
		BT_InvalidArg = std::make_unique<BugType>(
		CheckNames[CK_StdCLibraryFunctionArgsChecker],
		"Unsatisfied argument constraints", categories::LogicError);
		auto R = std::make_unique<PathSensitiveBugReport>(*BT_InvalidArg, Msg, N);
		bugreporter::trackExpressionValue(N, Call.getArgExpr(0), *R);
		C.emitReport(std::move(R));
		}
};		};

const StdLibraryFunctionsChecker::ArgNo StdLibraryFunctionsChecker::Ret =		const StdLibraryFunctionsChecker::ArgNo StdLibraryFunctionsChecker::Ret =
std::numeric_limits<ArgNo>::max();		std::numeric_limits<ArgNo>::max();

} // end of anonymous namespace		} // end of anonymous namespace

ProgramStateRef StdLibraryFunctionsChecker::RangeConstraint::applyAsOutOfRange(		ProgramStateRef StdLibraryFunctionsChecker::RangeConstraint::applyAsOutOfRange(
▲ Show 20 Lines • Show All 97 Lines • ▼ Show 20 Lines	ProgramStateRef StdLibraryFunctionsChecker::ComparisonConstraint::apply(
// Note: we avoid integral promotion for comparison.		// Note: we avoid integral promotion for comparison.
OtherV = SVB.evalCast(OtherV, T, OtherT);		OtherV = SVB.evalCast(OtherV, T, OtherT);
if (auto CompV = SVB.evalBinOp(State, Op, V, OtherV, CondT)		if (auto CompV = SVB.evalBinOp(State, Op, V, OtherV, CondT)
.getAs<DefinedOrUnknownSVal>())		.getAs<DefinedOrUnknownSVal>())
State = State->assume(*CompV, true);		State = State->assume(*CompV, true);
return State;		return State;
}		}

void StdLibraryFunctionsChecker::checkPostCall(const CallEvent &Call,		void StdLibraryFunctionsChecker::checkPreCall(const CallEvent &Call,
CheckerContext &C) const {		CheckerContext &C) const {
const FunctionDecl *FD = dyn_cast_or_null<FunctionDecl>(Call.getDecl());		Optional<Summary> FoundSummary = findFunctionSummary(Call, C);
if (!FD)		if (!FoundSummary)
return;		return;

const CallExpr *CE = dyn_cast_or_null<CallExpr>(Call.getOriginExpr());		const Summary &Summary = *FoundSummary;
if (!CE)		ProgramStateRef State = C.getState();
return;

Optional<Summary> FoundSummary = findFunctionSummary(FD, CE, C);		for (const ValueConstraintPtr& VC : Summary.ArgConstraints) {
		ProgramStateRef SuccessSt = VC->apply(State, Call, Summary);
		ProgramStateRef FailureSt = VC->negate()->apply(State, Call, Summary);
		// The argument constraint is not satisfied.
		SzelethusUnsubmitted Done Reply Inline Actions That is a `TODO`, rather :^) Szelethus: That is a `TODO`, rather :^)
		if (FailureSt && !SuccessSt) {
		steakhalUnsubmitted Done Reply Inline Actions StringRef steakhal: StringRef
		if (ExplodedNode *N = C.generateErrorNode(State))
		xazax.hunUnsubmitted Done Reply Inline Actions Dealing with only concrete ints might be a good start but we might want to handle symbolic cases in the future like: if (v > 255) return isalpha(v); I am ok with not addig this in the first version but adding TODOs and test cases upfront cannot hurt. So basivally, I was wondering if we should query the solver for the result instead of matching the sval kind and just early return if we do not want to support a specific kind. xazax.hun: Dealing with only concrete ints might be a good start but we might want to handle symbolic…
		balazskeUnsubmitted Done Reply Inline Actions This check works now with concrete int values. We have a known value and a list of ranges with known limits, so testing for in any of the ranges does work the same way as testing for out of all ranges. And testing if the value is inside one of the ranges is more simple code. But I think the symbolic evaluation with "eval" and "assume" functions would be more generic here (and more simple code). Then the way of cutting-of the bad ranges is usable (probably still there is other solution). balazske: This check works now with concrete int values. We have a known value and a list of ranges with…
		reportBug(Call, N, C);
		break;
		} else {
		SzelethusUnsubmitted Done Reply Inline Actions While I find your usage of lambdas fascinating, this one seems a bit unnecessary :) Szelethus: While I find your usage of lambdas fascinating, this one seems a bit unnecessary :)
		martongAuthorUnsubmitted Done Reply Inline Actions Ok I moved it to be a member function named `ReportBug`. martong: Ok I moved it to be a member function named `ReportBug`.
		// Apply the constraint even if we cannot reason about the argument. This
		// means both SuccessSt and FailureSt can be true. If we weren't applying
		// the constraint that would mean that symbolic execution continues on a
		// code whose behaviour is undefined.
		assert(SuccessSt);
		C.addTransition(SuccessSt);
		}
		NoQUnsubmitted Done Reply Inline Actions Yeah, a must-have for this check to be enabled by default would be to be able to provide a specific warning message for every function. I guess we could include them in the summaries as an extra argument of `ArgConstraint`. NoQ: Yeah, a must-have for this check to be enabled by default would be to be able to provide a…
		martongAuthorUnsubmitted Done Reply Inline Actions What about warning messages with placeholders? E.g. "Argument constraint of {nth} argument of {fun} is not satisfied. The value is {v}, however it should be in {range}." There will be a bunch of functions whose warning message template would be the same. On the other hand some others could have different warnings, and that justifies the need for specialized warnings. Still, I think the warning message in the summary should be optional, because otherwise it would be really hard to automatically add summaries from other sources (like from cppcheck). No matter how it turns out, this should be handled in a different patch. martong: What about warning messages with placeholders? E.g. "Argument constraint of {nth} argument of…
		}
		}

		NoQUnsubmitted Done Reply Inline Actions Let's test our notes. That'll be especially important when we get to non-concrete values, because the visitor might need to be expanded (or we might need a completely new visitor). NoQ: Let's test our notes. That'll be especially important when we get to non-concrete values…
		martongAuthorUnsubmitted Done Reply Inline Actions Ok, I added a separate test file where the tests focus on the bug path. martong: Ok, I added a separate test file where the tests focus on the bug path.
		void StdLibraryFunctionsChecker::checkPostCall(const CallEvent &Call,
		CheckerContext &C) const {
		Optional<Summary> FoundSummary = findFunctionSummary(Call, C);
		NoQUnsubmitted Done Reply Inline Actions Maybe we should add an assertion that the same argument isn't specified multiple times. NoQ: Maybe we should add an assertion that the same argument isn't specified multiple times.
		martongAuthorUnsubmitted Done Reply Inline Actions I think there could be cases when we want to have e.g. a not-null constraint on the 1st argument, but also we want to express that the 1st argument's size is described by the 2nd argument. I am planning to implement such a constraints in the future. In that case we would have two constraints on the 1st argument and the assert would fire. martong: I think there could be cases when we want to have e.g. a not-null constraint on the 1st…
		NoQUnsubmitted Not Done Reply Inline Actions Wait, i misunderstood the code. It's even worse than that: you're adding transitions in a loop, so it'll cause state splits for every constraint. Because you do not intend to create multiple branches here, there needs to be exactly one `addTransition` performed every time `checkPreCall` is called. I.e., for now this code is breaking everything whenever there's more than one constraint, regardless of whether it's on the same argument. NoQ: Wait, i misunderstood the code. It's even worse than that: you're adding transitions in a loop…
		martongAuthorUnsubmitted Done Reply Inline Actions Yeah, that's a very good catch, thanks! I am going to prepare a patch to fix this soon. My idea is to store the `SuccessSt` and apply the next argument constraint on that. And once the loop is finished I'll have call the `addTransition()`. martong: Yeah, that's a very good catch, thanks! I am going to prepare a patch to fix this soon. My idea…
		NoQUnsubmitted Not Done Reply Inline Actions Yup, that's the common thing to do in such cases. NoQ: Yup, that's the common thing to do in such cases.
		NoQUnsubmitted Not Done Reply Inline Actions While we're at it, could you try to come up with a runtime assertion that'll help us prevent these mistakes? Like, dunno, make `CheckerContext` crash whenever there's more than one branch being added, and then add a method to opt out when it's actually necessary to add more transitions (i.e., the user would say `C.setMaxTransitions(2)` at the beginning of their checker callback whenever they need to make a state split, defaulting to 1). It's a bit tricky because i still want to allow multiple transitions when they allow one branch (i.e., transitions chained together) but i think it'll take a lot of review anxiety from me because it's a very dangerous mistake to make and for now code review is the only way to catch it. So, yay, faster code reviews. NoQ: While we're at it, could you try to come up with a runtime assertion that'll help us prevent…
		martongAuthorUnsubmitted Done Reply Inline Actions Hmm I see your point and I agree this would be a valuable sanity check. But if you don't mind I'd like to address this in a different and stand-alone patch (independently from the quick-fix https://reviews.llvm.org/D76790) because it does not seem to be trivial for me. My first concern is this: if we have `1` as the default value for `maxTranisitions` then we should add an extra `C.setMaxTransitions(N)` in every checker callback that does a state split, is that right? martong: Hmm I see your point and I agree this would be a valuable sanity check. But if you don't mind…
if (!FoundSummary)		if (!FoundSummary)
return;		return;

		balazskeUnsubmitted Done Reply Inline Actions Is this `addTransition` needed? It would be OK to call `generateErrorNode` with `State`. Even if not, adding the transition before should not be needed? balazske: Is this `addTransition` needed? It would be OK to call `generateErrorNode` with `State`. Even…
		martongAuthorUnsubmitted Done Reply Inline Actions Yes, you are right it is superfluous, I removed it. martong: Yes, you are right it is superfluous, I removed it.
// Now apply the constraints.		// Now apply the constraints.
const Summary &Summary = *FoundSummary;		const Summary &Summary = *FoundSummary;
ProgramStateRef State = C.getState();		ProgramStateRef State = C.getState();

// Apply case/branch specifications.		// Apply case/branch specifications.
for (const auto &VRS : Summary.CaseConstraints) {		for (const auto &VRS : Summary.CaseConstraints) {
ProgramStateRef NewState = State;		ProgramStateRef NewState = State;
for (const auto &VR: VRS) {		for (const auto &VR: VRS) {
NewState = VR->apply(NewState, Call, Summary);		NewState = VR->apply(NewState, Call, Summary);
if (!NewState)		if (!NewState)
break;		break;
}		}

if (NewState && NewState != State)		if (NewState && NewState != State)
C.addTransition(NewState);		C.addTransition(NewState);
}		}
}		}

bool StdLibraryFunctionsChecker::evalCall(const CallEvent &Call,		bool StdLibraryFunctionsChecker::evalCall(const CallEvent &Call,
CheckerContext &C) const {		CheckerContext &C) const {
const auto *FD = dyn_cast_or_null<FunctionDecl>(Call.getDecl());		Optional<Summary> FoundSummary = findFunctionSummary(Call, C);
if (!FD)
return false;

const auto *CE = dyn_cast_or_null<CallExpr>(Call.getOriginExpr());
if (!CE)
return false;

Optional<Summary> FoundSummary = findFunctionSummary(FD, CE, C);
if (!FoundSummary)		if (!FoundSummary)
return false;		return false;

const Summary &Summary = *FoundSummary;		const Summary &Summary = *FoundSummary;
switch (Summary.InvalidationKd) {		switch (Summary.InvalidationKd) {
case EvalCallAsPure: {		case EvalCallAsPure: {
ProgramStateRef State = C.getState();		ProgramStateRef State = C.getState();
const LocationContext *LC = C.getLocationContext();		const LocationContext *LC = C.getLocationContext();
		const auto *CE = cast_or_null<CallExpr>(Call.getOriginExpr());
SVal V = C.getSValBuilder().conjureSymbolVal(		SVal V = C.getSValBuilder().conjureSymbolVal(
CE, LC, CE->getType().getCanonicalType(), C.blockCount());		CE, LC, CE->getType().getCanonicalType(), C.blockCount());
State = State->BindExpr(CE, LC, V);		State = State->BindExpr(CE, LC, V);
C.addTransition(State);		C.addTransition(State);
return true;		return true;
}		}
case NoEvalCall:		case NoEvalCall:
// Summary tells us to avoid performing eval::Call. The function is possibly		// Summary tells us to avoid performing eval::Call. The function is possibly
// evaluated by another checker, or evaluated conservatively.		// evaluated by another checker, or evaluated conservatively.
return false;		return false;
}		}
llvm_unreachable("Unknown invalidation kind!");		llvm_unreachable("Unknown invalidation kind!");
		balazskeUnsubmitted Done Reply Inline Actions If `evalCall` is used it could be more simple to test and apply the constraints for arguments and return value in a "single step". balazske: If `evalCall` is used it could be more simple to test and apply the constraints for arguments…
}		}

bool StdLibraryFunctionsChecker::Summary::matchesCall(		bool StdLibraryFunctionsChecker::Summary::matchesCall(
const CallExpr *CE) const {		const CallExpr *CE) const {
// Check number of arguments:		// Check number of arguments:
if (CE->getNumArgs() != ArgTys.size())		if (CE->getNumArgs() != ArgTys.size())
return false;		return false;

▲ Show 20 Lines • Show All 51 Lines • ▼ Show 20 Lines	StdLibraryFunctionsChecker::findFunctionSummary(const FunctionDecl *FD,
const Summaries &SpecVariants = FSMI->second;		const Summaries &SpecVariants = FSMI->second;
for (const Summary &Spec : SpecVariants)		for (const Summary &Spec : SpecVariants)
if (Spec.matchesCall(CE))		if (Spec.matchesCall(CE))
return Spec;		return Spec;

return None;		return None;
}		}

		Optional<StdLibraryFunctionsChecker::Summary>
		StdLibraryFunctionsChecker::findFunctionSummary(const CallEvent &Call,
		CheckerContext &C) const {
		const FunctionDecl *FD = dyn_cast_or_null<FunctionDecl>(Call.getDecl());
		if (!FD)
		return None;
		const CallExpr *CE = dyn_cast_or_null<CallExpr>(Call.getOriginExpr());
		if (!CE)
		return None;
		return findFunctionSummary(FD, CE, C);
		}

void StdLibraryFunctionsChecker::initFunctionSummaries(		void StdLibraryFunctionsChecker::initFunctionSummaries(
CheckerContext &C) const {		CheckerContext &C) const {
if (!FunctionSummaryMap.empty())		if (!FunctionSummaryMap.empty())
return;		return;

SValBuilder &SVB = C.getSValBuilder();		SValBuilder &SVB = C.getSValBuilder();
BasicValueFactory &BVF = SVB.getBasicValueFactory();		BasicValueFactory &BVF = SVB.getBasicValueFactory();
const ASTContext &ACtx = BVF.getContext();		const ASTContext &ACtx = BVF.getContext();
▲ Show 20 Lines • Show All 78 Lines • ▼ Show 20 Lines	auto Range = [](RangeInt b, RangeInt e) {
return IntRangeVector{std::pair<RangeInt, RangeInt>{b, e}};		return IntRangeVector{std::pair<RangeInt, RangeInt>{b, e}};
};		};
auto SingleValue = [](RangeInt v) {		auto SingleValue = [](RangeInt v) {
return IntRangeVector{std::pair<RangeInt, RangeInt>{v, v}};		return IntRangeVector{std::pair<RangeInt, RangeInt>{v, v}};
};		};
auto LessThanOrEq = BO_LE;		auto LessThanOrEq = BO_LE;

using RetType = QualType;		using RetType = QualType;

// Templates for summaries that are reused by many functions.		// Templates for summaries that are reused by many functions.
auto Getc = [&]() {		auto Getc = [&]() {
return Summary(ArgTypes{Irrelevant}, RetType{IntTy}, NoEvalCall)		return Summary(ArgTypes{Irrelevant}, RetType{IntTy}, NoEvalCall)
.Case({ReturnValueCondition(WithinRange,		.Case({ReturnValueCondition(WithinRange,
{{EOFv, EOFv}, {0, UCharRangeMax}})});		{{EOFv, EOFv}, {0, UCharRangeMax}})});
};		};
auto Read = [&](RetType R, RangeInt Max) {		auto Read = [&](RetType R, RangeInt Max) {
return Summary(ArgTypes{Irrelevant, Irrelevant, SizeTy}, RetType{R},		return Summary(ArgTypes{Irrelevant, Irrelevant, SizeTy}, RetType{R},
Show All 11 Lines	void StdLibraryFunctionsChecker::initFunctionSummaries(
auto Getline = [&](RetType R, RangeInt Max) {		auto Getline = [&](RetType R, RangeInt Max) {
return Summary(ArgTypes{Irrelevant, Irrelevant, Irrelevant}, RetType{R},		return Summary(ArgTypes{Irrelevant, Irrelevant, Irrelevant}, RetType{R},
NoEvalCall)		NoEvalCall)
.Case({ReturnValueCondition(WithinRange, {{-1, -1}, {1, Max}})});		.Case({ReturnValueCondition(WithinRange, {{-1, -1}, {1, Max}})});
};		};

FunctionSummaryMap = {		FunctionSummaryMap = {
// The isascii() family of functions.		// The isascii() family of functions.
		// The behavior is undefined if the value of the argument is not
		// representable as unsigned char or is not equal to EOF. See e.g. C99
		// 7.4.1.2 The isalpha function (p: 181-182).
		SzelethusUnsubmitted Done Reply Inline Actions This is true for the rest of the summaries as well, but shouldn't we retrieve the `unsigned char` size from `ASTContext`? Szelethus: This is true for the rest of the summaries as well, but shouldn't we retrieve the `unsigned…
		martongAuthorUnsubmitted Done Reply Inline Actions Yes this is a good idea. I will do this. What bothers me really much, however, is that we should handle EOF in a platform dependent way as well ... and I have absolutely no idea how to do that given that is defined by a macro in a platform specific header file. I am desperately in need for help and ideas about how could we get the value of EOF for the analysed platform. martong: Yes this is a good idea. I will do this. What bothers me really much, however, is that we…
		gamesh411Unsubmitted Done Reply Inline Actions If the EOF is not used in the TU analyzed, then there would be no way to find the specific `#define`. Another approach would be to check if the value is defined by an expression that is the EOF define (maybe transitively?). gamesh411: If the EOF is not used in the TU analyzed, then there would be no way to find the specific…
		martongAuthorUnsubmitted Done Reply Inline Actions I believe that the given standard C lib implementation (e.g. glibc) must provide a header for the prototypes of these functions where EOF is also defined transitively in any of the dependent system headers. Otherwise user code could misuse the value of EOF and thus the program would behave in an undefined manner. C99 clearly states that you should #include <ctype.h> to use isalhpa. martong: I believe that the given standard C lib implementation (e.g. glibc) must provide a header for…
{		{
"isalnum",		"isalnum",
Summaries{		Summaries{
Summary(ArgTypes{IntTy}, RetType{IntTy}, EvalCallAsPure)		Summary(ArgTypes{IntTy}, RetType{IntTy}, EvalCallAsPure)
// Boils down to isupper() or islower() or isdigit().		// Boils down to isupper() or islower() or isdigit().
.Case(		.Case(
{ArgumentCondition(0U, WithinRange,		{ArgumentCondition(0U, WithinRange,
{{'0', '9'}, {'A', 'Z'}, {'a', 'z'}}),		{{'0', '9'}, {'A', 'Z'}, {'a', 'z'}}),
ReturnValueCondition(OutOfRange, SingleValue(0))})		ReturnValueCondition(OutOfRange, SingleValue(0))})
// The locale-specific range.		// The locale-specific range.
// No post-condition. We are completely unaware of		// No post-condition. We are completely unaware of
// locale-specific return values.		// locale-specific return values.
.Case({ArgumentCondition(0U, WithinRange,		.Case({ArgumentCondition(0U, WithinRange,
{{128, UCharRangeMax}})})		{{128, UCharRangeMax}})})
		balazskeUnsubmitted Done Reply Inline Actions Why is this `{128, UCharMax}` here and at the next entry needed? balazske: Why is this `{128, UCharMax}` here and at the next entry needed?
		martongAuthorUnsubmitted Done Reply Inline Actions This is the local specific range , [128, 255]. There are characters like `ä` which we don't know if they are treated as an alphanumerical character or not. We can't really tell how a specific libc implementation classifies them. On the other hand, with English letters we can state the classes confidently. martong: This is the local specific range , [128, 255]. There are characters like `ä` which we don't…
.Case({ArgumentCondition(0U, OutOfRange,		.Case({ArgumentCondition(0U, OutOfRange,
{{'0', '9'},		{{'0', '9'},
{'A', 'Z'},		{'A', 'Z'},
{'a', 'z'},		{'a', 'z'},
{128, UCharRangeMax}}),		{128, UCharRangeMax}}),
ReturnValueCondition(WithinRange, SingleValue(0))})},		ReturnValueCondition(WithinRange, SingleValue(0))})
		.ArgConstraint(ArgumentCondition(
		0U, WithinRange, {{EOFv, EOFv}, {0, UCharRangeMax}}))},
		balazskeUnsubmitted Done Reply Inline Actions Is this `ArgConstraint` intentionally added only to `isalnum`? balazske: Is this `ArgConstraint` intentionally added only to `isalnum`?
		martongAuthorUnsubmitted Done Reply Inline Actions Yes, I wanted to create first the infrastructure and then later to add all these constraints to the rest of the summaries with new tests. martong: Yes, I wanted to create first the infrastructure and then later to add all these constraints to…
},		},
{		{
"isalpha",		"isalpha",
Summaries{		Summaries{
Summary(ArgTypes{IntTy}, RetType{IntTy}, EvalCallAsPure)		Summary(ArgTypes{IntTy}, RetType{IntTy}, EvalCallAsPure)
.Case({ArgumentCondition(0U, WithinRange,		.Case({ArgumentCondition(0U, WithinRange,
{{'A', 'Z'}, {'a', 'z'}}),		{{'A', 'Z'}, {'a', 'z'}}),
ReturnValueCondition(OutOfRange, SingleValue(0))})		ReturnValueCondition(OutOfRange, SingleValue(0))})
▲ Show 20 Lines • Show All 161 Lines • ▼ Show 20 Lines	FunctionSummaryMap = {
{"getline", Summaries{Getline(IntTy, IntMax), Getline(LongTy, LongMax),		{"getline", Summaries{Getline(IntTy, IntMax), Getline(LongTy, LongMax),
Getline(LongLongTy, LongLongMax)}},		Getline(LongLongTy, LongLongMax)}},
{"getdelim", Summaries{Getline(IntTy, IntMax), Getline(LongTy, LongMax),		{"getdelim", Summaries{Getline(IntTy, IntMax), Getline(LongTy, LongMax),
Getline(LongLongTy, LongLongMax)}},		Getline(LongLongTy, LongLongMax)}},
};		};
}		}

void ento::registerStdCLibraryFunctionsChecker(CheckerManager &mgr) {		void ento::registerStdCLibraryFunctionsChecker(CheckerManager &mgr) {
// If this checker grows large enough to support C++, Objective-C, or other
// standard libraries, we could use multiple register...Checker() functions,
// which would register various checkers with the help of the same Checker
// class, turning on different function summaries.
mgr.registerChecker<StdLibraryFunctionsChecker>();		mgr.registerChecker<StdLibraryFunctionsChecker>();
}		}

bool ento::shouldRegisterStdCLibraryFunctionsChecker(const LangOptions &LO) {		bool ento::shouldRegisterStdCLibraryFunctionsChecker(const LangOptions &LO) {
return true;		return true;
}		}

		#define REGISTER_CHECKER(name) \
		void ento::register##name(CheckerManager &mgr) { \
		StdLibraryFunctionsChecker *checker = \
		mgr.getChecker<StdLibraryFunctionsChecker>(); \
		checker->ChecksEnabled[StdLibraryFunctionsChecker::CK_##name] = true; \
		checker->CheckNames[StdLibraryFunctionsChecker::CK_##name] = \
		mgr.getCurrentCheckerName(); \
		} \
		\
		bool ento::shouldRegister##name(const LangOptions &LO) { return true; }

		REGISTER_CHECKER(StdCLibraryFunctionArgsChecker)

clang/test/Analysis/analyzer-enabled-checkers.c

	// RUN: %clang --analyze %s --target=x86_64-pc-linux-gnu \			// RUN: %clang --analyze %s --target=x86_64-pc-linux-gnu \
	// RUN: -Xclang -analyzer-list-enabled-checkers \			// RUN: -Xclang -analyzer-list-enabled-checkers \
	// RUN: -Xclang -analyzer-display-progress \			// RUN: -Xclang -analyzer-display-progress \
	// RUN: 2>&1 \| FileCheck %s --implicit-check-not=ANALYZE \			// RUN: 2>&1 \| FileCheck %s --implicit-check-not=ANALYZE \
	// RUN: --implicit-check-not=\.			// RUN: --implicit-check-not=\.

	// CHECK: OVERVIEW: Clang Static Analyzer Enabled Checkers List			// CHECK: OVERVIEW: Clang Static Analyzer Enabled Checkers List
	// CHECK-EMPTY:			// CHECK-EMPTY:
	// CHECK-NEXT: apiModeling.StdCLibraryFunctions			// CHECK-NEXT: apiModeling.StdCLibraryFunctions
				// CHECK-NEXT: apiModeling.StdCLibraryFunctionArgs
	// CHECK-NEXT: apiModeling.TrustNonnull			// CHECK-NEXT: apiModeling.TrustNonnull
	// CHECK-NEXT: apiModeling.llvm.CastValue			// CHECK-NEXT: apiModeling.llvm.CastValue
	// CHECK-NEXT: apiModeling.llvm.ReturnValue			// CHECK-NEXT: apiModeling.llvm.ReturnValue
	// CHECK-NEXT: core.CallAndMessage			// CHECK-NEXT: core.CallAndMessage
	// CHECK-NEXT: core.DivideZero			// CHECK-NEXT: core.DivideZero
	// CHECK-NEXT: core.DynamicTypePropagation			// CHECK-NEXT: core.DynamicTypePropagation
	// CHECK-NEXT: core.NonNullParamChecker			// CHECK-NEXT: core.NonNullParamChecker
	// CHECK-NEXT: core.NonnilStringConstants			// CHECK-NEXT: core.NonnilStringConstants
	Show All 37 Lines

clang/test/Analysis/std-c-library-functions-arg-constraints.c

This file was added.

				// Check the basic reporting/warning and the application of constraints.
				// RUN: %clang_analyze_cc1 %s \
				// RUN: -analyzer-checker=core \
				// RUN: -analyzer-checker=apiModeling.StdCLibraryFunctions \
				// RUN: -analyzer-checker=apiModeling.StdCLibraryFunctionArgs \
				// RUN: -analyzer-checker=debug.ExprInspection \
				// RUN: -triple x86_64-unknown-linux-gnu \
				// RUN: -verify=report
				SzelethusUnsubmitted Done Reply Inline Actions Hmm, why do we have 2 different test files that essentially do the same? Shouldn't we only have a single one with `analyzer-output=text`? Szelethus: Hmm, why do we have 2 different test files that essentially do the same? Shouldn't we only have…
				martongAuthorUnsubmitted Done Reply Inline Actions No, I wanted to have two different test files to test two different things: (1) We do have the constraints applied (here we don't care about the warnings and the path) (2) Check that we have a warning with the proper tracking and notes. martong: No, I wanted to have two different test files to test two different things: (1) We do have the…
				SzelethusUnsubmitted Done Reply Inline Actions What if we had different `-verify`s? `clang/test/Analysis/track-conditions.cpp` is a great example. Szelethus: What if we had different `-verify`s? `clang/test/Analysis/track-conditions.cpp` is a great…
				martongAuthorUnsubmitted Done Reply Inline Actions Yeah, that's a very good approach, I just changed it like that. :) martong: Yeah, that's a very good approach, I just changed it like that. :)

				// Check the bugpath related to the reports.
				// RUN: %clang_analyze_cc1 %s \
				// RUN: -analyzer-checker=core \
				// RUN: -analyzer-checker=apiModeling.StdCLibraryFunctions \
				// RUN: -analyzer-checker=apiModeling.StdCLibraryFunctionArgs \
				// RUN: -analyzer-checker=debug.ExprInspection \
				// RUN: -triple x86_64-unknown-linux-gnu \
				// RUN: -analyzer-output=text \
				// RUN: -verify=bugpath

				void clang_analyzer_eval(int);

				int glob;

				#define EOF -1

				int isalnum(int);

				void test_alnum_concrete(int v) {
				int ret = isalnum(256); // \
				// report-warning{{Function argument constraint is not satisfied}} \
				// bugpath-warning{{Function argument constraint is not satisfied}} \
				// bugpath-note{{Function argument constraint is not satisfied}}
				(void)ret;
				}

				void test_alnum_symbolic(int x) {
				int ret = isalnum(x);
				(void)ret;

				clang_analyzer_eval(EOF <= x && x <= 255); // \
				// report-warning{{TRUE}} \
				// bugpath-warning{{TRUE}} \
				// bugpath-note{{TRUE}} \
				// bugpath-note{{Left side of '&&' is true}} \
				// bugpath-note{{'x' is <= 255}}

				}

				void test_alnum_symbolic2(int x) {
				if (x > 255) { // \
				// bugpath-note{{Assuming 'x' is > 255}} \
				// bugpath-note{{Taking true branch}}

				int ret = isalnum(x); // \
				// report-warning{{Function argument constraint is not satisfied}} \
				// bugpath-warning{{Function argument constraint is not satisfied}} \
				// bugpath-note{{Function argument constraint is not satisfied}}

				(void)ret;
				}
				}

clang/test/Analysis/std-c-library-functions.c

	// RUN: %clang_analyze_cc1 -analyzer-checker=apiModeling.StdCLibraryFunctions,debug.ExprInspection -verify -analyzer-config eagerly-assume=false %s			// RUN: %clang_analyze_cc1 %s \
	// RUN: %clang_analyze_cc1 -triple i686-unknown-linux -analyzer-checker=apiModeling.StdCLibraryFunctions,debug.ExprInspection -verify -analyzer-config eagerly-assume=false %s			// RUN: -analyzer-checker=core \
	// RUN: %clang_analyze_cc1 -triple x86_64-unknown-linux -analyzer-checker=apiModeling.StdCLibraryFunctions,debug.ExprInspection -verify -analyzer-config eagerly-assume=false %s			// RUN: -analyzer-checker=apiModeling.StdCLibraryFunctions \
	// RUN: %clang_analyze_cc1 -triple armv7-a15-linux -analyzer-checker=apiModeling.StdCLibraryFunctions,debug.ExprInspection -verify -analyzer-config eagerly-assume=false %s			// RUN: -analyzer-checker=debug.ExprInspection \
	// RUN: %clang_analyze_cc1 -triple thumbv7-a15-linux -analyzer-checker=apiModeling.StdCLibraryFunctions,debug.ExprInspection -verify -analyzer-config eagerly-assume=false %s			// RUN: -analyzer-config eagerly-assume=false \
				// RUN: -triple i686-unknown-linux \
				// RUN: -verify

				// RUN: %clang_analyze_cc1 %s \
				// RUN: -analyzer-checker=core \
				// RUN: -analyzer-checker=apiModeling.StdCLibraryFunctions \
				// RUN: -analyzer-checker=debug.ExprInspection \
				// RUN: -analyzer-config eagerly-assume=false \
				// RUN: -triple x86_64-unknown-linux \
				// RUN: -verify

				// RUN: %clang_analyze_cc1 %s \
				// RUN: -analyzer-checker=core \
				// RUN: -analyzer-checker=apiModeling.StdCLibraryFunctions \
				// RUN: -analyzer-checker=debug.ExprInspection \
				// RUN: -analyzer-config eagerly-assume=false \
				// RUN: -triple armv7-a15-linux \
				// RUN: -verify

				// RUN: %clang_analyze_cc1 %s \
				// RUN: -analyzer-checker=core \
				// RUN: -analyzer-checker=apiModeling.StdCLibraryFunctions \
				// RUN: -analyzer-checker=debug.ExprInspection \
				// RUN: -analyzer-config eagerly-assume=false \
				// RUN: -triple thumbv7-a15-linux \
				// RUN: -verify

				SzelethusUnsubmitted Done Reply Inline Actions What a beautiful sight. Thanks. Szelethus: What a beautiful sight. Thanks.
				martongAuthorUnsubmitted Done Reply Inline Actions Anytime :D martong: Anytime :D
	void clang_analyzer_eval(int);			void clang_analyzer_eval(int);

	int glob;			int glob;

	typedef struct FILE FILE;			typedef struct FILE FILE;
	#define EOF -1			#define EOF -1

	int getc(FILE *);			int getc(FILE *);
	▲ Show 20 Lines • Show All 173 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[analyzer] StdLibraryFunctionsChecker: Add argument constraintsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 251656

clang/include/clang/StaticAnalyzer/Checkers/Checkers.td

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp

clang/test/Analysis/analyzer-enabled-checkers.c

clang/test/Analysis/std-c-library-functions-arg-constraints.c

clang/test/Analysis/std-c-library-functions.c

[analyzer] StdLibraryFunctionsChecker: Add argument constraints
ClosedPublic