This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
lib/StaticAnalyzer/Checkers/
-
StaticAnalyzer/
-
Checkers/
-
ExprInspectionChecker.cpp
17/17
StdLibraryFunctionsChecker.cpp
-
test/Analysis/
-
Analysis/
4/4
std-c-library-functions-arg-constraints-note-tags.cpp
2/2
std-c-library-functions-arg-constraints-notes.cpp
4/4
std-c-library-functions-arg-constraints.c

Differential D101526

[analyzer][StdLibraryFunctionsChecker] Add NoteTags for applied arg constraints
ClosedPublic

Authored by martong on Apr 29 2021, 5:45 AM.

Download Raw Diff

Details

Reviewers

vsavchenko
NoQ
steakhal
Szelethus

Commits

rG82a50812f7e5: [analyzer][StdLibraryFunctionsChecker] Add NoteTags for applied arg

Summary

In this patch I add a new NoteTag for each applied argument constraint.
This way, any other checker that reports a bug - where the applied
constraint is relevant - will display the corresponding note. With this
change we provide more information for the users to understand some
bug reports easier.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

martong created this revision.Apr 29 2021, 5:45 AM

Herald added a reviewer: Szelethus. · View Herald TranscriptApr 29 2021, 5:45 AM

Herald added subscribers: ASDenysPetrov, gamesh411, dkrupp and 9 others. · View Herald Transcript

martong requested review of this revision.Apr 29 2021, 5:45 AM

Herald added a project: Restricted Project. · View Herald TranscriptApr 29 2021, 5:45 AM

Herald added a subscriber: cfe-commits. · View Herald Transcript

Harbormaster completed remote builds in B101620: Diff 341488.Apr 29 2021, 6:43 AM

steakhal added inline comments.Apr 29 2021, 7:43 AM

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
969–970	This way each and every applied constraint will be displayed even if the given argument does not constitute to the bug condition. I recommend you branching within the lambda, on the interestingness of the given argument constraint.
clang/test/Analysis/std-c-library-functions-arg-constraints.c
241–243	nit. BTW I raised my related concerns about this elsewhere.
253–262	I was puzzled for a moment on why do you have two notes here. By checking the definition of `__arg_constrained_twice()`, I can see that it has two `ArgConstraint`s. Although, it shouldn't be a problem as on the UI we would visualize notes for only a single bugreport. So it is probably clear that both of the notes are valid and correspond to the given statement. It might look clunky in the LIT test, but should be 'somewhat' readable in real life. You should take no action here. I leave this comment just for the record.

NoQ added inline comments.Apr 29 2021, 4:45 PM

clang/test/Analysis/std-c-library-functions-arg-constraints-note-tags.cpp
17	This has to be a user-friendly message. "Constraints" is compiler jargon. We cannot afford shortening "argument" to "arg". Generally, the less machine-generated it looks the better (":" is definitely robotic).
clang/test/Analysis/std-c-library-functions-arg-constraints.c
42	This isn't part of this patch but what do you think about `{-1} U [0, 255]`? Or, you know, `[-1, 255]`.

martong planned changes to this revision.Apr 30 2021, 2:46 AM

martong marked 4 inline comments as done.

martong added inline comments.

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
969–970	Okay, good point, thanks for the feedback! I am planning to change to this direction.
clang/test/Analysis/std-c-library-functions-arg-constraints-note-tags.cpp
17	Okay, thanks for your comment. I can make it to be more similar to the other notes we already have. What about this? Assuming the 1st argument is within the range [1, 1] We cannot afford shortening "argument" to "arg". I'd like to address this in another following patch if you don't mind.
clang/test/Analysis/std-c-library-functions-arg-constraints.c
42	Yeah, good idea, `{-1} U [0, 255]` would be indeed nicer and shorter. However, `[-1, 255]` is hard(er) so I am going to start with the former in a follow up patch.

NoQ added inline comments.May 2 2021, 6:31 PM

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
969–970	Excellent catch @steakhal! I think you can always emit the note but only mark it as unprunable when the argument is interesting. This way it'd work identically to our normal "Assuming..." notes.
clang/test/Analysis/std-c-library-functions-arg-constraints-note-tags.cpp
17	This sounds good for a generic message. I still think that most of the time these messages should be part of the summary. Eg., Assuming the 1st argument is within range [33, 47] U [58, 64] U [91, 96] U [123, 125] ideally should be rephrased as Assuming the argument is a punctuation character in the summary of `ispunct()`.

martong marked 4 inline comments as done.May 3 2021, 2:14 AM

martong added inline comments.

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
969–970	I think you can always emit the note but only mark it as unprunable when the argument is interesting. This way it'd work identically to our normal "Assuming..." notes. `IsPrunable` is a `const` member in `NoteTag`. So, we have to decide about prunability when we call `getNoteTag`. To follow your suggestion, we should decide the prunability dynamically in `TagVisitor::VisitNode`. This would require some infrastructural changes in `NoteTag`. We could add e.g. another Callback member that would be able to decide the prunability with the help of a `BugReport&`. I am okay to go into that direction, but it should definitely be separated from this patch (follow-up). I am not sure if it is an absolutely needed dependency for this change, is it? (If yes then I am going to create the dependent patch first).

martong marked 2 inline comments as done.May 3 2021, 2:20 AM

martong added inline comments.

clang/test/Analysis/std-c-library-functions-arg-constraints-note-tags.cpp
17	Yes, absolutely, good idea. It makes sense to provide another member for the `Summary` that could specifically describe the function specific assumptions (or violations). However, before we would be able to go through all functions manually to create these specific messages we need a generic solution to have something that is more descriptive than the current solution.

Add the description to the note tag only if the SVal is interesting
Use 'Assuming the nth arg is in ...' form for the descriptions

Harbormaster completed remote builds in B102255: Diff 342342.May 3 2021, 3:35 AM

steakhal added inline comments.May 3 2021, 4:28 AM

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
805–820	I don't know. Report message construction always seemed clunky. Clang's or ClangTidy's approach seems superior in this regard. Do we have anything better for this @NoQ? Maybe `llvm::format()` could be an option. Regarding this patch: It's fine. Better than it was before!
971
974	Ah, there is a slight issue. You should mark some stuff interesting here, to make this interestingness propagate back transitively. Let's say `ArgSVal` is `x + y` which is considered to be out of range `[42,52]`. We should mark both `x` and `y` interesting because they themselves could have been constrained by the StdLibChecker previously. So, they must be interesting as well. On the same token, IMO `PathSensitiveBugReport::markInteresting(symbol)` should be transitive. So that all `SymbolData` in that symbolic expression tree are considered interesting. What do you think @NoQ? If we were doing this, @martong - you could simply acquire the assumption you constructed for the given `ValueConstraint`, and mark that interesting. Then all `SymbolData`s on both sides of the logic operator would become implicitly interesting.

Szelethus added inline comments.Jul 21 2021, 7:40 AM

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
148–149	How about we turn this into a print-like function and instead of returning with a string, we take an `llvm::raw_ostream` object as argument? `SmallString` + `raw_svector_stream` is how we construct most of our checker message strings.
974	On the same token, IMO PathSensitiveBugReport::markInteresting(symbol) should be transitive. So that all SymbolData in that symbolic expression tree are considered interesting. What do you think @NoQ? Thats how I'd expect this to work. This shouldn't be a burden on the checker developer (certainly not this kind of a checker), but rather be handled by `PathSensitiveBugReport`. So I think this is fine as it is.

Herald added a subscriber: manas. · View Herald TranscriptJul 21 2021, 7:40 AM

NoQ added inline comments.Jul 21 2021, 10:35 AM

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
974	Interestingness isn't a thing on its own; its meaning is entirely tied to the nature of the specific bug report. An interesting pointer in the null dereference checker is absolutely not the same thing as an interesting mutex pointer in `PthreadLockChecker` or a (de)allocated pointer in `MallocChecker`. I currently treat interestingness as a "GDM for visitors". It's their own way of communicating with themselves and with each other, a state they keep track of and update as they visit the bug report (initially populated during construction of the bug report). But the meaning of this state is entirely specific to the visitors. It is visitors who give interestingness a meaning (and the visitors are, naturally, also hand-picked during construction of the bug report). So I think the right question to ask is "what do you want interestingness to mean in your checker?" and build your visitors accordingly. Your visitors should provide enough information for the user to be able to understand the bug report. When the report says "$x + $y is in range [42, 52] and no values in that range are a valid input to that function", the user asks "why do you think $x + $y is in range [42, 52]?" and we'll have to answer that. For example, in if (x + y >= 42 && x + y <= 52) foo(x + y); there's no need to track ranges for $x and $y separately; it is sufficient to point the user to the constraint over $x + $y obtained from the if-statement. On the other hand, in if (x >= 44 && x <= 50) if (y >= -2 && y <= 2) foo(x + y); you'll have to explain both $x and $y in order for the user to understand that $x + $y is indeed in range [42, 52]. There are also other funny edge cases depending on the nature of the arithmetic, such as int z = x * y; if (x == 0) return 1 / z; where in order to explain the division by the zero value $x * $y it is sufficient to explain $x which makes $y redundant. And if they both are zero then we should probably flip a coin? So I think this is a non-trivial problem. Even on the examples above (let alone other bug types!) it's easy to see that interestingness of $x and $y doesn't always follow from the interestingness of $x + $y (but sometimes it indeed does). I think the answer lies somewhere in the underneathies of the constraint solver: we have to follow its logic in order to find out how the range was inferred (which it probably should annotate with more note tags so that we didn't have to reverse-engineer it in the visitor?) (@vsavchenko you may find this discussion particularly peculiar).

martong marked 7 inline comments as done.Sep 15 2022, 3:09 AM

martong added inline comments.

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
148–149	I don't see how that change would be relevant. Would we have a better run-time, or code that is easier to understand? please elaborate.
971	Thanks, changed it.
974	Guys, the thing is, we should have our discussion about "transitive interestingness" somewhere else. This patch is orthogonal to that problem. Actually, in this patch I add extra notes to already interesting SVals. Those SVals are marked to be interesting by other checkers. As @NoQ describes, I agree, that a checker could be responsible to describe how it wants to handle transitivity. But, once again, it is not the StdLibraryFunctionChecker that is marking the SVals interesting. Please take a look at the newly added test file: We have a division by zero there, thus the divisor is marked interesting by the DivZeroChecker (and transitivity is handled by trackExpressionValue). What we do in this patch is if we know that a value to be interesting and we know that had been constrained by an argument constraint, then we attach a note that describes this fact.

Herald added a project: Restricted Project. · View Herald TranscriptSep 15 2022, 3:09 AM

Rebase
move Msg into the lambda

Gentle ping @steakhal @NoQ
Trying to revive this after a year :) I am sorry it took so long to get back to this.

Harbormaster completed remote builds in B186814: Diff 460345.Sep 15 2022, 3:40 AM

Hi, looks great! I found a couple of typos and the amount of changes in tests is suspiciously low. And I want to make sure that the promise to change "arg" -> "argument" isn't lost (but I'll be happy if it's addressed in a follow-up patch).

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
139
719
755	I suspect this needs to be covered by tests.

Rebase

Changed to be more verbose: "arg" -> "argument"
Fixed "less than" to "greater than"
Added new tests
Fixed typos

In D101526#3804623, @NoQ wrote:

Hi, looks great! I found a couple of typos and the amount of changes in tests is suspiciously low. And I want to make sure that the promise to change "arg" -> "argument" isn't lost (but I'll be happy if it's addressed in a follow-up patch).

Ok, I've changed "arg" to "argument" in the latest update, plus added new test cases. Thanks for the review!

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp
755	Okay, I've added further tests for the "not-null" and the "buffer-size-constraint" cases.

Harbormaster completed remote builds in B192176: Diff 467773.Oct 14 2022, 8:21 AM

Ping

Looks great to me, thanks!!

clang/test/Analysis/std-c-library-functions-arg-constraints-notes.cpp
32–33	The warning is the same as the note here right? Our warnings traditionally describe the problem (the 1st argument is less than 10, and this is bad because...), not how things "should" be. I guess we can think more about that later.

This revision is now accepted and ready to land.Oct 25 2022, 3:21 PM

This revision was landed with ongoing or failed builds.Oct 26 2022, 7:34 AM

Closed by commit rG82a50812f7e5: [analyzer][StdLibraryFunctionsChecker] Add NoteTags for applied arg (authored by martong). · Explain Why

This revision was automatically updated to reflect the committed changes.

martong marked an inline comment as done.

martong added a commit: rG82a50812f7e5: [analyzer][StdLibraryFunctionsChecker] Add NoteTags for applied arg.

In D101526#3883871, @NoQ wrote:

Looks great to me, thanks!!

Thanks for the review!

clang/test/Analysis/std-c-library-functions-arg-constraints-notes.cpp
32–33	No, actually, the warning is different, it does not contain the text "should be". In this case this is it: Line 31: Function argument constraint is not satisfied, constraint: BufferSize [alpha.unix.StdCLibraryFunctionArgs] And then the notes basically further explain how the constraint is not satisfied. I did not put the check for the warnings here because this test file is responsible for checking the notes only, hence it has the name `std-c-library-functions-arg-constraints-notes.cpp`. The warnings are directly tested in `std-c-library-functions-arg-constraints.c`, however, I have to admit, probably we should have even more specific checks for the warning messages there.

Revision Contents

Path

Size

clang/

lib/

StaticAnalyzer/

Checkers/

ExprInspectionChecker.cpp

7 lines

StdLibraryFunctionsChecker.cpp

84 lines

test/

Analysis/

std-c-library-functions-arg-constraints-note-tags.cpp

51 lines

std-c-library-functions-arg-constraints-notes.cpp

10 lines

std-c-library-functions-arg-constraints.c

2 lines

Diff 470813

clang/lib/StaticAnalyzer/Checkers/ExprInspectionChecker.cpp

Show First 20 Lines • Show All 109 Lines • ▼ Show 20 Lines	FnCheck Handler =
&ExprInspectionChecker::analyzerGetExtent)		&ExprInspectionChecker::analyzerGetExtent)
.Case("clang_analyzer_printState",		.Case("clang_analyzer_printState",
&ExprInspectionChecker::analyzerPrintState)		&ExprInspectionChecker::analyzerPrintState)
.Case("clang_analyzer_numTimesReached",		.Case("clang_analyzer_numTimesReached",
&ExprInspectionChecker::analyzerNumTimesReached)		&ExprInspectionChecker::analyzerNumTimesReached)
.Case("clang_analyzer_hashDump",		.Case("clang_analyzer_hashDump",
&ExprInspectionChecker::analyzerHashDump)		&ExprInspectionChecker::analyzerHashDump)
.Case("clang_analyzer_denote", &ExprInspectionChecker::analyzerDenote)		.Case("clang_analyzer_denote", &ExprInspectionChecker::analyzerDenote)
.Case("clang_analyzer_express",		.Case("clang_analyzer_express", // This also marks the argument as
		// interesting.
&ExprInspectionChecker::analyzerExpress)		&ExprInspectionChecker::analyzerExpress)
.StartsWith("clang_analyzer_isTainted",		.StartsWith("clang_analyzer_isTainted",
&ExprInspectionChecker::analyzerIsTainted)		&ExprInspectionChecker::analyzerIsTainted)
.Default(nullptr);		.Default(nullptr);

if (!Handler)		if (!Handler)
return false;		return false;

▲ Show 20 Lines • Show All 398 Lines • ▼ Show 20 Lines	void ExprInspectionChecker::analyzerExpress(const CallExpr *CE,
CheckerContext &C) const {		CheckerContext &C) const {
const Expr *Arg = getArgExpr(CE, C);		const Expr *Arg = getArgExpr(CE, C);
if (!Arg)		if (!Arg)
return;		return;

SVal ArgVal = C.getSVal(CE->getArg(0));		SVal ArgVal = C.getSVal(CE->getArg(0));
SymbolRef Sym = ArgVal.getAsSymbol();		SymbolRef Sym = ArgVal.getAsSymbol();
if (!Sym) {		if (!Sym) {
reportBug("Not a symbol", C);		reportBug("Not a symbol", C, ArgVal);
return;		return;
}		}

SymbolExpressor V(C.getState());		SymbolExpressor V(C.getState());
auto Str = V.Visit(Sym);		auto Str = V.Visit(Sym);
if (!Str) {		if (!Str) {
reportBug("Unable to express", C);		reportBug("Unable to express", C, ArgVal);
return;		return;
}		}

reportBug(*Str, C, ArgVal);		reportBug(*Str, C, ArgVal);
}		}

void ExprInspectionChecker::analyzerIsTainted(const CallExpr *CE,		void ExprInspectionChecker::analyzerIsTainted(const CallExpr *CE,
CheckerContext &C) const {		CheckerContext &C) const {
Show All 16 Lines

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp

Show First 20 Lines • Show All 128 Lines • ▼ Show 20 Lines public:

// Return those arguments that should be tracked when we report a bug. By // Return those arguments that should be tracked when we report a bug. By

// default it is the argument that is constrained, however, in some special // default it is the argument that is constrained, however, in some special

// cases we need to track other arguments as well. E.g. a buffer size might // cases we need to track other arguments as well. E.g. a buffer size might

// be encoded in another argument. // be encoded in another argument.

virtual std::vector<ArgNo> getArgsToTrack() const { return {ArgN}; } virtual std::vector<ArgNo> getArgsToTrack() const { return {ArgN}; }

virtual StringRef getName() const = 0; virtual StringRef getName() const = 0;

// Represents that in which context do we require a description of the

// constraint.

enum class DescriptionKind {

NoQUnsubmitted

Done

// constraint.

- enum class DescritptionKind {

+ enum class DescriptionKind {

// The constraint is violated.

NoQ:

// The constraint is violated.

Violation,

// We assume that the constraint is satisfied.

Assumption

};

// Give a description that explains the constraint to the user. Used when // Give a description that explains the constraint to the user. Used when

// the bug is reported. // the bug is reported.

virtual std::string describe(ProgramStateRef State, virtual std::string describe(DescriptionKind DK, ProgramStateRef State,

const Summary &Summary) const { const Summary &Summary) const {

SzelethusUnsubmitted

Done

How about we turn this into a print-like function and instead of returning with a string, we take an llvm::raw_ostream object as argument? SmallString + raw_svector_stream is how we construct most of our checker message strings.

Szelethus: How about we turn this into a print-like function and instead of returning with a string, we…

martongAuthorUnsubmitted

Done

I don't see how that change would be relevant. Would we have a better run-time, or code that is easier to understand? please elaborate.

martong: I don't see how that change would be relevant. Would we have a better run-time, or code that is…

// There are some descendant classes that are not used as argument // There are some descendant classes that are not used as argument

// constraints, e.g. ComparisonConstraint. In that case we can safely // constraints, e.g. ComparisonConstraint. In that case we can safely

// ignore the implementation of this function. // ignore the implementation of this function.

llvm_unreachable("Not implemented"); llvm_unreachable("Not implemented");

} }

protected: protected:

ArgNo ArgN; // Argument to which we apply the constraint. ArgNo ArgN; // Argument to which we apply the constraint.

Show All 20 Lines class RangeConstraint : public ValueConstraint {

// is default initialized to be empty. // is default initialized to be empty.

IntRangeVector Ranges; IntRangeVector Ranges;

public: public:

StringRef getName() const override { return "Range"; } StringRef getName() const override { return "Range"; }

RangeConstraint(ArgNo ArgN, RangeKind Kind, const IntRangeVector &Ranges) RangeConstraint(ArgNo ArgN, RangeKind Kind, const IntRangeVector &Ranges)

: ValueConstraint(ArgN), Kind(Kind), Ranges(Ranges) {} : ValueConstraint(ArgN), Kind(Kind), Ranges(Ranges) {}

std::string describe(ProgramStateRef State, std::string describe(DescriptionKind DK, ProgramStateRef State,

const Summary &Summary) const override; const Summary &Summary) const override;

const IntRangeVector &getRanges() const { return Ranges; } const IntRangeVector &getRanges() const { return Ranges; }

private: private:

ProgramStateRef applyAsOutOfRange(ProgramStateRef State, ProgramStateRef applyAsOutOfRange(ProgramStateRef State,

const CallEvent &Call, const CallEvent &Call,

const Summary &Summary) const; const Summary &Summary) const;

▲ Show 20 Lines • Show All 53 Lines • ▼ Show 20 Lines class StdLibraryFunctionsChecker

}; };

class NotNullConstraint : public ValueConstraint { class NotNullConstraint : public ValueConstraint {

using ValueConstraint::ValueConstraint; using ValueConstraint::ValueConstraint;

// This variable has a role when we negate the constraint. // This variable has a role when we negate the constraint.

bool CannotBeNull = true; bool CannotBeNull = true;

public: public:

std::string describe(ProgramStateRef State, std::string describe(DescriptionKind DK, ProgramStateRef State,

const Summary &Summary) const override; const Summary &Summary) const override;

StringRef getName() const override { return "NonNull"; } StringRef getName() const override { return "NonNull"; }

ProgramStateRef apply(ProgramStateRef State, const CallEvent &Call, ProgramStateRef apply(ProgramStateRef State, const CallEvent &Call,

const Summary &Summary, const Summary &Summary,

CheckerContext &C) const override { CheckerContext &C) const override {

SVal V = getArgSVal(Call, getArgNo()); SVal V = getArgSVal(Call, getArgNo());

if (V.isUndef()) if (V.isUndef())

return State; return State;

▲ Show 20 Lines • Show All 55 Lines • ▼ Show 20 Lines std::vector<ArgNo> getArgsToTrack() const override {

std::vector<ArgNo> Result{ArgN}; std::vector<ArgNo> Result{ArgN};

if (SizeArgN) if (SizeArgN)

Result.push_back(*SizeArgN); Result.push_back(*SizeArgN);

if (SizeMultiplierArgN) if (SizeMultiplierArgN)

Result.push_back(*SizeMultiplierArgN); Result.push_back(*SizeMultiplierArgN);

return Result; return Result;

} }

std::string describe(ProgramStateRef State, std::string describe(DescriptionKind DK, ProgramStateRef State,

const Summary &Summary) const override; const Summary &Summary) const override;

ProgramStateRef apply(ProgramStateRef State, const CallEvent &Call, ProgramStateRef apply(ProgramStateRef State, const CallEvent &Call,

const Summary &Summary, const Summary &Summary,

CheckerContext &C) const override { CheckerContext &C) const override {

SValBuilder &SvalBuilder = C.getSValBuilder(); SValBuilder &SvalBuilder = C.getSValBuilder();

// The buffer argument. // The buffer argument.

SVal BufV = getArgSVal(Call, getArgNo()); SVal BufV = getArgSVal(Call, getArgNo());

▲ Show 20 Lines • Show All 371 Lines • ▼ Show 20 Lines void reportBug(const CallEvent &Call, ExplodedNode *N,

auto R = std::make_unique<PathSensitiveBugReport>(*BT_InvalidArg, Msg, N); auto R = std::make_unique<PathSensitiveBugReport>(*BT_InvalidArg, Msg, N);

for (ArgNo ArgN : VC->getArgsToTrack()) for (ArgNo ArgN : VC->getArgsToTrack())

bugreporter::trackExpressionValue(N, Call.getArgExpr(ArgN), *R); bugreporter::trackExpressionValue(N, Call.getArgExpr(ArgN), *R);

// Highlight the range of the argument that was violated. // Highlight the range of the argument that was violated.

R->addRange(Call.getArgSourceRange(VC->getArgNo())); R->addRange(Call.getArgSourceRange(VC->getArgNo()));

// Describe the argument constraint in a note. // Describe the argument constraint violation in a note.

R->addNote(VC->describe(C.getState(), Summary), R->getLocation(), std::string Descr = VC->describe(

Call.getArgSourceRange(VC->getArgNo())); ValueConstraint::DescriptionKind::Violation, C.getState(), Summary);

// Capitalize the first letter b/c we want a full sentence.

NoQUnsubmitted

Done

ValueConstraint::DescritptionKind::Violation, C.getState(), Summary);

- // Capitalize the firs letter b/c we want a full sentence.

+ // Capitalize the first letter b/c we want a full sentence.

Descr[0] = toupper(Descr[0]);

NoQ:

Descr[0] = toupper(Descr[0]);

R->addNote(Descr, R->getLocation(), Call.getArgSourceRange(VC->getArgNo()));

C.emitReport(std::move(R)); C.emitReport(std::move(R));

} }

/// These are the errno constraints that can be passed to summary cases. /// These are the errno constraints that can be passed to summary cases.

/// One of these should fit for a single summary case. /// One of these should fit for a single summary case.

/// Usually if a failure return value exists for function, that function /// Usually if a failure return value exists for function, that function

/// needs different cases for success and failure with different errno /// needs different cases for success and failure with different errno

Show All 12 Lines

static BasicValueFactory &getBVF(ProgramStateRef State) { static BasicValueFactory &getBVF(ProgramStateRef State) {

ProgramStateManager &Mgr = State->getStateManager(); ProgramStateManager &Mgr = State->getStateManager();

SValBuilder &SVB = Mgr.getSValBuilder(); SValBuilder &SVB = Mgr.getSValBuilder();

return SVB.getBasicValueFactory(); return SVB.getBasicValueFactory();

} }

std::string StdLibraryFunctionsChecker::NotNullConstraint::describe( std::string StdLibraryFunctionsChecker::NotNullConstraint::describe(

ProgramStateRef State, const Summary &Summary) const { DescriptionKind DK, ProgramStateRef State, const Summary &Summary) const {

SmallString<48> Result; SmallString<48> Result;

Result += "The "; const auto Violation = ValueConstraint::DescriptionKind::Violation;

Result += "the ";

Result += getArgDesc(ArgN); Result += getArgDesc(ArgN);

Result += " should not be NULL"; Result += DK == Violation ? " should not be NULL" : " is not NULL";

NoQUnsubmitted

Done

I suspect this needs to be covered by tests.

NoQ: I suspect this needs to be covered by tests.

martongAuthorUnsubmitted

Done

Okay, I've added further tests for the "not-null" and the "buffer-size-constraint" cases.

martong: Okay, I've added further tests for the "not-null" and the "buffer-size-constraint" cases.

return Result.c_str(); return Result.c_str();

} }

std::string StdLibraryFunctionsChecker::RangeConstraint::describe( std::string StdLibraryFunctionsChecker::RangeConstraint::describe(

ProgramStateRef State, const Summary &Summary) const { DescriptionKind DK, ProgramStateRef State, const Summary &Summary) const {

BasicValueFactory &BVF = getBVF(State); BasicValueFactory &BVF = getBVF(State);

QualType T = Summary.getArgType(getArgNo()); QualType T = Summary.getArgType(getArgNo());

SmallString<48> Result; SmallString<48> Result;

Result += "The "; const auto Violation = ValueConstraint::DescriptionKind::Violation;

Result += "the ";

Result += getArgDesc(ArgN); Result += getArgDesc(ArgN);

Result += " should be "; Result += DK == Violation ? " should be " : " is ";

// Range kind as a string. // Range kind as a string.

Kind == OutOfRange ? Result += "out of" : Result += "within"; Kind == OutOfRange ? Result += "out of" : Result += "within";

// Get the range values as a string. // Get the range values as a string.

Result += " the range "; Result += " the range ";

if (Ranges.size() > 1) if (Ranges.size() > 1)

Result += "["; Result += "[";

Show All 15 Lines std::string StdLibraryFunctionsChecker::RangeConstraint::describe(

return Result.c_str(); return Result.c_str();

} }

SmallString<8> SmallString<8>

StdLibraryFunctionsChecker::getArgDesc(StdLibraryFunctionsChecker::ArgNo ArgN) { StdLibraryFunctionsChecker::getArgDesc(StdLibraryFunctionsChecker::ArgNo ArgN) {

SmallString<8> Result; SmallString<8> Result;

Result += std::to_string(ArgN + 1); Result += std::to_string(ArgN + 1);

Result += llvm::getOrdinalSuffix(ArgN + 1); Result += llvm::getOrdinalSuffix(ArgN + 1);

Result += " arg"; Result += " argument";

return Result; return Result;

} }

std::string StdLibraryFunctionsChecker::BufferSizeConstraint::describe( std::string StdLibraryFunctionsChecker::BufferSizeConstraint::describe(

ProgramStateRef State, const Summary &Summary) const { DescriptionKind DK, ProgramStateRef State, const Summary &Summary) const {

SmallString<96> Result; SmallString<96> Result;

Result += "The size of the "; const auto Violation = ValueConstraint::DescriptionKind::Violation;

Result += "the size of the ";

Result += getArgDesc(ArgN); Result += getArgDesc(ArgN);

Result += " should be equal to or less than the value of "; Result += DK == Violation ? " should be " : " is ";

Result += "equal to or greater than the value of ";

if (ConcreteSize) { if (ConcreteSize) {

ConcreteSize->toString(Result); ConcreteSize->toString(Result);

} else if (SizeArgN) { } else if (SizeArgN) {

Result += "the "; Result += "the ";

Result += getArgDesc(*SizeArgN); Result += getArgDesc(*SizeArgN);

if (SizeMultiplierArgN) { if (SizeMultiplierArgN) {

Result += " times the "; Result += " times the ";

Result += getArgDesc(*SizeMultiplierArgN); Result += getArgDesc(*SizeMultiplierArgN);

steakhalUnsubmitted

Done

I don't know. Report message construction always seemed clunky.
Clang's or ClangTidy's approach seems superior in this regard.

Do we have anything better for this @NoQ?
Maybe llvm::format() could be an option.

Regarding this patch: It's fine. Better than it was before!

steakhal: I don't know. Report message construction always seemed clunky. Clang's or ClangTidy's approach…

} }

return Result.c_str(); return Result.c_str();

} }

ProgramStateRef StdLibraryFunctionsChecker::RangeConstraint::applyAsOutOfRange( ProgramStateRef StdLibraryFunctionsChecker::RangeConstraint::applyAsOutOfRange(

ProgramStateRef State, const CallEvent &Call, ProgramStateRef State, const CallEvent &Call,

const Summary &Summary) const { const Summary &Summary) const {

▲ Show 20 Lines • Show All 109 Lines • ▼ Show 20 Lines void StdLibraryFunctionsChecker::checkPreCall(const CallEvent &Call,

Optional<Summary> FoundSummary = findFunctionSummary(Call, C); Optional<Summary> FoundSummary = findFunctionSummary(Call, C);

if (!FoundSummary) if (!FoundSummary)

return; return;

const Summary &Summary = *FoundSummary; const Summary &Summary = *FoundSummary;

ProgramStateRef State = C.getState(); ProgramStateRef State = C.getState();

ProgramStateRef NewState = State; ProgramStateRef NewState = State;

ExplodedNode *NewNode = C.getPredecessor();

for (const ValueConstraintPtr &Constraint : Summary.getArgConstraints()) { for (const ValueConstraintPtr &Constraint : Summary.getArgConstraints()) {

ProgramStateRef SuccessSt = Constraint->apply(NewState, Call, Summary, C); ProgramStateRef SuccessSt = Constraint->apply(NewState, Call, Summary, C);

ProgramStateRef FailureSt = ProgramStateRef FailureSt =

Constraint->negate()->apply(NewState, Call, Summary, C); Constraint->negate()->apply(NewState, Call, Summary, C);

// The argument constraint is not satisfied. // The argument constraint is not satisfied.

if (FailureSt && !SuccessSt) { if (FailureSt && !SuccessSt) {

if (ExplodedNode *N = C.generateErrorNode(NewState)) if (ExplodedNode *N = C.generateErrorNode(NewState))

reportBug(Call, N, Constraint.get(), Summary, C); reportBug(Call, N, Constraint.get(), Summary, C);

break; break;

} else { }

// We will apply the constraint even if we cannot reason about the // We will apply the constraint even if we cannot reason about the

// argument. This means both SuccessSt and FailureSt can be true. If we // argument. This means both SuccessSt and FailureSt can be true. If we

// weren't applying the constraint that would mean that symbolic // weren't applying the constraint that would mean that symbolic

// execution continues on a code whose behaviour is undefined. // execution continues on a code whose behaviour is undefined.

assert(SuccessSt); assert(SuccessSt);

NewState = SuccessSt; NewState = SuccessSt;

if (NewState != State) {

SmallString<64> Msg;

Msg += "Assuming ";

Msg += Constraint->describe(ValueConstraint::DescriptionKind::Assumption,

NewState, Summary);

const auto ArgSVal = Call.getArgSVal(Constraint->getArgNo());

NewNode = C.addTransition(

NewState, NewNode,

steakhalUnsubmitted

Done

This way each and every applied constraint will be displayed even if the given argument does not constitute to the bug condition.
I recommend you branching within the lambda, on the interestingness of the given argument constraint.

steakhal: This way each and every applied constraint will be displayed even if the given argument does…

martongAuthorUnsubmitted

Done

Okay, good point, thanks for the feedback! I am planning to change to this direction.

martong: Okay, good point, thanks for the feedback! I am planning to change to this direction.

NoQUnsubmitted

Done

Excellent catch @steakhal!

I think you can always emit the note but only mark it as unprunable when the argument is interesting. This way it'd work identically to our normal "Assuming..." notes.

NoQ: Excellent catch @steakhal! I think you can always emit the note but only mark it as…

martongAuthorUnsubmitted

Done

I think you can always emit the note but only mark it as unprunable when the argument is interesting. This way it'd work identically to our normal "Assuming..." notes.

IsPrunable is a const member in NoteTag. So, we have to decide about prunability when we call getNoteTag. To follow your suggestion, we should decide the prunability dynamically in TagVisitor::VisitNode. This would require some infrastructural changes in NoteTag. We could add e.g. another Callback member that would be able to decide the prunability with the help of a BugReport&. I am okay to go into that direction, but it should definitely be separated from this patch (follow-up). I am not sure if it is an absolutely needed dependency for this change, is it? (If yes then I am going to create the dependent patch first).

martong: > I think you can always emit the note but only mark it as unprunable when the argument is…

C.getNoteTag([Msg = std::move(Msg), ArgSVal](

steakhalUnsubmitted

Done

NewState, NewNode,

- C.getNoteTag([Msg, ArgSVal](PathSensitiveBugReport &BR,

+ C.getNoteTag([Msg = std::move(Msg), ArgSVal](PathSensitiveBugReport &BR,

llvm::raw_ostream &OS) {

steakhal:

martongAuthorUnsubmitted

Done

Thanks, changed it.

martong: Thanks, changed it.

PathSensitiveBugReport &BR, llvm::raw_ostream &OS) {

if (BR.isInteresting(ArgSVal))

OS << Msg;

steakhalUnsubmitted

Done

Ah, there is a slight issue.
You should mark some stuff interesting here, to make this interestingness propagate back transitively.

Let's say ArgSVal is x + y which is considered to be out of range [42,52]. We should mark both x and y interesting because they themselves could have been constrained by the StdLibChecker previously. So, they must be interesting as well.

On the same token, IMO PathSensitiveBugReport::markInteresting(symbol) should be transitive. So that all SymbolData in that symbolic expression tree are considered interesting. What do you think @NoQ?
If we were doing this, @martong - you could simply acquire the assumption you constructed for the given ValueConstraint, and mark that interesting. Then all SymbolDatas on both sides of the logic operator would become implicitly interesting.

steakhal: Ah, there is a slight issue. You should mark some stuff interesting here, to make this…

SzelethusUnsubmitted

Done

On the same token, IMO PathSensitiveBugReport::markInteresting(symbol) should be transitive. So that all SymbolData in that symbolic expression tree are considered interesting. What do you think @NoQ?

Thats how I'd expect this to work. This shouldn't be a burden on the checker developer (certainly not this kind of a checker), but rather be handled by PathSensitiveBugReport.

So I think this is fine as it is.

Szelethus: >On the same token, IMO PathSensitiveBugReport::markInteresting(symbol) should be transitive.

NoQUnsubmitted

Done

Interestingness isn't a thing on its own; its meaning is entirely tied to the nature of the specific bug report. An interesting pointer in the null dereference checker is absolutely not the same thing as an interesting mutex pointer in PthreadLockChecker or a (de)allocated pointer in MallocChecker.

I currently treat interestingness as a "GDM for visitors". It's their own way of communicating with themselves and with each other, a state they keep track of and update as they visit the bug report (initially populated during construction of the bug report). But the meaning of this state is entirely specific to the visitors. It is visitors who give interestingness a meaning (and the visitors are, naturally, also hand-picked during construction of the bug report).

So I think the right question to ask is "what do you want interestingness to mean in your checker?" and build your visitors accordingly.

Your visitors should provide enough information for the user to be able to understand the bug report. When the report says "$x + $y is in range [42, 52] and no values in that range are a valid input to that function", the user asks "why do you think $x + $y is in range [42, 52]?" and we'll have to answer that.

For example, in

if (x + y >= 42 && x + y <= 52)
  foo(x + y);

there's no need to track ranges for $x and $y separately; it is sufficient to point the user to the constraint over $x + $y obtained from the if-statement. On the other hand, in

if (x >= 44 && x <= 50)
  if (y >= -2 && y <= 2)
    foo(x + y);

you'll have to explain both $x and $y in order for the user to understand that $x + $y is indeed in range [42, 52].

There are also other funny edge cases depending on the nature of the arithmetic, such as

int z = x * y;
if (x == 0)
  return 1 / z;

where in order to explain the division by the zero value $x * $y it is sufficient to explain $x which makes $y redundant. And if they both are zero then we should probably flip a coin?

So I think this is a non-trivial problem. Even on the examples above (let alone other bug types!) it's easy to see that interestingness of $x and $y doesn't always follow from the interestingness of $x + $y (but sometimes it indeed does). I think the answer lies somewhere in the underneathies of the constraint solver: we have to follow its logic in order to find out how the range was inferred (which it probably should annotate with more note tags so that we didn't have to reverse-engineer it in the visitor?) (@vsavchenko you may find this discussion particularly peculiar).

NoQ: Interestingness isn't a thing on its own; its meaning is entirely tied to the nature of the…

martongAuthorUnsubmitted

Done

Guys, the thing is, we should have our discussion about "transitive interestingness" somewhere else.

This patch is orthogonal to that problem. Actually, in this patch I add extra notes to already interesting SVals. Those SVals are marked to be interesting by other checkers. As @NoQ describes, I agree, that a checker could be responsible to describe how it wants to handle transitivity. But, once again, it is not the StdLibraryFunctionChecker that is marking the SVals interesting. Please take a look at the newly added test file: We have a division by zero there, thus the divisor is marked interesting by the DivZeroChecker (and transitivity is handled by trackExpressionValue). What we do in this patch is if we know that a value to be interesting and we know that had been constrained by an argument constraint, then we attach a note that describes this fact.

martong: Guys, the thing is, we should have our discussion about "transitive interestingness" somewhere…

}));

} }

if (NewState && NewState != State)

C.addTransition(NewState);

} }

void StdLibraryFunctionsChecker::checkPostCall(const CallEvent &Call, void StdLibraryFunctionsChecker::checkPostCall(const CallEvent &Call,

CheckerContext &C) const { CheckerContext &C) const {

Optional<Summary> FoundSummary = findFunctionSummary(Call, C); Optional<Summary> FoundSummary = findFunctionSummary(Call, C);

if (!FoundSummary) if (!FoundSummary)

return; return;

▲ Show 20 Lines • Show All 1,920 Lines • ▼ Show 20 Lines

// Functions for testing. // Functions for testing.

if (ChecksEnabled[CK_StdCLibraryFunctionsTesterChecker]) { if (ChecksEnabled[CK_StdCLibraryFunctionsTesterChecker]) {

addToFunctionSummaryMap( addToFunctionSummaryMap(

"__not_null", Signature(ArgTypes{IntPtrTy}, RetType{IntTy}), "__not_null", Signature(ArgTypes{IntPtrTy}, RetType{IntTy}),

Summary(EvalCallAsPure).ArgConstraint(NotNull(ArgNo(0)))); Summary(EvalCallAsPure).ArgConstraint(NotNull(ArgNo(0))));

// Test range values. // Test range values.

addToFunctionSummaryMap( addToFunctionSummaryMap(

"__single_val_0", Signature(ArgTypes{IntTy}, RetType{IntTy}),

Summary(EvalCallAsPure)

.ArgConstraint(ArgumentCondition(0U, WithinRange, SingleValue(0))));

addToFunctionSummaryMap(

"__single_val_1", Signature(ArgTypes{IntTy}, RetType{IntTy}), "__single_val_1", Signature(ArgTypes{IntTy}, RetType{IntTy}),

Summary(EvalCallAsPure) Summary(EvalCallAsPure)

.ArgConstraint(ArgumentCondition(0U, WithinRange, SingleValue(1)))); .ArgConstraint(ArgumentCondition(0U, WithinRange, SingleValue(1))));

addToFunctionSummaryMap( addToFunctionSummaryMap(

"__range_1_2", Signature(ArgTypes{IntTy}, RetType{IntTy}), "__range_1_2", Signature(ArgTypes{IntTy}, RetType{IntTy}),

Summary(EvalCallAsPure) Summary(EvalCallAsPure)

.ArgConstraint(ArgumentCondition(0U, WithinRange, Range(1, 2)))); .ArgConstraint(ArgumentCondition(0U, WithinRange, Range(1, 2))));

addToFunctionSummaryMap("__range_1_2__4_5", addToFunctionSummaryMap("__range_1_2__4_5",

▲ Show 20 Lines • Show All 92 Lines • Show Last 20 Lines

clang/test/Analysis/std-c-library-functions-arg-constraints-note-tags.cpp

This file was added.

				// RUN: %clang_analyze_cc1 %s \
				// RUN: -analyzer-checker=core \
				// RUN: -analyzer-checker=apiModeling.StdCLibraryFunctions \
				// RUN: -analyzer-checker=alpha.unix.StdCLibraryFunctionArgs \
				// RUN: -analyzer-checker=debug.StdCLibraryFunctionsTester \
				// RUN: -analyzer-config apiModeling.StdCLibraryFunctions:DisplayLoadedSummaries=true \
				// RUN: -analyzer-checker=debug.ExprInspection \
				// RUN: -analyzer-config eagerly-assume=false \
				// RUN: -triple i686-unknown-linux \
				// RUN: -analyzer-output=text \
				// RUN: -verify

				template <typename T>
				void clang_analyzer_express(T x);
				void clang_analyzer_eval(bool);
				int clang_analyzer_getExtent(void *);

				NoQUnsubmitted Done Reply Inline Actions This has to be a user-friendly message. "Constraints" is compiler jargon. We cannot afford shortening "argument" to "arg". Generally, the less machine-generated it looks the better (":" is definitely robotic). NoQ: This has to be a user-friendly message. * "Constraints" is compiler jargon. * We cannot afford…
				martongAuthorUnsubmitted Done Reply Inline Actions Okay, thanks for your comment. I can make it to be more similar to the other notes we already have. What about this? Assuming the 1st argument is within the range [1, 1] We cannot afford shortening "argument" to "arg". I'd like to address this in another following patch if you don't mind. martong: Okay, thanks for your comment. I can make it to be more similar to the other notes we already…
				NoQUnsubmitted Done Reply Inline Actions This sounds good for a generic message. I still think that most of the time these messages should be part of the summary. Eg., Assuming the 1st argument is within range [33, 47] U [58, 64] U [91, 96] U [123, 125] ideally should be rephrased as Assuming the argument is a punctuation character in the summary of `ispunct()`. NoQ: This sounds good for a generic message. I still think that most of the time these messages…
				martongAuthorUnsubmitted Done Reply Inline Actions Yes, absolutely, good idea. It makes sense to provide another member for the `Summary` that could specifically describe the function specific assumptions (or violations). However, before we would be able to go through all functions manually to create these specific messages we need a generic solution to have something that is more descriptive than the current solution. martong: Yes, absolutely, good idea. It makes sense to provide another member for the `Summary` that…

				// Check NotNullConstraint assumption notes.
				int __not_null(int *);
				int test_not_null_note(int *x, int y) {
				__not_null(x); // expected-note{{Assuming the 1st argument is not NULL}}
				if (x) // expected-note{{'x' is non-null}} \
				// expected-note{{Taking true branch}}
				if (!y) // expected-note{{Assuming 'y' is 0}} \
				// expected-note{{Taking true branch}}
				return 1 / y; // expected-warning{{Division by zero}} \
				// expected-note{{Division by zero}}

				return 0;
				}

				// Check the RangeConstraint assumption notes.
				int __single_val_0(int); // [0, 0]
				int test_range_constraint_note(int x, int y) {
				__single_val_0(x); // expected-note{{Assuming the 1st argument is within the range [0, 0]}}
				return y / x; // expected-warning{{Division by zero}} \
				// expected-note{{Division by zero}}
				}

				// Check the BufferSizeConstraint assumption notes.
				int __buf_size_arg_constraint_concrete(const void *buf); // size of buf must be >= 10
				void test_buffer_size_note(char *buf, int y) {
				__buf_size_arg_constraint_concrete(buf); // expected-note {{Assuming the size of the 1st argument is equal to or greater than the value of 10}}
				clang_analyzer_eval(clang_analyzer_getExtent(buf) >= 10); // expected-warning{{TRUE}} \
				// expected-note{{TRUE}}

				// clang_analyzer_express marks the argument as interesting.
				clang_analyzer_express(buf); // expected-warning {{}} // the message does not really matter \
				// expected-note {{}}
				}

clang/test/Analysis/std-c-library-functions-arg-constraints-notes.cpp

	Show All 9 Lines
	// RUN: -verify			// RUN: -verify

	// In this test we verify that each argument constraints are described properly.			// In this test we verify that each argument constraints are described properly.

	// Check NotNullConstraint violation notes.			// Check NotNullConstraint violation notes.
	int __not_null(int *);			int __not_null(int *);
	void test_not_null(int *x) {			void test_not_null(int *x) {
	__not_null(nullptr); // \			__not_null(nullptr); // \
	// expected-note{{The 1st arg should not be NULL}} \			// expected-note{{The 1st argument should not be NULL}} \
	// expected-warning{{}}			// expected-warning{{}}
	}			}

	// Check the BufferSizeConstraint violation notes.			// Check the BufferSizeConstraint violation notes.
	using size_t = decltype(sizeof(int));			using size_t = decltype(sizeof(int));
	int __buf_size_arg_constraint_concrete(const void *); // size <= 10			int __buf_size_arg_constraint_concrete(const void *); // size <= 10
	int __buf_size_arg_constraint(const void *, size_t); // size <= Arg1			int __buf_size_arg_constraint(const void *, size_t); // size <= Arg1
	int __buf_size_arg_constraint_mul(const void , size_t, size_t); // size <= Arg1 Arg2			int __buf_size_arg_constraint_mul(const void , size_t, size_t); // size <= Arg1 Arg2
	void test_buffer_size(int x) {			void test_buffer_size(int x) {
	switch (x) {			switch (x) {
	case 1: {			case 1: {
	char buf[9];			char buf[9];
	__buf_size_arg_constraint_concrete(buf); // \			__buf_size_arg_constraint_concrete(buf); // \
	// expected-note{{The size of the 1st arg should be equal to or less than the value of 10}} \			// expected-note{{The size of the 1st argument should be equal to or greater than the value of 10}} \
	// expected-warning{{}}			// expected-warning{{}}
				NoQUnsubmitted Done Reply Inline Actions The warning is the same as the note here right? Our warnings traditionally describe the problem (the 1st argument is less than 10, and this is bad because...), not how things "should" be. I guess we can think more about that later. NoQ: The warning is the same as the note here right? Our warnings traditionally describe the…
				martongAuthorUnsubmitted Done Reply Inline Actions No, actually, the warning is different, it does not contain the text "should be". In this case this is it: Line 31: Function argument constraint is not satisfied, constraint: BufferSize [alpha.unix.StdCLibraryFunctionArgs] And then the notes basically further explain how the constraint is not satisfied. I did not put the check for the warnings here because this test file is responsible for checking the notes only, hence it has the name `std-c-library-functions-arg-constraints-notes.cpp`. The warnings are directly tested in `std-c-library-functions-arg-constraints.c`, however, I have to admit, probably we should have even more specific checks for the warning messages there. martong: No, actually, the warning is different, it does not contain the text "should be". In this case…
	break;			break;
	}			}
	case 2: {			case 2: {
	char buf[3];			char buf[3];
	__buf_size_arg_constraint(buf, 4); // \			__buf_size_arg_constraint(buf, 4); // \
	// expected-note{{The size of the 1st arg should be equal to or less than the value of the 2nd arg}} \			// expected-note{{The size of the 1st argument should be equal to or greater than the value of the 2nd arg}} \
	// expected-warning{{}}			// expected-warning{{}}
	break;			break;
	}			}
	case 3: {			case 3: {
	char buf[3];			char buf[3];
	__buf_size_arg_constraint_mul(buf, 4, 2); // \			__buf_size_arg_constraint_mul(buf, 4, 2); // \
	// expected-note{{The size of the 1st arg should be equal to or less than the value of the 2nd arg times the 3rd arg}} \			// expected-note{{The size of the 1st argument should be equal to or greater than the value of the 2nd argument times the 3rd argument}} \
	// expected-warning{{}}			// expected-warning{{}}
	break;			break;
	}			}
	}			}
	}			}

	// Check the RangeConstraint violation notes.			// Check the RangeConstraint violation notes.
	int __single_val_1(int); // [1, 1]			int __single_val_1(int); // [1, 1]
	int __range_1_2(int); // [1, 2]			int __range_1_2(int); // [1, 2]
	int __range_1_2__4_5(int); // [1, 2], [4, 5]			int __range_1_2__4_5(int); // [1, 2], [4, 5]
	void test_range(int x) {			void test_range(int x) {
	__single_val_1(2); // \			__single_val_1(2); // \
	// expected-note{{The 1st arg should be within the range [1, 1]}} \			// expected-note{{The 1st argument should be within the range [1, 1]}} \
	// expected-warning{{}}			// expected-warning{{}}
	}			}
	// Do more specific check against the range strings.			// Do more specific check against the range strings.
	void test_range_values(int x) {			void test_range_values(int x) {
	switch (x) {			switch (x) {
	case 1:			case 1:
	__single_val_1(2); // expected-note{{[1, 1]}} \			__single_val_1(2); // expected-note{{[1, 1]}} \
	// expected-warning{{}}			// expected-warning{{}}
	Show All 27 Lines

clang/test/Analysis/std-c-library-functions-arg-constraints.c

Show All 33 Lines void test_alnum_concrete(int v) {

// bugpath-warning{{Function argument constraint is not satisfied}} \ // bugpath-warning{{Function argument constraint is not satisfied}} \

// bugpath-note{{}} \ // bugpath-note{{}} \

// bugpath-note{{Function argument constraint is not satisfied}} // bugpath-note{{Function argument constraint is not satisfied}}

(void)ret; (void)ret;

} }

void test_alnum_symbolic(int x) { void test_alnum_symbolic(int x) {

int ret = isalnum(x); // \ int ret = isalnum(x); // \

// bugpath-note{{Assuming the character is non-alphanumeric}} // bugpath-note{{Assuming the character is non-alphanumeric}}

NoQUnsubmitted

Done

This isn't part of this patch but what do you think about {-1} U [0, 255]? Or, you know, [-1, 255].

NoQ: This isn't part of this patch but what do you think about `{-1} U [0, 255]`? Or, you know, `[-1…

martongAuthorUnsubmitted

Done

Yeah, good idea, {-1} U [0, 255] would be indeed nicer and shorter. However, [-1, 255] is hard(er) so I am going to start with the former in a follow up patch.

martong: Yeah, good idea, `{-1} U [0, 255]` would be indeed nicer and shorter. However, `[-1, 255]` is…

(void)ret; (void)ret;

clang_analyzer_eval(EOF <= x && x <= 255); // \ clang_analyzer_eval(EOF <= x && x <= 255); // \

// report-warning{{TRUE}} \ // report-warning{{TRUE}} \

// bugpath-warning{{TRUE}} \ // bugpath-warning{{TRUE}} \

// bugpath-note{{TRUE}} \ // bugpath-note{{TRUE}} \

// bugpath-note{{Left side of '&&' is true}} \ // bugpath-note{{Left side of '&&' is true}} \

// bugpath-note{{'x' is <= 255}} // bugpath-note{{'x' is <= 255}}

▲ Show 20 Lines • Show All 182 Lines • ▼ Show 20 Lines void ARR38_C_F(FILE *file) {

// bugpath-note{{}} \ // bugpath-note{{}} \

// bugpath-note{{Function argument constraint is not satisfied}} // bugpath-note{{Function argument constraint is not satisfied}}

} }

int __two_constrained_args(int, int); int __two_constrained_args(int, int);

void test_constraints_on_multiple_args(int x, int y) { void test_constraints_on_multiple_args(int x, int y) {

// State split should not happen here. I.e. x == 1 should not be evaluated // State split should not happen here. I.e. x == 1 should not be evaluated

// FALSE. // FALSE.

__two_constrained_args(x, y); __two_constrained_args(x, y);

//NOTE! Because of the second `clang_analyzer_eval` call we have two bug

clang_analyzer_eval(x == 1); // \ clang_analyzer_eval(x == 1); // \

steakhalUnsubmitted

Done

// FALSE.

__two_constrained_args(x, y);

- // bugpath-note@-1{{Applied constraint: The 1st arg should be within the range [1, 1]}}

- // bugpath-note@-2{{Applied constraint: The 2nd arg should be within the range [1, 1]}}

- //NOTE! Because of the second `clang_analyzer_eval` call we have two bug

- //reports, thus the 'Applied constraint' notes appear twice.

- // bugpath-note@-5{{Applied constraint: The 1st arg should be within the range [1, 1]}}

- // bugpath-note@-6{{Applied constraint: The 2nd arg should be within the range [1, 1]}}

+ // NOTE! Because of the second `clang_analyzer_eval` call we have two bug

+ // reports, thus the 'Applied constraint' notes appear twice.

+ // bugpath-note@-3 2 {{Applied constraint: The 1st arg should be within the range [1, 1]}}

+ // bugpath-note@-4 2 {{Applied constraint: The 2nd arg should be within the range [1, 1]}}

clang_analyzer_eval(x == 1); // \

// report-warning{{TRUE}} \

nit.
BTW I raised my related concerns about this elsewhere.

steakhal: nit. BTW I raised my related concerns about this elsewhere.

// report-warning{{TRUE}} \ // report-warning{{TRUE}} \

// bugpath-warning{{TRUE}} \ // bugpath-warning{{TRUE}} \

// bugpath-note{{TRUE}} // bugpath-note{{TRUE}}

clang_analyzer_eval(y == 1); // \ clang_analyzer_eval(y == 1); // \

// report-warning{{TRUE}} \ // report-warning{{TRUE}} \

// bugpath-warning{{TRUE}} \ // bugpath-warning{{TRUE}} \

// bugpath-note{{TRUE}} // bugpath-note{{TRUE}}

} }

int __arg_constrained_twice(int); int __arg_constrained_twice(int);

void test_multiple_constraints_on_same_arg(int x) { void test_multiple_constraints_on_same_arg(int x) {

__arg_constrained_twice(x); __arg_constrained_twice(x);

// Check that both constraints are applied and only one branch is there.

clang_analyzer_eval(x < 1 || x > 2); // \ clang_analyzer_eval(x < 1 || x > 2); // \

// report-warning{{TRUE}} \ // report-warning{{TRUE}} \

// bugpath-warning{{TRUE}} \ // bugpath-warning{{TRUE}} \

// bugpath-note{{TRUE}} \ // bugpath-note{{TRUE}} \

// bugpath-note{{Assuming 'x' is < 1}} \ // bugpath-note{{Assuming 'x' is < 1}} \

// bugpath-note{{Left side of '||' is true}} // bugpath-note{{Left side of '||' is true}}

} }

steakhalUnsubmitted

Done

I was puzzled for a moment on why do you have two notes here.
By checking the definition of __arg_constrained_twice(), I can see that it has two ArgConstraints.

Although, it shouldn't be a problem as on the UI we would visualize notes for only a single bugreport. So it is probably clear that both of the notes are valid and correspond to the given statement. It might look clunky in the LIT test, but should be 'somewhat' readable in real life.

You should take no action here. I leave this comment just for the record.

steakhal: I was puzzled for a moment on why do you have two notes here. By checking the definition of…

int __variadic(void *stream, const char *format, ...); int __variadic(void *stream, const char *format, ...);

void test_arg_constraint_on_variadic_fun(void) { void test_arg_constraint_on_variadic_fun(void) {

__variadic(0, "%d%d", 1, 2); // \ __variadic(0, "%d%d", 1, 2); // \

// report-warning{{Function argument constraint is not satisfied}} \ // report-warning{{Function argument constraint is not satisfied}} \

// report-note{{}} \ // report-note{{}} \

// bugpath-warning{{Function argument constraint is not satisfied}} \ // bugpath-warning{{Function argument constraint is not satisfied}} \

// bugpath-note{{}} \ // bugpath-note{{}} \

▲ Show 20 Lines • Show All 70 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[analyzer][StdLibraryFunctionsChecker] Add NoteTags for applied arg constraintsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 470813

clang/lib/StaticAnalyzer/Checkers/ExprInspectionChecker.cpp

clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp

clang/test/Analysis/std-c-library-functions-arg-constraints-note-tags.cpp

clang/test/Analysis/std-c-library-functions-arg-constraints-notes.cpp

clang/test/Analysis/std-c-library-functions-arg-constraints.c

[analyzer][StdLibraryFunctionsChecker] Add NoteTags for applied arg constraints
ClosedPublic