This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
cfe/trunk/
-
trunk/
-
lib/StaticAnalyzer/Core/
-
StaticAnalyzer/
-
Core/
-
AnalyzerOptions.cpp
-
test/Analysis/
-
Analysis/
-
analyzer-config.c
-
analyzer-config.cpp

Differential D34277

[analyzer] Bump default performance thresholds?
ClosedPublic

Authored by NoQ on Jun 16 2017, 6:47 AM.

Download Raw Diff

Details

Reviewers

zaks.anna
dcoughlin
xazax.hun
a.sidorin

Commits

rG4a084cfde7b3: [analyzer] Bump a few default performance thresholds.
rC305900: [analyzer] Bump a few default performance thresholds.
rL305900: [analyzer] Bump a few default performance thresholds.

Summary

Because we now have faster CPUs and more RAM and stuff, should we now skew the balance to finding more bugs?

We could probably make a few rounds of such changes, observing any delayed feedback from users who use default settings and aren't watching phabricator, and rolling back in case we degrade dramatically on specific smaller projects.

As the first step, i've recently tested the following changes to default -analyzer-options:

max-nodes: 150000 -> 225000 (+50%) - the limit on the size of the exploded graph.
max-inlinable-size: 50 -> 100 (+100%) - the limit on the number of CFG blocks in inlined functions.

Totally, this gives 10% performance degradation and finds 5% more bugs on a large-ish codebase. max-inlinable-size change skews the analyzer to find more IPA-based bugs than before (+/-5% added/lost), and also overally slightly improves the number of bugs found; max-nodes increase brings back some of these positives.

Generally, it would also be good to make the analyzer work in a more obvious manner in terms of why does or doesn't it cover certain paths, inline certain functions, etc.- currently this is a mess of unobvious heuristics, and if we could make it less obvious by lifting some of these heuristics, it may be an additional benefit of this work as well.

Diff Detail

Repository: rL LLVM

Event Timeline

NoQ created this revision.Jun 16 2017, 6:47 AM

Hi Artem,

Could you tell what code bases did you use to collect your statistics? I'll try to check the patch on our code bases. I think we should be careful about default settings. Maybe we should introduce another UMK_* for deeper analysis instead?

Maybe we should introduce another UMK_* for deeper analysis instead?

The difference here is not substantial enough to warrant a new level. The general motivation for bumping these numbers is that we've set the timeouts many years ago and the hardware improved quite a bit since then.

Once Artem gives more details about the codebase he tested on, I think it would be fine to commit this. We can revert/address concerns later if @a.sidorin or anyone else raises concerns based on further testing on their codebases. @a.sidorin, would this work for you?

This revision is now accepted and ready to land.Jun 16 2017, 12:13 PM

This was an mixture of internal apple projects (user apps, drivers, deamons, whatever) with a relatively balanced selection of languages and levels of analyzer adoption. They amounted to ~16 hours of analysis CPU time (i.e. 4 hours on a quad-core machine per run). I've also ran it on LLVM separately, and had similar observations. I'm totally welcoming the feedback from other codebases!

In D34277#782605, @zaks.anna wrote:

Maybe we should introduce another UMK_* for deeper analysis instead?

The difference here is not substantial enough to warrant a new level. The general motivation for bumping these numbers is that we've set the timeouts many years ago and the hardware improved quite a bit since then.

Yeah, the point was mostly about default settings, for people who don't bother to tweak them, and adding more options essentially defeats the purpose.

While I have no objections, I am wondering whether this is the good way to spend the performance budget. In particular, there are patches to generate more symbolic expressions, that could also degrade the performance (but fix some fixmes along the way).

Ok, I hope this will work. Anyway, we can always revert this patch in case of any problems.

Gabor makes such a good point. Maybe we should commit the zombie symbols patch as well (:

In particular, there are patches to generate more symbolic expressions, that could also degrade the performance (but fix some fixmes along the way).

The patch you are talking about might be promising, but needs much more investigation and tuning for performance vs issues found. I do not think we should block this patch until the investigation is done.

Closed by commit rL305900: [analyzer] Bump a few default performance thresholds. (authored by dergachev). · Explain WhyJun 21 2017, 4:30 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

cfe/

trunk/

lib/

StaticAnalyzer/

Core/

AnalyzerOptions.cpp

4 lines

test/

Analysis/

analyzer-config.c

4 lines

analyzer-config.cpp

4 lines

Diff 103349

cfe/trunk/lib/StaticAnalyzer/Core/AnalyzerOptions.cpp

Show First 20 Lines • Show All 287 Lines • ▼ Show 20 Lines	if (!MaxInlinableSize.hasValue()) {
UserModeKind HighLevelMode = getUserMode();		UserModeKind HighLevelMode = getUserMode();
switch (HighLevelMode) {		switch (HighLevelMode) {
default:		default:
llvm_unreachable("Invalid mode.");		llvm_unreachable("Invalid mode.");
case UMK_Shallow:		case UMK_Shallow:
DefaultValue = 4;		DefaultValue = 4;
break;		break;
case UMK_Deep:		case UMK_Deep:
DefaultValue = 50;		DefaultValue = 100;
break;		break;
}		}

MaxInlinableSize = getOptionAsInteger("max-inlinable-size", DefaultValue);		MaxInlinableSize = getOptionAsInteger("max-inlinable-size", DefaultValue);
}		}
return MaxInlinableSize.getValue();		return MaxInlinableSize.getValue();
}		}

Show All 22 Lines	if (!MaxNodesPerTopLevelFunction.hasValue()) {
UserModeKind HighLevelMode = getUserMode();		UserModeKind HighLevelMode = getUserMode();
switch (HighLevelMode) {		switch (HighLevelMode) {
default:		default:
llvm_unreachable("Invalid mode.");		llvm_unreachable("Invalid mode.");
case UMK_Shallow:		case UMK_Shallow:
DefaultValue = 75000;		DefaultValue = 75000;
break;		break;
case UMK_Deep:		case UMK_Deep:
DefaultValue = 150000;		DefaultValue = 225000;
break;		break;
}		}
MaxNodesPerTopLevelFunction = getOptionAsInteger("max-nodes", DefaultValue);		MaxNodesPerTopLevelFunction = getOptionAsInteger("max-nodes", DefaultValue);
}		}
return MaxNodesPerTopLevelFunction.getValue();		return MaxNodesPerTopLevelFunction.getValue();
}		}

bool AnalyzerOptions::shouldSynthesizeBodies() {		bool AnalyzerOptions::shouldSynthesizeBodies() {
Show All 29 Lines

cfe/trunk/test/Analysis/analyzer-config.c

	Show All 13 Lines
	// CHECK-NEXT: cfg-conditional-static-initializers = true			// CHECK-NEXT: cfg-conditional-static-initializers = true
	// CHECK-NEXT: cfg-temporary-dtors = false			// CHECK-NEXT: cfg-temporary-dtors = false
	// CHECK-NEXT: faux-bodies = true			// CHECK-NEXT: faux-bodies = true
	// CHECK-NEXT: graph-trim-interval = 1000			// CHECK-NEXT: graph-trim-interval = 1000
	// CHECK-NEXT: inline-lambdas = true			// CHECK-NEXT: inline-lambdas = true
	// CHECK-NEXT: ipa = dynamic-bifurcate			// CHECK-NEXT: ipa = dynamic-bifurcate
	// CHECK-NEXT: ipa-always-inline-size = 3			// CHECK-NEXT: ipa-always-inline-size = 3
	// CHECK-NEXT: leak-diagnostics-reference-allocation = false			// CHECK-NEXT: leak-diagnostics-reference-allocation = false
	// CHECK-NEXT: max-inlinable-size = 50			// CHECK-NEXT: max-inlinable-size = 100
	// CHECK-NEXT: max-nodes = 150000			// CHECK-NEXT: max-nodes = 225000
	// CHECK-NEXT: max-times-inline-large = 32			// CHECK-NEXT: max-times-inline-large = 32
	// CHECK-NEXT: min-cfg-size-treat-functions-as-large = 14			// CHECK-NEXT: min-cfg-size-treat-functions-as-large = 14
	// CHECK-NEXT: mode = deep			// CHECK-NEXT: mode = deep
	// CHECK-NEXT: region-store-small-struct-limit = 2			// CHECK-NEXT: region-store-small-struct-limit = 2
	// CHECK-NEXT: widen-loops = false			// CHECK-NEXT: widen-loops = false
	// CHECK-NEXT: [stats]			// CHECK-NEXT: [stats]
	// CHECK-NEXT: num-entries = 15			// CHECK-NEXT: num-entries = 15

cfe/trunk/test/Analysis/analyzer-config.cpp

	Show All 24 Lines
	// CHECK-NEXT: cfg-conditional-static-initializers = true			// CHECK-NEXT: cfg-conditional-static-initializers = true
	// CHECK-NEXT: cfg-temporary-dtors = false			// CHECK-NEXT: cfg-temporary-dtors = false
	// CHECK-NEXT: faux-bodies = true			// CHECK-NEXT: faux-bodies = true
	// CHECK-NEXT: graph-trim-interval = 1000			// CHECK-NEXT: graph-trim-interval = 1000
	// CHECK-NEXT: inline-lambdas = true			// CHECK-NEXT: inline-lambdas = true
	// CHECK-NEXT: ipa = dynamic-bifurcate			// CHECK-NEXT: ipa = dynamic-bifurcate
	// CHECK-NEXT: ipa-always-inline-size = 3			// CHECK-NEXT: ipa-always-inline-size = 3
	// CHECK-NEXT: leak-diagnostics-reference-allocation = false			// CHECK-NEXT: leak-diagnostics-reference-allocation = false
	// CHECK-NEXT: max-inlinable-size = 50			// CHECK-NEXT: max-inlinable-size = 100
	// CHECK-NEXT: max-nodes = 150000			// CHECK-NEXT: max-nodes = 225000
	// CHECK-NEXT: max-times-inline-large = 32			// CHECK-NEXT: max-times-inline-large = 32
	// CHECK-NEXT: min-cfg-size-treat-functions-as-large = 14			// CHECK-NEXT: min-cfg-size-treat-functions-as-large = 14
	// CHECK-NEXT: mode = deep			// CHECK-NEXT: mode = deep
	// CHECK-NEXT: region-store-small-struct-limit = 2			// CHECK-NEXT: region-store-small-struct-limit = 2
	// CHECK-NEXT: widen-loops = false			// CHECK-NEXT: widen-loops = false
	// CHECK-NEXT: [stats]			// CHECK-NEXT: [stats]
	// CHECK-NEXT: num-entries = 20			// CHECK-NEXT: num-entries = 20

This is an archive of the discontinued LLVM Phabricator instance.

[analyzer] Bump default performance thresholds?ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 103349

cfe/trunk/lib/StaticAnalyzer/Core/AnalyzerOptions.cpp

cfe/trunk/test/Analysis/analyzer-config.c

cfe/trunk/test/Analysis/analyzer-config.cpp

[analyzer] Bump default performance thresholds?
ClosedPublic