This is an archive of the discontinued LLVM Phabricator instance.

[sancov] fixing too aggressive instrumentation elimination
AbandonedPublic

Authored by vitalybuka on Mar 22 2017, 4:13 PM.

Download Raw Diff

Details

Reviewers

eugenis
aizatsky

Diff Detail

Build Status

Buildable 5064
Build 5064: arc lint + arc unit

Event Timeline

aizatsky created this revision.Mar 22 2017, 4:13 PM

Harbormaster completed remote builds in B4989: Diff 92733.Mar 22 2017, 4:13 PM

aizatsky added a subscriber: llvm-commits.Mar 22 2017, 4:13 PM

LGTM

This revision is now accepted and ready to land.Mar 22 2017, 4:25 PM

I don't like an algorithm were we determine the blocks to instrument while traversing and instrumenting other blocks.
It's much harder to reasons about.

kcc added inline comments.Mar 23 2017, 4:38 PM

lib/Transforms/Instrumentation/SanitizerCoverage.cpp
423	what if you add here something like this: for (const BasicBlock *SUCC : make_range(succ_begin(BB), succ_end(BB))) { if (PDT->dominates(SUCC, BB)) return false; }

checking for post-dominators

aizatsky added inline comments.Mar 24 2017, 9:12 AM

lib/Transforms/Instrumentation/SanitizerCoverage.cpp
423	Done. Also rearranged comments. PTAL.

Do we need a test this big?
We may just have a test with two basic blocks, don't we?

vitalybuka requested changes to this revision.Apr 24 2017, 4:21 PM

This revision now requires changes to proceed.Apr 24 2017, 4:21 PM

vitalybuka commandeered this revision.Apr 24 2017, 4:22 PM

vitalybuka edited reviewers, added: aizatsky; removed: vitalybuka.

"commandeered" here means only that I need to finish this.

m.ostapenko added subscribers: m.ostapenko, m.guseva, d.nikiforov.May 18 2017, 12:29 AM

I think we can make a much simpler change: what if we just skip the optimization of not instrumenting post-dominators?

We apply optimizations in order to instrument less blocks without loss of precision, thus we don't instrument blocks
the path through which is uniquely identifiable using other blocks.
The problems start once this reasoning gets circular, and to me it's not obvious how this patch would solve this problem
in general (it very well may, and maybe I'm not looking close enough, but at least it's not obvious).

However, if we just apply the optimization for full dominators (not instrumenting these nodes)
we'll never get circular arguments, as domination relation is never circular.
From a limited set of examples I've tried, the post-dominator optimization produced a very tiny benefit
after the dominator optimization was applied.
Moreover, I could only see the benefit for a corner case of nodes with no successors (and if not instrumenting them
does speed fuzzing up, it might be easier to just check the is-strong-postdominator condition on those).

In D31266#759155, @george.karpenkov wrote:

I think we can make a much simpler change: what if we just skip the optimization of not instrumenting post-dominators?

It's clearly very simple.
The question is whether it's too pessimistic.

We apply optimizations in order to instrument less blocks without loss of precision, thus we don't instrument blocks
the path through which is uniquely identifiable using other blocks.
The problems start once this reasoning gets circular, and to me it's not obvious how this patch would solve this problem
in general (it very well may, and maybe I'm not looking close enough, but at least it's not obvious).

However, if we just apply the optimization for full dominators (not instrumenting these nodes)
we'll never get circular arguments, as domination relation is never circular.
From a limited set of examples I've tried, the post-dominator optimization produced a very tiny benefit
after the dominator optimization was applied.

Can you get some exact numbers on real code?
E.g. take https://github.com/google/fuzzer-test-suite/blob/master/sqlite-2016-11-14/sqlite3.c
(a single-file large chunk of code)

The numbers I remember were like DOM gives 30% saving, PDOM gives 20% more, which is a lot.

Moreover, I could only see the benefit for a corner case of nodes with no successors (and if not instrumenting them
does speed fuzzing up, it might be easier to just check the is-strong-postdominator condition on those).

The numbers I remember were like DOM gives 30% saving, PDOM gives 20% more, which is a lot.

But how would we know whether those numbers are good?
E.g. is it saving 20% of unneeded instrumentation, or missing 20% of code which actually needs to be instrumented?
LibFuzzer would find lots of bugs regardless, right?

With programs that large it would be hard to simply look at IR, and check whether instrumentation is spurious.

On your example the difference is 16k vs 22k, but it's not clear whether those extra calls are spurious.

In D31266#759656, @george.karpenkov wrote:

The numbers I remember were like DOM gives 30% saving, PDOM gives 20% more, which is a lot.

But how would we know whether those numbers are good?

Good question...
I was thinking about using libFuzzer itself to decide.
E.g. take a large corpus for some target (sqlite would be fine) and minimize it w/ and w/o optimization.

E.g. is it saving 20% of unneeded instrumentation, or missing 20% of code which actually needs to be instrumented?

My expectation is most of those 20% are really redundant.
But I did not invest much time into this problem.

LibFuzzer would find lots of bugs regardless, right?

Maybe not. When Mike implemented this optimization I compared a couple of targets and did not see any difference.

With programs that large it would be hard to simply look at IR, and check whether instrumentation is spurious.

After an offline discussion...
Looks like me and Vitaly both are busy with other stuff and this thing seems to be blocking you and a few others (and maybe prevents us from finding more bugs).
So let's just delete the PDOM part as George suggests and then later come up with a better strategy (and a way to test it).

George, want to send a separate CL?

Somewhat unrelated, just want to mention it here: I hope to get to implementing yet another way of coverage instrumentation which will not involve callbacks, just a single increment.
It's easy to do just that, but I also want to preserve the association bettween the counter and the BB, which is a bit trickier (and requires http://llvm.org/docs/LangRef.html#addresses-of-basic-blocks to work reliably).
This task is orthogonal to the optimization discussed here.

@kcc: sure, great!

In the spirit of disclosure, I'm now pretty certain I'm actually hitting a separate bug where the critical edges are not being split (even though SanitizerCoverage pass calls the proper function. Probably another after instrumentation adds them?.. ).

(but my stuff actually does work if I disable PDT optimization; it's the case of two bugs interacting)

In D31266#759809, @george.karpenkov wrote:

@kcc: sure, great!

In the spirit of disclosure, I'm now pretty certain I'm actually hitting a separate bug where the critical edges are not being split (even though SanitizerCoverage pass calls the proper function. Probably another after instrumentation adds them?.. ).

This might be a problem specific to the SWIFT driver...

@kcc does not seem to be, cf. https://github.com/google/sanitizers/issues/809

george.karpenkov mentioned this in D33472: A simple coverage optimization.May 23 2017, 6:38 PM

(https://github.com/google/sanitizers/issues/783 fixed with r303698

vitalybuka abandoned this revision.May 26 2017, 5:22 PM

Revision Contents

Path

Size

lib/

Transforms/

Instrumentation/

SanitizerCoverage.cpp

33 lines

test/

Instrumentation/

SanitizerCoverage/

prune-blocks.ll

130 lines

Diff 92956

lib/Transforms/Instrumentation/SanitizerCoverage.cpp

Show First 20 Lines • Show All 402 Lines • ▼ Show 20 Lines	std::tie(CtorFunc, std::ignore) = createSanitizerCtorAndInitFunctions(
IRB.CreatePointerCast(ModuleName, Int8PtrTy)});		IRB.CreatePointerCast(ModuleName, Int8PtrTy)});

appendToGlobalCtors(M, CtorFunc, SanCtorAndDtorPriority);		appendToGlobalCtors(M, CtorFunc, SanCtorAndDtorPriority);
}		}

return true;		return true;
}		}

// True if block has successors and it dominates all of them.		// True if block has predecessors and it postdominates all of them.
static bool isFullDominator(const BasicBlock BB, const DominatorTree DT) {		// If a block is full post dominator, then all paths through any of its
if (succ_begin(BB) == succ_end(BB))		// preds pass through the block.
		static bool isFullPostDominator(const BasicBlock *BB,
		const PostDominatorTree *PDT) {
		if (pred_begin(BB) == pred_end(BB))
return false;		return false;

for (const BasicBlock *SUCC : make_range(succ_begin(BB), succ_end(BB))) {		for (const BasicBlock *PRED : make_range(pred_begin(BB), pred_end(BB))) {
if (!DT->dominates(BB, SUCC))		if (!PDT->dominates(BB, PRED))
return false;		return false;
}		}

		kccUnsubmitted Not Done Reply Inline Actions what if you add here something like this: for (const BasicBlock SUCC : make_range(succ_begin(BB), succ_end(BB))) { if (PDT->dominates(SUCC, BB)) return false; } kcc:* what if you add here something like this: for (const BasicBlock *SUCC : make_range(succ_begin…
		aizatskyUnsubmitted Not Done Reply Inline Actions Done. Also rearranged comments. PTAL. aizatsky: Done. Also rearranged comments. PTAL.
return true;		return true;
}		}

// True if block has predecessors and it postdominates all of them.		// True if block has successors and it dominates all of them AND
static bool isFullPostDominator(const BasicBlock *BB,		// none of successors post-dominate the block.
		// It is tempting to skip all pre dominators as well, but the argument
		// doesn't work anymore, because some successors might already be skipped.
		// Linear sequence of blocks is a good example.
		static bool shouldSkipDominator(const BasicBlock BB, const DominatorTree DT,
const PostDominatorTree *PDT) {		const PostDominatorTree *PDT) {
if (pred_begin(BB) == pred_end(BB))		if (succ_begin(BB) == succ_end(BB))
return false;		return false;

for (const BasicBlock *PRED : make_range(pred_begin(BB), pred_end(BB))) {		for (const BasicBlock *SUCC : make_range(succ_begin(BB), succ_end(BB))) {
if (!PDT->dominates(BB, PRED))		if (!DT->dominates(BB, SUCC) \|\| PDT->dominates(SUCC, BB))
return false;		return false;
}		}

return true;		return true;
}		}

static bool shouldInstrumentBlock(const Function& F, const BasicBlock BB, const DominatorTree DT,		static bool shouldInstrumentBlock(const Function &F, const BasicBlock *BB,
		const DominatorTree *DT,
const PostDominatorTree *PDT) {		const PostDominatorTree *PDT) {
// Don't insert coverage for unreachable blocks: we will never call		// Don't insert coverage for unreachable blocks: we will never call
// __sanitizer_cov() for them, so counting them in		// __sanitizer_cov() for them, so counting them in
// NumberOfInstrumentedBlocks() might complicate calculation of code coverage		// NumberOfInstrumentedBlocks() might complicate calculation of code coverage
// percentage. Also, unreachable instructions frequently have no debug		// percentage. Also, unreachable instructions frequently have no debug
// locations.		// locations.
if (isa<UnreachableInst>(BB->getTerminator()))		if (isa<UnreachableInst>(BB->getTerminator()))
return false;		return false;

if (!ClPruneBlocks \|\| &F.getEntryBlock() == BB)		if (!ClPruneBlocks \|\| &F.getEntryBlock() == BB)
return true;		return true;

return !(isFullDominator(BB, DT) \|\| isFullPostDominator(BB, PDT));		return !(isFullPostDominator(BB, PDT) \|\| shouldSkipDominator(BB, DT, PDT));
}		}

bool SanitizerCoverageModule::runOnFunction(Function &F) {		bool SanitizerCoverageModule::runOnFunction(Function &F) {
if (F.empty())		if (F.empty())
return false;		return false;
if (F.getName().find(".module_ctor") != std::string::npos)		if (F.getName().find(".module_ctor") != std::string::npos)
return false; // Should not instrument sanitizer init functions.		return false; // Should not instrument sanitizer init functions.
if (F.getName().startswith("__sanitizer_"))		if (F.getName().startswith("__sanitizer_"))
▲ Show 20 Lines • Show All 52 Lines • ▼ Show 20 Lines	bool SanitizerCoverageModule::runOnFunction(Function &F) {
InjectCoverage(F, BlocksToInstrument);		InjectCoverage(F, BlocksToInstrument);
InjectCoverageForIndirectCalls(F, IndirCalls);		InjectCoverageForIndirectCalls(F, IndirCalls);
InjectTraceForCmp(F, CmpTraceTargets);		InjectTraceForCmp(F, CmpTraceTargets);
InjectTraceForSwitch(F, SwitchTraceTargets);		InjectTraceForSwitch(F, SwitchTraceTargets);
InjectTraceForDiv(F, DivTraceTargets);		InjectTraceForDiv(F, DivTraceTargets);
InjectTraceForGep(F, GepTraceTargets);		InjectTraceForGep(F, GepTraceTargets);
return true;		return true;
}		}

void SanitizerCoverageModule::CreateFunctionGuardArray(size_t NumGuards,		void SanitizerCoverageModule::CreateFunctionGuardArray(size_t NumGuards,
Function &F) {		Function &F) {
if (!Options.TracePCGuard) return;		if (!Options.TracePCGuard) return;
HasSancovGuardsSection = true;		HasSancovGuardsSection = true;
ArrayType *ArrayOfInt32Ty = ArrayType::get(Int32Ty, NumGuards);		ArrayType *ArrayOfInt32Ty = ArrayType::get(Int32Ty, NumGuards);
FunctionGuardArray = new GlobalVariable(		FunctionGuardArray = new GlobalVariable(
*CurModule, ArrayOfInt32Ty, false, GlobalVariable::PrivateLinkage,		*CurModule, ArrayOfInt32Ty, false, GlobalVariable::PrivateLinkage,
Constant::getNullValue(ArrayOfInt32Ty), "__sancov_gen_");		Constant::getNullValue(ArrayOfInt32Ty), "__sancov_gen_");
▲ Show 20 Lines • Show All 276 Lines • Show Last 20 Lines

test/Instrumentation/SanitizerCoverage/prune-blocks.ll

This file was added.

				; First example from https://github.com/google/sanitizers/issues/783
				; RUN: opt < %s -sancov -sanitizer-coverage-level=4 -sanitizer-coverage-trace-pc -sanitizer-coverage-prune-blocks=1 -S \| FileCheck %s --check-prefix=CHECK

				source_filename = "reduced.c"
				target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
				target triple = "x86_64-unknown-linux-gnu"

				; Function Attrs: noinline nounwind uwtable
				define i32 @foo(i8 signext %c, i32 %state, i32 %threshold) #0 {
				entry:
				%c.addr = alloca i8, align 1
				%state.addr = alloca i32, align 4
				%threshold.addr = alloca i32, align 4
				store i8 %c, i8* %c.addr, align 1
				store i32 %state, i32* %state.addr, align 4
				store i32 %threshold, i32* %threshold.addr, align 4
				br label %while.cond

				; CHECK-LABEL: entry:
				; CHECK: call void @__sanitizer_cov_trace_pc()

				while.cond: ; preds = %sw.epilog, %entry
				%0 = load i32, i32* %state.addr, align 4
				%1 = load i32, i32* %threshold.addr, align 4
				%cmp = icmp slt i32 %0, %1
				br i1 %cmp, label %while.body, label %while.end

				; CHECK-LABEL: while.cond:
				; CHECK-NOT: call void @__sanitizer_cov_trace_pc()

				while.body: ; preds = %while.cond
				%2 = load i32, i32* %state.addr, align 4
				switch i32 %2, label %sw.epilog [
				i32 1, label %sw.bb
				i32 2, label %sw.bb9
				]

				; CHECK-LABEL: while.body:
				; CHECK-NOT: call void @__sanitizer_cov_trace_pc()
				; CHECK-LABEL: while.body.sw.epilog_crit_edge:
				; CHECK: call void @__sanitizer_cov_trace_pc()

				sw.bb: ; preds = %while.body
				call void @clobber1(i32* %state.addr)
				%3 = load i8, i8* %c.addr, align 1
				%conv = sext i8 %3 to i32
				%cmp1 = icmp eq i32 %conv, 42
				br i1 %cmp1, label %if.then, label %if.else

				; CHECK-LABEL: sw.bb:
				; CHECK-NOT: call void @__sanitizer_cov_trace_pc()

				if.then: ; preds = %sw.bb
				%4 = load i32, i32* %state.addr, align 4
				%inc = add nsw i32 %4, 1
				store i32 %inc, i32* %state.addr, align 4
				br label %if.end8

				; CHECK-LABEL: if.then:
				; CHECK: call void @__sanitizer_cov_trace_pc()

				if.else: ; preds = %sw.bb
				%5 = load i8, i8* %c.addr, align 1
				%conv3 = sext i8 %5 to i32
				%cmp4 = icmp eq i32 %conv3, 47
				br i1 %cmp4, label %if.then6, label %if.else7

				; CHECK-LABEL: if.else:
				; CHECK-NOT: call void @__sanitizer_cov_trace_pc()

				if.then6: ; preds = %if.else
				%6 = load i32, i32* %state.addr, align 4
				%dec = add nsw i32 %6, -1
				store i32 %dec, i32* %state.addr, align 4
				br label %if.end

				; CHECK-LABEL: if.then6:
				; CHECK: call void @__sanitizer_cov_trace_pc()

				if.else7: ; preds = %if.else
				call void @clobber1(i32* %state.addr)
				br label %out

				; CHECK-LABEL: if.else7:
				; CHECK: call void @__sanitizer_cov_trace_pc()

				if.end: ; preds = %if.then6
				br label %if.end8

				; CHECK-LABEL: if.end:
				; CHECK-NOT: call void @__sanitizer_cov_trace_pc()

				if.end8: ; preds = %if.end, %if.then
				call void @clobber2(i32* %state.addr)
				br label %sw.epilog

				; CHECK-LABEL: if.end8:
				; CHECK-NOT: call void @__sanitizer_cov_trace_pc()

				sw.bb9: ; preds = %while.body
				call void @clobber3(i32* %state.addr)
				br label %sw.epilog

				; CHECK-LABEL: sw.bb9:
				; CHECK: call void @__sanitizer_cov_trace_pc()

				sw.epilog: ; preds = %while.body, %sw.bb9, %if.end8
				br label %while.cond

				; CHECK-LABEL: sw.epilog:
				; CHECK-NOT: call void @__sanitizer_cov_trace_pc()

				while.end: ; preds = %while.cond
				br label %out

				; CHECK-LABEL: while.end:
				; CHECK: call void @__sanitizer_cov_trace_pc()

				out: ; preds = %while.end, %if.else7
				%7 = load i32, i32* %state.addr, align 4
				ret i32 %7

				; CHECK-LABEL: out:
				; CHECK-NOT: call void @__sanitizer_cov_trace_pc()
				}

				declare void @clobber1(i32*) #1
				declare void @clobber2(i32*) #1
				declare void @clobber3(i32*) #1