This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
compiler-rt/lib/fuzzer/
-
lib/
-
fuzzer/
5/5
FuzzerCorpus.h
-
FuzzerDriver.cpp
5/5
FuzzerFlags.def
3/3
FuzzerFork.cpp
2/2
FuzzerInternal.h
12/13
FuzzerLoop.cpp
-
FuzzerOptions.h
-
tests/
4/4
FuzzerUnittest.cpp

Differential D86577

[libFuzzer] Add an option to keep initial seed inputs around.
ClosedPublic

Authored by dokyungs on Aug 25 2020, 2:32 PM.

Download Raw Diff

Details

Reviewers

morehouse
hctim
kcc

Commits

rG62673c430de4: [libFuzzer] Add an option to keep initial seed inputs around.

Summary

This patch adds an option "keep_seed" to keep all initial seed inputs. Previously, only the initial seed inputs that find new coverage were added to the corpus, and all the other initial inputs were discarded. We observed in some circumstances that useful initial seed inputs are discarded as they find no new coverage, even though they contain useful fragments in them (e.g., SQLITE3 FuzzBench benchmark). This newly added option provides a way to keeping seed inputs for those circumstances. With this patch, and with -keep_seed=1, all initial seed inputs are kept regardless of whether they find new coverage or not. Further, these seed inputs are not replaced with smaller inputs even if -reduce_inputs=1.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

dokyungs created this revision.Aug 25 2020, 2:32 PM

Herald added a project: Restricted Project. · View Herald TranscriptAug 25 2020, 2:32 PM

Herald added a subscriber: Restricted Project. · View Herald Transcript

dokyungs requested review of this revision.Aug 25 2020, 2:32 PM

Add missing code.

Harbormaster completed remote builds in B69512: Diff 287778.Aug 25 2020, 2:57 PM

Harbormaster completed remote builds in B69513: Diff 287779.Aug 25 2020, 3:20 PM

Is it possible to fork the cross_over_uniformdist changes into a supplementary patchset easily? Seems like an related but indepentent change (prefer one patchset per feature if possible please -- reviewing is O(n^3) in lines-of-code-per-patch and it's also nice to have different commits for different features).

compiler-rt/lib/fuzzer/FuzzerFlags.def
27	`When used with \|reduce_inputs==1\|, the seed inputs will never be reduced.`
compiler-rt/lib/fuzzer/FuzzerFork.cpp
316	nit: spaces not tabs (and throughout)
compiler-rt/lib/fuzzer/FuzzerLoop.cpp
490	Can't this be derived by `II->SeedInput` (as elsewhere)? It would be much cleaner to not have to maintain this variable in the class just for initialization. If it can't be derived by that, can you please make it a function paramater (you can even default it to `false` to avoid having to change all the other instances)
799	nit: `number of corpus inputs = %d\n` (just because it's absolutely 100% clear that it's number-of-units, not number-of-bytes)
compiler-rt/lib/fuzzer/tests/FuzzerUnittest.cpp
1061–1062	`std::unique_ptr<InputCorpus> C(new InputCorpus("", Entropic, /* KeepSeed */ false));` (and elsewhere)

Addressed comments. Added keep-seed.test to support/test usefulness of the -keep_seed=1 flag.

dokyungs marked 4 inline comments as done.Aug 26 2020, 9:49 AM

dokyungs marked an inline comment as done.

dokyungs added inline comments.

compiler-rt/lib/fuzzer/FuzzerFork.cpp
316	I just checked that these are actually spaces.

In D86577#2237648, @hctim wrote:

Is it possible to fork the cross_over_uniformdist changes into a supplementary patchset easily? Seems like an related but indepentent change (prefer one patchset per feature if possible please -- reviewing is O(n^3) in lines-of-code-per-patch and it's also nice to have different commits for different features).

Yes, it is definitely possible. I will do in the next upload though, because I'd like to see if we can have some discussion here about how to make a good use the kept seed inputs.

Harbormaster completed remote builds in B69625: Diff 288016.Aug 26 2020, 10:14 AM

hctim added inline comments.Aug 26 2020, 10:42 AM

compiler-rt/lib/fuzzer/FuzzerCorpus.h
284	nit: `ChooseUnweightedInputToMutate` or something like that. Make it clear that this is a uniform selection, not a bias-weighted selection.
compiler-rt/lib/fuzzer/FuzzerFlags.def
29	nit: `cross_over_uniform_dist`
compiler-rt/lib/fuzzer/FuzzerFork.cpp
316	Ah - I see, it's just the way that phabricator displays that the indentation level was increased. Interesting...
compiler-rt/lib/fuzzer/FuzzerLoop.cpp
780	nit: please also give these unnamed constants the `/* ArgName */` treatment
compiler-rt/lib/fuzzer/tests/FuzzerUnittest.cpp
602	While you're here can you give these the `/* ArgName */` treatment as well, thanks
compiler-rt/test/fuzzer/keep-seed.test
8 ↗	(On Diff #288016)	Hmm - maybe a better test would be to run with and without `keep_seed` and verify that `keep_seed` can find it in less iterations? Can you also add a quick comment with the amount of runs required on your machine to trigger this bug with and without this patch? Thanks.

Addressed the comments except for the test comment (will follow up on the next upload), and removed uniform dist changes which will be uploaded as a separate patch.

dokyungs edited the summary of this revision. (Show Details)Aug 31 2020, 9:57 AM

dokyungs marked 4 inline comments as done.

dokyungs added inline comments.

compiler-rt/lib/fuzzer/FuzzerCorpus.h
284	Will reflect this comment in a separate patch.
compiler-rt/lib/fuzzer/FuzzerFlags.def
29	Reflect this comment in a separate patch.

Harbormaster completed remote builds in B70112: Diff 288973.Aug 31 2020, 10:23 AM

Addressed the comment on the test.

With -keep_seed=0, it takes 5,150,128 execs to find the crash.
With -keep_seed=1, it takes 1,049,074 execs to find the crash. (because the seed input is not reduced.)

dokyungs marked an inline comment as done.Aug 31 2020, 10:24 AM

dokyungs added inline comments.

compiler-rt/test/fuzzer/keep-seed.test
8 ↗	(On Diff #288016)	Done. With -keep_seed=0, it takes 5,150,128 execs to find the crash. With -keep_seed=1, it takes 1,049,074 execs to find the crash. (because the seed input is not reduced.)

Fix comment in KeepSeedTest.cpp

Harbormaster completed remote builds in B70119: Diff 288982.Aug 31 2020, 10:51 AM

Harbormaster completed remote builds in B70120: Diff 288983.Aug 31 2020, 11:15 AM

morehouse added inline comments.Aug 31 2020, 5:13 PM

compiler-rt/lib/fuzzer/FuzzerCorpus.h
186	Feel free to disregard, but I'd like to protest the length of this parameter list. I would happily review a separate patch that cleaned it up.
compiler-rt/lib/fuzzer/FuzzerFlags.def
28	Please also document the intended use case (i.e. when seeds are not properly formed for the fuzz target but still have useful snippets.
compiler-rt/lib/fuzzer/FuzzerInternal.h
71	Nit: Input params should come before output params
compiler-rt/lib/fuzzer/FuzzerLoop.cpp
481	I think it's cleaner if we avoid checking `Options.KeepSeed` in this function, and instead check it in the caller. Then we could rename `SeedInput` to `ForceAddToCorpus` for clarity.
799	Isn't this information already printed above?
compiler-rt/lib/fuzzer/tests/FuzzerUnittest.cpp
597	Side note, feel free to ignore: Now that LLVM uses C++14, all these `unique_ptr` patterns can be simplified with `std::make_unique`: auto C = std::make_unique<InputCorpus>("", Entropic, false); (Separate patch is welcome)
compiler-rt/test/fuzzer/KeepSeedTest.cpp
34 ↗	(On Diff #288983)	Please clang-format this test.

Partially addressed comments.

dokyungs marked 6 inline comments as done.Sep 1 2020, 8:50 AM

dokyungs added inline comments.

compiler-rt/lib/fuzzer/FuzzerCorpus.h
186	Thanks! will send a separate patch!
compiler-rt/lib/fuzzer/FuzzerFlags.def
28	Done!
compiler-rt/lib/fuzzer/FuzzerInternal.h
71	Done.
compiler-rt/lib/fuzzer/FuzzerLoop.cpp
481	I tried to do this, but wasn't sure the scope of change you were thinking of. Do you want the `SeedInput` member variable of `InputInfo` to be changed to `ForceAddToCorpus` too? We need to keep this information in `InputInfo` to prevent it being reduced to a smaller input here.
799	Removed. Printing wasn't necessary here.
compiler-rt/lib/fuzzer/tests/FuzzerUnittest.cpp
597	Thanks for the suggestion! Will send a separate patch.
compiler-rt/test/fuzzer/KeepSeedTest.cpp
34 ↗	(On Diff #288983)	Done. `compiler-rt/test/.clang-format` has `ColumnLimit: 0` so I had to temporarily remove it to get 80 chars in each line.

Harbormaster completed remote builds in B70248: Diff 289187.Sep 1 2020, 9:26 AM

morehouse added inline comments.Sep 1 2020, 10:37 AM

compiler-rt/lib/fuzzer/FuzzerLoop.cpp
481	I just meant we could simplify slightly in this function by changing the param to `ForceAddToCorpus` and then unconditionally adding to corpus if its true. We can check the `KeepSeed` flag in the caller to determine if we want to force add the input or not.

dokyungs marked 6 inline comments as done.Sep 1 2020, 11:18 AM

dokyungs added inline comments.

compiler-rt/lib/fuzzer/FuzzerLoop.cpp
481	I am not sure if I can simplify this that way, because `SeedInput` needs to propagate; it's stored in the `InputInfo`s of all seed inputs when `InputInfo`s are created by `AddToCorpus`. Storing this in a member variable `SeedInput` of `InputInfo` is needed to prevent them from being replaced. The above line prevents them from being replaced with new, non-seed inputs that's based on the seed input. It seems to me that, if we change the argument `SeedInput` to `ForceAddToCorpus`, `SeedInput` of `InputInfo` needs changing too. Please correct me if I am still misled :)

morehouse added inline comments.Sep 1 2020, 11:31 AM

compiler-rt/lib/fuzzer/FuzzerLoop.cpp
481	Ok, I think I was confused because we have both `SeedInput` and `II->SeedInput` used in this function. Is there any way we can make this simpler? Having to check 3 things that seem similar (`Options.KeepSeed`, `SeedInput`, `II->SeedInput`) in this single function is tricky.

dokyungs added inline comments.Sep 2 2020, 9:11 AM

compiler-rt/lib/fuzzer/FuzzerLoop.cpp
481	Yes, I share the sense that this is confusing. What do you think of adding a new flag called `reduce_seed_inputs`, and check `!Options.ReduceSeedInputs && II->SeedInput` here rather than `Options.KeepSeed && II->SeedInput`? This is probably not simplifying things since it's more code, but adding a new flag could probably reduce possible confusion because it's more explanatory.

hctim added inline comments.Sep 2 2020, 2:27 PM

compiler-rt/lib/fuzzer/FuzzerCorpus.h
135	Vestigal - please delete.
compiler-rt/test/fuzzer/keep-seed.test
8 ↗	(On Diff #288016)	Can you also explicitly check that the number of runs with `-keep_seed=1` uses less iterations? You can get the data by providing `-print_final_stats=1`, you can grep the number by `stat::number_of_executed_units`

morehouse added inline comments.Sep 2 2020, 2:40 PM

compiler-rt/lib/fuzzer/FuzzerLoop.cpp
481	One idea: Rename `II->KeepSeed` to `II->NeverReduce`, and only set it to true if `Options.KeepSeed == true`. Rename the parameter `KeepSeed` to `ForceAddToCorpus`. This seems more understandable to me, and we don't need to check `Options.KeepSeed` at all in this function.

Removed unused var.

compiler-rt/test/fuzzer/keep-seed.test
8 ↗	(On Diff #288016)	Currently it's implicitly implied that -keep_seed=1 uses less iterations (i) with `not` and 2 million runs above, and (ii) without `not` and 4 million runs below. Do you have any idea how I can explicitly check this in a lit test? Can I use bash's `-lt`?

hctim added inline comments.Sep 2 2020, 3:33 PM

compiler-rt/test/fuzzer/keep-seed.test
8 ↗	(On Diff #288016)	Actually - what you have currently seems reasonable to me - but would you mind adding a comment to clarify? I missed the "works with 2m runs with this flag, doesn't work with 4m runs without it" context at first glance :(.

Give more general but descriptive variable names as suggested.

Options.KeepSeed -> ForceAddToCorpus -> InputInfo::NeverReduce

Add comment in keep-seed.test

dokyungs marked 6 inline comments as done.Sep 2 2020, 3:42 PM

dokyungs added inline comments.

compiler-rt/lib/fuzzer/FuzzerLoop.cpp
481	Thanks for the idea. This seems a lot more descriptive, which is now reflected in the code.
compiler-rt/test/fuzzer/keep-seed.test
8 ↗	(On Diff #288016)	Done.

Reflect variable name change in FuzzerInternal.h

Harbormaster completed remote builds in B70469: Diff 289579.Sep 2 2020, 3:49 PM

Harbormaster completed remote builds in B70477: Diff 289592.Sep 2 2020, 4:53 PM

Harbormaster completed remote builds in B70478: Diff 289593.Sep 2 2020, 5:01 PM

LGTM

This revision is now accepted and ready to land.Sep 2 2020, 5:20 PM

Harbormaster completed remote builds in B70482: Diff 289598.Sep 2 2020, 5:38 PM

This revision was landed with ongoing or failed builds.Sep 3 2020, 8:55 AM

Closed by commit rG62673c430de4: [libFuzzer] Add an option to keep initial seed inputs around. (authored by dokyungs). · Explain Why

This revision was automatically updated to reflect the committed changes.

dokyungs added a commit: rG62673c430de4: [libFuzzer] Add an option to keep initial seed inputs around..

Revision Contents

Path

Size

compiler-rt/

lib/

fuzzer/

16 lines

5 lines

4 lines

14 lines

2 lines

27 lines

2 lines

tests/

FuzzerUnittest.cpp

9 lines

Diff 287779

compiler-rt/lib/fuzzer/FuzzerCorpus.h

Show All 27 Lines	struct InputInfo {
Unit U; // The actual input data.		Unit U; // The actual input data.
uint8_t Sha1[kSHA1NumBytes]; // Checksum.		uint8_t Sha1[kSHA1NumBytes]; // Checksum.
// Number of features that this input has and no smaller input has.		// Number of features that this input has and no smaller input has.
size_t NumFeatures = 0;		size_t NumFeatures = 0;
size_t Tmp = 0; // Used by ValidateFeatureSet.		size_t Tmp = 0; // Used by ValidateFeatureSet.
// Stats.		// Stats.
size_t NumExecutedMutations = 0;		size_t NumExecutedMutations = 0;
size_t NumSuccessfullMutations = 0;		size_t NumSuccessfullMutations = 0;
		bool SeedInput = false;
bool MayDeleteFile = false;		bool MayDeleteFile = false;
bool Reduced = false;		bool Reduced = false;
bool HasFocusFunction = false;		bool HasFocusFunction = false;
Vector<uint32_t> UniqFeatureSet;		Vector<uint32_t> UniqFeatureSet;
Vector<uint8_t> DataFlowTraceForFocusFunction;		Vector<uint8_t> DataFlowTraceForFocusFunction;
// Power schedule.		// Power schedule.
bool NeedsEnergyUpdate = false;		bool NeedsEnergyUpdate = false;
double Energy = 0.0;		double Energy = 0.0;
▲ Show 20 Lines • Show All 82 Lines • ▼ Show 20 Lines	class InputCorpus {
static const uint32_t kFeatureSetSize = 1 << 21;		static const uint32_t kFeatureSetSize = 1 << 21;
static const uint8_t kMaxMutationFactor = 20;		static const uint8_t kMaxMutationFactor = 20;
static const size_t kSparseEnergyUpdates = 100;		static const size_t kSparseEnergyUpdates = 100;

size_t NumExecutedMutations = 0;		size_t NumExecutedMutations = 0;

EntropicOptions Entropic;		EntropicOptions Entropic;

		bool KeepSeed = false;
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: private field 'KeepSeed' is not used [clang-diagnostic-unused-private-field] not useful Lint: Pre-merge checks: clang-tidy: warning: private field 'KeepSeed' is not used [clang-diagnostic-unused-private…
		hctimUnsubmitted Done Reply Inline Actions Vestigal - please delete. hctim: Vestigal - please delete.

public:		public:
InputCorpus(const std::string &OutputCorpus, EntropicOptions Entropic)		InputCorpus(const std::string &OutputCorpus, EntropicOptions Entropic,
: Entropic(Entropic), OutputCorpus(OutputCorpus) {		bool KeepSeed)
		: Entropic(Entropic), OutputCorpus(OutputCorpus), KeepSeed(KeepSeed) {
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: field 'OutputCorpus' will be initialized after field 'KeepSeed' [clang-diagnostic-reorder-ctor] not useful Lint: Pre-merge checks: clang-tidy: warning: field 'OutputCorpus' will be initialized after field 'KeepSeed' [clang…
memset(InputSizesPerFeature, 0, sizeof(InputSizesPerFeature));		memset(InputSizesPerFeature, 0, sizeof(InputSizesPerFeature));
memset(SmallestElementPerFeature, 0, sizeof(SmallestElementPerFeature));		memset(SmallestElementPerFeature, 0, sizeof(SmallestElementPerFeature));
}		}
~InputCorpus() {		~InputCorpus() {
for (auto II : Inputs)		for (auto II : Inputs)
delete II;		delete II;
}		}
size_t size() const { return Inputs.size(); }		size_t size() const { return Inputs.size(); }
Show All 27 Lines	size_t NumInputsWithDataFlowTrace() {
return std::count_if(Inputs.begin(), Inputs.end(), [](const InputInfo *II) {		return std::count_if(Inputs.begin(), Inputs.end(), [](const InputInfo *II) {
return !II->DataFlowTraceForFocusFunction.empty();		return !II->DataFlowTraceForFocusFunction.empty();
});		});
}		}

bool empty() const { return Inputs.empty(); }		bool empty() const { return Inputs.empty(); }
const Unit &operator[] (size_t Idx) const { return Inputs[Idx]->U; }		const Unit &operator[] (size_t Idx) const { return Inputs[Idx]->U; }
InputInfo *AddToCorpus(const Unit &U, size_t NumFeatures, bool MayDeleteFile,		InputInfo *AddToCorpus(const Unit &U, size_t NumFeatures, bool MayDeleteFile,
bool HasFocusFunction,		bool HasFocusFunction, bool SeedInput,
const Vector<uint32_t> &FeatureSet,		const Vector<uint32_t> &FeatureSet,
const DataFlowTrace &DFT, const InputInfo *BaseII) {		const DataFlowTrace &DFT, const InputInfo *BaseII) {
		morehouseUnsubmitted Done Reply Inline Actions Feel free to disregard, but I'd like to protest the length of this parameter list. I would happily review a separate patch that cleaned it up. morehouse: Feel free to disregard, but I'd like to protest the length of this parameter list. I would…
		dokyungsAuthorUnsubmitted Done Reply Inline Actions Thanks! will send a separate patch! dokyungs: Thanks! will send a separate patch!
assert(!U.empty());		assert(!U.empty());
if (FeatureDebug)		if (FeatureDebug)
Printf("ADD_TO_CORPUS %zd NF %zd\n", Inputs.size(), NumFeatures);		Printf("ADD_TO_CORPUS %zd NF %zd\n", Inputs.size(), NumFeatures);
Inputs.push_back(new InputInfo());		Inputs.push_back(new InputInfo());
InputInfo &II = *Inputs.back();		InputInfo &II = *Inputs.back();
II.U = U;		II.U = U;
II.NumFeatures = NumFeatures;		II.NumFeatures = NumFeatures;
		II.SeedInput = SeedInput;
II.MayDeleteFile = MayDeleteFile;		II.MayDeleteFile = MayDeleteFile;
II.UniqFeatureSet = FeatureSet;		II.UniqFeatureSet = FeatureSet;
II.HasFocusFunction = HasFocusFunction;		II.HasFocusFunction = HasFocusFunction;
// Assign maximal energy to the new seed.		// Assign maximal energy to the new seed.
II.Energy = RareFeatures.empty() ? 1.0 : log(RareFeatures.size());		II.Energy = RareFeatures.empty() ? 1.0 : log(RareFeatures.size());
II.SumIncidence = RareFeatures.size();		II.SumIncidence = RareFeatures.size();
II.NeedsEnergyUpdate = false;		II.NeedsEnergyUpdate = false;
std::sort(II.UniqFeatureSet.begin(), II.UniqFeatureSet.end());		std::sort(II.UniqFeatureSet.begin(), II.UniqFeatureSet.end());
▲ Show 20 Lines • Show All 73 Lines • ▼ Show 20 Lines	public:
// Returns an index of random unit from the corpus to mutate.		// Returns an index of random unit from the corpus to mutate.
size_t ChooseUnitIdxToMutate(Random &Rand) {		size_t ChooseUnitIdxToMutate(Random &Rand) {
UpdateCorpusDistribution(Rand);		UpdateCorpusDistribution(Rand);
size_t Idx = static_cast<size_t>(CorpusDistribution(Rand));		size_t Idx = static_cast<size_t>(CorpusDistribution(Rand));
assert(Idx < Inputs.size());		assert(Idx < Inputs.size());
return Idx;		return Idx;
}		}

		InputInfo &ChooseUnitToCrossOverWith(Random &Rand) {
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for function 'ChooseUnitToCrossOverWith' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for function 'ChooseUnitToCrossOverWith' [readability…
		hctimUnsubmitted Done Reply Inline Actions nit: `ChooseUnweightedInputToMutate` or something like that. Make it clear that this is a uniform selection, not a bias-weighted selection. hctim: nit: `ChooseUnweightedInputToMutate` or something like that. Make it clear that this is a…
		dokyungsAuthorUnsubmitted Done Reply Inline Actions Will reflect this comment in a separate patch. dokyungs: Will reflect this comment in a separate patch.
		InputInfo &II = *Inputs[Rand(Inputs.size())];
		return II;
		}

void PrintStats() {		void PrintStats() {
for (size_t i = 0; i < Inputs.size(); i++) {		for (size_t i = 0; i < Inputs.size(); i++) {
const auto &II = *Inputs[i];		const auto &II = *Inputs[i];
Printf(" [% 3zd %s] sz: % 5zd runs: % 5zd succ: % 5zd focus: %d\n", i,		Printf(" [% 3zd %s] sz: % 5zd runs: % 5zd succ: % 5zd focus: %d\n", i,
Sha1ToString(II.Sha1).c_str(), II.U.size(),		Sha1ToString(II.Sha1).c_str(), II.U.size(),
II.NumExecutedMutations, II.NumSuccessfullMutations, II.HasFocusFunction);		II.NumExecutedMutations, II.NumSuccessfullMutations, II.HasFocusFunction);
}		}
}		}
▲ Show 20 Lines • Show All 247 Lines • Show Last 20 Lines

compiler-rt/lib/fuzzer/FuzzerDriver.cpp

Show First 20 Lines • Show All 643 Lines • ▼ Show 20 Lines	int FuzzerDriver(int argc, char **argv, UserCallback Callback) {

if (Flags.workers > 0 && Flags.jobs > 0)		if (Flags.workers > 0 && Flags.jobs > 0)
return RunInMultipleProcesses(Args, Flags.workers, Flags.jobs);		return RunInMultipleProcesses(Args, Flags.workers, Flags.jobs);

FuzzingOptions Options;		FuzzingOptions Options;
Options.Verbosity = Flags.verbosity;		Options.Verbosity = Flags.verbosity;
Options.MaxLen = Flags.max_len;		Options.MaxLen = Flags.max_len;
Options.LenControl = Flags.len_control;		Options.LenControl = Flags.len_control;
		Options.KeepSeed = Flags.keep_seed;
Options.UnitTimeoutSec = Flags.timeout;		Options.UnitTimeoutSec = Flags.timeout;
Options.ErrorExitCode = Flags.error_exitcode;		Options.ErrorExitCode = Flags.error_exitcode;
Options.TimeoutExitCode = Flags.timeout_exitcode;		Options.TimeoutExitCode = Flags.timeout_exitcode;
Options.IgnoreTimeouts = Flags.ignore_timeouts;		Options.IgnoreTimeouts = Flags.ignore_timeouts;
Options.IgnoreOOMs = Flags.ignore_ooms;		Options.IgnoreOOMs = Flags.ignore_ooms;
Options.IgnoreCrashes = Flags.ignore_crashes;		Options.IgnoreCrashes = Flags.ignore_crashes;
Options.MaxTotalTimeSec = Flags.max_total_time;		Options.MaxTotalTimeSec = Flags.max_total_time;
Options.DoCrossOver = Flags.cross_over;		Options.DoCrossOver = Flags.cross_over;
		Options.CrossOverUniformDist = Flags.cross_over_uniformdist;
Options.MutateDepth = Flags.mutate_depth;		Options.MutateDepth = Flags.mutate_depth;
Options.ReduceDepth = Flags.reduce_depth;		Options.ReduceDepth = Flags.reduce_depth;
Options.UseCounters = Flags.use_counters;		Options.UseCounters = Flags.use_counters;
Options.UseMemmem = Flags.use_memmem;		Options.UseMemmem = Flags.use_memmem;
Options.UseCmp = Flags.use_cmp;		Options.UseCmp = Flags.use_cmp;
Options.UseValueProfile = Flags.use_value_profile;		Options.UseValueProfile = Flags.use_value_profile;
Options.Shrink = Flags.shrink;		Options.Shrink = Flags.shrink;
Options.ReduceInputs = Flags.reduce_inputs;		Options.ReduceInputs = Flags.reduce_inputs;
▲ Show 20 Lines • Show All 80 Lines • ▼ Show 20 Lines	if (RunIndividualFiles)
ReadCorpora({}, *Inputs));		ReadCorpora({}, *Inputs));
else		else
return CollectDataFlow(Flags.collect_data_flow, Flags.data_flow_trace,		return CollectDataFlow(Flags.collect_data_flow, Flags.data_flow_trace,
ReadCorpora(*Inputs, {}));		ReadCorpora(*Inputs, {}));
}		}

Random Rand(Seed);		Random Rand(Seed);
auto *MD = new MutationDispatcher(Rand, Options);		auto *MD = new MutationDispatcher(Rand, Options);
auto *Corpus = new InputCorpus(Options.OutputCorpus, Entropic);		auto *Corpus =
		new InputCorpus(Options.OutputCorpus, Entropic, Options.KeepSeed);
auto F = new Fuzzer(Callback, Corpus, *MD, Options);		auto F = new Fuzzer(Callback, Corpus, *MD, Options);

for (auto &U: Dictionary)		for (auto &U: Dictionary)
if (U.size() <= Word::GetMaxSize())		if (U.size() <= Word::GetMaxSize())
MD->AddWordToManualDictionary(Word(U.data(), U.size()));		MD->AddWordToManualDictionary(Word(U.data(), U.size()));

// Threads are only supported by Chrome. Don't use them with emscripten		// Threads are only supported by Chrome. Don't use them with emscripten
// for now.		// for now.
▲ Show 20 Lines • Show All 107 Lines • Show Last 20 Lines

compiler-rt/lib/fuzzer/FuzzerFlags.def

Show All 17 Lines	FUZZER_FLAG_INT(max_len, 0, "Maximum length of the test input. "
"and reports it. ")		"and reports it. ")
FUZZER_FLAG_INT(len_control, 100, "Try generating small inputs first, "		FUZZER_FLAG_INT(len_control, 100, "Try generating small inputs first, "
"then try larger inputs over time. Specifies the rate at which the length "		"then try larger inputs over time. Specifies the rate at which the length "
"limit is increased (smaller == faster). If 0, immediately try inputs with "		"limit is increased (smaller == faster). If 0, immediately try inputs with "
"size up to max_len. Default value is 0, if LLVMFuzzerCustomMutator is used.")		"size up to max_len. Default value is 0, if LLVMFuzzerCustomMutator is used.")
FUZZER_FLAG_STRING(seed_inputs, "A comma-separated list of input files "		FUZZER_FLAG_STRING(seed_inputs, "A comma-separated list of input files "
"to use as an additional seed corpus. Alternatively, an \"@\" followed by "		"to use as an additional seed corpus. Alternatively, an \"@\" followed by "
"the name of a file containing the comma-separated list.")		"the name of a file containing the comma-separated list.")
		FUZZER_FLAG_INT(keep_seed, 0, "If 1, keep seed inputs for mutation even if "
		"they do not produce new coverage.")
		hctimUnsubmitted Done Reply Inline Actions `When used with \|reduce_inputs==1\|, the seed inputs will never be reduced.` hctim: `When used with \|reduce_inputs==1\|, the seed inputs will never be reduced.`
FUZZER_FLAG_INT(cross_over, 1, "If 1, cross over inputs.")		FUZZER_FLAG_INT(cross_over, 1, "If 1, cross over inputs.")
		morehouseUnsubmitted Done Reply Inline Actions Please also document the intended use case (i.e. when seeds are not properly formed for the fuzz target but still have useful snippets. morehouse: Please also document the intended use case (i.e. when seeds are not properly formed for the…
		dokyungsAuthorUnsubmitted Done Reply Inline Actions Done! dokyungs: Done!
		FUZZER_FLAG_INT(cross_over_uniformdist, 0, "Experimental. If 1, use a uniform "
		hctimUnsubmitted Done Reply Inline Actions nit: `cross_over_uniform_dist` hctim: nit: `cross_over_uniform_dist`
		dokyungsAuthorUnsubmitted Done Reply Inline Actions Reflect this comment in a separate patch. dokyungs: Reflect this comment in a separate patch.
		"probability distribution when choosing inputs to cross over with.")
FUZZER_FLAG_INT(mutate_depth, 5,		FUZZER_FLAG_INT(mutate_depth, 5,
"Apply this number of consecutive mutations to each input.")		"Apply this number of consecutive mutations to each input.")
FUZZER_FLAG_INT(reduce_depth, 0, "Experimental/internal. "		FUZZER_FLAG_INT(reduce_depth, 0, "Experimental/internal. "
"Reduce depth if mutations lose unique features")		"Reduce depth if mutations lose unique features")
FUZZER_FLAG_INT(shuffle, 1, "Shuffle inputs at startup")		FUZZER_FLAG_INT(shuffle, 1, "Shuffle inputs at startup")
FUZZER_FLAG_INT(prefer_small, 1,		FUZZER_FLAG_INT(prefer_small, 1,
"If 1, always prefer smaller inputs during the corpus shuffle.")		"If 1, always prefer smaller inputs during the corpus shuffle.")
FUZZER_FLAG_INT(		FUZZER_FLAG_INT(
▲ Show 20 Lines • Show All 135 Lines • Show Last 20 Lines

compiler-rt/lib/fuzzer/FuzzerFork.cpp

Show First 20 Lines • Show All 303 Lines • ▼ Show 20 Lines	void FuzzWithFork(Random &Rand, const FuzzingOptions &Options,
MkDir(Env.DFTDir);		MkDir(Env.DFTDir);


if (CorpusDirs.empty())		if (CorpusDirs.empty())
MkDir(Env.MainCorpusDir = DirPlusFile(Env.TempDir, "C"));		MkDir(Env.MainCorpusDir = DirPlusFile(Env.TempDir, "C"));
else		else
Env.MainCorpusDir = CorpusDirs[0];		Env.MainCorpusDir = CorpusDirs[0];

		if (Options.KeepSeed) {
		for (auto &File : SeedFiles)
		Env.Files.push_back(File.File);
		} else {
auto CFPath = DirPlusFile(Env.TempDir, "merge.txt");		auto CFPath = DirPlusFile(Env.TempDir, "merge.txt");
		hctimUnsubmitted Done Reply Inline Actions nit: spaces not tabs (and throughout) hctim: nit: spaces not tabs (and throughout)
		dokyungsAuthorUnsubmitted Done Reply Inline Actions I just checked that these are actually spaces. dokyungs: I just checked that these are actually spaces.
		hctimUnsubmitted Done Reply Inline Actions Ah - I see, it's just the way that phabricator displays that the indentation level was increased. Interesting... hctim: Ah - I see, it's just the way that phabricator displays that the indentation level was…
CrashResistantMerge(Env.Args, {}, SeedFiles, &Env.Files, {}, &Env.Features,		CrashResistantMerge(Env.Args, {}, SeedFiles, &Env.Files, {}, &Env.Features,
{}, &Env.Cov,		{}, &Env.Cov, CFPath, false);
CFPath, false);
RemoveFile(CFPath);		RemoveFile(CFPath);
		}
Printf("INFO: -fork=%d: %zd seed inputs, starting to fuzz in %s\n", NumJobs,		Printf("INFO: -fork=%d: %zd seed inputs, starting to fuzz in %s\n", NumJobs,
Env.Files.size(), Env.TempDir.c_str());		Env.Files.size(), Env.TempDir.c_str());

int ExitCode = 0;		int ExitCode = 0;

JobQueue FuzzQ, MergeQ;		JobQueue FuzzQ, MergeQ;

auto StopJobs = [&]() {		auto StopJobs = [&]() {
▲ Show 20 Lines • Show All 85 Lines • Show Last 20 Lines

compiler-rt/lib/fuzzer/FuzzerInternal.h

Show First 20 Lines • Show All 62 Lines • ▼ Show 20 Lines	public:
static void StaticExitCallback();		static void StaticExitCallback();
static void StaticInterruptCallback();		static void StaticInterruptCallback();
static void StaticFileSizeExceedCallback();		static void StaticFileSizeExceedCallback();
static void StaticGracefulExitCallback();		static void StaticGracefulExitCallback();

void ExecuteCallback(const uint8_t *Data, size_t Size);		void ExecuteCallback(const uint8_t *Data, size_t Size);
bool RunOne(const uint8_t *Data, size_t Size, bool MayDeleteFile = false,		bool RunOne(const uint8_t *Data, size_t Size, bool MayDeleteFile = false,
InputInfo II = nullptr, bool FoundUniqFeatures = nullptr);		InputInfo II = nullptr, bool FoundUniqFeatures = nullptr);

		morehouseUnsubmitted Done Reply Inline Actions Nit: Input params should come before output params morehouse: Nit: Input params should come before output params
		dokyungsAuthorUnsubmitted Done Reply Inline Actions Done. dokyungs: Done.
// Merge Corpora[1:] into Corpora[0].		// Merge Corpora[1:] into Corpora[0].
void Merge(const Vector<std::string> &Corpora);		void Merge(const Vector<std::string> &Corpora);
void CrashResistantMergeInternalStep(const std::string &ControlFilePath);		void CrashResistantMergeInternalStep(const std::string &ControlFilePath);
MutationDispatcher &GetMD() { return MD; }		MutationDispatcher &GetMD() { return MD; }
void PrintFinalStats();		void PrintFinalStats();
void SetMaxInputLen(size_t MaxInputLen);		void SetMaxInputLen(size_t MaxInputLen);
void SetMaxMutationLen(size_t MaxMutationLen);		void SetMaxMutationLen(size_t MaxMutationLen);
void RssLimitCallback();		void RssLimitCallback();
Show All 34 Lines	private:

bool GracefulExitRequested = false;		bool GracefulExitRequested = false;

size_t TotalNumberOfRuns = 0;		size_t TotalNumberOfRuns = 0;
size_t NumberOfNewUnitsAdded = 0;		size_t NumberOfNewUnitsAdded = 0;

size_t LastCorpusUpdateRun = 0;		size_t LastCorpusUpdateRun = 0;

		bool IsExecutingSeedCorpora = false;

bool HasMoreMallocsThanFrees = false;		bool HasMoreMallocsThanFrees = false;
size_t NumberOfLeakDetectionAttempts = 0;		size_t NumberOfLeakDetectionAttempts = 0;

system_clock::time_point LastAllocatorPurgeAttemptTime = system_clock::now();		system_clock::time_point LastAllocatorPurgeAttemptTime = system_clock::now();

UserCallback CB;		UserCallback CB;
InputCorpus &Corpus;		InputCorpus &Corpus;
MutationDispatcher &MD;		MutationDispatcher &MD;
▲ Show 20 Lines • Show All 43 Lines • Show Last 20 Lines

compiler-rt/lib/fuzzer/FuzzerLoop.cpp

Show First 20 Lines • Show All 472 Lines • ▼ Show 20 Lines	bool Fuzzer::RunOne(const uint8_t *Data, size_t Size, bool MayDeleteFile,
UniqFeatureSetTmp.clear();		UniqFeatureSetTmp.clear();
size_t FoundUniqFeaturesOfII = 0;		size_t FoundUniqFeaturesOfII = 0;
size_t NumUpdatesBefore = Corpus.NumFeatureUpdates();		size_t NumUpdatesBefore = Corpus.NumFeatureUpdates();
TPC.CollectFeatures([&](size_t Feature) {		TPC.CollectFeatures([&](size_t Feature) {
if (Corpus.AddFeature(Feature, Size, Options.Shrink))		if (Corpus.AddFeature(Feature, Size, Options.Shrink))
UniqFeatureSetTmp.push_back(Feature);		UniqFeatureSetTmp.push_back(Feature);
if (Options.Entropic)		if (Options.Entropic)
Corpus.UpdateFeatureFrequency(II, Feature);		Corpus.UpdateFeatureFrequency(II, Feature);
if (Options.ReduceInputs && II)		if (Options.ReduceInputs && II && !(Options.KeepSeed && II->SeedInput))
		morehouseUnsubmitted Done Reply Inline Actions I think it's cleaner if we avoid checking `Options.KeepSeed` in this function, and instead check it in the caller. Then we could rename `SeedInput` to `ForceAddToCorpus` for clarity. morehouse: I think it's cleaner if we avoid checking `Options.KeepSeed` in this function, and instead…
		dokyungsAuthorUnsubmitted Done Reply Inline Actions I tried to do this, but wasn't sure the scope of change you were thinking of. Do you want the `SeedInput` member variable of `InputInfo` to be changed to `ForceAddToCorpus` too? We need to keep this information in `InputInfo` to prevent it being reduced to a smaller input here. dokyungs: I tried to do this, but wasn't sure the scope of change you were thinking of. Do you want the…
		morehouseUnsubmitted Done Reply Inline Actions I just meant we could simplify slightly in this function by changing the param to `ForceAddToCorpus` and then unconditionally adding to corpus if its true. We can check the `KeepSeed` flag in the caller to determine if we want to force add the input or not. morehouse: I just meant we could simplify slightly in this function by changing the param to…
		dokyungsAuthorUnsubmitted Done Reply Inline Actions I am not sure if I can simplify this that way, because `SeedInput` needs to propagate; it's stored in the `InputInfo`s of all seed inputs when `InputInfo`s are created by `AddToCorpus`. Storing this in a member variable `SeedInput` of `InputInfo` is needed to prevent them from being replaced. The above line prevents them from being replaced with new, non-seed inputs that's based on the seed input. It seems to me that, if we change the argument `SeedInput` to `ForceAddToCorpus`, `SeedInput` of `InputInfo` needs changing too. Please correct me if I am still misled :) dokyungs: I am not sure if I can simplify this that way, because `SeedInput` needs to propagate; it's…
		morehouseUnsubmitted Done Reply Inline Actions Ok, I think I was confused because we have both `SeedInput` and `II->SeedInput` used in this function. Is there any way we can make this simpler? Having to check 3 things that seem similar (`Options.KeepSeed`, `SeedInput`, `II->SeedInput`) in this single function is tricky. morehouse: Ok, I think I was confused because we have both `SeedInput` and `II->SeedInput` used in this…
		dokyungsAuthorUnsubmitted Done Reply Inline Actions Yes, I share the sense that this is confusing. What do you think of adding a new flag called `reduce_seed_inputs`, and check `!Options.ReduceSeedInputs && II->SeedInput` here rather than `Options.KeepSeed && II->SeedInput`? This is probably not simplifying things since it's more code, but adding a new flag could probably reduce possible confusion because it's more explanatory. dokyungs: Yes, I share the sense that this is confusing. What do you think of adding a new flag called…
		morehouseUnsubmitted Done Reply Inline Actions One idea: Rename `II->KeepSeed` to `II->NeverReduce`, and only set it to true if `Options.KeepSeed == true`. Rename the parameter `KeepSeed` to `ForceAddToCorpus`. This seems more understandable to me, and we don't need to check `Options.KeepSeed` at all in this function. morehouse: One idea: - Rename `II->KeepSeed` to `II->NeverReduce`, and only set it to true if `Options.
		dokyungsAuthorUnsubmitted Not Done Reply Inline Actions Thanks for the idea. This seems a lot more descriptive, which is now reflected in the code. dokyungs: Thanks for the idea. This seems a lot more descriptive, which is now reflected in the code.
if (std::binary_search(II->UniqFeatureSet.begin(),		if (std::binary_search(II->UniqFeatureSet.begin(),
II->UniqFeatureSet.end(), Feature))		II->UniqFeatureSet.end(), Feature))
FoundUniqFeaturesOfII++;		FoundUniqFeaturesOfII++;
});		});
if (FoundUniqFeatures)		if (FoundUniqFeatures)
*FoundUniqFeatures = FoundUniqFeaturesOfII;		*FoundUniqFeatures = FoundUniqFeaturesOfII;
PrintPulseAndReportSlowInput(Data, Size);		PrintPulseAndReportSlowInput(Data, Size);
size_t NumNewFeatures = Corpus.NumFeatureUpdates() - NumUpdatesBefore;		size_t NumNewFeatures = Corpus.NumFeatureUpdates() - NumUpdatesBefore;
if (NumNewFeatures) {		if (NumNewFeatures \|\| (Options.KeepSeed && IsExecutingSeedCorpora)) {
		hctimUnsubmitted Done Reply Inline Actions Can't this be derived by `II->SeedInput` (as elsewhere)? It would be much cleaner to not have to maintain this variable in the class just for initialization. If it can't be derived by that, can you please make it a function paramater (you can even default it to `false` to avoid having to change all the other instances) hctim: Can't this be derived by `II->SeedInput` (as elsewhere)? It would be much cleaner to not have…
TPC.UpdateObservedPCs();		TPC.UpdateObservedPCs();
auto NewII = Corpus.AddToCorpus({Data, Data + Size}, NumNewFeatures,		auto NewII =
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: 'auto NewII' can be declared as 'auto NewII' [llvm-qualified-auto] not useful Lint: Pre-merge checks:* clang-tidy: warning: 'auto NewII' can be declared as 'auto *NewII' [llvm-qualified-auto]…
MayDeleteFile, TPC.ObservedFocusFunction(),		Corpus.AddToCorpus({Data, Data + Size}, NumNewFeatures, MayDeleteFile,
		TPC.ObservedFocusFunction(), IsExecutingSeedCorpora,
UniqFeatureSetTmp, DFT, II);		UniqFeatureSetTmp, DFT, II);
WriteFeatureSetToFile(Options.FeaturesDir, Sha1ToString(NewII->Sha1),		WriteFeatureSetToFile(Options.FeaturesDir, Sha1ToString(NewII->Sha1),
NewII->UniqFeatureSet);		NewII->UniqFeatureSet);
return true;		return true;
}		}
if (II && FoundUniqFeaturesOfII &&		if (II && FoundUniqFeaturesOfII &&
II->DataFlowTraceForFocusFunction.empty() &&		II->DataFlowTraceForFocusFunction.empty() &&
FoundUniqFeaturesOfII == II->UniqFeatureSet.size() &&		FoundUniqFeaturesOfII == II->UniqFeatureSet.size() &&
II->U.size() > Size) {		II->U.size() > Size) {
▲ Show 20 Lines • Show All 156 Lines • ▼ Show 20 Lines	if (EF->__lsan_do_recoverable_leak_check()) { // Leak is found, report it.
_Exit(Options.ErrorExitCode); // not exit() to disable lsan further on.		_Exit(Options.ErrorExitCode); // not exit() to disable lsan further on.
}		}
}		}

void Fuzzer::MutateAndTestOne() {		void Fuzzer::MutateAndTestOne() {
MD.StartMutationSequence();		MD.StartMutationSequence();

auto &II = Corpus.ChooseUnitToMutate(MD.GetRand());		auto &II = Corpus.ChooseUnitToMutate(MD.GetRand());
if (Options.DoCrossOver)		if (Options.DoCrossOver) {
		if (Options.CrossOverUniformDist) {
		MD.SetCrossOverWith(&Corpus.ChooseUnitToCrossOverWith(MD.GetRand()).U);
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - MD.SetCrossOverWith(&Corpus.ChooseUnitToCrossOverWith(MD.GetRand()).U); - } - else { - MD.SetCrossOverWith(&Corpus.ChooseUnitToMutate(MD.GetRand()).U); + MD.SetCrossOverWith(&Corpus.ChooseUnitToCrossOverWith(MD.GetRand()).U); + } else { + MD.SetCrossOverWith(&Corpus.ChooseUnitToMutate(MD.GetRand()).U); Lint: Pre-merge checks: clang-format: please reformat the code ``` - MD.SetCrossOverWith(&Corpus.
		}
		else {
MD.SetCrossOverWith(&Corpus.ChooseUnitToMutate(MD.GetRand()).U);		MD.SetCrossOverWith(&Corpus.ChooseUnitToMutate(MD.GetRand()).U);
		}
		}
const auto &U = II.U;		const auto &U = II.U;
memcpy(BaseSha1, II.Sha1, sizeof(BaseSha1));		memcpy(BaseSha1, II.Sha1, sizeof(BaseSha1));
assert(CurrentUnitData);		assert(CurrentUnitData);
size_t Size = U.size();		size_t Size = U.size();
assert(Size <= MaxInputLen && "Oversized Unit");		assert(Size <= MaxInputLen && "Oversized Unit");
memcpy(CurrentUnitData, U.data(), Size);		memcpy(CurrentUnitData, U.data(), Size);

assert(MaxMutationLen > 0);		assert(MaxMutationLen > 0);
▲ Show 20 Lines • Show All 82 Lines • ▼ Show 20 Lines	if (CorporaFiles.empty()) {
if (Options.ShuffleAtStartUp)		if (Options.ShuffleAtStartUp)
std::shuffle(CorporaFiles.begin(), CorporaFiles.end(), MD.GetRand());		std::shuffle(CorporaFiles.begin(), CorporaFiles.end(), MD.GetRand());

if (Options.PreferSmall) {		if (Options.PreferSmall) {
std::stable_sort(CorporaFiles.begin(), CorporaFiles.end());		std::stable_sort(CorporaFiles.begin(), CorporaFiles.end());
assert(CorporaFiles.front().Size <= CorporaFiles.back().Size);		assert(CorporaFiles.front().Size <= CorporaFiles.back().Size);
}		}

		IsExecutingSeedCorpora = true;

// Load and execute inputs one by one.		// Load and execute inputs one by one.
for (auto &SF : CorporaFiles) {		for (auto &SF : CorporaFiles) {
auto U = FileToVector(SF.File, MaxInputLen, /ExitOnError=/false);		auto U = FileToVector(SF.File, MaxInputLen, /ExitOnError=/false);
assert(U.size() <= MaxInputLen);		assert(U.size() <= MaxInputLen);
RunOne(U.data(), U.size());		RunOne(U.data(), U.size());
		hctimUnsubmitted Done Reply Inline Actions nit: please also give these unnamed constants the `/* ArgName /` treatment hctim:* nit: please also give these unnamed constants the `/* ArgName */` treatment
CheckExitOnSrcPosOrItem();		CheckExitOnSrcPosOrItem();
TryDetectingAMemoryLeak(U.data(), U.size(),		TryDetectingAMemoryLeak(U.data(), U.size(),
/DuringInitialCorpusExecution/ true);		/DuringInitialCorpusExecution/ true);
}		}

		IsExecutingSeedCorpora = false;
}		}

PrintStats("INITED");		PrintStats("INITED");
if (!Options.FocusFunction.empty()) {		if (!Options.FocusFunction.empty()) {
Printf("INFO: %zd/%zd inputs touch the focus function\n",		Printf("INFO: %zd/%zd inputs touch the focus function\n",
Corpus.NumInputsThatTouchFocusFunction(), Corpus.size());		Corpus.NumInputsThatTouchFocusFunction(), Corpus.size());
if (!Options.DataFlowTrace.empty())		if (!Options.DataFlowTrace.empty())
Printf("INFO: %zd/%zd inputs have the Data Flow Trace\n",		Printf("INFO: %zd/%zd inputs have the Data Flow Trace\n",
Corpus.NumInputsWithDataFlowTrace(),		Corpus.NumInputsWithDataFlowTrace(),
Corpus.NumInputsThatTouchFocusFunction());		Corpus.NumInputsThatTouchFocusFunction());
}		}

		Printf("INFO: corpus size = %d\n", Corpus.size());
		hctimUnsubmitted Done Reply Inline Actions nit: `number of corpus inputs = %d\n` (just because it's absolutely 100% clear that it's number-of-units, not number-of-bytes) hctim: nit: `number of corpus inputs = %d\n` (just because it's absolutely 100% clear that it's number…
		morehouseUnsubmitted Done Reply Inline Actions Isn't this information already printed above? morehouse: Isn't this information already printed above?
		dokyungsAuthorUnsubmitted Done Reply Inline Actions Removed. Printing wasn't necessary here. dokyungs: Removed. Printing wasn't necessary here.

if (Corpus.empty() && Options.MaxNumberOfRuns) {		if (Corpus.empty() && Options.MaxNumberOfRuns) {
Printf("ERROR: no interesting inputs were found. "		Printf("ERROR: no interesting inputs were found. "
"Is the code instrumented for coverage? Exiting.\n");		"Is the code instrumented for coverage? Exiting.\n");
exit(1);		exit(1);
}		}
}		}

void Fuzzer::Loop(Vector<SizedFile> &CorporaFiles) {		void Fuzzer::Loop(Vector<SizedFile> &CorporaFiles) {
▲ Show 20 Lines • Show All 79 Lines • Show Last 20 Lines

compiler-rt/lib/fuzzer/FuzzerOptions.h

	Show All 12 Lines
	#include "FuzzerDefs.h"			#include "FuzzerDefs.h"

	namespace fuzzer {			namespace fuzzer {

	struct FuzzingOptions {			struct FuzzingOptions {
	int Verbosity = 1;			int Verbosity = 1;
	size_t MaxLen = 0;			size_t MaxLen = 0;
	size_t LenControl = 1000;			size_t LenControl = 1000;
				bool KeepSeed = false;
	int UnitTimeoutSec = 300;			int UnitTimeoutSec = 300;
	int TimeoutExitCode = 70;			int TimeoutExitCode = 70;
	int OOMExitCode = 71;			int OOMExitCode = 71;
	int InterruptExitCode = 72;			int InterruptExitCode = 72;
	int ErrorExitCode = 77;			int ErrorExitCode = 77;
	bool IgnoreTimeouts = true;			bool IgnoreTimeouts = true;
	bool IgnoreOOMs = true;			bool IgnoreOOMs = true;
	bool IgnoreCrashes = false;			bool IgnoreCrashes = false;
	int MaxTotalTimeSec = 0;			int MaxTotalTimeSec = 0;
	int RssLimitMb = 0;			int RssLimitMb = 0;
	int MallocLimitMb = 0;			int MallocLimitMb = 0;
	bool DoCrossOver = true;			bool DoCrossOver = true;
				bool CrossOverUniformDist = false;
	int MutateDepth = 5;			int MutateDepth = 5;
	bool ReduceDepth = false;			bool ReduceDepth = false;
	bool UseCounters = false;			bool UseCounters = false;
	bool UseMemmem = true;			bool UseMemmem = true;
	bool UseCmp = false;			bool UseCmp = false;
	int UseValueProfile = false;			int UseValueProfile = false;
	bool Shrink = false;			bool Shrink = false;
	bool ReduceInputs = false;			bool ReduceInputs = false;
	▲ Show 20 Lines • Show All 46 Lines • Show Last 20 Lines

compiler-rt/lib/fuzzer/tests/FuzzerUnittest.cpp

Show All 10 Lines

#include "FuzzerCorpus.h"		#include "FuzzerCorpus.h"
#include "FuzzerDictionary.h"		#include "FuzzerDictionary.h"
#include "FuzzerInternal.h"		#include "FuzzerInternal.h"
#include "FuzzerMerge.h"		#include "FuzzerMerge.h"
#include "FuzzerMutate.h"		#include "FuzzerMutate.h"
#include "FuzzerRandom.h"		#include "FuzzerRandom.h"
#include "FuzzerTracePC.h"		#include "FuzzerTracePC.h"
#include "gtest/gtest.h"		#include "gtest/gtest.h"
		Lint: Pre-merge checks Inline Actions clang-tidy: error: 'gtest/gtest.h' file not found [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: 'gtest/gtest.h' file not found [clang-diagnostic-error] [[https://github.
#include <memory>		#include <memory>
#include <set>		#include <set>
#include <sstream>		#include <sstream>

using namespace fuzzer;		using namespace fuzzer;

// For now, have LLVMFuzzerTestOneInput just to make it link.		// For now, have LLVMFuzzerTestOneInput just to make it link.
// Later we may want to make unittests that actually call LLVMFuzzerTestOneInput.		// Later we may want to make unittests that actually call LLVMFuzzerTestOneInput.
▲ Show 20 Lines • Show All 560 Lines • ▼ Show 20 Lines	TEST(FuzzerUtil, Base64) {
EXPECT_EQ("YWJjeHk=", Base64({'a', 'b', 'c', 'x', 'y'}));		EXPECT_EQ("YWJjeHk=", Base64({'a', 'b', 'c', 'x', 'y'}));
EXPECT_EQ("YWJjeHl6", Base64({'a', 'b', 'c', 'x', 'y', 'z'}));		EXPECT_EQ("YWJjeHl6", Base64({'a', 'b', 'c', 'x', 'y', 'z'}));
}		}

TEST(Corpus, Distribution) {		TEST(Corpus, Distribution) {
DataFlowTrace DFT;		DataFlowTrace DFT;
Random Rand(0);		Random Rand(0);
struct EntropicOptions Entropic = {false, 0xFF, 100};		struct EntropicOptions Entropic = {false, 0xFF, 100};
std::unique_ptr<InputCorpus> C(new InputCorpus("", Entropic));		bool KeepSeed = false;
		std::unique_ptr<InputCorpus> C(new InputCorpus("", Entropic, KeepSeed));
		morehouseUnsubmitted Done Reply Inline Actions Side note, feel free to ignore: Now that LLVM uses C++14, all these `unique_ptr` patterns can be simplified with `std::make_unique`: auto C = std::make_unique<InputCorpus>("", Entropic, false); (Separate patch is welcome) morehouse: Side note, feel free to ignore: Now that LLVM uses C++14, all these `unique_ptr` patterns can…
		dokyungsAuthorUnsubmitted Done Reply Inline Actions Thanks for the suggestion! Will send a separate patch. dokyungs: Thanks for the suggestion! Will send a separate patch.
size_t N = 10;		size_t N = 10;
size_t TriesPerUnit = 1<<16;		size_t TriesPerUnit = 1<<16;
for (size_t i = 0; i < N; i++)		for (size_t i = 0; i < N; i++)
C->AddToCorpus(Unit{static_cast<uint8_t>(i)}, 1, false, false, {}, DFT,		C->AddToCorpus(Unit{static_cast<uint8_t>(i)}, 1, false, false, {}, DFT,
nullptr);		nullptr);
		hctimUnsubmitted Done Reply Inline Actions While you're here can you give these the `/* ArgName /` treatment as well, thanks hctim:* While you're here can you give these the `/* ArgName */` treatment as well, thanks

Vector<size_t> Hist(N);		Vector<size_t> Hist(N);
for (size_t i = 0; i < N * TriesPerUnit; i++) {		for (size_t i = 0; i < N * TriesPerUnit; i++) {
Hist[C->ChooseUnitIdxToMutate(Rand)]++;		Hist[C->ChooseUnitIdxToMutate(Rand)]++;
}		}
for (size_t i = 0; i < N; i++) {		for (size_t i = 0; i < N; i++) {
// A weak sanity check that every unit gets invoked.		// A weak sanity check that every unit gets invoked.
EXPECT_GT(Hist[i], TriesPerUnit / N / 3);		EXPECT_GT(Hist[i], TriesPerUnit / N / 3);
▲ Show 20 Lines • Show All 442 Lines • ▼ Show 20 Lines
}		}

TEST(Entropic, UpdateFrequency) {		TEST(Entropic, UpdateFrequency) {
const size_t One = 1, Two = 2;		const size_t One = 1, Two = 2;
const size_t FeatIdx1 = 0, FeatIdx2 = 42, FeatIdx3 = 12, FeatIdx4 = 26;		const size_t FeatIdx1 = 0, FeatIdx2 = 42, FeatIdx3 = 12, FeatIdx4 = 26;
size_t Index;		size_t Index;
// Create input corpus with default entropic configuration		// Create input corpus with default entropic configuration
struct EntropicOptions Entropic = {true, 0xFF, 100};		struct EntropicOptions Entropic = {true, 0xFF, 100};
std::unique_ptr<InputCorpus> C(new InputCorpus("", Entropic));		bool KeepSeed = false;
		std::unique_ptr<InputCorpus> C(new InputCorpus("", Entropic, KeepSeed));
		hctimUnsubmitted Done Reply Inline Actions `std::unique_ptr<InputCorpus> C(new InputCorpus("", Entropic, /* KeepSeed / false));` (and elsewhere) hctim:* `std::unique_ptr<InputCorpus> C(new InputCorpus("", Entropic, /* KeepSeed */ false));` (and…
std::unique_ptr<InputInfo> II(new InputInfo());		std::unique_ptr<InputInfo> II(new InputInfo());

C->AddRareFeature(FeatIdx1);		C->AddRareFeature(FeatIdx1);
C->UpdateFeatureFrequency(II.get(), FeatIdx1);		C->UpdateFeatureFrequency(II.get(), FeatIdx1);
EXPECT_EQ(II->FeatureFreqs.size(), One);		EXPECT_EQ(II->FeatureFreqs.size(), One);
C->AddRareFeature(FeatIdx2);		C->AddRareFeature(FeatIdx2);
C->UpdateFeatureFrequency(II.get(), FeatIdx1);		C->UpdateFeatureFrequency(II.get(), FeatIdx1);
C->UpdateFeatureFrequency(II.get(), FeatIdx2);		C->UpdateFeatureFrequency(II.get(), FeatIdx2);
Show All 20 Lines	double SubAndSquare(double X, double Y) {
double R = X - Y;		double R = X - Y;
R = R * R;		R = R * R;
return R;		return R;
}		}

TEST(Entropic, ComputeEnergy) {		TEST(Entropic, ComputeEnergy) {
const double Precision = 0.01;		const double Precision = 0.01;
struct EntropicOptions Entropic = {true, 0xFF, 100};		struct EntropicOptions Entropic = {true, 0xFF, 100};
std::unique_ptr<InputCorpus> C(new InputCorpus("", Entropic));		bool KeepSeed = false;
		std::unique_ptr<InputCorpus> C(new InputCorpus("", Entropic, KeepSeed));
std::unique_ptr<InputInfo> II(new InputInfo());		std::unique_ptr<InputInfo> II(new InputInfo());
Vector<std::pair<uint32_t, uint16_t>> FeatureFreqs = {{1, 3}, {2, 3}, {3, 3}};		Vector<std::pair<uint32_t, uint16_t>> FeatureFreqs = {{1, 3}, {2, 3}, {3, 3}};
II->FeatureFreqs = FeatureFreqs;		II->FeatureFreqs = FeatureFreqs;
II->NumExecutedMutations = 0;		II->NumExecutedMutations = 0;
II->UpdateEnergy(4);		II->UpdateEnergy(4);
EXPECT_LT(SubAndSquare(II->Energy, 1.450805), Precision);		EXPECT_LT(SubAndSquare(II->Energy, 1.450805), Precision);

II->NumExecutedMutations = 9;		II->NumExecutedMutations = 9;
Show All 14 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[libFuzzer] Add an option to keep initial seed inputs around.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 287779

compiler-rt/lib/fuzzer/FuzzerCorpus.h

compiler-rt/lib/fuzzer/FuzzerDriver.cpp

compiler-rt/lib/fuzzer/FuzzerFlags.def

compiler-rt/lib/fuzzer/FuzzerFork.cpp

compiler-rt/lib/fuzzer/FuzzerInternal.h

compiler-rt/lib/fuzzer/FuzzerLoop.cpp

compiler-rt/lib/fuzzer/FuzzerOptions.h

compiler-rt/lib/fuzzer/tests/FuzzerUnittest.cpp

[libFuzzer] Add an option to keep initial seed inputs around.
ClosedPublic