This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
compiler-rt/
-
lib/fuzzer/
-
fuzzer/
12/14
FuzzerMutate.cpp
-
test/fuzzer/
-
fuzzer/
3/3
OverwriteBytesTest.cpp
8/9
overwrite-bytes.test

Differential D86358

[libFuzzer] Extend ChangeBinaryInteger mutator to support overwriting selected input with predefined integers.
Needs ReviewPublic

Authored by dokyungs on Aug 21 2020, 10:23 AM.

Download Raw Diff

Details

Reviewers

morehouse
hctim
kcc

Summary

(Experimental - Uploading this to get early feedback before a large-scale experiment.)

This patch extends the ChangeBinaryInteger mutator to support overwriting the selected input with predefined integers. The rationale for this heuristic is that certain byte (word, qword, or qword) overwrite at a specific location (with "magic" integers) in a large input may make an invalid input valid, potentially triggering new neighbor code paths.

Currently, triggering such an overwrite is costly in libFuzzer. ChangeBinaryInteger mutator may do the same, but only with a low probability, because the chosen byte (word, dword, or qword) must already be an integer ranging from -10 to 10.

CopyPart/CrossOver mutator may also effectively do the same, but only if these predefined integers are found in any of the corpus inputs; even if the corpus inputs do contain the predefined integers, the chances are much narrower because a specific location and a specific width have to be selected.

InsertRepeatedBytes combined with EraseBytes mutators (or other combinations of existing mutators) may eventually trigger the desired change, but still the probability is low, as the probabilities of different mutators multiply.

This patch allows to find the desired input in a single mutation (as tested by the accompanying test - overwrite-bytes.test), effectively increasing the probability of finding the desired input given a corpus input.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	1,460 ms	linux > libFuzzer.libFuzzer::print-func.test

Event Timeline

dokyungs created this revision.Aug 21 2020, 10:23 AM

Herald added a project: Restricted Project. · View Herald TranscriptAug 21 2020, 10:23 AM

Herald added a subscriber: Restricted Project. · View Herald Transcript

dokyungs requested review of this revision.Aug 21 2020, 10:23 AM

Harbormaster completed remote builds in B69153: Diff 287061.Aug 21 2020, 11:01 AM

morehouse added inline comments.Aug 21 2020, 5:29 PM

compiler-rt/lib/fuzzer/FuzzerMutate.cpp
463	I think we should add more. What does honggfuzz use?
compiler-rt/test/fuzzer/OverwriteBytesTest.cpp
26	Probably the large input defeats our memcmp hook, but we should be careful that it isn't the reason we can pass this test.
compiler-rt/test/fuzzer/OverwriteBytesTest.h
1 ↗	(On Diff #287061)	Why do we need such a large sequence of bytes, and how did you generate it?
compiler-rt/test/fuzzer/overwrite-bytes.test
3	Can we hard code the magics in the C++?
7	100M runs seems high. How long does it take if we hit that limit?

dokyungs added inline comments.Aug 24 2020, 3:19 PM

compiler-rt/lib/fuzzer/FuzzerMutate.cpp
463	afl uses the following magic values. #define INTERESTING_8 \ -128, /* Overflow signed 8-bit when decremented / \ -1, / / \ 0, / / \ 1, / / \ 16, / One-off with common buffer size / \ 32, / One-off with common buffer size / \ 64, / One-off with common buffer size / \ 100, / One-off with common buffer size / \ 127 / Overflow signed 8-bit when incremented / #define INTERESTING_16 \ -32768, / Overflow signed 16-bit when decremented / \ -129, / Overflow signed 8-bit / \ 128, / Overflow signed 8-bit / \ 255, / Overflow unsig 8-bit when incremented / \ 256, / Overflow unsig 8-bit / \ 512, / One-off with common buffer size / \ 1000, / One-off with common buffer size / \ 1024, / One-off with common buffer size / \ 4096, / One-off with common buffer size / \ 32767 / Overflow signed 16-bit when incremented / #define INTERESTING_32 \ -2147483648LL, / Overflow signed 32-bit when decremented / \ -100663046, / Large negative number (endian-agnostic) / \ -32769, / Overflow signed 16-bit / \ 32768, / Overflow signed 16-bit / \ 65535, / Overflow unsig 16-bit when incremented / \ 65536, / Overflow unsig 16 bit / \ 100663045, / Large positive number (endian-agnostic) / \ 2147483647 / Overflow signed 32-bit when incremented / honggfuzz uses the following values for 8B as well as similar values for 1, 2, 4B too: / 8B - LE */ {"\x00\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x01\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x02\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x03\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x04\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x05\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x06\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x07\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x08\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x09\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x0A\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x0B\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x0C\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x0D\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x0E\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x0F\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x10\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x20\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x40\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x7E\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x7F\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x80\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x81\x00\x00\x00\x00\x00\x00\x00", 8}, {"\xC0\x00\x00\x00\x00\x00\x00\x00", 8}, {"\xFE\x00\x00\x00\x00\x00\x00\x00", 8}, {"\xFF\x00\x00\x00\x00\x00\x00\x00", 8}, {"\xFF\xFF\xFF\xFF\xFF\xFF\xFF\x7E", 8}, {"\xFF\xFF\xFF\xFF\xFF\xFF\xFF\x7F", 8}, {"\x00\x00\x00\x00\x00\x00\x00\x80", 8}, {"\x01\x00\x00\x00\x00\x00\x00\x80", 8}, {"\xFE\xFF\xFF\xFF\xFF\xFF\xFF\xFF", 8}, We can probably start with well-described magic values in afl?
compiler-rt/test/fuzzer/OverwriteBytesTest.cpp
26	Right, we need to make sure that. I attempted to do that by using FileCheck that looks for the mutation sequence printed when the test aborts.
compiler-rt/test/fuzzer/OverwriteBytesTest.h
1 ↗	(On Diff #287061)	This input is the base input used for magic byte mutation that lead to a big subtree that libFuzzer missed in my short experiment. The file was generated by `xxd -i`. We do not need this large sequence of bytes for testing purposes. I will trim it down to a lot more compact version in the next upload.
compiler-rt/test/fuzzer/overwrite-bytes.test
3	Will do in the next upload.
7	I will adjust this value after trimming down the base input.

Add magic values used in AFL.

Harbormaster completed remote builds in B69500: Diff 287761.Aug 25 2020, 1:20 PM

Addressed comments.

dokyungs marked 4 inline comments as done.Aug 25 2020, 1:52 PM

dokyungs added inline comments.

compiler-rt/test/fuzzer/OverwriteBytesTest.cpp
26	I now use memmem here and invoke fuzzing with -use_memmem=0 to eliminate that possibility.
compiler-rt/test/fuzzer/OverwriteBytesTest.h
1 ↗	(On Diff #287061)	Reduced to 36 bytes.
compiler-rt/test/fuzzer/overwrite-bytes.test
3	Done.
7	Reduced to 10M.

clang-format

dokyungs edited the summary of this revision. (Show Details)Aug 25 2020, 1:54 PM

Harbormaster completed remote builds in B69507: Diff 287771.Aug 25 2020, 2:24 PM

Harbormaster completed remote builds in B69508: Diff 287772.Aug 25 2020, 2:29 PM

morehouse added inline comments.Aug 26 2020, 9:49 AM

compiler-rt/lib/fuzzer/FuzzerMutate.cpp
382	Please add a comment saying these are borrowed from AFL.
413	The indentation seems weird. Can we fix it?
441	The templates here seem like overkill. I'd rather have a runtime check if it simplifies the code. For example, a simple function should suffice: template<typename T> static T GetRandomMagic(Random &Rand) { switch (sizeof(T)) { case 1: { constexpr T Magics[] = {INTERESTING_8}; return Magics[Rand(sizeof(Magics) / sizeof(T))]; } case 2: ... } Actually, this probably gets optimized to similar code since `sizeof(T)` is a compile-time constant.
compiler-rt/test/fuzzer/OverwriteBytesTest.h
1 ↗	(On Diff #287061)	Now it's probably small enough to be included in the test itself instead of a header.
compiler-rt/test/fuzzer/overwrite-bytes.test
7	How many runs does it take with and without this mutation?

Addressed comments.

dokyungs marked 2 inline comments as done.Aug 26 2020, 3:14 PM

dokyungs added inline comments.

compiler-rt/lib/fuzzer/FuzzerMutate.cpp
413	This is actually what clang-format gives us. Is it better to manually fix the indentation?
441	It's been simplified now as you suggested. Yes, it's more concise!
compiler-rt/test/fuzzer/overwrite-bytes.test
7	Without this mutation it takes 172,094,572 execs to find it the crash. With this mutation, it takes 531,941 execs.

dokyungs marked an inline comment as done.Aug 26 2020, 3:14 PM

morehouse added inline comments.Aug 26 2020, 3:22 PM

compiler-rt/lib/fuzzer/FuzzerMutate.cpp
413	Yes, I would prefer manual formatting.
423	Can we avoid the memcpy by just returning `SignedVal`, possibly with a cast?
429	There's no breaks, so all of these are fallthroughs...

Addressed comments.

dokyungs marked 3 inline comments as done.Aug 26 2020, 4:05 PM

dokyungs added inline comments.

compiler-rt/lib/fuzzer/FuzzerMutate.cpp
423	I am not sure what cast to use - static_cast or repinterpret_cast? To figure this out I should probably go read the C++ standard. I thought that we want the exact machine representation of signed magic values here. Based on this thought, I thought memcpy could be better here.
429	Thanks. I still make this mistake :(

New data with switch breakthrough fix.

compiler-rt/test/fuzzer/overwrite-bytes.test
7	-mutate_depth=1: With magic mutation: 34,024 Without magic mutation: more than 134,217,728 -mutate_depth=5: With magic mutation: 537,111 Without magic mutation: more than 67,108,864

Harbormaster completed remote builds in B69683: Diff 288120.Aug 26 2020, 4:18 PM

morehouse added inline comments.Aug 26 2020, 4:20 PM

compiler-rt/lib/fuzzer/FuzzerMutate.cpp
423	`static_cast` should work as long as `T` isn't some kind of pointer. In that case we would need `reinterpret_cast`. Assuming `T` is some kind of integer, casting is equivalent to memcpy. So if the compiler lets you, I'd prefer to just assume integer types. We can add a `static_assert(std::is_integral<T>::value)` to ensure this isn't abused in the future.

Harbormaster completed remote builds in B69686: Diff 288128.Aug 26 2020, 4:24 PM

Addressed comment - use static_cast instead of memcpy.

LGTM for an experiment.

compiler-rt/lib/fuzzer/FuzzerMutate.cpp
446	Nit: we can save some lines of code by directly returning instead of assigning to `Val`. That also lets us remove the breaks.

Harbormaster completed remote builds in B69693: Diff 288136.Aug 26 2020, 5:19 PM

Revision Contents

Path

Size

compiler-rt/

lib/

fuzzer/

FuzzerMutate.cpp

75 lines

test/

fuzzer/

OverwriteBytesTest.cpp

81 lines

overwrite-bytes.test

9 lines

Diff 288136

compiler-rt/lib/fuzzer/FuzzerMutate.cpp

//===- FuzzerMutate.cpp - Mutate a test input -----------------------------===//		//===- FuzzerMutate.cpp - Mutate a test input -----------------------------===//
		Lint: Lint Inline Actions clang-format suggested style edits found: Lint: Lint: clang-format suggested style edits found:
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Mutate a test input.		// Mutate a test input.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
▲ Show 20 Lines • Show All 364 Lines • ▼ Show 20 Lines	for (size_t i = B; i < E; i++) {
size_t Idx = E + B - i - 1;		size_t Idx = E + B - i - 1;
assert(Idx >= B && Idx < E);		assert(Idx >= B && Idx < E);
Data[Idx] = (Val % 10) + '0';		Data[Idx] = (Val % 10) + '0';
Val /= 10;		Val /= 10;
}		}
return Size;		return Size;
}		}

		// The following magic constants were borrowed from AFL.
		morehouseUnsubmitted Done Reply Inline Actions Please add a comment saying these are borrowed from AFL. morehouse: Please add a comment saying these are borrowed from AFL.

		#define INTERESTING_8 \
		-128, /* Overflow signed 8-bit when decremented */ \
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - -128, /* Overflow signed 8-bit when decremented / \ - -1, / / \ - 0, / / \ - 1, / / \ - 16, / One-off with common buffer size / \ - 32, / One-off with common buffer size / \ - 64, / One-off with common buffer size / \ - 100, / One-off with common buffer size / \ - 127 / Overflow signed 8-bit when incremented / + -128, / Overflow signed 8-bit when decremented / \ 8 diff lines are omitted. See full path. Lint: Pre-merge checks:* clang-format: please reformat the code ``` - -128, /* Overflow signed 8-bit when…
		-1, /* */ \
		0, /* */ \
		1, /* */ \
		16, /* One-off with common buffer size */ \
		32, /* One-off with common buffer size */ \
		64, /* One-off with common buffer size */ \
		100, /* One-off with common buffer size */ \
		127 /* Overflow signed 8-bit when incremented */

		#define INTERESTING_16 \
		-32768, /* Overflow signed 16-bit when decremented */ \
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - -32768, /* Overflow signed 16-bit when decremented / \ - -129, / Overflow signed 8-bit / \ - 128, / Overflow signed 8-bit / \ - 255, / Overflow unsig 8-bit when incremented / \ - 256, / Overflow unsig 8-bit / \ - 512, / One-off with common buffer size / \ - 1000, / One-off with common buffer size / \ - 1024, / One-off with common buffer size / \ - 4096, / One-off with common buffer size / \ - 32767 / Overflow signed 16-bit when incremented / 10 diff lines are omitted. See full path. Lint: Pre-merge checks:* clang-format: please reformat the code ``` - -32768, /* Overflow signed 16-bit when…
		-129, /* Overflow signed 8-bit */ \
		128, /* Overflow signed 8-bit */ \
		255, /* Overflow unsig 8-bit when incremented */ \
		256, /* Overflow unsig 8-bit */ \
		512, /* One-off with common buffer size */ \
		1000, /* One-off with common buffer size */ \
		1024, /* One-off with common buffer size */ \
		4096, /* One-off with common buffer size */ \
		32767 /* Overflow signed 16-bit when incremented */

		#define INTERESTING_32 \
		-2147483648LL, /* Overflow signed 32-bit when decremented */ \
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - -2147483648LL, /* Overflow signed 32-bit when decremented / \ - -100663046, / Large negative number (endian-agnostic) / \ - -32769, / Overflow signed 16-bit / \ - 32768, / Overflow signed 16-bit / \ - 65535, / Overflow unsig 16-bit when incremented / \ - 65536, / Overflow unsig 16 bit / \ - 100663045, / Large positive number (endian-agnostic) / \ - 2147483647 / Overflow signed 32-bit when incremented / + -2147483648LL, / Overflow signed 32-bit when decremented / \ + -100663046, / Large negative number (endian-agnostic) / \ 6 diff lines are omitted. See full path. Lint: Pre-merge checks:* clang-format: please reformat the code ``` - -2147483648LL, /* Overflow signed 32-bit when…
		-100663046, /* Large negative number (endian-agnostic) */ \
		-32769, /* Overflow signed 16-bit */ \
		32768, /* Overflow signed 16-bit */ \
		65535, /* Overflow unsig 16-bit when incremented */ \
		65536, /* Overflow unsig 16 bit */ \
		morehouseUnsubmitted Done Reply Inline Actions The indentation seems weird. Can we fix it? morehouse: The indentation seems weird. Can we fix it?
		dokyungsAuthorUnsubmitted Done Reply Inline Actions This is actually what clang-format gives us. Is it better to manually fix the indentation? dokyungs: This is actually what clang-format gives us. Is it better to manually fix the indentation?
		morehouseUnsubmitted Done Reply Inline Actions Yes, I would prefer manual formatting. morehouse: Yes, I would prefer manual formatting.
		100663045, /* Large positive number (endian-agnostic) */ \
		2147483647 /* Overflow signed 32-bit when incremented */

		template <class T> static T GetRandomMagic(Random &Rand) {
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for function 'GetRandomMagic' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for function 'GetRandomMagic' [readability-identifier…
		static_assert(std::is_integral<T>::value, "Invalid type for magic values");

		T Val = 0;
		switch (sizeof(T)) {
		case 1: {
		constexpr int8_t Magics[] = {INTERESTING_8};
		morehouseUnsubmitted Not Done Reply Inline Actions Can we avoid the memcpy by just returning `SignedVal`, possibly with a cast? morehouse: Can we avoid the memcpy by just returning `SignedVal`, possibly with a cast?
		dokyungsAuthorUnsubmitted Done Reply Inline Actions I am not sure what cast to use - static_cast or repinterpret_cast? To figure this out I should probably go read the C++ standard. I thought that we want the exact machine representation of signed magic values here. Based on this thought, I thought memcpy could be better here. dokyungs: I am not sure what cast to use - static_cast or repinterpret_cast? To figure this out I should…
		morehouseUnsubmitted Not Done Reply Inline Actions `static_cast` should work as long as `T` isn't some kind of pointer. In that case we would need `reinterpret_cast`. Assuming `T` is some kind of integer, casting is equivalent to memcpy. So if the compiler lets you, I'd prefer to just assume integer types. We can add a `static_assert(std::is_integral<T>::value)` to ensure this isn't abused in the future. morehouse: `static_cast` should work as long as `T` isn't some kind of pointer. In that case we would…
		int8_t SignedVal = Magics[Rand(sizeof(Magics) / sizeof(int8_t))];
		Val = static_cast<T>(SignedVal);
		break;
		}
		case 2: {
		constexpr int16_t Magics[] = {INTERESTING_8, INTERESTING_16};
		morehouseUnsubmitted Done Reply Inline Actions There's no breaks, so all of these are fallthroughs... morehouse: There's no breaks, so all of these are fallthroughs...
		dokyungsAuthorUnsubmitted Done Reply Inline Actions Thanks. I still make this mistake :( dokyungs: Thanks. I still make this mistake :(
		int16_t SignedVal = Magics[Rand(sizeof(Magics) / sizeof(int16_t))];
		Val = static_cast<T>(SignedVal);
		break;
		}
		case 4: {
		constexpr int32_t Magics[] = {INTERESTING_8, INTERESTING_16,
		INTERESTING_32};
		int32_t SignedVal = Magics[Rand(sizeof(Magics) / sizeof(int32_t))];
		Val = static_cast<T>(SignedVal);
		break;
		}
		case 8: {
		morehouseUnsubmitted Done Reply Inline Actions The templates here seem like overkill. I'd rather have a runtime check if it simplifies the code. For example, a simple function should suffice: template<typename T> static T GetRandomMagic(Random &Rand) { switch (sizeof(T)) { case 1: { constexpr T Magics[] = {INTERESTING_8}; return Magics[Rand(sizeof(Magics) / sizeof(T))]; } case 2: ... } Actually, this probably gets optimized to similar code since `sizeof(T)` is a compile-time constant. morehouse: The templates here seem like overkill. I'd rather have a runtime check if it simplifies the…
		dokyungsAuthorUnsubmitted Done Reply Inline Actions It's been simplified now as you suggested. Yes, it's more concise! dokyungs: It's been simplified now as you suggested. Yes, it's more concise!
		constexpr int64_t Magics[] = {INTERESTING_8, INTERESTING_16,
		INTERESTING_32};
		int64_t SignedVal = Magics[Rand(sizeof(Magics) / sizeof(int64_t))];
		Val = static_cast<T>(SignedVal);
		break;
		morehouseUnsubmitted Not Done Reply Inline Actions Nit: we can save some lines of code by directly returning instead of assigning to `Val`. That also lets us remove the breaks. morehouse: Nit: we can save some lines of code by directly returning instead of assigning to `Val`. That…
		}
		}
		return Val;
		}

template<class T>		template<class T>
size_t ChangeBinaryInteger(uint8_t *Data, size_t Size, Random &Rand) {		size_t ChangeBinaryInteger(uint8_t *Data, size_t Size, Random &Rand) {
if (Size < sizeof(T)) return 0;		if (Size < sizeof(T)) return 0;
size_t Off = Rand(Size - sizeof(T) + 1);		size_t Off = Rand(Size - sizeof(T) + 1);
assert(Off + sizeof(T) <= Size);		assert(Off + sizeof(T) <= Size);
T Val;		T Val;
if (Off < 64 && !Rand(4)) {		if (Off < 64 && !Rand(4)) {
Val = Size;		Val = Size;
if (Rand.RandBool())		if (Rand.RandBool())
Val = Bswap(Val);		Val = Bswap(Val);
		} else if (Rand.RandBool()) {
		Val = GetRandomMagic<T>(Rand);
		morehouseUnsubmitted Not Done Reply Inline Actions I think we should add more. What does honggfuzz use? morehouse: I think we should add more. What does honggfuzz use?
		dokyungsAuthorUnsubmitted Done Reply Inline Actions afl uses the following magic values. #define INTERESTING_8 \ -128, /* Overflow signed 8-bit when decremented / \ -1, / / \ 0, / / \ 1, / / \ 16, / One-off with common buffer size / \ 32, / One-off with common buffer size / \ 64, / One-off with common buffer size / \ 100, / One-off with common buffer size / \ 127 / Overflow signed 8-bit when incremented / #define INTERESTING_16 \ -32768, / Overflow signed 16-bit when decremented / \ -129, / Overflow signed 8-bit / \ 128, / Overflow signed 8-bit / \ 255, / Overflow unsig 8-bit when incremented / \ 256, / Overflow unsig 8-bit / \ 512, / One-off with common buffer size / \ 1000, / One-off with common buffer size / \ 1024, / One-off with common buffer size / \ 4096, / One-off with common buffer size / \ 32767 / Overflow signed 16-bit when incremented / #define INTERESTING_32 \ -2147483648LL, / Overflow signed 32-bit when decremented / \ -100663046, / Large negative number (endian-agnostic) / \ -32769, / Overflow signed 16-bit / \ 32768, / Overflow signed 16-bit / \ 65535, / Overflow unsig 16-bit when incremented / \ 65536, / Overflow unsig 16 bit / \ 100663045, / Large positive number (endian-agnostic) / \ 2147483647 / Overflow signed 32-bit when incremented / honggfuzz uses the following values for 8B as well as similar values for 1, 2, 4B too: / 8B - LE / {"\x00\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x01\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x02\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x03\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x04\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x05\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x06\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x07\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x08\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x09\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x0A\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x0B\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x0C\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x0D\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x0E\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x0F\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x10\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x20\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x40\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x7E\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x7F\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x80\x00\x00\x00\x00\x00\x00\x00", 8}, {"\x81\x00\x00\x00\x00\x00\x00\x00", 8}, {"\xC0\x00\x00\x00\x00\x00\x00\x00", 8}, {"\xFE\x00\x00\x00\x00\x00\x00\x00", 8}, {"\xFF\x00\x00\x00\x00\x00\x00\x00", 8}, {"\xFF\xFF\xFF\xFF\xFF\xFF\xFF\x7E", 8}, {"\xFF\xFF\xFF\xFF\xFF\xFF\xFF\x7F", 8}, {"\x00\x00\x00\x00\x00\x00\x00\x80", 8}, {"\x01\x00\x00\x00\x00\x00\x00\x80", 8}, {"\xFE\xFF\xFF\xFF\xFF\xFF\xFF\xFF", 8}, We can probably start with well-described magic values in afl? dokyungs:* afl uses the following magic values. ``` #define INTERESTING_8 \ -128, /* Overflow…
		if (Rand.RandBool()) {
		Val = Bswap(Val);
		}
} else {		} else {
memcpy(&Val, Data + Off, sizeof(Val));		memcpy(&Val, Data + Off, sizeof(Val));
T Add = Rand(21);		T Add = Rand(21);
Add -= 10;		Add -= 10;
if (Rand.RandBool())		if (Rand.RandBool())
Val = Bswap(T(Bswap(Val) + Add)); // Add assuming different endiannes.		Val = Bswap(T(Bswap(Val) + Add)); // Add assuming different endiannes.
else		else
Val = Val + Add; // Add assuming current endiannes.		Val = Val + Add; // Add assuming current endiannes.
▲ Show 20 Lines • Show All 163 Lines • Show Last 20 Lines

compiler-rt/test/fuzzer/OverwriteBytesTest.cpp

This file was added.

				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception

				#include <cassert>
				#include <cstdint>
				#include <cstdio>
				#include <cstdlib>
				#include <cstring>

				#include <algorithm>
				#include <vector>

				uint8_t SeedInput[] = {
				0xba,
				0xe3,
				0x92,
				0x7c,
				0x80,
				0x86,
				0x73,
				0x0f,
				0xf2,
				0x83,
				0x23,
				0x0f,
				morehouseUnsubmitted Done Reply Inline Actions Probably the large input defeats our memcmp hook, but we should be careful that it isn't the reason we can pass this test. morehouse: Probably the large input defeats our memcmp hook, but we should be careful that it isn't the…
				dokyungsAuthorUnsubmitted Done Reply Inline Actions Right, we need to make sure that. I attempted to do that by using FileCheck that looks for the mutation sequence printed when the test aborts. dokyungs: Right, we need to make sure that. I attempted to do that by using FileCheck that looks for the…
				dokyungsAuthorUnsubmitted Done Reply Inline Actions I now use memmem here and invoke fuzzing with -use_memmem=0 to eliminate that possibility. dokyungs: I now use memmem here and invoke fuzzing with -use_memmem=0 to eliminate that possibility.
				0xf5,
				0x17,
				0x4c,
				0x08,
				0xf2,
				0x83,
				0x23,
				0x0f,
				0xd8,
				0x71,
				0x58,
				0x1c,
				0xb9,
				0x8d,
				0xf1,
				0x0e,
				0x80,
				0x86,
				0x73,
				0x0f,
				0xf0,
				0x83,
				0x23,
				0x0f,
				};

				#define MAGIC_BYTE_VALUE 0x1
				#define MAGIC_BYTE_OFFSET 0xf

				static volatile int *Nil = nullptr;

				#ifdef PRINT_SEED_INPUT

				int main(int argc, char **argv) {
				fwrite(SeedInput, sizeof(SeedInput[0]), sizeof(SeedInput), stdout);
				return 0;
				}

				#else

				extern "C" int LLVMFuzzerTestOneInput(const uint8_t *Data, size_t Size) {
				if (Size != sizeof(SeedInput)) {
				return 0;
				}

				(uint64_t )(SeedInput + MAGIC_BYTE_OFFSET) = MAGIC_BYTE_VALUE;

				if (memmem(Data, Size, SeedInput, Size) == Data) {
				*Nil = 42; // crash.
				}

				return 0;
				}

				#endif

compiler-rt/test/fuzzer/overwrite-bytes.test

This file was added.

				REQUIRES: linux, x86_64
				RUN: %cpp_compiler %S/OverwriteBytesTest.cpp -o %t-OverwriteBytesTest
				RUN: %cpp_compiler -fno-sanitize=fuzzer -DPRINT_SEED_INPUT=1 %S/OverwriteBytesTest.cpp -o %t-OverwriteBytesTestPrintSeed
				morehouseUnsubmitted Done Reply Inline Actions Can we hard code the magics in the C++? morehouse: Can we hard code the magics in the C++?
				dokyungsAuthorUnsubmitted Done Reply Inline Actions Will do in the next upload. dokyungs: Will do in the next upload.
				dokyungsAuthorUnsubmitted Done Reply Inline Actions Done. dokyungs: Done.
				RUN: %t-OverwriteBytesTestPrintSeed > %t-OverwriteBytesTest.seed

				RUN: not %run %t-OverwriteBytesTest -seed=1 -use_memmem=0 -mutate_depth=1 -reduce_inputs=0 -runs=10000000 -seed_inputs=%t-OverwriteBytesTest.seed 2>&1 \| FileCheck %s

				morehouseUnsubmitted Done Reply Inline Actions 100M runs seems high. How long does it take if we hit that limit? morehouse: 100M runs seems high. How long does it take if we hit that limit?
				dokyungsAuthorUnsubmitted Done Reply Inline Actions I will adjust this value after trimming down the base input. dokyungs: I will adjust this value after trimming down the base input.
				dokyungsAuthorUnsubmitted Done Reply Inline Actions Reduced to 10M. dokyungs: Reduced to 10M.
				morehouseUnsubmitted Not Done Reply Inline Actions How many runs does it take with and without this mutation? morehouse: How many runs does it take with and without this mutation?
				dokyungsAuthorUnsubmitted Done Reply Inline Actions Without this mutation it takes 172,094,572 execs to find it the crash. With this mutation, it takes 531,941 execs. dokyungs: Without this mutation it takes 172,094,572 execs to find it the crash. With this mutation, it…
				dokyungsAuthorUnsubmitted Done Reply Inline Actions -mutate_depth=1: With magic mutation: 34,024 Without magic mutation: more than 134,217,728 -mutate_depth=5: With magic mutation: 537,111 Without magic mutation: more than 67,108,864 dokyungs: -mutate_depth=1: With magic mutation: 34,024 Without magic mutation: more than 134,217,728…
				CHECK: ABORTING
				CHECK-NEXT: MS: 1 ChangeBinInt-;

This is an archive of the discontinued LLVM Phabricator instance.

[libFuzzer] Extend ChangeBinaryInteger mutator to support overwriting selected input with predefined integers.Needs ReviewPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 288136

compiler-rt/lib/fuzzer/FuzzerMutate.cpp

compiler-rt/test/fuzzer/OverwriteBytesTest.cpp

compiler-rt/test/fuzzer/overwrite-bytes.test

[libFuzzer] Extend ChangeBinaryInteger mutator to support overwriting selected input with predefined integers.
Needs ReviewPublic