This is an archive of the discontinued LLVM Phabricator instance.

[RFC] Introduce support for multiple sanitizer blacklists (LLVM side).
Needs ReviewPublic

Authored by pcc on Jul 16 2014, 2:31 PM.

Download Raw Diff

Details

Reviewers

Summary

This adds support for multiple -fsanitize-blacklist flags, which cause
clang to load blacklists from each of the specified paths. This could be
useful for sanitizers such as DFSan where we may have multiple ABI lists,
each covering a specific library, which describes the ABI of that library.

This changes the semantics of the -fsanitize-blacklist flag, so I wanted to
make sure that we were comfortable with that. Now, the flag is additive, so a
-fsanitize-blacklist flag on the command line loads the specified blacklist
as well as the one in the resource directory. The -fno-sanitize-blacklist
flag works as before in that it prevents the blacklist in the resource
directory from being loaded, but it can also be used in combination with
-fsanitize-blacklist to load only user-specified blacklists.

Tests to come. I've removed the one test which fails as a result of the
changed semantics.

Diff Detail

Event Timeline

pcc updated this revision to Diff 11537.Jul 16 2014, 2:31 PM

pcc retitled this revision from to [RFC] Introduce support for multiple sanitizer blacklists (LLVM side)..

pcc updated this object.

pcc edited the test plan for this revision. (Show Details)

pcc added a reviewer: samsonov.

pcc added a subscriber: Unknown Object (MLST).

I'm fine with general direction of this patch. Having multiple blacklists for different parts of code base looks like a legit use case.

I don't like proliferation of std::unique_ptr across interfaces, though. I would prefer to have them more lightweight. Can we add one more factory:

llvm::SpecialCaseList::createOrDie(const std::vector<std::string> &Paths);

In this case Clang's BackendUtil won't have to bother about creating SpecialCaseList - it will just pass CGOpts.SanitizerBlacklistFiles to the DFSan instrumentation pass.

lib/Support/SpecialCaseList.cpp
135–136	You're now using a vector of Regexps. Either use "\|" (as done here) to merge them into a single Regexp, or use push_back everywhere.

I don't like proliferation of std::unique_ptr across interfaces, though. I would prefer to have them more lightweight. Can we add one more factory:
llvm::SpecialCaseList::createOrDie(const std::vector<std::string> &Paths);
In this case Clang's BackendUtil won't have to bother about creating SpecialCaseList - it will just pass CGOpts.SanitizerBlacklistFiles to the DFSan instrumentation pass.

I would prefer not to. It seems to me that passing SpecialCaseLists around would give us a little more flexibility. For example, it would make it easier to accept ad-hoc entries on the command line, or do things like dynamically load ABI lists out of object files.

lib/Support/SpecialCaseList.cpp
135–136	Okay, I'll change this to use one regexp per entry. It doesn't seem like we were getting much benefit out of having a single regexp for all entries, given the poor performance I was seeing before.

Add tests and address reviewer comments.

Revision Contents

Path

Size

include/

llvm/

Support/

SpecialCaseList.h

25 lines

Transforms/

Instrumentation.h

12 lines

lib/

Support/

SpecialCaseList.cpp

70 lines

Transforms/

Instrumentation/

DataFlowSanitizer.cpp

21 lines

unittests/

Support/

SpecialCaseListTest.cpp

76 lines

Diff 12251

include/llvm/Support/SpecialCaseList.h

	Show First 20 Lines • Show All 51 Lines • ▼ Show 20 Lines

	namespace llvm {			namespace llvm {
	class MemoryBuffer;			class MemoryBuffer;
	class Regex;			class Regex;
	class StringRef;			class StringRef;

	class SpecialCaseList {			class SpecialCaseList {
	public:			public:
	/// Parses the special case list from a file. If Path is empty, returns			/// Parses the special case list from a file. If Path is empty, returns an
	/// an empty special case list. On failure, returns 0 and writes an error			/// empty special case list. On failure, reports a fatal error.
	/// message to string.			static std::unique_ptr<SpecialCaseList> createOrDie(const StringRef Path);
	static SpecialCaseList *create(const StringRef Path, std::string &Error);
	/// Parses the special case list from a memory buffer. On failure, returns
	/// 0 and writes an error message to string.
	static SpecialCaseList create(const MemoryBuffer MB, std::string &Error);
	/// Parses the special case list from a file. On failure, reports a fatal
	/// error.
	static SpecialCaseList *createOrDie(const StringRef Path);

				SpecialCaseList();
	~SpecialCaseList();			~SpecialCaseList();

				/// Parses the special case list from a file. Returns true if successful. On
				/// failure, writes an error message to \param Error.
				bool loadFromFile(StringRef Path, std::string &Error);
				/// Parses the special case list from a memory buffer. Returns true if
				/// successful. On failure, writes an error message to \param Error.
				bool loadFromBuffer(const MemoryBuffer *MB, std::string &Error);

	/// Returns true, if special case list contains a line			/// Returns true, if special case list contains a line
	/// \code			/// \code
	/// @Section:<E>=@Category			/// @Section:<E>=@Category
	/// \endcode			/// \endcode
	/// and @Query satisfies a wildcard expression <E>.			/// and @Query satisfies a wildcard expression <E>.
	bool inSection(const StringRef Section, const StringRef Query,			bool inSection(const StringRef Section, const StringRef Query,
	const StringRef Category = StringRef()) const;			const StringRef Category = StringRef()) const;

	private:			private:
	SpecialCaseList(SpecialCaseList const &) LLVM_DELETED_FUNCTION;			SpecialCaseList(SpecialCaseList const &) LLVM_DELETED_FUNCTION;
	SpecialCaseList &operator=(SpecialCaseList const &) LLVM_DELETED_FUNCTION;			SpecialCaseList &operator=(SpecialCaseList const &) LLVM_DELETED_FUNCTION;

	struct Entry;			struct Entry;
	StringMap<StringMap<Entry> > Entries;			StringMap<StringMap<Entry> > Entries;

	SpecialCaseList();
	/// Parses just-constructed SpecialCaseList entries from a memory buffer.
	bool parse(const MemoryBuffer *MB, std::string &Error);
	};			};

	} // namespace llvm			} // namespace llvm

	#endif // LLVM_SUPPORT_SPECIALCASELIST_H			#endif // LLVM_SUPPORT_SPECIALCASELIST_H

include/llvm/Transforms/Instrumentation.h

	Show All 9 Lines
	// This file defines constructor functions for instrumentation passes.			// This file defines constructor functions for instrumentation passes.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_TRANSFORMS_INSTRUMENTATION_H			#ifndef LLVM_TRANSFORMS_INSTRUMENTATION_H
	#define LLVM_TRANSFORMS_INSTRUMENTATION_H			#define LLVM_TRANSFORMS_INSTRUMENTATION_H

	#include "llvm/ADT/StringRef.h"			#include "llvm/ADT/StringRef.h"
				#include "llvm/Support/SpecialCaseList.h"
				#include <memory>

	#if defined(__GNUC__) && defined(__linux__) && !defined(ANDROID)			#if defined(__GNUC__) && defined(__linux__) && !defined(ANDROID)
	inline void *getDFSanArgTLSPtrForJIT() {			inline void *getDFSanArgTLSPtrForJIT() {
	extern __thread __attribute__((tls_model("initial-exec")))			extern __thread __attribute__((tls_model("initial-exec")))
	void *__dfsan_arg_tls;			void *__dfsan_arg_tls;
	return (void *)&__dfsan_arg_tls;			return (void *)&__dfsan_arg_tls;
	}			}

	▲ Show 20 Lines • Show All 43 Lines • ▼ Show 20 Lines

	// Insert MemorySanitizer instrumentation (detection of uninitialized reads)			// Insert MemorySanitizer instrumentation (detection of uninitialized reads)
	FunctionPass *createMemorySanitizerPass(int TrackOrigins = 0);			FunctionPass *createMemorySanitizerPass(int TrackOrigins = 0);

	// Insert ThreadSanitizer (race detection) instrumentation			// Insert ThreadSanitizer (race detection) instrumentation
	FunctionPass *createThreadSanitizerPass();			FunctionPass *createThreadSanitizerPass();

	// Insert DataFlowSanitizer (dynamic data flow analysis) instrumentation			// Insert DataFlowSanitizer (dynamic data flow analysis) instrumentation
	ModulePass *createDataFlowSanitizerPass(StringRef ABIListFile = StringRef(),			ModulePass *createDataFlowSanitizerPass(std::unique_ptr<SpecialCaseList> ABIL,
	void (getArgTLS)() = nullptr,			void (getArgTLS)() = nullptr,
	void (getRetValTLS)() = nullptr);			void (getRetValTLS)() = nullptr);

	#if defined(__GNUC__) && defined(__linux__) && !defined(ANDROID)			#if defined(__GNUC__) && defined(__linux__) && !defined(ANDROID)
	inline ModulePass *createDataFlowSanitizerPassForJIT(StringRef ABIListFile =			inline ModulePass *
	StringRef()) {			createDataFlowSanitizerPassForJIT(std::unique_ptr<SpecialCaseList> ABIList) {
	return createDataFlowSanitizerPass(ABIListFile, getDFSanArgTLSPtrForJIT,			return createDataFlowSanitizerPass(
	getDFSanRetValTLSPtrForJIT);			std::move(ABIList), getDFSanArgTLSPtrForJIT, getDFSanRetValTLSPtrForJIT);
	}			}
	#endif			#endif

	// BoundsChecking - This pass instruments the code to perform run-time bounds			// BoundsChecking - This pass instruments the code to perform run-time bounds
	// checking on loads, stores, and other memory intrinsics.			// checking on loads, stores, and other memory intrinsics.
	FunctionPass *createBoundsCheckingPass();			FunctionPass *createBoundsCheckingPass();

	/// createDebugIRPass - Enable interactive stepping through LLVM IR in LLDB (or			/// createDebugIRPass - Enable interactive stepping through LLVM IR in LLDB (or
	Show All 33 Lines

lib/Support/SpecialCaseList.cpp

Show All 30 Lines
/// Represents a set of regular expressions. Regular expressions which are		/// Represents a set of regular expressions. Regular expressions which are
/// "literal" (i.e. no regex metacharacters) are stored in Strings, while all		/// "literal" (i.e. no regex metacharacters) are stored in Strings, while all
/// others are represented as a single pipe-separated regex in RegEx. The		/// others are represented as a single pipe-separated regex in RegEx. The
/// reason for doing so is efficiency; StringSet is much faster at matching		/// reason for doing so is efficiency; StringSet is much faster at matching
/// literal strings than Regex.		/// literal strings than Regex.
struct SpecialCaseList::Entry {		struct SpecialCaseList::Entry {
Entry() {}		Entry() {}
Entry(Entry &&Other)		Entry(Entry &&Other)
: Strings(std::move(Other.Strings)), RegEx(std::move(Other.RegEx)) {}		: Strings(std::move(Other.Strings)), RegExps(std::move(Other.RegExps)) {}

StringSet<> Strings;		StringSet<> Strings;
std::unique_ptr<Regex> RegEx;		std::vector<std::unique_ptr<Regex>> RegExps;

bool match(StringRef Query) const {		bool match(StringRef Query) const {
return Strings.count(Query) \|\| (RegEx && RegEx->match(Query));		if (Strings.count(Query))
		return true;
		for (auto &&RE : RegExps) {
		if (RE->match(Query))
		return true;
		}
		return false;
}		}
};		};

SpecialCaseList::SpecialCaseList() : Entries() {}		SpecialCaseList::SpecialCaseList() {}

SpecialCaseList *SpecialCaseList::create(		std::unique_ptr<SpecialCaseList>
const StringRef Path, std::string &Error) {		SpecialCaseList::createOrDie(const StringRef Path) {
		auto SCL = make_unique<SpecialCaseList>();
if (Path.empty())		if (Path.empty())
return new SpecialCaseList();		return std::move(SCL);
		std::string Error;
		if (!SCL->loadFromFile(Path, Error))
		report_fatal_error(Error);
		return std::move(SCL);
		}

		bool SpecialCaseList::loadFromFile(StringRef Path, std::string &Error) {
ErrorOr<std::unique_ptr<MemoryBuffer>> FileOrErr =		ErrorOr<std::unique_ptr<MemoryBuffer>> FileOrErr =
MemoryBuffer::getFile(Path);		MemoryBuffer::getFile(Path);
if (std::error_code EC = FileOrErr.getError()) {		if (std::error_code EC = FileOrErr.getError()) {
Error = (Twine("Can't open file '") + Path + "': " + EC.message()).str();		Error = (Twine("Can't open file '") + Path + "': " + EC.message()).str();
return nullptr;		return nullptr;
}		}
return create(FileOrErr.get().get(), Error);		return loadFromBuffer(FileOrErr.get().get(), Error);
}		}

SpecialCaseList *SpecialCaseList::create(		bool SpecialCaseList::loadFromBuffer(const MemoryBuffer *MB, std::string &Error) {
const MemoryBuffer *MB, std::string &Error) {
std::unique_ptr<SpecialCaseList> SCL(new SpecialCaseList());
if (!SCL->parse(MB, Error))
return nullptr;
return SCL.release();
}

SpecialCaseList *SpecialCaseList::createOrDie(const StringRef Path) {
std::string Error;
if (SpecialCaseList *SCL = create(Path, Error))
return SCL;
report_fatal_error(Error);
}

bool SpecialCaseList::parse(const MemoryBuffer *MB, std::string &Error) {
// Iterate through each line in the blacklist file.		// Iterate through each line in the blacklist file.
SmallVector<StringRef, 16> Lines;		SmallVector<StringRef, 16> Lines;
SplitString(MB->getBuffer(), Lines, "\n\r");		SplitString(MB->getBuffer(), Lines, "\n\r");
StringMap<StringMap<std::string> > Regexps;
assert(Entries.empty() &&
"parse() should be called on an empty SpecialCaseList");
int LineNo = 1;		int LineNo = 1;
for (SmallVectorImpl<StringRef>::iterator I = Lines.begin(), E = Lines.end();		for (SmallVectorImpl<StringRef>::iterator I = Lines.begin(), E = Lines.end();
I != E; ++I, ++LineNo) {		I != E; ++I, ++LineNo) {
// Ignore empty lines and lines starting with "#"		// Ignore empty lines and lines starting with "#"
if (I->empty() \|\| I->startswith("#"))		if (I->empty() \|\| I->startswith("#"))
continue;		continue;
// Get our prefix and unparsed regexp.		// Get our prefix and unparsed regexp.
std::pair<StringRef, StringRef> SplitLine = I->split(":");		std::pair<StringRef, StringRef> SplitLine = I->split(":");
Show All 28 Lines	for (SmallVectorImpl<StringRef>::iterator I = Lines.begin(), E = Lines.end();
}		}

// Replace * with .*		// Replace * with .*
for (size_t pos = 0; (pos = Regexp.find("*", pos)) != std::string::npos;		for (size_t pos = 0; (pos = Regexp.find("*", pos)) != std::string::npos;
pos += strlen(".*")) {		pos += strlen(".*")) {
Regexp.replace(pos, strlen(""), ".");		Regexp.replace(pos, strlen(""), ".");
}		}

// Check that the regexp is valid.		auto RE = make_unique<Regex>("^" + Regexp + "$");
Regex CheckRE(Regexp);
std::string REError;		std::string REError;
if (!CheckRE.isValid(REError)) {		// Check that the regexp is valid.
		if (!RE->isValid(REError)) {
Error = (Twine("Malformed regex in line ") + Twine(LineNo) + ": '" +		Error = (Twine("Malformed regex in line ") + Twine(LineNo) + ": '" +
SplitLine.second + "': " + REError).str();		SplitLine.second + "': " + REError).str();
return false;		return false;
}		}

// Add this regexp into the proper group by its prefix.		// Add this regexp into the proper group by its prefix.
if (!Regexps[Prefix][Category].empty())		Entries[Prefix][Category].RegExps.push_back(std::move(RE));
		samsonovUnsubmitted Not Done Reply Inline Actions You're now using a vector of Regexps. Either use "\|" (as done here) to merge them into a single Regexp, or use push_back everywhere. samsonov: You're now using a vector of Regexps. Either use "\|" (as done here) to merge them into a single…
		pccAuthorUnsubmitted Not Done Reply Inline Actions Okay, I'll change this to use one regexp per entry. It doesn't seem like we were getting much benefit out of having a single regexp for all entries, given the poor performance I was seeing before. pcc: Okay, I'll change this to use one regexp per entry. It doesn't seem like we were getting much…
Regexps[Prefix][Category] += "\|";
Regexps[Prefix][Category] += "^" + Regexp + "$";
}		}

// Iterate through each of the prefixes, and create Regexs for them.
for (StringMap<StringMap<std::string> >::const_iterator I = Regexps.begin(),
E = Regexps.end();
I != E; ++I) {
for (StringMap<std::string>::const_iterator II = I->second.begin(),
IE = I->second.end();
II != IE; ++II) {
Entries[I->getKey()][II->getKey()].RegEx.reset(new Regex(II->getValue()));
}
}
return true;		return true;
}		}

SpecialCaseList::~SpecialCaseList() {}		SpecialCaseList::~SpecialCaseList() {}

bool SpecialCaseList::inSection(const StringRef Section, const StringRef Query,		bool SpecialCaseList::inSection(const StringRef Section, const StringRef Query,
const StringRef Category) const {		const StringRef Category) const {
StringMap<StringMap<Entry> >::const_iterator I = Entries.find(Section);		StringMap<StringMap<Entry> >::const_iterator I = Entries.find(Section);
if (I == Entries.end()) return false;		if (I == Entries.end()) return false;
StringMap<Entry>::const_iterator II = I->second.find(Category);		StringMap<Entry>::const_iterator II = I->second.find(Category);
if (II == I->second.end()) return false;		if (II == I->second.end()) return false;

return II->getValue().match(Query);		return II->getValue().match(Query);
}		}

} // namespace llvm		} // namespace llvm

lib/Transforms/Instrumentation/DataFlowSanitizer.cpp

Show First 20 Lines • Show All 133 Lines • ▼ Show 20 Lines	StringRef GetGlobalTypeString(const GlobalValue &G) {
}		}
return "<unknown type>";		return "<unknown type>";
}		}

class DFSanABIList {		class DFSanABIList {
std::unique_ptr<SpecialCaseList> SCL;		std::unique_ptr<SpecialCaseList> SCL;

public:		public:
DFSanABIList(SpecialCaseList *SCL) : SCL(SCL) {}		DFSanABIList(std::unique_ptr<SpecialCaseList> SCL) : SCL(std::move(SCL)) {}

/// Returns whether either this function or its source file are listed in the		/// Returns whether either this function or its source file are listed in the
/// given category.		/// given category.
bool isIn(const Function &F, const StringRef Category) const {		bool isIn(const Function &F, const StringRef Category) const {
return isIn(*F.getParent(), Category) \|\|		return isIn(*F.getParent(), Category) \|\|
SCL->inSection("fun", F.getName(), Category);		SCL->inSection("fun", F.getName(), Category);
}		}

▲ Show 20 Lines • Show All 103 Lines • ▼ Show 20 Lines	class DataFlowSanitizer : public ModulePass {
WrapperKind getWrapperKind(Function *F);		WrapperKind getWrapperKind(Function *F);
void addGlobalNamePrefix(GlobalValue *GV);		void addGlobalNamePrefix(GlobalValue *GV);
Function buildWrapperFunction(Function F, StringRef NewFName,		Function buildWrapperFunction(Function F, StringRef NewFName,
GlobalValue::LinkageTypes NewFLink,		GlobalValue::LinkageTypes NewFLink,
FunctionType *NewFT);		FunctionType *NewFT);
Constant getOrBuildTrampolineFunction(FunctionType FT, StringRef FName);		Constant getOrBuildTrampolineFunction(FunctionType FT, StringRef FName);

public:		public:
DataFlowSanitizer(StringRef ABIListFile = StringRef(),		DataFlowSanitizer(std::unique_ptr<SpecialCaseList> ABIList =
		std::unique_ptr<SpecialCaseList>(),
void (getArgTLS)() = nullptr,		void (getArgTLS)() = nullptr,
void (getRetValTLS)() = nullptr);		void (getRetValTLS)() = nullptr);
static char ID;		static char ID;
bool doInitialization(Module &M) override;		bool doInitialization(Module &M) override;
bool runOnModule(Module &M) override;		bool runOnModule(Module &M) override;
};		};

struct DFSanFunction {		struct DFSanFunction {
▲ Show 20 Lines • Show All 70 Lines • ▼ Show 20 Lines
};		};

}		}

char DataFlowSanitizer::ID;		char DataFlowSanitizer::ID;
INITIALIZE_PASS(DataFlowSanitizer, "dfsan",		INITIALIZE_PASS(DataFlowSanitizer, "dfsan",
"DataFlowSanitizer: dynamic data flow analysis.", false, false)		"DataFlowSanitizer: dynamic data flow analysis.", false, false)

ModulePass *llvm::createDataFlowSanitizerPass(StringRef ABIListFile,		ModulePass *
		llvm::createDataFlowSanitizerPass(std::unique_ptr<SpecialCaseList> ABIList,
void (getArgTLS)(),		void (getArgTLS)(),
void (getRetValTLS)()) {		void (getRetValTLS)()) {
return new DataFlowSanitizer(ABIListFile, getArgTLS, getRetValTLS);		return new DataFlowSanitizer(std::move(ABIList), getArgTLS, getRetValTLS);
}		}

DataFlowSanitizer::DataFlowSanitizer(StringRef ABIListFile,		DataFlowSanitizer::DataFlowSanitizer(std::unique_ptr<SpecialCaseList> ABIList,
void (getArgTLS)(),		void (getArgTLS)(),
void (getRetValTLS)())		void (getRetValTLS)())
: ModulePass(ID), GetArgTLSPtr(getArgTLS), GetRetvalTLSPtr(getRetValTLS),		: ModulePass(ID), GetArgTLSPtr(getArgTLS), GetRetvalTLSPtr(getRetValTLS),
ABIList(SpecialCaseList::createOrDie(ABIListFile.empty() ? ClABIListFile		ABIList(ABIList ? std::move(ABIList)
: ABIListFile)) {		: SpecialCaseList::createOrDie(ClABIListFile)) {}
}

FunctionType DataFlowSanitizer::getArgsFunctionType(FunctionType T) {		FunctionType DataFlowSanitizer::getArgsFunctionType(FunctionType T) {
llvm::SmallVector<Type *, 4> ArgTypes;		llvm::SmallVector<Type *, 4> ArgTypes;
std::copy(T->param_begin(), T->param_end(), std::back_inserter(ArgTypes));		std::copy(T->param_begin(), T->param_end(), std::back_inserter(ArgTypes));
for (unsigned i = 0, e = T->getNumParams(); i != e; ++i)		for (unsigned i = 0, e = T->getNumParams(); i != e; ++i)
ArgTypes.push_back(ShadowTy);		ArgTypes.push_back(ShadowTy);
if (T->isVarArg())		if (T->isVarArg())
ArgTypes.push_back(ShadowPtrTy);		ArgTypes.push_back(ShadowPtrTy);
▲ Show 20 Lines • Show All 1,190 Lines • Show Last 20 Lines

unittests/Support/SpecialCaseListTest.cpp

	//===- SpecialCaseListTest.cpp - Unit tests for SpecialCaseList -----------===//			//===- SpecialCaseListTest.cpp - Unit tests for SpecialCaseList -----------===//
	//			//
	// The LLVM Compiler Infrastructure			// The LLVM Compiler Infrastructure
	//			//
	// This file is distributed under the University of Illinois Open Source			// This file is distributed under the University of Illinois Open Source
	// License. See LICENSE.TXT for details.			// License. See LICENSE.TXT for details.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

				#include "llvm/ADT/STLExtras.h"
	#include "llvm/Support/MemoryBuffer.h"			#include "llvm/Support/MemoryBuffer.h"
	#include "llvm/Support/SpecialCaseList.h"			#include "llvm/Support/SpecialCaseList.h"
	#include "gtest/gtest.h"			#include "gtest/gtest.h"

	using namespace llvm;			using namespace llvm;

	namespace {			namespace {

	class SpecialCaseListTest : public ::testing::Test {			class SpecialCaseListTest : public ::testing::Test {
	protected:			protected:
	SpecialCaseList *makeSpecialCaseList(StringRef List, std::string &Error) {			bool loadFromStr(SpecialCaseList *SCL, StringRef List, std::string &Error) {
	std::unique_ptr<MemoryBuffer> MB(MemoryBuffer::getMemBuffer(List));			std::unique_ptr<MemoryBuffer> MB(MemoryBuffer::getMemBuffer(List));
	return SpecialCaseList::create(MB.get(), Error);			return SCL->loadFromBuffer(MB.get(), Error);
	}			}

	SpecialCaseList *makeSpecialCaseList(StringRef List) {			std::unique_ptr<SpecialCaseList> makeSpecialCaseList(StringRef List, std::string &Error) {
				auto SCL = make_unique<SpecialCaseList>();
				if (!loadFromStr(SCL.get(), List, Error))
				return nullptr;
				return SCL;
				}

				std::unique_ptr<SpecialCaseList> makeSpecialCaseList(StringRef List) {
	std::string Error;			std::string Error;
	SpecialCaseList *SCL = makeSpecialCaseList(List, Error);			auto SCL = makeSpecialCaseList(List, Error);
	assert(SCL);			assert(SCL);
	assert(Error == "");			assert(Error == "");
	return SCL;			return SCL;
	}			}
	};			};

	TEST_F(SpecialCaseListTest, Basic) {			TEST_F(SpecialCaseListTest, Basic) {
	std::unique_ptr<SpecialCaseList> SCL(			auto SCL =
	makeSpecialCaseList("# This is a comment.\n"			makeSpecialCaseList("# This is a comment.\n"
	"\n"			"\n"
	"src:hello\n"			"src:hello\n"
	"src:bye\n"			"src:bye\n"
	"src:hi=category\n"			"src:hi=category\n"
	"src:z*=category\n"));			"src:z*=category\n");
	EXPECT_TRUE(SCL->inSection("src", "hello"));			EXPECT_TRUE(SCL->inSection("src", "hello"));
	EXPECT_TRUE(SCL->inSection("src", "bye"));			EXPECT_TRUE(SCL->inSection("src", "bye"));
	EXPECT_TRUE(SCL->inSection("src", "hi", "category"));			EXPECT_TRUE(SCL->inSection("src", "hi", "category"));
	EXPECT_TRUE(SCL->inSection("src", "zzzz", "category"));			EXPECT_TRUE(SCL->inSection("src", "zzzz", "category"));
	EXPECT_FALSE(SCL->inSection("src", "hi"));			EXPECT_FALSE(SCL->inSection("src", "hi"));
	EXPECT_FALSE(SCL->inSection("fun", "hello"));			EXPECT_FALSE(SCL->inSection("fun", "hello"));
	EXPECT_FALSE(SCL->inSection("src", "hello", "category"));			EXPECT_FALSE(SCL->inSection("src", "hello", "category"));
	}			}

	TEST_F(SpecialCaseListTest, GlobalInitCompat) {			TEST_F(SpecialCaseListTest, GlobalInitCompat) {
	std::unique_ptr<SpecialCaseList> SCL(			auto SCL = makeSpecialCaseList("global:foo=init\n");
	makeSpecialCaseList("global:foo=init\n"));
	EXPECT_FALSE(SCL->inSection("global", "foo"));			EXPECT_FALSE(SCL->inSection("global", "foo"));
	EXPECT_FALSE(SCL->inSection("global", "bar"));			EXPECT_FALSE(SCL->inSection("global", "bar"));
	EXPECT_TRUE(SCL->inSection("global", "foo", "init"));			EXPECT_TRUE(SCL->inSection("global", "foo", "init"));
	EXPECT_FALSE(SCL->inSection("global", "bar", "init"));			EXPECT_FALSE(SCL->inSection("global", "bar", "init"));

	SCL.reset(makeSpecialCaseList("global-init:foo\n"));			SCL = makeSpecialCaseList("global-init:foo\n");
	EXPECT_FALSE(SCL->inSection("global", "foo"));			EXPECT_FALSE(SCL->inSection("global", "foo"));
	EXPECT_FALSE(SCL->inSection("global", "bar"));			EXPECT_FALSE(SCL->inSection("global", "bar"));
	EXPECT_TRUE(SCL->inSection("global", "foo", "init"));			EXPECT_TRUE(SCL->inSection("global", "foo", "init"));
	EXPECT_FALSE(SCL->inSection("global", "bar", "init"));			EXPECT_FALSE(SCL->inSection("global", "bar", "init"));

	SCL.reset(makeSpecialCaseList("type:t2=init\n"));			SCL = makeSpecialCaseList("type:t2=init\n");
	EXPECT_FALSE(SCL->inSection("type", "t1"));			EXPECT_FALSE(SCL->inSection("type", "t1"));
	EXPECT_FALSE(SCL->inSection("type", "t2"));			EXPECT_FALSE(SCL->inSection("type", "t2"));
	EXPECT_FALSE(SCL->inSection("type", "t1", "init"));			EXPECT_FALSE(SCL->inSection("type", "t1", "init"));
	EXPECT_TRUE(SCL->inSection("type", "t2", "init"));			EXPECT_TRUE(SCL->inSection("type", "t2", "init"));

	SCL.reset(makeSpecialCaseList("global-init-type:t2\n"));			SCL = makeSpecialCaseList("global-init-type:t2\n");
	EXPECT_FALSE(SCL->inSection("type", "t1"));			EXPECT_FALSE(SCL->inSection("type", "t1"));
	EXPECT_FALSE(SCL->inSection("type", "t2"));			EXPECT_FALSE(SCL->inSection("type", "t2"));
	EXPECT_FALSE(SCL->inSection("type", "t1", "init"));			EXPECT_FALSE(SCL->inSection("type", "t1", "init"));
	EXPECT_TRUE(SCL->inSection("type", "t2", "init"));			EXPECT_TRUE(SCL->inSection("type", "t2", "init"));

	SCL.reset(makeSpecialCaseList("src:hello=init\n"));			SCL = makeSpecialCaseList("src:hello=init\n");
	EXPECT_FALSE(SCL->inSection("src", "hello"));			EXPECT_FALSE(SCL->inSection("src", "hello"));
	EXPECT_FALSE(SCL->inSection("src", "bye"));			EXPECT_FALSE(SCL->inSection("src", "bye"));
	EXPECT_TRUE(SCL->inSection("src", "hello", "init"));			EXPECT_TRUE(SCL->inSection("src", "hello", "init"));
	EXPECT_FALSE(SCL->inSection("src", "bye", "init"));			EXPECT_FALSE(SCL->inSection("src", "bye", "init"));

	SCL.reset(makeSpecialCaseList("global-init-src:hello\n"));			SCL = makeSpecialCaseList("global-init-src:hello\n");
	EXPECT_FALSE(SCL->inSection("src", "hello"));			EXPECT_FALSE(SCL->inSection("src", "hello"));
	EXPECT_FALSE(SCL->inSection("src", "bye"));			EXPECT_FALSE(SCL->inSection("src", "bye"));
	EXPECT_TRUE(SCL->inSection("src", "hello", "init"));			EXPECT_TRUE(SCL->inSection("src", "hello", "init"));
	EXPECT_FALSE(SCL->inSection("src", "bye", "init"));			EXPECT_FALSE(SCL->inSection("src", "bye", "init"));
	}			}

	TEST_F(SpecialCaseListTest, Substring) {			TEST_F(SpecialCaseListTest, Substring) {
	std::unique_ptr<SpecialCaseList> SCL(makeSpecialCaseList("src:hello\n"			auto SCL = makeSpecialCaseList("src:hello\n"
	"fun:foo\n"			"fun:foo\n"
	"global:bar\n"));			"global:bar\n");
	EXPECT_FALSE(SCL->inSection("src", "othello"));			EXPECT_FALSE(SCL->inSection("src", "othello"));
	EXPECT_FALSE(SCL->inSection("fun", "tomfoolery"));			EXPECT_FALSE(SCL->inSection("fun", "tomfoolery"));
	EXPECT_FALSE(SCL->inSection("global", "bartender"));			EXPECT_FALSE(SCL->inSection("global", "bartender"));

	SCL.reset(makeSpecialCaseList("fun:foo\n"));			SCL = makeSpecialCaseList("fun:foo\n");
	EXPECT_TRUE(SCL->inSection("fun", "tomfoolery"));			EXPECT_TRUE(SCL->inSection("fun", "tomfoolery"));
	EXPECT_TRUE(SCL->inSection("fun", "foobar"));			EXPECT_TRUE(SCL->inSection("fun", "foobar"));
	}			}

	TEST_F(SpecialCaseListTest, InvalidSpecialCaseList) {			TEST_F(SpecialCaseListTest, InvalidSpecialCaseList) {
	std::string Error;			std::string Error;
	EXPECT_EQ(nullptr, makeSpecialCaseList("badline", Error));			EXPECT_EQ(nullptr, makeSpecialCaseList("badline", Error));
	EXPECT_EQ("Malformed line 1: 'badline'", Error);			EXPECT_EQ("Malformed line 1: 'badline'", Error);
	EXPECT_EQ(nullptr, makeSpecialCaseList("src:bad[a-", Error));			EXPECT_EQ(nullptr, makeSpecialCaseList("src:bad[a-", Error));
	EXPECT_EQ("Malformed regex in line 1: 'bad[a-': invalid character range",			EXPECT_EQ("Malformed regex in line 1: 'bad[a-': invalid character range",
	Error);			Error);
	EXPECT_EQ(nullptr, makeSpecialCaseList("src:a.c\n"			EXPECT_EQ(nullptr, makeSpecialCaseList("src:a.c\n"
	"fun:fun(a\n",			"fun:fun(a\n",
	Error));			Error));
	EXPECT_EQ("Malformed regex in line 2: 'fun(a': parentheses not balanced",			EXPECT_EQ("Malformed regex in line 2: 'fun(a': parentheses not balanced",
	Error);			Error);
	EXPECT_EQ(nullptr, SpecialCaseList::create("unexisting", Error));
				SpecialCaseList SCL;
				EXPECT_FALSE(SCL.loadFromFile("unexisting", Error));
	EXPECT_EQ(0U, Error.find("Can't open file 'unexisting':"));			EXPECT_EQ(0U, Error.find("Can't open file 'unexisting':"));
	}			}

	TEST_F(SpecialCaseListTest, EmptySpecialCaseList) {			TEST_F(SpecialCaseListTest, EmptySpecialCaseList) {
	std::unique_ptr<SpecialCaseList> SCL(makeSpecialCaseList(""));			auto SCL = makeSpecialCaseList("");
	EXPECT_FALSE(SCL->inSection("foo", "bar"));			EXPECT_FALSE(SCL->inSection("foo", "bar"));
	}			}

	}			TEST_F(SpecialCaseListTest, MultipleLists) {
				SpecialCaseList SCL;
				std::string Error;

				EXPECT_FALSE(SCL.inSection("fun", "foo"));
				EXPECT_FALSE(SCL.inSection("fun", "bar"));

				ASSERT_TRUE(loadFromStr(&SCL, "fun:foo\n", Error));

				EXPECT_TRUE(SCL.inSection("fun", "foo"));
				EXPECT_FALSE(SCL.inSection("fun", "bar"));

				ASSERT_TRUE(loadFromStr(&SCL, "fun:bar*\n", Error));

				EXPECT_TRUE(SCL.inSection("fun", "foo"));
				EXPECT_TRUE(SCL.inSection("fun", "bar"));
				EXPECT_TRUE(SCL.inSection("fun", "bartender"));
				EXPECT_FALSE(SCL.inSection("fun", "foobar"));
				EXPECT_FALSE(SCL.inSection("fun", "baz"));

				ASSERT_TRUE(loadFromStr(&SCL, "fun:baz\n", Error));

				EXPECT_TRUE(SCL.inSection("fun", "foo"));
				EXPECT_TRUE(SCL.inSection("fun", "bar"));
				EXPECT_TRUE(SCL.inSection("fun", "bartender"));
				EXPECT_FALSE(SCL.inSection("fun", "foobar"));
				EXPECT_TRUE(SCL.inSection("fun", "baz"));
				}

				}

This is an archive of the discontinued LLVM Phabricator instance.

[RFC] Introduce support for multiple sanitizer blacklists (LLVM side).Needs ReviewPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 12251

include/llvm/Support/SpecialCaseList.h

include/llvm/Transforms/Instrumentation.h

lib/Support/SpecialCaseList.cpp

lib/Transforms/Instrumentation/DataFlowSanitizer.cpp

unittests/Support/SpecialCaseListTest.cpp

[RFC] Introduce support for multiple sanitizer blacklists (LLVM side).
Needs ReviewPublic