This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
compiler-rt/test/msan/
-
test/
-
msan/
-
getaddrinfo.cpp
-
qsort.cpp
-
llvm/
-
lib/Transforms/Instrumentation/
-
Transforms/
-
Instrumentation/
11/15
MemorySanitizer.cpp
-
test/Instrumentation/MemorySanitizer/
-
Instrumentation/
-
MemorySanitizer/
-
alloca-store.ll
-
alloca.ll
-
msan_x86_bts_asm.ll

Differential D83595

[Draft][MSAN] Optimize away poisoning allocas that are always written before load
Changes PlannedPublic

Authored by vitalybuka on Jul 10 2020, 4:10 PM.

Download Raw Diff

Details

Reviewers

eugenis
guiand

Summary

If we know that every path from an alloca leads to a store, we can optimize away poisoning its shadow (it'll be overwritten anyway).

I'm wondering if there's a better approach for finding all these def-uses. I investigated the DominatorTree, but I'm not sure how I would use that to check *all* the uses without blowing up the runtime for instrumenting an alloca to O(N^2) (N=# uses) or so.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

guiand created this revision.Jul 10 2020, 4:10 PM

Herald added a project: Restricted Project. · View Herald TranscriptJul 10 2020, 4:10 PM

Herald added subscribers: llvm-commits, hiraditya. · View Herald Transcript

Perhaps MemSSA-based DCE can be taught about it?

guiand marked an inline comment as done.Jul 10 2020, 4:14 PM

guiand added inline comments.

llvm/lib/Transforms/Instrumentation/MemorySanitizer.cpp
3848	TODO: get rid of `Const` in this function name

vitalybuka added inline comments.Jul 10 2020, 4:18 PM

llvm/lib/Transforms/Instrumentation/MemorySanitizer.cpp
3871	Store may write only part of alloca. How useful (binary size) the patch as is?

Allocas are often used through bitcast or GEP, we should handle them as well.

Harbormaster failed remote builds in B63823: Diff 277159!Jul 10 2020, 4:39 PM

I figured if it's using GEP then it's likely not going to be storing to the entire shadow. Bitcast is overwhelmingly common though (especially bitcast->lifetime.start, which I realize I need to handle).

In D83595#2145447, @guiand wrote:

I figured if it's using GEP then it's likely not going to be storing to the entire shadow. Bitcast is overwhelmingly common though (especially bitcast->lifetime.start, which I realize I need to handle).

Right. From what I've seen, small aggregates on stack are quite common as well, and they are always initialized piecemeal. This can be seen as an improvement of this change, but it would be a pretty bit change to the algorithm, so consider including that from the start.

In D83595#2145347, @lebedev.ri wrote:

Perhaps MemSSA-based DCE can be taught about it?

That's an option, but it could be harder to pull off when poisoning is outlined. The code will look something like this:

%p = alloca i32
call __msan_poison_and_set_origin(%p, 4)
...
%s_p = inttoptr(xor(ptrtoint(%p), 0x50..00)))  ; shadow address for %p
store i32 zeroinitializer, %s_p
store i32 <value>, %p

To eliminate the dead call to __msan_poison, DCE would need to know the shadow mapping.

I've expanded this to be able to find chained uses like through a bitcast.

Mostly a proof-of-concept; this still needs a check to make sure the final store is over the whole size of the initial type. And, of course, the main issue is that this really slows down compilation. So I think using MemSSA might be necessary for performance reasons, unless there's some algorithm tweak I can make here that would reduce the complexity.

Herald added a project: Restricted Project. · View Herald TranscriptJul 13 2020, 11:02 AM

Herald added a subscriber: Restricted Project. · View Herald Transcript

Harbormaster failed remote builds in B64005: Diff 277491!Jul 13 2020, 11:02 AM

If we know that every path from an alloca leads to a store, we can optimize away poisoning its shadow (it'll be overwritten anyway).

! In D83595#2145347, @lebedev.ri wrote:

Perhaps MemSSA-based DCE can be taught about it?

MemorySSA-based DSE has similar logic (check that a store is either overwritten or never read on all paths to a function exit), but it uses a MemorySSA traversal to do so. I don't think alloca are modeled directly in MemorySSA though, so the approach cannot be directly applied I think. It might be possible to add a pseudo-memorydef writing to the alloca directly after its definition. Then the MemorySSA based logic may apply.

Added ability to detect piecemeal initialization.

Harbormaster failed remote builds in B64190: Diff 277900!Jul 14 2020, 10:55 AM

This actually has very significant effects on some, but not all, benchmarks.

Running grep I observed ~8% decrease in binary with this patch. But clang sees little effect: <1%.

On a few different benchmarks from a benchmark suite, this also decreased runtime overhead by a significant amount: 8% for sha512, and 13% for qsort.

In D83595#2185554, @guiand wrote:

This actually has very significant effects on some, but not all, benchmarks.

Running grep I observed ~8% decrease in binary with this patch. But clang sees little effect: <1%.

On a few different benchmarks from a benchmark suite, this also decreased runtime overhead by a significant amount: 8% for sha512, and 13% for qsort.

This sounds worthwhile.

llvm/lib/Transforms/Instrumentation/MemorySanitizer.cpp
3940	I don't think this is entirely correct. Consider this: entry { alloca 32; store 0..16 } -> A, B A { store 16..32 } -> B B { load } traversing entry->A->B will mark B as done even though there is a path where control flow arrives at B with alloca not entirely initialized. What happens to the benchmarks if this algorithm is limited to a single basic block?

guiand added inline comments.Jul 30 2020, 3:15 PM

llvm/lib/Transforms/Instrumentation/MemorySanitizer.cpp
3940	I tried to write the code so that it would look to see if all paths are initialized before a load. So in your example, it would look at the path `entry -> A -> B`, and it would see that it's fully stored, and it would continue looking at the next path. For `entry -> B`, `if (!firstUsesAreStore) return false` would come into effect, and we won't optimize away the poisoning.

guiand added inline comments.Jul 30 2020, 3:21 PM

llvm/lib/Transforms/Instrumentation/MemorySanitizer.cpp
3940	Oh, I see where the issue is here: the `if TraversedSet.count` check. I needed some way to prevent cycles in the graph, so that was the first thing I reached for, but you're right, this approach breaks the algorithm.

eugenis added inline comments.Jul 30 2020, 3:31 PM

llvm/lib/Transforms/Instrumentation/MemorySanitizer.cpp
3940	Right. And it does not even require a cycle. This is a dataflow problem - assign labels to graph nodes showing which bytes are always initialized at entry, and update them based on predecessor's labels until the fixed point is reached. But in this case, I suspect, most of the optimization opportunities are within a single BB.

guiand added inline comments.Jul 30 2020, 3:47 PM

llvm/lib/Transforms/Instrumentation/MemorySanitizer.cpp
3940	I think this could be fixed by removing the target node from the set after the recursive call is finished. But I'll measure the impact of removing the inter-block stuff entirely to see how much value it adds.

I've cut it down to only within a basic block as @eugenis and @vitalybuka suggested.

Harbormaster completed remote builds in B66488: Diff 282099.Jul 30 2020, 7:16 PM

In general, this implementation looks pretty complex and easy to get wrong. I'd prefer something along the lines of AArch64StackTagging::collectInitializers - directly calculate the offset for each store/load instruction. It might do some extra work with unrelated memory instructions, but probably not too much.

How do you handle this case?

a = alloca
b = bitcast a
lifetime_start b
store b

When scanning from lifetime_start, this code will never encounter any direct use of a, and would miss the transitive use.

Missing diff context.

llvm/lib/Transforms/Instrumentation/MemorySanitizer.cpp
3895	prefer early exits / continue(s)
3915	This function is never called with StoreOffs != 0, which seems necessary to handle a chain of GEPs.
3920	Only if same BB.

Flattened some control flow, updated to properly use StoreOffs, and updated tests to cover chained GEPs

In D83595#2188021, @eugenis wrote:

In general, this implementation looks pretty complex and easy to get wrong. I'd prefer something along the lines of AArch64StackTagging::collectInitializers - directly calculate the offset for each store/load instruction. It might do some extra work with unrelated memory instructions, but probably not too much.

I'll take a look at collectInitializers. As for the current implementation -- yeah, I always figured there would be a better way. But I tried to be pretty conservative with how I implemented it, so while we might miss some stores, we should never "forget" to poison an alloca.

How do you handle this case?
a = alloca
b = bitcast a
lifetime_start b
store b
When scanning from lifetime_start, this code will never encounter any direct use of a, and would miss the transitive use.

The code currently scans from the alloca, rather than from the lifetime_start. This might make only searching in single BB pretty limiting, since afaict an alloca can be detached from its lifetime region.

Harbormaster completed remote builds in B66611: Diff 282317.Jul 31 2020, 3:34 PM

In D83595#2188282, @guiand wrote:

The code currently scans from the alloca, rather than from the lifetime_start. This might make only searching in single BB pretty limiting, since afaict an alloca can be detached from its lifetime region.

Oh right. That's actually pretty weak, it's very common for lifetime to start in a basic block other than entry.

You want to start scanning at the same point where poisoning is going to be inserted.

It seems like collectInitializers leans heavily on the isPointerOffset function, which returns an offset if two pointers have a constant difference, nullopt if they don't. The problem here is that we can't distinguish isPointerOffset == nullopt happening because the offset is determined at runtime, or because the two pointers are completely unrelated.

It's a pretty big difference, because we don't want to poison a sequence like this:

%x = alloca [ i32, i32 }
%y = alloca i32
%z = load i32, i32* %y ; isPointerOffset == false
%x0 = getelementptr { i32, i32 }, { i32, i32 }* %x, i32 0, i32 0
%x1 = getelementptr { i32, i32 }, { i32, i32 }* %x, i32 0, i32 1
store i32 0, i32* %x0
store i32 0, i32* %x1

But we want to poison a sequence like this:

%x = alloca [ i32, i32 }
%y = getelementptr { i32, i32 }, { i32, i32 }* %x, i32 0, i32 %dynamic_offs
%z = load i32, i32* %y ; isPointerOffset == false
%x0 = getelementptr { i32, i32 }, { i32, i32 }* %x, i32 0, i32 0
%x1 = getelementptr { i32, i32 }, { i32, i32 }* %x, i32 0, i32 1
store i32 0, i32* %x0
store i32 0, i32* %x1

Right, that's what the isNoModRef check is for.

Integrated with Alias Analyzer, uses simpler mechanism for walking through BB and determining stores to alloca

Harbormaster completed remote builds in B66973: Diff 282982.Aug 4 2020, 11:47 AM

guiand added inline comments.Aug 4 2020, 11:53 AM

llvm/lib/Transforms/Instrumentation/MemorySanitizer.cpp
3848	No longer needed
3850	No longer needed
3871	It depends a lot on the program. Grep saw like a 8% decrease in binary size, while clang saw 0.5% to 1% decrease.

vitalybuka added inline comments.Aug 18 2020, 7:32 PM

llvm/lib/Transforms/Instrumentation/MemorySanitizer.cpp
3927	not sure why StoredBytes is a parameter and not just a local var in firstUsesAreStore

guiand added inline comments.Aug 22 2020, 12:32 PM

llvm/lib/Transforms/Instrumentation/MemorySanitizer.cpp
3927	True, that was leftover from a previous version of this patch.

FYI @kda @kstoimenov

Herald added a project: Restricted Project. · View Herald TranscriptMay 9 2022, 11:43 AM

vitalybuka commandeered this revision.Dec 7 2022, 1:31 PM

vitalybuka edited reviewers, added: guiand; removed: vitalybuka.

Herald added a subscriber: Enna1. · View Herald TranscriptDec 7 2022, 1:31 PM

stashing indefinitely

Not any time soon.

Revision Contents

Path

Size

compiler-rt/

test/

msan/

getaddrinfo.cpp

1 line

qsort.cpp

1 line

llvm/

lib/

Transforms/

Instrumentation/

MemorySanitizer.cpp

103 lines

test/

Instrumentation/

MemorySanitizer/

alloca-store.ll

68 lines

alloca.ll

7 lines

msan_x86_bts_asm.ll

6 lines

Diff 282982

compiler-rt/test/msan/getaddrinfo.cpp

	// RUN: %clangxx_msan -O0 %s -o %t && %run %t			// RUN: %clangxx_msan -O0 %s -o %t && %run %t

	#include <sys/types.h>			#include <sys/types.h>
	#include <sys/socket.h>			#include <sys/socket.h>
	#include <netdb.h>			#include <netdb.h>
	#include <stdlib.h>			#include <stdlib.h>

	void poison_stack_ahead() {			void poison_stack_ahead() {
	char buf[100000];			char buf[100000];
				__asm__ volatile("" ::"r"(buf));
	// With -O0 this poisons a large chunk of stack.			// With -O0 this poisons a large chunk of stack.
	}			}

	int main(void) {			int main(void) {
	poison_stack_ahead();			poison_stack_ahead();

	struct addrinfo *ai;			struct addrinfo *ai;

	// This should trigger loading of libnss_dns and friends.			// This should trigger loading of libnss_dns and friends.
	// Those libraries are typically uninstrumented.They will call strlen() on a			// Those libraries are typically uninstrumented.They will call strlen() on a
	// stack-allocated buffer, which is very likely to be poisoned. Test that we			// stack-allocated buffer, which is very likely to be poisoned. Test that we
	// don't report this as an UMR.			// don't report this as an UMR.
	int res = getaddrinfo("not-in-etc-hosts", NULL, NULL, &ai);			int res = getaddrinfo("not-in-etc-hosts", NULL, NULL, &ai);
	return 0;			return 0;
	}			}

compiler-rt/test/msan/qsort.cpp

	Show All 13 Lines
	constexpr size_t kSize2 = 7;			constexpr size_t kSize2 = 7;

	bool seen2;			bool seen2;

	void dummy(long a, long b, long c, long d, long e) {}			void dummy(long a, long b, long c, long d, long e) {}

	void poison_stack_and_param() {			void poison_stack_and_param() {
	char x[10000];			char x[10000];
				__asm__ volatile("" ::"r"(x));
	int y;			int y;
	dummy(y, y, y, y, y);			dummy(y, y, y, y, y);
	}			}

	__attribute__((always_inline)) int cmp(long a, long b) {			__attribute__((always_inline)) int cmp(long a, long b) {
	if (a < b)			if (a < b)
	return -1;			return -1;
	else if (a > b)			else if (a > b)
	▲ Show 20 Lines • Show All 56 Lines • Show Last 20 Lines

llvm/lib/Transforms/Instrumentation/MemorySanitizer.cpp

Show First 20 Lines • Show All 139 Lines • ▼ Show 20 Lines
//		//
// FIXME: This sanitizer does not yet handle scalable vectors		// FIXME: This sanitizer does not yet handle scalable vectors
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "llvm/Transforms/Instrumentation/MemorySanitizer.h"		#include "llvm/Transforms/Instrumentation/MemorySanitizer.h"
#include "llvm/ADT/APInt.h"		#include "llvm/ADT/APInt.h"
#include "llvm/ADT/ArrayRef.h"		#include "llvm/ADT/ArrayRef.h"
		#include "llvm/ADT/BitVector.h"
#include "llvm/ADT/DepthFirstIterator.h"		#include "llvm/ADT/DepthFirstIterator.h"
#include "llvm/ADT/SmallSet.h"		#include "llvm/ADT/SmallSet.h"
#include "llvm/ADT/SmallString.h"		#include "llvm/ADT/SmallString.h"
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
#include "llvm/ADT/StringExtras.h"		#include "llvm/ADT/StringExtras.h"
#include "llvm/ADT/StringRef.h"		#include "llvm/ADT/StringRef.h"
#include "llvm/ADT/Triple.h"		#include "llvm/ADT/Triple.h"
		#include "llvm/Analysis/AliasAnalysis.h"
		#include "llvm/Analysis/MemoryLocation.h"
#include "llvm/Analysis/TargetLibraryInfo.h"		#include "llvm/Analysis/TargetLibraryInfo.h"
		#include "llvm/Analysis/ValueTracking.h"
#include "llvm/IR/Argument.h"		#include "llvm/IR/Argument.h"
#include "llvm/IR/Attributes.h"		#include "llvm/IR/Attributes.h"
#include "llvm/IR/BasicBlock.h"		#include "llvm/IR/BasicBlock.h"
#include "llvm/IR/CallingConv.h"		#include "llvm/IR/CallingConv.h"
#include "llvm/IR/Constant.h"		#include "llvm/IR/Constant.h"
#include "llvm/IR/Constants.h"		#include "llvm/IR/Constants.h"
#include "llvm/IR/DataLayout.h"		#include "llvm/IR/DataLayout.h"
#include "llvm/IR/DerivedTypes.h"		#include "llvm/IR/DerivedTypes.h"
▲ Show 20 Lines • Show All 327 Lines • ▼ Show 20 Lines	public:
}		}

// MSan cannot be moved or copied because of MapParams.		// MSan cannot be moved or copied because of MapParams.
MemorySanitizer(MemorySanitizer &&) = delete;		MemorySanitizer(MemorySanitizer &&) = delete;
MemorySanitizer &operator=(MemorySanitizer &&) = delete;		MemorySanitizer &operator=(MemorySanitizer &&) = delete;
MemorySanitizer(const MemorySanitizer &) = delete;		MemorySanitizer(const MemorySanitizer &) = delete;
MemorySanitizer &operator=(const MemorySanitizer &) = delete;		MemorySanitizer &operator=(const MemorySanitizer &) = delete;

bool sanitizeFunction(Function &F, TargetLibraryInfo &TLI);		bool sanitizeFunction(Function &F, TargetLibraryInfo &TLI, AAResults *AA);

private:		private:
friend struct MemorySanitizerVisitor;		friend struct MemorySanitizerVisitor;
friend struct VarArgAMD64Helper;		friend struct VarArgAMD64Helper;
friend struct VarArgMIPS64Helper;		friend struct VarArgMIPS64Helper;
friend struct VarArgAArch64Helper;		friend struct VarArgAArch64Helper;
friend struct VarArgPowerPC64Helper;		friend struct VarArgPowerPC64Helper;
friend struct VarArgSystemZHelper;		friend struct VarArgSystemZHelper;
▲ Show 20 Lines • Show All 128 Lines • ▼ Show 20 Lines	struct MemorySanitizerLegacyPass : public FunctionPass {
MemorySanitizerLegacyPass(MemorySanitizerOptions Options = {})		MemorySanitizerLegacyPass(MemorySanitizerOptions Options = {})
: FunctionPass(ID), Options(Options) {		: FunctionPass(ID), Options(Options) {
initializeMemorySanitizerLegacyPassPass(*PassRegistry::getPassRegistry());		initializeMemorySanitizerLegacyPassPass(*PassRegistry::getPassRegistry());
}		}
StringRef getPassName() const override { return "MemorySanitizerLegacyPass"; }		StringRef getPassName() const override { return "MemorySanitizerLegacyPass"; }

void getAnalysisUsage(AnalysisUsage &AU) const override {		void getAnalysisUsage(AnalysisUsage &AU) const override {
AU.addRequired<TargetLibraryInfoWrapperPass>();		AU.addRequired<TargetLibraryInfoWrapperPass>();
		AU.addRequired<AAResultsWrapperPass>();
}		}

bool runOnFunction(Function &F) override {		bool runOnFunction(Function &F) override {
		AAResults *AA = &getAnalysis<AAResultsWrapperPass>().getAAResults();
return MSan->sanitizeFunction(		return MSan->sanitizeFunction(
F, getAnalysis<TargetLibraryInfoWrapperPass>().getTLI(F));		F, getAnalysis<TargetLibraryInfoWrapperPass>().getTLI(F), AA);
}		}
bool doInitialization(Module &M) override;		bool doInitialization(Module &M) override;

Optional<MemorySanitizer> MSan;		Optional<MemorySanitizer> MSan;
MemorySanitizerOptions Options;		MemorySanitizerOptions Options;
};		};

template <class T> T getOptOrDefault(const cl::opt<T> &Opt, T Default) {		template <class T> T getOptOrDefault(const cl::opt<T> &Opt, T Default) {
return (Opt.getNumOccurrences() > 0) ? Opt : Default;		return (Opt.getNumOccurrences() > 0) ? Opt : Default;
}		}

} // end anonymous namespace		} // end anonymous namespace

MemorySanitizerOptions::MemorySanitizerOptions(int TO, bool R, bool K)		MemorySanitizerOptions::MemorySanitizerOptions(int TO, bool R, bool K)
: Kernel(getOptOrDefault(ClEnableKmsan, K)),		: Kernel(getOptOrDefault(ClEnableKmsan, K)),
TrackOrigins(getOptOrDefault(ClTrackOrigins, Kernel ? 2 : TO)),		TrackOrigins(getOptOrDefault(ClTrackOrigins, Kernel ? 2 : TO)),
Recover(getOptOrDefault(ClKeepGoing, Kernel \|\| R)) {}		Recover(getOptOrDefault(ClKeepGoing, Kernel \|\| R)) {}

PreservedAnalyses MemorySanitizerPass::run(Function &F,		PreservedAnalyses MemorySanitizerPass::run(Function &F,
FunctionAnalysisManager &FAM) {		FunctionAnalysisManager &FAM) {
MemorySanitizer Msan(*F.getParent(), Options);		MemorySanitizer Msan(*F.getParent(), Options);
if (Msan.sanitizeFunction(F, FAM.getResult<TargetLibraryAnalysis>(F)))		AAResults &AA = FAM.getResult<AAManager>(F);
		if (Msan.sanitizeFunction(F, FAM.getResult<TargetLibraryAnalysis>(F), &AA))
return PreservedAnalyses::none();		return PreservedAnalyses::none();
return PreservedAnalyses::all();		return PreservedAnalyses::all();
}		}

PreservedAnalyses MemorySanitizerPass::run(Module &M,		PreservedAnalyses MemorySanitizerPass::run(Module &M,
ModuleAnalysisManager &AM) {		ModuleAnalysisManager &AM) {
if (Options.Kernel)		if (Options.Kernel)
return PreservedAnalyses::all();		return PreservedAnalyses::all();
insertModuleCtor(M);		insertModuleCtor(M);
return PreservedAnalyses::none();		return PreservedAnalyses::none();
}		}

char MemorySanitizerLegacyPass::ID = 0;		char MemorySanitizerLegacyPass::ID = 0;

INITIALIZE_PASS_BEGIN(MemorySanitizerLegacyPass, "msan",		INITIALIZE_PASS_BEGIN(MemorySanitizerLegacyPass, "msan",
"MemorySanitizer: detects uninitialized reads.", false,		"MemorySanitizer: detects uninitialized reads.", false,
false)		false)
INITIALIZE_PASS_DEPENDENCY(TargetLibraryInfoWrapperPass)		INITIALIZE_PASS_DEPENDENCY(TargetLibraryInfoWrapperPass)
		INITIALIZE_PASS_DEPENDENCY(AAResultsWrapperPass)
INITIALIZE_PASS_END(MemorySanitizerLegacyPass, "msan",		INITIALIZE_PASS_END(MemorySanitizerLegacyPass, "msan",
"MemorySanitizer: detects uninitialized reads.", false,		"MemorySanitizer: detects uninitialized reads.", false,
false)		false)

FunctionPass *		FunctionPass *
llvm::createMemorySanitizerLegacyPassPass(MemorySanitizerOptions Options) {		llvm::createMemorySanitizerLegacyPassPass(MemorySanitizerOptions Options) {
return new MemorySanitizerLegacyPass(Options);		return new MemorySanitizerLegacyPass(Options);
}		}
▲ Show 20 Lines • Show All 354 Lines • ▼ Show 20 Lines
struct MemorySanitizerVisitor : public InstVisitor<MemorySanitizerVisitor> {		struct MemorySanitizerVisitor : public InstVisitor<MemorySanitizerVisitor> {
Function &F;		Function &F;
MemorySanitizer &MS;		MemorySanitizer &MS;
SmallVector<PHINode *, 16> ShadowPHINodes, OriginPHINodes;		SmallVector<PHINode *, 16> ShadowPHINodes, OriginPHINodes;
ValueMap<Value, Value> ShadowMap, OriginMap;		ValueMap<Value, Value> ShadowMap, OriginMap;
std::unique_ptr<VarArgHelper> VAHelper;		std::unique_ptr<VarArgHelper> VAHelper;
const TargetLibraryInfo *TLI;		const TargetLibraryInfo *TLI;
BasicBlock *ActualFnStart;		BasicBlock *ActualFnStart;
		AAResults *AA;

// The following flags disable parts of MSan instrumentation based on		// The following flags disable parts of MSan instrumentation based on
// exclusion list contents and command-line options.		// exclusion list contents and command-line options.
bool InsertChecks;		bool InsertChecks;
bool PropagateShadow;		bool PropagateShadow;
bool PoisonStack;		bool PoisonStack;
bool PoisonUndef;		bool PoisonUndef;

struct ShadowOriginAndInsertPoint {		struct ShadowOriginAndInsertPoint {
Value *Shadow;		Value *Shadow;
Value *Origin;		Value *Origin;
Instruction *OrigIns;		Instruction *OrigIns;

ShadowOriginAndInsertPoint(Value S, Value O, Instruction *I)		ShadowOriginAndInsertPoint(Value S, Value O, Instruction *I)
: Shadow(S), Origin(O), OrigIns(I) {}		: Shadow(S), Origin(O), OrigIns(I) {}
};		};
SmallVector<ShadowOriginAndInsertPoint, 16> InstrumentationList;		SmallVector<ShadowOriginAndInsertPoint, 16> InstrumentationList;
bool InstrumentLifetimeStart = ClHandleLifetimeIntrinsics;		bool InstrumentLifetimeStart = ClHandleLifetimeIntrinsics;
SmallSet<AllocaInst *, 16> AllocaSet;		SmallSet<AllocaInst *, 16> AllocaSet;
SmallVector<std::pair<IntrinsicInst , AllocaInst >, 16> LifetimeStartList;		SmallVector<std::pair<IntrinsicInst , AllocaInst >, 16> LifetimeStartList;
SmallVector<StoreInst *, 16> StoreList;		SmallVector<StoreInst *, 16> StoreList;

MemorySanitizerVisitor(Function &F, MemorySanitizer &MS,		MemorySanitizerVisitor(Function &F, MemorySanitizer &MS,
const TargetLibraryInfo &TLI)		const TargetLibraryInfo &TLI, AAResults *AA)
: F(F), MS(MS), VAHelper(CreateVarArgHelper(F, MS, *this)), TLI(&TLI) {		: F(F), MS(MS), VAHelper(CreateVarArgHelper(F, MS, *this)), TLI(&TLI),
		AA(AA) {
bool SanitizeFunction = F.hasFnAttribute(Attribute::SanitizeMemory);		bool SanitizeFunction = F.hasFnAttribute(Attribute::SanitizeMemory);
InsertChecks = SanitizeFunction;		InsertChecks = SanitizeFunction;
PropagateShadow = SanitizeFunction;		PropagateShadow = SanitizeFunction;
PoisonStack = SanitizeFunction && ClPoisonStack;		PoisonStack = SanitizeFunction && ClPoisonStack;
PoisonUndef = SanitizeFunction && ClPoisonUndef;		PoisonUndef = SanitizeFunction && ClPoisonUndef;

MS.initializeCallbacks(*F.getParent());		MS.initializeCallbacks(*F.getParent());
if (MS.CompileKernel)		if (MS.CompileKernel)
▲ Show 20 Lines • Show All 2,735 Lines • ▼ Show 20 Lines	if (PoisonStack) {
{IRB.CreatePointerCast(&I, IRB.getInt8PtrTy()), Len,		{IRB.CreatePointerCast(&I, IRB.getInt8PtrTy()), Len,
IRB.CreatePointerCast(Descr, IRB.getInt8PtrTy())});		IRB.CreatePointerCast(Descr, IRB.getInt8PtrTy())});
} else {		} else {
IRB.CreateCall(MS.MsanUnpoisonAllocaFn,		IRB.CreateCall(MS.MsanUnpoisonAllocaFn,
{IRB.CreatePointerCast(&I, IRB.getInt8PtrTy()), Len});		{IRB.CreatePointerCast(&I, IRB.getInt8PtrTy()), Len});
}		}
}		}

		// Bytes in an alloca already stored to
		using StoredSet = BitVector;
		// Mapping of Value modifying a pointer => Offset it's pointing to
		using RedefOffsets = DenseMap<Value *, uint64_t>;
		guiandUnsubmitted Done Reply Inline Actions TODO: get rid of `Const` in this function name guiand: TODO: get rid of `Const` in this function name
		guiandUnsubmitted Done Reply Inline Actions No longer needed guiand: No longer needed
		// Set of instructions consuming a given alloca
		using UserSet = SmallPtrSet<Value *, 10>;
		guiandUnsubmitted Done Reply Inline Actions No longer needed guiand: No longer needed

		// Determine whether all paths from this alloca lead to storing into it.
		// If true, we can omit poisoning the alloca because it'll just be
		// overwritten anyway.
		bool firstUsesAreStore(Value Alloca, Instruction CurrInst,
		StoredSet StoredBytes) {
		MemoryLocation AllocaLoc{Alloca, StoredBytes.size()};
		const DataLayout &DL = F.getParent()->getDataLayout();

		for (; CurrInst && !StoredBytes.all();
		CurrInst = CurrInst->getNextNonDebugInstruction()) {
		if (isNoModRef(AA->getModRefInfo(CurrInst, AllocaLoc)))
		continue;

		if (StoreInst *Store = dyn_cast<StoreInst>(CurrInst)) {
		uint64_t StoreSize =
		DL.getTypeStoreSize(Store->getValueOperand()->getType());
		// The store could be a use if it's storing the alloca
		// pointer somewhere. But we don't want that.
		Value *Ptr = Store->getPointerOperand();
		if (auto Offset = isPointerOffset(Alloca, Ptr, DL))
		vitalybukaAuthorUnsubmitted Not Done Reply Inline Actions Store may write only part of alloca. How useful (binary size) the patch as is? vitalybuka: Store may write only part of alloca. How useful (binary size) the patch as is?
		guiandUnsubmitted Done Reply Inline Actions It depends a lot on the program. Grep saw like a 8% decrease in binary size, while clang saw 0.5% to 1% decrease. guiand: It depends a lot on the program. Grep saw like a 8% decrease in binary size, while clang saw 0.
		StoredBytes.set(Offset, Offset + StoreSize);
		continue;
		}

		if (IntrinsicInst *Intrinsic = dyn_cast<IntrinsicInst>(CurrInst)) {
		// Ignore lifetime intrinsics, count others as a use.
		Intrinsic::ID ID = Intrinsic->getIntrinsicID();
		if (ID != Intrinsic::lifetime_start && ID != Intrinsic::lifetime_end)
		return false;
		continue;
		}

		if (CallInst *Call = dyn_cast<CallInst>(CurrInst)) {
		if (Call->getCalledFunction() != MS.MemcpyFn.getCallee() &&
		Call->getCalledFunction() != MS.MemsetFn.getCallee() &&
		Call->getCalledFunction() != MS.MemmoveFn.getCallee())
		// We want to consider instrumented memory writing functions
		// as stores here. Anything else is considered a use.
		return false;

		Value *StoreSizeVal = Call->getArgOperand(2);
		Value *Dest = Call->getArgOperand(0);
		auto Offset = isPointerOffset(Alloca, Dest, DL);
		if (!Offset)
		eugenisUnsubmitted Done Reply Inline Actions prefer early exits / continue(s) eugenis: prefer early exits / continue(s)
		return false;

		ConstantInt *StoreSizeConst = dyn_cast<ConstantInt>(StoreSizeVal);
		if (!StoreSizeConst)
		// Can only decide this covers the whole shadow if we can
		// check the constant store size.
		return false;
		uint64_t StoreSize = StoreSizeConst->getLimitedValue();
		StoredBytes.set(Offset, Offset + StoreSize);
		continue;
		}

		return false;
		}

		// Reached end of basic block
		// TODO: Check successive BBs
		return StoredBytes.all();
		};

		eugenisUnsubmitted Done Reply Inline Actions This function is never called with StoreOffs != 0, which seems necessary to handle a chain of GEPs. eugenis: This function is never called with StoreOffs != 0, which seems necessary to handle a chain of…
void instrumentAlloca(AllocaInst &I, Instruction *InsPoint = nullptr) {		void instrumentAlloca(AllocaInst &I, Instruction *InsPoint = nullptr) {
if (!InsPoint)		if (!InsPoint)
InsPoint = &I;		InsPoint = &I;
IRBuilder<> IRB(InsPoint->getNextNode());		IRBuilder<> IRB(InsPoint->getNextNode());
const DataLayout &DL = F.getParent()->getDataLayout();		const DataLayout &DL = F.getParent()->getDataLayout();
		eugenisUnsubmitted Done Reply Inline Actions Only if same BB. eugenis: Only if same BB.
uint64_t TypeSize = DL.getTypeAllocSize(I.getAllocatedType());		uint64_t TypeSize = DL.getTypeAllocSize(I.getAllocatedType());
Value *Len = ConstantInt::get(MS.IntptrTy, TypeSize);		Value *Len = ConstantInt::get(MS.IntptrTy, TypeSize);
if (I.isArrayAllocation())		if (I.isArrayAllocation()) {
Len = IRB.CreateMul(Len, I.getArraySize());		Len = IRB.CreateMul(Len, I.getArraySize());
		} else {
		StoredSet StoredBytes(TypeSize, false);
		if (firstUsesAreStore(&I, InsPoint, StoredBytes))
		vitalybukaAuthorUnsubmitted Not Done Reply Inline Actions not sure why StoredBytes is a parameter and not just a local var in firstUsesAreStore vitalybuka: not sure why StoredBytes is a parameter and not just a local var in firstUsesAreStore
		guiandUnsubmitted Done Reply Inline Actions True, that was leftover from a previous version of this patch. guiand: True, that was leftover from a previous version of this patch.
		return;
		}

if (MS.CompileKernel)		if (MS.CompileKernel)
poisonAllocaKmsan(I, IRB, Len);		poisonAllocaKmsan(I, IRB, Len);
else		else
poisonAllocaUserspace(I, IRB, Len);		poisonAllocaUserspace(I, IRB, Len);
}		}

void visitAllocaInst(AllocaInst &I) {		void visitAllocaInst(AllocaInst &I) {
setShadow(&I, getCleanShadow(&I));		setShadow(&I, getCleanShadow(&I));
setOrigin(&I, getCleanOrigin());		setOrigin(&I, getCleanOrigin());
// We'll get to this alloca later unless it's poisoned at the corresponding		// We'll get to this alloca later unless it's poisoned at the corresponding
		eugenisUnsubmitted Not Done Reply Inline Actions I don't think this is entirely correct. Consider this: entry { alloca 32; store 0..16 } -> A, B A { store 16..32 } -> B B { load } traversing entry->A->B will mark B as done even though there is a path where control flow arrives at B with alloca not entirely initialized. What happens to the benchmarks if this algorithm is limited to a single basic block? eugenis: I don't think this is entirely correct. Consider this: ``` entry { alloca 32; store 0..16 } ->…
		guiandUnsubmitted Done Reply Inline Actions I tried to write the code so that it would look to see if all paths are initialized before a load. So in your example, it would look at the path `entry -> A -> B`, and it would see that it's fully stored, and it would continue looking at the next path. For `entry -> B`, `if (!firstUsesAreStore) return false` would come into effect, and we won't optimize away the poisoning. guiand: I tried to write the code so that it would look to see if all paths are initialized before a…
		guiandUnsubmitted Done Reply Inline Actions Oh, I see where the issue is here: the `if TraversedSet.count` check. I needed some way to prevent cycles in the graph, so that was the first thing I reached for, but you're right, this approach breaks the algorithm. guiand: Oh, I see where the issue is here: the `if TraversedSet.count` check. I needed some way to…
		eugenisUnsubmitted Not Done Reply Inline Actions Right. And it does not even require a cycle. This is a dataflow problem - assign labels to graph nodes showing which bytes are always initialized at entry, and update them based on predecessor's labels until the fixed point is reached. But in this case, I suspect, most of the optimization opportunities are within a single BB. eugenis: Right. And it does not even require a cycle. This is a dataflow problem - assign labels to…
		guiandUnsubmitted Done Reply Inline Actions I think this could be fixed by removing the target node from the set after the recursive call is finished. But I'll measure the impact of removing the inter-block stuff entirely to see how much value it adds. guiand: I think this could be fixed by removing the target node from the set after the recursive call…
// llvm.lifetime.start.		// llvm.lifetime.start.
AllocaSet.insert(&I);		AllocaSet.insert(&I);
}		}

void visitSelectInst(SelectInst& I) {		void visitSelectInst(SelectInst& I) {
IRBuilder<> IRB(&I);		IRBuilder<> IRB(&I);
// a = select b, c, d		// a = select b, c, d
Value *B = I.getCondition();		Value *B = I.getCondition();
▲ Show 20 Lines • Show All 1,391 Lines • ▼ Show 20 Lines	else if (TargetTriple.getArch() == Triple::ppc64 \|\|
TargetTriple.getArch() == Triple::ppc64le)		TargetTriple.getArch() == Triple::ppc64le)
return new VarArgPowerPC64Helper(Func, Msan, Visitor);		return new VarArgPowerPC64Helper(Func, Msan, Visitor);
else if (TargetTriple.getArch() == Triple::systemz)		else if (TargetTriple.getArch() == Triple::systemz)
return new VarArgSystemZHelper(Func, Msan, Visitor);		return new VarArgSystemZHelper(Func, Msan, Visitor);
else		else
return new VarArgNoOpHelper(Func, Msan, Visitor);		return new VarArgNoOpHelper(Func, Msan, Visitor);
}		}

bool MemorySanitizer::sanitizeFunction(Function &F, TargetLibraryInfo &TLI) {		bool MemorySanitizer::sanitizeFunction(Function &F, TargetLibraryInfo &TLI,
		AAResults *AA) {
if (!CompileKernel && F.getName() == kMsanModuleCtorName)		if (!CompileKernel && F.getName() == kMsanModuleCtorName)
return false;		return false;

MemorySanitizerVisitor Visitor(F, *this, TLI);		MemorySanitizerVisitor Visitor(F, *this, TLI, AA);

// Clear out readonly/readnone attributes.		// Clear out readonly/readnone attributes.
AttrBuilder B;		AttrBuilder B;
B.addAttribute(Attribute::ReadOnly)		B.addAttribute(Attribute::ReadOnly)
.addAttribute(Attribute::ReadNone)		.addAttribute(Attribute::ReadNone)
.addAttribute(Attribute::WriteOnly)		.addAttribute(Attribute::WriteOnly)
.addAttribute(Attribute::ArgMemOnly)		.addAttribute(Attribute::ArgMemOnly)
.addAttribute(Attribute::Speculatable);		.addAttribute(Attribute::Speculatable);
F.removeAttributes(AttributeList::FunctionIndex, B);		F.removeAttributes(AttributeList::FunctionIndex, B);

return Visitor.runOnFunction();		return Visitor.runOnFunction();
}		}

llvm/test/Instrumentation/MemorySanitizer/alloca-store.ll

This file was added.

				; Tests if MSAN can optimize away dead-code alloca poisons

				; RUN: opt < %s -msan-track-origins=2 -msan-check-access-address=0 -msan-poison-stack-with-call=1 -S -passes=msan 2>&1 \| FileCheck \
				; RUN: %s "--check-prefixes=CHECK"

				target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"
				target triple = "x86_64-unknown-linux-gnu"

				define i32* @simple(i32 %val) sanitize_memory {
				; CHECK: define {{.*}} @simple
				%p = alloca i32
				store i32 %val, i32* %p
				; CHECK-NOT: call void @__msan_poison_stack
				ret i32* %p
				}

				define i32* @bitcast({ i32 } %val) sanitize_memory {
				; CHECK: define {{.*}} @bitcast
				%p = alloca i32
				%ps = bitcast i32* %p to { i32 }*
				store { i32 } %val, { i32 }* %ps
				; CHECK-NOT: call void @__msan_poison_stack
				ret i32* %p
				}

				define i32* @lifetime(i32 %val) sanitize_memory {
				; CHECK: define {{.*}} @lifetime
				%p = alloca i32
				%p8 = bitcast i32* %p to i8*
				call void @llvm.lifetime.start(i64 4, i8* %p8)
				store i32 %val, i32* %p
				; CHECK-NOT: call void @__msan_poison_stack
				call void @llvm.lifetime.end(i64 4, i8* %p8)
				ret i32* %p
				}

				define i32* @with_memset(i8 %val) sanitize_memory {
				; CHECK: define {{.*}} @with_memset
				%p = alloca i32
				%p8 = bitcast i32* %p to i8*
				call void @llvm.memset(i8* %p8, i8 0, i64 4, i1 false)
				; CHECK-NOT: call void @__msan_poison_stack
				ret i32* %p
				}

				%struct.multi = type { i32, { i8, i8 }, i8, i8 }

				define i32* @piecemeal(i8 %val) sanitize_memory {
				; CHECK: define {{.*}} @piecemeal
				%p = alloca %struct.multi
				%p0 = getelementptr %struct.multi, %struct.multi* %p, i32 0, i32 0
				%p1 = getelementptr %struct.multi, %struct.multi* %p, i32 0, i32 1
				%p2 = getelementptr %struct.multi, %struct.multi* %p, i32 0, i32 2
				%p3 = getelementptr %struct.multi, %struct.multi* %p, i32 0, i32 3
				%p1_0 = getelementptr { i8, i8 }, { i8, i8 }* %p1, i32 0, i32 0
				%p1_1 = getelementptr { i8, i8 }, { i8, i8 }* %p1, i32 0, i32 1
				store i32 0, i32* %p0
				store i8 0, i8* %p1_0
				store i8 0, i8* %p1_1
				store i8 0, i8* %p2
				store i8 0, i8* %p3
				; CHECK-NOT: call void @__msan_poison_stack
				ret i32* %p0
				}

				declare void @llvm.memset(i8*, i8, i64, i1)
				declare void @llvm.lifetime.start(i64, i8* nocapture)
				declare void @llvm.lifetime.end(i64, i8* nocapture)

llvm/test/Instrumentation/MemorySanitizer/alloca.ll

	Show All 16 Lines
	; RUN: opt < %s -msan -msan-kernel=1 -S \| FileCheck %s --check-prefixes=CHECK,KMSAN			; RUN: opt < %s -msan -msan-kernel=1 -S \| FileCheck %s --check-prefixes=CHECK,KMSAN

	target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"			target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	define void @static() sanitize_memory {			define void @static() sanitize_memory {
	entry:			entry:
	%x = alloca i32, align 4			%x = alloca i32, align 4
				%y = ptrtoint i32* %x to i32
	ret void			ret void
	}			}

	; CHECK-LABEL: define void @static(			; CHECK-LABEL: define void @static(
	; INLINE: call void @llvm.memset.p0i8.i64(i8* align 4 {{.*}}, i8 -1, i64 4, i1 false)			; INLINE: call void @llvm.memset.p0i8.i64(i8* align 4 {{.*}}, i8 -1, i64 4, i1 false)
	; CALL: call void @__msan_poison_stack(i8* {{.*}}, i64 4)			; CALL: call void @__msan_poison_stack(i8* {{.*}}, i64 4)
	; ORIGIN: call void @__msan_set_alloca_origin4(i8* {{.*}}, i64 4,			; ORIGIN: call void @__msan_set_alloca_origin4(i8* {{.*}}, i64 4,
	; KMSAN: call void @__msan_poison_alloca(i8* {{.*}}, i64 4,			; KMSAN: call void @__msan_poison_alloca(i8* {{.*}}, i64 4,
	; CHECK: ret void			; CHECK: ret void


	define void @dynamic() sanitize_memory {			define void @dynamic() sanitize_memory {
	entry:			entry:
	br label %l			br label %l
	l:			l:
	%x = alloca i32, align 4			%x = alloca i32, align 4
				%y = ptrtoint i32* %x to i32
	ret void			ret void
	}			}

	; CHECK-LABEL: define void @dynamic(			; CHECK-LABEL: define void @dynamic(
	; INLINE: call void @llvm.memset.p0i8.i64(i8* align 4 {{.*}}, i8 -1, i64 4, i1 false)			; INLINE: call void @llvm.memset.p0i8.i64(i8* align 4 {{.*}}, i8 -1, i64 4, i1 false)
	; CALL: call void @__msan_poison_stack(i8* {{.*}}, i64 4)			; CALL: call void @__msan_poison_stack(i8* {{.*}}, i64 4)
	; ORIGIN: call void @__msan_set_alloca_origin4(i8* {{.*}}, i64 4,			; ORIGIN: call void @__msan_set_alloca_origin4(i8* {{.*}}, i64 4,
	; KMSAN: call void @__msan_poison_alloca(i8* {{.*}}, i64 4,			; KMSAN: call void @__msan_poison_alloca(i8* {{.*}}, i64 4,
	; CHECK: ret void			; CHECK: ret void

	define void @array() sanitize_memory {			define void @array() sanitize_memory {
	entry:			entry:
	%x = alloca i32, i64 5, align 4			%x = alloca i32, i64 5, align 4
				%y = ptrtoint i32* %x to i32
	ret void			ret void
	}			}

	; CHECK-LABEL: define void @array(			; CHECK-LABEL: define void @array(
	; INLINE: call void @llvm.memset.p0i8.i64(i8* align 4 {{.*}}, i8 -1, i64 20, i1 false)			; INLINE: call void @llvm.memset.p0i8.i64(i8* align 4 {{.*}}, i8 -1, i64 20, i1 false)
	; CALL: call void @__msan_poison_stack(i8* {{.*}}, i64 20)			; CALL: call void @__msan_poison_stack(i8* {{.*}}, i64 20)
	; ORIGIN: call void @__msan_set_alloca_origin4(i8* {{.*}}, i64 20,			; ORIGIN: call void @__msan_set_alloca_origin4(i8* {{.*}}, i64 20,
	; KMSAN: call void @__msan_poison_alloca(i8* {{.*}}, i64 20,			; KMSAN: call void @__msan_poison_alloca(i8* {{.*}}, i64 20,
	; CHECK: ret void			; CHECK: ret void

	define void @array_non_const(i64 %cnt) sanitize_memory {			define void @array_non_const(i64 %cnt) sanitize_memory {
	entry:			entry:
	%x = alloca i32, i64 %cnt, align 4			%x = alloca i32, i64 %cnt, align 4
				%y = ptrtoint i32* %x to i32
	ret void			ret void
	}			}

	; CHECK-LABEL: define void @array_non_const(			; CHECK-LABEL: define void @array_non_const(
	; CHECK: %[[A:.*]] = mul i64 4, %cnt			; CHECK: %[[A:.*]] = mul i64 4, %cnt
	; INLINE: call void @llvm.memset.p0i8.i64(i8* align 4 {{.*}}, i8 -1, i64 %[[A]], i1 false)			; INLINE: call void @llvm.memset.p0i8.i64(i8* align 4 {{.*}}, i8 -1, i64 %[[A]], i1 false)
	; CALL: call void @__msan_poison_stack(i8* {{.*}}, i64 %[[A]])			; CALL: call void @__msan_poison_stack(i8* {{.*}}, i64 %[[A]])
	; ORIGIN: call void @__msan_set_alloca_origin4(i8* {{.*}}, i64 %[[A]],			; ORIGIN: call void @__msan_set_alloca_origin4(i8* {{.*}}, i64 %[[A]],
	; KMSAN: call void @__msan_poison_alloca(i8* {{.*}}, i64 %[[A]],			; KMSAN: call void @__msan_poison_alloca(i8* {{.*}}, i64 %[[A]],
	; CHECK: ret void			; CHECK: ret void

	; Check that the local is unpoisoned in the absence of sanitize_memory			; Check that the local is unpoisoned in the absence of sanitize_memory
	define void @unpoison_local() {			define void @unpoison_local() {
	entry:			entry:
	%x = alloca i32, i64 5, align 4			%x = alloca i32, i64 5, align 4
				%y = ptrtoint i32* %x to i32
	ret void			ret void
	}			}

	; CHECK-LABEL: define void @unpoison_local(			; CHECK-LABEL: define void @unpoison_local(
	; INLINE: call void @llvm.memset.p0i8.i64(i8* align 4 {{.*}}, i8 0, i64 20, i1 false)			; INLINE: call void @llvm.memset.p0i8.i64(i8* align 4 {{.*}}, i8 0, i64 20, i1 false)
	; CALL: call void @llvm.memset.p0i8.i64(i8* align 4 {{.*}}, i8 0, i64 20, i1 false)			; CALL: call void @llvm.memset.p0i8.i64(i8* align 4 {{.*}}, i8 0, i64 20, i1 false)
	; ORIGIN-NOT: call void @__msan_set_alloca_origin4(i8* {{.*}}, i64 20,			; ORIGIN-NOT: call void @__msan_set_alloca_origin4(i8* {{.*}}, i64 20,
	; KMSAN: call void @__msan_unpoison_alloca(i8* {{.*}}, i64 20)			; KMSAN: call void @__msan_unpoison_alloca(i8* {{.*}}, i64 20)
	; CHECK: ret void			; CHECK: ret void

	; Check that every llvm.lifetime.start() causes poisoning of locals.			; Check that every llvm.lifetime.start() causes poisoning of locals.
	define void @lifetime_start() sanitize_memory {			define void @lifetime_start() sanitize_memory {
	entry:			entry:
	%x = alloca i32, align 4			%x = alloca i32, align 4
	%c = bitcast i32* %x to i8*			%c = bitcast i32* %x to i8*
	br label %another_bb			br label %another_bb

	another_bb:			another_bb:
	call void @llvm.lifetime.start.p0i8(i64 4, i8* nonnull %c)			call void @llvm.lifetime.start.p0i8(i64 4, i8* nonnull %c)
				%y = load i32, i32* %x
	store i32 7, i32* %x			store i32 7, i32* %x
	call void @llvm.lifetime.end.p0i8(i64 4, i8* nonnull %c)			call void @llvm.lifetime.end.p0i8(i64 4, i8* nonnull %c)
	call void @llvm.lifetime.start.p0i8(i64 4, i8* nonnull %c)			call void @llvm.lifetime.start.p0i8(i64 4, i8* nonnull %c)
				%z = load i32, i32* %x
	store i32 8, i32* %x			store i32 8, i32* %x
	call void @llvm.lifetime.end.p0i8(i64 4, i8* nonnull %c)			call void @llvm.lifetime.end.p0i8(i64 4, i8* nonnull %c)
	ret void			ret void
	}			}

	; CHECK-LABEL: define void @lifetime_start(			; CHECK-LABEL: define void @lifetime_start(
	; CHECK-LABEL: entry:			; CHECK-LABEL: entry:
	; CHECK: %x = alloca i32			; CHECK: %x = alloca i32
	▲ Show 20 Lines • Show All 109 Lines • Show Last 20 Lines

llvm/test/Instrumentation/MemorySanitizer/msan_x86_bts_asm.ll

	Show First 20 Lines • Show All 52 Lines • ▼ Show 20 Lines

	if.then: ; preds = %entry			if.then: ; preds = %entry
	ret i32 0			ret i32 0

	if.else: ; preds = %entry			if.else: ; preds = %entry
	ret i32 1			ret i32 1
	}			}

	; %nr is first poisoned, then unpoisoned (written to). Need to optimize this in the future.			; %nr is first poisoned, then unpoisoned (written to). No need to poison.
	; CHECK: [[NRC1:%.]] = bitcast i64 %nr to i8*			; CHECK-NOT: [[NRC1:%.]] = bitcast i64 %nr to i8*
	; CHECK: call void @__msan_poison_alloca(i8* [[NRC1]]{{.*}})			; CHECK-NOT: call void @__msan_poison_alloca(i8* [[NRC1]]{{.*}})
	; CHECK: [[NRC2:%.]] = bitcast i64 %nr to i8*			; CHECK: [[NRC2:%.]] = bitcast i64 %nr to i8*
	; CHECK: call { i8, i32 } @__msan_metadata_ptr_for_store_8(i8* [[NRC2]])			; CHECK: call { i8, i32 } @__msan_metadata_ptr_for_store_8(i8* [[NRC2]])

	; Hooks for inputs usually go before the assembly statement. But here we have none,			; Hooks for inputs usually go before the assembly statement. But here we have none,
	; because %nr is passed by value. However we check %nr for being initialized.			; because %nr is passed by value. However we check %nr for being initialized.
	; CHECK-CONS: [[NRC3:%.]] = bitcast i64 %nr to i8*			; CHECK-CONS: [[NRC3:%.]] = bitcast i64 %nr to i8*
	; CHECK-CONS: call { i8, i32 } @__msan_metadata_ptr_for_load_8(i8* [[NRC3]])			; CHECK-CONS: call { i8, i32 } @__msan_metadata_ptr_for_load_8(i8* [[NRC3]])

	Show All 29 Lines