This is an archive of the discontinued LLVM Phabricator instance.

[asan] Speed up interesting alloca checks
ClosedPublic

Authored by zaks.anna on Mar 26 2015, 10:16 AM.

Download Raw Diff

Details

Reviewers

Summary

We make many redundant calls to isInterestingAlloca in the AddressSanitzier pass. This is especially inefficient for allocas that have many uses. Let's cache the results to speed up compilation.

The compile time improvements depend on the input. I did not see much difference on benchmarks; however, I have a test case where compile time goes from minutes to under a second.

Diff Detail

Event Timeline

zaks.anna updated this revision to Diff 22731.Mar 26 2015, 10:16 AM

zaks.anna retitled this revision from to [asan] Speed up interesting alloca checks.

zaks.anna updated this object.

zaks.anna edited the test plan for this revision. (Show Details)

zaks.anna added subscribers: Unknown Object (MLST), kubamracek, kcc and 2 others.

This looks fine, but I would slightly prefer to compute the map beforehand instead of caching, if possible.
I.e. instead of a map of ProcessedAllocas have a set of InterestingAllocas that is computed in visitAllocaInst

lib/Transforms/Instrumentation/AddressSanitizer.cpp
811	we tend to not write {} in such cases

This looks fine, but I would slightly prefer to compute the map beforehand instead of caching, if possible.
I.e. instead of a map of ProcessedAllocas have a set of InterestingAllocas that is computed in visitAllocaInst

In the future, determining if an alloca is interesting or not might depend on visiting all of its uses. For example, if an alloca is promotable but all of it's uses are accessing memory, which is provably in range, it should be marked as non-interesting. The current approach seems to fit that model better than visiting allocas and building a list.

In D8639#147540, @zaks.anna wrote:

This looks fine, but I would slightly prefer to compute the map beforehand instead of caching, if possible.
I.e. instead of a map of ProcessedAllocas have a set of InterestingAllocas that is computed in visitAllocaInst

In the future, determining if an alloca is interesting or not might depend on visiting all of its uses. For example, if an alloca is promotable but all of it's uses are accessing memory, which is provably in range, it should be marked as non-interesting. The current approach seems to fit that model better than visiting allocas and building a list.

But you can also visit all uses in a pre-compute step too.
And having this pre-computed instead of cached will make the code simpler to understand.

Each walk of the use list of the alloca is relatively expensive -- it is a linked list and cache hostile.

But won't we need it anyway regardless of caching-cs-precomputing?

LGTM (with a nit about {})

We currently do not run an instruction visitor in the very beginning of the pass

Yes, indeed.

This revision is now accepted and ready to land.Mar 26 2015, 6:51 PM

Thanks for the feedback everyone! Committed in r233397.

zaks.anna closed this revision.Jun 25 2015, 6:14 PM

Revision Contents

Path

Size

lib/

Transforms/

Instrumentation/

AddressSanitizer.cpp

30 lines

Diff 22731

lib/Transforms/Instrumentation/AddressSanitizer.cpp

Show First 20 Lines • Show All 395 Lines • ▼ Show 20 Lines	struct AddressSanitizer : public FunctionPass {
}		}
uint64_t getAllocaSizeInBytes(AllocaInst *AI) const {		uint64_t getAllocaSizeInBytes(AllocaInst *AI) const {
Type *Ty = AI->getAllocatedType();		Type *Ty = AI->getAllocatedType();
uint64_t SizeInBytes =		uint64_t SizeInBytes =
AI->getModule()->getDataLayout().getTypeAllocSize(Ty);		AI->getModule()->getDataLayout().getTypeAllocSize(Ty);
return SizeInBytes;		return SizeInBytes;
}		}
/// Check if we want (and can) handle this alloca.		/// Check if we want (and can) handle this alloca.
bool isInterestingAlloca(AllocaInst &AI) const;		bool isInterestingAlloca(AllocaInst &AI);
/// If it is an interesting memory access, return the PointerOperand		/// If it is an interesting memory access, return the PointerOperand
/// and set IsWrite/Alignment. Otherwise return nullptr.		/// and set IsWrite/Alignment. Otherwise return nullptr.
Value isInterestingMemoryAccess(Instruction I, bool *IsWrite,		Value isInterestingMemoryAccess(Instruction I, bool *IsWrite,
uint64_t *TypeSize,		uint64_t *TypeSize,
unsigned *Alignment) const;		unsigned *Alignment);
void instrumentMop(ObjectSizeOffsetVisitor &ObjSizeVis, Instruction *I,		void instrumentMop(ObjectSizeOffsetVisitor &ObjSizeVis, Instruction *I,
bool UseCalls, const DataLayout &DL);		bool UseCalls, const DataLayout &DL);
void instrumentPointerComparisonOrSubtraction(Instruction *I);		void instrumentPointerComparisonOrSubtraction(Instruction *I);
void instrumentAddress(Instruction OrigIns, Instruction InsertBefore,		void instrumentAddress(Instruction OrigIns, Instruction InsertBefore,
Value *Addr, uint32_t TypeSize, bool IsWrite,		Value *Addr, uint32_t TypeSize, bool IsWrite,
Value *SizeArgument, bool UseCalls, uint32_t Exp);		Value *SizeArgument, bool UseCalls, uint32_t Exp);
void instrumentUnusualSizeOrAlignment(Instruction I, Value Addr,		void instrumentUnusualSizeOrAlignment(Instruction I, Value Addr,
uint32_t TypeSize, bool IsWrite,		uint32_t TypeSize, bool IsWrite,
Show All 35 Lines	private:
Function *AsanErrorCallback[2][2][kNumberOfAccessSizes];		Function *AsanErrorCallback[2][2][kNumberOfAccessSizes];
Function *AsanMemoryAccessCallback[2][2][kNumberOfAccessSizes];		Function *AsanMemoryAccessCallback[2][2][kNumberOfAccessSizes];
// This array is indexed by AccessIsWrite and Experiment.		// This array is indexed by AccessIsWrite and Experiment.
Function *AsanErrorCallbackSized[2][2];		Function *AsanErrorCallbackSized[2][2];
Function *AsanMemoryAccessCallbackSized[2][2];		Function *AsanMemoryAccessCallbackSized[2][2];
Function AsanMemmove, AsanMemcpy, *AsanMemset;		Function AsanMemmove, AsanMemcpy, *AsanMemset;
InlineAsm *EmptyAsm;		InlineAsm *EmptyAsm;
GlobalsMetadata GlobalsMD;		GlobalsMetadata GlobalsMD;
		DenseMap<AllocaInst *, bool> ProcessedAllocas;

friend struct FunctionStackPoisoner;		friend struct FunctionStackPoisoner;
};		};

class AddressSanitizerModule : public ModulePass {		class AddressSanitizerModule : public ModulePass {
public:		public:
AddressSanitizerModule() : ModulePass(ID) {}		AddressSanitizerModule() : ModulePass(ID) {}
bool runOnModule(Module &M) override;		bool runOnModule(Module &M) override;
▲ Show 20 Lines • Show All 330 Lines • ▼ Show 20 Lines	IRB.CreateCall3(
IRB.CreatePointerCast(MI->getOperand(0), IRB.getInt8PtrTy()),		IRB.CreatePointerCast(MI->getOperand(0), IRB.getInt8PtrTy()),
IRB.CreateIntCast(MI->getOperand(1), IRB.getInt32Ty(), false),		IRB.CreateIntCast(MI->getOperand(1), IRB.getInt32Ty(), false),
IRB.CreateIntCast(MI->getOperand(2), IntptrTy, false));		IRB.CreateIntCast(MI->getOperand(2), IntptrTy, false));
}		}
MI->eraseFromParent();		MI->eraseFromParent();
}		}

/// Check if we want (and can) handle this alloca.		/// Check if we want (and can) handle this alloca.
bool AddressSanitizer::isInterestingAlloca(AllocaInst &AI) const {		bool AddressSanitizer::isInterestingAlloca(AllocaInst &AI) {
return (AI.getAllocatedType()->isSized() &&		auto PreviouslySeenAllocaInfo = ProcessedAllocas.find(&AI);

		if (PreviouslySeenAllocaInfo != ProcessedAllocas.end()) {
		kccUnsubmitted Not Done Reply Inline Actions we tend to not write {} in such cases kcc: we tend to not write {} in such cases
		return PreviouslySeenAllocaInfo->getSecond();
		}

		bool IsInteresting = (AI.getAllocatedType()->isSized() &&
// alloca() may be called with 0 size, ignore it.		// alloca() may be called with 0 size, ignore it.
getAllocaSizeInBytes(&AI) > 0 &&		getAllocaSizeInBytes(&AI) > 0 &&
// We are only interested in allocas not promotable to registers.		// We are only interested in allocas not promotable to registers.
// Promotable allocas are common under -O0.		// Promotable allocas are common under -O0.
(!ClSkipPromotableAllocas \|\| !isAllocaPromotable(&AI)));		(!ClSkipPromotableAllocas \|\| !isAllocaPromotable(&AI)));

		ProcessedAllocas[&AI] = IsInteresting;
		return IsInteresting;
}		}

/// If I is an interesting memory access, return the PointerOperand		/// If I is an interesting memory access, return the PointerOperand
/// and set IsWrite/Alignment. Otherwise return nullptr.		/// and set IsWrite/Alignment. Otherwise return nullptr.
Value AddressSanitizer::isInterestingMemoryAccess(Instruction I,		Value AddressSanitizer::isInterestingMemoryAccess(Instruction I,
bool *IsWrite,		bool *IsWrite,
uint64_t *TypeSize,		uint64_t *TypeSize,
unsigned *Alignment) const {		unsigned *Alignment) {
// Skip memory accesses inserted by another instrumentation.		// Skip memory accesses inserted by another instrumentation.
if (I->getMetadata("nosanitize")) return nullptr;		if (I->getMetadata("nosanitize")) return nullptr;

Value *PtrOperand = nullptr;		Value *PtrOperand = nullptr;
const DataLayout &DL = I->getModule()->getDataLayout();		const DataLayout &DL = I->getModule()->getDataLayout();
if (LoadInst *LI = dyn_cast<LoadInst>(I)) {		if (LoadInst *LI = dyn_cast<LoadInst>(I)) {
if (!ClInstrumentReads) return nullptr;		if (!ClInstrumentReads) return nullptr;
*IsWrite = false;		*IsWrite = false;
▲ Show 20 Lines • Show All 1,288 Lines • Show Last 20 Lines