This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/Analysis/
-
llvm/
-
Analysis/
-
MemoryBuiltins.h
-
lib/
-
Analysis/
-
MemoryBuiltins.cpp
-
Target/AMDGPU/
-
AMDGPU/
2
AMDGPUPromoteAlloca.cpp
-
Transforms/
-
Scalar/
-
GVN.cpp
-
NewGVN.cpp
-
Utils/
-
PromoteMemoryToRegister.cpp

Differential D155773

[llvm][MemoryBuiltins] Add alloca support to getInitialValueOfAllocation
Needs ReviewPublic

Authored by jmciver on Jul 19 2023, 7:51 PM.

Download Raw Diff

Details

Reviewers

nikic
efriedma
fhahn

Summary

This commit is in support of future uninitialized memory handling and adds
alloca instruction support to getInitialValueOfAllocation. This unifies initial
memory state querying (both stack and heap) to a single utility function.

Mem2Reg, GVN, and NewGVN optimizations are refactored to take advantage of
alloca support in getInitialValueOfAllocation.

To support uninitialized memory as poison we are proposing that in the future
load instructions support an attribute !freeze (or alternative) which will allow
freeze poison insertions where applicable. Thus the constant emitted by
getInitialValueOfAllocation will be dependent on the allocation
function/instruction and the load instruction attributes.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

jmciver created this revision.Jul 19 2023, 7:51 PM

Herald added a project: Restricted Project. · View Herald TranscriptJul 19 2023, 7:51 PM

Herald added subscribers: kmitropoulou, ormris, ChuanqiXu and 4 others. · View Herald Transcript

jmciver published this revision for review.Jul 19 2023, 10:15 PM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptJul 19 2023, 10:15 PM

Herald added subscribers: llvm-commits, cfe-commits. · View Herald Transcript

This commit is in support of future uninitialized memory handling and adds
alloca instruction support to getInitialValueOfAllocation. This unifies initial
memory state querying (both stack and heap) to a single utility function.

Could you explain in more detail what future work you have based on this? Is this about the promotion of malloc (and similar) memory?

In any case, I don't think it's acceptable to add TLI as a required dependency for mem2reg. Alloca promotion doesn't need it, and that's what all the existing code is doing. This should at least be an optional dependency.

However, I think it would be better to directly pass the initial value to mem2reg instead (defaulting to UndefValue), rather than making it query it.

This revision now requires changes to proceed.Jul 20 2023, 1:02 AM

@nikic as per my GSoC project I am trying, under the guidance of @nlopes, a load attribute based approach to migrating uninitialized load to poison. The first attribute that I am working on is !freeze which effectively inserts a freeze poison if the load is uninitialized. Obviously this can only be done in
optimizations that allow instruction creation. In the future I am looking to add a load instruction parameter to getInitialValueOfAllocation to allow modification of the returned constant based on the allocation function/instruction and the presence of a load attribute.

So why pack the alloca test into getInitialValueOfAllocation? The reasoning is that attribute is on a per-load instruction, but allocation could be alloca or function. I agree if the application of the load attribute were universal passing in the default value to mem2reg would work, but as we are modulating the returned constant based on individual load instruction attributes this would not work. Hence the query based approach.

As for the TLI I agree that making it required is probably not ideal. @nlopes can vouch that I thought about this :-). Two options off the top of my head are:

Make the TLI parameter of getInitialValueOfAllocation a pointer with defaults to nullptr.

Add a separated query function for alloca instructions.

The advantage to option 1 is all allocation initial state queries are handled in one place. The detractor is we have a parameter that has a default nullptr.

The advantage to option 2 is allocation functions vs alloca instruction are handled by functions that have only the required parameterizations exposed. The detractor is there will be some combinatorial overlap between the two functions.

Thoughts?

Remove reliance on TLI objects where only alloca instructions are processed.

jmciver edited the summary of this revision. (Show Details)Jul 20 2023, 12:01 PM

Harbormaster completed remote builds in B246987: Diff 542618.Jul 20 2023, 5:23 PM

Refactor AMDGPUPromoteAlloca to use getInitialValueOfAllocation.

Herald added subscribers: foad, kerbowa, jvesely, arsenm. · View Herald TranscriptJul 28 2023, 3:25 PM

Harbormaster completed remote builds in B248942: Diff 545294.Jul 28 2023, 4:39 PM

Ping

arsenm added inline comments.Aug 11 2023, 9:54 AM

llvm/lib/Target/AMDGPU/AMDGPUPromoteAlloca.cpp
809–811	This is very specifically handling alloca, not any random allocation like function

jmciver added inline comments.Aug 11 2023, 11:03 AM

llvm/lib/Target/AMDGPU/AMDGPUPromoteAlloca.cpp
809–811	@arsenm thanks for the feedback. I added functionality to `getInitalValueOfAllocation` to handle `alloca` instructions specifically. This is being done as preliminary to some possible refactorizations allowing uninitialized memory to move to poison semantics. The behavior for these changes would be the same for `alloca` and allocation like functions.

@jmciver Thanks for the context. I would recommend you to put up the patch you have based on top of this, because it's pretty hard to tell whether this API design makes sense without seeing the use. I have some doubts about that, for two reasons:

If I understand correctly, for your use case getInitialValueOfAllocation() can't just return a constant anymore, it may have to insert an instruction. Additionally, there is no longer a single "initial value", but it may wary between different loads from the same allocation. This means that the API is going to change to the point that getInitialValueOfAllocation() is no longer recognizable (and probably no longer correctly named -- it would be more materializeInitialValueOfAllocation()).
As some of the changes in this patch show, we also have to give lifetime.start intrinsics similar treatment to allocas, but this doesn't quite fit the current API.

I think supporting allocas in getInitialValueOfAllocation() is perfectly fine, but I'm not sure this really brings you closer to what you want to do.

@nikic Thanks for responding. I will get a "work in progress" patch up in the next three days.

In the API adaptation, instruction insertion is still being handled in the caller as some passes are only allowed removal and not insertion.

jmciver added a child revision: D158352: WIP: [llvm][MemoryBuiltins] Add initialization category to getInitialValueOfAllocation.Aug 19 2023, 3:34 PM

@nikic I added this patch to a work in progress (WIP) "stack" of patches. D158352 and D158353 show what the intended progression looks like for mem2reg. I will add WIP SROA changes in the next few days.

Revision Contents

Path

Size

llvm/

include/

llvm/

Analysis/

MemoryBuiltins.h

4 lines

lib/

Analysis/

MemoryBuiltins.cpp

3 lines

Target/

AMDGPU/

AMDGPUPromoteAlloca.cpp

5 lines

Transforms/

Scalar/

GVN.cpp

5 lines

NewGVN.cpp

19 lines

Utils/

PromoteMemoryToRegister.cpp

3 lines

Diff 545294

llvm/include/llvm/Analysis/MemoryBuiltins.h

	Show First 20 Lines • Show All 113 Lines • ▼ Show 20 Lines
	/// doing abstract interpretation.			/// doing abstract interpretation.
	std::optional<APInt> getAllocSize(			std::optional<APInt> getAllocSize(
	const CallBase CB, const TargetLibraryInfo TLI,			const CallBase CB, const TargetLibraryInfo TLI,
	function_ref<const Value (const Value )> Mapper = [](const Value *V) {			function_ref<const Value (const Value )> Mapper = [](const Value *V) {
	return V;			return V;
	});			});

	/// If this is a call to an allocation function that initializes memory to a			/// If this is a call to an allocation function that initializes memory to a
	/// fixed value, return said value in the requested type. Otherwise, return			/// fixed value, return said value in the requested type. If this is a call to
	/// nullptr.			/// alloca instruction the returned value is undef. Otherwise, return nullptr.
	Constant getInitialValueOfAllocation(const Value V,			Constant getInitialValueOfAllocation(const Value V,
	const TargetLibraryInfo *TLI,			const TargetLibraryInfo *TLI,
	Type *Ty);			Type *Ty);

	/// If a function is part of an allocation family (e.g.			/// If a function is part of an allocation family (e.g.
	/// malloc/realloc/calloc/free), return the identifier for its family			/// malloc/realloc/calloc/free), return the identifier for its family
	/// of functions.			/// of functions.
	std::optional<StringRef> getAllocationFamily(const Value *I,			std::optional<StringRef> getAllocationFamily(const Value *I,
	▲ Show 20 Lines • Show All 188 Lines • Show Last 20 Lines

llvm/lib/Analysis/MemoryBuiltins.cpp

Show First 20 Lines • Show All 430 Lines • ▼ Show 20 Lines	llvm::getAllocSize(const CallBase CB, const TargetLibraryInfo TLI,
if (Overflow)		if (Overflow)
return std::nullopt;		return std::nullopt;
return Size;		return Size;
}		}

Constant llvm::getInitialValueOfAllocation(const Value V,		Constant llvm::getInitialValueOfAllocation(const Value V,
const TargetLibraryInfo *TLI,		const TargetLibraryInfo *TLI,
Type *Ty) {		Type *Ty) {
		if (isa<AllocaInst>(V))
		return UndefValue::get(Ty);

auto *Alloc = dyn_cast<CallBase>(V);		auto *Alloc = dyn_cast<CallBase>(V);
if (!Alloc)		if (!Alloc)
return nullptr;		return nullptr;

// malloc are uninitialized (undef)		// malloc are uninitialized (undef)
if (getAllocationData(Alloc, MallocOrOpNewLike, TLI).has_value())		if (getAllocationData(Alloc, MallocOrOpNewLike, TLI).has_value())
return UndefValue::get(Ty);		return UndefValue::get(Ty);

▲ Show 20 Lines • Show All 800 Lines • Show Last 20 Lines

llvm/lib/Target/AMDGPU/AMDGPUPromoteAlloca.cpp

Show All 26 Lines

#include "AMDGPU.h"		#include "AMDGPU.h"
#include "GCNSubtarget.h"		#include "GCNSubtarget.h"
#include "Utils/AMDGPUBaseInfo.h"		#include "Utils/AMDGPUBaseInfo.h"
#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
#include "llvm/Analysis/CaptureTracking.h"		#include "llvm/Analysis/CaptureTracking.h"
#include "llvm/Analysis/InstSimplifyFolder.h"		#include "llvm/Analysis/InstSimplifyFolder.h"
#include "llvm/Analysis/InstructionSimplify.h"		#include "llvm/Analysis/InstructionSimplify.h"
		#include "llvm/Analysis/MemoryBuiltins.h"
#include "llvm/Analysis/ValueTracking.h"		#include "llvm/Analysis/ValueTracking.h"
#include "llvm/CodeGen/TargetPassConfig.h"		#include "llvm/CodeGen/TargetPassConfig.h"
#include "llvm/IR/IRBuilder.h"		#include "llvm/IR/IRBuilder.h"
#include "llvm/IR/IntrinsicInst.h"		#include "llvm/IR/IntrinsicInst.h"
#include "llvm/IR/IntrinsicsAMDGPU.h"		#include "llvm/IR/IntrinsicsAMDGPU.h"
#include "llvm/IR/IntrinsicsR600.h"		#include "llvm/IR/IntrinsicsR600.h"
#include "llvm/IR/PatternMatch.h"		#include "llvm/IR/PatternMatch.h"
#include "llvm/Pass.h"		#include "llvm/Pass.h"
▲ Show 20 Lines • Show All 757 Lines • ▼ Show 20 Lines	bool AMDGPUPromoteAllocaImpl::tryPromoteAllocaToVector(AllocaInst &Alloca) {
LLVM_DEBUG(dbgs() << " Converting alloca to vector " << *AllocaTy << " -> "		LLVM_DEBUG(dbgs() << " Converting alloca to vector " << *AllocaTy << " -> "
<< *VectorTy << '\n');		<< *VectorTy << '\n');
const unsigned VecStoreSize = DL->getTypeStoreSize(VectorTy);		const unsigned VecStoreSize = DL->getTypeStoreSize(VectorTy);

// Alloca is uninitialized memory. Imitate that by making the first value		// Alloca is uninitialized memory. Imitate that by making the first value
// undef.		// undef.
SSAUpdater Updater;		SSAUpdater Updater;
Updater.Initialize(VectorTy, "promotealloca");		Updater.Initialize(VectorTy, "promotealloca");
Updater.AddAvailableValue(Alloca.getParent(), UndefValue::get(VectorTy));		Updater.AddAvailableValue(
		Alloca.getParent(),
		getInitialValueOfAllocation(&Alloca, nullptr, VectorTy));
		arsenmUnsubmitted Not Done Reply Inline Actions This is very specifically handling alloca, not any random allocation like function arsenm: This is very specifically handling alloca, not any random allocation like function
		jmciverAuthorUnsubmitted Not Done Reply Inline Actions @arsenm thanks for the feedback. I added functionality to `getInitalValueOfAllocation` to handle `alloca` instructions specifically. This is being done as preliminary to some possible refactorizations allowing uninitialized memory to move to poison semantics. The behavior for these changes would be the same for `alloca` and allocation like functions. jmciver: @arsenm thanks for the feedback. I added functionality to `getInitalValueOfAllocation` to…

// First handle the initial worklist.		// First handle the initial worklist.
SmallVector<LoadInst *, 4> DeferredLoads;		SmallVector<LoadInst *, 4> DeferredLoads;
forEachWorkListItem(WorkList, [&](Instruction *I) {		forEachWorkListItem(WorkList, [&](Instruction *I) {
BasicBlock *BB = I->getParent();		BasicBlock *BB = I->getParent();
// On the first pass, we only take values that are trivially known, i.e.		// On the first pass, we only take values that are trivially known, i.e.
// where AddAvailableValue was already called in this block.		// where AddAvailableValue was already called in this block.
Value *Result = promoteAllocaUserToVector(		Value *Result = promoteAllocaUserToVector(
▲ Show 20 Lines • Show All 684 Lines • Show Last 20 Lines

llvm/lib/Transforms/Scalar/GVN.cpp

Show First 20 Lines • Show All 1,233 Lines • ▼ Show 20 Lines	LLVM_DEBUG(
dbgs() << " is clobbered by " << *DepInst << '\n';);		dbgs() << " is clobbered by " << *DepInst << '\n';);
if (ORE->allowExtraAnalysis(DEBUG_TYPE))		if (ORE->allowExtraAnalysis(DEBUG_TYPE))
reportMayClobberedLoad(Load, DepInfo, DT, ORE);		reportMayClobberedLoad(Load, DepInfo, DT, ORE);

return std::nullopt;		return std::nullopt;
}		}
assert(DepInfo.isDef() && "follows from above");		assert(DepInfo.isDef() && "follows from above");

// Loading the alloca -> undef.
// Loading immediately after lifetime begin -> undef.		// Loading immediately after lifetime begin -> undef.
if (isa<AllocaInst>(DepInst) \|\| isLifetimeStart(DepInst))		if (isLifetimeStart(DepInst))
return AvailableValue::get(UndefValue::get(Load->getType()));		return AvailableValue::get(UndefValue::get(Load->getType()));

		// In addition to allocator function calls this includes loading the alloca ->
		// undef.
if (Constant *InitVal =		if (Constant *InitVal =
getInitialValueOfAllocation(DepInst, TLI, Load->getType()))		getInitialValueOfAllocation(DepInst, TLI, Load->getType()))
return AvailableValue::get(InitVal);		return AvailableValue::get(InitVal);

if (StoreInst *S = dyn_cast<StoreInst>(DepInst)) {		if (StoreInst *S = dyn_cast<StoreInst>(DepInst)) {
// Reject loads and stores that are to the same address but are of		// Reject loads and stores that are to the same address but are of
// different types if we have to. If the stored value is convertable to		// different types if we have to. If the stored value is convertable to
// the loaded value, we can reuse it.		// the loaded value, we can reuse it.
▲ Show 20 Lines • Show All 2,090 Lines • Show Last 20 Lines

llvm/lib/Transforms/Scalar/NewGVN.cpp

Show First 20 Lines • Show All 1,490 Lines • ▼ Show 20 Lines	if (auto *DepSI = dyn_cast<StoreInst>(DepInst)) {
}		}
}		}

// All of the below are only true if the loaded pointer is produced		// All of the below are only true if the loaded pointer is produced
// by the dependent instruction.		// by the dependent instruction.
if (LoadPtr != lookupOperandLeader(DepInst) &&		if (LoadPtr != lookupOperandLeader(DepInst) &&
!AA->isMustAlias(LoadPtr, DepInst))		!AA->isMustAlias(LoadPtr, DepInst))
return nullptr;		return nullptr;
// If this load really doesn't depend on anything, then we must be loading an
// undef value. This can happen when loading for a fresh allocation with no
// intervening stores, for example. Note that this is only true in the case
// that the result of the allocation is pointer equal to the load ptr.
if (isa<AllocaInst>(DepInst)) {
return createConstantExpression(UndefValue::get(LoadType));
}
// If this load occurs either right after a lifetime begin,		// If this load occurs either right after a lifetime begin,
// then the loaded value is undefined.		// then the loaded value is undefined.
else if (auto *II = dyn_cast<IntrinsicInst>(DepInst)) {		if (auto *II = dyn_cast<IntrinsicInst>(DepInst)) {
if (II->getIntrinsicID() == Intrinsic::lifetime_start)		if (II->getIntrinsicID() == Intrinsic::lifetime_start)
return createConstantExpression(UndefValue::get(LoadType));		return createConstantExpression(UndefValue::get(LoadType));
} else if (auto *InitVal =		}
getInitialValueOfAllocation(DepInst, TLI, LoadType))		// If this load really doesn't depend on anything, then we must be loading an
		// undef value. This can happen when loading for a fresh allocation with no
		// intervening stores, for example. Note that this is only true in the case
		// that the result of the allocation is pointer equal to the load ptr.
		else if (auto *InitVal = getInitialValueOfAllocation(DepInst, TLI, LoadType))
return createConstantExpression(InitVal);		return createConstantExpression(InitVal);

return nullptr;		return nullptr;
}		}

const Expression NewGVN::performSymbolicLoadEvaluation(Instruction I) const {		const Expression NewGVN::performSymbolicLoadEvaluation(Instruction I) const {
auto *LI = cast<LoadInst>(I);		auto *LI = cast<LoadInst>(I);

// We can eliminate in favor of non-simple loads, but we won't be able to		// We can eliminate in favor of non-simple loads, but we won't be able to
▲ Show 20 Lines • Show All 2,718 Lines • Show Last 20 Lines

llvm/lib/Transforms/Utils/PromoteMemoryToRegister.cpp

Show All 18 Lines
#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
#include "llvm/ADT/SmallPtrSet.h"		#include "llvm/ADT/SmallPtrSet.h"
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
#include "llvm/ADT/Statistic.h"		#include "llvm/ADT/Statistic.h"
#include "llvm/ADT/Twine.h"		#include "llvm/ADT/Twine.h"
#include "llvm/Analysis/AssumptionCache.h"		#include "llvm/Analysis/AssumptionCache.h"
#include "llvm/Analysis/InstructionSimplify.h"		#include "llvm/Analysis/InstructionSimplify.h"
#include "llvm/Analysis/IteratedDominanceFrontier.h"		#include "llvm/Analysis/IteratedDominanceFrontier.h"
		#include "llvm/Analysis/MemoryBuiltins.h"
#include "llvm/Analysis/ValueTracking.h"		#include "llvm/Analysis/ValueTracking.h"
#include "llvm/IR/BasicBlock.h"		#include "llvm/IR/BasicBlock.h"
#include "llvm/IR/CFG.h"		#include "llvm/IR/CFG.h"
#include "llvm/IR/Constant.h"		#include "llvm/IR/Constant.h"
#include "llvm/IR/Constants.h"		#include "llvm/IR/Constants.h"
#include "llvm/IR/DIBuilder.h"		#include "llvm/IR/DIBuilder.h"
#include "llvm/IR/DebugInfo.h"		#include "llvm/IR/DebugInfo.h"
#include "llvm/IR/Dominators.h"		#include "llvm/IR/Dominators.h"
▲ Show 20 Lines • Show All 560 Lines • ▼ Show 20 Lines	for (User *U : make_early_inc_range(AI->users())) {
StoresByIndexTy::iterator I = llvm::lower_bound(		StoresByIndexTy::iterator I = llvm::lower_bound(
StoresByIndex,		StoresByIndex,
std::make_pair(LoadIdx, static_cast<StoreInst *>(nullptr)),		std::make_pair(LoadIdx, static_cast<StoreInst *>(nullptr)),
less_first());		less_first());
Value *ReplVal;		Value *ReplVal;
if (I == StoresByIndex.begin()) {		if (I == StoresByIndex.begin()) {
if (StoresByIndex.empty())		if (StoresByIndex.empty())
// If there are no stores, the load takes the undef value.		// If there are no stores, the load takes the undef value.
ReplVal = UndefValue::get(LI->getType());		ReplVal = getInitialValueOfAllocation(AI, nullptr, LI->getType());
else		else
// There is no store before this load, bail out (load may be affected		// There is no store before this load, bail out (load may be affected
// by the following stores - see main comment).		// by the following stores - see main comment).
return false;		return false;
} else {		} else {
// Otherwise, there was a store before this load, the load takes its		// Otherwise, there was a store before this load, the load takes its
// value.		// value.
ReplVal = std::prev(I)->second->getOperand(0);		ReplVal = std::prev(I)->second->getOperand(0);
▲ Show 20 Lines • Show All 525 Lines • Show Last 20 Lines