Download Raw Diff

Details

Reviewers

nikic
anna
fhahn

Commits

rG44d23d5345a6: [DSE] Remove calls with known writes to dead memory
rGa8a51fe55649: [DSE] Remove calls with known writes to dead memory

Summary

The majority of this change is sinking logic from instcombine into MemoryLocation such that it can be generically reused. If we have a call with a single analyzable write to an argument, we can treat that as-if it were a store of unknown size.

Merging the code in this was unblocks DSE in the store to dead memory code paths. In theory, it should also enable classic DSE of such calls, but the code appears to not know how to use object sizes to refine unknown access bounds (yet).

In addition, this does make the isAllocRemovable path slightly stronger by reusing the libfunc and additional intrinsics bits which are already in getForDest.

A couple ideas for follow up (which I don't plan to do):

The more I look at this, the more I'm starting to think the capture handling is conservative. It feels like the write and no-return handling should be enough to disallow capture in the problematic cases. Maybe we can relax this? Or adjust inference rules to simply infer?
We should be able to use known object size info in DSE per the added test cases.
We should be able to remove empty lifetime start/end ranges in DSE, and thus kill off remaining alloca uses. Today, this will take another instcombine run. Adding the DSE support is most useful when we have two distinct live ranges, one live and one dead. DSE can recognize the dead one cheaply (even if large), where instcombine really can't.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

reames created this revision.Dec 16 2021, 1:44 PM

Herald added subscribers: bollu, hiraditya, mcrosier. · View Herald TranscriptDec 16 2021, 1:44 PM

reames requested review of this revision.Dec 16 2021, 1:44 PM

Herald added a project: Restricted Project. · View Herald TranscriptDec 16 2021, 1:44 PM

reames added a reviewer: fhahn.Dec 16 2021, 1:45 PM

Harbormaster completed remote builds in B139743: Diff 394980.Dec 16 2021, 2:26 PM

nikic added inline comments.Dec 17 2021, 12:21 AM

llvm/include/llvm/Analysis/MemoryLocation.h
260 ↗	(On Diff #394980)	This might want to highlight that the function can still read other memory.
llvm/lib/Analysis/MemoryLocation.cpp
154	I'm not sure the willreturn and nounwind checks really belong in here. We should try to separate the modelling of different effects, and MemoryLocation should only be modelling memory effects. The willreturn/nounwind checks can be done in the caller. After all, the statement that only this one location is written remains true regardless of whether the call unwind or diverge, the latter only affect whether it can be removed.

Please check failing ASAN tests

In D115904#3199421, @xbolva00 wrote:

Please check failing ASAN tests

These are almost certainly spurious. The pre-commit CI picks a random base commit when doing builds. This change has nothing to do with compiler-rt, so this breakage is probably in the base commit chosen.

llvm/include/llvm/Analysis/MemoryLocation.h
260 ↗	(On Diff #394980)	I thought the wording about side effects covered this, but will do so if desired.
llvm/lib/Analysis/MemoryLocation.cpp
154	I'm sympathetic to this concern, but this appears to already be the implicit contract of this method. In particular, DSE does not otherwise check these properties for any of the intrinsics and libfuncs. I propose that we go with this, and if desired, try to annotate all the intrinsics/libfuncs separately so that these checks can be lifted out to the two callers.

nikic added inline comments.Dec 17 2021, 9:41 AM

llvm/lib/Analysis/MemoryLocation.cpp
154	This code was only recently moved from DSE into here, and it makes sense that DSE did not check this while it was still working on a hardcoded list of functions. We should already be specifying/inferring the necessary attributes (though possibly the tests don't have the full set of inferred attributes).
186	Thinking about libcalls, I just realized that this should probably not returning a getBeforeOrAfter(UsedV), but rather remembering which argument the write is on and then calling MemoryLocation::getForArgument() on that. That would likely subsume the libcall cases by returning a more accurate location if getForArgument() implements something more specific. This would lose the case where the same argument is passed to the function twice, but I don't think we particularly care about that (as it would have to be literally the same argument as implemented).

@nikic I don't disagree with either of your recent suggestions, but the reorg your asking for is a huge amount of churn. Do you mind if this goes in, and then we try to cleanup the attribute handling in the direction you've indicated? I'm happy to do some of the work, but frankly, I suspect that's a lot more involved than you're making it out to be and I don't want to go boil the ocean. :)

On the single argument vs multiple arguments, I want to preserve the functionality of returning a conservative result if passed twice, but supporting both the precise and less precise results would not be hard to do. My strong preference would for a follow up though.

nikic mentioned this in D115962: [DSE] Make isRemovable() for calls more robust (NFCI).Dec 17 2021, 11:46 AM

@nikic Looks like you proved my concern about scoping wrong with D115962. Once that lands, I'll rebase to include the restructure you requested on the attribute checking side.

nikic mentioned this in rGeb2cad8329b0: [DSE] Make isRemovable() for calls more robust (NFCI).Dec 17 2021, 11:53 AM

Incorporate review comments.

Note: I tried removing the libfunc block, but that caused a test failure. I suspect there's just some attributes in the test which need updated, but I'd prefer to do that as a follow up.

reames mentioned this in rGd9d6e6a0483e: [tests] Precommit tests from D115904.Dec 17 2021, 12:43 PM

re-rebase - I managed to drop tests last time.

LGTM

This revision is now accepted and ready to land.Dec 17 2021, 12:55 PM

Harbormaster completed remote builds in B139901: Diff 395201.Dec 17 2021, 1:32 PM

This revision was landed with ongoing or failed builds.Dec 17 2021, 1:42 PM

Closed by commit rGa8a51fe55649: [DSE] Remove calls with known writes to dead memory (authored by reames). · Explain Why

This revision was automatically updated to reflect the committed changes.

reames added a commit: rGa8a51fe55649: [DSE] Remove calls with known writes to dead memory.

nikic added a reverting change: rG1ba99eaf7095: Revert "[DSE] Remove calls with known writes to dead memory".Dec 18 2021, 12:25 AM

Reverted this because it does break the strncpy-overflow.cpp test case in asan (https://github.com/llvm/llvm-project/blob/main/compiler-rt/test/asan/TestCases/strncpy-overflow.cpp). The strncpy is now getting optimized away (correctly). Not sure what the policy in this case is, but I assume it's updating the test case to be more robust rather than trying to avoid the optimization in sanitized functions?

I found this change a bit surprising because I thought we'd already do this based on the previous attribute code in InstCombine. The reason we didn't is that strncpy returns the first argument, which means that it's not nocapture. But we do provide getForDest() information for this libcall, and that got exposed to InstCombine now.

I guess this is also why you say failures when dropping the libcall handling: We do still need it for libcalls that return the argument.

This revision is now accepted and ready to land.Dec 18 2021, 12:32 AM

In D115904#3199421, @xbolva00 wrote:

Please check failing ASAN tests

Well I told you so…

Use -fno-builtin-strncpy in RUN commands

lkail added a subscriber: lkail.Dec 18 2021, 2:06 AM

In D115904#3201217, @nikic wrote:

Reverted this because it does break the strncpy-overflow.cpp test case in asan (https://github.com/llvm/llvm-project/blob/main/compiler-rt/test/asan/TestCases/strncpy-overflow.cpp). The strncpy is now getting optimized away (correctly). Not sure what the policy in this case is, but I assume it's updating the test case to be more robust rather than trying to avoid the optimization in sanitized functions?

First, thank you for reverting. I'd seen the build failures, but since we all coming from bots I've mentally marked as unstable (as in, they fail often enough there's no signal in the messages), I didn't see the real failure here.

Second, yeah, that test is simply wrong/unstable. I will take a shot at trying to stabilize it, but I see at least two other transform paths which can break it as written. I'll probably just end up deferring the problem to the next person, but hey, that's how this sometimes works.

I found this change a bit surprising because I thought we'd already do this based on the previous attribute code in InstCombine. The reason we didn't is that strncpy returns the first argument, which means that it's not nocapture. But we do provide getForDest() information for this libcall, and that got exposed to InstCombine now.

Well, not quite. So long as the return value is unused, the strncpy is in fact nocapture on both arguments.

This is confirming my suspicion that nocapture is too conservative for this case. If the only capture can be a write to the dead memory, then the callee is effectively nocapture.

Given DSE wasn't previously checking for the use on the return value, I suspect we may have had a latent miscompile before your recent change

I guess this is also why you say failures when dropping the libcall handling: We do still need it for libcalls that return the argument.

I went back and skimmed the tests in libcalls.ll (the ones which failed without the libcall handling). None of them cover the case where a strncpy actually captures by returning. It's possibly I'd have noticed that while updating tests, but equally likely I'd have missed it.

reames mentioned this in rG9b955f77a18a: Attempt to stablize compiler-rt/test/asan/TestCases/strncpy-overflow.cpp.Dec 20 2021, 5:53 PM

Closed by commit rG44d23d5345a6: [DSE] Remove calls with known writes to dead memory (authored by reames). · Explain WhyDec 20 2021, 6:15 PM

This revision was automatically updated to reflect the committed changes.

reames added a commit: rG44d23d5345a6: [DSE] Remove calls with known writes to dead memory.

Diff 395564

llvm/lib/Analysis/MemoryLocation.cpp

Show First 20 Lines • Show All 141 Lines • ▼ Show 20 Lines	if (TLI.getLibFunc(*CB, LF) && TLI.has(LF)) {
case LibFunc_strcat:		case LibFunc_strcat:
case LibFunc_strncat:		case LibFunc_strncat:
return getForArgument(CB, 0, &TLI);		return getForArgument(CB, 0, &TLI);
default:		default:
break;		break;
}		}
}		}

		if (!CB->onlyAccessesArgMemory())
return None;		return None;

		if (CB->hasOperandBundles())
		// TODO: remove implementation restriction
		nikicUnsubmitted Not Done Reply Inline Actions I'm not sure the willreturn and nounwind checks really belong in here. We should try to separate the modelling of different effects, and MemoryLocation should only be modelling memory effects. The willreturn/nounwind checks can be done in the caller. After all, the statement that only this one location is written remains true regardless of whether the call unwind or diverge, the latter only affect whether it can be removed. nikic: I'm not sure the willreturn and nounwind checks really belong in here. We should try to…
		reamesAuthorUnsubmitted Done Reply Inline Actions I'm sympathetic to this concern, but this appears to already be the implicit contract of this method. In particular, DSE does not otherwise check these properties for any of the intrinsics and libfuncs. I propose that we go with this, and if desired, try to annotate all the intrinsics/libfuncs separately so that these checks can be lifted out to the two callers. reames: I'm sympathetic to this concern, but this appears to already be the implicit contract of this…
		nikicUnsubmitted Not Done Reply Inline Actions This code was only recently moved from DSE into here, and it makes sense that DSE did not check this while it was still working on a hardcoded list of functions. We should already be specifying/inferring the necessary attributes (though possibly the tests don't have the full set of inferred attributes). nikic: This code was only recently moved from DSE into here, and it makes sense that DSE did not check…
		return None;

		Value *UsedV = nullptr;
		Optional<unsigned> UsedIdx;
		for (unsigned i = 0; i < CB->arg_size(); i++) {
		if (!CB->getArgOperand(i)->getType()->isPointerTy())
		continue;
		if (!CB->doesNotCapture(i))
		// capture would allow the address to be read back in an untracked manner
		return None;
		if (CB->onlyReadsMemory(i))
		continue;
		if (!UsedV) {
		// First potentially writing parameter
		UsedV = CB->getArgOperand(i);
		UsedIdx = i;
		continue;
		}
		UsedIdx = None;
		if (UsedV != CB->getArgOperand(i))
		// Can't describe writing to two distinct locations.
		// TODO: This results in an inprecision when two values derived from the
		// same object are passed as arguments to the same function.
		return None;
		}
		if (!UsedV)
		// We don't currently have a way to represent a "does not write" result
		// and thus have to be conservative and return unknown.
		return None;

		if (UsedIdx)
		return getForArgument(CB, *UsedIdx, &TLI);
		nikicUnsubmitted Not Done Reply Inline Actions Thinking about libcalls, I just realized that this should probably not returning a getBeforeOrAfter(UsedV), but rather remembering which argument the write is on and then calling MemoryLocation::getForArgument() on that. That would likely subsume the libcall cases by returning a more accurate location if getForArgument() implements something more specific. This would lose the case where the same argument is passed to the function twice, but I don't think we particularly care about that (as it would have to be literally the same argument as implemented). nikic: Thinking about libcalls, I just realized that this should probably not returning a…
		return MemoryLocation::getBeforeOrAfter(UsedV, CB->getAAMetadata());
}		}

MemoryLocation MemoryLocation::getForArgument(const CallBase *Call,		MemoryLocation MemoryLocation::getForArgument(const CallBase *Call,
unsigned ArgIdx,		unsigned ArgIdx,
const TargetLibraryInfo *TLI) {		const TargetLibraryInfo *TLI) {
AAMDNodes AATags = Call->getAAMetadata();		AAMDNodes AATags = Call->getAAMetadata();
const Value *Arg = Call->getArgOperand(ArgIdx);		const Value *Arg = Call->getArgOperand(ArgIdx);

▲ Show 20 Lines • Show All 164 Lines • Show Last 20 Lines

llvm/lib/Transforms/InstCombine/InstructionCombining.cpp

Show First 20 Lines • Show All 2,556 Lines • ▼ Show 20 Lines	static bool isNeverEqualToUnescapedAlloc(Value *V, const TargetLibraryInfo &TLI,
// through bitcasts of V can cause		// through bitcasts of V can cause
// the result statement below to be true, even when AI and V (ex:		// the result statement below to be true, even when AI and V (ex:
// i8* ->i32* ->i8* of AI) are the same allocations.		// i8* ->i32* ->i8* of AI) are the same allocations.
return isAllocLikeFn(V, &TLI) && V != AI;		return isAllocLikeFn(V, &TLI) && V != AI;
}		}

/// Given a call CB which uses an address UsedV, return true if we can prove the		/// Given a call CB which uses an address UsedV, return true if we can prove the
/// call's only possible effect is storing to V.		/// call's only possible effect is storing to V.
static bool isRemovableWrite(CallBase &CB, Value *UsedV) {		static bool isRemovableWrite(CallBase &CB, Value *UsedV,
		const TargetLibraryInfo &TLI) {
if (!CB.use_empty())		if (!CB.use_empty())
// TODO: add recursion if returned attribute is present		// TODO: add recursion if returned attribute is present
return false;		return false;

if (!CB.willReturn() \|\| !CB.doesNotThrow() \|\| !CB.onlyAccessesArgMemory() \|\|		if (CB.isTerminator())
CB.isTerminator())		// TODO: remove implementation restriction
return false;		return false;

if (CB.hasOperandBundles())		if (!CB.willReturn() \|\| !CB.doesNotThrow())
return false;		return false;

for (unsigned i = 0; i < CB.arg_size(); i++) {		// If the only possible side effect of the call is writing to the alloca,
if (!CB.getArgOperand(i)->getType()->isPointerTy())		// and the result isn't used, we can safely remove any reads implied by the
continue;		// call including those which might read the alloca itself.
if (!CB.doesNotCapture(i))		Optional<MemoryLocation> Dest = MemoryLocation::getForDest(&CB, TLI);
// capture would allow the address to be read back in an untracked manner		return Dest && Dest->Ptr == UsedV;
return false;
if (UsedV != CB.getArgOperand(i) && !CB.onlyReadsMemory(i))
// A write to another memory location keeps the call live, and thus we
// must keep the alloca so that the call has somewhere to write to.
// TODO: This results in an inprecision when two values derived from the
// same alloca are passed as arguments to the same function.
return false;
// Note: Both reads from and writes to the alloca are fine. Since the
// result is unused nothing can observe the values read from the alloca
// without writing it to some other observable location (checked above).
}
return true;
}		}

static bool isAllocSiteRemovable(Instruction *AI,		static bool isAllocSiteRemovable(Instruction *AI,
SmallVectorImpl<WeakTrackingVH> &Users,		SmallVectorImpl<WeakTrackingVH> &Users,
const TargetLibraryInfo &TLI) {		const TargetLibraryInfo &TLI) {
SmallVector<Instruction*, 4> Worklist;		SmallVector<Instruction*, 4> Worklist;
Worklist.push_back(AI);		Worklist.push_back(AI);

▲ Show 20 Lines • Show All 53 Lines • ▼ Show 20 Lines	for (User *U : PI->users()) {
case Intrinsic::launder_invariant_group:		case Intrinsic::launder_invariant_group:
case Intrinsic::strip_invariant_group:		case Intrinsic::strip_invariant_group:
Users.emplace_back(I);		Users.emplace_back(I);
Worklist.push_back(I);		Worklist.push_back(I);
continue;		continue;
}		}
}		}

if (isRemovableWrite(*cast<CallBase>(I), PI)) {		if (isRemovableWrite(*cast<CallBase>(I), PI, TLI)) {
Users.emplace_back(I);		Users.emplace_back(I);
continue;		continue;
}		}

if (isFreeCall(I, &TLI)) {		if (isFreeCall(I, &TLI)) {
Users.emplace_back(I);		Users.emplace_back(I);
continue;		continue;
}		}
▲ Show 20 Lines • Show All 1,705 Lines • Show Last 20 Lines

llvm/test/Transforms/DeadStoreElimination/trivial-dse-calls.ll

; NOTE: Assertions have been autogenerated by utils/update_test_checks.py		; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
; RUN: opt -dse -S < %s \| FileCheck %s		; RUN: opt -dse -S < %s \| FileCheck %s

declare void @llvm.lifetime.start.p0i8(i64 immarg, i8* nocapture)		declare void @llvm.lifetime.start.p0i8(i64 immarg, i8* nocapture)
declare void @llvm.lifetime.end.p0i8(i64 immarg, i8* nocapture)		declare void @llvm.lifetime.end.p0i8(i64 immarg, i8* nocapture)

declare void @unknown()		declare void @unknown()
declare void @f(i8*)		declare void @f(i8*)
declare void @f2(i8, i8)		declare void @f2(i8, i8)

; Basic case for DSEing a trivially dead writing call		; Basic case for DSEing a trivially dead writing call
define void @test_dead() {		define void @test_dead() {
; CHECK-LABEL: @test_dead(		; CHECK-LABEL: @test_dead(
; CHECK-NEXT: [[A:%.*]] = alloca i32, align 4
; CHECK-NEXT: [[BITCAST:%.]] = bitcast i32 [[A]] to i8*
; CHECK-NEXT: call void @f(i8* nocapture writeonly [[BITCAST]]) #[[ATTR1:[0-9]+]]
; CHECK-NEXT: ret void		; CHECK-NEXT: ret void
;		;
%a = alloca i32, align 4		%a = alloca i32, align 4
%bitcast = bitcast i32* %a to i8*		%bitcast = bitcast i32* %a to i8*
call void @f(i8* writeonly nocapture %bitcast) argmemonly nounwind willreturn		call void @f(i8* writeonly nocapture %bitcast) argmemonly nounwind willreturn
ret void		ret void
}		}

; Add in canonical lifetime intrinsics		; Add in canonical lifetime intrinsics
define void @test_lifetime() {		define void @test_lifetime() {
; CHECK-LABEL: @test_lifetime(		; CHECK-LABEL: @test_lifetime(
; CHECK-NEXT: [[A:%.*]] = alloca i32, align 4		; CHECK-NEXT: [[A:%.*]] = alloca i32, align 4
; CHECK-NEXT: [[BITCAST:%.]] = bitcast i32 [[A]] to i8*		; CHECK-NEXT: [[BITCAST:%.]] = bitcast i32 [[A]] to i8*
; CHECK-NEXT: call void @llvm.lifetime.start.p0i8(i64 4, i8* [[BITCAST]])		; CHECK-NEXT: call void @llvm.lifetime.start.p0i8(i64 4, i8* [[BITCAST]])
; CHECK-NEXT: call void @f(i8* nocapture writeonly [[BITCAST]]) #[[ATTR1]]
; CHECK-NEXT: call void @llvm.lifetime.end.p0i8(i64 4, i8* [[BITCAST]])		; CHECK-NEXT: call void @llvm.lifetime.end.p0i8(i64 4, i8* [[BITCAST]])
; CHECK-NEXT: ret void		; CHECK-NEXT: ret void
;		;
%a = alloca i32, align 4		%a = alloca i32, align 4
%bitcast = bitcast i32* %a to i8*		%bitcast = bitcast i32* %a to i8*
call void @llvm.lifetime.start.p0i8(i64 4, i8* %bitcast)		call void @llvm.lifetime.start.p0i8(i64 4, i8* %bitcast)
call void @f(i8* writeonly nocapture %bitcast) argmemonly nounwind willreturn		call void @f(i8* writeonly nocapture %bitcast) argmemonly nounwind willreturn
call void @llvm.lifetime.end.p0i8(i64 4, i8* %bitcast)		call void @llvm.lifetime.end.p0i8(i64 4, i8* %bitcast)
ret void		ret void
}		}

; Add some unknown calls just to point out that this is use based, not		; Add some unknown calls just to point out that this is use based, not
; instruction order sensitive		; instruction order sensitive
define void @test_lifetime2() {		define void @test_lifetime2() {
; CHECK-LABEL: @test_lifetime2(		; CHECK-LABEL: @test_lifetime2(
; CHECK-NEXT: [[A:%.*]] = alloca i32, align 4		; CHECK-NEXT: [[A:%.*]] = alloca i32, align 4
; CHECK-NEXT: [[BITCAST:%.]] = bitcast i32 [[A]] to i8*		; CHECK-NEXT: [[BITCAST:%.]] = bitcast i32 [[A]] to i8*
; CHECK-NEXT: call void @llvm.lifetime.start.p0i8(i64 4, i8* [[BITCAST]])		; CHECK-NEXT: call void @llvm.lifetime.start.p0i8(i64 4, i8* [[BITCAST]])
; CHECK-NEXT: call void @unknown()		; CHECK-NEXT: call void @unknown()
; CHECK-NEXT: call void @f(i8* nocapture writeonly [[BITCAST]]) #[[ATTR1]]
; CHECK-NEXT: call void @unknown()		; CHECK-NEXT: call void @unknown()
; CHECK-NEXT: call void @llvm.lifetime.end.p0i8(i64 4, i8* [[BITCAST]])		; CHECK-NEXT: call void @llvm.lifetime.end.p0i8(i64 4, i8* [[BITCAST]])
; CHECK-NEXT: ret void		; CHECK-NEXT: ret void
;		;
%a = alloca i32, align 4		%a = alloca i32, align 4
%bitcast = bitcast i32* %a to i8*		%bitcast = bitcast i32* %a to i8*
call void @llvm.lifetime.start.p0i8(i64 4, i8* %bitcast)		call void @llvm.lifetime.start.p0i8(i64 4, i8* %bitcast)
call void @unknown()		call void @unknown()
call void @f(i8* writeonly nocapture %bitcast) argmemonly nounwind willreturn		call void @f(i8* writeonly nocapture %bitcast) argmemonly nounwind willreturn
call void @unknown()		call void @unknown()
call void @llvm.lifetime.end.p0i8(i64 4, i8* %bitcast)		call void @llvm.lifetime.end.p0i8(i64 4, i8* %bitcast)
ret void		ret void
}		}

; As long as the result is unused, we can even remove reads of the alloca		; As long as the result is unused, we can even remove reads of the alloca
; itself since the write will be dropped.		; itself since the write will be dropped.
define void @test_dead_readwrite() {		define void @test_dead_readwrite() {
; CHECK-LABEL: @test_dead_readwrite(		; CHECK-LABEL: @test_dead_readwrite(
; CHECK-NEXT: [[A:%.*]] = alloca i32, align 4
; CHECK-NEXT: [[BITCAST:%.]] = bitcast i32 [[A]] to i8*
; CHECK-NEXT: call void @f(i8* nocapture [[BITCAST]]) #[[ATTR1]]
; CHECK-NEXT: ret void		; CHECK-NEXT: ret void
;		;
%a = alloca i32, align 4		%a = alloca i32, align 4
%bitcast = bitcast i32* %a to i8*		%bitcast = bitcast i32* %a to i8*
call void @f(i8* nocapture %bitcast) argmemonly nounwind willreturn		call void @f(i8* nocapture %bitcast) argmemonly nounwind willreturn
ret void		ret void
}		}

define i32 @test_neg_read_after() {		define i32 @test_neg_read_after() {
; CHECK-LABEL: @test_neg_read_after(		; CHECK-LABEL: @test_neg_read_after(
; CHECK-NEXT: [[A:%.*]] = alloca i32, align 4		; CHECK-NEXT: [[A:%.*]] = alloca i32, align 4
; CHECK-NEXT: [[BITCAST:%.]] = bitcast i32 [[A]] to i8*		; CHECK-NEXT: [[BITCAST:%.]] = bitcast i32 [[A]] to i8*
; CHECK-NEXT: call void @f(i8* nocapture writeonly [[BITCAST]]) #[[ATTR1]]		; CHECK-NEXT: call void @f(i8* nocapture writeonly [[BITCAST]]) #[[ATTR1:[0-9]+]]
; CHECK-NEXT: [[RES:%.]] = load i32, i32 [[A]], align 4		; CHECK-NEXT: [[RES:%.]] = load i32, i32 [[A]], align 4
; CHECK-NEXT: ret i32 [[RES]]		; CHECK-NEXT: ret i32 [[RES]]
;		;
%a = alloca i32, align 4		%a = alloca i32, align 4
%bitcast = bitcast i32* %a to i8*		%bitcast = bitcast i32* %a to i8*
call void @f(i8* writeonly nocapture %bitcast) argmemonly nounwind willreturn		call void @f(i8* writeonly nocapture %bitcast) argmemonly nounwind willreturn
%res = load i32, i32* %a		%res = load i32, i32* %a
ret i32 %res		ret i32 %res
▲ Show 20 Lines • Show All 104 Lines • ▼ Show 20 Lines	;
%a_copy = bitcast i8* %a_copy_cast to i32*		%a_copy = bitcast i8* %a_copy_cast to i32*
%res = load i32, i32* %a_copy		%res = load i32, i32* %a_copy
ret i32 %res		ret i32 %res
}		}

; Show that reading from unrelated memory is okay		; Show that reading from unrelated memory is okay
define void @test_unreleated_read() {		define void @test_unreleated_read() {
; CHECK-LABEL: @test_unreleated_read(		; CHECK-LABEL: @test_unreleated_read(
; CHECK-NEXT: [[A:%.*]] = alloca i32, align 4
; CHECK-NEXT: [[A2:%.*]] = alloca i32, align 4
; CHECK-NEXT: [[BITCAST:%.]] = bitcast i32 [[A]] to i8*
; CHECK-NEXT: [[BITCAST2:%.]] = bitcast i32 [[A2]] to i8*
; CHECK-NEXT: call void @f2(i8* nocapture writeonly [[BITCAST]], i8* nocapture readonly [[BITCAST2]]) #[[ATTR1]]
; CHECK-NEXT: ret void		; CHECK-NEXT: ret void
;		;
%a = alloca i32, align 4		%a = alloca i32, align 4
%a2 = alloca i32, align 4		%a2 = alloca i32, align 4
%bitcast = bitcast i32* %a to i8*		%bitcast = bitcast i32* %a to i8*
%bitcast2 = bitcast i32* %a2 to i8*		%bitcast2 = bitcast i32* %a2 to i8*
call void @f2(i8* nocapture writeonly %bitcast, i8* nocapture readonly %bitcast2) argmemonly nounwind willreturn		call void @f2(i8* nocapture writeonly %bitcast, i8* nocapture readonly %bitcast2) argmemonly nounwind willreturn
ret void		ret void
Show All 16 Lines	;
call void @f2(i8* nocapture writeonly %bitcast, i8* readonly %bitcast2) argmemonly nounwind willreturn		call void @f2(i8* nocapture writeonly %bitcast, i8* readonly %bitcast2) argmemonly nounwind willreturn
ret void		ret void
}		}

; As long as the result is unused, we can even remove reads of the alloca		; As long as the result is unused, we can even remove reads of the alloca
; itself since the write will be dropped.		; itself since the write will be dropped.
define void @test_self_read() {		define void @test_self_read() {
; CHECK-LABEL: @test_self_read(		; CHECK-LABEL: @test_self_read(
; CHECK-NEXT: [[A:%.*]] = alloca i32, align 4
; CHECK-NEXT: [[BITCAST:%.]] = bitcast i32 [[A]] to i8*
; CHECK-NEXT: call void @f2(i8* nocapture writeonly [[BITCAST]], i8* nocapture readonly [[BITCAST]]) #[[ATTR1]]
; CHECK-NEXT: ret void		; CHECK-NEXT: ret void
;		;
%a = alloca i32, align 4		%a = alloca i32, align 4
%bitcast = bitcast i32* %a to i8*		%bitcast = bitcast i32* %a to i8*
call void @f2(i8* nocapture writeonly %bitcast, i8* nocapture readonly %bitcast) argmemonly nounwind willreturn		call void @f2(i8* nocapture writeonly %bitcast, i8* nocapture readonly %bitcast) argmemonly nounwind willreturn
ret void		ret void
}		}

▲ Show 20 Lines • Show All 71 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[DSE] Remove calls with known writes to dead memory
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 395564

llvm/lib/Analysis/MemoryLocation.cpp

llvm/lib/Transforms/InstCombine/InstructionCombining.cpp

llvm/test/Transforms/DeadStoreElimination/trivial-dse-calls.ll

This is an archive of the discontinued LLVM Phabricator instance.

[DSE] Remove calls with known writes to dead memoryClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 395564

llvm/lib/Analysis/MemoryLocation.cpp

llvm/lib/Transforms/InstCombine/InstructionCombining.cpp

llvm/test/Transforms/DeadStoreElimination/trivial-dse-calls.ll

[DSE] Remove calls with known writes to dead memory
ClosedPublic