This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
docs/
-
LangRef.rst
-
lib/Transforms/Utils/
-
Transforms/
-
Utils/
-
PromoteMemoryToRegister.cpp
-
test/
-
CodeGen/AMDGPU/
-
AMDGPU/
-
vector-alloca-bitcast.ll
-
Transforms/
-
Mem2Reg/
-
debug-alloca-vla-1.ll
-
debug-alloca-vla-2.ll
-
pr24179.ll
-
undef-order.ll
-
PhaseOrdering/X86/
-
X86/
-
nancvt.ll
-
SROA/
-
addrspacecast.ll
-
basictest.ll
-
phi-and-select.ll
-
phi-gep.ll
-
phi-with-duplicate-pred.ll
-
pr37267.ll
-
scalable-vectors.ll
-
slice-width.ll
-
vector-promotion.ll

Differential D104648

[Mem2Reg] Use poison instead of undef for read of uninitialized memory
Changes PlannedPublic

Authored by nikic on Jun 21 2021, 8:02 AM.

Download Raw Diff

Details

Reviewers

aqjune
nlopes
spatel

Summary

During mem2reg, make loads from uninitialized memory return a poison value rather than an undef value. Also clarify the alloca documentation in this regard.

Diff Detail

Unit TestsFailed

	Time	Test
	30 ms	x64 debian > Clang.CodeGen::2009-03-22-increment-bitfield.c
	80 ms	x64 debian > Clang.CodeGen::aapcs64-align.cpp
	230 ms	x64 debian > Clang.CodeGen::bitfield.c
	140 ms	x64 debian > Clang.CodeGen::cmse-clear-return.c
	60 ms	x64 debian > Clang.CodeGenCXX::bitfield-layout.cpp
		View Full Test Results (24 Failed)

Event Timeline

nikic created this revision.Jun 21 2021, 8:02 AM

Herald added subscribers: jdoerfert, kerbowa, hiraditya and 2 others. · View Herald TranscriptJun 21 2021, 8:02 AM

nikic requested review of this revision.Jun 21 2021, 8:02 AM

Herald added a project: Restricted Project. · View Herald TranscriptJun 21 2021, 8:02 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B110206: Diff 353370.Jun 21 2021, 8:42 AM

This is a big change because AFAIK clang intentionally encodes the value of uninitialized variables in a benign way.
For example: https://godbolt.org/z/EYjasxM1M is optimized to 0 even if it is undef ^ undef = undef. If poison is used, this won't work.
I think that to switch to poison reasonable performance benefit should be shown. I have a patch that makes poison propagate better, which will help this change.
BTW, do you have any idea about the failures happening at clang unit tests?

I really look forward to the day that we can commit this patch! 😀 That's the goal, to get rid of undef.

However, there's still one issue we need to solve first: bit-fields. The lowering of bitfield stores in clang is something like:

v = load word
v' = mask v + combine with bitfield value to be stored
store v'

On the first bit-field store the memory is uninitialized but it is still combined with the value to be stored. If we change uninitialized memory to be poison, as this patch does, clang's lowering becomes wrong.
Two fixes are possible:

For bitfields on the stack, initialize the value to zero [possible optimization?]
For other bitfields, freeze the value after loading. This is sufficient to tame the initial poison from uninitialized memory. Subsequent loads don't need freeze (if clang can prove they touch a bitfield that has been initialized before) as even if some value becomes poison it's not problematic. In C/C++ if a value becomes poison then the program has already executed UB, so we can taint the whole word without a problem.

TL;DR: we need to add a freeze after load in clang's bitfield lowering before this patch can go in. @nikic do you have bandwidth for this work?
After that, I'm all for it! But we should probably send an email to the ML so that folks working on other frontends notice the IR semantics change.

In D104648#2831212, @aqjune wrote:

This is a big change because AFAIK clang intentionally encodes the value of uninitialized variables in a benign way.
For example: https://godbolt.org/z/EYjasxM1M is optimized to 0 even if it is undef ^ undef = undef. If poison is used, this won't work.

I'm not sure what you're getting at here. This isn't a difference between undef and poison. It's perfectly legal to fold "poison ^ poison -> 0". And we probably should, just to avoid the hassle involved in diagnosing the issues that would result from changing this fold.

@aqjune I don't think performance is really relevant here: It's a necessary step to remove undef in the future. I think this is one of the main ways left in which we introduce undef in IR.

@nlopes Ugh, this is ugly. The fact that clang is using this also means that we can't just ignore bitcode autoupgrade. Adding a freeze on every load seems like a really big hammer, but I'm not sure what else could be done.

@nlopes Ugh, this is ugly. The fact that clang is using this also means that we can't just ignore bitcode autoupgrade. Adding a freeze on every load seems like a really big hammer, but I'm not sure what else could be done.

It would be just loads for bitfields. They are not common, so I guess it's ok.

There are technically better solutions, but require more functionality in the IR. But I don't think it's worth it for such an uncommon feature.
A better solution come to mind:
Introduce "really packed" structs, where we push all the ABI work to the frontend. LLVM's packed structs still go through the ABI code and may get extra padding, while we would need something that bypasses the padding so we could encode bitfields precisely.
This solution doesn't require any freezing, but it's more complicated: requires additional IR features, backend work, etc. So the load+freeze for bitfields sounds better to me.

aqjune mentioned this in D104661: [InstSimplify] Add more poison folding optimizations.Jun 22 2021, 7:55 AM

Or, can we temporarily mark an alloca with !has_bitfield if it had bitfields in C & make mem2reg use poison only when uninitialized memory is read, for a while?

In D104648#2836258, @aqjune wrote:

Or, can we temporarily mark an alloca with !has_bitfield if it had bitfields in C & make mem2reg use poison only when uninitialized memory is read, for a while?

Well, that doesn't help much. It's just a band-aid..
I would rather initialize all alloca'd bitfields with zero instead. Zero folds trivially with the bit-masking operations from bit-fields, so it's unlikely to disrupt anything.

Removing this from the review queue -- this change by itself is clearly not sufficient.

Revision Contents

Path

Size

llvm/

docs/

LangRef.rst

6 lines

lib/

Transforms/

Utils/

PromoteMemoryToRegister.cpp

14 lines

test/

CodeGen/

AMDGPU/

vector-alloca-bitcast.ll

6 lines

Transforms/

Mem2Reg/

debug-alloca-vla-1.ll

2 lines

debug-alloca-vla-2.ll

2 lines

pr24179.ll

4 lines

undef-order.ll

2 lines

PhaseOrdering/

X86/

nancvt.ll

2 lines

SROA/

26 lines

14 lines

4 lines

2 lines

phi-with-duplicate-pred.ll

2 lines

6 lines

2 lines

2 lines

12 lines

Diff 353370

llvm/docs/LangRef.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 9,605 Lines • ▼ Show 20 Lines
	boundary compatible with the type.			boundary compatible with the type.

	'``type``' may be any sized type.			'``type``' may be any sized type.

	Semantics:			Semantics:
	""""""""""			""""""""""

	Memory is allocated; a pointer is returned. The allocated memory is			Memory is allocated; a pointer is returned. The allocated memory is
	uninitialized, and loading from uninitialized memory produces an undefined			uninitialized, and loading from uninitialized memory produces a :ref:`poison
	value. The operation itself is undefined if there is insufficient stack			value <poisonvalue>`. The operation itself is undefined if there is insufficient
	space for the allocation.'``alloca``'d memory is automatically released			stack space for the allocation.'``alloca``'d memory is automatically released
	when the function returns. The '``alloca``' instruction is commonly used			when the function returns. The '``alloca``' instruction is commonly used
	to represent automatic variables that must have an address available. When			to represent automatic variables that must have an address available. When
	the function returns (either with the ``ret`` or ``resume`` instructions),			the function returns (either with the ``ret`` or ``resume`` instructions),
	the memory is reclaimed. Allocating zero bytes is legal, but the returned			the memory is reclaimed. Allocating zero bytes is legal, but the returned
	pointer may not be unique. The order in which memory is allocated (ie.,			pointer may not be unique. The order in which memory is allocated (ie.,
	which way the stack grows) is not specified.			which way the stack grows) is not specified.

	Note that '``alloca``' outside of the alloca address space from the			Note that '``alloca``' outside of the alloca address space from the
	▲ Show 20 Lines • Show All 12,895 Lines • Show Last 20 Lines

llvm/lib/Transforms/Utils/PromoteMemoryToRegister.cpp

Show First 20 Lines • Show All 484 Lines • ▼ Show 20 Lines	for (User *U : make_early_inc_range(AI->users())) {

// Find the nearest store that has a lower index than this load.		// Find the nearest store that has a lower index than this load.
StoresByIndexTy::iterator I = llvm::lower_bound(		StoresByIndexTy::iterator I = llvm::lower_bound(
StoresByIndex,		StoresByIndex,
std::make_pair(LoadIdx, static_cast<StoreInst *>(nullptr)),		std::make_pair(LoadIdx, static_cast<StoreInst *>(nullptr)),
less_first());		less_first());
if (I == StoresByIndex.begin()) {		if (I == StoresByIndex.begin()) {
if (StoresByIndex.empty())		if (StoresByIndex.empty())
// If there are no stores, the load takes the undef value.		// If there are no stores, the load takes the poison value.
LI->replaceAllUsesWith(UndefValue::get(LI->getType()));		LI->replaceAllUsesWith(PoisonValue::get(LI->getType()));
else		else
// There is no store before this load, bail out (load may be affected		// There is no store before this load, bail out (load may be affected
// by the following stores - see main comment).		// by the following stores - see main comment).
return false;		return false;
} else {		} else {
// Otherwise, there was a store before this load, the load takes its value.		// Otherwise, there was a store before this load, the load takes its value.
// Note, if the load was marked as nonnull we don't want to lose that		// Note, if the load was marked as nonnull we don't want to lose that
// information when we erase it. So we preserve it with an assume.		// information when we erase it. So we preserve it with an assume.
▲ Show 20 Lines • Show All 132 Lines • ▼ Show 20 Lines	for (BasicBlock *BB : PHIBlocks)
QueuePhiNode(BB, AllocaNum, CurrentVersion);		QueuePhiNode(BB, AllocaNum, CurrentVersion);
}		}

if (Allocas.empty())		if (Allocas.empty())
return; // All of the allocas must have been trivial!		return; // All of the allocas must have been trivial!

LBI.clear();		LBI.clear();

// Set the incoming values for the basic block to be null values for all of		// Set the incoming values for the basic block to be poison values for all of
// the alloca's. We do this in case there is a load of a value that has not		// the alloca's. We do this in case there is a load of a value that has not
// been stored yet. In this case, it will get this null value.		// been stored yet. In this case, it will get this poison value.
RenamePassData::ValVector Values(Allocas.size());		RenamePassData::ValVector Values(Allocas.size());
for (unsigned i = 0, e = Allocas.size(); i != e; ++i)		for (unsigned i = 0, e = Allocas.size(); i != e; ++i)
Values[i] = UndefValue::get(Allocas[i]->getAllocatedType());		Values[i] = PoisonValue::get(Allocas[i]->getAllocatedType());

// When handling debug info, treat all incoming values as if they have unknown		// When handling debug info, treat all incoming values as if they have unknown
// locations until proven otherwise.		// locations until proven otherwise.
RenamePassData::LocationVector Locations(Allocas.size());		RenamePassData::LocationVector Locations(Allocas.size());

// Walks all basic blocks in the function performing the SSA rename algorithm		// Walks all basic blocks in the function performing the SSA rename algorithm
// and inserting the phi nodes we marked as necessary		// and inserting the phi nodes we marked as necessary
std::vector<RenamePassData> RenamePassWorkList;		std::vector<RenamePassData> RenamePassWorkList;
▲ Show 20 Lines • Show All 105 Lines • ▼ Show 20 Lines	for (DenseMap<std::pair<unsigned, unsigned>, PHINode *>::iterator
// At this point, the blocks left in the preds list must have dummy		// At this point, the blocks left in the preds list must have dummy
// entries inserted into every PHI nodes for the block. Update all the phi		// entries inserted into every PHI nodes for the block. Update all the phi
// nodes in this block that we are inserting (there could be phis before		// nodes in this block that we are inserting (there could be phis before
// mem2reg runs).		// mem2reg runs).
unsigned NumBadPreds = SomePHI->getNumIncomingValues();		unsigned NumBadPreds = SomePHI->getNumIncomingValues();
BasicBlock::iterator BBI = BB->begin();		BasicBlock::iterator BBI = BB->begin();
while ((SomePHI = dyn_cast<PHINode>(BBI++)) &&		while ((SomePHI = dyn_cast<PHINode>(BBI++)) &&
SomePHI->getNumIncomingValues() == NumBadPreds) {		SomePHI->getNumIncomingValues() == NumBadPreds) {
Value *UndefVal = UndefValue::get(SomePHI->getType());		Value *PoisonVal = PoisonValue::get(SomePHI->getType());
for (BasicBlock *Pred : Preds)		for (BasicBlock *Pred : Preds)
SomePHI->addIncoming(UndefVal, Pred);		SomePHI->addIncoming(PoisonVal, Pred);
}		}
}		}

NewPhiNodes.clear();		NewPhiNodes.clear();
}		}

/// Determine which blocks the value is live in.		/// Determine which blocks the value is live in.
///		///
▲ Show 20 Lines • Show All 240 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/vector-alloca-bitcast.ll

Show First 20 Lines • Show All 70 Lines • ▼ Show 20 Lines	entry:
%tmp3 = load i32, i32 addrspace(5)* %tmp2		%tmp3 = load i32, i32 addrspace(5)* %tmp2
store i32 %tmp3, i32 addrspace(1)* %out		store i32 %tmp3, i32 addrspace(1)* %out
ret void		ret void
}		}

; OPT-LABEL: @vector_write_read_bitcast_to_float(		; OPT-LABEL: @vector_write_read_bitcast_to_float(
; OPT-NOT: alloca		; OPT-NOT: alloca
; OPT: bb2:		; OPT: bb2:
; OPT: %tmp.sroa.0.0 = phi <6 x float> [ undef, %bb ], [ %0, %bb2 ]		; OPT: %tmp.sroa.0.0 = phi <6 x float> [ poison, %bb ], [ %0, %bb2 ]
; OPT: %0 = insertelement <6 x float> %tmp.sroa.0.0, float %tmp73, i32 %tmp10		; OPT: %0 = insertelement <6 x float> %tmp.sroa.0.0, float %tmp73, i32 %tmp10
; OPT: .preheader:		; OPT: .preheader:
; OPT: %bc = bitcast <6 x float> %0 to <6 x i32>		; OPT: %bc = bitcast <6 x float> %0 to <6 x i32>
; OPT: %1 = extractelement <6 x i32> %bc, i32 %tmp20		; OPT: %1 = extractelement <6 x i32> %bc, i32 %tmp20

; GCN-LABEL: {{^}}vector_write_read_bitcast_to_float:		; GCN-LABEL: {{^}}vector_write_read_bitcast_to_float:
; GCN-ALLOCA: buffer_store_dword		; GCN-ALLOCA: buffer_store_dword

▲ Show 20 Lines • Show All 60 Lines • ▼ Show 20 Lines	.preheader: ; preds = %.preheader, %bb2
%tmp27 = add nuw nsw i32 %tmp16, 1		%tmp27 = add nuw nsw i32 %tmp16, 1
%tmp28 = icmp eq i32 %tmp27, 1000		%tmp28 = icmp eq i32 %tmp27, 1000
br i1 %tmp28, label %bb15, label %.preheader		br i1 %tmp28, label %bb15, label %.preheader
}		}

; OPT-LABEL: @vector_write_read_bitcast_to_double(		; OPT-LABEL: @vector_write_read_bitcast_to_double(
; OPT-NOT: alloca		; OPT-NOT: alloca
; OPT: bb2:		; OPT: bb2:
; OPT: %tmp.sroa.0.0 = phi <6 x double> [ undef, %bb ], [ %0, %bb2 ]		; OPT: %tmp.sroa.0.0 = phi <6 x double> [ poison, %bb ], [ %0, %bb2 ]
; OPT: %0 = insertelement <6 x double> %tmp.sroa.0.0, double %tmp73, i32 %tmp10		; OPT: %0 = insertelement <6 x double> %tmp.sroa.0.0, double %tmp73, i32 %tmp10
; OPT: .preheader:		; OPT: .preheader:
; OPT: %bc = bitcast <6 x double> %0 to <6 x i64>		; OPT: %bc = bitcast <6 x double> %0 to <6 x i64>
; OPT: %1 = extractelement <6 x i64> %bc, i32 %tmp20		; OPT: %1 = extractelement <6 x i64> %bc, i32 %tmp20

; GCN-LABEL: {{^}}vector_write_read_bitcast_to_double:		; GCN-LABEL: {{^}}vector_write_read_bitcast_to_double:

; GCN-ALLOCA-COUNT-2: buffer_store_dword		; GCN-ALLOCA-COUNT-2: buffer_store_dword
▲ Show 20 Lines • Show All 49 Lines • ▼ Show 20 Lines	.preheader: ; preds = %.preheader, %bb2
%tmp27 = add nuw nsw i32 %tmp16, 1		%tmp27 = add nuw nsw i32 %tmp16, 1
%tmp28 = icmp eq i32 %tmp27, 1000		%tmp28 = icmp eq i32 %tmp27, 1000
br i1 %tmp28, label %bb15, label %.preheader		br i1 %tmp28, label %bb15, label %.preheader
}		}

; OPT-LABEL: @vector_write_read_bitcast_to_i64(		; OPT-LABEL: @vector_write_read_bitcast_to_i64(
; OPT-NOT: alloca		; OPT-NOT: alloca
; OPT: bb2:		; OPT: bb2:
; OPT: %tmp.sroa.0.0 = phi <6 x i64> [ undef, %bb ], [ %0, %bb2 ]		; OPT: %tmp.sroa.0.0 = phi <6 x i64> [ poison, %bb ], [ %0, %bb2 ]
; OPT: %0 = insertelement <6 x i64> %tmp.sroa.0.0, i64 %tmp6, i32 %tmp9		; OPT: %0 = insertelement <6 x i64> %tmp.sroa.0.0, i64 %tmp6, i32 %tmp9
; OPT: .preheader:		; OPT: .preheader:
; OPT: %1 = extractelement <6 x i64> %0, i32 %tmp18		; OPT: %1 = extractelement <6 x i64> %0, i32 %tmp18

; GCN-LABEL: {{^}}vector_write_read_bitcast_to_i64:		; GCN-LABEL: {{^}}vector_write_read_bitcast_to_i64:

; GCN-ALLOCA-COUNT-2: buffer_store_dword		; GCN-ALLOCA-COUNT-2: buffer_store_dword
; GCN-PROMOTE-COUNT-2: v_movreld_b32_e32		; GCN-PROMOTE-COUNT-2: v_movreld_b32_e32
▲ Show 20 Lines • Show All 235 Lines • Show Last 20 Lines

llvm/test/Transforms/Mem2Reg/debug-alloca-vla-1.ll

	Show All 13 Lines
	target triple = "x86_64-apple-macosx10.12.0"			target triple = "x86_64-apple-macosx10.12.0"

	; Function Attrs: nounwind ssp uwtable			; Function Attrs: nounwind ssp uwtable
	define void @scan() #0 !dbg !4 {			define void @scan() #0 !dbg !4 {
	; CHECK-LABEL: @scan(			; CHECK-LABEL: @scan(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: br label [[FOR_COND:%.*]], !dbg [[DBG10:![0-9]+]]			; CHECK-NEXT: br label [[FOR_COND:%.*]], !dbg [[DBG10:![0-9]+]]
	; CHECK: for.cond:			; CHECK: for.cond:
	; CHECK-NEXT: [[VLA1_0:%.]] = phi i32 [ undef, [[ENTRY:%.]] ], [ [[T0:%.*]], [[FOR_COND]] ]			; CHECK-NEXT: [[VLA1_0:%.]] = phi i32 [ poison, [[ENTRY:%.]] ], [ [[T0:%.*]], [[FOR_COND]] ]
	; CHECK-NEXT: call void @llvm.dbg.value(metadata i32 [[VLA1_0]], metadata [[META11:![0-9]+]], metadata !DIExpression()), !dbg [[DBG19:![0-9]+]]			; CHECK-NEXT: call void @llvm.dbg.value(metadata i32 [[VLA1_0]], metadata [[META11:![0-9]+]], metadata !DIExpression()), !dbg [[DBG19:![0-9]+]]
	; CHECK-NEXT: [[T0]] = add i32 [[VLA1_0]], 1			; CHECK-NEXT: [[T0]] = add i32 [[VLA1_0]], 1
	; CHECK-NEXT: call void @llvm.dbg.value(metadata i32 [[T0]], metadata [[META11]], metadata !DIExpression()), !dbg [[DBG19]]			; CHECK-NEXT: call void @llvm.dbg.value(metadata i32 [[T0]], metadata [[META11]], metadata !DIExpression()), !dbg [[DBG19]]
	; CHECK-NEXT: br label [[FOR_COND]], !dbg [[DBG10]]			; CHECK-NEXT: br label [[FOR_COND]], !dbg [[DBG10]]
	;			;
	entry:			entry:
	%vla1 = alloca i32, i32 1, align 8			%vla1 = alloca i32, i32 1, align 8
	call void @llvm.dbg.declare(metadata i32* %vla1, metadata !10, metadata !DIExpression()), !dbg !18			call void @llvm.dbg.declare(metadata i32* %vla1, metadata !10, metadata !DIExpression()), !dbg !18
	Show All 37 Lines

llvm/test/Transforms/Mem2Reg/debug-alloca-vla-2.ll

	Show All 16 Lines
	target triple = "x86_64-apple-macosx10.12.0"			target triple = "x86_64-apple-macosx10.12.0"

	; Function Attrs: nounwind ssp uwtable			; Function Attrs: nounwind ssp uwtable
	define void @scan(i32 %n) #0 !dbg !4 {			define void @scan(i32 %n) #0 !dbg !4 {
	; CHECK-LABEL: @scan(			; CHECK-LABEL: @scan(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: br label [[FOR_COND:%.*]], !dbg [[DBG10:![0-9]+]]			; CHECK-NEXT: br label [[FOR_COND:%.*]], !dbg [[DBG10:![0-9]+]]
	; CHECK: for.cond:			; CHECK: for.cond:
	; CHECK-NEXT: [[VLA1_0:%.]] = phi i32 [ undef, [[ENTRY:%.]] ], [ [[T0:%.*]], [[FOR_COND]] ]			; CHECK-NEXT: [[VLA1_0:%.]] = phi i32 [ poison, [[ENTRY:%.]] ], [ [[T0:%.*]], [[FOR_COND]] ]
	; CHECK-NEXT: [[T0]] = add i32 [[VLA1_0]], 1			; CHECK-NEXT: [[T0]] = add i32 [[VLA1_0]], 1
	; CHECK-NEXT: call void @llvm.dbg.value(metadata i32 undef, metadata [[META11:![0-9]+]], metadata !DIExpression()), !dbg [[DBG19:![0-9]+]]			; CHECK-NEXT: call void @llvm.dbg.value(metadata i32 undef, metadata [[META11:![0-9]+]], metadata !DIExpression()), !dbg [[DBG19:![0-9]+]]
	; CHECK-NEXT: br label [[FOR_COND]], !dbg [[DBG10]]			; CHECK-NEXT: br label [[FOR_COND]], !dbg [[DBG10]]
	;			;
	entry:			entry:
	%vla1 = alloca i32, i32 %n, align 8			%vla1 = alloca i32, i32 %n, align 8
	call void @llvm.dbg.declare(metadata i32* %vla1, metadata !10, metadata !DIExpression()), !dbg !18			call void @llvm.dbg.declare(metadata i32* %vla1, metadata !10, metadata !DIExpression()), !dbg !18
	br label %for.cond, !dbg !18			br label %for.cond, !dbg !18
	Show All 36 Lines

llvm/test/Transforms/Mem2Reg/pr24179.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt -mem2reg < %s -S \| FileCheck %s			; RUN: opt -mem2reg < %s -S \| FileCheck %s
	; RUN: opt -passes=mem2reg < %s -S \| FileCheck %s			; RUN: opt -passes=mem2reg < %s -S \| FileCheck %s

	declare i32 @def(i32)			declare i32 @def(i32)
	declare i1 @use(i32)			declare i1 @use(i32)

	; Special case of a single-BB alloca does not apply here since the load			; Special case of a single-BB alloca does not apply here since the load
	; is affected by the following store. Expect this case to be identified			; is affected by the following store. Expect this case to be identified
	; and a PHI node to be created.			; and a PHI node to be created.
	define void @test1() {			define void @test1() {
	; CHECK-LABEL: @test1(			; CHECK-LABEL: @test1(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: br label [[LOOP:%.*]]			; CHECK-NEXT: br label [[LOOP:%.*]]
	; CHECK: loop:			; CHECK: loop:
	; CHECK-NEXT: [[T_0:%.]] = phi i32 [ undef, [[ENTRY:%.]] ], [ [[N:%.*]], [[LOOP]] ]			; CHECK-NEXT: [[T_0:%.]] = phi i32 [ poison, [[ENTRY:%.]] ], [ [[N:%.*]], [[LOOP]] ]
	; CHECK-NEXT: [[C:%.*]] = call i1 @use(i32 [[T_0]])			; CHECK-NEXT: [[C:%.*]] = call i1 @use(i32 [[T_0]])
	; CHECK-NEXT: [[N]] = call i32 @def(i32 7)			; CHECK-NEXT: [[N]] = call i32 @def(i32 7)
	; CHECK-NEXT: br i1 [[C]], label [[LOOP]], label [[EXIT:%.*]]			; CHECK-NEXT: br i1 [[C]], label [[LOOP]], label [[EXIT:%.*]]
	; CHECK: exit:			; CHECK: exit:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	%t = alloca i32			%t = alloca i32
	Show All 12 Lines

	; Same as above, except there is no following store. The alloca should just be			; Same as above, except there is no following store. The alloca should just be
	; replaced with an undef			; replaced with an undef
	define void @test2() {			define void @test2() {
	; CHECK-LABEL: @test2(			; CHECK-LABEL: @test2(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: br label [[LOOP:%.*]]			; CHECK-NEXT: br label [[LOOP:%.*]]
	; CHECK: loop:			; CHECK: loop:
	; CHECK-NEXT: [[C:%.*]] = call i1 @use(i32 undef)			; CHECK-NEXT: [[C:%.*]] = call i1 @use(i32 poison)
	; CHECK-NEXT: br i1 [[C]], label [[LOOP]], label [[EXIT:%.*]]			; CHECK-NEXT: br i1 [[C]], label [[LOOP]], label [[EXIT:%.*]]
	; CHECK: exit:			; CHECK: exit:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	%t = alloca i32			%t = alloca i32
	br label %loop			br label %loop

	loop:			loop:
	%v = load i32, i32* %t			%v = load i32, i32* %t
	%c = call i1 @use(i32 %v)			%c = call i1 @use(i32 %v)
	br i1 %c, label %loop, label %exit			br i1 %c, label %loop, label %exit

	exit:			exit:
	ret void			ret void
	}			}

llvm/test/Transforms/Mem2Reg/undef-order.ll

	Show All 39 Lines
	; CHECK-NEXT: br label [[JOIN]]			; CHECK-NEXT: br label [[JOIN]]
	; CHECK: Block14:			; CHECK: Block14:
	; CHECK-NEXT: br label [[JOIN]]			; CHECK-NEXT: br label [[JOIN]]
	; CHECK: Block15:			; CHECK: Block15:
	; CHECK-NEXT: br label [[JOIN]]			; CHECK-NEXT: br label [[JOIN]]
	; CHECK: Block16:			; CHECK: Block16:
	; CHECK-NEXT: br label [[JOIN]]			; CHECK-NEXT: br label [[JOIN]]
	; CHECK: Join:			; CHECK: Join:
	; CHECK-NEXT: [[VAL_0:%.]] = phi i32 [ 1, [[STORE1]] ], [ 2, [[STORE2]] ], [ undef, [[BLOCK1:%.]] ], [ undef, [[BLOCK2:%.]] ], [ undef, [[BLOCK3:%.]] ], [ undef, [[BLOCK4:%.]] ], [ undef, [[BLOCK5:%.]] ], [ undef, [[BLOCK6:%.]] ], [ undef, [[BLOCK7:%.]] ], [ undef, [[BLOCK8:%.]] ], [ undef, [[BLOCK9:%.]] ], [ undef, [[BLOCK10:%.]] ], [ undef, [[BLOCK11:%.]] ], [ undef, [[BLOCK12:%.]] ], [ undef, [[BLOCK13:%.]] ], [ undef, [[BLOCK14:%.]] ], [ undef, [[BLOCK15:%.]] ], [ undef, [[BLOCK16:%.*]] ]			; CHECK-NEXT: [[VAL_0:%.]] = phi i32 [ 1, [[STORE1]] ], [ 2, [[STORE2]] ], [ poison, [[BLOCK1:%.]] ], [ poison, [[BLOCK2:%.]] ], [ poison, [[BLOCK3:%.]] ], [ poison, [[BLOCK4:%.]] ], [ poison, [[BLOCK5:%.]] ], [ poison, [[BLOCK6:%.]] ], [ poison, [[BLOCK7:%.]] ], [ poison, [[BLOCK8:%.]] ], [ poison, [[BLOCK9:%.]] ], [ poison, [[BLOCK10:%.]] ], [ poison, [[BLOCK11:%.]] ], [ poison, [[BLOCK12:%.]] ], [ poison, [[BLOCK13:%.]] ], [ poison, [[BLOCK14:%.]] ], [ poison, [[BLOCK15:%.]] ], [ poison, [[BLOCK16:%.*]] ]
	; CHECK-NEXT: ret i32 [[VAL_0]]			; CHECK-NEXT: ret i32 [[VAL_0]]
	;			;
	Entry:			Entry:
	%val = alloca i32			%val = alloca i32
	%c1 = call i1 @cond()			%c1 = call i1 @cond()
	br i1 %c1, label %Store1, label %Store2			br i1 %c1, label %Store1, label %Store2
	Block1:			Block1:
	br label %Join			br label %Join
	▲ Show 20 Lines • Show All 41 Lines • Show Last 20 Lines

llvm/test/Transforms/PhaseOrdering/X86/nancvt.ll

	Show All 36 Lines
	; CHECK-NEXT: store volatile i32 -1610612736, i32* @var, align 4			; CHECK-NEXT: store volatile i32 -1610612736, i32* @var, align 4
	; CHECK-NEXT: store volatile i32 2147027116, i32* @var, align 4			; CHECK-NEXT: store volatile i32 2147027116, i32* @var, align 4
	; CHECK-NEXT: store volatile i32 -2147483648, i32* @var, align 4			; CHECK-NEXT: store volatile i32 -2147483648, i32* @var, align 4
	; CHECK-NEXT: store volatile i32 2147027116, i32* @var, align 4			; CHECK-NEXT: store volatile i32 2147027116, i32* @var, align 4
	; CHECK-NEXT: store volatile i32 -1073741824, i32* @var, align 4			; CHECK-NEXT: store volatile i32 -1073741824, i32* @var, align 4
	; CHECK-NEXT: store volatile i32 2147228864, i32* @var, align 4			; CHECK-NEXT: store volatile i32 2147228864, i32* @var, align 4
	; CHECK-NEXT: store volatile i32 2147228864, i32* @var, align 4			; CHECK-NEXT: store volatile i32 2147228864, i32* @var, align 4
	; CHECK-NEXT: store volatile i32 2147228864, i32* @var, align 4			; CHECK-NEXT: store volatile i32 2147228864, i32* @var, align 4
	; CHECK-NEXT: ret i32 undef			; CHECK-NEXT: ret i32 poison
	;			;
	entry:			entry:
	%retval = alloca i32, align 4			%retval = alloca i32, align 4
	%i = alloca i32, align 4			%i = alloca i32, align 4
	%uf = alloca %struct..0anon, align 4			%uf = alloca %struct..0anon, align 4
	%ud = alloca %struct..1anon, align 8			%ud = alloca %struct..1anon, align 8
	%"alloca point" = bitcast i32 0 to i32			%"alloca point" = bitcast i32 0 to i32
	store i32 0, i32* %i, align 4			store i32 0, i32* %i, align 4
	▲ Show 20 Lines • Show All 155 Lines • Show Last 20 Lines

llvm/test/Transforms/SROA/addrspacecast.ll

Show First 20 Lines • Show All 166 Lines • ▼ Show 20 Lines	entry:
%val = load i64, i64* %gep.bitcast		%val = load i64, i64* %gep.bitcast
ret i64 %val		ret i64 %val
}		}

; Don't change the address space of a volatile operation		; Don't change the address space of a volatile operation
define i64 @alloca_addrspacecast_bitcast_volatile_store(i64 %X) {		define i64 @alloca_addrspacecast_bitcast_volatile_store(i64 %X) {
; CHECK-LABEL: @alloca_addrspacecast_bitcast_volatile_store(		; CHECK-LABEL: @alloca_addrspacecast_bitcast_volatile_store(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[A:%.*]] = alloca [8 x i8]		; CHECK-NEXT: [[A:%.*]] = alloca [8 x i8], align 1
; CHECK-NEXT: [[A_CAST:%.]] = addrspacecast [8 x i8] [[A]] to [8 x i8] addrspace(1)*		; CHECK-NEXT: [[A_CAST:%.]] = addrspacecast [8 x i8] [[A]] to [8 x i8] addrspace(1)*
; CHECK-NEXT: [[B:%.]] = bitcast [8 x i8] addrspace(1) [[A_CAST]] to i64 addrspace(1)*		; CHECK-NEXT: [[B:%.]] = bitcast [8 x i8] addrspace(1) [[A_CAST]] to i64 addrspace(1)*
; CHECK-NEXT: store volatile i64 [[X:%.]], i64 addrspace(1) [[B]]		; CHECK-NEXT: store volatile i64 [[X:%.]], i64 addrspace(1) [[B]], align 4
; CHECK-NEXT: [[Z:%.]] = load i64, i64 addrspace(1) [[B]]		; CHECK-NEXT: [[Z:%.]] = load i64, i64 addrspace(1) [[B]], align 4
; CHECK-NEXT: ret i64 [[Z]]		; CHECK-NEXT: ret i64 [[Z]]
;		;
entry:		entry:
%A = alloca [8 x i8]		%A = alloca [8 x i8]
%A.cast = addrspacecast [8 x i8]* %A to [8 x i8] addrspace(1)*		%A.cast = addrspacecast [8 x i8]* %A to [8 x i8] addrspace(1)*
%B = bitcast [8 x i8] addrspace(1)* %A.cast to i64 addrspace(1)*		%B = bitcast [8 x i8] addrspace(1)* %A.cast to i64 addrspace(1)*
store volatile i64 %X, i64 addrspace(1)* %B		store volatile i64 %X, i64 addrspace(1)* %B
%Z = load i64, i64 addrspace(1)* %B		%Z = load i64, i64 addrspace(1)* %B
ret i64 %Z		ret i64 %Z
}		}

; Don't change the address space of a volatile operation		; Don't change the address space of a volatile operation
define i64 @alloca_addrspacecast_bitcast_volatile_load(i64 %X) {		define i64 @alloca_addrspacecast_bitcast_volatile_load(i64 %X) {
; CHECK-LABEL: @alloca_addrspacecast_bitcast_volatile_load(		; CHECK-LABEL: @alloca_addrspacecast_bitcast_volatile_load(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[A:%.*]] = alloca [8 x i8]		; CHECK-NEXT: [[A:%.*]] = alloca [8 x i8], align 1
; CHECK-NEXT: [[A_CAST:%.]] = addrspacecast [8 x i8] [[A]] to [8 x i8] addrspace(1)*		; CHECK-NEXT: [[A_CAST:%.]] = addrspacecast [8 x i8] [[A]] to [8 x i8] addrspace(1)*
; CHECK-NEXT: [[B:%.]] = bitcast [8 x i8] addrspace(1) [[A_CAST]] to i64 addrspace(1)*		; CHECK-NEXT: [[B:%.]] = bitcast [8 x i8] addrspace(1) [[A_CAST]] to i64 addrspace(1)*
; CHECK-NEXT: store i64 [[X:%.]], i64 addrspace(1) [[B]]		; CHECK-NEXT: store i64 [[X:%.]], i64 addrspace(1) [[B]], align 4
; CHECK-NEXT: [[Z:%.]] = load volatile i64, i64 addrspace(1) [[B]]		; CHECK-NEXT: [[Z:%.]] = load volatile i64, i64 addrspace(1) [[B]], align 4
; CHECK-NEXT: ret i64 [[Z]]		; CHECK-NEXT: ret i64 [[Z]]
;		;
entry:		entry:
%A = alloca [8 x i8]		%A = alloca [8 x i8]
%A.cast = addrspacecast [8 x i8]* %A to [8 x i8] addrspace(1)*		%A.cast = addrspacecast [8 x i8]* %A to [8 x i8] addrspace(1)*
%B = bitcast [8 x i8] addrspace(1)* %A.cast to i64 addrspace(1)*		%B = bitcast [8 x i8] addrspace(1)* %A.cast to i64 addrspace(1)*
store i64 %X, i64 addrspace(1)* %B		store i64 %X, i64 addrspace(1)* %B
%Z = load volatile i64, i64 addrspace(1)* %B		%Z = load volatile i64, i64 addrspace(1)* %B
ret i64 %Z		ret i64 %Z
}		}

declare void @llvm.memset.p1i8.i32(i8 addrspace(1)* nocapture, i8, i32, i1) nounwind		declare void @llvm.memset.p1i8.i32(i8 addrspace(1)* nocapture, i8, i32, i1) nounwind

; Don't change the address space of a volatile operation		; Don't change the address space of a volatile operation
define i32 @volatile_memset() {		define i32 @volatile_memset() {
; CHECK-LABEL: @volatile_memset(		; CHECK-LABEL: @volatile_memset(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[A:%.*]] = alloca [4 x i8]		; CHECK-NEXT: [[A:%.*]] = alloca [4 x i8], align 1
; CHECK-NEXT: [[PTR:%.]] = getelementptr [4 x i8], [4 x i8] [[A]], i32 0, i32 0		; CHECK-NEXT: [[PTR:%.]] = getelementptr [4 x i8], [4 x i8] [[A]], i32 0, i32 0
; CHECK-NEXT: [[ASC:%.]] = addrspacecast i8 [[PTR]] to i8 addrspace(1)*		; CHECK-NEXT: [[ASC:%.]] = addrspacecast i8 [[PTR]] to i8 addrspace(1)*
; CHECK-NEXT: call void @llvm.memset.p1i8.i32(i8 addrspace(1)* [[ASC]], i8 42, i32 4, i1 true)		; CHECK-NEXT: call void @llvm.memset.p1i8.i32(i8 addrspace(1)* [[ASC]], i8 42, i32 4, i1 true)
; CHECK-NEXT: [[IPTR:%.]] = bitcast i8 [[PTR]] to i32*		; CHECK-NEXT: [[IPTR:%.]] = bitcast i8 [[PTR]] to i32*
; CHECK-NEXT: [[VAL:%.]] = load i32, i32 [[IPTR]]		; CHECK-NEXT: [[VAL:%.]] = load i32, i32 [[IPTR]], align 4
; CHECK-NEXT: ret i32 [[VAL]]		; CHECK-NEXT: ret i32 [[VAL]]
;		;
entry:		entry:
%a = alloca [4 x i8]		%a = alloca [4 x i8]
%ptr = getelementptr [4 x i8], [4 x i8]* %a, i32 0, i32 0		%ptr = getelementptr [4 x i8], [4 x i8]* %a, i32 0, i32 0
%asc = addrspacecast i8* %ptr to i8 addrspace(1)*		%asc = addrspacecast i8* %ptr to i8 addrspace(1)*
call void @llvm.memset.p1i8.i32(i8 addrspace(1)* %asc, i8 42, i32 4, i1 true)		call void @llvm.memset.p1i8.i32(i8 addrspace(1)* %asc, i8 42, i32 4, i1 true)
%iptr = bitcast i8* %ptr to i32*		%iptr = bitcast i8* %ptr to i32*
%val = load i32, i32* %iptr		%val = load i32, i32* %iptr
ret i32 %val		ret i32 %val
}		}

; Don't change the address space of a volatile operation		; Don't change the address space of a volatile operation
define void @volatile_memcpy(i8* %src, i8* %dst) {		define void @volatile_memcpy(i8* %src, i8* %dst) {
; CHECK-LABEL: @volatile_memcpy(		; CHECK-LABEL: @volatile_memcpy(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[A:%.*]] = alloca [4 x i8]		; CHECK-NEXT: [[A:%.*]] = alloca [4 x i8], align 1
; CHECK-NEXT: [[PTR:%.]] = getelementptr [4 x i8], [4 x i8] [[A]], i32 0, i32 0		; CHECK-NEXT: [[PTR:%.]] = getelementptr [4 x i8], [4 x i8] [[A]], i32 0, i32 0
; CHECK-NEXT: [[ASC:%.]] = addrspacecast i8 [[PTR]] to i8 addrspace(1)*		; CHECK-NEXT: [[ASC:%.]] = addrspacecast i8 [[PTR]] to i8 addrspace(1)*
; CHECK-NEXT: call void @llvm.memcpy.p1i8.p0i8.i32(i8 addrspace(1)* [[ASC]], i8* [[SRC:%.*]], i32 4, i1 true), !tbaa !0		; CHECK-NEXT: call void @llvm.memcpy.p1i8.p0i8.i32(i8 addrspace(1)* [[ASC]], i8* [[SRC:%.*]], i32 4, i1 true), !tbaa [[TBAA0:![0-9]+]]
; CHECK-NEXT: call void @llvm.memcpy.p0i8.p1i8.i32(i8* [[DST:%.]], i8 addrspace(1) [[ASC]], i32 4, i1 true), !tbaa !3		; CHECK-NEXT: call void @llvm.memcpy.p0i8.p1i8.i32(i8* [[DST:%.]], i8 addrspace(1) [[ASC]], i32 4, i1 true), !tbaa [[TBAA3:![0-9]+]]
; CHECK-NEXT: ret void		; CHECK-NEXT: ret void
;		;
entry:		entry:
%a = alloca [4 x i8]		%a = alloca [4 x i8]
%ptr = getelementptr [4 x i8], [4 x i8]* %a, i32 0, i32 0		%ptr = getelementptr [4 x i8], [4 x i8]* %a, i32 0, i32 0
%asc = addrspacecast i8* %ptr to i8 addrspace(1)*		%asc = addrspacecast i8* %ptr to i8 addrspace(1)*
call void @llvm.memcpy.p1i8.p0i8.i32(i8 addrspace(1)* %asc, i8* %src, i32 4, i1 true), !tbaa !0		call void @llvm.memcpy.p1i8.p0i8.i32(i8 addrspace(1)* %asc, i8* %src, i32 4, i1 true), !tbaa !0
call void @llvm.memcpy.p0i8.p1i8.i32(i8* %dst, i8 addrspace(1)* %asc, i32 4, i1 true), !tbaa !3		call void @llvm.memcpy.p0i8.p1i8.i32(i8* %dst, i8 addrspace(1)* %asc, i32 4, i1 true), !tbaa !3
Show All 32 Lines

;; If this was external, we wouldn't be able to prove dereferenceability		;; If this was external, we wouldn't be able to prove dereferenceability
;; of the location.		;; of the location.
@gv = addrspace(1) global i64 zeroinitializer		@gv = addrspace(1) global i64 zeroinitializer

define void @select_addrspacecast_gv(i1 %a, i1 %b) {		define void @select_addrspacecast_gv(i1 %a, i1 %b) {
; CHECK-LABEL: @select_addrspacecast_gv(		; CHECK-LABEL: @select_addrspacecast_gv(
; CHECK-NEXT: [[COND_SROA_SPECULATE_LOAD_FALSE:%.]] = load i64, i64 addrspace(1) @gv, align 8		; CHECK-NEXT: [[COND_SROA_SPECULATE_LOAD_FALSE:%.]] = load i64, i64 addrspace(1) @gv, align 8
; CHECK-NEXT: [[COND_SROA_SPECULATED:%.*]] = select i1 undef, i64 undef, i64 [[COND_SROA_SPECULATE_LOAD_FALSE]]		; CHECK-NEXT: [[COND_SROA_SPECULATED:%.*]] = select i1 undef, i64 poison, i64 [[COND_SROA_SPECULATE_LOAD_FALSE]]
; CHECK-NEXT: ret void		; CHECK-NEXT: ret void
;		;
%c = alloca i64, align 8		%c = alloca i64, align 8
%p.0.c = select i1 undef, i64* %c, i64* %c		%p.0.c = select i1 undef, i64* %c, i64* %c
%asc = addrspacecast i64* %p.0.c to i64 addrspace(1)*		%asc = addrspacecast i64* %p.0.c to i64 addrspace(1)*

%cond.in = select i1 undef, i64 addrspace(1)* %asc, i64 addrspace(1)* @gv		%cond.in = select i1 undef, i64 addrspace(1)* %asc, i64 addrspace(1)* @gv
%cond = load i64, i64 addrspace(1)* %cond.in, align 8		%cond = load i64, i64 addrspace(1)* %cond.in, align 8
ret void		ret void
}		}

define i8 @select_addrspacecast_i8() {		define i8 @select_addrspacecast_i8() {
; CHECK-LABEL: @select_addrspacecast_i8(		; CHECK-LABEL: @select_addrspacecast_i8(
; CHECK-NEXT: [[RET_SROA_SPECULATED:%.*]] = select i1 undef, i8 undef, i8 undef		; CHECK-NEXT: [[RET_SROA_SPECULATED:%.*]] = select i1 undef, i8 poison, i8 poison
; CHECK-NEXT: ret i8 [[RET_SROA_SPECULATED]]		; CHECK-NEXT: ret i8 [[RET_SROA_SPECULATED]]
;		;
%a = alloca i8		%a = alloca i8
%b = alloca i8		%b = alloca i8

%a.ptr = addrspacecast i8* %a to i8 addrspace(1)*		%a.ptr = addrspacecast i8* %a to i8 addrspace(1)*
%b.ptr = addrspacecast i8* %b to i8 addrspace(1)*		%b.ptr = addrspacecast i8* %b to i8 addrspace(1)*

Show All 10 Lines

llvm/test/Transforms/SROA/basictest.ll

	Show First 20 Lines • Show All 1,245 Lines • ▼ Show 20 Lines
	; registers. This in turn was missed as an optimization by SROA due to the			; registers. This in turn was missed as an optimization by SROA due to the
	; partial loads and stores of integers to the double alloca we were trying to			; partial loads and stores of integers to the double alloca we were trying to
	; form and promote. The solution is to widen the integer operations to be			; form and promote. The solution is to widen the integer operations to be
	; whole-alloca operations, and perform the appropriate bitcasting on the			; whole-alloca operations, and perform the appropriate bitcasting on the
	; values rather than the pointers. When this works, partial reads and writes			; values rather than the pointers. When this works, partial reads and writes
	; via integers can be promoted away.			; via integers can be promoted away.
	; CHECK-LABEL: @PR14059.1(			; CHECK-LABEL: @PR14059.1(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[TMP0:%.*]] = bitcast double undef to i64			; CHECK-NEXT: [[TMP0:%.*]] = bitcast double poison to i64
	; CHECK-NEXT: [[X_SROA_0_I_0_INSERT_MASK:%.*]] = and i64 [[TMP0]], -4294967296			; CHECK-NEXT: [[X_SROA_0_I_0_INSERT_MASK:%.*]] = and i64 [[TMP0]], -4294967296
	; CHECK-NEXT: [[TMP1:%.*]] = bitcast i64 [[X_SROA_0_I_0_INSERT_MASK]] to double			; CHECK-NEXT: [[TMP1:%.*]] = bitcast i64 [[X_SROA_0_I_0_INSERT_MASK]] to double
	; CHECK-NEXT: [[TMP2:%.*]] = bitcast double [[TMP1]] to i64			; CHECK-NEXT: [[TMP2:%.*]] = bitcast double [[TMP1]] to i64
	; CHECK-NEXT: [[X_SROA_0_I_2_INSERT_MASK:%.*]] = and i64 [[TMP2]], -281474976645121			; CHECK-NEXT: [[X_SROA_0_I_2_INSERT_MASK:%.*]] = and i64 [[TMP2]], -281474976645121
	; CHECK-NEXT: [[TMP3:%.*]] = bitcast i64 [[X_SROA_0_I_2_INSERT_MASK]] to double			; CHECK-NEXT: [[TMP3:%.*]] = bitcast i64 [[X_SROA_0_I_2_INSERT_MASK]] to double
	; CHECK-NEXT: [[TMP4:%.*]] = bitcast double [[TMP3]] to i64			; CHECK-NEXT: [[TMP4:%.*]] = bitcast double [[TMP3]] to i64
	; CHECK-NEXT: [[X_SROA_0_I_4_D_RAW_SROA_CAST:%.]] = bitcast double [[D:%.]] to i32			; CHECK-NEXT: [[X_SROA_0_I_4_D_RAW_SROA_CAST:%.]] = bitcast double [[D:%.]] to i32
	; CHECK-NEXT: [[X_SROA_0_I_4_COPYLOAD:%.]] = load i32, i32 [[X_SROA_0_I_4_D_RAW_SROA_CAST]], align 1			; CHECK-NEXT: [[X_SROA_0_I_4_COPYLOAD:%.]] = load i32, i32 [[X_SROA_0_I_4_D_RAW_SROA_CAST]], align 1
	▲ Show 20 Lines • Show All 213 Lines • ▼ Show 20 Lines
	}			}

	define i32 @PR14572.2(<3 x i8> %x) {			define i32 @PR14572.2(<3 x i8> %x) {
	; Ensure that a split integer load which is wider than the type size of the			; Ensure that a split integer load which is wider than the type size of the
	; alloca (relying on the alloc size padding) doesn't trigger an assert.			; alloca (relying on the alloc size padding) doesn't trigger an assert.
	; CHECK-LABEL: @PR14572.2(			; CHECK-LABEL: @PR14572.2(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[TMP0:%.]] = bitcast <3 x i8> [[X:%.]] to i24			; CHECK-NEXT: [[TMP0:%.]] = bitcast <3 x i8> [[X:%.]] to i24
	; CHECK-NEXT: [[A_SROA_2_0_INSERT_EXT:%.*]] = zext i8 undef to i32			; CHECK-NEXT: [[A_SROA_2_0_INSERT_EXT:%.*]] = zext i8 poison to i32
	; CHECK-NEXT: [[A_SROA_2_0_INSERT_SHIFT:%.*]] = shl i32 [[A_SROA_2_0_INSERT_EXT]], 24			; CHECK-NEXT: [[A_SROA_2_0_INSERT_SHIFT:%.*]] = shl i32 [[A_SROA_2_0_INSERT_EXT]], 24
	; CHECK-NEXT: [[A_SROA_2_0_INSERT_MASK:%.*]] = and i32 undef, 16777215			; CHECK-NEXT: [[A_SROA_2_0_INSERT_MASK:%.*]] = and i32 undef, 16777215
	; CHECK-NEXT: [[A_SROA_2_0_INSERT_INSERT:%.*]] = or i32 [[A_SROA_2_0_INSERT_MASK]], [[A_SROA_2_0_INSERT_SHIFT]]			; CHECK-NEXT: [[A_SROA_2_0_INSERT_INSERT:%.*]] = or i32 [[A_SROA_2_0_INSERT_MASK]], [[A_SROA_2_0_INSERT_SHIFT]]
	; CHECK-NEXT: [[A_0_INSERT_EXT:%.*]] = zext i24 [[TMP0]] to i32			; CHECK-NEXT: [[A_0_INSERT_EXT:%.*]] = zext i24 [[TMP0]] to i32
	; CHECK-NEXT: [[A_0_INSERT_MASK:%.*]] = and i32 [[A_SROA_2_0_INSERT_INSERT]], -16777216			; CHECK-NEXT: [[A_0_INSERT_MASK:%.*]] = and i32 [[A_SROA_2_0_INSERT_INSERT]], -16777216
	; CHECK-NEXT: [[A_0_INSERT_INSERT:%.*]] = or i32 [[A_0_INSERT_MASK]], [[A_0_INSERT_EXT]]			; CHECK-NEXT: [[A_0_INSERT_INSERT:%.*]] = or i32 [[A_0_INSERT_MASK]], [[A_0_INSERT_EXT]]
	; CHECK-NEXT: ret i32 [[A_0_INSERT_INSERT]]			; CHECK-NEXT: ret i32 [[A_0_INSERT_INSERT]]
	;			;
	▲ Show 20 Lines • Show All 121 Lines • ▼ Show 20 Lines
	end:			end:
	%tmp.raw = bitcast [4 x i8]* %tmp to i8*			%tmp.raw = bitcast [4 x i8]* %tmp to i8*
	call void @llvm.memcpy.p0i8.p0i8.i32(i8* %data, i8* %tmp.raw, i32 %size, i1 false)			call void @llvm.memcpy.p0i8.p0i8.i32(i8* %data, i8* %tmp.raw, i32 %size, i1 false)
	ret void			ret void
	}			}

	define void @PR15805(i1 %a, i1 %b) {			define void @PR15805(i1 %a, i1 %b) {
	; CHECK-LABEL: @PR15805(			; CHECK-LABEL: @PR15805(
	; CHECK-NEXT: [[COND_SROA_SPECULATED:%.*]] = select i1 undef, i64 undef, i64 undef			; CHECK-NEXT: [[COND_SROA_SPECULATED:%.*]] = select i1 undef, i64 poison, i64 poison
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;

	%c = alloca i64, align 8			%c = alloca i64, align 8
	%p.0.c = select i1 undef, i64* %c, i64* %c			%p.0.c = select i1 undef, i64* %c, i64* %c
	%cond.in = select i1 undef, i64* %p.0.c, i64* %c			%cond.in = select i1 undef, i64* %p.0.c, i64* %c
	%cond = load i64, i64* %cond.in, align 8			%cond = load i64, i64* %cond.in, align 8
	ret void			ret void
	}			}

	define void @PR15805.1(i1 %a, i1 %b) {			define void @PR15805.1(i1 %a, i1 %b) {
	; Same as the normal PR15805, but rigged to place the use before the def inside			; Same as the normal PR15805, but rigged to place the use before the def inside
	; of looping unreachable code. This helps ensure that we aren't sensitive to the			; of looping unreachable code. This helps ensure that we aren't sensitive to the
	; order in which the uses of the alloca are visited.			; order in which the uses of the alloca are visited.
	;			;
	; CHECK-LABEL: @PR15805.1(			; CHECK-LABEL: @PR15805.1(
	; CHECK-NEXT: br label [[EXIT:%.*]]			; CHECK-NEXT: br label [[EXIT:%.*]]
	; CHECK: loop:			; CHECK: loop:
	; CHECK-NEXT: [[COND_SROA_SPECULATED:%.*]] = select i1 undef, i64 undef, i64 undef			; CHECK-NEXT: [[COND_SROA_SPECULATED:%.*]] = select i1 undef, i64 poison, i64 poison
	; CHECK-NEXT: br i1 undef, label [[LOOP:%.*]], label [[EXIT]]			; CHECK-NEXT: br i1 undef, label [[LOOP:%.*]], label [[EXIT]]
	; CHECK: exit:			; CHECK: exit:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;

	%c = alloca i64, align 8			%c = alloca i64, align 8
	br label %exit			br label %exit

	▲ Show 20 Lines • Show All 227 Lines • ▼ Show 20 Lines
	; this alloca we need to handle it specially to ensure the splits line up			; this alloca we need to handle it specially to ensure the splits line up
	; properly for rewriting.			; properly for rewriting.
	;			;
	; CHECK-LABEL: @PR22093(			; CHECK-LABEL: @PR22093(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[A_SROA_0:%.*]] = alloca i16, align 4			; CHECK-NEXT: [[A_SROA_0:%.*]] = alloca i16, align 4
	; CHECK-NEXT: store volatile i16 42, i16* [[A_SROA_0]], align 4			; CHECK-NEXT: store volatile i16 42, i16* [[A_SROA_0]], align 4
	; CHECK-NEXT: [[A_SROA_0_0_A_SROA_0_0_LOAD:%.]] = load i16, i16 [[A_SROA_0]], align 4			; CHECK-NEXT: [[A_SROA_0_0_A_SROA_0_0_LOAD:%.]] = load i16, i16 [[A_SROA_0]], align 4
	; CHECK-NEXT: [[A_SROA_3_0_INSERT_EXT:%.*]] = zext i16 undef to i32			; CHECK-NEXT: [[A_SROA_3_0_INSERT_EXT:%.*]] = zext i16 poison to i32
	; CHECK-NEXT: [[A_SROA_3_0_INSERT_SHIFT:%.*]] = shl i32 [[A_SROA_3_0_INSERT_EXT]], 16			; CHECK-NEXT: [[A_SROA_3_0_INSERT_SHIFT:%.*]] = shl i32 [[A_SROA_3_0_INSERT_EXT]], 16
	; CHECK-NEXT: [[A_SROA_3_0_INSERT_MASK:%.*]] = and i32 undef, 65535			; CHECK-NEXT: [[A_SROA_3_0_INSERT_MASK:%.*]] = and i32 undef, 65535
	; CHECK-NEXT: [[A_SROA_3_0_INSERT_INSERT:%.*]] = or i32 [[A_SROA_3_0_INSERT_MASK]], [[A_SROA_3_0_INSERT_SHIFT]]			; CHECK-NEXT: [[A_SROA_3_0_INSERT_INSERT:%.*]] = or i32 [[A_SROA_3_0_INSERT_MASK]], [[A_SROA_3_0_INSERT_SHIFT]]
	; CHECK-NEXT: [[A_SROA_0_0_INSERT_EXT:%.*]] = zext i16 [[A_SROA_0_0_A_SROA_0_0_LOAD]] to i32			; CHECK-NEXT: [[A_SROA_0_0_INSERT_EXT:%.*]] = zext i16 [[A_SROA_0_0_A_SROA_0_0_LOAD]] to i32
	; CHECK-NEXT: [[A_SROA_0_0_INSERT_MASK:%.*]] = and i32 [[A_SROA_3_0_INSERT_INSERT]], -65536			; CHECK-NEXT: [[A_SROA_0_0_INSERT_MASK:%.*]] = and i32 [[A_SROA_3_0_INSERT_INSERT]], -65536
	; CHECK-NEXT: [[A_SROA_0_0_INSERT_INSERT:%.*]] = or i32 [[A_SROA_0_0_INSERT_MASK]], [[A_SROA_0_0_INSERT_EXT]]			; CHECK-NEXT: [[A_SROA_0_0_INSERT_INSERT:%.*]] = or i32 [[A_SROA_0_0_INSERT_MASK]], [[A_SROA_0_0_INSERT_EXT]]
	; CHECK-NEXT: [[A_SROA_0_0_EXTRACT_TRUNC:%.*]] = trunc i32 [[A_SROA_0_0_INSERT_INSERT]] to i16			; CHECK-NEXT: [[A_SROA_0_0_EXTRACT_TRUNC:%.*]] = trunc i32 [[A_SROA_0_0_INSERT_INSERT]] to i16
	; CHECK-NEXT: store i16 [[A_SROA_0_0_EXTRACT_TRUNC]], i16* [[A_SROA_0]], align 4			; CHECK-NEXT: store i16 [[A_SROA_0_0_EXTRACT_TRUNC]], i16* [[A_SROA_0]], align 4
	Show All 21 Lines
	; presplit.			; presplit.
	;			;
	; CHECK-LABEL: @PR22093.2(			; CHECK-LABEL: @PR22093.2(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[A_SROA_0:%.*]] = alloca i16, align 8			; CHECK-NEXT: [[A_SROA_0:%.*]] = alloca i16, align 8
	; CHECK-NEXT: [[A_SROA_31:%.*]] = alloca i8, align 4			; CHECK-NEXT: [[A_SROA_31:%.*]] = alloca i8, align 4
	; CHECK-NEXT: store volatile i16 42, i16* [[A_SROA_0]], align 8			; CHECK-NEXT: store volatile i16 42, i16* [[A_SROA_0]], align 8
	; CHECK-NEXT: [[A_SROA_0_0_A_SROA_0_0_LOAD:%.]] = load i16, i16 [[A_SROA_0]], align 8			; CHECK-NEXT: [[A_SROA_0_0_A_SROA_0_0_LOAD:%.]] = load i16, i16 [[A_SROA_0]], align 8
	; CHECK-NEXT: [[A_SROA_3_0_INSERT_EXT:%.*]] = zext i16 undef to i32			; CHECK-NEXT: [[A_SROA_3_0_INSERT_EXT:%.*]] = zext i16 poison to i32
	; CHECK-NEXT: [[A_SROA_3_0_INSERT_SHIFT:%.*]] = shl i32 [[A_SROA_3_0_INSERT_EXT]], 16			; CHECK-NEXT: [[A_SROA_3_0_INSERT_SHIFT:%.*]] = shl i32 [[A_SROA_3_0_INSERT_EXT]], 16
	; CHECK-NEXT: [[A_SROA_3_0_INSERT_MASK:%.*]] = and i32 undef, 65535			; CHECK-NEXT: [[A_SROA_3_0_INSERT_MASK:%.*]] = and i32 undef, 65535
	; CHECK-NEXT: [[A_SROA_3_0_INSERT_INSERT:%.*]] = or i32 [[A_SROA_3_0_INSERT_MASK]], [[A_SROA_3_0_INSERT_SHIFT]]			; CHECK-NEXT: [[A_SROA_3_0_INSERT_INSERT:%.*]] = or i32 [[A_SROA_3_0_INSERT_MASK]], [[A_SROA_3_0_INSERT_SHIFT]]
	; CHECK-NEXT: [[A_SROA_0_0_INSERT_EXT:%.*]] = zext i16 [[A_SROA_0_0_A_SROA_0_0_LOAD]] to i32			; CHECK-NEXT: [[A_SROA_0_0_INSERT_EXT:%.*]] = zext i16 [[A_SROA_0_0_A_SROA_0_0_LOAD]] to i32
	; CHECK-NEXT: [[A_SROA_0_0_INSERT_MASK:%.*]] = and i32 [[A_SROA_3_0_INSERT_INSERT]], -65536			; CHECK-NEXT: [[A_SROA_0_0_INSERT_MASK:%.*]] = and i32 [[A_SROA_3_0_INSERT_INSERT]], -65536
	; CHECK-NEXT: [[A_SROA_0_0_INSERT_INSERT:%.*]] = or i32 [[A_SROA_0_0_INSERT_MASK]], [[A_SROA_0_0_INSERT_EXT]]			; CHECK-NEXT: [[A_SROA_0_0_INSERT_INSERT:%.*]] = or i32 [[A_SROA_0_0_INSERT_MASK]], [[A_SROA_0_0_INSERT_EXT]]
	; CHECK-NEXT: [[A_SROA_0_0_EXTRACT_TRUNC:%.*]] = trunc i32 [[A_SROA_0_0_INSERT_INSERT]] to i16			; CHECK-NEXT: [[A_SROA_0_0_EXTRACT_TRUNC:%.*]] = trunc i32 [[A_SROA_0_0_INSERT_INSERT]] to i16
	; CHECK-NEXT: store i16 [[A_SROA_0_0_EXTRACT_TRUNC]], i16* [[A_SROA_0]], align 8			; CHECK-NEXT: store i16 [[A_SROA_0_0_EXTRACT_TRUNC]], i16* [[A_SROA_0]], align 8
	▲ Show 20 Lines • Show All 199 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: [[TMP0:%.]] = bitcast float [[ARRAYIDX5]] to i32*			; CHECK-NEXT: [[TMP0:%.]] = bitcast float [[ARRAYIDX5]] to i32*
	; CHECK-NEXT: br label [[BB2:%.*]]			; CHECK-NEXT: br label [[BB2:%.*]]
	; CHECK: bb2:			; CHECK: bb2:
	; CHECK-NEXT: [[I_02:%.]] = phi i32 [ [[NUM]], [[BB1]] ], [ [[SUB:%.]], [[BB5:%.*]] ]			; CHECK-NEXT: [[I_02:%.]] = phi i32 [ [[NUM]], [[BB1]] ], [ [[SUB:%.]], [[BB5:%.*]] ]
	; CHECK-NEXT: br i1 [[TOBOOL]], label [[BB3:%.]], label [[BB4:%.]]			; CHECK-NEXT: br i1 [[TOBOOL]], label [[BB3:%.]], label [[BB4:%.]]
	; CHECK: bb3:			; CHECK: bb3:
	; CHECK-NEXT: br label [[BB5]]			; CHECK-NEXT: br label [[BB5]]
	; CHECK: bb4:			; CHECK: bb4:
	; CHECK-NEXT: store i32 undef, i32* [[TMP0]], align 4			; CHECK-NEXT: store i32 poison, i32* [[TMP0]], align 4
	; CHECK-NEXT: br label [[BB5]]			; CHECK-NEXT: br label [[BB5]]
	; CHECK: bb5:			; CHECK: bb5:
	; CHECK-NEXT: [[SUB]] = add i32 [[I_02]], -1			; CHECK-NEXT: [[SUB]] = add i32 [[I_02]], -1
	; CHECK-NEXT: [[CMP:%.*]] = icmp sgt i32 [[SUB]], 0			; CHECK-NEXT: [[CMP:%.*]] = icmp sgt i32 [[SUB]], 0
	; CHECK-NEXT: br i1 [[CMP]], label [[BB2]], label [[BB6:%.*]]			; CHECK-NEXT: br i1 [[CMP]], label [[BB2]], label [[BB6:%.*]]
	; CHECK: bb6:			; CHECK: bb6:
	; CHECK-NEXT: br label [[BB7]]			; CHECK-NEXT: br label [[BB7]]
	; CHECK: bb7:			; CHECK: bb7:
	▲ Show 20 Lines • Show All 114 Lines • Show Last 20 Lines

llvm/test/Transforms/SROA/phi-and-select.ll

	Show First 20 Lines • Show All 315 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: [[TEST:%.]] = icmp ne i32 [[B:%.]], 0			; CHECK-NEXT: [[TEST:%.]] = icmp ne i32 [[B:%.]], 0
	; CHECK-NEXT: br i1 [[TEST]], label [[THEN:%.]], label [[ELSE:%.]]			; CHECK-NEXT: br i1 [[TEST]], label [[THEN:%.]], label [[ELSE:%.]]
	; CHECK: then:			; CHECK: then:
	; CHECK-NEXT: [[PHI_SROA_SPECULATE_LOAD_THEN:%.]] = load i32, i32 [[PTR:%.*]], align 4			; CHECK-NEXT: [[PHI_SROA_SPECULATE_LOAD_THEN:%.]] = load i32, i32 [[PTR:%.*]], align 4
	; CHECK-NEXT: br label [[EXIT:%.*]]			; CHECK-NEXT: br label [[EXIT:%.*]]
	; CHECK: else:			; CHECK: else:
	; CHECK-NEXT: br label [[EXIT]]			; CHECK-NEXT: br label [[EXIT]]
	; CHECK: exit:			; CHECK: exit:
	; CHECK-NEXT: [[PHI_SROA_SPECULATED:%.*]] = phi i32 [ undef, [[ELSE]] ], [ [[PHI_SROA_SPECULATE_LOAD_THEN]], [[THEN]] ]			; CHECK-NEXT: [[PHI_SROA_SPECULATED:%.*]] = phi i32 [ poison, [[ELSE]] ], [ [[PHI_SROA_SPECULATE_LOAD_THEN]], [[THEN]] ]
	; CHECK-NEXT: ret i32 [[PHI_SROA_SPECULATED]]			; CHECK-NEXT: ret i32 [[PHI_SROA_SPECULATED]]
	;			;

	entry:			entry:
	%f = alloca float			%f = alloca float
	%test = icmp ne i32 %b, 0			%test = icmp ne i32 %b, 0
	br i1 %test, label %then, label %else			br i1 %test, label %then, label %else

	Show All 12 Lines

	define i32 @test9(i32 %b, i32* %ptr) {			define i32 @test9(i32 %b, i32* %ptr) {
	; Same as @test8 but for a select rather than a PHI node.			; Same as @test8 but for a select rather than a PHI node.
	; CHECK-LABEL: @test9(			; CHECK-LABEL: @test9(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: store i32 0, i32* [[PTR:%.*]], align 4			; CHECK-NEXT: store i32 0, i32* [[PTR:%.*]], align 4
	; CHECK-NEXT: [[TEST:%.]] = icmp ne i32 [[B:%.]], 0			; CHECK-NEXT: [[TEST:%.]] = icmp ne i32 [[B:%.]], 0
	; CHECK-NEXT: [[LOADED_SROA_SPECULATE_LOAD_FALSE:%.]] = load i32, i32 [[PTR]], align 4			; CHECK-NEXT: [[LOADED_SROA_SPECULATE_LOAD_FALSE:%.]] = load i32, i32 [[PTR]], align 4
	; CHECK-NEXT: [[LOADED_SROA_SPECULATED:%.*]] = select i1 [[TEST]], i32 undef, i32 [[LOADED_SROA_SPECULATE_LOAD_FALSE]]			; CHECK-NEXT: [[LOADED_SROA_SPECULATED:%.*]] = select i1 [[TEST]], i32 poison, i32 [[LOADED_SROA_SPECULATE_LOAD_FALSE]]
	; CHECK-NEXT: ret i32 [[LOADED_SROA_SPECULATED]]			; CHECK-NEXT: ret i32 [[LOADED_SROA_SPECULATED]]
	;			;

	entry:			entry:
	%f = alloca float			%f = alloca float
	store i32 0, i32* %ptr			store i32 0, i32* %ptr
	%test = icmp ne i32 %b, 0			%test = icmp ne i32 %b, 0
	%bitcast = bitcast float* %f to i32*			%bitcast = bitcast float* %f to i32*
	▲ Show 20 Lines • Show All 504 Lines • Show Last 20 Lines

llvm/test/Transforms/SROA/phi-gep.ll

	Show First 20 Lines • Show All 487 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: i32 3, label [[BB_4:%.*]]			; CHECK-NEXT: i32 3, label [[BB_4:%.*]]
	; CHECK-NEXT: i32 4, label [[BB_4]]			; CHECK-NEXT: i32 4, label [[BB_4]]
	; CHECK-NEXT: ]			; CHECK-NEXT: ]
	; CHECK: bb.2:			; CHECK: bb.2:
	; CHECK-NEXT: br label [[BB_4]]			; CHECK-NEXT: br label [[BB_4]]
	; CHECK: bb.3:			; CHECK: bb.3:
	; CHECK-NEXT: br label [[BB_4]]			; CHECK-NEXT: br label [[BB_4]]
	; CHECK: bb.4:			; CHECK: bb.4:
	; CHECK-NEXT: [[PHI_SROA_PHI_SROA_SPECULATED:%.]] = phi i32 [ undef, [[BB_3]] ], [ undef, [[BB_2]] ], [ undef, [[BB_1:%.]] ], [ undef, [[BB_1]] ]			; CHECK-NEXT: [[PHI_SROA_PHI_SROA_SPECULATED:%.]] = phi i32 [ poison, [[BB_3]] ], [ poison, [[BB_2]] ], [ poison, [[BB_1:%.]] ], [ poison, [[BB_1]] ]
	; CHECK-NEXT: ret i32 [[PHI_SROA_PHI_SROA_SPECULATED]]			; CHECK-NEXT: ret i32 [[PHI_SROA_PHI_SROA_SPECULATED]]
	;			;
	bb.1:			bb.1:
	%a = alloca %pair, align 4			%a = alloca %pair, align 4
	%b = alloca %pair, align 4			%b = alloca %pair, align 4
	switch i32 %arg, label %bb.3 [			switch i32 %arg, label %bb.3 [
	i32 1, label %bb.2			i32 1, label %bb.2
	i32 2, label %bb.2			i32 2, label %bb.2
	Show All 22 Lines

llvm/test/Transforms/SROA/phi-with-duplicate-pred.ll

	Show All 15 Lines
	; CHECK-NEXT: [[G_0_SROA_SPECULATE_LOAD_CLEANUP:%.]] = load i16, i16 @a, align 1			; CHECK-NEXT: [[G_0_SROA_SPECULATE_LOAD_CLEANUP:%.]] = load i16, i16 @a, align 1
	; CHECK-NEXT: switch i32 2, label [[CLEANUP7:%.*]] [			; CHECK-NEXT: switch i32 2, label [[CLEANUP7:%.*]] [
	; CHECK-NEXT: i32 0, label [[LBL1:%.*]]			; CHECK-NEXT: i32 0, label [[LBL1:%.*]]
	; CHECK-NEXT: i32 2, label [[LBL1]]			; CHECK-NEXT: i32 2, label [[LBL1]]
	; CHECK-NEXT: ]			; CHECK-NEXT: ]
	; CHECK: if.else:			; CHECK: if.else:
	; CHECK-NEXT: br label [[LBL1]]			; CHECK-NEXT: br label [[LBL1]]
	; CHECK: lbl1:			; CHECK: lbl1:
	; CHECK-NEXT: [[G_0_SROA_SPECULATED:%.*]] = phi i16 [ [[G_0_SROA_SPECULATE_LOAD_CLEANUP]], [[CLEANUP]] ], [ [[G_0_SROA_SPECULATE_LOAD_CLEANUP]], [[CLEANUP]] ], [ undef, [[IF_ELSE]] ]			; CHECK-NEXT: [[G_0_SROA_SPECULATED:%.*]] = phi i16 [ [[G_0_SROA_SPECULATE_LOAD_CLEANUP]], [[CLEANUP]] ], [ [[G_0_SROA_SPECULATE_LOAD_CLEANUP]], [[CLEANUP]] ], [ poison, [[IF_ELSE]] ]
	; CHECK-NEXT: unreachable			; CHECK-NEXT: unreachable
	; CHECK: cleanup7:			; CHECK: cleanup7:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	%e = alloca i16, align 1			%e = alloca i16, align 1
	br i1 undef, label %if.then, label %if.else			br i1 undef, label %if.then, label %if.else

	▲ Show 20 Lines • Show All 233 Lines • Show Last 20 Lines

llvm/test/Transforms/SROA/pr37267.ll

Show All 39 Lines	; slice 4: [2,4)

%rc = add i16 %_tmp13, %_tmp16		%rc = add i16 %_tmp13, %_tmp16
ret i16 %rc		ret i16 %rc
}		}

define i16 @f2() {		define i16 @f2() {
; CHECK-LABEL: @f2(		; CHECK-LABEL: @f2(
; CHECK-NEXT: bb1:		; CHECK-NEXT: bb1:
; CHECK-NEXT: [[A_3_SROA_2_2_INSERT_EXT:%.*]] = zext i16 undef to i32		; CHECK-NEXT: [[A_3_SROA_2_2_INSERT_EXT:%.*]] = zext i16 poison to i32
; CHECK-NEXT: [[A_3_SROA_2_2_INSERT_MASK:%.*]] = and i32 undef, -65536		; CHECK-NEXT: [[A_3_SROA_2_2_INSERT_MASK:%.*]] = and i32 undef, -65536
; CHECK-NEXT: [[A_3_SROA_2_2_INSERT_INSERT:%.*]] = or i32 [[A_3_SROA_2_2_INSERT_MASK]], [[A_3_SROA_2_2_INSERT_EXT]]		; CHECK-NEXT: [[A_3_SROA_2_2_INSERT_INSERT:%.*]] = or i32 [[A_3_SROA_2_2_INSERT_MASK]], [[A_3_SROA_2_2_INSERT_EXT]]
; CHECK-NEXT: [[A_3_SROA_0_2_INSERT_EXT:%.*]] = zext i16 undef to i32		; CHECK-NEXT: [[A_3_SROA_0_2_INSERT_EXT:%.*]] = zext i16 poison to i32
; CHECK-NEXT: [[A_3_SROA_0_2_INSERT_SHIFT:%.*]] = shl i32 [[A_3_SROA_0_2_INSERT_EXT]], 16		; CHECK-NEXT: [[A_3_SROA_0_2_INSERT_SHIFT:%.*]] = shl i32 [[A_3_SROA_0_2_INSERT_EXT]], 16
; CHECK-NEXT: [[A_3_SROA_0_2_INSERT_MASK:%.*]] = and i32 [[A_3_SROA_2_2_INSERT_INSERT]], 65535		; CHECK-NEXT: [[A_3_SROA_0_2_INSERT_MASK:%.*]] = and i32 [[A_3_SROA_2_2_INSERT_INSERT]], 65535
; CHECK-NEXT: [[A_3_SROA_0_2_INSERT_INSERT:%.*]] = or i32 [[A_3_SROA_0_2_INSERT_MASK]], [[A_3_SROA_0_2_INSERT_SHIFT]]		; CHECK-NEXT: [[A_3_SROA_0_2_INSERT_INSERT:%.*]] = or i32 [[A_3_SROA_0_2_INSERT_MASK]], [[A_3_SROA_0_2_INSERT_SHIFT]]
; CHECK-NEXT: [[RC:%.*]] = add i16 2, undef		; CHECK-NEXT: [[RC:%.*]] = add i16 2, poison
; CHECK-NEXT: ret i16 [[RC]]		; CHECK-NEXT: ret i16 [[RC]]
;		;

bb1:		bb1:
; This 12-byte alloca is split into partitions as [0,2), [2,4), [4,8), [8,10), [10, 12).		; This 12-byte alloca is split into partitions as [0,2), [2,4), [4,8), [8,10), [10, 12).
; The reported error happened when visitLoadInst rewrites a split tail of slice 1 for [4, 8) partition.		; The reported error happened when visitLoadInst rewrites a split tail of slice 1 for [4, 8) partition.
; alloca 012345678901		; alloca 012345678901
; slice 1: RRRR		; slice 1: RRRR
Show All 23 Lines

llvm/test/Transforms/SROA/scalable-vectors.ll

	Show First 20 Lines • Show All 67 Lines • ▼ Show 20 Lines

	; When casting from VLA to VLS via memory check we bail out when producing a			; When casting from VLA to VLS via memory check we bail out when producing a
	; GEP where the element type is a scalable vector.			; GEP where the element type is a scalable vector.
	define <vscale x 4 x i32> @cast_alloca_from_svint32_t() {			define <vscale x 4 x i32> @cast_alloca_from_svint32_t() {
	; CHECK-LABEL: @cast_alloca_from_svint32_t(			; CHECK-LABEL: @cast_alloca_from_svint32_t(
	; CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 4 x i32>, align 16			; CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 4 x i32>, align 16
	; CHECK-NEXT: [[TMP1:%.]] = bitcast <vscale x 4 x i32> [[RETVAL_COERCE]] to i8*			; CHECK-NEXT: [[TMP1:%.]] = bitcast <vscale x 4 x i32> [[RETVAL_COERCE]] to i8*
	; CHECK-NEXT: [[RETVAL_0__SROA_CAST:%.]] = bitcast i8 [[TMP1]] to <16 x i32>*			; CHECK-NEXT: [[RETVAL_0__SROA_CAST:%.]] = bitcast i8 [[TMP1]] to <16 x i32>*
	; CHECK-NEXT: store <16 x i32> undef, <16 x i32>* [[RETVAL_0__SROA_CAST]], align 16			; CHECK-NEXT: store <16 x i32> poison, <16 x i32>* [[RETVAL_0__SROA_CAST]], align 16
	; CHECK-NEXT: [[TMP2:%.]] = load <vscale x 4 x i32>, <vscale x 4 x i32> [[RETVAL_COERCE]], align 16			; CHECK-NEXT: [[TMP2:%.]] = load <vscale x 4 x i32>, <vscale x 4 x i32> [[RETVAL_COERCE]], align 16
	; CHECK-NEXT: ret <vscale x 4 x i32> [[TMP2]]			; CHECK-NEXT: ret <vscale x 4 x i32> [[TMP2]]
	;			;
	%retval = alloca <16 x i32>			%retval = alloca <16 x i32>
	%retval.coerce = alloca <vscale x 4 x i32>			%retval.coerce = alloca <vscale x 4 x i32>
	%1 = bitcast <vscale x 4 x i32>* %retval.coerce to i8*			%1 = bitcast <vscale x 4 x i32>* %retval.coerce to i8*
	%2 = bitcast <16 x i32>* %retval to i8*			%2 = bitcast <16 x i32>* %retval to i8*
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %1, i8* align 16 %2, i64 64, i1 false)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %1, i8* align 16 %2, i64 64, i1 false)
	%3 = load <vscale x 4 x i32>, <vscale x 4 x i32>* %retval.coerce			%3 = load <vscale x 4 x i32>, <vscale x 4 x i32>* %retval.coerce
	ret <vscale x 4 x i32> %3			ret <vscale x 4 x i32> %3
	}			}

	declare void @llvm.memcpy.p0i8.p0i8.i64(i8* nocapture, i8* nocapture, i64, i1) nounwind			declare void @llvm.memcpy.p0i8.p0i8.i64(i8* nocapture, i8* nocapture, i64, i1) nounwind

llvm/test/Transforms/SROA/slice-width.ll

	Show First 20 Lines • Show All 96 Lines • ▼ Show 20 Lines
	; PR18726: Check that SROA does not rewrite a 12-byte memcpy into a 16-byte			; PR18726: Check that SROA does not rewrite a 12-byte memcpy into a 16-byte
	; vector store, hence accidentally putting gibberish onto the stack.			; vector store, hence accidentally putting gibberish onto the stack.
	define i32 @memcpy_vec3float_widening(%S.vec3float* %x) {			define i32 @memcpy_vec3float_widening(%S.vec3float* %x) {
	; CHECK-LABEL: @memcpy_vec3float_widening(			; CHECK-LABEL: @memcpy_vec3float_widening(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[TMP1_SROA_0_0_TMP1_SROA_0_0__SROA_CAST_SROA_CAST:%.]] = bitcast %S.vec3float [[X:%.]] to <3 x float>			; CHECK-NEXT: [[TMP1_SROA_0_0_TMP1_SROA_0_0__SROA_CAST_SROA_CAST:%.]] = bitcast %S.vec3float [[X:%.]] to <3 x float>
	; CHECK-NEXT: [[TMP1_SROA_0_0_COPYLOAD:%.]] = load <3 x float>, <3 x float> [[TMP1_SROA_0_0_TMP1_SROA_0_0__SROA_CAST_SROA_CAST]], align 4			; CHECK-NEXT: [[TMP1_SROA_0_0_COPYLOAD:%.]] = load <3 x float>, <3 x float> [[TMP1_SROA_0_0_TMP1_SROA_0_0__SROA_CAST_SROA_CAST]], align 4
	; CHECK-NEXT: [[TMP1_SROA_0_0_VEC_EXPAND:%.*]] = shufflevector <3 x float> [[TMP1_SROA_0_0_COPYLOAD]], <3 x float> poison, <4 x i32> <i32 0, i32 1, i32 2, i32 undef>			; CHECK-NEXT: [[TMP1_SROA_0_0_VEC_EXPAND:%.*]] = shufflevector <3 x float> [[TMP1_SROA_0_0_COPYLOAD]], <3 x float> poison, <4 x i32> <i32 0, i32 1, i32 2, i32 undef>
	; CHECK-NEXT: [[TMP1_SROA_0_0_VECBLEND:%.*]] = select <4 x i1> <i1 true, i1 true, i1 true, i1 false>, <4 x float> [[TMP1_SROA_0_0_VEC_EXPAND]], <4 x float> undef			; CHECK-NEXT: [[TMP1_SROA_0_0_VECBLEND:%.*]] = select <4 x i1> <i1 true, i1 true, i1 true, i1 false>, <4 x float> [[TMP1_SROA_0_0_VEC_EXPAND]], <4 x float> poison
	; CHECK-NEXT: [[TMP2:%.]] = alloca [[S_VEC3FLOAT:%.]], align 4			; CHECK-NEXT: [[TMP2:%.]] = alloca [[S_VEC3FLOAT:%.]], align 4
	; CHECK-NEXT: [[TMP1_SROA_0_0_TMP1_SROA_0_0__SROA_CAST2_SROA_CAST:%.]] = bitcast %S.vec3float [[TMP2]] to <3 x float>*			; CHECK-NEXT: [[TMP1_SROA_0_0_TMP1_SROA_0_0__SROA_CAST2_SROA_CAST:%.]] = bitcast %S.vec3float [[TMP2]] to <3 x float>*
	; CHECK-NEXT: [[TMP1_SROA_0_0_VEC_EXTRACT:%.*]] = shufflevector <4 x float> [[TMP1_SROA_0_0_VECBLEND]], <4 x float> poison, <3 x i32> <i32 0, i32 1, i32 2>			; CHECK-NEXT: [[TMP1_SROA_0_0_VEC_EXTRACT:%.*]] = shufflevector <4 x float> [[TMP1_SROA_0_0_VECBLEND]], <4 x float> poison, <3 x i32> <i32 0, i32 1, i32 2>
	; CHECK-NEXT: store <3 x float> [[TMP1_SROA_0_0_VEC_EXTRACT]], <3 x float>* [[TMP1_SROA_0_0_TMP1_SROA_0_0__SROA_CAST2_SROA_CAST]], align 4			; CHECK-NEXT: store <3 x float> [[TMP1_SROA_0_0_VEC_EXTRACT]], <3 x float>* [[TMP1_SROA_0_0_TMP1_SROA_0_0__SROA_CAST2_SROA_CAST]], align 4
	; CHECK-NEXT: [[RESULT:%.]] = call i32 @memcpy_vec3float_helper(%S.vec3float [[TMP2]])			; CHECK-NEXT: [[RESULT:%.]] = call i32 @memcpy_vec3float_helper(%S.vec3float [[TMP2]])
	; CHECK-NEXT: ret i32 [[RESULT]]			; CHECK-NEXT: ret i32 [[RESULT]]
	;			;
	entry:			entry:
	Show All 20 Lines

llvm/test/Transforms/SROA/vector-promotion.ll

Show First 20 Lines • Show All 256 Lines • ▼ Show 20 Lines	; PR13254
%addr = getelementptr inbounds { <4 x i64>, <4 x i64> }, { <4 x i64>, <4 x i64> }* %tmp, i32 0, i32 0, i64 %n		%addr = getelementptr inbounds { <4 x i64>, <4 x i64> }, { <4 x i64>, <4 x i64> }* %tmp, i32 0, i32 0, i64 %n
%res = load i64, i64* %addr, align 4		%res = load i64, i64* %addr, align 4
ret i64 %res		ret i64 %res
}		}

define <4 x i32> @test_subvec_store() {		define <4 x i32> @test_subvec_store() {
; CHECK-LABEL: @test_subvec_store(		; CHECK-LABEL: @test_subvec_store(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[A_0_VECBLEND:%.*]] = select <4 x i1> <i1 true, i1 true, i1 false, i1 false>, <4 x i32> <i32 0, i32 0, i32 undef, i32 undef>, <4 x i32> undef		; CHECK-NEXT: [[A_0_VECBLEND:%.*]] = select <4 x i1> <i1 true, i1 true, i1 false, i1 false>, <4 x i32> <i32 0, i32 0, i32 undef, i32 undef>, <4 x i32> poison
; CHECK-NEXT: [[A_4_VECBLEND:%.*]] = select <4 x i1> <i1 false, i1 true, i1 true, i1 false>, <4 x i32> <i32 undef, i32 1, i32 1, i32 undef>, <4 x i32> [[A_0_VECBLEND]]		; CHECK-NEXT: [[A_4_VECBLEND:%.*]] = select <4 x i1> <i1 false, i1 true, i1 true, i1 false>, <4 x i32> <i32 undef, i32 1, i32 1, i32 undef>, <4 x i32> [[A_0_VECBLEND]]
; CHECK-NEXT: [[A_8_VECBLEND:%.*]] = select <4 x i1> <i1 false, i1 false, i1 true, i1 true>, <4 x i32> <i32 undef, i32 undef, i32 2, i32 2>, <4 x i32> [[A_4_VECBLEND]]		; CHECK-NEXT: [[A_8_VECBLEND:%.*]] = select <4 x i1> <i1 false, i1 false, i1 true, i1 true>, <4 x i32> <i32 undef, i32 undef, i32 2, i32 2>, <4 x i32> [[A_4_VECBLEND]]
; CHECK-NEXT: [[A_12_VEC_INSERT:%.*]] = insertelement <4 x i32> [[A_8_VECBLEND]], i32 3, i32 3		; CHECK-NEXT: [[A_12_VEC_INSERT:%.*]] = insertelement <4 x i32> [[A_8_VECBLEND]], i32 3, i32 3
; CHECK-NEXT: ret <4 x i32> [[A_12_VEC_INSERT]]		; CHECK-NEXT: ret <4 x i32> [[A_12_VEC_INSERT]]
;		;
entry:		entry:
%a = alloca <4 x i32>		%a = alloca <4 x i32>

▲ Show 20 Lines • Show All 49 Lines • ▼ Show 20 Lines	entry:
ret <4 x i32> %ret		ret <4 x i32> %ret
}		}

declare void @llvm.memset.p0i32.i32(i32* nocapture, i32, i32, i1) nounwind		declare void @llvm.memset.p0i32.i32(i32* nocapture, i32, i32, i1) nounwind

define <4 x float> @test_subvec_memset() {		define <4 x float> @test_subvec_memset() {
; CHECK-LABEL: @test_subvec_memset(		; CHECK-LABEL: @test_subvec_memset(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[A_0_VECBLEND:%.*]] = select <4 x i1> <i1 true, i1 true, i1 false, i1 false>, <4 x float> <float 0.000000e+00, float 0.000000e+00, float undef, float undef>, <4 x float> undef		; CHECK-NEXT: [[A_0_VECBLEND:%.*]] = select <4 x i1> <i1 true, i1 true, i1 false, i1 false>, <4 x float> <float 0.000000e+00, float 0.000000e+00, float undef, float undef>, <4 x float> poison
; CHECK-NEXT: [[A_4_VECBLEND:%.*]] = select <4 x i1> <i1 false, i1 true, i1 true, i1 false>, <4 x float> <float undef, float 0x3820202020000000, float 0x3820202020000000, float undef>, <4 x float> [[A_0_VECBLEND]]		; CHECK-NEXT: [[A_4_VECBLEND:%.*]] = select <4 x i1> <i1 false, i1 true, i1 true, i1 false>, <4 x float> <float undef, float 0x3820202020000000, float 0x3820202020000000, float undef>, <4 x float> [[A_0_VECBLEND]]
; CHECK-NEXT: [[A_8_VECBLEND:%.*]] = select <4 x i1> <i1 false, i1 false, i1 true, i1 true>, <4 x float> <float undef, float undef, float 0x3860606060000000, float 0x3860606060000000>, <4 x float> [[A_4_VECBLEND]]		; CHECK-NEXT: [[A_8_VECBLEND:%.*]] = select <4 x i1> <i1 false, i1 false, i1 true, i1 true>, <4 x float> <float undef, float undef, float 0x3860606060000000, float 0x3860606060000000>, <4 x float> [[A_4_VECBLEND]]
; CHECK-NEXT: [[A_12_VEC_INSERT:%.*]] = insertelement <4 x float> [[A_8_VECBLEND]], float 0x38E0E0E0E0000000, i32 3		; CHECK-NEXT: [[A_12_VEC_INSERT:%.*]] = insertelement <4 x float> [[A_8_VECBLEND]], float 0x38E0E0E0E0000000, i32 3
; CHECK-NEXT: ret <4 x float> [[A_12_VEC_INSERT]]		; CHECK-NEXT: ret <4 x float> [[A_12_VEC_INSERT]]
;		;
entry:		entry:
%a = alloca <4 x float>		%a = alloca <4 x float>

Show All 19 Lines
}		}

define <4 x float> @test_subvec_memcpy(i8* %x, i8* %y, i8* %z, i8* %f, i8* %out) {		define <4 x float> @test_subvec_memcpy(i8* %x, i8* %y, i8* %z, i8* %f, i8* %out) {
; CHECK-LABEL: @test_subvec_memcpy(		; CHECK-LABEL: @test_subvec_memcpy(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[A_0_X_SROA_CAST:%.]] = bitcast i8 [[X:%.]] to <2 x float>		; CHECK-NEXT: [[A_0_X_SROA_CAST:%.]] = bitcast i8 [[X:%.]] to <2 x float>
; CHECK-NEXT: [[A_0_COPYLOAD:%.]] = load <2 x float>, <2 x float> [[A_0_X_SROA_CAST]], align 1		; CHECK-NEXT: [[A_0_COPYLOAD:%.]] = load <2 x float>, <2 x float> [[A_0_X_SROA_CAST]], align 1
; CHECK-NEXT: [[A_0_VEC_EXPAND:%.*]] = shufflevector <2 x float> [[A_0_COPYLOAD]], <2 x float> poison, <4 x i32> <i32 0, i32 1, i32 undef, i32 undef>		; CHECK-NEXT: [[A_0_VEC_EXPAND:%.*]] = shufflevector <2 x float> [[A_0_COPYLOAD]], <2 x float> poison, <4 x i32> <i32 0, i32 1, i32 undef, i32 undef>
; CHECK-NEXT: [[A_0_VECBLEND:%.*]] = select <4 x i1> <i1 true, i1 true, i1 false, i1 false>, <4 x float> [[A_0_VEC_EXPAND]], <4 x float> undef		; CHECK-NEXT: [[A_0_VECBLEND:%.*]] = select <4 x i1> <i1 true, i1 true, i1 false, i1 false>, <4 x float> [[A_0_VEC_EXPAND]], <4 x float> poison
; CHECK-NEXT: [[A_4_Y_SROA_CAST:%.]] = bitcast i8 [[Y:%.]] to <2 x float>		; CHECK-NEXT: [[A_4_Y_SROA_CAST:%.]] = bitcast i8 [[Y:%.]] to <2 x float>
; CHECK-NEXT: [[A_4_COPYLOAD:%.]] = load <2 x float>, <2 x float> [[A_4_Y_SROA_CAST]], align 1		; CHECK-NEXT: [[A_4_COPYLOAD:%.]] = load <2 x float>, <2 x float> [[A_4_Y_SROA_CAST]], align 1
; CHECK-NEXT: [[A_4_VEC_EXPAND:%.*]] = shufflevector <2 x float> [[A_4_COPYLOAD]], <2 x float> poison, <4 x i32> <i32 undef, i32 0, i32 1, i32 undef>		; CHECK-NEXT: [[A_4_VEC_EXPAND:%.*]] = shufflevector <2 x float> [[A_4_COPYLOAD]], <2 x float> poison, <4 x i32> <i32 undef, i32 0, i32 1, i32 undef>
; CHECK-NEXT: [[A_4_VECBLEND:%.*]] = select <4 x i1> <i1 false, i1 true, i1 true, i1 false>, <4 x float> [[A_4_VEC_EXPAND]], <4 x float> [[A_0_VECBLEND]]		; CHECK-NEXT: [[A_4_VECBLEND:%.*]] = select <4 x i1> <i1 false, i1 true, i1 true, i1 false>, <4 x float> [[A_4_VEC_EXPAND]], <4 x float> [[A_0_VECBLEND]]
; CHECK-NEXT: [[A_8_Z_SROA_CAST:%.]] = bitcast i8 [[Z:%.]] to <2 x float>		; CHECK-NEXT: [[A_8_Z_SROA_CAST:%.]] = bitcast i8 [[Z:%.]] to <2 x float>
; CHECK-NEXT: [[A_8_COPYLOAD:%.]] = load <2 x float>, <2 x float> [[A_8_Z_SROA_CAST]], align 1		; CHECK-NEXT: [[A_8_COPYLOAD:%.]] = load <2 x float>, <2 x float> [[A_8_Z_SROA_CAST]], align 1
; CHECK-NEXT: [[A_8_VEC_EXPAND:%.*]] = shufflevector <2 x float> [[A_8_COPYLOAD]], <2 x float> poison, <4 x i32> <i32 undef, i32 undef, i32 0, i32 1>		; CHECK-NEXT: [[A_8_VEC_EXPAND:%.*]] = shufflevector <2 x float> [[A_8_COPYLOAD]], <2 x float> poison, <4 x i32> <i32 undef, i32 undef, i32 0, i32 1>
; CHECK-NEXT: [[A_8_VECBLEND:%.*]] = select <4 x i1> <i1 false, i1 false, i1 true, i1 true>, <4 x float> [[A_8_VEC_EXPAND]], <4 x float> [[A_4_VECBLEND]]		; CHECK-NEXT: [[A_8_VECBLEND:%.*]] = select <4 x i1> <i1 false, i1 false, i1 true, i1 true>, <4 x float> [[A_8_VEC_EXPAND]], <4 x float> [[A_4_VECBLEND]]
Show All 30 Lines	entry:

ret <4 x float> %ret		ret <4 x float> %ret
}		}

define i32 @PR14212() {		define i32 @PR14212() {
; CHECK-LABEL: @PR14212(		; CHECK-LABEL: @PR14212(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[TMP0:%.*]] = bitcast <3 x i8> undef to i24		; CHECK-NEXT: [[TMP0:%.*]] = bitcast <3 x i8> undef to i24
; CHECK-NEXT: [[RETVAL_SROA_2_0_INSERT_EXT:%.*]] = zext i8 undef to i32		; CHECK-NEXT: [[RETVAL_SROA_2_0_INSERT_EXT:%.*]] = zext i8 poison to i32
; CHECK-NEXT: [[RETVAL_SROA_2_0_INSERT_SHIFT:%.*]] = shl i32 [[RETVAL_SROA_2_0_INSERT_EXT]], 24		; CHECK-NEXT: [[RETVAL_SROA_2_0_INSERT_SHIFT:%.*]] = shl i32 [[RETVAL_SROA_2_0_INSERT_EXT]], 24
; CHECK-NEXT: [[RETVAL_SROA_2_0_INSERT_MASK:%.*]] = and i32 undef, 16777215		; CHECK-NEXT: [[RETVAL_SROA_2_0_INSERT_MASK:%.*]] = and i32 undef, 16777215
; CHECK-NEXT: [[RETVAL_SROA_2_0_INSERT_INSERT:%.*]] = or i32 [[RETVAL_SROA_2_0_INSERT_MASK]], [[RETVAL_SROA_2_0_INSERT_SHIFT]]		; CHECK-NEXT: [[RETVAL_SROA_2_0_INSERT_INSERT:%.*]] = or i32 [[RETVAL_SROA_2_0_INSERT_MASK]], [[RETVAL_SROA_2_0_INSERT_SHIFT]]
; CHECK-NEXT: [[RETVAL_0_INSERT_EXT:%.*]] = zext i24 [[TMP0]] to i32		; CHECK-NEXT: [[RETVAL_0_INSERT_EXT:%.*]] = zext i24 [[TMP0]] to i32
; CHECK-NEXT: [[RETVAL_0_INSERT_MASK:%.*]] = and i32 [[RETVAL_SROA_2_0_INSERT_INSERT]], -16777216		; CHECK-NEXT: [[RETVAL_0_INSERT_MASK:%.*]] = and i32 [[RETVAL_SROA_2_0_INSERT_INSERT]], -16777216
; CHECK-NEXT: [[RETVAL_0_INSERT_INSERT:%.*]] = or i32 [[RETVAL_0_INSERT_MASK]], [[RETVAL_0_INSERT_EXT]]		; CHECK-NEXT: [[RETVAL_0_INSERT_INSERT:%.*]] = or i32 [[RETVAL_0_INSERT_MASK]], [[RETVAL_0_INSERT_EXT]]
; CHECK-NEXT: ret i32 [[RETVAL_0_INSERT_INSERT]]		; CHECK-NEXT: ret i32 [[RETVAL_0_INSERT_INSERT]]
;		;
Show All 30 Lines	entry:

ret <2 x i8> %vec		ret <2 x i8> %vec
}		}

define i32 @PR14349.2(<2 x i8> %x) {		define i32 @PR14349.2(<2 x i8> %x) {
; CHECK-LABEL: @PR14349.2(		; CHECK-LABEL: @PR14349.2(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[TMP0:%.]] = bitcast <2 x i8> [[X:%.]] to i16		; CHECK-NEXT: [[TMP0:%.]] = bitcast <2 x i8> [[X:%.]] to i16
; CHECK-NEXT: [[A_SROA_2_0_INSERT_EXT:%.*]] = zext i16 undef to i32		; CHECK-NEXT: [[A_SROA_2_0_INSERT_EXT:%.*]] = zext i16 poison to i32
; CHECK-NEXT: [[A_SROA_2_0_INSERT_SHIFT:%.*]] = shl i32 [[A_SROA_2_0_INSERT_EXT]], 16		; CHECK-NEXT: [[A_SROA_2_0_INSERT_SHIFT:%.*]] = shl i32 [[A_SROA_2_0_INSERT_EXT]], 16
; CHECK-NEXT: [[A_SROA_2_0_INSERT_MASK:%.*]] = and i32 undef, 65535		; CHECK-NEXT: [[A_SROA_2_0_INSERT_MASK:%.*]] = and i32 undef, 65535
; CHECK-NEXT: [[A_SROA_2_0_INSERT_INSERT:%.*]] = or i32 [[A_SROA_2_0_INSERT_MASK]], [[A_SROA_2_0_INSERT_SHIFT]]		; CHECK-NEXT: [[A_SROA_2_0_INSERT_INSERT:%.*]] = or i32 [[A_SROA_2_0_INSERT_MASK]], [[A_SROA_2_0_INSERT_SHIFT]]
; CHECK-NEXT: [[A_SROA_0_0_INSERT_EXT:%.*]] = zext i16 [[TMP0]] to i32		; CHECK-NEXT: [[A_SROA_0_0_INSERT_EXT:%.*]] = zext i16 [[TMP0]] to i32
; CHECK-NEXT: [[A_SROA_0_0_INSERT_MASK:%.*]] = and i32 [[A_SROA_2_0_INSERT_INSERT]], -65536		; CHECK-NEXT: [[A_SROA_0_0_INSERT_MASK:%.*]] = and i32 [[A_SROA_2_0_INSERT_INSERT]], -65536
; CHECK-NEXT: [[A_SROA_0_0_INSERT_INSERT:%.*]] = or i32 [[A_SROA_0_0_INSERT_MASK]], [[A_SROA_0_0_INSERT_EXT]]		; CHECK-NEXT: [[A_SROA_0_0_INSERT_INSERT:%.*]] = or i32 [[A_SROA_0_0_INSERT_MASK]], [[A_SROA_0_0_INSERT_EXT]]
; CHECK-NEXT: ret i32 [[A_SROA_0_0_INSERT_INSERT]]		; CHECK-NEXT: ret i32 [[A_SROA_0_0_INSERT_INSERT]]
;		;
▲ Show 20 Lines • Show All 68 Lines • ▼ Show 20 Lines	entry:
ret i32 %tmp4		ret i32 %tmp4
}		}

define <2 x i32> @test9(i32 %x, i32 %y) {		define <2 x i32> @test9(i32 %x, i32 %y) {
; Ensure that we can promote an alloca that doesn't mention a vector type based		; Ensure that we can promote an alloca that doesn't mention a vector type based
; on a single load with a vector type.		; on a single load with a vector type.
; CHECK-LABEL: @test9(		; CHECK-LABEL: @test9(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[A_SROA_0_0_VEC_INSERT:%.]] = insertelement <2 x i32> undef, i32 [[X:%.]], i32 0		; CHECK-NEXT: [[A_SROA_0_0_VEC_INSERT:%.]] = insertelement <2 x i32> poison, i32 [[X:%.]], i32 0
; CHECK-NEXT: [[A_SROA_0_4_VEC_INSERT:%.]] = insertelement <2 x i32> [[A_SROA_0_0_VEC_INSERT]], i32 [[Y:%.]], i32 1		; CHECK-NEXT: [[A_SROA_0_4_VEC_INSERT:%.]] = insertelement <2 x i32> [[A_SROA_0_0_VEC_INSERT]], i32 [[Y:%.]], i32 1
; CHECK-NEXT: ret <2 x i32> [[A_SROA_0_4_VEC_INSERT]]		; CHECK-NEXT: ret <2 x i32> [[A_SROA_0_4_VEC_INSERT]]
;		;
entry:		entry:
%a = alloca i64		%a = alloca i64
%a.vec = bitcast i64* %a to <2 x i32>*		%a.vec = bitcast i64* %a to <2 x i32>*
%a.i32 = bitcast i64* %a to i32*		%a.i32 = bitcast i64* %a to i32*

▲ Show 20 Lines • Show All 76 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[Mem2Reg] Use poison instead of undef for read of uninitialized memoryChanges PlannedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 353370

llvm/docs/LangRef.rst

llvm/lib/Transforms/Utils/PromoteMemoryToRegister.cpp

llvm/test/CodeGen/AMDGPU/vector-alloca-bitcast.ll

llvm/test/Transforms/Mem2Reg/debug-alloca-vla-1.ll

llvm/test/Transforms/Mem2Reg/debug-alloca-vla-2.ll

llvm/test/Transforms/Mem2Reg/pr24179.ll

llvm/test/Transforms/Mem2Reg/undef-order.ll

llvm/test/Transforms/PhaseOrdering/X86/nancvt.ll

llvm/test/Transforms/SROA/addrspacecast.ll

llvm/test/Transforms/SROA/basictest.ll

llvm/test/Transforms/SROA/phi-and-select.ll

llvm/test/Transforms/SROA/phi-gep.ll

llvm/test/Transforms/SROA/phi-with-duplicate-pred.ll

llvm/test/Transforms/SROA/pr37267.ll

llvm/test/Transforms/SROA/scalable-vectors.ll

llvm/test/Transforms/SROA/slice-width.ll

llvm/test/Transforms/SROA/vector-promotion.ll

[Mem2Reg] Use poison instead of undef for read of uninitialized memory
Changes PlannedPublic