This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/CodeGen/SelectionDAG/
-
CodeGen/
-
SelectionDAG/
1
FunctionLoweringInfo.cpp
-
test/
-
CodeGen/
-
AArch64/
1/2
preferred-alignment.ll
-
seh-finally.ll
-
AMDGPU/
-
call-argument-types.ll
-
frame-index-elimination.ll
-
spill-scavenge-offset.ll
-
ARM/
-
ssp-data-layout.ll
-
BPF/
-
undef.ll
-
Mips/
-
Fast-ISel/
-
fastalloca.ll
-
atomic64.ll
-
cconv/
-
byval.ll
-
return-struct.ll
-
largeimmprinting.ll
-
o32_cc_byval.ll
-
NVPTX/
-
lower-byval-args.ll
-
PowerPC/
-
aix-cc-byval.ll
-
aix-sret-param.ll
-
byval.ll
-
structsinregs.ll
-
varargs-struct-float.ll
-
RISCV/
-
calling-conv-ilp32-ilp32f-ilp32d-common.ll
-
frame.ll
-
mem64.ll
-
vararg.ll
-
Thumb2/
-
mve-stack.ll
-
VE/Scalar/
-
Scalar/
-
atomic_cmp_swap.ll
-
atomic_load.ll
-
atomic_swap.ll
-
WebAssembly/
-
PR40172.ll
-
X86/
-
dbg-changes-codegen-branch-folding.ll
-
fast-isel-call.ll
-
load-local-v3i129.ll
-
pr44140.ll
-
ssp-data-layout.ll
-
win-cleanuppad.ll
-
x86-mixed-alignment-dagcombine.ll
-
DebugInfo/
-
AArch64/
-
frameindices.ll
-
NVPTX/
-
dbg-declare-alloca.ll
-
X86/
-
dbg-addr.ll
-
dbg-declare-alloca.ll
-
sret.ll

Differential D135462

[SelectionDAG] Do not second-guess alignment for alloca
ClosedPublic

Authored by asavonic on Oct 7 2022, 10:14 AM.

Download Raw Diff

Details

Reviewers

efriedma
sdesmalen

Commits

rGc65b4d64d4b0: [SelectionDAG] Do not second-guess alignment for alloca
rGffedf47d8b79: [SelectionDAG] Do not second-guess alignment for alloca

Summary

Alignment of an alloca in IR can be lower than the preferred alignment
on purpose, but this override essentially treats the preferred
alignment as the minimum alignment.

The patch changes this behavior to always use the specified
alignment. If alignment is not set explicitly in LLVM IR, it is set to
DL.getPrefTypeAlign(Ty) in computeAllocaDefaultAlign.

Tests are changed as well: explicit alignment is increased to match
the preferred alignment if it changes output, or omitted when it is
hard to determine the right value (e.g. for pointers, some structs, or
weird types).

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

asavonic created this revision.Oct 7 2022, 10:14 AM

Herald added a project: Restricted Project. · View Herald TranscriptOct 7 2022, 10:14 AM

Herald added subscribers: kosarev, mattd, gchakrabarti and 36 others. · View Herald Transcript

asavonic requested review of this revision.Oct 7 2022, 10:14 AM

Herald added a project: Restricted Project. · View Herald TranscriptOct 7 2022, 10:14 AM

Herald added subscribers: llvm-commits, • pcwang-thead, MaskRay and 2 others. · View Herald Transcript

A similar issue was discussed and fixed in D79532, where promotion of alignment caused stack realignment.
This patch now removes promotion of alignment, so we always use the specified alignment.

Harbormaster completed remote builds in B190969: Diff 466107.Oct 7 2022, 11:04 AM

Given we're de-emphasizing the type of allocas, this probably makes sense? We should be careful that this doesn't have unexpected effects, though; decreasing the alignment of an alloca could affect code generation. (For example, see D134282.)

llvm/lib/CodeGen/SelectionDAG/FunctionLoweringInfo.cpp
134–135	Outdated comment?
llvm/test/CodeGen/AArch64/preferred-alignment.ll
30	Please explicitly specify alignment where possible. I though the preferred alignment of i8 is 1? I can't see how this change has any practical effect.

Removed an outdated comment
Added another testcase to AArch64/preferred-alignment.ll

In D135462#3843425, @efriedma wrote:

Given we're de-emphasizing the type of allocas, this probably makes sense? We should be careful that this doesn't have unexpected effects, though; decreasing the alignment of an alloca could affect code generation. (For example, see D134282.)

Yes, this can cause some issues (as indicated by test changes), because alignment on alloca suddenly becomes more important.
Unfortunately, I don't see any other way how we can support alignment that is lower than the preferred alignment.

llvm/test/CodeGen/AArch64/preferred-alignment.ll
30	Apparently all 3 allocas have preferred alignment of 4. It sound like the test is supposed to verify the preferred alignment. If we keep explicit alignment, it will take precedence over the preferred alignment. So I suggest to have two tests: one where alignment is implicit, and another where alignment is set to whatever value we expect the preferred alignment to be.

This makes sense to me. InstCombine will still raise the alignment based on load/stores so long as it does not require stack realignment.

Harbormaster completed remote builds in B191242: Diff 466460.Oct 10 2022, 4:07 AM

Rebased.

@efriedma, can you please check this patch again?

LGTM

This revision is now accepted and ready to land.Dec 12 2022, 9:28 AM

Harbormaster completed remote builds in B202605: Diff 482160.Dec 12 2022, 10:24 AM

This revision was landed with ongoing or failed builds.Dec 15 2022, 7:18 AM

Closed by commit rGffedf47d8b79: [SelectionDAG] Do not second-guess alignment for alloca (authored by asavonic). · Explain Why

This revision was automatically updated to reflect the committed changes.

asavonic added a commit: rGffedf47d8b79: [SelectionDAG] Do not second-guess alignment for alloca.

hello , this commit is breaking our buildbot : could you address please ? https://lab.llvm.org/buildbot/#/builders/193

ronlieb added a reverting change: rG38f1abef8604: Revert "[SelectionDAG] Do not second-guess alignment for alloca".Dec 15 2022, 8:55 AM

In D135462#3998037, @ronlieb wrote:

hello , this commit is breaking our buildbot : could you address please ? https://lab.llvm.org/buildbot/#/builders/193

It sounds like the frontend does not set the right alignment in IR:

Libomptarget message: explicit extension not allowed: host address specified is 0x00007fff01df78a0 (12 bytes), but device allocation maps to host at 0x00007fff01df78a4 (8 bytes)

@ronlieb, do you know who can help with this on OpenMP side?

asavonic reopened this revision.Dec 15 2022, 11:15 AM

This revision is now accepted and ready to land.Dec 15 2022, 11:15 AM

@jhuber6 please help with this one ? thanks

Thanks, I've wanted to fix this for a long time. I have alignment decreasing optimizations I would like to perform

lkail added a subscriber: lkail.Dec 16 2022, 7:17 AM

I checked OpenMP tests, and the difference is pretty much what was expected. This is what we have as an input (the patch does not change the IR):

// x86_64-pc-linux-gnu :: mapping/ompx_hold/struct.c
struct S {
  int i;
  int j;
} s;
// CHECK: presence of s, s.i, s.j: 0, 0, 0
CHECK_PRESENCE(s, s.i, s.j);

IR:

%s = alloca %struct.S, align 4
%call = call i32 @omp_get_default_device()
%call1 = call i32 @omp_target_is_present(ptr noundef %s, i32 noundef %call)

The assembly is different: we keep alignment of 4 and do not increase it to the preferred value of 8.
Before the patch:

leaq    -16(%rbp), %rdi
callq   omp_target_is_present@PLT

After the patch:

leaq    -12(%rbp), %rdi
callq   omp_target_is_present@PLT

Libomptarget message: explicit extension not allowed: host address specified is 0x00007fff03109220 (12 bytes), but device allocation maps to host at 0x00007fff03109224 (8 bytes)

If I explicitly set __attribute__((aligned(8))) to the declaration of s, assembly before and after the patch becomes equivalent, so the problem should go away. Apparently there is some discrepancy in alignment that the frontend (or the OpenMP runtime) expects, but it is not obvious where we need to fix this.

In D135462#4006016, @asavonic wrote:
I checked OpenMP tests, and the difference is pretty much what was expected. This is what we have as an input (the patch does not change the IR):
// x86_64-pc-linux-gnu :: mapping/ompx_hold/struct.c
struct S {
  int i;
  int j;
} s;
// CHECK: presence of s, s.i, s.j: 0, 0, 0
CHECK_PRESENCE(s, s.i, s.j);
IR:
%s = alloca %struct.S, align 4
%call = call i32 @omp_get_default_device()
%call1 = call i32 @omp_target_is_present(ptr noundef %s, i32 noundef %call)
The assembly is different: we keep alignment of 4 and do not increase it to the preferred value of 8.
Before the patch:
leaq    -16(%rbp), %rdi
callq   omp_target_is_present@PLT
After the patch:
leaq    -12(%rbp), %rdi
callq   omp_target_is_present@PLT

Libomptarget message: explicit extension not allowed: host address specified is 0x00007fff03109220 (12 bytes), but device allocation maps to host at 0x00007fff03109224 (8 bytes)
If I explicitly set __attribute__((aligned(8))) to the declaration of s, assembly before and after the patch becomes equivalent, so the problem should go away. Apparently there is some discrepancy in alignment that the frontend (or the OpenMP runtime) expects, but it is not obvious where we need to fix this.

The error you get comes from the associated pointer we find extending before or after where we expect it to. My guess is that we think the device-side address is also four bytes, but is actually eight causing the discrepancy. Device in this case being the x86_64 code we generated for offloading, you should be able to see both using flags like here https://godbolt.org/z/EGzq7Tef4. I'm not sure why OpenMP seems to have a different preferred alignment. Total spitballing, but all the arguments to a target kernel in OpenMP are just treated as uintptr_t, might be always rounding up to 8 on the device side and this patch makes us not respect that anymore.

It appears that the issue is caused by libomptarget mapping code that adds padding if the begin address does not have alignment of 8 (search for "alignment" in omptarget.cpp). The problem is that the padding is not applied consistently, so we may have a mapping without the padding (size == 8), and then get a request for a mapping with the padding (size == 12). Notice that we don't have Using a padding of 4 bytes for begin address for the first mapping, and we have it for the same exact pointer for the second mapping.

check: parent DynRefCount=1 is not sufficient for transfer
Libomptarget --> Entering data begin region for device -1 with 1 mappings
[...]
Libomptarget --> Entry  0: Base=0x00007ffc9b2581e4, Begin=0x00007ffc9b2581e4, Size=8, Type=0x2003, Name=unknown
Libomptarget --> Looking up mapping(HstPtrBegin=0x00007ffc9b2581e4, Size=8)...
Libomptarget --> Creating new map entry with HstPtrBase=0x00007ffc9b2581e4, HstPtrBegin=0x00007ffc9b2581e4, TgtPtrBegin=0x0000561d0739b390, Size=8, DynRefCount=0, HoldRefCount=1, Name=unknown
Libomptarget --> Moving 8 bytes (hst:0x00007ffc9b2581e4) -> (tgt:0x0000561d0739b390)
[...]
Libomptarget --> Entering target region for device -1 with entry point 0x0000561d059ca298
[...]
Libomptarget --> Entry  0: Base=0x00007ffc9b2581e4, Begin=0x00007ffc9b2581e4, Size=8, Type=0x20, Name=unknown
Libomptarget --> Entry  1: Base=0x00007ffc9b2581e4, Begin=0x00007ffc9b2581e4, Size=4, Type=0x1000000000002, Name=unknown
Libomptarget --> Entry  2: Base=0x00007ffc9b2581e4, Begin=0x00007ffc9b2581e8, Size=4, Type=0x1000000000002, Name=unknown
Libomptarget --> loop trip count is 0.
Libomptarget --> Using a padding of 4 bytes for begin address 0x00007ffc9b2581e4
Libomptarget --> Looking up mapping(HstPtrBegin=0x00007ffc9b2581e0, Size=12)...
Libomptarget --> WARNING: Pointer is not mapped but section extends into already mapped data
Libomptarget message: explicit extension not allowed: host address specified is 0x00007ffc9b2581e0 (12 bytes), but device allocation maps to host at 0x00007ffc9b2581e4 (8 bytes)

I'm not sure how to fix this issue. The obvious solution is to pass the padding value to DeviceTy::getTargetPointer, and ignore it if we already have a mapping without it (since alignment requirements are already satisfied, right?). Another option is to apply padding more consistently, but I don't know enough about OpenMP to figure out what exactly is wrong with the current implementation.

@grokos, @Hahnfeld, do you have any suggestions how to fix this issue? The handling of alignment was added in D44186.

Hi @asavonic, after reading through the comments here and peeking into D44186, I think your analysis is correct: libomptarget assumes that it can pad the start address of every member to 8 bytes and stay within the requested size, which isn't true anymore. I'm not sure if it would be possible to "fix up" this mistake by increasing the padding requirements in the IR generated by the Clang frontend (do we control all allocas and can they also be passed in by other parts of code?). I guess the better approach would indeed be to correct the implementation in the runtime library. That said, I haven't looked into the OpenMP offloading code for quite some time, more active members include @jdoerfert @jhuber6 @ronlieb (all of which have been pinged already...)

From the outside, and remembering many discussions and changes around alignment of struct members, a viable approach may be to disable the padding code in libomptarget and see which cases actually break. I wouldn't be super surprised if some / most of the complexity isn't actually needed anymore and is dealt with (more correctly) in the frontend.

pavelkopyl mentioned this in D142508: [OpenMP][libomptarget] Fix alignment calculation for mapping struct members..Jan 24 2023, 2:30 PM

pavelkopyl mentioned this in D142586: [OpenMP][FIX] Do not overalign mapped structures.Jan 26 2023, 10:56 AM

Is there any progress with this patch? This patch can fix our downstream test failure which is caused by insistent alignment between IR and backend, thus affects alias analysis.

Herald added subscribers: jobnoorman, luke. · View Herald TranscriptFeb 7 2023, 11:01 PM

In D135462#4112048, @lkail wrote:

Is there any progress with this patch? This patch can fix our downstream test failure which is caused by insistent alignment between IR and backend, thus affects alias analysis.

Yes, the regression in OpenMP was fixed, and I plan to land this patch today.

Closed by commit rGc65b4d64d4b0: [SelectionDAG] Do not second-guess alignment for alloca (authored by asavonic). · Explain WhyFeb 9 2023, 7:45 AM

This revision was automatically updated to reflect the committed changes.

asavonic added a commit: rGc65b4d64d4b0: [SelectionDAG] Do not second-guess alignment for alloca.

The patch is merged (again). Please let me know if it causes any regressions for in-tree targets or tests.

In D135462#4115559, @asavonic wrote:

The patch is merged (again). Please let me know if it causes any regressions for in-tree targets or tests.

An early heads up: we started seeing some test failures after this commit, but I can't yet tell whether the commit or the code is at fault.

In D135462#4125805, @alexfh wrote:

In D135462#4115559, @asavonic wrote:

The patch is merged (again). Please let me know if it causes any regressions for in-tree targets or tests.

An early heads up: we started seeing some test failures after this commit, but I can't yet tell whether the commit or the code is at fault.

The only data point so far is that failures are happening on AArch64, which is alignment-sensitive.

bgraur added a subscriber: bgraur.Feb 14 2023, 6:33 AM

In D135462#4125826, @alexfh wrote:

In D135462#4125805, @alexfh wrote:

In D135462#4115559, @asavonic wrote:

The patch is merged (again). Please let me know if it causes any regressions for in-tree targets or tests.

An early heads up: we started seeing some test failures after this commit, but I can't yet tell whether the commit or the code is at fault.

The only data point so far is that failures are happening on AArch64, which is alignment-sensitive.

False alarm. All the problems we've seen so far are due to UB in the code.

Hello this commit is causing a compiler crash for the following example:
https://llvm.godbolt.org/z/xajYWoa8K

In D135462#4347294, @zjaffal wrote:

Hello this commit is causing a compiler crash for the following example:
https://llvm.godbolt.org/z/xajYWoa8K

That seems to generate MIR where there are adjacent scaled and unscaled 32-bit stores of wzr, but AArch64LoadStoreOpt::mergeNarrowZeroStores looks pretty broken if you mix scaled-ness, as it determines IsScaled based on just one of the instructions and then uses that to determine what the immediate means. Either those need to be skipped over or the function needs to learn how to handle that mix.

I've bisected a broken test in libcxx for the mingw/x86_64 target down to this commit - see https://github.com/llvm/llvm-project/issues/64253 for a full bug report including a reduced reproducer.

Herald added a subscriber: wangpc. · View Herald TranscriptJul 31 2023, 4:12 AM

I've bisected a broken test in libcxx for the mingw/x86_64 target down to this commit - see https://github.com/llvm/llvm-project/issues/64253 for a full bug report including a reduced reproducer.

Most likely exposing an issue elsewhere. All this patch does is change the alignment of allocas, so most likely there's some unrelated problem related to or exposed by the stack layout. (Or maybe some other optimization is making bad assumptions about alloca alignment, but that's unlikely given the way the APIs in question work.) Probably using opt-bisect-limit to bisect the exact optimization in question would help narrow down the issue.

In D135462#4548436, @efriedma wrote:

I've bisected a broken test in libcxx for the mingw/x86_64 target down to this commit - see https://github.com/llvm/llvm-project/issues/64253 for a full bug report including a reduced reproducer.

Most likely exposing an issue elsewhere. All this patch does is change the alignment of allocas, so most likely there's some unrelated problem related to or exposed by the stack layout. (Or maybe some other optimization is making bad assumptions about alloca alignment, but that's unlikely given the way the APIs in question work.) Probably using opt-bisect-limit to bisect the exact optimization in question would help narrow down the issue.

Thanks, I'll try to look into that and see if I can pinpoint anything. The surprising thing here though, is that the error occurs in unoptimized builds, while when building with optimizations enabled, the code turns out to work just right.

Revision Contents

Path

Size

llvm/

lib/

CodeGen/

SelectionDAG/

FunctionLoweringInfo.cpp

15 lines

test/

CodeGen/

AArch64/

preferred-alignment.ll

28 lines

seh-finally.ll

2 lines

AMDGPU/

call-argument-types.ll

6 lines

frame-index-elimination.ll

2 lines

spill-scavenge-offset.ll

2 lines

ARM/

ssp-data-layout.ll

4 lines

BPF/

undef.ll

2 lines

Mips/

Fast-ISel/

fastalloca.ll

4 lines

atomic64.ll

4 lines

cconv/

10 lines

2 lines

2 lines

2 lines

NVPTX/

lower-byval-args.ll

2 lines

PowerPC/

4 lines

2 lines

2 lines

28 lines

varargs-struct-float.ll

2 lines

RISCV/

calling-conv-ilp32-ilp32f-ilp32d-common.ll

2 lines

frame.ll

2 lines

mem64.ll

2 lines

vararg.ll

18 lines

Thumb2/

mve-stack.ll

4 lines

VE/

Scalar/

atomic_cmp_swap.ll

14 lines

atomic_load.ll

14 lines

atomic_swap.ll

14 lines

WebAssembly/

PR40172.ll

2 lines

X86/

dbg-changes-codegen-branch-folding.ll

4 lines

2 lines

2 lines

2 lines

34 lines

4 lines

x86-mixed-alignment-dagcombine.ll

8 lines

DebugInfo/

AArch64/

frameindices.ll

2 lines

NVPTX/

dbg-declare-alloca.ll

2 lines

X86/

dbg-addr.ll

4 lines

dbg-declare-alloca.ll

2 lines

sret.ll

14 lines

Diff 466460

llvm/lib/CodeGen/SelectionDAG/FunctionLoweringInfo.cpp

Show First 20 Lines • Show All 125 Lines • ▼ Show 20 Lines	void FunctionLoweringInfo::set(const Function &fn, MachineFunction &mf,

// Initialize the mapping of values to registers. This is only set up for		// Initialize the mapping of values to registers. This is only set up for
// instruction values that are used outside of the block that defines		// instruction values that are used outside of the block that defines
// them.		// them.
const Align StackAlign = TFI->getStackAlign();		const Align StackAlign = TFI->getStackAlign();
for (const BasicBlock &BB : *Fn) {		for (const BasicBlock &BB : *Fn) {
for (const Instruction &I : BB) {		for (const Instruction &I : BB) {
if (const AllocaInst *AI = dyn_cast<AllocaInst>(&I)) {		if (const AllocaInst *AI = dyn_cast<AllocaInst>(&I)) {
Type *Ty = AI->getAllocatedType();		Type *Ty = AI->getAllocatedType();
Align TyPrefAlign = MF->getDataLayout().getPrefTypeAlign(Ty);		Align Alignment = AI->getAlign();
		efriedmaUnsubmitted Not Done Reply Inline Actions Outdated comment? efriedma: Outdated comment?
// The "specified" alignment is the alignment written on the alloca,
// or the preferred alignment of the type if none is specified.
//
// (Unspecified alignment on allocas will be going away soon.)
Align SpecifiedAlign = AI->getAlign();

// If the preferred alignment of the type is higher than the specified
// alignment of the alloca, promote the alignment, as long as it doesn't
// require realigning the stack.
//
// FIXME: Do we really want to second-guess the IR in isel?
Align Alignment =
std::max(std::min(TyPrefAlign, StackAlign), SpecifiedAlign);

// Static allocas can be folded into the initial stack frame		// Static allocas can be folded into the initial stack frame
// adjustment. For targets that don't realign the stack, don't		// adjustment. For targets that don't realign the stack, don't
// do this if there is an extra alignment requirement.		// do this if there is an extra alignment requirement.
if (AI->isStaticAlloca() &&		if (AI->isStaticAlloca() &&
(TFI->isStackRealignable() \|\| (Alignment <= StackAlign))) {		(TFI->isStackRealignable() \|\| (Alignment <= StackAlign))) {
const ConstantInt *CUI = cast<ConstantInt>(AI->getArraySize());		const ConstantInt *CUI = cast<ConstantInt>(AI->getArraySize());
uint64_t TySize =		uint64_t TySize =
▲ Show 20 Lines • Show All 422 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/preferred-alignment.ll

	; RUN: llc -mtriple=aarch64 -O0 -fast-isel < %s \| FileCheck %s			; RUN: llc -mtriple=aarch64 -O0 -fast-isel < %s \| FileCheck %s

	; Function Attrs: nounwind			; Function Attrs: nounwind
	define i32 @foo() #0 {			define i32 @foo() #0 {
	entry:			entry:
	%c = alloca i8, align 1			%c = alloca i8
	; CHECK: add x0, sp, #12			; CHECK: add x0, sp, #12
	%s = alloca i16, align 2			%s = alloca i16
				; CHECK-NEXT: add x1, sp, #8
				%i = alloca i32
				; CHECK-NEXT: add x2, sp, #4
				%call = call i32 @baz(i8* %c, i16* %s, i32* %i)
				%0 = load i8, i8* %c, align 1
				%conv = zext i8 %0 to i32
				%add = add nsw i32 %call, %conv
				%1 = load i16, i16* %s, align 2
				%conv1 = sext i16 %1 to i32
				%add2 = add nsw i32 %add, %conv1
				%2 = load i32, i32* %i, align 4
				%add3 = add nsw i32 %add2, %2
				ret i32 %add3
				}

				define i32 @bar() #0 {
				entry:
				%c = alloca i8, align 4
				; CHECK: add x0, sp, #12
				%s = alloca i16, align 4
	; CHECK-NEXT: add x1, sp, #8			; CHECK-NEXT: add x1, sp, #8
	%i = alloca i32, align 4			%i = alloca i32, align 4
				efriedmaUnsubmitted Not Done Reply Inline Actions Please explicitly specify alignment where possible. I though the preferred alignment of i8 is 1? I can't see how this change has any practical effect. efriedma: Please explicitly specify alignment where possible. I though the preferred alignment of i8 is…
				asavonicAuthorUnsubmitted Done Reply Inline Actions Apparently all 3 allocas have preferred alignment of 4. It sound like the test is supposed to verify the preferred alignment. If we keep explicit alignment, it will take precedence over the preferred alignment. So I suggest to have two tests: one where alignment is implicit, and another where alignment is set to whatever value we expect the preferred alignment to be. asavonic: Apparently all 3 allocas have preferred alignment of 4. It sound like the test is supposed to…
	; CHECK-NEXT: add x2, sp, #4			; CHECK-NEXT: add x2, sp, #4
	%call = call i32 @bar(i8* %c, i16* %s, i32* %i)			%call = call i32 @baz(i8* %c, i16* %s, i32* %i)
	%0 = load i8, i8* %c, align 1			%0 = load i8, i8* %c, align 1
	%conv = zext i8 %0 to i32			%conv = zext i8 %0 to i32
	%add = add nsw i32 %call, %conv			%add = add nsw i32 %call, %conv
	%1 = load i16, i16* %s, align 2			%1 = load i16, i16* %s, align 2
	%conv1 = sext i16 %1 to i32			%conv1 = sext i16 %1 to i32
	%add2 = add nsw i32 %add, %conv1			%add2 = add nsw i32 %add, %conv1
	%2 = load i32, i32* %i, align 4			%2 = load i32, i32* %i, align 4
	%add3 = add nsw i32 %add2, %2			%add3 = add nsw i32 %add2, %2
	ret i32 %add3			ret i32 %add3
	}			}

	declare i32 @bar(i8, i16, i32*) #1			declare i32 @baz(i8, i16, i32*) #1

	attributes #0 = { nounwind "frame-pointer"="none" }			attributes #0 = { nounwind "frame-pointer"="none" }
	attributes #1 = { "frame-pointer"="none" }			attributes #1 = { "frame-pointer"="none" }

llvm/test/CodeGen/AArch64/seh-finally.ll

	Show All 36 Lines
	; CHECK-LABEL: simple_seh			; CHECK-LABEL: simple_seh
	; CHECK: add x29, sp, #16			; CHECK: add x29, sp, #16
	; CHECK: mov x0, #-2			; CHECK: mov x0, #-2
	; CHECK: stur x0, [x29, #16]			; CHECK: stur x0, [x29, #16]
	; CHECK: .set .Lsimple_seh$frame_escape_0, -8			; CHECK: .set .Lsimple_seh$frame_escape_0, -8
	; CHECK: ldur w0, [x29, #-8]			; CHECK: ldur w0, [x29, #-8]
	; CHECK: bl foo			; CHECK: bl foo

	%o = alloca %struct.S, align 4			%o = alloca %struct.S, align 8
	call void (...) @llvm.localescape(%struct.S* %o)			call void (...) @llvm.localescape(%struct.S* %o)
	%x = getelementptr inbounds %struct.S, %struct.S* %o, i32 0, i32 0			%x = getelementptr inbounds %struct.S, %struct.S* %o, i32 0, i32 0
	%0 = load i32, i32* %x, align 4			%0 = load i32, i32* %x, align 4
	invoke void @foo(i32 %0) #5			invoke void @foo(i32 %0) #5
	to label %invoke.cont unwind label %ehcleanup			to label %invoke.cont unwind label %ehcleanup

	invoke.cont: ; preds = %entry			invoke.cont: ; preds = %entry
	%1 = call i8* @llvm.localaddress()			%1 = call i8* @llvm.localaddress()
	▲ Show 20 Lines • Show All 230 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/call-argument-types.ll

	Show First 20 Lines • Show All 665 Lines • ▼ Show 20 Lines
	; HSA-DAG: buffer_store_dword [[RELOAD_VAL1]], off, s[0:3], [[SP]] offset:4			; HSA-DAG: buffer_store_dword [[RELOAD_VAL1]], off, s[0:3], [[SP]] offset:4

	; MESA-DAG: buffer_store_dword [[RELOAD_VAL0]], off, s[36:39], [[SP]]{{$}}			; MESA-DAG: buffer_store_dword [[RELOAD_VAL0]], off, s[36:39], [[SP]]{{$}}
	; MESA-DAG: buffer_store_dword [[RELOAD_VAL1]], off, s[36:39], [[SP]] offset:4			; MESA-DAG: buffer_store_dword [[RELOAD_VAL1]], off, s[36:39], [[SP]] offset:4

	; GCN-NEXT: s_swappc_b64			; GCN-NEXT: s_swappc_b64
	; GCN-NOT: [[SP]]			; GCN-NOT: [[SP]]
	define amdgpu_kernel void @test_call_external_void_func_byval_struct_i8_i32() #0 {			define amdgpu_kernel void @test_call_external_void_func_byval_struct_i8_i32() #0 {
	%val = alloca { i8, i32 }, align 4, addrspace(5)			%val = alloca { i8, i32 }, align 8, addrspace(5)
	%gep0 = getelementptr inbounds { i8, i32 }, { i8, i32 } addrspace(5)* %val, i32 0, i32 0			%gep0 = getelementptr inbounds { i8, i32 }, { i8, i32 } addrspace(5)* %val, i32 0, i32 0
	%gep1 = getelementptr inbounds { i8, i32 }, { i8, i32 } addrspace(5)* %val, i32 0, i32 1			%gep1 = getelementptr inbounds { i8, i32 }, { i8, i32 } addrspace(5)* %val, i32 0, i32 1
	store i8 3, i8 addrspace(5)* %gep0			store i8 3, i8 addrspace(5)* %gep0
	store i32 8, i32 addrspace(5)* %gep1			store i32 8, i32 addrspace(5)* %gep1
	call void @external_void_func_byval_struct_i8_i32({ i8, i32 } addrspace(5)* byval({ i8, i32 }) %val)			call void @external_void_func_byval_struct_i8_i32({ i8, i32 } addrspace(5)* byval({ i8, i32 }) %val)
	ret void			ret void
	}			}

	Show All 14 Lines
	; GCN: s_swappc_b64			; GCN: s_swappc_b64
	; GCN-DAG: buffer_load_ubyte [[LOAD_OUT_VAL0:v[0-9]+]], off, s{{\[[0-9]+:[0-9]+\]}}, 0 offset:16			; GCN-DAG: buffer_load_ubyte [[LOAD_OUT_VAL0:v[0-9]+]], off, s{{\[[0-9]+:[0-9]+\]}}, 0 offset:16
	; GCN-DAG: buffer_load_dword [[LOAD_OUT_VAL1:v[0-9]+]], off, s{{\[[0-9]+:[0-9]+\]}}, 0 offset:20			; GCN-DAG: buffer_load_dword [[LOAD_OUT_VAL1:v[0-9]+]], off, s{{\[[0-9]+:[0-9]+\]}}, 0 offset:20
	; GCN-NOT: s_sub_u32 [[SP]]			; GCN-NOT: s_sub_u32 [[SP]]

	; GCN: buffer_store_byte [[LOAD_OUT_VAL0]], off			; GCN: buffer_store_byte [[LOAD_OUT_VAL0]], off
	; GCN: buffer_store_dword [[LOAD_OUT_VAL1]], off			; GCN: buffer_store_dword [[LOAD_OUT_VAL1]], off
	define amdgpu_kernel void @test_call_external_void_func_sret_struct_i8_i32_byval_struct_i8_i32(i32) #0 {			define amdgpu_kernel void @test_call_external_void_func_sret_struct_i8_i32_byval_struct_i8_i32(i32) #0 {
	%in.val = alloca { i8, i32 }, align 4, addrspace(5)			%in.val = alloca { i8, i32 }, align 8, addrspace(5)
	%out.val = alloca { i8, i32 }, align 4, addrspace(5)			%out.val = alloca { i8, i32 }, align 8, addrspace(5)
	%in.gep0 = getelementptr inbounds { i8, i32 }, { i8, i32 } addrspace(5)* %in.val, i32 0, i32 0			%in.gep0 = getelementptr inbounds { i8, i32 }, { i8, i32 } addrspace(5)* %in.val, i32 0, i32 0
	%in.gep1 = getelementptr inbounds { i8, i32 }, { i8, i32 } addrspace(5)* %in.val, i32 0, i32 1			%in.gep1 = getelementptr inbounds { i8, i32 }, { i8, i32 } addrspace(5)* %in.val, i32 0, i32 1
	store i8 3, i8 addrspace(5)* %in.gep0			store i8 3, i8 addrspace(5)* %in.gep0
	store i32 8, i32 addrspace(5)* %in.gep1			store i32 8, i32 addrspace(5)* %in.gep1
	call void @external_void_func_sret_struct_i8_i32_byval_struct_i8_i32({ i8, i32 } addrspace(5)* %out.val, { i8, i32 } addrspace(5)* byval({ i8, i32 }) %in.val)			call void @external_void_func_sret_struct_i8_i32_byval_struct_i8_i32({ i8, i32 } addrspace(5)* %out.val, { i8, i32 } addrspace(5)* byval({ i8, i32 }) %in.val)
	%out.gep0 = getelementptr inbounds { i8, i32 }, { i8, i32 } addrspace(5)* %out.val, i32 0, i32 0			%out.gep0 = getelementptr inbounds { i8, i32 }, { i8, i32 } addrspace(5)* %out.val, i32 0, i32 0
	%out.gep1 = getelementptr inbounds { i8, i32 }, { i8, i32 } addrspace(5)* %out.val, i32 0, i32 1			%out.gep1 = getelementptr inbounds { i8, i32 }, { i8, i32 } addrspace(5)* %out.val, i32 0, i32 1
	%out.val0 = load i8, i8 addrspace(5)* %out.gep0			%out.val0 = load i8, i8 addrspace(5)* %out.gep0
	▲ Show 20 Lines • Show All 206 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/frame-index-elimination.ll

	Show First 20 Lines • Show All 285 Lines • ▼ Show 20 Lines
	; GFX9-MUBUF: v_lshrrev_b32_e64 [[SHIFT:v[0-9]+]], 6, s32			; GFX9-MUBUF: v_lshrrev_b32_e64 [[SHIFT:v[0-9]+]], 6, s32
	; GFX9-MUBUF-NEXT: v_or_b32_e32 [[PTR:v[0-9]+]], 4, [[SHIFT]]			; GFX9-MUBUF-NEXT: v_or_b32_e32 [[PTR:v[0-9]+]], 4, [[SHIFT]]

	; GFX9-FLATSCR: v_mov_b32_e32 [[SP:v[0-9]+]], s32			; GFX9-FLATSCR: v_mov_b32_e32 [[SP:v[0-9]+]], s32
	; GFX9-FLATSCR-NEXT: v_or_b32_e32 [[PTR:v[0-9]+]], 4, [[SP]]			; GFX9-FLATSCR-NEXT: v_or_b32_e32 [[PTR:v[0-9]+]], 4, [[SP]]

	; GCN: ds_write_b32 v{{[0-9]+}}, [[PTR]]			; GCN: ds_write_b32 v{{[0-9]+}}, [[PTR]]
	define void @alloca_ptr_nonentry_block(i32 %arg0) #0 {			define void @alloca_ptr_nonentry_block(i32 %arg0) #0 {
	%alloca0 = alloca { i8, i32 }, align 4, addrspace(5)			%alloca0 = alloca { i8, i32 }, align 8, addrspace(5)
	%cmp = icmp eq i32 %arg0, 0			%cmp = icmp eq i32 %arg0, 0
	br i1 %cmp, label %bb, label %ret			br i1 %cmp, label %bb, label %ret

	bb:			bb:
	%gep0 = getelementptr inbounds { i8, i32 }, { i8, i32 } addrspace(5)* %alloca0, i32 0, i32 0			%gep0 = getelementptr inbounds { i8, i32 }, { i8, i32 } addrspace(5)* %alloca0, i32 0, i32 0
	%gep1 = getelementptr inbounds { i8, i32 }, { i8, i32 } addrspace(5)* %alloca0, i32 0, i32 1			%gep1 = getelementptr inbounds { i8, i32 }, { i8, i32 } addrspace(5)* %alloca0, i32 0, i32 1
	%load1 = load volatile i32, i32 addrspace(5)* %gep1			%load1 = load volatile i32, i32 addrspace(5)* %gep1
	store volatile i32 addrspace(5)* %gep1, i32 addrspace(5)* addrspace(3)* undef			store volatile i32 addrspace(5)* %gep1, i32 addrspace(5)* addrspace(3)* undef
	Show All 34 Lines

llvm/test/CodeGen/AMDGPU/spill-scavenge-offset.ll

	Show First 20 Lines • Show All 117 Lines • ▼ Show 20 Lines
	; FLATSCR: s_movk_i32 [[SOFF2:s[0-9]+]], 0x			; FLATSCR: s_movk_i32 [[SOFF2:s[0-9]+]], 0x
	; FLATSCR: scratch_load_dwordx4 v[{{[0-9:]+}}], off, [[SOFF2]] ; 16-byte Folded Reload			; FLATSCR: scratch_load_dwordx4 v[{{[0-9:]+}}], off, [[SOFF2]] ; 16-byte Folded Reload
	define amdgpu_kernel void @test_limited_sgpr(<64 x i32> addrspace(1)* %out, <64 x i32> addrspace(1)* %in) #0 {			define amdgpu_kernel void @test_limited_sgpr(<64 x i32> addrspace(1)* %out, <64 x i32> addrspace(1)* %in) #0 {
	entry:			entry:
	%lo = call i32 @llvm.amdgcn.mbcnt.lo(i32 -1, i32 0)			%lo = call i32 @llvm.amdgcn.mbcnt.lo(i32 -1, i32 0)
	%tid = call i32 @llvm.amdgcn.mbcnt.hi(i32 -1, i32 %lo)			%tid = call i32 @llvm.amdgcn.mbcnt.hi(i32 -1, i32 %lo)

	; allocate enough scratch to go beyond 2^12 addressing			; allocate enough scratch to go beyond 2^12 addressing
	%scratch = alloca <1280 x i32>, align 8, addrspace(5)			%scratch = alloca <1280 x i32>, align 16, addrspace(5)

	; load VGPR data			; load VGPR data
	%aptr = getelementptr <64 x i32>, <64 x i32> addrspace(1)* %in, i32 %tid			%aptr = getelementptr <64 x i32>, <64 x i32> addrspace(1)* %in, i32 %tid
	%a = load <64 x i32>, <64 x i32> addrspace(1)* %aptr			%a = load <64 x i32>, <64 x i32> addrspace(1)* %aptr

	; make sure scratch is used			; make sure scratch is used
	%x = extractelement <64 x i32> %a, i32 0			%x = extractelement <64 x i32> %a, i32 0
	%sptr0 = getelementptr <1280 x i32>, <1280 x i32> addrspace(5)* %scratch, i32 %x, i32 0			%sptr0 = getelementptr <1280 x i32>, <1280 x i32> addrspace(5)* %scratch, i32 %x, i32 0
	▲ Show 20 Lines • Show All 41 Lines • Show Last 20 Lines

llvm/test/CodeGen/ARM/ssp-data-layout.ll

	Show First 20 Lines • Show All 446 Lines • ▼ Show 20 Lines
	; large array is assigned to the stack properly as a large object.			; large array is assigned to the stack properly as a large object.
	; CHECK: struct_with_protectable_arrays:			; CHECK: struct_with_protectable_arrays:
	; CHECK: bl get_struct_small_char			; CHECK: bl get_struct_small_char
	; CHECK: strb r0, [sp, #68]			; CHECK: strb r0, [sp, #68]
	; CHECK: bl end_struct_small_char			; CHECK: bl end_struct_small_char
	; CHECK: bl get_struct_large_char2			; CHECK: bl get_struct_large_char2
	; CHECK: strb r0, [sp, #106]			; CHECK: strb r0, [sp, #106]
	; CHECK: bl end_struct_large_char2			; CHECK: bl end_struct_large_char2
	%a = alloca %struct.struct_small_char, align 1			%a = alloca %struct.struct_small_char, align 4
	%b = alloca %struct.struct_large_char2, align 1			%b = alloca %struct.struct_large_char2, align 4
	%d1 = alloca %struct.struct_large_nonchar, align 8			%d1 = alloca %struct.struct_large_nonchar, align 8
	%d2 = alloca %struct.struct_small_nonchar, align 2			%d2 = alloca %struct.struct_small_nonchar, align 2
	%call = call signext i8 @get_struct_small_char()			%call = call signext i8 @get_struct_small_char()
	%foo = getelementptr inbounds %struct.struct_small_char, %struct.struct_small_char* %a, i32 0, i32 0			%foo = getelementptr inbounds %struct.struct_small_char, %struct.struct_small_char* %a, i32 0, i32 0
	%arrayidx = getelementptr inbounds [2 x i8], [2 x i8]* %foo, i32 0, i64 0			%arrayidx = getelementptr inbounds [2 x i8], [2 x i8]* %foo, i32 0, i64 0
	store i8 %call, i8* %arrayidx, align 1			store i8 %call, i8* %arrayidx, align 1
	call void @end_struct_small_char()			call void @end_struct_small_char()
	%call1 = call signext i8 @get_struct_large_char2()			%call1 = call signext i8 @get_struct_large_char2()
	▲ Show 20 Lines • Show All 58 Lines • Show Last 20 Lines

llvm/test/CodeGen/BPF/undef.ll

	Show All 34 Lines
	; CHECK-DAG: (u16 )(r10 + 0) = r1			; CHECK-DAG: (u16 )(r10 + 0) = r1
	; CHECK-DAG: (u16 )(r10 + 26) = r1			; CHECK-DAG: (u16 )(r10 + 26) = r1

	; CHECK: r2 = r10			; CHECK: r2 = r10
	; CHECK: r2 += -8			; CHECK: r2 += -8
	; CHECK: r1 = routing			; CHECK: r1 = routing
	; CHECK: call bpf_map_lookup_elem			; CHECK: call bpf_map_lookup_elem
	; CHECK: exit			; CHECK: exit
	%key = alloca %struct.routing_key_2, align 1			%key = alloca %struct.routing_key_2, align 8
	%1 = getelementptr inbounds %struct.routing_key_2, %struct.routing_key_2* %key, i64 0, i32 0, i64 0			%1 = getelementptr inbounds %struct.routing_key_2, %struct.routing_key_2* %key, i64 0, i32 0, i64 0
	store i8 5, i8* %1, align 1			store i8 5, i8* %1, align 1
	%2 = getelementptr inbounds %struct.routing_key_2, %struct.routing_key_2* %key, i64 0, i32 0, i64 1			%2 = getelementptr inbounds %struct.routing_key_2, %struct.routing_key_2* %key, i64 0, i32 0, i64 1
	store i8 6, i8* %2, align 1			store i8 6, i8* %2, align 1
	%3 = getelementptr inbounds %struct.routing_key_2, %struct.routing_key_2* %key, i64 0, i32 0, i64 2			%3 = getelementptr inbounds %struct.routing_key_2, %struct.routing_key_2* %key, i64 0, i32 0, i64 2
	store i8 7, i8* %3, align 1			store i8 7, i8* %3, align 1
	%4 = getelementptr inbounds %struct.routing_key_2, %struct.routing_key_2* %key, i64 0, i32 0, i64 3			%4 = getelementptr inbounds %struct.routing_key_2, %struct.routing_key_2* %key, i64 0, i32 0, i64 3
	store i8 8, i8* %4, align 1			store i8 8, i8* %4, align 1
	Show All 14 Lines

llvm/test/CodeGen/Mips/Fast-ISel/fastalloca.ll

	; RUN: llc -march=mipsel -relocation-model=pic -O0 -fast-isel-abort=3 -mcpu=mips32r2 \			; RUN: llc -march=mipsel -relocation-model=pic -O0 -fast-isel-abort=3 -mcpu=mips32r2 \
	; RUN: < %s -verify-machineinstrs \| FileCheck %s			; RUN: < %s -verify-machineinstrs \| FileCheck %s

	%struct.x = type { i32 }			%struct.x = type { i32 }

	@i = common global i32 0, align 4			@i = common global i32 0, align 4

	define i32 @foobar(i32 signext %x) {			define i32 @foobar(i32 signext %x) {
	entry:			entry:
	; CHECK-LABEL: foobar:			; CHECK-LABEL: foobar:
	%retval = alloca i32, align 4			%retval = alloca i32, align 4
	%x.addr = alloca i32, align 4			%x.addr = alloca i32, align 4
	%a = alloca %struct.x, align 4			%a = alloca %struct.x, align 8
	%c = alloca %struct.x*, align 4			%c = alloca %struct.x*, align 8
	store i32 %x, i32* %x.addr, align 4			store i32 %x, i32* %x.addr, align 4
	%x1 = getelementptr inbounds %struct.x, %struct.x* %a, i32 0, i32 0			%x1 = getelementptr inbounds %struct.x, %struct.x* %a, i32 0, i32 0
	%0 = load i32, i32* %x.addr, align 4			%0 = load i32, i32* %x.addr, align 4
	store i32 %0, i32* %x1, align 4			store i32 %0, i32* %x1, align 4
	store %struct.x* %a, %struct.x** %c, align 4			store %struct.x* %a, %struct.x** %c, align 4
	%1 = load %struct.x, %struct.x* %c, align 4			%1 = load %struct.x, %struct.x* %c, align 4
	%x2 = getelementptr inbounds %struct.x, %struct.x* %1, i32 0, i32 0			%x2 = getelementptr inbounds %struct.x, %struct.x* %1, i32 0, i32 0
	%2 = load i32, i32* %x2, align 4			%2 = load i32, i32* %x2, align 4
	Show All 10 Lines

llvm/test/CodeGen/Mips/atomic64.ll

	Show First 20 Lines • Show All 1,139 Lines • ▼ Show 20 Lines
	; MIPS64EB-NEXT: move $3, $4			; MIPS64EB-NEXT: move $3, $4
	; MIPS64EB-NEXT: scd $3, 0($1)			; MIPS64EB-NEXT: scd $3, 0($1)
	; MIPS64EB-NEXT: beqz $3, .LBB6_1			; MIPS64EB-NEXT: beqz $3, .LBB6_1
	; MIPS64EB-NEXT: nop			; MIPS64EB-NEXT: nop
	; MIPS64EB-NEXT: # %bb.2: # %entry			; MIPS64EB-NEXT: # %bb.2: # %entry
	; MIPS64EB-NEXT: jr $ra			; MIPS64EB-NEXT: jr $ra
	; MIPS64EB-NEXT: daddiu $sp, $sp, 16			; MIPS64EB-NEXT: daddiu $sp, $sp, 16
	entry:			entry:
	%newval.addr = alloca i64, align 4			%newval.addr = alloca i64, align 8
	store i64 %newval, i64* %newval.addr, align 4			store i64 %newval, i64* %newval.addr, align 4
	%tmp = load i64, i64* %newval.addr, align 4			%tmp = load i64, i64* %newval.addr, align 4
	%0 = atomicrmw xchg i64* @x, i64 %tmp monotonic			%0 = atomicrmw xchg i64* @x, i64 %tmp monotonic
	ret i64 %0			ret i64 %0

	}			}

	define i64 @AtomicCmpSwap64(i64 signext %oldval, i64 signext %newval) nounwind {			define i64 @AtomicCmpSwap64(i64 signext %oldval, i64 signext %newval) nounwind {
	▲ Show 20 Lines • Show All 197 Lines • ▼ Show 20 Lines
	; MIPS64EB-NEXT: move $3, $5			; MIPS64EB-NEXT: move $3, $5
	; MIPS64EB-NEXT: scd $3, 0($1)			; MIPS64EB-NEXT: scd $3, 0($1)
	; MIPS64EB-NEXT: beqz $3, .LBB7_1			; MIPS64EB-NEXT: beqz $3, .LBB7_1
	; MIPS64EB-NEXT: nop			; MIPS64EB-NEXT: nop
	; MIPS64EB-NEXT: .LBB7_3: # %entry			; MIPS64EB-NEXT: .LBB7_3: # %entry
	; MIPS64EB-NEXT: jr $ra			; MIPS64EB-NEXT: jr $ra
	; MIPS64EB-NEXT: daddiu $sp, $sp, 16			; MIPS64EB-NEXT: daddiu $sp, $sp, 16
	entry:			entry:
	%newval.addr = alloca i64, align 4			%newval.addr = alloca i64, align 8
	store i64 %newval, i64* %newval.addr, align 4			store i64 %newval, i64* %newval.addr, align 4
	%tmp = load i64, i64* %newval.addr, align 4			%tmp = load i64, i64* %newval.addr, align 4
	%0 = cmpxchg i64* @x, i64 %oldval, i64 %tmp monotonic monotonic			%0 = cmpxchg i64* @x, i64 %oldval, i64 %tmp monotonic monotonic
	%1 = extractvalue { i64, i1 } %0, 0			%1 = extractvalue { i64, i1 } %0, 0
	ret i64 %1			ret i64 %1

	}			}

llvm/test/CodeGen/Mips/cconv/byval.ll

	Show First 20 Lines • Show All 145 Lines • ▼ Show 20 Lines
	; N64-NEXT: daddu $sp, $sp, $1			; N64-NEXT: daddu $sp, $sp, $1
	; N64-NEXT: lui $1, 1			; N64-NEXT: lui $1, 1
	; N64-NEXT: daddu $1, $sp, $1			; N64-NEXT: daddu $1, $sp, $1
	; N64-NEXT: ld $ra, -8($1) # 8-byte Folded Reload			; N64-NEXT: ld $ra, -8($1) # 8-byte Folded Reload
	; N64-NEXT: lui $1, 1			; N64-NEXT: lui $1, 1
	; N64-NEXT: jr $ra			; N64-NEXT: jr $ra
	; N64-NEXT: daddu $sp, $sp, $1			; N64-NEXT: daddu $sp, $sp, $1
	entry:			entry:
	%a = alloca %struct.S1, align 4			%a = alloca %struct.S1, align 8
	call void @f2(%struct.S1* byval(%struct.S1) align 4 %a)			call void @f2(%struct.S1* byval(%struct.S1) align 4 %a)
	ret void			ret void
	}			}

	declare dso_local void @f2(%struct.S1* byval(%struct.S1) align 4) #1			declare dso_local void @f2(%struct.S1* byval(%struct.S1) align 4) #1

	; O32-SDAG-LABEL: Initial selection DAG: %bb.0 'g2:entry'			; O32-SDAG-LABEL: Initial selection DAG: %bb.0 'g2:entry'
	; O32-SDAG: t{{.}}: ch,glue = callseq_start t{{.}}, TargetConstant:i32<{{.*}}>			; O32-SDAG: t{{.}}: ch,glue = callseq_start t{{.}}, TargetConstant:i32<{{.*}}>
	▲ Show 20 Lines • Show All 172 Lines • ▼ Show 20 Lines
	; N64-NEXT: lui $1, 1			; N64-NEXT: lui $1, 1
	; N64-NEXT: daddu $1, $sp, $1			; N64-NEXT: daddu $1, $sp, $1
	; N64-NEXT: ld $ra, 8($1) # 8-byte Folded Reload			; N64-NEXT: ld $ra, 8($1) # 8-byte Folded Reload
	; N64-NEXT: lui $1, 1			; N64-NEXT: lui $1, 1
	; N64-NEXT: daddiu $1, $1, 16			; N64-NEXT: daddiu $1, $1, 16
	; N64-NEXT: jr $ra			; N64-NEXT: jr $ra
	; N64-NEXT: daddu $sp, $sp, $1			; N64-NEXT: daddu $sp, $sp, $1
	entry:			entry:
	%a.addr = alloca %struct.S1*, align 4			%a.addr = alloca %struct.S1*
	%byval-temp = alloca %struct.S1, align 4			%byval-temp = alloca %struct.S1, align 8
	store %struct.S1* %a, %struct.S1** %a.addr, align 4			store %struct.S1* %a, %struct.S1** %a.addr, align 4
	%0 = load %struct.S1, %struct.S1* %a.addr, align 4			%0 = load %struct.S1, %struct.S1* %a.addr, align 4
	%1 = bitcast %struct.S1* %byval-temp to i8*			%1 = bitcast %struct.S1* %byval-temp to i8*
	%2 = bitcast %struct.S1* %0 to i8*			%2 = bitcast %struct.S1* %0 to i8*
	call void @llvm.memcpy.p0i8.p0i8.i32(i8* align 4 %1, i8* align 1 %2, i32 65520, i1 false)			call void @llvm.memcpy.p0i8.p0i8.i32(i8* align 4 %1, i8* align 1 %2, i32 65520, i1 false)
	call void @f2(%struct.S1* byval(%struct.S1) align 4 %byval-temp)			call void @f2(%struct.S1* byval(%struct.S1) align 4 %byval-temp)
	ret void			ret void
	}			}
	▲ Show 20 Lines • Show All 54 Lines • ▼ Show 20 Lines
	; N64-NEXT: sd $4, 16($sp)			; N64-NEXT: sd $4, 16($sp)
	; N64-NEXT: jal memcpy			; N64-NEXT: jal memcpy
	; N64-NEXT: ori $6, $zero, 65520			; N64-NEXT: ori $6, $zero, 65520
	; N64-NEXT: addiu $2, $zero, 4			; N64-NEXT: addiu $2, $zero, 4
	; N64-NEXT: ld $ra, 24($sp) # 8-byte Folded Reload			; N64-NEXT: ld $ra, 24($sp) # 8-byte Folded Reload
	; N64-NEXT: jr $ra			; N64-NEXT: jr $ra
	; N64-NEXT: daddiu $sp, $sp, 32			; N64-NEXT: daddiu $sp, $sp, 32
	entry:			entry:
	%a.addr = alloca %struct.S1*, align 4			%a.addr = alloca %struct.S1*
	%b.addr = alloca %struct.S1*, align 4			%b.addr = alloca %struct.S1*
	store %struct.S1* %a, %struct.S1** %a.addr, align 4			store %struct.S1* %a, %struct.S1** %a.addr, align 4
	store %struct.S1* %b, %struct.S1** %b.addr, align 4			store %struct.S1* %b, %struct.S1** %b.addr, align 4
	%0 = load %struct.S1, %struct.S1* %a.addr, align 4			%0 = load %struct.S1, %struct.S1* %a.addr, align 4
	%1 = bitcast %struct.S1* %0 to i8*			%1 = bitcast %struct.S1* %0 to i8*
	%2 = load %struct.S1, %struct.S1* %b.addr, align 4			%2 = load %struct.S1, %struct.S1* %b.addr, align 4
	%3 = bitcast %struct.S1* %2 to i8*			%3 = bitcast %struct.S1* %2 to i8*
	call void @llvm.memcpy.p0i8.p0i8.i32(i8* align 1 %1, i8* align 1 %3, i32 65520, i1 false)			call void @llvm.memcpy.p0i8.p0i8.i32(i8* align 1 %1, i8* align 1 %3, i32 65520, i1 false)
	ret i32 4			ret i32 4
	}			}

	declare void @llvm.memcpy.p0i8.p0i8.i32(i8* nocapture writeonly, i8* nocapture readonly, i32, i1) #2			declare void @llvm.memcpy.p0i8.p0i8.i32(i8* nocapture writeonly, i8* nocapture readonly, i32, i1) #2

llvm/test/CodeGen/Mips/cconv/return-struct.ll

	Show First 20 Lines • Show All 50 Lines • ▼ Show 20 Lines
	; N64-BE-DAG: dsll $2, [[R1]], 56			; N64-BE-DAG: dsll $2, [[R1]], 56

	; This test is based on the way clang currently lowers {i8,i8} to {i16}.			; This test is based on the way clang currently lowers {i8,i8} to {i16}.
	; FIXME: It should probably work for without any lowering too but this doesn't			; FIXME: It should probably work for without any lowering too but this doesn't
	; work as expected. Each member gets mapped to a register rather than			; work as expected. Each member gets mapped to a register rather than
	; packed into a single register.			; packed into a single register.
	define inreg {i16} @ret_struct_i16() nounwind {			define inreg {i16} @ret_struct_i16() nounwind {
	entry:			entry:
	%retval = alloca {i8,i8}, align 1			%retval = alloca {i8,i8}, align 8
	%0 = bitcast {i8,i8}* %retval to i8*			%0 = bitcast {i8,i8}* %retval to i8*
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* %0, i8* getelementptr inbounds ({i8,i8}, {i8,i8}* @struct_2byte, i32 0, i32 0), i64 2, i1 false)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* %0, i8* getelementptr inbounds ({i8,i8}, {i8,i8}* @struct_2byte, i32 0, i32 0), i64 2, i1 false)
	%1 = bitcast {i8,i8}* %retval to {i16}*			%1 = bitcast {i8,i8}* %retval to {i16}*
	%2 = load volatile {i16}, {i16}* %1			%2 = load volatile {i16}, {i16}* %1
	ret {i16} %2			ret {i16} %2
	}			}

	; ALL-LABEL: ret_struct_i16:			; ALL-LABEL: ret_struct_i16:
	▲ Show 20 Lines • Show All 170 Lines • Show Last 20 Lines

llvm/test/CodeGen/Mips/largeimmprinting.ll

	Show All 18 Lines

	; 64: lui $[[R0:[0-9]+]], 1			; 64: lui $[[R0:[0-9]+]], 1
	; 64: daddiu $[[R0]], $[[R0]], 32			; 64: daddiu $[[R0]], $[[R0]], 32
	; 64: dsubu $sp, $sp, $[[R0]]			; 64: dsubu $sp, $sp, $[[R0]]
	; 64: lui $[[R1:[0-9]+]], 1			; 64: lui $[[R1:[0-9]+]], 1
	; 64: daddu $[[R1]], $sp, $[[R1]]			; 64: daddu $[[R1]], $sp, $[[R1]]
	; 64: sd $ra, 24($[[R1]])			; 64: sd $ra, 24($[[R1]])

	%agg.tmp = alloca %struct.S1, align 1			%agg.tmp = alloca %struct.S1, align 8
	%tmp = getelementptr inbounds %struct.S1, %struct.S1* %agg.tmp, i32 0, i32 0, i32 0			%tmp = getelementptr inbounds %struct.S1, %struct.S1* %agg.tmp, i32 0, i32 0, i32 0
	call void @llvm.memcpy.p0i8.p0i8.i32(i8* align 1 %tmp, i8* align 1 getelementptr inbounds (%struct.S1, %struct.S1* @s1, i32 0, i32 0, i32 0), i32 65536, i1 false)			call void @llvm.memcpy.p0i8.p0i8.i32(i8* align 1 %tmp, i8* align 1 getelementptr inbounds (%struct.S1, %struct.S1* @s1, i32 0, i32 0, i32 0), i32 65536, i1 false)
	call void @f2(%struct.S1* byval(%struct.S1) %agg.tmp) nounwind			call void @f2(%struct.S1* byval(%struct.S1) %agg.tmp) nounwind
	ret void			ret void
	}			}

	declare void @f2(%struct.S1* byval(%struct.S1))			declare void @f2(%struct.S1* byval(%struct.S1))

	declare void @llvm.memcpy.p0i8.p0i8.i32(i8* nocapture, i8* nocapture, i32, i1) nounwind			declare void @llvm.memcpy.p0i8.p0i8.i32(i8* nocapture, i8* nocapture, i32, i1) nounwind

llvm/test/CodeGen/Mips/o32_cc_byval.ll

	Show First 20 Lines • Show All 74 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: move $gp, $16			; CHECK-NEXT: move $gp, $16
	; CHECK-NEXT: lw $16, 48($sp) # 4-byte Folded Reload			; CHECK-NEXT: lw $16, 48($sp) # 4-byte Folded Reload
	; CHECK-NEXT: lw $17, 52($sp) # 4-byte Folded Reload			; CHECK-NEXT: lw $17, 52($sp) # 4-byte Folded Reload
	; CHECK-NEXT: lw $18, 56($sp) # 4-byte Folded Reload			; CHECK-NEXT: lw $18, 56($sp) # 4-byte Folded Reload
	; CHECK-NEXT: lw $ra, 60($sp) # 4-byte Folded Reload			; CHECK-NEXT: lw $ra, 60($sp) # 4-byte Folded Reload
	; CHECK-NEXT: jr $ra			; CHECK-NEXT: jr $ra
	; CHECK-NEXT: addiu $sp, $sp, 64			; CHECK-NEXT: addiu $sp, $sp, 64
	entry:			entry:
	%agg.tmp10 = alloca %struct.S3, align 4			%agg.tmp10 = alloca %struct.S3, align 8
	call void @callee1(float 2.000000e+01, %struct.S1* byval(%struct.S1) bitcast (%0* @f1.s1 to %struct.S1*)) nounwind			call void @callee1(float 2.000000e+01, %struct.S1* byval(%struct.S1) bitcast (%0* @f1.s1 to %struct.S1*)) nounwind
	call void @callee2(%struct.S2* byval(%struct.S2) @f1.s2) nounwind			call void @callee2(%struct.S2* byval(%struct.S2) @f1.s2) nounwind
	%tmp11 = getelementptr inbounds %struct.S3, %struct.S3* %agg.tmp10, i32 0, i32 0			%tmp11 = getelementptr inbounds %struct.S3, %struct.S3* %agg.tmp10, i32 0, i32 0
	store i8 11, i8* %tmp11, align 4			store i8 11, i8* %tmp11, align 4
	call void @callee3(float 2.100000e+01, %struct.S3* byval(%struct.S3) %agg.tmp10, %struct.S1* byval(%struct.S1) bitcast (%0* @f1.s1 to %struct.S1*)) nounwind			call void @callee3(float 2.100000e+01, %struct.S3* byval(%struct.S3) %agg.tmp10, %struct.S1* byval(%struct.S1) bitcast (%0* @f1.s1 to %struct.S1*)) nounwind
	ret void			ret void
	}			}

	▲ Show 20 Lines • Show All 168 Lines • Show Last 20 Lines

llvm/test/CodeGen/NVPTX/lower-byval-args.ll

Show First 20 Lines • Show All 114 Lines • ▼ Show 20 Lines	bb:
%load = load i8, i8 addrspace(101)* %asc, align 4		%load = load i8, i8 addrspace(101)* %asc, align 4
store i8 %load, i8* %out, align 4		store i8 %load, i8* %out, align 4
ret void		ret void
}		}


; Verify that if the pointer escapes, then we do fall back onto using a temp copy.		; Verify that if the pointer escapes, then we do fall back onto using a temp copy.
; CHECK-LABEL: .visible .entry pointer_escapes		; CHECK-LABEL: .visible .entry pointer_escapes
; CHECK: .local .align 8 .b8 __local_depot{{.*}}		; CHECK: .local .align 4 .b8 __local_depot{{.*}}
; CHECK64: ld.param.u64 [[result_addr:%rd[0-9]+]], [{{.*}}_param_0]		; CHECK64: ld.param.u64 [[result_addr:%rd[0-9]+]], [{{.*}}_param_0]
; CHECK64: add.u64 %[[copy_addr:rd[0-9]+]], %SPL, 0;		; CHECK64: add.u64 %[[copy_addr:rd[0-9]+]], %SPL, 0;
; CHECK32: ld.param.u32 [[result_addr:%r[0-9]+]], [{{.*}}_param_0]		; CHECK32: ld.param.u32 [[result_addr:%r[0-9]+]], [{{.*}}_param_0]
; CHECK32: add.u32 %[[copy_addr:r[0-9]+]], %SPL, 0;		; CHECK32: add.u32 %[[copy_addr:r[0-9]+]], %SPL, 0;
; CHECK-DAG: ld.param.u32 %{{.*}}, [pointer_escapes_param_1+12];		; CHECK-DAG: ld.param.u32 %{{.*}}, [pointer_escapes_param_1+12];
; CHECK-DAG: ld.param.u32 %{{.*}}, [pointer_escapes_param_1+8];		; CHECK-DAG: ld.param.u32 %{{.*}}, [pointer_escapes_param_1+8];
; CHECK-DAG: ld.param.u32 %{{.*}}, [pointer_escapes_param_1+4];		; CHECK-DAG: ld.param.u32 %{{.*}}, [pointer_escapes_param_1+4];
; CHECK-DAG: ld.param.u32 %{{.*}}, [pointer_escapes_param_1];		; CHECK-DAG: ld.param.u32 %{{.*}}, [pointer_escapes_param_1];
Show All 37 Lines

llvm/test/CodeGen/PowerPC/aix-cc-byval.ll

	Show First 20 Lines • Show All 347 Lines • ▼ Show 20 Lines
	%struct.S4 = type { [4 x i8] }			%struct.S4 = type { [4 x i8] }
	%struct.S4A = type { i32 }			%struct.S4A = type { i32 }

	@gS4 = external global %struct.S4, align 1			@gS4 = external global %struct.S4, align 1

	define void @call_test_byval_4Byte() {			define void @call_test_byval_4Byte() {
	entry:			entry:
	%s0 = alloca %struct.S0, align 8			%s0 = alloca %struct.S0, align 8
	%s4a = alloca %struct.S4A, align 4			%s4a = alloca %struct.S4A, align 8
	%call = call signext i32 @test_byval_4Byte(%struct.S4* byval(%struct.S4) align 1 @gS4, %struct.S0* byval(%struct.S0) align 1 %s0, %struct.S4A* byval(%struct.S4A) align 4 %s4a)			%call = call signext i32 @test_byval_4Byte(%struct.S4* byval(%struct.S4) align 1 @gS4, %struct.S0* byval(%struct.S0) align 1 %s0, %struct.S4A* byval(%struct.S4A) align 4 %s4a)
	ret void			ret void
	}			}

	; CHECK-LABEL: name: call_test_byval_4Byte{{.*}}			; CHECK-LABEL: name: call_test_byval_4Byte{{.*}}

	; 32BIT: ADJCALLSTACKDOWN 56, 0, implicit-def dead $r1, implicit $r1			; 32BIT: ADJCALLSTACKDOWN 56, 0, implicit-def dead $r1, implicit $r1
	; 32BIT-NEXT: renamable $r[[REG:[0-9]+]] = LWZtoc @gS4, $r2 :: (load (s32) from got)			; 32BIT-NEXT: renamable $r[[REG:[0-9]+]] = LWZtoc @gS4, $r2 :: (load (s32) from got)
	▲ Show 20 Lines • Show All 575 Lines • ▼ Show 20 Lines
	; ASM64-DAG: std 4, 56(1)			; ASM64-DAG: std 4, 56(1)
	; ASM64-DAG: std 6, 72(1)			; ASM64-DAG: std 6, 72(1)
	; ASM64-NEXT: blr			; ASM64-NEXT: blr

	%struct.F = type { float, float, float }			%struct.F = type { float, float, float }

	define i32 @call_test_byval_homogeneous_float_struct() {			define i32 @call_test_byval_homogeneous_float_struct() {
	entry:			entry:
	%s = alloca %struct.F, align 4			%s = alloca %struct.F, align 8
	%0 = bitcast %struct.F* %s to i8*			%0 = bitcast %struct.F* %s to i8*
	call void @llvm.memset.p0i8.i32(i8* align 4 %0, i8 0, i32 12, i1 false)			call void @llvm.memset.p0i8.i32(i8* align 4 %0, i8 0, i32 12, i1 false)
	%call = call i32 @test_byval_homogeneous_float_struct(%struct.F* byval(%struct.F) align 4 %s)			%call = call i32 @test_byval_homogeneous_float_struct(%struct.F* byval(%struct.F) align 4 %s)
	ret i32 %call			ret i32 %call
	}			}

	declare void @llvm.memset.p0i8.i32(i8* nocapture writeonly, i8, i32, i1 immarg)			declare void @llvm.memset.p0i8.i32(i8* nocapture writeonly, i8, i32, i1 immarg)

	Show All 35 Lines

llvm/test/CodeGen/PowerPC/aix-sret-param.ll

	Show All 11 Lines
	; RUN: llc -mtriple powerpc64-ibm-aix-xcoff -mcpu=pwr4 -mattr=-altivec \			; RUN: llc -mtriple powerpc64-ibm-aix-xcoff -mcpu=pwr4 -mattr=-altivec \
	; RUN: --verify-machineinstrs < %s \| FileCheck --check-prefixes=ASM,ASM64 %s			; RUN: --verify-machineinstrs < %s \| FileCheck --check-prefixes=ASM,ASM64 %s

	%struct.S = type { i8 }			%struct.S = type { i8 }
	%struct.T = type { double, i32, i32, i32, float }			%struct.T = type { double, i32, i32, i32, float }

	define void @test1() {			define void @test1() {
	entry:			entry:
	%s = alloca %struct.S, align 4			%s = alloca %struct.S, align 8
	call void @foo(%struct.S* sret(%struct.S) %s)			call void @foo(%struct.S* sret(%struct.S) %s)
	ret void			ret void
	}			}

	define void @test2() {			define void @test2() {
	entry:			entry:
	%t = alloca %struct.T, align 8			%t = alloca %struct.T, align 8
	call void @bar(%struct.T* sret(%struct.T) %t)			call void @bar(%struct.T* sret(%struct.T) %t)
	▲ Show 20 Lines • Show All 70 Lines • Show Last 20 Lines

llvm/test/CodeGen/PowerPC/byval.ll

	Show All 28 Lines
	; CHECK-NEXT: ld 3, 40(1)			; CHECK-NEXT: ld 3, 40(1)
	; CHECK-NEXT: bl foo1			; CHECK-NEXT: bl foo1
	; CHECK-NEXT: nop			; CHECK-NEXT: nop
	; CHECK-NEXT: addi 1, 1, 80			; CHECK-NEXT: addi 1, 1, 80
	; CHECK-NEXT: ld 0, 16(1)			; CHECK-NEXT: ld 0, 16(1)
	; CHECK-NEXT: mtlr 0			; CHECK-NEXT: mtlr 0
	; CHECK-NEXT: blr			; CHECK-NEXT: blr
	entry:			entry:
	%x = alloca %struct, align 4			%x = alloca %struct, align 8
	call void @foo(%struct* %x)			call void @foo(%struct* %x)
	%r = call i32 @foo1(%struct* byval(%struct) %x)			%r = call i32 @foo1(%struct* byval(%struct) %x)
	ret i32 %r			ret i32 %r
	}			}

llvm/test/CodeGen/PowerPC/structsinregs.ll

	Show All 29 Lines
	@caller2.p3 = private unnamed_addr constant %struct.t3 <{ i16 4, i8 8 }>, align 1			@caller2.p3 = private unnamed_addr constant %struct.t3 <{ i16 4, i8 8 }>, align 1
	@caller2.p4 = private unnamed_addr constant { i32 } { i32 16 }, align 1			@caller2.p4 = private unnamed_addr constant { i32 } { i32 16 }, align 1
	@caller2.p5 = private unnamed_addr constant %struct.t5 <{ i32 32, i8 64 }>, align 1			@caller2.p5 = private unnamed_addr constant %struct.t5 <{ i32 32, i8 64 }>, align 1
	@caller2.p6 = private unnamed_addr constant %struct.t6 <{ i32 128, i16 256 }>, align 1			@caller2.p6 = private unnamed_addr constant %struct.t6 <{ i32 128, i16 256 }>, align 1
	@caller2.p7 = private unnamed_addr constant %struct.t7 <{ i32 512, i16 1024, i8 -3 }>, align 1			@caller2.p7 = private unnamed_addr constant %struct.t7 <{ i32 512, i16 1024, i8 -3 }>, align 1

	define i32 @caller1() nounwind {			define i32 @caller1() nounwind {
	entry:			entry:
	%p1 = alloca %struct.s1, align 1			%p1 = alloca %struct.s1
	%p2 = alloca %struct.s2, align 2			%p2 = alloca %struct.s2
	%p3 = alloca %struct.s3, align 2			%p3 = alloca %struct.s3
	%p4 = alloca %struct.s4, align 4			%p4 = alloca %struct.s4
	%p5 = alloca %struct.s5, align 4			%p5 = alloca %struct.s5
	%p6 = alloca %struct.s6, align 4			%p6 = alloca %struct.s6
	%p7 = alloca %struct.s7, align 4			%p7 = alloca %struct.s7
	%0 = bitcast %struct.s1* %p1 to i8*			%0 = bitcast %struct.s1* %p1 to i8*
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* %0, i8* getelementptr inbounds (%struct.s1, %struct.s1* @caller1.p1, i32 0, i32 0), i64 1, i1 false)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* %0, i8* getelementptr inbounds (%struct.s1, %struct.s1* @caller1.p1, i32 0, i32 0), i64 1, i1 false)
	%1 = bitcast %struct.s2* %p2 to i8*			%1 = bitcast %struct.s2* %p2 to i8*
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 2 %1, i8* align 2 bitcast (%struct.s2* @caller1.p2 to i8*), i64 2, i1 false)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 2 %1, i8* align 2 bitcast (%struct.s2* @caller1.p2 to i8*), i64 2, i1 false)
	%2 = bitcast %struct.s3* %p3 to i8*			%2 = bitcast %struct.s3* %p3 to i8*
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 2 %2, i8* align 2 bitcast ({ i16, i8, i8 }* @caller1.p3 to i8*), i64 4, i1 false)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 2 %2, i8* align 2 bitcast ({ i16, i8, i8 }* @caller1.p3 to i8*), i64 4, i1 false)
	%3 = bitcast %struct.s4* %p4 to i8*			%3 = bitcast %struct.s4* %p4 to i8*
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 4 %3, i8* align 4 bitcast (%struct.s4* @caller1.p4 to i8*), i64 4, i1 false)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 4 %3, i8* align 4 bitcast (%struct.s4* @caller1.p4 to i8*), i64 4, i1 false)
	▲ Show 20 Lines • Show All 59 Lines • ▼ Show 20 Lines
	; CHECK-DAG: lwz {{[0-9]+}}, 76(1)			; CHECK-DAG: lwz {{[0-9]+}}, 76(1)
	; CHECK-DAG: lwz {{[0-9]+}}, 80(1)			; CHECK-DAG: lwz {{[0-9]+}}, 80(1)
	; CHECK-DAG: lwz {{[0-9]+}}, 88(1)			; CHECK-DAG: lwz {{[0-9]+}}, 88(1)
	; CHECK-DAG: lwz {{[0-9]+}}, 96(1)			; CHECK-DAG: lwz {{[0-9]+}}, 96(1)
	}			}

	define i32 @caller2() nounwind {			define i32 @caller2() nounwind {
	entry:			entry:
	%p1 = alloca %struct.t1, align 1			%p1 = alloca %struct.t1
	%p2 = alloca %struct.t2, align 1			%p2 = alloca %struct.t2
	%p3 = alloca %struct.t3, align 1			%p3 = alloca %struct.t3
	%p4 = alloca %struct.t4, align 1			%p4 = alloca %struct.t4
	%p5 = alloca %struct.t5, align 1			%p5 = alloca %struct.t5
	%p6 = alloca %struct.t6, align 1			%p6 = alloca %struct.t6
	%p7 = alloca %struct.t7, align 1			%p7 = alloca %struct.t7
	%0 = bitcast %struct.t1* %p1 to i8*			%0 = bitcast %struct.t1* %p1 to i8*
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* %0, i8* getelementptr inbounds (%struct.t1, %struct.t1* @caller2.p1, i32 0, i32 0), i64 1, i1 false)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* %0, i8* getelementptr inbounds (%struct.t1, %struct.t1* @caller2.p1, i32 0, i32 0), i64 1, i1 false)
	%1 = bitcast %struct.t2* %p2 to i8*			%1 = bitcast %struct.t2* %p2 to i8*
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* %1, i8* bitcast ({ i16 }* @caller2.p2 to i8*), i64 2, i1 false)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* %1, i8* bitcast ({ i16 }* @caller2.p2 to i8*), i64 2, i1 false)
	%2 = bitcast %struct.t3* %p3 to i8*			%2 = bitcast %struct.t3* %p3 to i8*
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* %2, i8* bitcast (%struct.t3* @caller2.p3 to i8*), i64 3, i1 false)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* %2, i8* bitcast (%struct.t3* @caller2.p3 to i8*), i64 3, i1 false)
	%3 = bitcast %struct.t4* %p4 to i8*			%3 = bitcast %struct.t4* %p4 to i8*
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* %3, i8* bitcast ({ i32 }* @caller2.p4 to i8*), i64 4, i1 false)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* %3, i8* bitcast ({ i32 }* @caller2.p4 to i8*), i64 4, i1 false)
	▲ Show 20 Lines • Show All 80 Lines • Show Last 20 Lines

llvm/test/CodeGen/PowerPC/varargs-struct-float.ll

	; RUN: llc -verify-machineinstrs -mcpu=pwr7 -O0 < %s \| FileCheck %s			; RUN: llc -verify-machineinstrs -mcpu=pwr7 -O0 < %s \| FileCheck %s

	target datalayout = "E-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-f128:128:128-v128:128:128-n32:64"			target datalayout = "E-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-f128:128:128-v128:128:128-n32:64"
	target triple = "powerpc64-unknown-linux-gnu"			target triple = "powerpc64-unknown-linux-gnu"

	%struct.Sf1 = type { float }			%struct.Sf1 = type { float }

	define void @foo(float inreg %s.coerce) nounwind {			define void @foo(float inreg %s.coerce) nounwind {
	entry:			entry:
	%s = alloca %struct.Sf1, align 4			%s = alloca %struct.Sf1, align 8
	%coerce.dive = getelementptr %struct.Sf1, %struct.Sf1* %s, i32 0, i32 0			%coerce.dive = getelementptr %struct.Sf1, %struct.Sf1* %s, i32 0, i32 0
	store float %s.coerce, float* %coerce.dive, align 1			store float %s.coerce, float* %coerce.dive, align 1
	%coerce.dive1 = getelementptr %struct.Sf1, %struct.Sf1* %s, i32 0, i32 0			%coerce.dive1 = getelementptr %struct.Sf1, %struct.Sf1* %s, i32 0, i32 0
	%0 = load float, float* %coerce.dive1, align 1			%0 = load float, float* %coerce.dive1, align 1
	call void (i32, ...) @testvaSf1(i32 1, float inreg %0)			call void (i32, ...) @testvaSf1(i32 1, float inreg %0)
	ret void			ret void
	}			}

	; CHECK: stfs {{[0-9]+}}, 116(1)			; CHECK: stfs {{[0-9]+}}, 116(1)
	; CHECK: lwz 4, 116(1)			; CHECK: lwz 4, 116(1)
	; CHECK: bl			; CHECK: bl

	declare void @testvaSf1(i32, ...)			declare void @testvaSf1(i32, ...)

llvm/test/CodeGen/RISCV/calling-conv-ilp32-ilp32f-ilp32d-common.ll

	Show First 20 Lines • Show All 589 Lines • ▼ Show 20 Lines
	; RV32I-WITHFP-NEXT: sw a2, -32(s0)			; RV32I-WITHFP-NEXT: sw a2, -32(s0)
	; RV32I-WITHFP-NEXT: sw a3, -28(s0)			; RV32I-WITHFP-NEXT: sw a3, -28(s0)
	; RV32I-WITHFP-NEXT: addi a0, s0, -40			; RV32I-WITHFP-NEXT: addi a0, s0, -40
	; RV32I-WITHFP-NEXT: call callee_large_struct@plt			; RV32I-WITHFP-NEXT: call callee_large_struct@plt
	; RV32I-WITHFP-NEXT: lw ra, 44(sp) # 4-byte Folded Reload			; RV32I-WITHFP-NEXT: lw ra, 44(sp) # 4-byte Folded Reload
	; RV32I-WITHFP-NEXT: lw s0, 40(sp) # 4-byte Folded Reload			; RV32I-WITHFP-NEXT: lw s0, 40(sp) # 4-byte Folded Reload
	; RV32I-WITHFP-NEXT: addi sp, sp, 48			; RV32I-WITHFP-NEXT: addi sp, sp, 48
	; RV32I-WITHFP-NEXT: ret			; RV32I-WITHFP-NEXT: ret
	%ls = alloca %struct.large, align 4			%ls = alloca %struct.large, align 8
	%1 = bitcast %struct.large* %ls to i8*			%1 = bitcast %struct.large* %ls to i8*
	%a = getelementptr inbounds %struct.large, %struct.large* %ls, i32 0, i32 0			%a = getelementptr inbounds %struct.large, %struct.large* %ls, i32 0, i32 0
	store i32 1, i32* %a			store i32 1, i32* %a
	%b = getelementptr inbounds %struct.large, %struct.large* %ls, i32 0, i32 1			%b = getelementptr inbounds %struct.large, %struct.large* %ls, i32 0, i32 1
	store i32 2, i32* %b			store i32 2, i32* %b
	%c = getelementptr inbounds %struct.large, %struct.large* %ls, i32 0, i32 2			%c = getelementptr inbounds %struct.large, %struct.large* %ls, i32 0, i32 2
	store i32 3, i32* %c			store i32 3, i32* %c
	%d = getelementptr inbounds %struct.large, %struct.large* %ls, i32 0, i32 3			%d = getelementptr inbounds %struct.large, %struct.large* %ls, i32 0, i32 3
	▲ Show 20 Lines • Show All 413 Lines • Show Last 20 Lines

llvm/test/CodeGen/RISCV/frame.ll

	Show All 35 Lines
	; RV32I-WITHFP-NEXT: sw zero, -32(s0)			; RV32I-WITHFP-NEXT: sw zero, -32(s0)
	; RV32I-WITHFP-NEXT: addi a0, s0, -28			; RV32I-WITHFP-NEXT: addi a0, s0, -28
	; RV32I-WITHFP-NEXT: call test1@plt			; RV32I-WITHFP-NEXT: call test1@plt
	; RV32I-WITHFP-NEXT: li a0, 0			; RV32I-WITHFP-NEXT: li a0, 0
	; RV32I-WITHFP-NEXT: lw ra, 28(sp) # 4-byte Folded Reload			; RV32I-WITHFP-NEXT: lw ra, 28(sp) # 4-byte Folded Reload
	; RV32I-WITHFP-NEXT: lw s0, 24(sp) # 4-byte Folded Reload			; RV32I-WITHFP-NEXT: lw s0, 24(sp) # 4-byte Folded Reload
	; RV32I-WITHFP-NEXT: addi sp, sp, 32			; RV32I-WITHFP-NEXT: addi sp, sp, 32
	; RV32I-WITHFP-NEXT: ret			; RV32I-WITHFP-NEXT: ret
	%key = alloca %struct.key_t, align 4			%key = alloca %struct.key_t, align 8
	%1 = bitcast %struct.key_t* %key to i8*			%1 = bitcast %struct.key_t* %key to i8*
	call void @llvm.memset.p0i8.i64(i8* align 4 %1, i8 0, i64 20, i1 false)			call void @llvm.memset.p0i8.i64(i8* align 4 %1, i8 0, i64 20, i1 false)
	%2 = getelementptr inbounds %struct.key_t, %struct.key_t* %key, i64 0, i32 1, i64 0			%2 = getelementptr inbounds %struct.key_t, %struct.key_t* %key, i64 0, i32 1, i64 0
	call void @test1(i8* %2)			call void @test1(i8* %2)
	ret i32 0			ret i32 0
	}			}

	declare void @llvm.memset.p0i8.i64(i8* nocapture, i8, i64, i1)			declare void @llvm.memset.p0i8.i64(i8* nocapture, i8, i64, i1)

	declare void @test1(i8*)			declare void @test1(i8*)

llvm/test/CodeGen/RISCV/mem64.ll

	Show First 20 Lines • Show All 362 Lines • ▼ Show 20 Lines
	; RV64I-NEXT: add a0, a1, a0			; RV64I-NEXT: add a0, a1, a0
	; RV64I-NEXT: sb zero, 0(a0)			; RV64I-NEXT: sb zero, 0(a0)
	; RV64I-NEXT: mv a0, a1			; RV64I-NEXT: mv a0, a1
	; RV64I-NEXT: call snork@plt			; RV64I-NEXT: call snork@plt
	; RV64I-NEXT: ld ra, 8(sp) # 8-byte Folded Reload			; RV64I-NEXT: ld ra, 8(sp) # 8-byte Folded Reload
	; RV64I-NEXT: addi sp, sp, 16			; RV64I-NEXT: addi sp, sp, 16
	; RV64I-NEXT: ret			; RV64I-NEXT: ret
	bb:			bb:
	%tmp = alloca %struct.quux, align 4			%tmp = alloca %struct.quux, align 8
	%tmp1 = getelementptr inbounds %struct.quux, %struct.quux* %tmp, i64 0, i32 1			%tmp1 = getelementptr inbounds %struct.quux, %struct.quux* %tmp, i64 0, i32 1
	%tmp2 = getelementptr inbounds %struct.quux, %struct.quux* %tmp, i64 0, i32 1, i64 %arg			%tmp2 = getelementptr inbounds %struct.quux, %struct.quux* %tmp, i64 0, i32 1, i64 %arg
	store i8 0, i8* %tmp2, align 1			store i8 0, i8* %tmp2, align 1
	call void @snork([0 x i8]* %tmp1)			call void @snork([0 x i8]* %tmp1)
	ret void			ret void
	}			}

	declare void @snork([0 x i8]*)			declare void @snork([0 x i8]*)

llvm/test/CodeGen/RISCV/vararg.ll

	Show First 20 Lines • Show All 132 Lines • ▼ Show 20 Lines
	; LP64-LP64F-LP64D-WITHFP-NEXT: sd a2, 16(s0)			; LP64-LP64F-LP64D-WITHFP-NEXT: sd a2, 16(s0)
	; LP64-LP64F-LP64D-WITHFP-NEXT: addi a0, s0, 12			; LP64-LP64F-LP64D-WITHFP-NEXT: addi a0, s0, 12
	; LP64-LP64F-LP64D-WITHFP-NEXT: sd a0, -24(s0)			; LP64-LP64F-LP64D-WITHFP-NEXT: sd a0, -24(s0)
	; LP64-LP64F-LP64D-WITHFP-NEXT: lw a0, 8(s0)			; LP64-LP64F-LP64D-WITHFP-NEXT: lw a0, 8(s0)
	; LP64-LP64F-LP64D-WITHFP-NEXT: ld ra, 24(sp) # 8-byte Folded Reload			; LP64-LP64F-LP64D-WITHFP-NEXT: ld ra, 24(sp) # 8-byte Folded Reload
	; LP64-LP64F-LP64D-WITHFP-NEXT: ld s0, 16(sp) # 8-byte Folded Reload			; LP64-LP64F-LP64D-WITHFP-NEXT: ld s0, 16(sp) # 8-byte Folded Reload
	; LP64-LP64F-LP64D-WITHFP-NEXT: addi sp, sp, 96			; LP64-LP64F-LP64D-WITHFP-NEXT: addi sp, sp, 96
	; LP64-LP64F-LP64D-WITHFP-NEXT: ret			; LP64-LP64F-LP64D-WITHFP-NEXT: ret
	%va = alloca i8*, align 4			%va = alloca i8*
	%1 = bitcast i8** %va to i8*			%1 = bitcast i8** %va to i8*
	call void @llvm.va_start(i8* %1)			call void @llvm.va_start(i8* %1)
	%argp.cur = load i8, i8* %va, align 4			%argp.cur = load i8, i8* %va, align 4
	%argp.next = getelementptr inbounds i8, i8* %argp.cur, i32 4			%argp.next = getelementptr inbounds i8, i8* %argp.cur, i32 4
	store i8* %argp.next, i8** %va, align 4			store i8* %argp.next, i8** %va, align 4
	%2 = bitcast i8* %argp.cur to i32*			%2 = bitcast i8* %argp.cur to i32*
	%3 = load i32, i32* %2, align 4			%3 = load i32, i32* %2, align 4
	call void @llvm.va_end(i8* %1)			call void @llvm.va_end(i8* %1)
	▲ Show 20 Lines • Show All 448 Lines • ▼ Show 20 Lines
	; LP64-LP64F-LP64D-WITHFP-NEXT: sd a1, -24(s0)			; LP64-LP64F-LP64D-WITHFP-NEXT: sd a1, -24(s0)
	; LP64-LP64F-LP64D-WITHFP-NEXT: srliw a0, a0, 3			; LP64-LP64F-LP64D-WITHFP-NEXT: srliw a0, a0, 3
	; LP64-LP64F-LP64D-WITHFP-NEXT: slli a0, a0, 3			; LP64-LP64F-LP64D-WITHFP-NEXT: slli a0, a0, 3
	; LP64-LP64F-LP64D-WITHFP-NEXT: ld a0, 0(a0)			; LP64-LP64F-LP64D-WITHFP-NEXT: ld a0, 0(a0)
	; LP64-LP64F-LP64D-WITHFP-NEXT: ld ra, 24(sp) # 8-byte Folded Reload			; LP64-LP64F-LP64D-WITHFP-NEXT: ld ra, 24(sp) # 8-byte Folded Reload
	; LP64-LP64F-LP64D-WITHFP-NEXT: ld s0, 16(sp) # 8-byte Folded Reload			; LP64-LP64F-LP64D-WITHFP-NEXT: ld s0, 16(sp) # 8-byte Folded Reload
	; LP64-LP64F-LP64D-WITHFP-NEXT: addi sp, sp, 96			; LP64-LP64F-LP64D-WITHFP-NEXT: addi sp, sp, 96
	; LP64-LP64F-LP64D-WITHFP-NEXT: ret			; LP64-LP64F-LP64D-WITHFP-NEXT: ret
	%va = alloca i8*, align 4			%va = alloca i8*
	%1 = bitcast i8** %va to i8*			%1 = bitcast i8** %va to i8*
	call void @llvm.va_start(i8* %1)			call void @llvm.va_start(i8* %1)
	%2 = bitcast i8** %va to i32*			%2 = bitcast i8** %va to i32*
	%argp.cur = load i32, i32* %2, align 4			%argp.cur = load i32, i32* %2, align 4
	%3 = add i32 %argp.cur, 7			%3 = add i32 %argp.cur, 7
	%4 = and i32 %3, -8			%4 = and i32 %3, -8
	%argp.cur.aligned = inttoptr i32 %3 to i8*			%argp.cur.aligned = inttoptr i32 %3 to i8*
	%argp.next = getelementptr inbounds i8, i8* %argp.cur.aligned, i32 8			%argp.next = getelementptr inbounds i8, i8* %argp.cur.aligned, i32 8
	▲ Show 20 Lines • Show All 105 Lines • ▼ Show 20 Lines
	; LP64-LP64F-LP64D-WITHFP-NEXT: sd a2, 16(s0)			; LP64-LP64F-LP64D-WITHFP-NEXT: sd a2, 16(s0)
	; LP64-LP64F-LP64D-WITHFP-NEXT: sd a1, 8(s0)			; LP64-LP64F-LP64D-WITHFP-NEXT: sd a1, 8(s0)
	; LP64-LP64F-LP64D-WITHFP-NEXT: addi a1, s0, 16			; LP64-LP64F-LP64D-WITHFP-NEXT: addi a1, s0, 16
	; LP64-LP64F-LP64D-WITHFP-NEXT: sd a1, -24(s0)			; LP64-LP64F-LP64D-WITHFP-NEXT: sd a1, -24(s0)
	; LP64-LP64F-LP64D-WITHFP-NEXT: ld ra, 24(sp) # 8-byte Folded Reload			; LP64-LP64F-LP64D-WITHFP-NEXT: ld ra, 24(sp) # 8-byte Folded Reload
	; LP64-LP64F-LP64D-WITHFP-NEXT: ld s0, 16(sp) # 8-byte Folded Reload			; LP64-LP64F-LP64D-WITHFP-NEXT: ld s0, 16(sp) # 8-byte Folded Reload
	; LP64-LP64F-LP64D-WITHFP-NEXT: addi sp, sp, 96			; LP64-LP64F-LP64D-WITHFP-NEXT: addi sp, sp, 96
	; LP64-LP64F-LP64D-WITHFP-NEXT: ret			; LP64-LP64F-LP64D-WITHFP-NEXT: ret
	%va = alloca i8*, align 4			%va = alloca i8*
	%1 = bitcast i8** %va to i8*			%1 = bitcast i8** %va to i8*
	call void @llvm.va_start(i8* %1)			call void @llvm.va_start(i8* %1)
	%2 = va_arg i8** %va, double			%2 = va_arg i8** %va, double
	call void @llvm.va_end(i8* %1)			call void @llvm.va_end(i8* %1)
	%3 = bitcast double %2 to i64			%3 = bitcast double %2 to i64
	ret i64 %3			ret i64 %3
	}			}

	▲ Show 20 Lines • Show All 181 Lines • ▼ Show 20 Lines
	; LP64-LP64F-LP64D-WITHFP-NEXT: srliw a0, a0, 3			; LP64-LP64F-LP64D-WITHFP-NEXT: srliw a0, a0, 3
	; LP64-LP64F-LP64D-WITHFP-NEXT: slli a0, a0, 3			; LP64-LP64F-LP64D-WITHFP-NEXT: slli a0, a0, 3
	; LP64-LP64F-LP64D-WITHFP-NEXT: ld a0, 0(a0)			; LP64-LP64F-LP64D-WITHFP-NEXT: ld a0, 0(a0)
	; LP64-LP64F-LP64D-WITHFP-NEXT: add a0, a1, a0			; LP64-LP64F-LP64D-WITHFP-NEXT: add a0, a1, a0
	; LP64-LP64F-LP64D-WITHFP-NEXT: ld ra, 24(sp) # 8-byte Folded Reload			; LP64-LP64F-LP64D-WITHFP-NEXT: ld ra, 24(sp) # 8-byte Folded Reload
	; LP64-LP64F-LP64D-WITHFP-NEXT: ld s0, 16(sp) # 8-byte Folded Reload			; LP64-LP64F-LP64D-WITHFP-NEXT: ld s0, 16(sp) # 8-byte Folded Reload
	; LP64-LP64F-LP64D-WITHFP-NEXT: addi sp, sp, 80			; LP64-LP64F-LP64D-WITHFP-NEXT: addi sp, sp, 80
	; LP64-LP64F-LP64D-WITHFP-NEXT: ret			; LP64-LP64F-LP64D-WITHFP-NEXT: ret
	%va = alloca i8*, align 4			%va = alloca i8*
	%1 = bitcast i8** %va to i8*			%1 = bitcast i8** %va to i8*
	call void @llvm.va_start(i8* %1)			call void @llvm.va_start(i8* %1)
	%2 = bitcast i8** %va to i32*			%2 = bitcast i8** %va to i32*
	%argp.cur = load i32, i32* %2, align 4			%argp.cur = load i32, i32* %2, align 4
	%3 = add i32 %argp.cur, 7			%3 = add i32 %argp.cur, 7
	%4 = and i32 %3, -8			%4 = and i32 %3, -8
	%argp.cur.aligned = inttoptr i32 %3 to i8*			%argp.cur.aligned = inttoptr i32 %3 to i8*
	%argp.next = getelementptr inbounds i8, i8* %argp.cur.aligned, i32 8			%argp.next = getelementptr inbounds i8, i8* %argp.cur.aligned, i32 8
	▲ Show 20 Lines • Show All 110 Lines • ▼ Show 20 Lines
	; LP64-LP64F-LP64D-WITHFP-NEXT: sd a2, 0(s0)			; LP64-LP64F-LP64D-WITHFP-NEXT: sd a2, 0(s0)
	; LP64-LP64F-LP64D-WITHFP-NEXT: addi a3, s0, 8			; LP64-LP64F-LP64D-WITHFP-NEXT: addi a3, s0, 8
	; LP64-LP64F-LP64D-WITHFP-NEXT: add a0, a1, a2			; LP64-LP64F-LP64D-WITHFP-NEXT: add a0, a1, a2
	; LP64-LP64F-LP64D-WITHFP-NEXT: sd a3, -24(s0)			; LP64-LP64F-LP64D-WITHFP-NEXT: sd a3, -24(s0)
	; LP64-LP64F-LP64D-WITHFP-NEXT: ld ra, 24(sp) # 8-byte Folded Reload			; LP64-LP64F-LP64D-WITHFP-NEXT: ld ra, 24(sp) # 8-byte Folded Reload
	; LP64-LP64F-LP64D-WITHFP-NEXT: ld s0, 16(sp) # 8-byte Folded Reload			; LP64-LP64F-LP64D-WITHFP-NEXT: ld s0, 16(sp) # 8-byte Folded Reload
	; LP64-LP64F-LP64D-WITHFP-NEXT: addi sp, sp, 80			; LP64-LP64F-LP64D-WITHFP-NEXT: addi sp, sp, 80
	; LP64-LP64F-LP64D-WITHFP-NEXT: ret			; LP64-LP64F-LP64D-WITHFP-NEXT: ret
	%va = alloca i8*, align 4			%va = alloca i8*
	%1 = bitcast i8** %va to i8*			%1 = bitcast i8** %va to i8*
	call void @llvm.va_start(i8* %1)			call void @llvm.va_start(i8* %1)
	%2 = va_arg i8** %va, double			%2 = va_arg i8** %va, double
	call void @llvm.va_end(i8* %1)			call void @llvm.va_end(i8* %1)
	%3 = bitcast double %2 to i64			%3 = bitcast double %2 to i64
	%4 = add i64 %b, %3			%4 = add i64 %b, %3
	ret i64 %4			ret i64 %4
	}			}
	▲ Show 20 Lines • Show All 284 Lines • ▼ Show 20 Lines
	; LP64-LP64F-LP64D-WITHFP-NEXT: addw a1, a1, s1			; LP64-LP64F-LP64D-WITHFP-NEXT: addw a1, a1, s1
	; LP64-LP64F-LP64D-WITHFP-NEXT: addw a1, a1, a2			; LP64-LP64F-LP64D-WITHFP-NEXT: addw a1, a1, a2
	; LP64-LP64F-LP64D-WITHFP-NEXT: addw a0, a1, a0			; LP64-LP64F-LP64D-WITHFP-NEXT: addw a0, a1, a0
	; LP64-LP64F-LP64D-WITHFP-NEXT: ld ra, 40(sp) # 8-byte Folded Reload			; LP64-LP64F-LP64D-WITHFP-NEXT: ld ra, 40(sp) # 8-byte Folded Reload
	; LP64-LP64F-LP64D-WITHFP-NEXT: ld s0, 32(sp) # 8-byte Folded Reload			; LP64-LP64F-LP64D-WITHFP-NEXT: ld s0, 32(sp) # 8-byte Folded Reload
	; LP64-LP64F-LP64D-WITHFP-NEXT: ld s1, 24(sp) # 8-byte Folded Reload			; LP64-LP64F-LP64D-WITHFP-NEXT: ld s1, 24(sp) # 8-byte Folded Reload
	; LP64-LP64F-LP64D-WITHFP-NEXT: addi sp, sp, 112			; LP64-LP64F-LP64D-WITHFP-NEXT: addi sp, sp, 112
	; LP64-LP64F-LP64D-WITHFP-NEXT: ret			; LP64-LP64F-LP64D-WITHFP-NEXT: ret
	%vargs = alloca i8*, align 4			%vargs = alloca i8*
	%wargs = alloca i8*, align 4			%wargs = alloca i8*
	%1 = bitcast i8** %vargs to i8*			%1 = bitcast i8** %vargs to i8*
	%2 = bitcast i8** %wargs to i8*			%2 = bitcast i8** %wargs to i8*
	call void @llvm.va_start(i8* %1)			call void @llvm.va_start(i8* %1)
	%3 = va_arg i8** %vargs, i32			%3 = va_arg i8** %vargs, i32
	call void @llvm.va_copy(i8* %2, i8* %1)			call void @llvm.va_copy(i8* %2, i8* %1)
	%4 = load i8, i8* %wargs, align 4			%4 = load i8, i8* %wargs, align 4
	call void @notdead(i8* %4)			call void @notdead(i8* %4)
	%5 = va_arg i8** %vargs, i32			%5 = va_arg i8** %vargs, i32
	▲ Show 20 Lines • Show All 303 Lines • ▼ Show 20 Lines
	; LP64-LP64F-LP64D-WITHFP-NEXT: sd a1, 8(s0)			; LP64-LP64F-LP64D-WITHFP-NEXT: sd a1, 8(s0)
	; LP64-LP64F-LP64D-WITHFP-NEXT: sd a0, 0(s0)			; LP64-LP64F-LP64D-WITHFP-NEXT: sd a0, 0(s0)
	; LP64-LP64F-LP64D-WITHFP-NEXT: addi a1, s0, 8			; LP64-LP64F-LP64D-WITHFP-NEXT: addi a1, s0, 8
	; LP64-LP64F-LP64D-WITHFP-NEXT: sd a1, -24(s0)			; LP64-LP64F-LP64D-WITHFP-NEXT: sd a1, -24(s0)
	; LP64-LP64F-LP64D-WITHFP-NEXT: ld ra, 24(sp) # 8-byte Folded Reload			; LP64-LP64F-LP64D-WITHFP-NEXT: ld ra, 24(sp) # 8-byte Folded Reload
	; LP64-LP64F-LP64D-WITHFP-NEXT: ld s0, 16(sp) # 8-byte Folded Reload			; LP64-LP64F-LP64D-WITHFP-NEXT: ld s0, 16(sp) # 8-byte Folded Reload
	; LP64-LP64F-LP64D-WITHFP-NEXT: addi sp, sp, 96			; LP64-LP64F-LP64D-WITHFP-NEXT: addi sp, sp, 96
	; LP64-LP64F-LP64D-WITHFP-NEXT: ret			; LP64-LP64F-LP64D-WITHFP-NEXT: ret
	%va = alloca i8*, align 4			%va = alloca i8*
	%1 = bitcast i8** %va to i8*			%1 = bitcast i8** %va to i8*
	call void @llvm.va_start(i8* %1)			call void @llvm.va_start(i8* %1)
	%2 = va_arg i8** %va, i32			%2 = va_arg i8** %va, i32
	call void @llvm.va_end(i8* %1)			call void @llvm.va_end(i8* %1)
	ret i32 %2			ret i32 %2
	}			}

	; TODO: improve constant materialization of stack addresses			; TODO: improve constant materialization of stack addresses
	▲ Show 20 Lines • Show All 197 Lines • ▼ Show 20 Lines
	; LP64-LP64F-LP64D-WITHFP-NEXT: lui a1, 24414			; LP64-LP64F-LP64D-WITHFP-NEXT: lui a1, 24414
	; LP64-LP64F-LP64D-WITHFP-NEXT: addiw a1, a1, -1680			; LP64-LP64F-LP64D-WITHFP-NEXT: addiw a1, a1, -1680
	; LP64-LP64F-LP64D-WITHFP-NEXT: add sp, sp, a1			; LP64-LP64F-LP64D-WITHFP-NEXT: add sp, sp, a1
	; LP64-LP64F-LP64D-WITHFP-NEXT: ld ra, 1960(sp) # 8-byte Folded Reload			; LP64-LP64F-LP64D-WITHFP-NEXT: ld ra, 1960(sp) # 8-byte Folded Reload
	; LP64-LP64F-LP64D-WITHFP-NEXT: ld s0, 1952(sp) # 8-byte Folded Reload			; LP64-LP64F-LP64D-WITHFP-NEXT: ld s0, 1952(sp) # 8-byte Folded Reload
	; LP64-LP64F-LP64D-WITHFP-NEXT: addi sp, sp, 2032			; LP64-LP64F-LP64D-WITHFP-NEXT: addi sp, sp, 2032
	; LP64-LP64F-LP64D-WITHFP-NEXT: ret			; LP64-LP64F-LP64D-WITHFP-NEXT: ret
	%large = alloca [ 100000000 x i8 ]			%large = alloca [ 100000000 x i8 ]
	%va = alloca i8*, align 4			%va = alloca i8*
	%1 = bitcast i8** %va to i8*			%1 = bitcast i8** %va to i8*
	call void @llvm.va_start(i8* %1)			call void @llvm.va_start(i8* %1)
	%argp.cur = load i8, i8* %va, align 4			%argp.cur = load i8, i8* %va, align 4
	%argp.next = getelementptr inbounds i8, i8* %argp.cur, i32 4			%argp.next = getelementptr inbounds i8, i8* %argp.cur, i32 4
	store i8* %argp.next, i8** %va, align 4			store i8* %argp.next, i8** %va, align 4
	%2 = bitcast i8* %argp.cur to i32*			%2 = bitcast i8* %argp.cur to i32*
	%3 = load i32, i32* %2, align 4			%3 = load i32, i32* %2, align 4
	call void @llvm.va_end(i8* %1)			call void @llvm.va_end(i8* %1)
	ret i32 %3			ret i32 %3
	}			}

llvm/test/CodeGen/Thumb2/mve-stack.ll

	Show All 9 Lines
	; CHECK-NEXT: sub sp, #16			; CHECK-NEXT: sub sp, #16
	; CHECK-NEXT: vmov.i32 q0, #0x0			; CHECK-NEXT: vmov.i32 q0, #0x0
	; CHECK-NEXT: mov r0, sp			; CHECK-NEXT: mov r0, sp
	; CHECK-NEXT: vstrw.32 q0, [sp, #8]			; CHECK-NEXT: vstrw.32 q0, [sp, #8]
	; CHECK-NEXT: bl func			; CHECK-NEXT: bl func
	; CHECK-NEXT: add sp, #16			; CHECK-NEXT: add sp, #16
	; CHECK-NEXT: pop {r7, pc}			; CHECK-NEXT: pop {r7, pc}
	entry:			entry:
	%d = alloca [4 x i32], align 2			%d = alloca [4 x i32], align 4
	%g = getelementptr inbounds [4 x i32], [4 x i32]* %d, i32 0, i32 2			%g = getelementptr inbounds [4 x i32], [4 x i32]* %d, i32 0, i32 2
	%b = bitcast i32* %g to <4 x i32>*			%b = bitcast i32* %g to <4 x i32>*
	store <4 x i32> zeroinitializer, <4 x i32>* %b, align 2			store <4 x i32> zeroinitializer, <4 x i32>* %b, align 2
	%arraydecay = getelementptr inbounds [4 x i32], [4 x i32]* %d, i32 0, i32 0			%arraydecay = getelementptr inbounds [4 x i32], [4 x i32]* %d, i32 0, i32 0
	call arm_aapcs_vfpcc void bitcast (void (...)* @func to void (i32))(i32* %arraydecay)			call arm_aapcs_vfpcc void bitcast (void (...)* @func to void (i32))(i32* %arraydecay)
	ret void			ret void
	}			}

	▲ Show 20 Lines • Show All 121 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: .pad #16			; CHECK-NEXT: .pad #16
	; CHECK-NEXT: sub sp, #16			; CHECK-NEXT: sub sp, #16
	; CHECK-NEXT: mov r0, sp			; CHECK-NEXT: mov r0, sp
	; CHECK-NEXT: bl func			; CHECK-NEXT: bl func
	; CHECK-NEXT: vldrw.u32 q0, [sp, #8]			; CHECK-NEXT: vldrw.u32 q0, [sp, #8]
	; CHECK-NEXT: add sp, #16			; CHECK-NEXT: add sp, #16
	; CHECK-NEXT: pop {r7, pc}			; CHECK-NEXT: pop {r7, pc}
	entry:			entry:
	%d = alloca [4 x i32], align 2			%d = alloca [4 x i32], align 4
	%arraydecay = getelementptr inbounds [4 x i32], [4 x i32]* %d, i32 0, i32 0			%arraydecay = getelementptr inbounds [4 x i32], [4 x i32]* %d, i32 0, i32 0
	call arm_aapcs_vfpcc void bitcast (void (...)* @func to void (i32))(i32* %arraydecay)			call arm_aapcs_vfpcc void bitcast (void (...)* @func to void (i32))(i32* %arraydecay)
	%g = getelementptr inbounds [4 x i32], [4 x i32]* %d, i32 0, i32 2			%g = getelementptr inbounds [4 x i32], [4 x i32]* %d, i32 0, i32 2
	%b = bitcast i32* %g to <4 x i32>*			%b = bitcast i32* %g to <4 x i32>*
	%l = load <4 x i32>, <4 x i32>* %b, align 2			%l = load <4 x i32>, <4 x i32>* %b, align 2
	ret <4 x i32> %l			ret <4 x i32> %l
	}			}

	▲ Show 20 Lines • Show All 114 Lines • Show Last 20 Lines

llvm/test/CodeGen/VE/Scalar/atomic_cmp_swap.ll

	Show First 20 Lines • Show All 1,449 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: cmov.w.eq %s2, (63)0, %s4			; CHECK-NEXT: cmov.w.eq %s2, (63)0, %s4
	; CHECK-NEXT: breq.w %s1, %s3, .LBB33_2			; CHECK-NEXT: breq.w %s1, %s3, .LBB33_2
	; CHECK-NEXT: # %bb.1:			; CHECK-NEXT: # %bb.1:
	; CHECK-NEXT: st1b %s1, (, %s0)			; CHECK-NEXT: st1b %s1, (, %s0)
	; CHECK-NEXT: .LBB33_2:			; CHECK-NEXT: .LBB33_2:
	; CHECK-NEXT: adds.w.zx %s0, %s2, (0)1			; CHECK-NEXT: adds.w.zx %s0, %s2, (0)1
	; CHECK-NEXT: adds.l %s11, 16, %s11			; CHECK-NEXT: adds.l %s11, 16, %s11
	; CHECK-NEXT: b.l.t (, %s10)			; CHECK-NEXT: b.l.t (, %s10)
	%3 = alloca %"struct.std::__1::atomic", align 1			%3 = alloca %"struct.std::__1::atomic", align 8
	%4 = getelementptr inbounds %"struct.std::__1::atomic", %"struct.std::__1::atomic"* %3, i64 0, i32 0, i32 0, i32 0, i32 0			%4 = getelementptr inbounds %"struct.std::__1::atomic", %"struct.std::__1::atomic"* %3, i64 0, i32 0, i32 0, i32 0, i32 0
	call void @llvm.lifetime.start.p0i8(i64 1, i8* nonnull %4)			call void @llvm.lifetime.start.p0i8(i64 1, i8* nonnull %4)
	%5 = zext i1 %1 to i8			%5 = zext i1 %1 to i8
	%6 = load i8, i8* %0, align 1			%6 = load i8, i8* %0, align 1
	%7 = cmpxchg weak volatile i8* %4, i8 %6, i8 %5 monotonic monotonic			%7 = cmpxchg weak volatile i8* %4, i8 %6, i8 %5 monotonic monotonic
	%8 = extractvalue { i8, i1 } %7, 1			%8 = extractvalue { i8, i1 } %7, 1
	br i1 %8, label %11, label %9			br i1 %8, label %11, label %9

	▲ Show 20 Lines • Show All 43 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: cmov.w.eq %s2, (63)0, %s4			; CHECK-NEXT: cmov.w.eq %s2, (63)0, %s4
	; CHECK-NEXT: breq.w %s1, %s3, .LBB34_2			; CHECK-NEXT: breq.w %s1, %s3, .LBB34_2
	; CHECK-NEXT: # %bb.1:			; CHECK-NEXT: # %bb.1:
	; CHECK-NEXT: st1b %s1, (, %s0)			; CHECK-NEXT: st1b %s1, (, %s0)
	; CHECK-NEXT: .LBB34_2:			; CHECK-NEXT: .LBB34_2:
	; CHECK-NEXT: adds.w.zx %s0, %s2, (0)1			; CHECK-NEXT: adds.w.zx %s0, %s2, (0)1
	; CHECK-NEXT: adds.l %s11, 16, %s11			; CHECK-NEXT: adds.l %s11, 16, %s11
	; CHECK-NEXT: b.l.t (, %s10)			; CHECK-NEXT: b.l.t (, %s10)
	%3 = alloca %"struct.std::__1::atomic.0", align 1			%3 = alloca %"struct.std::__1::atomic.0", align 8
	%4 = getelementptr inbounds %"struct.std::__1::atomic.0", %"struct.std::__1::atomic.0"* %3, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0			%4 = getelementptr inbounds %"struct.std::__1::atomic.0", %"struct.std::__1::atomic.0"* %3, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0
	call void @llvm.lifetime.start.p0i8(i64 1, i8* nonnull %4)			call void @llvm.lifetime.start.p0i8(i64 1, i8* nonnull %4)
	%5 = load i8, i8* %0, align 1			%5 = load i8, i8* %0, align 1
	%6 = cmpxchg weak volatile i8* %4, i8 %5, i8 %1 monotonic monotonic			%6 = cmpxchg weak volatile i8* %4, i8 %5, i8 %1 monotonic monotonic
	%7 = extractvalue { i8, i1 } %6, 1			%7 = extractvalue { i8, i1 } %6, 1
	br i1 %7, label %10, label %8			br i1 %7, label %10, label %8

	8: ; preds = %2			8: ; preds = %2
	Show All 36 Lines
	; CHECK-NEXT: cmov.w.eq %s2, (63)0, %s4			; CHECK-NEXT: cmov.w.eq %s2, (63)0, %s4
	; CHECK-NEXT: breq.w %s1, %s3, .LBB35_2			; CHECK-NEXT: breq.w %s1, %s3, .LBB35_2
	; CHECK-NEXT: # %bb.1:			; CHECK-NEXT: # %bb.1:
	; CHECK-NEXT: st1b %s1, (, %s0)			; CHECK-NEXT: st1b %s1, (, %s0)
	; CHECK-NEXT: .LBB35_2:			; CHECK-NEXT: .LBB35_2:
	; CHECK-NEXT: adds.w.zx %s0, %s2, (0)1			; CHECK-NEXT: adds.w.zx %s0, %s2, (0)1
	; CHECK-NEXT: adds.l %s11, 16, %s11			; CHECK-NEXT: adds.l %s11, 16, %s11
	; CHECK-NEXT: b.l.t (, %s10)			; CHECK-NEXT: b.l.t (, %s10)
	%3 = alloca %"struct.std::__1::atomic.5", align 1			%3 = alloca %"struct.std::__1::atomic.5", align 8
	%4 = getelementptr inbounds %"struct.std::__1::atomic.5", %"struct.std::__1::atomic.5"* %3, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0			%4 = getelementptr inbounds %"struct.std::__1::atomic.5", %"struct.std::__1::atomic.5"* %3, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0
	call void @llvm.lifetime.start.p0i8(i64 1, i8* nonnull %4)			call void @llvm.lifetime.start.p0i8(i64 1, i8* nonnull %4)
	%5 = load i8, i8* %0, align 1			%5 = load i8, i8* %0, align 1
	%6 = cmpxchg weak volatile i8* %4, i8 %5, i8 %1 monotonic monotonic			%6 = cmpxchg weak volatile i8* %4, i8 %5, i8 %1 monotonic monotonic
	%7 = extractvalue { i8, i1 } %6, 1			%7 = extractvalue { i8, i1 } %6, 1
	br i1 %7, label %10, label %8			br i1 %7, label %10, label %8

	8: ; preds = %2			8: ; preds = %2
	Show All 37 Lines
	; CHECK-NEXT: cmov.w.eq %s2, (63)0, %s4			; CHECK-NEXT: cmov.w.eq %s2, (63)0, %s4
	; CHECK-NEXT: breq.w %s1, %s3, .LBB36_2			; CHECK-NEXT: breq.w %s1, %s3, .LBB36_2
	; CHECK-NEXT: # %bb.1:			; CHECK-NEXT: # %bb.1:
	; CHECK-NEXT: st2b %s1, (, %s0)			; CHECK-NEXT: st2b %s1, (, %s0)
	; CHECK-NEXT: .LBB36_2:			; CHECK-NEXT: .LBB36_2:
	; CHECK-NEXT: adds.w.zx %s0, %s2, (0)1			; CHECK-NEXT: adds.w.zx %s0, %s2, (0)1
	; CHECK-NEXT: adds.l %s11, 16, %s11			; CHECK-NEXT: adds.l %s11, 16, %s11
	; CHECK-NEXT: b.l.t (, %s10)			; CHECK-NEXT: b.l.t (, %s10)
	%3 = alloca %"struct.std::__1::atomic.10", align 2			%3 = alloca %"struct.std::__1::atomic.10", align 8
	%4 = bitcast %"struct.std::__1::atomic.10"* %3 to i8*			%4 = bitcast %"struct.std::__1::atomic.10"* %3 to i8*
	call void @llvm.lifetime.start.p0i8(i64 2, i8* nonnull %4)			call void @llvm.lifetime.start.p0i8(i64 2, i8* nonnull %4)
	%5 = getelementptr inbounds %"struct.std::__1::atomic.10", %"struct.std::__1::atomic.10"* %3, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0			%5 = getelementptr inbounds %"struct.std::__1::atomic.10", %"struct.std::__1::atomic.10"* %3, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0
	%6 = load i16, i16* %0, align 2			%6 = load i16, i16* %0, align 2
	%7 = cmpxchg weak volatile i16* %5, i16 %6, i16 %1 monotonic monotonic			%7 = cmpxchg weak volatile i16* %5, i16 %6, i16 %1 monotonic monotonic
	%8 = extractvalue { i16, i1 } %7, 1			%8 = extractvalue { i16, i1 } %7, 1
	br i1 %8, label %11, label %9			br i1 %8, label %11, label %9

	Show All 37 Lines
	; CHECK-NEXT: cmov.w.eq %s2, (63)0, %s4			; CHECK-NEXT: cmov.w.eq %s2, (63)0, %s4
	; CHECK-NEXT: breq.w %s1, %s3, .LBB37_2			; CHECK-NEXT: breq.w %s1, %s3, .LBB37_2
	; CHECK-NEXT: # %bb.1:			; CHECK-NEXT: # %bb.1:
	; CHECK-NEXT: st2b %s1, (, %s0)			; CHECK-NEXT: st2b %s1, (, %s0)
	; CHECK-NEXT: .LBB37_2:			; CHECK-NEXT: .LBB37_2:
	; CHECK-NEXT: adds.w.zx %s0, %s2, (0)1			; CHECK-NEXT: adds.w.zx %s0, %s2, (0)1
	; CHECK-NEXT: adds.l %s11, 16, %s11			; CHECK-NEXT: adds.l %s11, 16, %s11
	; CHECK-NEXT: b.l.t (, %s10)			; CHECK-NEXT: b.l.t (, %s10)
	%3 = alloca %"struct.std::__1::atomic.15", align 2			%3 = alloca %"struct.std::__1::atomic.15", align 8
	%4 = bitcast %"struct.std::__1::atomic.15"* %3 to i8*			%4 = bitcast %"struct.std::__1::atomic.15"* %3 to i8*
	call void @llvm.lifetime.start.p0i8(i64 2, i8* nonnull %4)			call void @llvm.lifetime.start.p0i8(i64 2, i8* nonnull %4)
	%5 = getelementptr inbounds %"struct.std::__1::atomic.15", %"struct.std::__1::atomic.15"* %3, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0			%5 = getelementptr inbounds %"struct.std::__1::atomic.15", %"struct.std::__1::atomic.15"* %3, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0
	%6 = load i16, i16* %0, align 2			%6 = load i16, i16* %0, align 2
	%7 = cmpxchg weak volatile i16* %5, i16 %6, i16 %1 monotonic monotonic			%7 = cmpxchg weak volatile i16* %5, i16 %6, i16 %1 monotonic monotonic
	%8 = extractvalue { i16, i1 } %7, 1			%8 = extractvalue { i16, i1 } %7, 1
	br i1 %8, label %11, label %9			br i1 %8, label %11, label %9

	Show All 31 Lines
	; CHECK-NEXT: cmov.w.eq %s2, (63)0, %s4			; CHECK-NEXT: cmov.w.eq %s2, (63)0, %s4
	; CHECK-NEXT: breq.w %s1, %s3, .LBB38_2			; CHECK-NEXT: breq.w %s1, %s3, .LBB38_2
	; CHECK-NEXT: # %bb.1:			; CHECK-NEXT: # %bb.1:
	; CHECK-NEXT: stl %s1, (, %s0)			; CHECK-NEXT: stl %s1, (, %s0)
	; CHECK-NEXT: .LBB38_2:			; CHECK-NEXT: .LBB38_2:
	; CHECK-NEXT: adds.w.zx %s0, %s2, (0)1			; CHECK-NEXT: adds.w.zx %s0, %s2, (0)1
	; CHECK-NEXT: adds.l %s11, 16, %s11			; CHECK-NEXT: adds.l %s11, 16, %s11
	; CHECK-NEXT: b.l.t (, %s10)			; CHECK-NEXT: b.l.t (, %s10)
	%3 = alloca %"struct.std::__1::atomic.20", align 4			%3 = alloca %"struct.std::__1::atomic.20", align 8
	%4 = bitcast %"struct.std::__1::atomic.20"* %3 to i8*			%4 = bitcast %"struct.std::__1::atomic.20"* %3 to i8*
	call void @llvm.lifetime.start.p0i8(i64 4, i8* nonnull %4)			call void @llvm.lifetime.start.p0i8(i64 4, i8* nonnull %4)
	%5 = getelementptr inbounds %"struct.std::__1::atomic.20", %"struct.std::__1::atomic.20"* %3, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0			%5 = getelementptr inbounds %"struct.std::__1::atomic.20", %"struct.std::__1::atomic.20"* %3, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0
	%6 = load i32, i32* %0, align 4			%6 = load i32, i32* %0, align 4
	%7 = cmpxchg weak volatile i32* %5, i32 %6, i32 %1 monotonic monotonic			%7 = cmpxchg weak volatile i32* %5, i32 %6, i32 %1 monotonic monotonic
	%8 = extractvalue { i32, i1 } %7, 1			%8 = extractvalue { i32, i1 } %7, 1
	br i1 %8, label %11, label %9			br i1 %8, label %11, label %9

	Show All 31 Lines
	; CHECK-NEXT: cmov.w.eq %s2, (63)0, %s4			; CHECK-NEXT: cmov.w.eq %s2, (63)0, %s4
	; CHECK-NEXT: breq.w %s1, %s3, .LBB39_2			; CHECK-NEXT: breq.w %s1, %s3, .LBB39_2
	; CHECK-NEXT: # %bb.1:			; CHECK-NEXT: # %bb.1:
	; CHECK-NEXT: stl %s1, (, %s0)			; CHECK-NEXT: stl %s1, (, %s0)
	; CHECK-NEXT: .LBB39_2:			; CHECK-NEXT: .LBB39_2:
	; CHECK-NEXT: adds.w.zx %s0, %s2, (0)1			; CHECK-NEXT: adds.w.zx %s0, %s2, (0)1
	; CHECK-NEXT: adds.l %s11, 16, %s11			; CHECK-NEXT: adds.l %s11, 16, %s11
	; CHECK-NEXT: b.l.t (, %s10)			; CHECK-NEXT: b.l.t (, %s10)
	%3 = alloca %"struct.std::__1::atomic.25", align 4			%3 = alloca %"struct.std::__1::atomic.25", align 8
	%4 = bitcast %"struct.std::__1::atomic.25"* %3 to i8*			%4 = bitcast %"struct.std::__1::atomic.25"* %3 to i8*
	call void @llvm.lifetime.start.p0i8(i64 4, i8* nonnull %4)			call void @llvm.lifetime.start.p0i8(i64 4, i8* nonnull %4)
	%5 = getelementptr inbounds %"struct.std::__1::atomic.25", %"struct.std::__1::atomic.25"* %3, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0			%5 = getelementptr inbounds %"struct.std::__1::atomic.25", %"struct.std::__1::atomic.25"* %3, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0
	%6 = load i32, i32* %0, align 4			%6 = load i32, i32* %0, align 4
	%7 = cmpxchg weak volatile i32* %5, i32 %6, i32 %1 monotonic monotonic			%7 = cmpxchg weak volatile i32* %5, i32 %6, i32 %1 monotonic monotonic
	%8 = extractvalue { i32, i1 } %7, 1			%8 = extractvalue { i32, i1 } %7, 1
	br i1 %8, label %11, label %9			br i1 %8, label %11, label %9

	▲ Show 20 Lines • Show All 648 Lines • Show Last 20 Lines

llvm/test/CodeGen/VE/Scalar/atomic_load.ll

	Show First 20 Lines • Show All 554 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: lea %s0, _Z6fun_i1RNSt3__16atomicIbEE@lo			; CHECK-NEXT: lea %s0, _Z6fun_i1RNSt3__16atomicIbEE@lo
	; CHECK-NEXT: and %s0, %s0, (32)0			; CHECK-NEXT: and %s0, %s0, (32)0
	; CHECK-NEXT: lea.sl %s12, _Z6fun_i1RNSt3__16atomicIbEE@hi(, %s0)			; CHECK-NEXT: lea.sl %s12, _Z6fun_i1RNSt3__16atomicIbEE@hi(, %s0)
	; CHECK-NEXT: lea %s0, 248(, %s11)			; CHECK-NEXT: lea %s0, 248(, %s11)
	; CHECK-NEXT: bsic %s10, (, %s12)			; CHECK-NEXT: bsic %s10, (, %s12)
	; CHECK-NEXT: ld1b.zx %s0, 248(, %s11)			; CHECK-NEXT: ld1b.zx %s0, 248(, %s11)
	; CHECK-NEXT: and %s0, 1, %s0			; CHECK-NEXT: and %s0, 1, %s0
	; CHECK-NEXT: or %s11, 0, %s9			; CHECK-NEXT: or %s11, 0, %s9
	%1 = alloca %"struct.std::__1::atomic", align 1			%1 = alloca %"struct.std::__1::atomic", align 8
	%2 = getelementptr inbounds %"struct.std::__1::atomic", %"struct.std::__1::atomic"* %1, i64 0, i32 0, i32 0, i32 0, i32 0			%2 = getelementptr inbounds %"struct.std::__1::atomic", %"struct.std::__1::atomic"* %1, i64 0, i32 0, i32 0, i32 0, i32 0
	call void @llvm.lifetime.start.p0i8(i64 1, i8* nonnull %2)			call void @llvm.lifetime.start.p0i8(i64 1, i8* nonnull %2)
	call void @_Z6fun_i1RNSt3__16atomicIbEE(%"struct.std::__1::atomic"* nonnull align 1 dereferenceable(1) %1)			call void @_Z6fun_i1RNSt3__16atomicIbEE(%"struct.std::__1::atomic"* nonnull align 1 dereferenceable(1) %1)
	%3 = load atomic i8, i8* %2 monotonic, align 1			%3 = load atomic i8, i8* %2 monotonic, align 1
	%4 = and i8 %3, 1			%4 = and i8 %3, 1
	%5 = icmp ne i8 %4, 0			%5 = icmp ne i8 %4, 0
	call void @llvm.lifetime.end.p0i8(i64 1, i8* nonnull %2)			call void @llvm.lifetime.end.p0i8(i64 1, i8* nonnull %2)
	ret i1 %5			ret i1 %5
	Show All 13 Lines
	; CHECK: .LBB{{[0-9]+}}_2:			; CHECK: .LBB{{[0-9]+}}_2:
	; CHECK-NEXT: lea %s0, _Z6fun_i8RNSt3__16atomicIcEE@lo			; CHECK-NEXT: lea %s0, _Z6fun_i8RNSt3__16atomicIcEE@lo
	; CHECK-NEXT: and %s0, %s0, (32)0			; CHECK-NEXT: and %s0, %s0, (32)0
	; CHECK-NEXT: lea.sl %s12, _Z6fun_i8RNSt3__16atomicIcEE@hi(, %s0)			; CHECK-NEXT: lea.sl %s12, _Z6fun_i8RNSt3__16atomicIcEE@hi(, %s0)
	; CHECK-NEXT: lea %s0, 248(, %s11)			; CHECK-NEXT: lea %s0, 248(, %s11)
	; CHECK-NEXT: bsic %s10, (, %s12)			; CHECK-NEXT: bsic %s10, (, %s12)
	; CHECK-NEXT: ld1b.sx %s0, 248(, %s11)			; CHECK-NEXT: ld1b.sx %s0, 248(, %s11)
	; CHECK-NEXT: or %s11, 0, %s9			; CHECK-NEXT: or %s11, 0, %s9
	%1 = alloca %"struct.std::__1::atomic.0", align 1			%1 = alloca %"struct.std::__1::atomic.0", align 8
	%2 = getelementptr inbounds %"struct.std::__1::atomic.0", %"struct.std::__1::atomic.0"* %1, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0			%2 = getelementptr inbounds %"struct.std::__1::atomic.0", %"struct.std::__1::atomic.0"* %1, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0
	call void @llvm.lifetime.start.p0i8(i64 1, i8* nonnull %2)			call void @llvm.lifetime.start.p0i8(i64 1, i8* nonnull %2)
	call void @_Z6fun_i8RNSt3__16atomicIcEE(%"struct.std::__1::atomic.0"* nonnull align 1 dereferenceable(1) %1)			call void @_Z6fun_i8RNSt3__16atomicIcEE(%"struct.std::__1::atomic.0"* nonnull align 1 dereferenceable(1) %1)
	%3 = load atomic i8, i8* %2 monotonic, align 1			%3 = load atomic i8, i8* %2 monotonic, align 1
	call void @llvm.lifetime.end.p0i8(i64 1, i8* nonnull %2)			call void @llvm.lifetime.end.p0i8(i64 1, i8* nonnull %2)
	ret i8 %3			ret i8 %3
	}			}

	declare void @_Z6fun_i8RNSt3__16atomicIcEE(%"struct.std::__1::atomic.0"* nonnull align 1 dereferenceable(1))			declare void @_Z6fun_i8RNSt3__16atomicIcEE(%"struct.std::__1::atomic.0"* nonnull align 1 dereferenceable(1))

	; Function Attrs: mustprogress			; Function Attrs: mustprogress
	define zeroext i8 @_Z26atomic_load_relaxed_stk_u8v() {			define zeroext i8 @_Z26atomic_load_relaxed_stk_u8v() {
	; CHECK-LABEL: _Z26atomic_load_relaxed_stk_u8v:			; CHECK-LABEL: _Z26atomic_load_relaxed_stk_u8v:
	; CHECK: .LBB{{[0-9]+}}_2:			; CHECK: .LBB{{[0-9]+}}_2:
	; CHECK-NEXT: lea %s0, _Z6fun_u8RNSt3__16atomicIhEE@lo			; CHECK-NEXT: lea %s0, _Z6fun_u8RNSt3__16atomicIhEE@lo
	; CHECK-NEXT: and %s0, %s0, (32)0			; CHECK-NEXT: and %s0, %s0, (32)0
	; CHECK-NEXT: lea.sl %s12, _Z6fun_u8RNSt3__16atomicIhEE@hi(, %s0)			; CHECK-NEXT: lea.sl %s12, _Z6fun_u8RNSt3__16atomicIhEE@hi(, %s0)
	; CHECK-NEXT: lea %s0, 248(, %s11)			; CHECK-NEXT: lea %s0, 248(, %s11)
	; CHECK-NEXT: bsic %s10, (, %s12)			; CHECK-NEXT: bsic %s10, (, %s12)
	; CHECK-NEXT: ld1b.zx %s0, 248(, %s11)			; CHECK-NEXT: ld1b.zx %s0, 248(, %s11)
	; CHECK-NEXT: or %s11, 0, %s9			; CHECK-NEXT: or %s11, 0, %s9
	%1 = alloca %"struct.std::__1::atomic.5", align 1			%1 = alloca %"struct.std::__1::atomic.5", align 8
	%2 = getelementptr inbounds %"struct.std::__1::atomic.5", %"struct.std::__1::atomic.5"* %1, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0			%2 = getelementptr inbounds %"struct.std::__1::atomic.5", %"struct.std::__1::atomic.5"* %1, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0
	call void @llvm.lifetime.start.p0i8(i64 1, i8* nonnull %2)			call void @llvm.lifetime.start.p0i8(i64 1, i8* nonnull %2)
	call void @_Z6fun_u8RNSt3__16atomicIhEE(%"struct.std::__1::atomic.5"* nonnull align 1 dereferenceable(1) %1)			call void @_Z6fun_u8RNSt3__16atomicIhEE(%"struct.std::__1::atomic.5"* nonnull align 1 dereferenceable(1) %1)
	%3 = load atomic i8, i8* %2 monotonic, align 1			%3 = load atomic i8, i8* %2 monotonic, align 1
	call void @llvm.lifetime.end.p0i8(i64 1, i8* nonnull %2)			call void @llvm.lifetime.end.p0i8(i64 1, i8* nonnull %2)
	ret i8 %3			ret i8 %3
	}			}

	declare void @_Z6fun_u8RNSt3__16atomicIhEE(%"struct.std::__1::atomic.5"* nonnull align 1 dereferenceable(1))			declare void @_Z6fun_u8RNSt3__16atomicIhEE(%"struct.std::__1::atomic.5"* nonnull align 1 dereferenceable(1))

	; Function Attrs: mustprogress			; Function Attrs: mustprogress
	define signext i16 @_Z27atomic_load_relaxed_stk_i16v() {			define signext i16 @_Z27atomic_load_relaxed_stk_i16v() {
	; CHECK-LABEL: _Z27atomic_load_relaxed_stk_i16v:			; CHECK-LABEL: _Z27atomic_load_relaxed_stk_i16v:
	; CHECK: .LBB{{[0-9]+}}_2:			; CHECK: .LBB{{[0-9]+}}_2:
	; CHECK-NEXT: lea %s0, _Z7fun_i16RNSt3__16atomicIsEE@lo			; CHECK-NEXT: lea %s0, _Z7fun_i16RNSt3__16atomicIsEE@lo
	; CHECK-NEXT: and %s0, %s0, (32)0			; CHECK-NEXT: and %s0, %s0, (32)0
	; CHECK-NEXT: lea.sl %s12, _Z7fun_i16RNSt3__16atomicIsEE@hi(, %s0)			; CHECK-NEXT: lea.sl %s12, _Z7fun_i16RNSt3__16atomicIsEE@hi(, %s0)
	; CHECK-NEXT: lea %s0, 248(, %s11)			; CHECK-NEXT: lea %s0, 248(, %s11)
	; CHECK-NEXT: bsic %s10, (, %s12)			; CHECK-NEXT: bsic %s10, (, %s12)
	; CHECK-NEXT: ld2b.sx %s0, 248(, %s11)			; CHECK-NEXT: ld2b.sx %s0, 248(, %s11)
	; CHECK-NEXT: or %s11, 0, %s9			; CHECK-NEXT: or %s11, 0, %s9
	%1 = alloca %"struct.std::__1::atomic.10", align 2			%1 = alloca %"struct.std::__1::atomic.10", align 8
	%2 = bitcast %"struct.std::__1::atomic.10"* %1 to i8*			%2 = bitcast %"struct.std::__1::atomic.10"* %1 to i8*
	call void @llvm.lifetime.start.p0i8(i64 2, i8* nonnull %2)			call void @llvm.lifetime.start.p0i8(i64 2, i8* nonnull %2)
	call void @_Z7fun_i16RNSt3__16atomicIsEE(%"struct.std::__1::atomic.10"* nonnull align 2 dereferenceable(2) %1)			call void @_Z7fun_i16RNSt3__16atomicIsEE(%"struct.std::__1::atomic.10"* nonnull align 2 dereferenceable(2) %1)
	%3 = getelementptr inbounds %"struct.std::__1::atomic.10", %"struct.std::__1::atomic.10"* %1, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0			%3 = getelementptr inbounds %"struct.std::__1::atomic.10", %"struct.std::__1::atomic.10"* %1, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0
	%4 = load atomic i16, i16* %3 monotonic, align 2			%4 = load atomic i16, i16* %3 monotonic, align 2
	call void @llvm.lifetime.end.p0i8(i64 2, i8* nonnull %2)			call void @llvm.lifetime.end.p0i8(i64 2, i8* nonnull %2)
	ret i16 %4			ret i16 %4
	}			}

	declare void @_Z7fun_i16RNSt3__16atomicIsEE(%"struct.std::__1::atomic.10"* nonnull align 2 dereferenceable(2))			declare void @_Z7fun_i16RNSt3__16atomicIsEE(%"struct.std::__1::atomic.10"* nonnull align 2 dereferenceable(2))

	; Function Attrs: mustprogress			; Function Attrs: mustprogress
	define zeroext i16 @_Z27atomic_load_relaxed_stk_u16v() {			define zeroext i16 @_Z27atomic_load_relaxed_stk_u16v() {
	; CHECK-LABEL: _Z27atomic_load_relaxed_stk_u16v:			; CHECK-LABEL: _Z27atomic_load_relaxed_stk_u16v:
	; CHECK: .LBB{{[0-9]+}}_2:			; CHECK: .LBB{{[0-9]+}}_2:
	; CHECK-NEXT: lea %s0, _Z7fun_u16RNSt3__16atomicItEE@lo			; CHECK-NEXT: lea %s0, _Z7fun_u16RNSt3__16atomicItEE@lo
	; CHECK-NEXT: and %s0, %s0, (32)0			; CHECK-NEXT: and %s0, %s0, (32)0
	; CHECK-NEXT: lea.sl %s12, _Z7fun_u16RNSt3__16atomicItEE@hi(, %s0)			; CHECK-NEXT: lea.sl %s12, _Z7fun_u16RNSt3__16atomicItEE@hi(, %s0)
	; CHECK-NEXT: lea %s0, 248(, %s11)			; CHECK-NEXT: lea %s0, 248(, %s11)
	; CHECK-NEXT: bsic %s10, (, %s12)			; CHECK-NEXT: bsic %s10, (, %s12)
	; CHECK-NEXT: ld2b.zx %s0, 248(, %s11)			; CHECK-NEXT: ld2b.zx %s0, 248(, %s11)
	; CHECK-NEXT: or %s11, 0, %s9			; CHECK-NEXT: or %s11, 0, %s9
	%1 = alloca %"struct.std::__1::atomic.15", align 2			%1 = alloca %"struct.std::__1::atomic.15", align 8
	%2 = bitcast %"struct.std::__1::atomic.15"* %1 to i8*			%2 = bitcast %"struct.std::__1::atomic.15"* %1 to i8*
	call void @llvm.lifetime.start.p0i8(i64 2, i8* nonnull %2)			call void @llvm.lifetime.start.p0i8(i64 2, i8* nonnull %2)
	call void @_Z7fun_u16RNSt3__16atomicItEE(%"struct.std::__1::atomic.15"* nonnull align 2 dereferenceable(2) %1)			call void @_Z7fun_u16RNSt3__16atomicItEE(%"struct.std::__1::atomic.15"* nonnull align 2 dereferenceable(2) %1)
	%3 = getelementptr inbounds %"struct.std::__1::atomic.15", %"struct.std::__1::atomic.15"* %1, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0			%3 = getelementptr inbounds %"struct.std::__1::atomic.15", %"struct.std::__1::atomic.15"* %1, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0
	%4 = load atomic i16, i16* %3 monotonic, align 2			%4 = load atomic i16, i16* %3 monotonic, align 2
	call void @llvm.lifetime.end.p0i8(i64 2, i8* nonnull %2)			call void @llvm.lifetime.end.p0i8(i64 2, i8* nonnull %2)
	ret i16 %4			ret i16 %4
	}			}

	declare void @_Z7fun_u16RNSt3__16atomicItEE(%"struct.std::__1::atomic.15"* nonnull align 2 dereferenceable(2))			declare void @_Z7fun_u16RNSt3__16atomicItEE(%"struct.std::__1::atomic.15"* nonnull align 2 dereferenceable(2))

	; Function Attrs: mustprogress			; Function Attrs: mustprogress
	define signext i32 @_Z27atomic_load_relaxed_stk_i32v() {			define signext i32 @_Z27atomic_load_relaxed_stk_i32v() {
	; CHECK-LABEL: _Z27atomic_load_relaxed_stk_i32v:			; CHECK-LABEL: _Z27atomic_load_relaxed_stk_i32v:
	; CHECK: .LBB{{[0-9]+}}_2:			; CHECK: .LBB{{[0-9]+}}_2:
	; CHECK-NEXT: lea %s0, _Z7fun_i32RNSt3__16atomicIiEE@lo			; CHECK-NEXT: lea %s0, _Z7fun_i32RNSt3__16atomicIiEE@lo
	; CHECK-NEXT: and %s0, %s0, (32)0			; CHECK-NEXT: and %s0, %s0, (32)0
	; CHECK-NEXT: lea.sl %s12, _Z7fun_i32RNSt3__16atomicIiEE@hi(, %s0)			; CHECK-NEXT: lea.sl %s12, _Z7fun_i32RNSt3__16atomicIiEE@hi(, %s0)
	; CHECK-NEXT: lea %s0, 248(, %s11)			; CHECK-NEXT: lea %s0, 248(, %s11)
	; CHECK-NEXT: bsic %s10, (, %s12)			; CHECK-NEXT: bsic %s10, (, %s12)
	; CHECK-NEXT: ldl.sx %s0, 248(, %s11)			; CHECK-NEXT: ldl.sx %s0, 248(, %s11)
	; CHECK-NEXT: or %s11, 0, %s9			; CHECK-NEXT: or %s11, 0, %s9
	%1 = alloca %"struct.std::__1::atomic.20", align 4			%1 = alloca %"struct.std::__1::atomic.20", align 8
	%2 = bitcast %"struct.std::__1::atomic.20"* %1 to i8*			%2 = bitcast %"struct.std::__1::atomic.20"* %1 to i8*
	call void @llvm.lifetime.start.p0i8(i64 4, i8* nonnull %2)			call void @llvm.lifetime.start.p0i8(i64 4, i8* nonnull %2)
	call void @_Z7fun_i32RNSt3__16atomicIiEE(%"struct.std::__1::atomic.20"* nonnull align 4 dereferenceable(4) %1)			call void @_Z7fun_i32RNSt3__16atomicIiEE(%"struct.std::__1::atomic.20"* nonnull align 4 dereferenceable(4) %1)
	%3 = getelementptr inbounds %"struct.std::__1::atomic.20", %"struct.std::__1::atomic.20"* %1, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0			%3 = getelementptr inbounds %"struct.std::__1::atomic.20", %"struct.std::__1::atomic.20"* %1, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0
	%4 = load atomic i32, i32* %3 monotonic, align 4			%4 = load atomic i32, i32* %3 monotonic, align 4
	call void @llvm.lifetime.end.p0i8(i64 4, i8* nonnull %2)			call void @llvm.lifetime.end.p0i8(i64 4, i8* nonnull %2)
	ret i32 %4			ret i32 %4
	}			}

	declare void @_Z7fun_i32RNSt3__16atomicIiEE(%"struct.std::__1::atomic.20"* nonnull align 4 dereferenceable(4))			declare void @_Z7fun_i32RNSt3__16atomicIiEE(%"struct.std::__1::atomic.20"* nonnull align 4 dereferenceable(4))

	; Function Attrs: mustprogress			; Function Attrs: mustprogress
	define zeroext i32 @_Z27atomic_load_relaxed_stk_u32v() {			define zeroext i32 @_Z27atomic_load_relaxed_stk_u32v() {
	; CHECK-LABEL: _Z27atomic_load_relaxed_stk_u32v:			; CHECK-LABEL: _Z27atomic_load_relaxed_stk_u32v:
	; CHECK: .LBB{{[0-9]+}}_2:			; CHECK: .LBB{{[0-9]+}}_2:
	; CHECK-NEXT: lea %s0, _Z7fun_u32RNSt3__16atomicIjEE@lo			; CHECK-NEXT: lea %s0, _Z7fun_u32RNSt3__16atomicIjEE@lo
	; CHECK-NEXT: and %s0, %s0, (32)0			; CHECK-NEXT: and %s0, %s0, (32)0
	; CHECK-NEXT: lea.sl %s12, _Z7fun_u32RNSt3__16atomicIjEE@hi(, %s0)			; CHECK-NEXT: lea.sl %s12, _Z7fun_u32RNSt3__16atomicIjEE@hi(, %s0)
	; CHECK-NEXT: lea %s0, 248(, %s11)			; CHECK-NEXT: lea %s0, 248(, %s11)
	; CHECK-NEXT: bsic %s10, (, %s12)			; CHECK-NEXT: bsic %s10, (, %s12)
	; CHECK-NEXT: ldl.zx %s0, 248(, %s11)			; CHECK-NEXT: ldl.zx %s0, 248(, %s11)
	; CHECK-NEXT: or %s11, 0, %s9			; CHECK-NEXT: or %s11, 0, %s9
	%1 = alloca %"struct.std::__1::atomic.25", align 4			%1 = alloca %"struct.std::__1::atomic.25", align 8
	%2 = bitcast %"struct.std::__1::atomic.25"* %1 to i8*			%2 = bitcast %"struct.std::__1::atomic.25"* %1 to i8*
	call void @llvm.lifetime.start.p0i8(i64 4, i8* nonnull %2)			call void @llvm.lifetime.start.p0i8(i64 4, i8* nonnull %2)
	call void @_Z7fun_u32RNSt3__16atomicIjEE(%"struct.std::__1::atomic.25"* nonnull align 4 dereferenceable(4) %1)			call void @_Z7fun_u32RNSt3__16atomicIjEE(%"struct.std::__1::atomic.25"* nonnull align 4 dereferenceable(4) %1)
	%3 = getelementptr inbounds %"struct.std::__1::atomic.25", %"struct.std::__1::atomic.25"* %1, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0			%3 = getelementptr inbounds %"struct.std::__1::atomic.25", %"struct.std::__1::atomic.25"* %1, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0
	%4 = load atomic i32, i32* %3 monotonic, align 4			%4 = load atomic i32, i32* %3 monotonic, align 4
	call void @llvm.lifetime.end.p0i8(i64 4, i8* nonnull %2)			call void @llvm.lifetime.end.p0i8(i64 4, i8* nonnull %2)
	ret i32 %4			ret i32 %4
	}			}
	▲ Show 20 Lines • Show All 300 Lines • Show Last 20 Lines

llvm/test/CodeGen/VE/Scalar/atomic_swap.ll

	Show First 20 Lines • Show All 762 Lines • ▼ Show 20 Lines
	; CHECK: .LBB{{[0-9]+}}_2:			; CHECK: .LBB{{[0-9]+}}_2:
	; CHECK-NEXT: and %s0, %s0, (32)0			; CHECK-NEXT: and %s0, %s0, (32)0
	; CHECK-NEXT: or %s1, 1, (0)1			; CHECK-NEXT: or %s1, 1, (0)1
	; CHECK-NEXT: lea %s2, 8(, %s11)			; CHECK-NEXT: lea %s2, 8(, %s11)
	; CHECK-NEXT: ts1am.w %s0, (%s2), %s1			; CHECK-NEXT: ts1am.w %s0, (%s2), %s1
	; CHECK-NEXT: and %s0, 1, %s0			; CHECK-NEXT: and %s0, 1, %s0
	; CHECK-NEXT: adds.l %s11, 16, %s11			; CHECK-NEXT: adds.l %s11, 16, %s11
	; CHECK-NEXT: b.l.t (, %s10)			; CHECK-NEXT: b.l.t (, %s10)
	%2 = alloca %"struct.std::__1::atomic", align 1			%2 = alloca %"struct.std::__1::atomic", align 8
	%3 = getelementptr inbounds %"struct.std::__1::atomic", %"struct.std::__1::atomic"* %2, i64 0, i32 0, i32 0, i32 0, i32 0			%3 = getelementptr inbounds %"struct.std::__1::atomic", %"struct.std::__1::atomic"* %2, i64 0, i32 0, i32 0, i32 0, i32 0
	call void @llvm.lifetime.start.p0i8(i64 1, i8* nonnull %3)			call void @llvm.lifetime.start.p0i8(i64 1, i8* nonnull %3)
	%4 = zext i1 %0 to i8			%4 = zext i1 %0 to i8
	%5 = atomicrmw volatile xchg i8* %3, i8 %4 monotonic			%5 = atomicrmw volatile xchg i8* %3, i8 %4 monotonic
	%6 = and i8 %5, 1			%6 = and i8 %5, 1
	%7 = icmp ne i8 %6, 0			%7 = icmp ne i8 %6, 0
	call void @llvm.lifetime.end.p0i8(i64 1, i8* nonnull %3)			call void @llvm.lifetime.end.p0i8(i64 1, i8* nonnull %3)
	ret i1 %7			ret i1 %7
	Show All 12 Lines
	; CHECK-NEXT: and %s0, %s0, (32)0			; CHECK-NEXT: and %s0, %s0, (32)0
	; CHECK-NEXT: or %s1, 1, (0)1			; CHECK-NEXT: or %s1, 1, (0)1
	; CHECK-NEXT: lea %s2, 8(, %s11)			; CHECK-NEXT: lea %s2, 8(, %s11)
	; CHECK-NEXT: ts1am.w %s0, (%s2), %s1			; CHECK-NEXT: ts1am.w %s0, (%s2), %s1
	; CHECK-NEXT: sll %s0, %s0, 56			; CHECK-NEXT: sll %s0, %s0, 56
	; CHECK-NEXT: sra.l %s0, %s0, 56			; CHECK-NEXT: sra.l %s0, %s0, 56
	; CHECK-NEXT: adds.l %s11, 16, %s11			; CHECK-NEXT: adds.l %s11, 16, %s11
	; CHECK-NEXT: b.l.t (, %s10)			; CHECK-NEXT: b.l.t (, %s10)
	%2 = alloca %"struct.std::__1::atomic.0", align 1			%2 = alloca %"struct.std::__1::atomic.0", align 8
	%3 = getelementptr inbounds %"struct.std::__1::atomic.0", %"struct.std::__1::atomic.0"* %2, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0			%3 = getelementptr inbounds %"struct.std::__1::atomic.0", %"struct.std::__1::atomic.0"* %2, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0
	call void @llvm.lifetime.start.p0i8(i64 1, i8* nonnull %3)			call void @llvm.lifetime.start.p0i8(i64 1, i8* nonnull %3)
	%4 = atomicrmw volatile xchg i8* %3, i8 %0 monotonic			%4 = atomicrmw volatile xchg i8* %3, i8 %0 monotonic
	call void @llvm.lifetime.end.p0i8(i64 1, i8* nonnull %3)			call void @llvm.lifetime.end.p0i8(i64 1, i8* nonnull %3)
	ret i8 %4			ret i8 %4
	}			}

	; Function Attrs: nofree nounwind mustprogress			; Function Attrs: nofree nounwind mustprogress
	define zeroext i8 @_Z26atomic_swap_relaxed_stk_u8h(i8 zeroext %0) {			define zeroext i8 @_Z26atomic_swap_relaxed_stk_u8h(i8 zeroext %0) {
	; CHECK-LABEL: _Z26atomic_swap_relaxed_stk_u8h:			; CHECK-LABEL: _Z26atomic_swap_relaxed_stk_u8h:
	; CHECK: .LBB{{[0-9]+}}_2:			; CHECK: .LBB{{[0-9]+}}_2:
	; CHECK-NEXT: and %s0, %s0, (32)0			; CHECK-NEXT: and %s0, %s0, (32)0
	; CHECK-NEXT: or %s1, 1, (0)1			; CHECK-NEXT: or %s1, 1, (0)1
	; CHECK-NEXT: lea %s2, 8(, %s11)			; CHECK-NEXT: lea %s2, 8(, %s11)
	; CHECK-NEXT: ts1am.w %s0, (%s2), %s1			; CHECK-NEXT: ts1am.w %s0, (%s2), %s1
	; CHECK-NEXT: and %s0, %s0, (56)0			; CHECK-NEXT: and %s0, %s0, (56)0
	; CHECK-NEXT: adds.l %s11, 16, %s11			; CHECK-NEXT: adds.l %s11, 16, %s11
	; CHECK-NEXT: b.l.t (, %s10)			; CHECK-NEXT: b.l.t (, %s10)
	%2 = alloca %"struct.std::__1::atomic.5", align 1			%2 = alloca %"struct.std::__1::atomic.5", align 8
	%3 = getelementptr inbounds %"struct.std::__1::atomic.5", %"struct.std::__1::atomic.5"* %2, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0			%3 = getelementptr inbounds %"struct.std::__1::atomic.5", %"struct.std::__1::atomic.5"* %2, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0
	call void @llvm.lifetime.start.p0i8(i64 1, i8* nonnull %3)			call void @llvm.lifetime.start.p0i8(i64 1, i8* nonnull %3)
	%4 = atomicrmw volatile xchg i8* %3, i8 %0 monotonic			%4 = atomicrmw volatile xchg i8* %3, i8 %0 monotonic
	call void @llvm.lifetime.end.p0i8(i64 1, i8* nonnull %3)			call void @llvm.lifetime.end.p0i8(i64 1, i8* nonnull %3)
	ret i8 %4			ret i8 %4
	}			}

	; Function Attrs: nofree nounwind mustprogress			; Function Attrs: nofree nounwind mustprogress
	define signext i16 @_Z27atomic_swap_relaxed_stk_i16s(i16 signext %0) {			define signext i16 @_Z27atomic_swap_relaxed_stk_i16s(i16 signext %0) {
	; CHECK-LABEL: _Z27atomic_swap_relaxed_stk_i16s:			; CHECK-LABEL: _Z27atomic_swap_relaxed_stk_i16s:
	; CHECK: .LBB{{[0-9]+}}_2:			; CHECK: .LBB{{[0-9]+}}_2:
	; CHECK-NEXT: and %s0, %s0, (32)0			; CHECK-NEXT: and %s0, %s0, (32)0
	; CHECK-NEXT: or %s1, 3, (0)1			; CHECK-NEXT: or %s1, 3, (0)1
	; CHECK-NEXT: lea %s2, 8(, %s11)			; CHECK-NEXT: lea %s2, 8(, %s11)
	; CHECK-NEXT: ts1am.w %s0, (%s2), %s1			; CHECK-NEXT: ts1am.w %s0, (%s2), %s1
	; CHECK-NEXT: sll %s0, %s0, 48			; CHECK-NEXT: sll %s0, %s0, 48
	; CHECK-NEXT: sra.l %s0, %s0, 48			; CHECK-NEXT: sra.l %s0, %s0, 48
	; CHECK-NEXT: adds.l %s11, 16, %s11			; CHECK-NEXT: adds.l %s11, 16, %s11
	; CHECK-NEXT: b.l.t (, %s10)			; CHECK-NEXT: b.l.t (, %s10)
	%2 = alloca %"struct.std::__1::atomic.10", align 2			%2 = alloca %"struct.std::__1::atomic.10", align 8
	%3 = bitcast %"struct.std::__1::atomic.10"* %2 to i8*			%3 = bitcast %"struct.std::__1::atomic.10"* %2 to i8*
	call void @llvm.lifetime.start.p0i8(i64 2, i8* nonnull %3)			call void @llvm.lifetime.start.p0i8(i64 2, i8* nonnull %3)
	%4 = getelementptr inbounds %"struct.std::__1::atomic.10", %"struct.std::__1::atomic.10"* %2, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0			%4 = getelementptr inbounds %"struct.std::__1::atomic.10", %"struct.std::__1::atomic.10"* %2, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0
	%5 = atomicrmw volatile xchg i16* %4, i16 %0 monotonic			%5 = atomicrmw volatile xchg i16* %4, i16 %0 monotonic
	call void @llvm.lifetime.end.p0i8(i64 2, i8* nonnull %3)			call void @llvm.lifetime.end.p0i8(i64 2, i8* nonnull %3)
	ret i16 %5			ret i16 %5
	}			}

	; Function Attrs: nofree nounwind mustprogress			; Function Attrs: nofree nounwind mustprogress
	define zeroext i16 @_Z27atomic_swap_relaxed_stk_u16t(i16 zeroext %0) {			define zeroext i16 @_Z27atomic_swap_relaxed_stk_u16t(i16 zeroext %0) {
	; CHECK-LABEL: _Z27atomic_swap_relaxed_stk_u16t:			; CHECK-LABEL: _Z27atomic_swap_relaxed_stk_u16t:
	; CHECK: .LBB{{[0-9]+}}_2:			; CHECK: .LBB{{[0-9]+}}_2:
	; CHECK-NEXT: and %s0, %s0, (32)0			; CHECK-NEXT: and %s0, %s0, (32)0
	; CHECK-NEXT: or %s1, 3, (0)1			; CHECK-NEXT: or %s1, 3, (0)1
	; CHECK-NEXT: lea %s2, 8(, %s11)			; CHECK-NEXT: lea %s2, 8(, %s11)
	; CHECK-NEXT: ts1am.w %s0, (%s2), %s1			; CHECK-NEXT: ts1am.w %s0, (%s2), %s1
	; CHECK-NEXT: and %s0, %s0, (48)0			; CHECK-NEXT: and %s0, %s0, (48)0
	; CHECK-NEXT: adds.l %s11, 16, %s11			; CHECK-NEXT: adds.l %s11, 16, %s11
	; CHECK-NEXT: b.l.t (, %s10)			; CHECK-NEXT: b.l.t (, %s10)
	%2 = alloca %"struct.std::__1::atomic.15", align 2			%2 = alloca %"struct.std::__1::atomic.15", align 8
	%3 = bitcast %"struct.std::__1::atomic.15"* %2 to i8*			%3 = bitcast %"struct.std::__1::atomic.15"* %2 to i8*
	call void @llvm.lifetime.start.p0i8(i64 2, i8* nonnull %3)			call void @llvm.lifetime.start.p0i8(i64 2, i8* nonnull %3)
	%4 = getelementptr inbounds %"struct.std::__1::atomic.15", %"struct.std::__1::atomic.15"* %2, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0			%4 = getelementptr inbounds %"struct.std::__1::atomic.15", %"struct.std::__1::atomic.15"* %2, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0
	%5 = atomicrmw volatile xchg i16* %4, i16 %0 monotonic			%5 = atomicrmw volatile xchg i16* %4, i16 %0 monotonic
	call void @llvm.lifetime.end.p0i8(i64 2, i8* nonnull %3)			call void @llvm.lifetime.end.p0i8(i64 2, i8* nonnull %3)
	ret i16 %5			ret i16 %5
	}			}

	; Function Attrs: nofree nounwind mustprogress			; Function Attrs: nofree nounwind mustprogress
	define signext i32 @_Z27atomic_swap_relaxed_stk_i32i(i32 signext %0) {			define signext i32 @_Z27atomic_swap_relaxed_stk_i32i(i32 signext %0) {
	; CHECK-LABEL: _Z27atomic_swap_relaxed_stk_i32i:			; CHECK-LABEL: _Z27atomic_swap_relaxed_stk_i32i:
	; CHECK: .LBB{{[0-9]+}}_2:			; CHECK: .LBB{{[0-9]+}}_2:
	; CHECK-NEXT: ts1am.w %s0, 8(%s11), 15			; CHECK-NEXT: ts1am.w %s0, 8(%s11), 15
	; CHECK-NEXT: adds.w.sx %s0, %s0, (0)1			; CHECK-NEXT: adds.w.sx %s0, %s0, (0)1
	; CHECK-NEXT: adds.l %s11, 16, %s11			; CHECK-NEXT: adds.l %s11, 16, %s11
	; CHECK-NEXT: b.l.t (, %s10)			; CHECK-NEXT: b.l.t (, %s10)
	%2 = alloca %"struct.std::__1::atomic.20", align 4			%2 = alloca %"struct.std::__1::atomic.20", align 8
	%3 = bitcast %"struct.std::__1::atomic.20"* %2 to i8*			%3 = bitcast %"struct.std::__1::atomic.20"* %2 to i8*
	call void @llvm.lifetime.start.p0i8(i64 4, i8* nonnull %3)			call void @llvm.lifetime.start.p0i8(i64 4, i8* nonnull %3)
	%4 = getelementptr inbounds %"struct.std::__1::atomic.20", %"struct.std::__1::atomic.20"* %2, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0			%4 = getelementptr inbounds %"struct.std::__1::atomic.20", %"struct.std::__1::atomic.20"* %2, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0
	%5 = atomicrmw volatile xchg i32* %4, i32 %0 monotonic			%5 = atomicrmw volatile xchg i32* %4, i32 %0 monotonic
	call void @llvm.lifetime.end.p0i8(i64 4, i8* nonnull %3)			call void @llvm.lifetime.end.p0i8(i64 4, i8* nonnull %3)
	ret i32 %5			ret i32 %5
	}			}

	; Function Attrs: nofree nounwind mustprogress			; Function Attrs: nofree nounwind mustprogress
	define zeroext i32 @_Z27atomic_swap_relaxed_stk_u32j(i32 zeroext %0) {			define zeroext i32 @_Z27atomic_swap_relaxed_stk_u32j(i32 zeroext %0) {
	; CHECK-LABEL: _Z27atomic_swap_relaxed_stk_u32j:			; CHECK-LABEL: _Z27atomic_swap_relaxed_stk_u32j:
	; CHECK: .LBB{{[0-9]+}}_2:			; CHECK: .LBB{{[0-9]+}}_2:
	; CHECK-NEXT: ts1am.w %s0, 8(%s11), 15			; CHECK-NEXT: ts1am.w %s0, 8(%s11), 15
	; CHECK-NEXT: adds.w.zx %s0, %s0, (0)1			; CHECK-NEXT: adds.w.zx %s0, %s0, (0)1
	; CHECK-NEXT: adds.l %s11, 16, %s11			; CHECK-NEXT: adds.l %s11, 16, %s11
	; CHECK-NEXT: b.l.t (, %s10)			; CHECK-NEXT: b.l.t (, %s10)
	%2 = alloca %"struct.std::__1::atomic.25", align 4			%2 = alloca %"struct.std::__1::atomic.25", align 8
	%3 = bitcast %"struct.std::__1::atomic.25"* %2 to i8*			%3 = bitcast %"struct.std::__1::atomic.25"* %2 to i8*
	call void @llvm.lifetime.start.p0i8(i64 4, i8* nonnull %3)			call void @llvm.lifetime.start.p0i8(i64 4, i8* nonnull %3)
	%4 = getelementptr inbounds %"struct.std::__1::atomic.25", %"struct.std::__1::atomic.25"* %2, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0			%4 = getelementptr inbounds %"struct.std::__1::atomic.25", %"struct.std::__1::atomic.25"* %2, i64 0, i32 0, i32 0, i32 0, i32 0, i32 0
	%5 = atomicrmw volatile xchg i32* %4, i32 %0 monotonic			%5 = atomicrmw volatile xchg i32* %4, i32 %0 monotonic
	call void @llvm.lifetime.end.p0i8(i64 4, i8* nonnull %3)			call void @llvm.lifetime.end.p0i8(i64 4, i8* nonnull %3)
	ret i32 %5			ret i32 %5
	}			}

	▲ Show 20 Lines • Show All 347 Lines • Show Last 20 Lines

llvm/test/CodeGen/WebAssembly/PR40172.ll

	Show All 9 Lines

	; CHECK: i32.sub $[[BASE:[0-9]+]]=,			; CHECK: i32.sub $[[BASE:[0-9]+]]=,
	; CHECK: local.copy $[[ARG:[0-9]+]]=, $0{{$}}			; CHECK: local.copy $[[ARG:[0-9]+]]=, $0{{$}}
	; CHECK: i32.const $[[A0:[0-9]+]]=, 1{{$}}			; CHECK: i32.const $[[A0:[0-9]+]]=, 1{{$}}
	; CHECK: i32.and $[[A1:[0-9]+]]=, $[[ARG]], $[[A0]]{{$}}			; CHECK: i32.and $[[A1:[0-9]+]]=, $[[ARG]], $[[A0]]{{$}}
	; CHECK: i32.store8 8($[[BASE]]), $[[A1]]{{$}}			; CHECK: i32.store8 8($[[BASE]]), $[[A1]]{{$}}

	define void @test(i8 %byte) {			define void @test(i8 %byte) {
	%t = alloca { i8, i8 }, align 1			%t = alloca { i8, i8 }, align 8
	%x4 = and i8 %byte, 1			%x4 = and i8 %byte, 1
	%x5 = icmp eq i8 %x4, 1			%x5 = icmp eq i8 %x4, 1
	%x6 = and i8 %byte, 2			%x6 = and i8 %byte, 2
	%x7 = icmp eq i8 %x6, 2			%x7 = icmp eq i8 %x6, 2
	%x8 = bitcast { i8, i8 }* %t to i8*			%x8 = bitcast { i8, i8 }* %t to i8*
	%x9 = zext i1 %x5 to i8			%x9 = zext i1 %x5 to i8
	store i8 %x9, i8* %x8, align 1			store i8 %x9, i8* %x8, align 1
	%x10 = getelementptr inbounds { i8, i8 }, { i8, i8 }* %t, i32 0, i32 1			%x10 = getelementptr inbounds { i8, i8 }, { i8, i8 }* %t, i32 0, i32 1
	%x11 = zext i1 %x7 to i8			%x11 = zext i1 %x7 to i8
	store i8 %x11, i8* %x10, align 1			store i8 %x11, i8* %x10, align 1
	ret void			ret void
	}			}

llvm/test/CodeGen/X86/dbg-changes-codegen-branch-folding.ll

	Show First 20 Lines • Show All 44 Lines • ▼ Show 20 Lines

	@.str = private unnamed_addr constant [1 x i8] zeroinitializer, align 1			@.str = private unnamed_addr constant [1 x i8] zeroinitializer, align 1
	@.str.1 = private unnamed_addr constant [2 x i8] c"+\00", align 1			@.str.1 = private unnamed_addr constant [2 x i8] c"+\00", align 1
	@.str.2 = private unnamed_addr constant [2 x i8] c"-\00", align 1			@.str.2 = private unnamed_addr constant [2 x i8] c"-\00", align 1

	; Function Attrs: uwtable			; Function Attrs: uwtable
	define void @_Z3barii(i32 %param1, i32 %param2) #0 !dbg !24 {			define void @_Z3barii(i32 %param1, i32 %param2) #0 !dbg !24 {
	entry:			entry:
	%var1 = alloca %struct.AAA3, align 1			%var1 = alloca %struct.AAA3, align 8
	%var2 = alloca %struct.AAA3, align 1			%var2 = alloca %struct.AAA3, align 8
	tail call void @llvm.dbg.value(metadata i32 %param1, i64 0, metadata !29, metadata !46), !dbg !47			tail call void @llvm.dbg.value(metadata i32 %param1, i64 0, metadata !29, metadata !46), !dbg !47
	tail call void @llvm.dbg.value(metadata i32 %param2, i64 0, metadata !30, metadata !46), !dbg !48			tail call void @llvm.dbg.value(metadata i32 %param2, i64 0, metadata !30, metadata !46), !dbg !48
	tail call void @llvm.dbg.value(metadata ptr null, i64 0, metadata !31, metadata !46), !dbg !49			tail call void @llvm.dbg.value(metadata ptr null, i64 0, metadata !31, metadata !46), !dbg !49
	%tobool = icmp eq i32 %param2, 0, !dbg !50			%tobool = icmp eq i32 %param2, 0, !dbg !50
	br i1 %tobool, label %if.end, label %if.then, !dbg !52			br i1 %tobool, label %if.end, label %if.then, !dbg !52

	if.then: ; preds = %entry			if.then: ; preds = %entry
	%call = tail call ptr @_Z5i2stri(i32 %param2), !dbg !53			%call = tail call ptr @_Z5i2stri(i32 %param2), !dbg !53
	▲ Show 20 Lines • Show All 149 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/fast-isel-call.ll

	Show First 20 Lines • Show All 53 Lines • ▼ Show 20 Lines
	; CHECK: movl $100, 8(%esp)			; CHECK: movl $100, 8(%esp)
	; CHECK: calll {{.*}}memcpy			; CHECK: calll {{.*}}memcpy
	}			}

	; STDERR-NOT: FastISel missed call: call x86_thiscallcc void @thiscallfun			; STDERR-NOT: FastISel missed call: call x86_thiscallcc void @thiscallfun
	%struct.S = type { i8 }			%struct.S = type { i8 }
	define void @test5() {			define void @test5() {
	entry:			entry:
	%s = alloca %struct.S, align 1			%s = alloca %struct.S, align 8
	; CHECK-LABEL: test5:			; CHECK-LABEL: test5:
	; CHECK: subl $12, %esp			; CHECK: subl $12, %esp
	; CHECK: leal 8(%esp), %ecx			; CHECK: leal 8(%esp), %ecx
	; CHECK: movl $43, (%esp)			; CHECK: movl $43, (%esp)
	; CHECK: calll {{.*}}thiscallfun			; CHECK: calll {{.*}}thiscallfun
	; CHECK: addl $8, %esp			; CHECK: addl $8, %esp
	call x86_thiscallcc void @thiscallfun(ptr %s, i32 43)			call x86_thiscallcc void @thiscallfun(ptr %s, i32 43)
	ret void			ret void
	Show All 15 Lines

llvm/test/CodeGen/X86/load-local-v3i129.ll

	Show All 23 Lines
	; SLOW-SHLD-NEXT: movq -40(%rsp), %rax			; SLOW-SHLD-NEXT: movq -40(%rsp), %rax
	; SLOW-SHLD-NEXT: andq $-4, %rax			; SLOW-SHLD-NEXT: andq $-4, %rax
	; SLOW-SHLD-NEXT: orq $1, %rax			; SLOW-SHLD-NEXT: orq $1, %rax
	; SLOW-SHLD-NEXT: movq %rax, -40(%rsp)			; SLOW-SHLD-NEXT: movq %rax, -40(%rsp)
	; SLOW-SHLD-NEXT: orq $-2, -56(%rsp)			; SLOW-SHLD-NEXT: orq $-2, -56(%rsp)
	; SLOW-SHLD-NEXT: movq $-1, -48(%rsp)			; SLOW-SHLD-NEXT: movq $-1, -48(%rsp)
	; SLOW-SHLD-NEXT: retq			; SLOW-SHLD-NEXT: retq
	Entry:			Entry:
	%y = alloca <3 x i129>, align 4			%y = alloca <3 x i129>, align 16
	%L = load <3 x i129>, ptr %y			%L = load <3 x i129>, ptr %y
	%I1 = insertelement <3 x i129> %L, i129 340282366920938463463374607431768211455, i32 1			%I1 = insertelement <3 x i129> %L, i129 340282366920938463463374607431768211455, i32 1
	store <3 x i129> %I1, ptr %y			store <3 x i129> %I1, ptr %y
	ret void			ret void
	}			}

llvm/test/CodeGen/X86/pr44140.ll

	Show First 20 Lines • Show All 53 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: addq $584, %rsp # imm = 0x248			; CHECK-NEXT: addq $584, %rsp # imm = 0x248
	; CHECK-NEXT: .cfi_def_cfa_offset 8			; CHECK-NEXT: .cfi_def_cfa_offset 8
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	start:			start:
	%dummy0 = alloca [22 x i64], align 8			%dummy0 = alloca [22 x i64], align 8
	%dummy1 = alloca [22 x i64], align 8			%dummy1 = alloca [22 x i64], align 8
	%dummy2 = alloca [22 x i64], align 8			%dummy2 = alloca [22 x i64], align 8

	%data = alloca <2 x i64>, align 8			%data = alloca <2 x i64>, align 16

	br label %fake-loop			br label %fake-loop

	fake-loop: ; preds = %fake-loop, %start			fake-loop: ; preds = %fake-loop, %start
	%dummy0.cast = bitcast [22 x i64]* %dummy0 to i8*			%dummy0.cast = bitcast [22 x i64]* %dummy0 to i8*
	%dummy1.cast = bitcast [22 x i64]* %dummy1 to i8*			%dummy1.cast = bitcast [22 x i64]* %dummy1 to i8*
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* nonnull align 8 %dummy1.cast, i8* nonnull align 8 %dummy0.cast, i64 176, i1 false)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* nonnull align 8 %dummy1.cast, i8* nonnull align 8 %dummy0.cast, i64 176, i1 false)

	Show All 27 Lines

llvm/test/CodeGen/X86/ssp-data-layout.ll

	Show First 20 Lines • Show All 87 Lines • ▼ Show 20 Lines

	; CHECK: call{{l\|q}} get_struct_small_nonchar			; CHECK: call{{l\|q}} get_struct_small_nonchar
	; CHECK: movw %ax, -128(			; CHECK: movw %ax, -128(
	; CHECK: call{{l\|q}} end_struct_small_nonchar			; CHECK: call{{l\|q}} end_struct_small_nonchar
	%x = alloca i32, align 4			%x = alloca i32, align 4
	%y = alloca i32, align 4			%y = alloca i32, align 4
	%z = alloca i32, align 4			%z = alloca i32, align 4
	%ptr = alloca i32, align 4			%ptr = alloca i32, align 4
	%small2 = alloca [2 x i16], align 2			%small2 = alloca [2 x i16], align 4
	%large2 = alloca [8 x i32], align 16			%large2 = alloca [8 x i32], align 16
	%small = alloca [2 x i8], align 1			%small = alloca [2 x i8], align 2
	%large = alloca [8 x i8], align 1			%large = alloca [8 x i8], align 8
	%a = alloca %struct.struct_large_char, align 1			%a = alloca %struct.struct_large_char, align 8
	%b = alloca %struct.struct_small_char, align 1			%b = alloca %struct.struct_small_char, align 8
	%c = alloca %struct.struct_large_nonchar, align 8			%c = alloca %struct.struct_large_nonchar, align 8
	%d = alloca %struct.struct_small_nonchar, align 2			%d = alloca %struct.struct_small_nonchar, align 8
	%call = call i32 @get_scalar1()			%call = call i32 @get_scalar1()
	store i32 %call, ptr %x, align 4			store i32 %call, ptr %x, align 4
	call void @end_scalar1()			call void @end_scalar1()
	%call1 = call i32 @get_scalar2()			%call1 = call i32 @get_scalar2()
	store i32 %call1, ptr %y, align 4			store i32 %call1, ptr %y, align 4
	call void @end_scalar2()			call void @end_scalar2()
	%call2 = call i32 @get_scalar3()			%call2 = call i32 @get_scalar3()
	store i32 %call2, ptr %z, align 4			store i32 %call2, ptr %z, align 4
	▲ Show 20 Lines • Show All 100 Lines • ▼ Show 20 Lines
	; CHECK: movw %ax, -112(			; CHECK: movw %ax, -112(
	; CHECK: call{{l\|q}} end_struct_small_nonchar			; CHECK: call{{l\|q}} end_struct_small_nonchar
	%x = alloca i32, align 4			%x = alloca i32, align 4
	%y = alloca i32, align 4			%y = alloca i32, align 4
	%z = alloca i32, align 4			%z = alloca i32, align 4
	%ptr = alloca i32, align 4			%ptr = alloca i32, align 4
	%small2 = alloca [2 x i16], align 2			%small2 = alloca [2 x i16], align 2
	%large2 = alloca [8 x i32], align 16			%large2 = alloca [8 x i32], align 16
	%small = alloca [2 x i8], align 1			%small = alloca [2 x i8], align 2
	%large = alloca [8 x i8], align 1			%large = alloca [8 x i8], align 8
	%a = alloca %struct.struct_large_char, align 1			%a = alloca %struct.struct_large_char, align 8
	%b = alloca %struct.struct_small_char, align 1			%b = alloca %struct.struct_small_char, align 8
	%c = alloca %struct.struct_large_nonchar, align 8			%c = alloca %struct.struct_large_nonchar, align 8
	%d = alloca %struct.struct_small_nonchar, align 2			%d = alloca %struct.struct_small_nonchar, align 8
	%call = call i32 @get_scalar1()			%call = call i32 @get_scalar1()
	store i32 %call, ptr %x, align 4			store i32 %call, ptr %x, align 4
	call void @end_scalar1()			call void @end_scalar1()
	%call1 = call i32 @get_scalar2()			%call1 = call i32 @get_scalar2()
	store i32 %call1, ptr %y, align 4			store i32 %call1, ptr %y, align 4
	call void @end_scalar2()			call void @end_scalar2()
	%call2 = call i32 @get_scalar3()			%call2 = call i32 @get_scalar3()
	store i32 %call2, ptr %z, align 4			store i32 %call2, ptr %z, align 4
	▲ Show 20 Lines • Show All 86 Lines • ▼ Show 20 Lines

	; CHECK: call{{l\|q}} get_struct_small_nonchar			; CHECK: call{{l\|q}} get_struct_small_nonchar
	; CHECK: movw %ax, -112(			; CHECK: movw %ax, -112(
	; CHECK: call{{l\|q}} end_struct_small_nonchar			; CHECK: call{{l\|q}} end_struct_small_nonchar
	%x = alloca i32, align 4			%x = alloca i32, align 4
	%y = alloca i32, align 4			%y = alloca i32, align 4
	%z = alloca i32, align 4			%z = alloca i32, align 4
	%ptr = alloca i32, align 4			%ptr = alloca i32, align 4
	%small2 = alloca [2 x i16], align 2			%small2 = alloca [2 x i16], align 4
	%large2 = alloca [8 x i32], align 16			%large2 = alloca [8 x i32], align 16
	%small = alloca [2 x i8], align 1			%small = alloca [2 x i8], align 2
	%large = alloca [8 x i8], align 1			%large = alloca [8 x i8], align 8
	%a = alloca %struct.struct_large_char, align 1			%a = alloca %struct.struct_large_char, align 8
	%b = alloca %struct.struct_small_char, align 1			%b = alloca %struct.struct_small_char, align 8
	%c = alloca %struct.struct_large_nonchar, align 8			%c = alloca %struct.struct_large_nonchar, align 8
	%d = alloca %struct.struct_small_nonchar, align 2			%d = alloca %struct.struct_small_nonchar, align 8
	%call = call i32 @get_scalar1()			%call = call i32 @get_scalar1()
	store i32 %call, ptr %x, align 4			store i32 %call, ptr %x, align 4
	call void @end_scalar1()			call void @end_scalar1()
	%call1 = call i32 @get_scalar2()			%call1 = call i32 @get_scalar2()
	store i32 %call1, ptr %y, align 4			store i32 %call1, ptr %y, align 4
	call void @end_scalar2()			call void @end_scalar2()
	%call2 = call i32 @get_scalar3()			%call2 = call i32 @get_scalar3()
	store i32 %call2, ptr %z, align 4			store i32 %call2, ptr %z, align 4
	▲ Show 20 Lines • Show All 99 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/win-cleanuppad.ll

	Show First 20 Lines • Show All 52 Lines • ▼ Show 20 Lines

	declare i32 @__CxxFrameHandler3(...)			declare i32 @__CxxFrameHandler3(...)

	; Function Attrs: nounwind			; Function Attrs: nounwind
	declare x86_thiscallcc void @"\01??1Dtor@@QAE@XZ"(ptr) #1			declare x86_thiscallcc void @"\01??1Dtor@@QAE@XZ"(ptr) #1

	define void @nested_cleanup() #0 personality ptr @__CxxFrameHandler3 {			define void @nested_cleanup() #0 personality ptr @__CxxFrameHandler3 {
	entry:			entry:
	%o1 = alloca %struct.Dtor, align 1			%o1 = alloca %struct.Dtor, align 8
	%o2 = alloca %struct.Dtor, align 1			%o2 = alloca %struct.Dtor, align 8
	invoke void @f(i32 1)			invoke void @f(i32 1)
	to label %invoke.cont unwind label %cleanup.outer			to label %invoke.cont unwind label %cleanup.outer

	invoke.cont: ; preds = %entry			invoke.cont: ; preds = %entry
	invoke void @f(i32 2)			invoke void @f(i32 2)
	to label %invoke.cont.1 unwind label %cleanup.inner			to label %invoke.cont.1 unwind label %cleanup.inner

	invoke.cont.1: ; preds = %invoke.cont			invoke.cont.1: ; preds = %invoke.cont
	▲ Show 20 Lines • Show All 129 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/x86-mixed-alignment-dagcombine.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc -mtriple=x86_64-apple-macosx10.9.0 -mcpu=core2 -mattr=+64bit,+sse2 < %s \| FileCheck %s			; RUN: llc -mtriple=x86_64-apple-macosx10.9.0 -mcpu=core2 -mattr=+64bit,+sse2 < %s \| FileCheck %s

	; DAGCombine may choose to rewrite 2 loads feeding a select as a select of			; DAGCombine may choose to rewrite 2 loads feeding a select as a select of
	; addresses feeding a load. This test ensures that when it does that it creates			; addresses feeding a load. This test ensures that when it does that it creates
	; a load with alignment equivalent to the most restrictive source load.			; a load with alignment equivalent to the most restrictive source load.

	declare void @sink(<2 x double>)			declare void @sink(<2 x double>)

	define void @test1(i1 %cmp) align 2 {			define void @test1(i1 %cmp) align 2 {
	; CHECK-LABEL: test1:			; CHECK-LABEL: test1:
	; CHECK: ## %bb.0:			; CHECK: ## %bb.0:
	; CHECK-NEXT: subq $40, %rsp			; CHECK-NEXT: subq $40, %rsp
	; CHECK-NEXT: .cfi_def_cfa_offset 48			; CHECK-NEXT: .cfi_def_cfa_offset 48
	; CHECK-NEXT: testb $1, %dil			; CHECK-NEXT: testb $1, %dil
	; CHECK-NEXT: leaq {{[0-9]+}}(%rsp), %rax			; CHECK-NEXT: movq %rsp, %rax
	; CHECK-NEXT: movq %rsp, %rcx			; CHECK-NEXT: leaq {{[0-9]+}}(%rsp), %rcx
	; CHECK-NEXT: cmovneq %rax, %rcx			; CHECK-NEXT: cmovneq %rax, %rcx
	; CHECK-NEXT: movups (%rcx), %xmm0			; CHECK-NEXT: movups (%rcx), %xmm0
	; CHECK-NEXT: callq _sink			; CHECK-NEXT: callq _sink
	; CHECK-NEXT: addq $40, %rsp			; CHECK-NEXT: addq $40, %rsp
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%1 = alloca <2 x double>, align 16			%1 = alloca <2 x double>, align 16
	%2 = alloca <2 x double>, align 8			%2 = alloca <2 x double>, align 8

	%val = load <2 x double>, ptr %1, align 16			%val = load <2 x double>, ptr %1, align 16
	%val2 = load <2 x double>, ptr %2, align 8			%val2 = load <2 x double>, ptr %2, align 8
	%val3 = select i1 %cmp, <2 x double> %val, <2 x double> %val2			%val3 = select i1 %cmp, <2 x double> %val, <2 x double> %val2
	call void @sink(<2 x double> %val3)			call void @sink(<2 x double> %val3)
	ret void			ret void
	}			}

	define void @test2(i1 %cmp) align 2 {			define void @test2(i1 %cmp) align 2 {
	; CHECK-LABEL: test2:			; CHECK-LABEL: test2:
	; CHECK: ## %bb.0:			; CHECK: ## %bb.0:
	; CHECK-NEXT: subq $40, %rsp			; CHECK-NEXT: subq $40, %rsp
	; CHECK-NEXT: .cfi_def_cfa_offset 48			; CHECK-NEXT: .cfi_def_cfa_offset 48
	; CHECK-NEXT: testb $1, %dil			; CHECK-NEXT: testb $1, %dil
	; CHECK-NEXT: leaq {{[0-9]+}}(%rsp), %rax			; CHECK-NEXT: movq %rsp, %rax
	; CHECK-NEXT: movq %rsp, %rcx			; CHECK-NEXT: leaq {{[0-9]+}}(%rsp), %rcx
	; CHECK-NEXT: cmovneq %rax, %rcx			; CHECK-NEXT: cmovneq %rax, %rcx
	; CHECK-NEXT: movaps (%rcx), %xmm0			; CHECK-NEXT: movaps (%rcx), %xmm0
	; CHECK-NEXT: callq _sink			; CHECK-NEXT: callq _sink
	; CHECK-NEXT: addq $40, %rsp			; CHECK-NEXT: addq $40, %rsp
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%1 = alloca <2 x double>, align 16			%1 = alloca <2 x double>, align 16
	%2 = alloca <2 x double>, align 8			%2 = alloca <2 x double>, align 8

	%val = load <2 x double>, ptr %1, align 16			%val = load <2 x double>, ptr %1, align 16
	%val2 = load <2 x double>, ptr %2, align 16			%val2 = load <2 x double>, ptr %2, align 16
	%val3 = select i1 %cmp, <2 x double> %val, <2 x double> %val2			%val3 = select i1 %cmp, <2 x double> %val, <2 x double> %val2
	call void @sink(<2 x double> %val3)			call void @sink(<2 x double> %val3)
	ret void			ret void
	}			}

llvm/test/DebugInfo/AArch64/frameindices.ll

Show First 20 Lines • Show All 80 Lines • ▼ Show 20 Lines	entry:
call void @_Z2f91A(%struct.A* %agg.tmp.i), !dbg !65		call void @_Z2f91A(%struct.A* %agg.tmp.i), !dbg !65
call void @llvm.lifetime.end(i64 24, i8* %1), !dbg !66		call void @llvm.lifetime.end(i64 24, i8* %1), !dbg !66
ret void, !dbg !67		ret void, !dbg !67
}		}

define void @_Z3f16v() personality i8* bitcast (i32 (...)* @__gxx_personality_v0 to i8*) !dbg !68 {		define void @_Z3f16v() personality i8* bitcast (i32 (...)* @__gxx_personality_v0 to i8*) !dbg !68 {
entry:		entry:
%agg.tmp.i.i = alloca %struct.A, align 8		%agg.tmp.i.i = alloca %struct.A, align 8
%d = alloca %struct.B, align 1		%d = alloca %struct.B, align 8
%agg.tmp.sroa.2 = alloca [15 x i8], align 1		%agg.tmp.sroa.2 = alloca [15 x i8], align 1
%agg.tmp.sroa.4 = alloca [7 x i8], align 1		%agg.tmp.sroa.4 = alloca [7 x i8], align 1
tail call void @llvm.dbg.declare(metadata [15 x i8]* %agg.tmp.sroa.2, metadata !56, metadata !74), !dbg !75		tail call void @llvm.dbg.declare(metadata [15 x i8]* %agg.tmp.sroa.2, metadata !56, metadata !74), !dbg !75
tail call void @llvm.dbg.declare(metadata [7 x i8]* %agg.tmp.sroa.4, metadata !56, metadata !77), !dbg !75		tail call void @llvm.dbg.declare(metadata [7 x i8]* %agg.tmp.sroa.4, metadata !56, metadata !77), !dbg !75
tail call void @llvm.dbg.declare(metadata %struct.A* undef, metadata !72, metadata !37), !dbg !78		tail call void @llvm.dbg.declare(metadata %struct.A* undef, metadata !72, metadata !37), !dbg !78
%0 = load i64, i64* @a, align 8, !dbg !79, !tbaa !40		%0 = load i64, i64* @a, align 8, !dbg !79, !tbaa !40
tail call void @llvm.dbg.value(metadata %struct.B* %d, metadata !73, metadata !37), !dbg !80		tail call void @llvm.dbg.value(metadata %struct.B* %d, metadata !73, metadata !37), !dbg !80
%call = call %struct.B* @_ZN1BC1El(%struct.B* %d, i64 %0), !dbg !80		%call = call %struct.B* @_ZN1BC1El(%struct.B* %d, i64 %0), !dbg !80
▲ Show 20 Lines • Show All 159 Lines • Show Last 20 Lines

llvm/test/DebugInfo/NVPTX/dbg-declare-alloca.ll

	Show First 20 Lines • Show All 215 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: .b8 0 // End Of Children Mark			; CHECK-NEXT: .b8 0 // End Of Children Mark
	; CHECK-NEXT: }			; CHECK-NEXT: }

	%struct.Foo = type { i32 }			%struct.Foo = type { i32 }

	; Function Attrs: noinline nounwind uwtable			; Function Attrs: noinline nounwind uwtable
	define void @use_dbg_declare() #0 !dbg !7 {			define void @use_dbg_declare() #0 !dbg !7 {
	entry:			entry:
	%o = alloca %struct.Foo, align 4			%o = alloca %struct.Foo, align 8
	call void @llvm.dbg.declare(metadata %struct.Foo* %o, metadata !10, metadata !15), !dbg !16			call void @llvm.dbg.declare(metadata %struct.Foo* %o, metadata !10, metadata !15), !dbg !16
	call void @escape_foo(%struct.Foo* %o), !dbg !17			call void @escape_foo(%struct.Foo* %o), !dbg !17
	ret void, !dbg !18			ret void, !dbg !18
	}			}

	; Function Attrs: nounwind readnone speculatable			; Function Attrs: nounwind readnone speculatable
	declare void @llvm.dbg.declare(metadata, metadata, metadata) #1			declare void @llvm.dbg.declare(metadata, metadata, metadata) #1

	Show All 28 Lines

llvm/test/DebugInfo/X86/dbg-addr.ll

	Show All 38 Lines
	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64--linux"			target triple = "x86_64--linux"

	%struct.Foo = type { i32 }			%struct.Foo = type { i32 }

	; Function Attrs: noinline nounwind uwtable			; Function Attrs: noinline nounwind uwtable
	define void @use_dbg_addr() #0 !dbg !7 {			define void @use_dbg_addr() #0 !dbg !7 {
	entry:			entry:
	%o = alloca %struct.Foo, align 4			%o = alloca %struct.Foo, align 8
	call void @llvm.dbg.addr(metadata %struct.Foo* %o, metadata !10, metadata !15), !dbg !16			call void @llvm.dbg.addr(metadata %struct.Foo* %o, metadata !10, metadata !15), !dbg !16
	call void @escape_foo(%struct.Foo* %o), !dbg !17			call void @escape_foo(%struct.Foo* %o), !dbg !17
	ret void, !dbg !18			ret void, !dbg !18
	}			}

	define void @test_dbg_addr_and_dbg_val_undef() #0 !dbg !117 {			define void @test_dbg_addr_and_dbg_val_undef() #0 !dbg !117 {
	entry:			entry:
	%o = alloca %struct.Foo, align 4			%o = alloca %struct.Foo, align 8
	call void @llvm.dbg.addr(metadata %struct.Foo* %o, metadata !1110, metadata !1115), !dbg !1116			call void @llvm.dbg.addr(metadata %struct.Foo* %o, metadata !1110, metadata !1115), !dbg !1116
	call void @escape_foo(%struct.Foo* %o), !dbg !1117			call void @escape_foo(%struct.Foo* %o), !dbg !1117
	call void @llvm.dbg.value(metadata %struct.Foo* undef, metadata !1110, metadata !1115), !dbg !1116			call void @llvm.dbg.value(metadata %struct.Foo* undef, metadata !1110, metadata !1115), !dbg !1116
	ret void, !dbg !1118			ret void, !dbg !1118
	}			}

	; Function Attrs: nounwind readnone speculatable			; Function Attrs: nounwind readnone speculatable
	declare void @llvm.dbg.addr(metadata, metadata, metadata) #1			declare void @llvm.dbg.addr(metadata, metadata, metadata) #1
	▲ Show 20 Lines • Show All 43 Lines • Show Last 20 Lines

llvm/test/DebugInfo/X86/dbg-declare-alloca.ll

	Show All 17 Lines
	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64--linux"			target triple = "x86_64--linux"

	%struct.Foo = type { i32 }			%struct.Foo = type { i32 }

	; Function Attrs: noinline nounwind uwtable			; Function Attrs: noinline nounwind uwtable
	define void @use_dbg_declare() #0 !dbg !7 {			define void @use_dbg_declare() #0 !dbg !7 {
	entry:			entry:
	%o = alloca %struct.Foo, align 4			%o = alloca %struct.Foo, align 8
	call void @llvm.dbg.declare(metadata %struct.Foo* %o, metadata !10, metadata !15), !dbg !16			call void @llvm.dbg.declare(metadata %struct.Foo* %o, metadata !10, metadata !15), !dbg !16
	call void @escape_foo(%struct.Foo* %o), !dbg !17			call void @escape_foo(%struct.Foo* %o), !dbg !17
	ret void, !dbg !18			ret void, !dbg !18
	}			}

	; Function Attrs: nounwind readnone speculatable			; Function Attrs: nounwind readnone speculatable
	declare void @llvm.dbg.declare(metadata, metadata, metadata) #1			declare void @llvm.dbg.declare(metadata, metadata, metadata) #1

	Show All 28 Lines

llvm/test/DebugInfo/X86/sret.ll

Show First 20 Lines • Show All 98 Lines • ▼ Show 20 Lines	entry:
%0 = load i32, i32* %m_int, align 4, !dbg !88		%0 = load i32, i32* %m_int, align 4, !dbg !88
ret i32 %0, !dbg !88		ret i32 %0, !dbg !88
}		}

; Function Attrs: uwtable		; Function Attrs: uwtable
define void @_ZN1B9AInstanceEv(%class.A* noalias sret(%class.A) %agg.result, %class.B* %this) #2 align 2 !dbg !53 {		define void @_ZN1B9AInstanceEv(%class.A* noalias sret(%class.A) %agg.result, %class.B* %this) #2 align 2 !dbg !53 {
entry:		entry:
%this.addr = alloca %class.B*, align 8		%this.addr = alloca %class.B*, align 8
%nrvo = alloca i1		%nrvo = alloca i1, align 1
%cleanup.dest.slot = alloca i32		%cleanup.dest.slot = alloca i32
store %class.B* %this, %class.B** %this.addr, align 8		store %class.B* %this, %class.B** %this.addr, align 8
call void @llvm.dbg.declare(metadata %class.B** %this.addr, metadata !89, metadata !DIExpression()), !dbg !91		call void @llvm.dbg.declare(metadata %class.B** %this.addr, metadata !89, metadata !DIExpression()), !dbg !91
%this1 = load %class.B, %class.B* %this.addr		%this1 = load %class.B, %class.B* %this.addr
store i1 false, i1* %nrvo, !dbg !92		store i1 false, i1* %nrvo, !dbg !92
call void @llvm.dbg.declare(metadata %class.A* %agg.result, metadata !93, metadata !DIExpression()), !dbg !92		call void @llvm.dbg.declare(metadata %class.A* %agg.result, metadata !93, metadata !DIExpression()), !dbg !92
call void @_ZN1AC1Ei(%class.A* %agg.result, i32 12), !dbg !92		call void @_ZN1AC1Ei(%class.A* %agg.result, i32 12), !dbg !92
store i1 true, i1* %nrvo, !dbg !94		store i1 true, i1* %nrvo, !dbg !94
Show All 20 Lines
}		}

; Function Attrs: uwtable		; Function Attrs: uwtable
define i32 @main(i32 %argc, i8** %argv) #2 personality i8* bitcast (i32 (...)* @__gxx_personality_v0 to i8*) !dbg !54 {		define i32 @main(i32 %argc, i8** %argv) #2 personality i8* bitcast (i32 (...)* @__gxx_personality_v0 to i8*) !dbg !54 {
entry:		entry:
%retval = alloca i32, align 4		%retval = alloca i32, align 4
%argc.addr = alloca i32, align 4		%argc.addr = alloca i32, align 4
%argv.addr = alloca i8**, align 8		%argv.addr = alloca i8**, align 8
%b = alloca %class.B, align 1		%b = alloca %class.B, align 8
%return_val = alloca i32, align 4		%return_val = alloca i32, align 4
%temp.lvalue = alloca %class.A, align 8		%temp.lvalue = alloca %class.A, align 8
%exn.slot = alloca i8*		%exn.slot = alloca i8*, align 8
%ehselector.slot = alloca i32		%ehselector.slot = alloca i32, align 4
%a = alloca %class.A, align 8		%a = alloca %class.A, align 8
%cleanup.dest.slot = alloca i32		%cleanup.dest.slot = alloca i32, align 4
store i32 0, i32* %retval		store i32 0, i32* %retval
store i32 %argc, i32* %argc.addr, align 4		store i32 %argc, i32* %argc.addr, align 4
call void @llvm.dbg.declare(metadata i32* %argc.addr, metadata !104, metadata !DIExpression()), !dbg !105		call void @llvm.dbg.declare(metadata i32* %argc.addr, metadata !104, metadata !DIExpression()), !dbg !105
store i8 %argv, i8* %argv.addr, align 8		store i8 %argv, i8* %argv.addr, align 8
call void @llvm.dbg.declare(metadata i8*** %argv.addr, metadata !106, metadata !DIExpression()), !dbg !105		call void @llvm.dbg.declare(metadata i8*** %argv.addr, metadata !106, metadata !DIExpression()), !dbg !105
call void @llvm.dbg.declare(metadata %class.B* %b, metadata !107, metadata !DIExpression()), !dbg !108		call void @llvm.dbg.declare(metadata %class.B* %b, metadata !107, metadata !DIExpression()), !dbg !108
call void @_ZN1BC2Ev(%class.B* %b), !dbg !108		call void @_ZN1BC2Ev(%class.B* %b), !dbg !108
call void @llvm.dbg.declare(metadata i32* %return_val, metadata !109, metadata !DIExpression()), !dbg !110		call void @llvm.dbg.declare(metadata i32* %return_val, metadata !109, metadata !DIExpression()), !dbg !110
▲ Show 20 Lines • Show All 63 Lines • ▼ Show 20 Lines
declare i8* @__cxa_begin_catch(i8*)		declare i8* @__cxa_begin_catch(i8*)

declare void @_ZSt9terminatev()		declare void @_ZSt9terminatev()

; Function Attrs: uwtable		; Function Attrs: uwtable
define linkonce_odr void @_ZN1AD0Ev(%class.A* %this) unnamed_addr #2 align 2 personality i8* bitcast (i32 (...)* @__gxx_personality_v0 to i8*) !dbg !61 {		define linkonce_odr void @_ZN1AD0Ev(%class.A* %this) unnamed_addr #2 align 2 personality i8* bitcast (i32 (...)* @__gxx_personality_v0 to i8*) !dbg !61 {
entry:		entry:
%this.addr = alloca %class.A*, align 8		%this.addr = alloca %class.A*, align 8
%exn.slot = alloca i8*		%exn.slot = alloca i8*, align 8
%ehselector.slot = alloca i32		%ehselector.slot = alloca i32, align 4
store %class.A* %this, %class.A** %this.addr, align 8		store %class.A* %this, %class.A** %this.addr, align 8
call void @llvm.dbg.declare(metadata %class.A** %this.addr, metadata !126, metadata !DIExpression()), !dbg !127		call void @llvm.dbg.declare(metadata %class.A** %this.addr, metadata !126, metadata !DIExpression()), !dbg !127
%this1 = load %class.A, %class.A* %this.addr		%this1 = load %class.A, %class.A* %this.addr
invoke void @_ZN1AD2Ev(%class.A* %this1)		invoke void @_ZN1AD2Ev(%class.A* %this1)
to label %invoke.cont unwind label %lpad, !dbg !128		to label %invoke.cont unwind label %lpad, !dbg !128

invoke.cont: ; preds = %entry		invoke.cont: ; preds = %entry
%0 = bitcast %class.A* %this1 to i8*, !dbg !129		%0 = bitcast %class.A* %this1 to i8*, !dbg !129
▲ Show 20 Lines • Show All 165 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[SelectionDAG] Do not second-guess alignment for allocaClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 466460

llvm/lib/CodeGen/SelectionDAG/FunctionLoweringInfo.cpp

llvm/test/CodeGen/AArch64/preferred-alignment.ll

llvm/test/CodeGen/AArch64/seh-finally.ll

llvm/test/CodeGen/AMDGPU/call-argument-types.ll

llvm/test/CodeGen/AMDGPU/frame-index-elimination.ll

llvm/test/CodeGen/AMDGPU/spill-scavenge-offset.ll

llvm/test/CodeGen/ARM/ssp-data-layout.ll

llvm/test/CodeGen/BPF/undef.ll

llvm/test/CodeGen/Mips/Fast-ISel/fastalloca.ll

llvm/test/CodeGen/Mips/atomic64.ll

llvm/test/CodeGen/Mips/cconv/byval.ll

llvm/test/CodeGen/Mips/cconv/return-struct.ll

llvm/test/CodeGen/Mips/largeimmprinting.ll

llvm/test/CodeGen/Mips/o32_cc_byval.ll

llvm/test/CodeGen/NVPTX/lower-byval-args.ll

llvm/test/CodeGen/PowerPC/aix-cc-byval.ll

llvm/test/CodeGen/PowerPC/aix-sret-param.ll

llvm/test/CodeGen/PowerPC/byval.ll

llvm/test/CodeGen/PowerPC/structsinregs.ll

llvm/test/CodeGen/PowerPC/varargs-struct-float.ll

llvm/test/CodeGen/RISCV/calling-conv-ilp32-ilp32f-ilp32d-common.ll

llvm/test/CodeGen/RISCV/frame.ll

llvm/test/CodeGen/RISCV/mem64.ll

llvm/test/CodeGen/RISCV/vararg.ll

llvm/test/CodeGen/Thumb2/mve-stack.ll

llvm/test/CodeGen/VE/Scalar/atomic_cmp_swap.ll

llvm/test/CodeGen/VE/Scalar/atomic_load.ll

llvm/test/CodeGen/VE/Scalar/atomic_swap.ll

llvm/test/CodeGen/WebAssembly/PR40172.ll

llvm/test/CodeGen/X86/dbg-changes-codegen-branch-folding.ll

llvm/test/CodeGen/X86/fast-isel-call.ll

llvm/test/CodeGen/X86/load-local-v3i129.ll

llvm/test/CodeGen/X86/pr44140.ll

llvm/test/CodeGen/X86/ssp-data-layout.ll

llvm/test/CodeGen/X86/win-cleanuppad.ll

llvm/test/CodeGen/X86/x86-mixed-alignment-dagcombine.ll

llvm/test/DebugInfo/AArch64/frameindices.ll

llvm/test/DebugInfo/NVPTX/dbg-declare-alloca.ll

llvm/test/DebugInfo/X86/dbg-addr.ll

llvm/test/DebugInfo/X86/dbg-declare-alloca.ll

llvm/test/DebugInfo/X86/sret.ll

[SelectionDAG] Do not second-guess alignment for alloca
ClosedPublic