This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/Transforms/IPO/
-
llvm/
-
Transforms/
-
IPO/
-
ArgumentPromotion.h
3/3
Attributor.h
-
lib/Transforms/IPO/
-
Transforms/
-
IPO/
2/2
ArgumentPromotion.cpp
5/11
Attributor.cpp
-
test/Transforms/
-
Transforms/
-
ArgumentPromotion/
-
2008-02-01-ReturnAttrs.ll
-
2008-07-02-array-indexing.ll
-
2008-09-07-CGUpdate.ll
-
2008-09-08-CGUpdateSelfEdge.ll
-
X86/
-
attributes.ll
-
min-legal-vector-width.ll
-
thiscall.ll
-
aggregate-promote.ll
-
attrs.ll
-
basictest.ll
-
byval-2.ll
-
byval.ll
1
chained.ll
-
control-flow.ll
2/4
control-flow2.ll
-
crash.ll
-
fp80.ll
-
inalloca.ll
-
invalidation.ll
-
naked_functions.ll
-
nonzero-address-spaces.ll
-
pr27568.ll
-
pr3085.ll
-
pr32917.ll
-
pr33641_remove_arg_dbgvalue.ll
-
profile.ll
-
reserve-tbaa.ll
-
sret.ll
-
tail.ll
-
variadic.ll
-
FunctionAttrs/
-
callbacks.ll

Differential D68852

[Attributor] Pointer privatization attribute (argument promotion)
ClosedPublic

Authored by jdoerfert on Oct 10 2019, 7:36 PM.

Download Raw Diff

Details

Reviewers

uenoku
sstefan1
lebedev.ri
hfinkel
vsk
dblaikie
davidxl
tejohnson
tstellar
echristo
chandlerc
efriedma

Commits

rG89c2e733e80e: [Attributor] Pointer privatization attribute (argument promotion)

Summary

A pointer is privatizeable if it can be replaced by a new, private one.
Privatizing pointer reduces the use count, interaction between unrelated
code parts. This is a first step towards replacing argument promotion.
While we can already handle recursion (unlike argument promotion!) we
are restricted to stack allocations for now because we do not analyze
the uses in the callee.

All argument promotion test now run the Attributor as well.

Diff Detail

Repository

rG LLVM Github Monorepo

Build Status

Buildable 40327
Build 40430: arc lint + arc unit

Event Timeline

jdoerfert created this revision.Oct 10 2019, 7:36 PM

Herald added a project: Restricted Project. · View Herald TranscriptOct 10 2019, 7:36 PM

Herald added subscribers: arphaman, bollu, hiraditya. · View Herald Transcript

I went through all the tests, added mem2reg/sroa where approriate and modified the source sometimes, mostly to avoid UB. I think the results of the Attributor look good, all problems should have been addressed already.

Harbormaster completed remote builds in B39391: Diff 224526.Oct 10 2019, 7:40 PM

jdoerfert mentioned this in D68766: [NFC][ArgPromo][Tests] Run update_test_checks on all ArgumentPromotion tests.Oct 10 2019, 7:44 PM

lebedev.ri added inline comments.Oct 16 2019, 11:59 AM

llvm/test/Transforms/ArgumentPromotion/control-flow2.ll
39–45	This is why i'm really pushing on using `--check-prefixes=ALL,ARGPROMOTION` from the getgo :[
48	I strongly believe you want to precommit test changes+regeneration first.

jdoerfert added parent revisions: D68766: [NFC][ArgPromo][Tests] Run update_test_checks on all ArgumentPromotion tests, D68851: [Utils] Allow update_test_checks to scrub attribute annotations, D68819: [Utils] Allow update_test_checks to check function arguments, D68850: [Utils] Deal with occasionally deleted functions.Oct 29 2019, 11:09 PM

jdoerfert marked 2 inline comments as done.

jdoerfert added inline comments.

llvm/test/Transforms/ArgumentPromotion/control-flow2.ll
39–45	Agreed.
48	I will update the test lines after the D68766 update. Do you want me to split test changes, e.g., that remove UB, as well?

jdoerfert added a parent revision: D68765: [Attributor] Function signature rewrite infrastructure.Oct 30 2019, 10:41 PM

Update the tests

Harbormaster completed remote builds in B40327: Diff 227230.Oct 31 2019, 12:02 AM

Update tests

Harbormaster completed remote builds in B40365: Diff 227335.Oct 31 2019, 1:45 PM

I'll copy the tests into the new Attributor test folder. Any other comments? @uenoku @sstefan1

Looks generally fine.
I couldn't imagine what is Pointer privatization at first hand. Could you add an example result of Pointer privatization? Like,

int f(int* ptr){
 ...
}
=>
int f(int p){
 int* ptr = &p;
 ...
}

And I guess the current implementation always does privatization if possible. I think the cost may increase in some cases, right? What do you think about?

llvm/include/llvm/Transforms/IPO/Attributor.h
2086	Could you add comments for the condition of whether the pointer can be replaced a private one?
2098	nit: choose
llvm/test/Transforms/ArgumentPromotion/chained.ll
18	Please add FIXME here for `AAValueSimplify`.

In D68852#1781869, @uenoku wrote:
Looks generally fine.
I couldn't imagine what is Pointer privatization at first hand. Could you add an example result of Pointer privatization? Like,
int f(int* ptr){
 ...
}
=>
int f(int p){
 int* ptr = &p;
 ...
}

I will add the example to the class comment in the header file.

For the record:
Privatization, at least the part implemented so far, is roughly argument promotion. Instead of passing a pointer, pass the values accessed through it by the callee.
The existing argument promotion does not do privatization but tries to replace the uses of the pointer with the values passed right away. Privatization is simpler in that
regard but later, partially because of this, also more powerful.

And I guess the current implementation always does privatization if possible. I think the cost may increase in some cases. What do you think about?

That is correct. So far, we build the Attribtor to be powerful (=applicable) not to be "smart" about costs. We'll have to write heurisitcs soon but before that I want to ask people to test the powerful version in order to get more coverage and sniff out bugs.

In fact, we might always do privatization once another piece of code I have only locally is available. With it, privatization might cause arbitrarily many arguments at the call site but we can always recover the original call site from it. More on that later though. For comparison: ArgumentPromotion restricts the size of the structs that are expanded arbitrarily to 3, which is beyond me.

LGTM from my side but please make sure that it passes test-suite.

llvm/lib/Transforms/IPO/ArgumentPromotion.cpp
845	I think you can put this function to `llvm` namespace and split commit.

This revision is now accepted and ready to land.Dec 13 2019, 3:46 AM

In D68852#1783313, @uenoku wrote:

LGTM from my side but please make sure that it passes test-suite.

Will do.

I have to wait for D68765 first but not necessarily for the update scrip patches (they are blocked by the update test infrastructure patch).

rebase

clang-format: pass.

Build artifacts: console-log.txt, diff.json

Addressed comments

Harbormaster failed remote builds in B42507: Diff 233921!Dec 14 2019, 12:21 AM

clang-format: pass.

Build artifacts: console-log.txt, diff.json

jdoerfert added inline comments.Dec 14 2019, 12:24 AM

llvm/include/llvm/Transforms/IPO/Attributor.h
2086	Done.
llvm/lib/Transforms/IPO/ArgumentPromotion.cpp
845	I would like to get rid of this, the Attributor should directly use a more specialized form of areFunctionArgsABICompatible. I'll add a TODO.

Harbormaster failed remote builds in B42508: Diff 233922!Dec 14 2019, 12:30 AM

efriedma added inline comments.Dec 16 2019, 6:38 PM

llvm/lib/Transforms/IPO/Attributor.cpp
4367	No alignment set on loads?
4402	No alignment set on alloca?
4466	isArrayAllocation()

jdoerfert marked 2 inline comments as done.Dec 16 2019, 7:01 PM

jdoerfert added inline comments.

llvm/lib/Transforms/IPO/Attributor.cpp
4367	I'll add the alignment for the alloca below based on the alignment of the pointer it replaces. The loads and stores will be annotated by the next run of the Attributor automatically. We can also consider not emitting it if we can prove it is not needed, though that will not always be the case and it will require an analysis we do not yet have (something like AAAccessTracker). Finally, SROA should, in the good case, eliminate the alloca completely.
4466	Will do.

efriedma added inline comments.Dec 17 2019, 8:58 AM

llvm/lib/Transforms/IPO/Attributor.cpp
4367	The loads and stores will be annotated by the next run of the Attributor automatically. The default alignment of a load is the alignment of the load's type, as computed by the datalayout. This might be too high, depending on the pointer.

jdoerfert marked an inline comment as done.Dec 17 2019, 9:21 AM

jdoerfert added inline comments.

llvm/lib/Transforms/IPO/Attributor.cpp
4367	The default alignment of a load is the alignment of the load's type, as computed by the datalayout. Sure. This might be too high, depending on the pointer. How could that be? We create the pointer with a proper type (the alloca) below. Shouldn't the alloca take the default alignment into account when the memory is allocated?

efriedma added inline comments.Dec 17 2019, 10:12 AM

llvm/lib/Transforms/IPO/Attributor.cpp
4367	If you create an alloca, then load/store to that pointer, the default alignments will work, yes. But that isn't what's happening here, is it? The alloca is in the callee, and this load is in the caller.

jdoerfert marked an inline comment as done.Dec 17 2019, 11:57 AM

jdoerfert added inline comments.

llvm/lib/Transforms/IPO/Attributor.cpp
4367	With this patch we always have an alloca or an argument with some pointer to (struct) type which we only access through proper gep addressing. I don't think this can create an alignment issue. I get that the alloca needs to be aligned with a higher value if the pointer was marked as such, but I already said that will be fixed.

efriedma added inline comments.Dec 17 2019, 12:42 PM

llvm/lib/Transforms/IPO/Attributor.cpp
4367	Pointers can be misaligned, generally. For example: define void @f() { entry: %a = alloca i32, align 1 call void @g(i32* %a) ret void } define internal void @g(i32* %a) { %aa = load i32, i32* %a, align 1 call void @z(aa) } declare void @z(i32) As far as I can tell, your patch will introduce a misaligned load into `@f()`. (C generally provides additional guarantees based on the pointee type of a pointer, but there isn't any corresponding rule for IR pointers.)

jdoerfert marked an inline comment as done.Dec 17 2019, 3:25 PM

jdoerfert added inline comments.

llvm/lib/Transforms/IPO/Attributor.cpp
4367	I finally understand your concern, sorry that it took so long. I played around a bit to see what we currently do and I found this interesting: https://godbolt.org/z/2q_oqH We basically align the alloca naturally at some point. I would for now just set the alignment to 1 and add a TODO. For these loads, the Attributor can find a better alignment in the next run anyway and this allows me to not amend this patch too much. The TODO will explain the situation and we can work on a better solution from then. Maybe, if it is very simple, I'll directly use the AAAlign logic to get a lower bound instead. Long story short, I'll make sure these loads are properly aligned and we test for this.

jdoerfert mentioned this in D72382: [ArgPromotion] Extend search for SafeToUnconditionallyLoad indices to the blocks that must be executed upon entry into the function..Jan 9 2020, 8:24 AM

jdoerfert mentioned this in D71989: [OpenMP][IRBuilder] `omp task` support.Jan 14 2020, 5:54 PM

Closed by commit rG89c2e733e80e: [Attributor] Pointer privatization attribute (argument promotion) (authored by jdoerfert). · Explain WhyJan 29 2020, 7:33 PM

This revision was automatically updated to reflect the committed changes.

I added a test case for the alignment and I tested it on the LLVM Test Suite :)

Revision Contents

Path

Size

llvm/

include/

llvm/

Transforms/

IPO/

ArgumentPromotion.h

12 lines

Attributor.h

51 lines

lib/

Transforms/

IPO/

ArgumentPromotion.cpp

17 lines

Attributor.cpp

520 lines

test/

Transforms/

ArgumentPromotion/

2008-02-01-ReturnAttrs.ll

24 lines

2008-07-02-array-indexing.ll

29 lines

2008-09-07-CGUpdate.ll

1 line

2008-09-08-CGUpdateSelfEdge.ll

1 line

X86/

attributes.ll

43 lines

min-legal-vector-width.ll

175 lines

23 lines

20 lines

20 lines

25 lines

39 lines

64 lines

17 lines

38 lines

30 lines

146 lines

85 lines

42 lines

21 lines

20 lines

nonzero-address-spaces.ll

23 lines

pr27568.ll

33 lines

pr3085.ll

1 line

pr32917.ll

17 lines

pr33641_remove_arg_dbgvalue.ll

8 lines

17 lines

26 lines

21 lines

24 lines

20 lines

FunctionAttrs/

callbacks.ll

109 lines

Diff 227230

llvm/include/llvm/Transforms/IPO/ArgumentPromotion.h

	//===- ArgumentPromotion.h - Promote by-reference arguments ------ C++ --===//			//===- ArgumentPromotion.h - Promote by-reference arguments ------ C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_TRANSFORMS_IPO_ARGUMENTPROMOTION_H			#ifndef LLVM_TRANSFORMS_IPO_ARGUMENTPROMOTION_H
	#define LLVM_TRANSFORMS_IPO_ARGUMENTPROMOTION_H			#define LLVM_TRANSFORMS_IPO_ARGUMENTPROMOTION_H

	#include "llvm/Analysis/CGSCCPassManager.h"			#include "llvm/Analysis/CGSCCPassManager.h"
	#include "llvm/Analysis/LazyCallGraph.h"			#include "llvm/Analysis/LazyCallGraph.h"
	#include "llvm/IR/PassManager.h"			#include "llvm/IR/PassManager.h"

	namespace llvm {			namespace llvm {
				class TargetTransformInfo;

	/// Argument promotion pass.			/// Argument promotion pass.
	///			///
	/// This pass walks the functions in each SCC and for each one tries to			/// This pass walks the functions in each SCC and for each one tries to
	/// transform it and all of its callers to replace indirect arguments with			/// transform it and all of its callers to replace indirect arguments with
	/// direct (by-value) arguments.			/// direct (by-value) arguments.
	class ArgumentPromotionPass : public PassInfoMixin<ArgumentPromotionPass> {			class ArgumentPromotionPass : public PassInfoMixin<ArgumentPromotionPass> {
	unsigned MaxElements;			unsigned MaxElements;

	public:			public:
	ArgumentPromotionPass(unsigned MaxElements = 3u) : MaxElements(MaxElements) {}			ArgumentPromotionPass(unsigned MaxElements = 3u) : MaxElements(MaxElements) {}

				/// Check if callers and the callee \p F agree how promoted arguments would be
				/// passed. The ones that they do not agree on are eliminated from the sets but
				/// the return value has to be observed as well.
				static bool areFunctionArgsABICompatible(
				const Function &F, const TargetTransformInfo &TTI,
				SmallPtrSetImpl<Argument *> &ArgsToPromote,
				SmallPtrSetImpl<Argument *> &ByValArgsToTransform);

				/// Checks if a type could have padding bytes.
				static bool isDenselyPacked(Type *type, const DataLayout &DL);

	PreservedAnalyses run(LazyCallGraph::SCC &C, CGSCCAnalysisManager &AM,			PreservedAnalyses run(LazyCallGraph::SCC &C, CGSCCAnalysisManager &AM,
	LazyCallGraph &CG, CGSCCUpdateResult &UR);			LazyCallGraph &CG, CGSCCUpdateResult &UR);
	};			};

	} // end namespace llvm			} // end namespace llvm

	#endif // LLVM_TRANSFORMS_IPO_ARGUMENTPROMOTION_H			#endif // LLVM_TRANSFORMS_IPO_ARGUMENTPROMOTION_H

llvm/include/llvm/Transforms/IPO/Attributor.h

Show First 20 Lines • Show All 98 Lines • ▼ Show 20 Lines

#include "llvm/ADT/MapVector.h"		#include "llvm/ADT/MapVector.h"
#include "llvm/ADT/SCCIterator.h"		#include "llvm/ADT/SCCIterator.h"
#include "llvm/ADT/SetVector.h"		#include "llvm/ADT/SetVector.h"
#include "llvm/Analysis/AliasAnalysis.h"		#include "llvm/Analysis/AliasAnalysis.h"
#include "llvm/Analysis/CallGraph.h"		#include "llvm/Analysis/CallGraph.h"
#include "llvm/Analysis/MustExecute.h"		#include "llvm/Analysis/MustExecute.h"
#include "llvm/Analysis/TargetLibraryInfo.h"		#include "llvm/Analysis/TargetLibraryInfo.h"
		#include "llvm/Analysis/TargetTransformInfo.h"
#include "llvm/IR/CallSite.h"		#include "llvm/IR/CallSite.h"
#include "llvm/IR/PassManager.h"		#include "llvm/IR/PassManager.h"

namespace llvm {		namespace llvm {

struct AbstractAttribute;		struct AbstractAttribute;
struct InformationCache;		struct InformationCache;
struct AAIsDead;		struct AAIsDead;
▲ Show 20 Lines • Show All 155 Lines • ▼ Show 20 Lines	Function *getAssociatedFunction() const {
if (isa<Argument>(V))		if (isa<Argument>(V))
return cast<Argument>(V).getParent();		return cast<Argument>(V).getParent();
if (isa<Instruction>(V))		if (isa<Instruction>(V))
return cast<Instruction>(V).getFunction();		return cast<Instruction>(V).getFunction();
return nullptr;		return nullptr;
}		}

/// Return the associated argument, if any.		/// Return the associated argument, if any.
Argument *getAssociatedArgument() const {		Argument *getAssociatedArgument() const;
if (auto *Arg = dyn_cast<Argument>(&getAnchorValue()))
return Arg;
int ArgNo = getArgNo();
if (ArgNo < 0)
return nullptr;
Function *AssociatedFn = getAssociatedFunction();
if (!AssociatedFn \|\| AssociatedFn->arg_size() <= unsigned(ArgNo))
return nullptr;
return AssociatedFn->arg_begin() + ArgNo;
}

/// Return true if the position refers to a function interface, that is the		/// Return true if the position refers to a function interface, that is the
/// function scope, the function return, or an argumnt.		/// function scope, the function return, or an argumnt.
bool isFnInterfaceKind() const {		bool isFnInterfaceKind() const {
switch (getPositionKind()) {		switch (getPositionKind()) {
case IRPosition::IRP_FUNCTION:		case IRPosition::IRP_FUNCTION:
case IRPosition::IRP_RETURNED:		case IRPosition::IRP_RETURNED:
case IRPosition::IRP_ARGUMENT:		case IRPosition::IRP_ARGUMENT:
▲ Show 20 Lines • Show All 300 Lines • ▼ Show 20 Lines	struct InformationCache {
MustBeExecutedContextExplorer &getMustBeExecutedContextExplorer() {		MustBeExecutedContextExplorer &getMustBeExecutedContextExplorer() {
return Explorer;		return Explorer;
}		}

/// Return TargetLibraryInfo for function \p F.		/// Return TargetLibraryInfo for function \p F.
TargetLibraryInfo *getTargetLibraryInfoForFunction(const Function &F) {		TargetLibraryInfo *getTargetLibraryInfoForFunction(const Function &F) {
return AG.getAnalysis<TargetLibraryAnalysis>(F);		return AG.getAnalysis<TargetLibraryAnalysis>(F);
}		}
		const TargetTransformInfo *
		getTargetTransformInfoForFunction(const Function &F) {
		return AG.getAnalysis<TargetIRAnalysis>(F);
		}

/// Return AliasAnalysis Result for function \p F.		/// Return AliasAnalysis Result for function \p F.
AAResults *getAAResultsForFunction(const Function &F) {		AAResults *getAAResultsForFunction(const Function &F) {
return AG.getAnalysis<AAManager>(F);		return AG.getAnalysis<AAManager>(F);
}		}

/// Return SCC size on call graph for function \p F.		/// Return SCC size on call graph for function \p F.
unsigned getSccSize(const Function &F) {		unsigned getSccSize(const Function &F) {
▲ Show 20 Lines • Show All 1,467 Lines • ▼ Show 20 Lines	struct AAHeapToStack : public StateWrapper<BooleanState, AbstractAttribute>,

/// Create an abstract attribute view for the position \p IRP.		/// Create an abstract attribute view for the position \p IRP.
static AAHeapToStack &createForPosition(const IRPosition &IRP, Attributor &A);		static AAHeapToStack &createForPosition(const IRPosition &IRP, Attributor &A);

/// Unique ID (due to the unique address)		/// Unique ID (due to the unique address)
static const char ID;		static const char ID;
};		};

		/// An abstract interface for privatizability.
		///
		/// A pointer is privatizable if it can be replaced by a new, private one.
		/// Privatizing pointer reduces the use count, interaction between unrelated
		uenokuUnsubmitted Done Reply Inline Actions Could you add comments for the condition of whether the pointer can be replaced a private one? uenoku: Could you add comments for the condition of whether the pointer can be replaced a private one?
		jdoerfertAuthorUnsubmitted Done Reply Inline Actions Done. jdoerfert: Done.
		/// code parts.
		struct AAPrivatizablePtr : public StateWrapper<BooleanState, AbstractAttribute>,
		public IRPosition {
		AAPrivatizablePtr(const IRPosition &IRP) : IRPosition(IRP) {}

		/// Returns true if pointer privatization is assumed to be possible.
		bool isAssumedPrivatizablePtr() const { return getAssumed(); }

		/// Returns true if pointer privatization is known to be possible.
		bool isKnownPrivatizablePtr() const { return getKnown(); }

		/// Return the type we can chose for a private copy of the underlying
		uenokuUnsubmitted Done Reply Inline Actions nit: choose uenoku: nit: choose
		/// value. None means it is not clear yet, nullptr means there is none.
		virtual Optional<Type *> getPrivatizableType() const = 0;

		/// Return an IR position, see struct IRPosition.
		///
		///{
		IRPosition &getIRPosition() { return *this; }
		const IRPosition &getIRPosition() const { return *this; }
		///}

		/// Create an abstract attribute view for the position \p IRP.
		static AAPrivatizablePtr &createForPosition(const IRPosition &IRP,
		Attributor &A);

		/// Unique ID (due to the unique address)
		static const char ID;
		};

/// An abstract interface for all memory related attributes.		/// An abstract interface for all memory related attributes.
struct AAMemoryBehavior		struct AAMemoryBehavior
: public IRAttribute<		: public IRAttribute<
Attribute::ReadNone,		Attribute::ReadNone,
StateWrapper<BitIntegerState<uint8_t, 3>, AbstractAttribute>> {		StateWrapper<BitIntegerState<uint8_t, 3>, AbstractAttribute>> {
AAMemoryBehavior(const IRPosition &IRP) : IRAttribute(IRP) {}		AAMemoryBehavior(const IRPosition &IRP) : IRAttribute(IRP) {}

/// State encoding bits. A set bit in the state means the property holds.		/// State encoding bits. A set bit in the state means the property holds.
▲ Show 20 Lines • Show All 44 Lines • Show Last 20 Lines

llvm/lib/Transforms/IPO/ArgumentPromotion.cpp

Show First 20 Lines • Show All 767 Lines • ▼ Show 20 Lines	static bool isSafeToPromoteArgument(Argument Arg, Type ByValTy, AAResults &AAR,
}		}

// If the path from the entry of the function to each load is free of		// If the path from the entry of the function to each load is free of
// instructions that potentially invalidate the load, we can make the		// instructions that potentially invalidate the load, we can make the
// transformation!		// transformation!
return true;		return true;
}		}

/// Checks if a type could have padding bytes.		bool ArgumentPromotionPass::isDenselyPacked(Type *type, const DataLayout &DL) {
static bool isDenselyPacked(Type *type, const DataLayout &DL) {
// There is no size information, so be conservative.		// There is no size information, so be conservative.
if (!type->isSized())		if (!type->isSized())
return false;		return false;

// If the alloc size is not equal to the storage size, then there are padding		// If the alloc size is not equal to the storage size, then there are padding
// bytes. For x86_fp80 on x86-64, size: 80 alloc size: 128.		// bytes. For x86_fp80 on x86-64, size: 80 alloc size: 128.
if (DL.getTypeSizeInBits(type) != DL.getTypeAllocSizeInBits(type))		if (DL.getTypeSizeInBits(type) != DL.getTypeAllocSizeInBits(type))
return false;		return false;
▲ Show 20 Lines • Show All 52 Lines • ▼ Show 20 Lines	static bool canPaddingBeAccessed(Argument *arg) {
// Check to make sure the pointers aren't captured		// Check to make sure the pointers aren't captured
for (StoreInst *Store : Stores)		for (StoreInst *Store : Stores)
if (PtrValues.count(Store->getValueOperand()))		if (PtrValues.count(Store->getValueOperand()))
return true;		return true;

return false;		return false;
}		}

static bool areFunctionArgsABICompatible(		bool ArgumentPromotionPass::areFunctionArgsABICompatible(
		uenokuUnsubmitted Done Reply Inline Actions I think you can put this function to `llvm` namespace and split commit. uenoku: I think you can put this function to `llvm` namespace and split commit.
		jdoerfertAuthorUnsubmitted Done Reply Inline Actions I would like to get rid of this, the Attributor should directly use a more specialized form of areFunctionArgsABICompatible. I'll add a TODO. jdoerfert: I would like to get rid of this, the Attributor should directly use a more specialized form of…
const Function &F, const TargetTransformInfo &TTI,		const Function &F, const TargetTransformInfo &TTI,
SmallPtrSetImpl<Argument *> &ArgsToPromote,		SmallPtrSetImpl<Argument *> &ArgsToPromote,
SmallPtrSetImpl<Argument *> &ByValArgsToTransform) {		SmallPtrSetImpl<Argument *> &ByValArgsToTransform) {
for (const Use &U : F.uses()) {		for (const Use &U : F.uses()) {
CallSite CS(U.getUser());		CallSite CS(U.getUser());
		if (!CS)
		return false;
const Function *Caller = CS.getCaller();		const Function *Caller = CS.getCaller();
const Function *Callee = CS.getCalledFunction();		const Function *Callee = CS.getCalledFunction();
if (!TTI.areFunctionArgsABICompatible(Caller, Callee, ArgsToPromote) \|\|		if (!TTI.areFunctionArgsABICompatible(Caller, Callee, ArgsToPromote) \|\|
!TTI.areFunctionArgsABICompatible(Caller, Callee, ByValArgsToTransform))		!TTI.areFunctionArgsABICompatible(Caller, Callee, ByValArgsToTransform))
return false;		return false;
}		}
return true;		return true;
}		}
▲ Show 20 Lines • Show All 85 Lines • ▼ Show 20 Lines	if (PtrArg->hasStructRetAttr()) {
CS.removeParamAttr(ArgNo, Attribute::StructRet);		CS.removeParamAttr(ArgNo, Attribute::StructRet);
CS.addParamAttr(ArgNo, Attribute::NoAlias);		CS.addParamAttr(ArgNo, Attribute::NoAlias);
}		}
}		}

// If this is a byval argument, and if the aggregate type is small, just		// If this is a byval argument, and if the aggregate type is small, just
// pass the elements, which is always safe, if the passed value is densely		// pass the elements, which is always safe, if the passed value is densely
// packed or if we can prove the padding bytes are never accessed.		// packed or if we can prove the padding bytes are never accessed.
bool isSafeToPromote =		bool isSafeToPromote = PtrArg->hasByValAttr() &&
PtrArg->hasByValAttr() &&		(ArgumentPromotionPass::isDenselyPacked(AgTy, DL) \|\|
(isDenselyPacked(AgTy, DL) \|\| !canPaddingBeAccessed(PtrArg));		!canPaddingBeAccessed(PtrArg));
if (isSafeToPromote) {		if (isSafeToPromote) {
if (StructType *STy = dyn_cast<StructType>(AgTy)) {		if (StructType *STy = dyn_cast<StructType>(AgTy)) {
if (MaxElements > 0 && STy->getNumElements() > MaxElements) {		if (MaxElements > 0 && STy->getNumElements() > MaxElements) {
LLVM_DEBUG(dbgs() << "argpromotion disable promoting argument '"		LLVM_DEBUG(dbgs() << "argpromotion disable promoting argument '"
<< PtrArg->getName()		<< PtrArg->getName()
<< "' because it would require adding more"		<< "' because it would require adding more"
<< " than " << MaxElements		<< " than " << MaxElements
<< " arguments to the function.\n");		<< " arguments to the function.\n");
▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines	for (Argument *PtrArg : PointerArgs) {
if (isSafeToPromoteArgument(PtrArg, ByValTy, AAR, MaxElements))		if (isSafeToPromoteArgument(PtrArg, ByValTy, AAR, MaxElements))
ArgsToPromote.insert(PtrArg);		ArgsToPromote.insert(PtrArg);
}		}

// No promotable pointer arguments.		// No promotable pointer arguments.
if (ArgsToPromote.empty() && ByValArgsToTransform.empty())		if (ArgsToPromote.empty() && ByValArgsToTransform.empty())
return nullptr;		return nullptr;

if (!areFunctionArgsABICompatible(*F, TTI, ArgsToPromote,		if (!ArgumentPromotionPass::areFunctionArgsABICompatible(
ByValArgsToTransform))		*F, TTI, ArgsToPromote, ByValArgsToTransform))
return nullptr;		return nullptr;

return doPromotion(F, ArgsToPromote, ByValArgsToTransform, ReplaceCallSite);		return doPromotion(F, ArgsToPromote, ByValArgsToTransform, ReplaceCallSite);
}		}

PreservedAnalyses ArgumentPromotionPass::run(LazyCallGraph::SCC &C,		PreservedAnalyses ArgumentPromotionPass::run(LazyCallGraph::SCC &C,
CGSCCAnalysisManager &AM,		CGSCCAnalysisManager &AM,
LazyCallGraph &CG,		LazyCallGraph &CG,
▲ Show 20 Lines • Show All 152 Lines • Show Last 20 Lines

llvm/lib/Transforms/IPO/Attributor.cpp

Show All 23 Lines
#include "llvm/Analysis/EHPersonalities.h"		#include "llvm/Analysis/EHPersonalities.h"
#include "llvm/Analysis/GlobalsModRef.h"		#include "llvm/Analysis/GlobalsModRef.h"
#include "llvm/Analysis/Loads.h"		#include "llvm/Analysis/Loads.h"
#include "llvm/Analysis/MemoryBuiltins.h"		#include "llvm/Analysis/MemoryBuiltins.h"
#include "llvm/Analysis/ValueTracking.h"		#include "llvm/Analysis/ValueTracking.h"
#include "llvm/IR/Argument.h"		#include "llvm/IR/Argument.h"
#include "llvm/IR/Attributes.h"		#include "llvm/IR/Attributes.h"
#include "llvm/IR/CFG.h"		#include "llvm/IR/CFG.h"
		#include "llvm/IR/IRBuilder.h"
#include "llvm/IR/InstIterator.h"		#include "llvm/IR/InstIterator.h"
#include "llvm/IR/IntrinsicInst.h"		#include "llvm/IR/IntrinsicInst.h"
		#include "llvm/IR/NoFolder.h"
#include "llvm/Support/CommandLine.h"		#include "llvm/Support/CommandLine.h"
#include "llvm/Support/Debug.h"		#include "llvm/Support/Debug.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
		#include "llvm/Transforms/IPO/ArgumentPromotion.h"
#include "llvm/Transforms/Utils/BasicBlockUtils.h"		#include "llvm/Transforms/Utils/BasicBlockUtils.h"
#include "llvm/Transforms/Utils/Local.h"		#include "llvm/Transforms/Utils/Local.h"

#include <cassert>		#include <cassert>

using namespace llvm;		using namespace llvm;

#define DEBUG_TYPE "attributor"		#define DEBUG_TYPE "attributor"
▲ Show 20 Lines • Show All 97 Lines • ▼ Show 20 Lines
ChangeStatus llvm::operator\|(ChangeStatus l, ChangeStatus r) {		ChangeStatus llvm::operator\|(ChangeStatus l, ChangeStatus r) {
return l == ChangeStatus::CHANGED ? l : r;		return l == ChangeStatus::CHANGED ? l : r;
}		}
ChangeStatus llvm::operator&(ChangeStatus l, ChangeStatus r) {		ChangeStatus llvm::operator&(ChangeStatus l, ChangeStatus r) {
return l == ChangeStatus::UNCHANGED ? l : r;		return l == ChangeStatus::UNCHANGED ? l : r;
}		}
///}		///}

		Argument *IRPosition::getAssociatedArgument() const {
		if (getPositionKind() == IRP_ARGUMENT)
		return cast<Argument>(&getAnchorValue());

		// Not an Argument and no argument number means this is not a call site
		// argument, thus we cannot find a callback argument to return.
		int ArgNo = getArgNo();
		if (ArgNo < 0)
		return nullptr;

		const Function *Callee = getAssociatedFunction();
		if (Callee && Callee->arg_size() > unsigned(ArgNo))
		return Callee->getArg(ArgNo);

		return nullptr;
		}

		/// Helper function to create a pointer of type \p ResTy, based on \p Ptr, and
		/// advanced by \p Offset bytes. To aid later analysis the method tries to build
		/// getelement pointer instructions that traverse the natural type of \p Ptr if
		/// possible. If that fails, the remaining offset is adjusted byte-wise, hence
		/// through a cast to i8*.
		///
		/// TODO: This could probably live somewhere more prominantly if it doesn't
		/// already exist.
		static Value constructPointer(Type ResTy, Value *Ptr, int64_t Offset,
		IRBuilder<NoFolder> &IRB, const DataLayout &DL) {
		assert(Offset >= 0 && "Negative offset not supported yet!");
		LLVM_DEBUG(dbgs() << "Construct pointer: " << *Ptr << " + " << Offset
		<< "-bytes as " << *ResTy << "\n");

		// The initial type we are trying to traverse to get nice GEPs.
		Type *Ty = Ptr->getType();

		SmallVector<Value *, 4> Indices;
		std::string GEPName = Ptr->getName();
		while (Offset) {
		uint64_t Idx, Rem;

		if (auto *STy = dyn_cast<StructType>(Ty)) {
		const StructLayout *SL = DL.getStructLayout(STy);
		if (int64_t(SL->getSizeInBytes()) < Offset)
		break;
		Idx = SL->getElementContainingOffset(Offset);
		assert(Idx < STy->getNumElements() && "Offset calculation error!");
		Rem = Offset - SL->getElementOffset(Idx);
		Ty = STy->getElementType(Idx);
		} else if (auto *PTy = dyn_cast<PointerType>(Ty)) {
		Ty = PTy->getElementType();
		if (!Ty->isSized())
		break;
		uint64_t ElementSize = DL.getTypeAllocSize(Ty);
		assert(ElementSize && "Expected type with size!");
		Idx = Offset / ElementSize;
		Rem = Offset % ElementSize;
		} else {
		// Non-aggregate type, we cast and make byte-wise progress now.
		break;
		}

		LLVM_DEBUG(errs() << "Ty: " << *Ty << " Offset: " << Offset
		<< " Idx: " << Idx << " Rem: " << Rem << "\n");

		GEPName += "." + std::to_string(Idx);
		Indices.push_back(ConstantInt::get(IRB.getInt32Ty(), Idx));
		Offset = Rem;
		}

		// Create a GEP if we collected indices above.
		if (Indices.size())
		Ptr = IRB.CreateGEP(Ptr, Indices, GEPName);

		// If an offset is left we use byte-wise adjustment.
		if (Offset) {
		Ptr = IRB.CreateBitCast(Ptr, IRB.getInt8PtrTy());
		Ptr = IRB.CreateGEP(Ptr, IRB.getInt32(Offset),
		GEPName + ".b" + Twine(Offset));
		}

		// Ensure the result has the requested type.
		Ptr = IRB.CreateBitOrPointerCast(Ptr, ResTy, Ptr->getName() + ".cast");

		LLVM_DEBUG(dbgs() << "Constructed pointer: " << *Ptr << "\n");
		return Ptr;
		}

/// Recursively visit all values that might become \p IRP at some point. This		/// Recursively visit all values that might become \p IRP at some point. This
/// will be done by looking through cast instructions, selects, phis, and calls		/// will be done by looking through cast instructions, selects, phis, and calls
/// with the "returned" attribute. Once we cannot look through the value any		/// with the "returned" attribute. Once we cannot look through the value any
/// further, the callback \p VisitValueCB is invoked and passed the current		/// further, the callback \p VisitValueCB is invoked and passed the current
/// value, the \p State, and a flag to indicate if we stripped anything. To		/// value, the \p State, and a flag to indicate if we stripped anything. To
/// limit how much effort is invested, we will never visit more values than		/// limit how much effort is invested, we will never visit more values than
/// specified by \p MaxValues.		/// specified by \p MaxValues.
template <typename AAType, typename StateTy>		template <typename AAType, typename StateTy>
▲ Show 20 Lines • Show All 3,882 Lines • ▼ Show 20 Lines	void trackStatistics() const override {
STATS_DECL(MallocCalls, Function,		STATS_DECL(MallocCalls, Function,
"Number of malloc calls converted to allocas");		"Number of malloc calls converted to allocas");
for (auto *C : MallocCalls)		for (auto *C : MallocCalls)
if (!BadMallocCalls.count(C))		if (!BadMallocCalls.count(C))
++BUILD_STAT_NAME(MallocCalls, Function);		++BUILD_STAT_NAME(MallocCalls, Function);
}		}
};		};

		/// ----------------------- Privatizable Pointers ------------------------------
		struct AAPrivatizablePtrImpl : public AAPrivatizablePtr {
		AAPrivatizablePtrImpl(const IRPosition &IRP)
		: AAPrivatizablePtr(IRP), PrivatizableType(llvm::None) {}

		ChangeStatus indicatePessimisticFixpoint() override {
		AAPrivatizablePtr::indicatePessimisticFixpoint();
		PrivatizableType = nullptr;
		return ChangeStatus::CHANGED;
		}

		/// Identify the type we can chose for a private copy of the underlying
		/// argument. None means it is not clear yet, nullptr means there is none.
		virtual Optional<Type *> identifyPrivatizableType(Attributor &A) = 0;

		/// Return a privatizable type that encloses both T0 and T1.
		/// TODO: This is merely a stub for now as we should manage a mapping as well.
		Optional<Type > combineTypes(Optional<Type > T0, Optional<Type *> T1) {
		if (!T0.hasValue())
		return T1;
		if (!T1.hasValue())
		return T0;
		if (T0 == T1)
		return T0;
		return nullptr;
		}

		Optional<Type *> getPrivatizableType() const override {
		return PrivatizableType;
		}

		const std::string getAsStr() const override {
		return isAssumedPrivatizablePtr() ? "[priv]" : "[no-priv]";
		}

		protected:
		Optional<Type *> PrivatizableType;
		};

		// TODO: Do this for call site arguments (probably also other values) as well.

		struct AAPrivatizablePtrArgument final : public AAPrivatizablePtrImpl {
		AAPrivatizablePtrArgument(const IRPosition &IRP)
		: AAPrivatizablePtrImpl(IRP) {}

		/// See AAPrivatizablePtrImpl::identifyPrivatizableType(...)
		Optional<Type *> identifyPrivatizableType(Attributor &A) override {
		// If this is a byval argument and we know all the call sites (so we can
		// rewrite them), there is no need to check them explicitly.
		if (getIRPosition().hasAttr(Attribute::ByVal) &&
		A.checkForAllCallSites([](AbstractCallSite ACS) { return true; }, *this,
		true))
		return getAssociatedValue().getType()->getPointerElementType();

		Optional<Type *> Ty;
		unsigned ArgNo = getIRPosition().getArgNo();

		// Make sure the associated call site argument has the same type at all call
		// sites and it is an allocation we know is safe to privatize, for now that
		// means we only allow alloca instructions.
		// TODO: We can additionally analyze the accesses in the callee to create
		// the type from that information instead. That is a little more
		// involved and will be done in a follow up patch.
		auto CallSiteCheck = [&](AbstractCallSite ACS) {
		IRPosition ACSArgPos = IRPosition::callsite_argument(ACS, ArgNo);
		// Check if a coresponding argument was found or if it is one not
		// associated (which can happen for callback calls).
		if (ACSArgPos.getPositionKind() == IRPosition::IRP_INVALID)
		return false;

		// Check that all call sites agree on a type.
		auto &PrivCSArgAA = A.getAAFor<AAPrivatizablePtr>(*this, ACSArgPos);
		Optional<Type *> CSTy = PrivCSArgAA.getPrivatizableType();

		LLVM_DEBUG({
		dbgs() << "[AAPrivatizablePtr] ACSPos: " << ACSArgPos << ", CSTy: ";
		if (CSTy.hasValue() && CSTy.getValue())
		CSTy.getValue()->print(dbgs());
		else if (CSTy.hasValue())
		dbgs() << "<nullptr>";
		else
		dbgs() << "<none>";
		});

		Ty = combineTypes(Ty, CSTy);

		LLVM_DEBUG({
		dbgs() << " : New Type: ";
		if (Ty.hasValue() && Ty.getValue())
		Ty.getValue()->print(dbgs());
		else if (Ty.hasValue())
		dbgs() << "<nullptr>";
		else
		dbgs() << "<none>";
		dbgs() << "\n";
		});

		return !Ty.hasValue() \|\| Ty.getValue();
		};

		if (!A.checkForAllCallSites(CallSiteCheck, *this, true))
		return nullptr;
		return Ty;
		}

		/// See AbstractAttribute::updateImpl(...).
		ChangeStatus updateImpl(Attributor &A) override {
		PrivatizableType = identifyPrivatizableType(A);
		if (!PrivatizableType.hasValue())
		return ChangeStatus::UNCHANGED;
		if (!PrivatizableType.getValue())
		return indicatePessimisticFixpoint();

		// Avoid arguments with padding for now.
		if (!ArgumentPromotionPass::isDenselyPacked(PrivatizableType.getValue(),
		A.getInfoCache().getDL())) {
		LLVM_DEBUG(dbgs() << "[AAPrivatizablePtr] Padding detected\n");
		return indicatePessimisticFixpoint();
		}

		// Verify callee and caller agree on how the promoted argument would be
		// passed.
		// TODO: We should re-implement this and not reuse the ArgumentPromotion
		// version of this.
		Function &Fn = *getIRPosition().getAnchorScope();
		SmallPtrSet<Argument *, 1> ArgsToPromote, Dummy;
		ArgsToPromote.insert(getAssociatedArgument());
		const auto *TTI = A.getInfoCache().getTargetTransformInfoForFunction(Fn);
		if (!TTI \|\|
		!ArgumentPromotionPass::areFunctionArgsABICompatible(
		Fn, *TTI, ArgsToPromote, Dummy) \|\|
		ArgsToPromote.empty()) {
		LLVM_DEBUG(dbgs() << "[AAPrivatizablePtr] ABI incompatibility detected\n");
		return indicatePessimisticFixpoint();
		}

		return ChangeStatus::UNCHANGED;
		}

		/// Given a type to private \p PrivType, collect the constituates (which are
		/// used) in \p ReplacementTypes.
		static void
		identifyReplacementTypes(Type *PrivType,
		SmallVectorImpl<Type *> &ReplacementTypes) {
		// TODO: For now we expand the privatization type to the fullest which can
		// lead to dead arguments that need to be removed later.
		assert(PrivType && "Expected privatizable type!");

		// Traverse the type, extract constituate types on the outermost level.
		if (auto *PrivStructType = dyn_cast<StructType>(PrivType)) {
		for (unsigned u = 0, e = PrivStructType->getNumElements(); u < e; u++)
		ReplacementTypes.push_back(PrivStructType->getElementType(u));
		} else if (auto *PrivArrayType = dyn_cast<ArrayType>(PrivType)) {
		ReplacementTypes.append(PrivArrayType->getNumElements(),
		PrivArrayType->getElementType());
		} else {
		ReplacementTypes.push_back(PrivType);
		}
		}

		/// Initialize \p Base according to the type \p PrivType at position \p IP.
		/// The values needed are taken from the arguments of \p F starting at
		/// position \p ArgNo.
		static void createInitialization(Type *PrivType, Value &Base, Function &F,
		unsigned ArgNo, Instruction &IP) {
		assert(PrivType && "Expected privatizable type!");

		IRBuilder<NoFolder> IRB(&IP);
		const DataLayout &DL = F.getParent()->getDataLayout();

		// Traverse the type, build GEPs and stores.
		if (auto *PrivStructType = dyn_cast<StructType>(PrivType)) {
		const StructLayout *PrivStructLayout = DL.getStructLayout(PrivStructType);
		for (unsigned u = 0, e = PrivStructType->getNumElements(); u < e; u++) {
		Type *PointeeTy = PrivStructType->getElementType(u)->getPointerTo();
		Value *Ptr = constructPointer(
		PointeeTy, &Base, PrivStructLayout->getElementOffset(u), IRB, DL);
		new StoreInst(F.getArg(ArgNo + u), Ptr, &IP);
		}
		} else if (auto *PrivArrayType = dyn_cast<ArrayType>(PrivType)) {
		Type *PointeePtrTy = PrivArrayType->getElementType()->getPointerTo();
		uint64_t PointeeTySize = DL.getTypeStoreSize(PointeePtrTy);
		for (unsigned u = 0, e = PrivArrayType->getNumElements(); u < e; u++) {
		Value *Ptr =
		constructPointer(PointeePtrTy, &Base, u * PointeeTySize, IRB, DL);
		new StoreInst(F.getArg(ArgNo + u), Ptr, &IP);
		}
		} else {
		new StoreInst(F.getArg(ArgNo), &Base, &IP);
		}
		}

		/// Extract values from \p Base according to the type \p PrivType at the
		/// call position \p ACS. The values are appended to \p ReplacementValues.
		void createReplacementValues(Type *PrivType, AbstractCallSite ACS,
		Value *Base,
		SmallVectorImpl<Value *> &ReplacementValues) {
		assert(Base && "Expected base value!");
		assert(PrivType && "Expected privatizable type!");
		Instruction *IP = ACS.getInstruction();

		IRBuilder<NoFolder> IRB(IP);
		const DataLayout &DL = IP->getModule()->getDataLayout();

		if (Base->getType()->getPointerElementType() != PrivType)
		Base = BitCastInst::CreateBitOrPointerCast(Base, PrivType->getPointerTo(),
		"", ACS.getInstruction());

		// Traverse the type, build GEPs and loads.
		if (auto *PrivStructType = dyn_cast<StructType>(PrivType)) {
		const StructLayout *PrivStructLayout = DL.getStructLayout(PrivStructType);
		for (unsigned u = 0, e = PrivStructType->getNumElements(); u < e; u++) {
		Type *PointeeTy = PrivStructType->getElementType(u);
		Value *Ptr =
		constructPointer(PointeeTy->getPointerTo(), Base,
		PrivStructLayout->getElementOffset(u), IRB, DL);
		ReplacementValues.push_back(new LoadInst(PointeeTy, Ptr, "", IP));
		}
		} else if (auto *PrivArrayType = dyn_cast<ArrayType>(PrivType)) {
		Type *PointeeTy = PrivArrayType->getElementType();
		uint64_t PointeeTySize = DL.getTypeStoreSize(PointeeTy);
		Type *PointeePtrTy = PointeeTy->getPointerTo();
		for (unsigned u = 0, e = PrivArrayType->getNumElements(); u < e; u++) {
		Value *Ptr =
		constructPointer(PointeePtrTy, Base, u * PointeeTySize, IRB, DL);
		ReplacementValues.push_back(new LoadInst(PointeePtrTy, Ptr, "", IP));
		}
		} else {
		ReplacementValues.push_back(new LoadInst(PrivType, Base, "", IP));
		}
		}
		efriedmaUnsubmitted Not Done Reply Inline Actions No alignment set on loads? efriedma: No alignment set on loads?
		jdoerfertAuthorUnsubmitted Done Reply Inline Actions I'll add the alignment for the alloca below based on the alignment of the pointer it replaces. The loads and stores will be annotated by the next run of the Attributor automatically. We can also consider not emitting it if we can prove it is not needed, though that will not always be the case and it will require an analysis we do not yet have (something like AAAccessTracker). Finally, SROA should, in the good case, eliminate the alloca completely. jdoerfert: I'll add the alignment for the alloca below based on the alignment of the pointer it replaces.
		efriedmaUnsubmitted Not Done Reply Inline Actions The loads and stores will be annotated by the next run of the Attributor automatically. The default alignment of a load is the alignment of the load's type, as computed by the datalayout. This might be too high, depending on the pointer. efriedma: > The loads and stores will be annotated by the next run of the Attributor automatically. The…
		jdoerfertAuthorUnsubmitted Done Reply Inline Actions The default alignment of a load is the alignment of the load's type, as computed by the datalayout. Sure. This might be too high, depending on the pointer. How could that be? We create the pointer with a proper type (the alloca) below. Shouldn't the alloca take the default alignment into account when the memory is allocated? jdoerfert: > The default alignment of a load is the alignment of the load's type, as computed by the…
		efriedmaUnsubmitted Not Done Reply Inline Actions If you create an alloca, then load/store to that pointer, the default alignments will work, yes. But that isn't what's happening here, is it? The alloca is in the callee, and this load is in the caller. efriedma: If you create an alloca, then load/store to that pointer, the default alignments will work, yes.
		jdoerfertAuthorUnsubmitted Done Reply Inline Actions With this patch we always have an alloca or an argument with some pointer to (struct) type which we only access through proper gep addressing. I don't think this can create an alignment issue. I get that the alloca needs to be aligned with a higher value if the pointer was marked as such, but I already said that will be fixed. jdoerfert: With this patch we always have an alloca or an argument with some pointer to (struct) type…
		efriedmaUnsubmitted Not Done Reply Inline Actions Pointers can be misaligned, generally. For example: define void @f() { entry: %a = alloca i32, align 1 call void @g(i32* %a) ret void } define internal void @g(i32* %a) { %aa = load i32, i32* %a, align 1 call void @z(aa) } declare void @z(i32) As far as I can tell, your patch will introduce a misaligned load into `@f()`. (C generally provides additional guarantees based on the pointee type of a pointer, but there isn't any corresponding rule for IR pointers.) efriedma: Pointers can be misaligned, generally. For example: ``` define void @f() { entry: %a =…
		jdoerfertAuthorUnsubmitted Done Reply Inline Actions I finally understand your concern, sorry that it took so long. I played around a bit to see what we currently do and I found this interesting: https://godbolt.org/z/2q_oqH We basically align the alloca naturally at some point. I would for now just set the alignment to 1 and add a TODO. For these loads, the Attributor can find a better alignment in the next run anyway and this allows me to not amend this patch too much. The TODO will explain the situation and we can work on a better solution from then. Maybe, if it is very simple, I'll directly use the AAAlign logic to get a lower bound instead. Long story short, I'll make sure these loads are properly aligned and we test for this. jdoerfert: I finally understand your concern, sorry that it took so long. I played around a bit to see…

		/// See AbstractAttribute::manifest(...)
		ChangeStatus manifest(Attributor &A) override {
		if (!PrivatizableType.hasValue())
		return ChangeStatus::UNCHANGED;
		assert(PrivatizableType.getValue() && "Expected privatizable type!");

		// Collect all tail calls in the function as we cannot allow new allocas to
		// escape into tail recursion.
		// TODO: Be smarter about new allocas escaping into tail calls.
		SmallVector<CallInst *, 16> TailCalls;
		if (!A.checkForAllInstructions(
		[&](Instruction &I) {
		CallInst &CI = cast<CallInst>(I);
		if (CI.isTailCall())
		TailCalls.push_back(&CI);
		return true;
		},
		*this, {Instruction::Call}))
		return ChangeStatus::UNCHANGED;

		Argument *Arg = getAssociatedArgument();

		// Callback to repair the associated function. A new alloca is placed at the
		// beginning and initialized with the values passed through arguments. The
		// new alloca replaces the use of the old pointer argument.
		Attributor::ArgumentReplacementInfo::CalleeRepairCBTy FnRepairCB =
		[=](const Attributor::ArgumentReplacementInfo &ARI,
		Function &ReplacementFn, Function::arg_iterator ArgIt) {
		BasicBlock &EntryBB = ReplacementFn.getEntryBlock();
		Instruction IP = &EntryBB.getFirstInsertionPt();
		auto *AI = new AllocaInst(PrivatizableType.getValue(), 0,
		Arg->getName() + ".priv", IP);
		createInitialization(PrivatizableType.getValue(), *AI, ReplacementFn,
		ArgIt->getArgNo(), *IP);
		efriedmaUnsubmitted Not Done Reply Inline Actions No alignment set on alloca? efriedma: No alignment set on alloca?
		Arg->replaceAllUsesWith(AI);

		for (CallInst *CI : TailCalls)
		CI->setTailCall(false);
		};

		// Callback to repair a call site of the associated function. The elements
		// of the privatizable type are loaded prior to the call and passed to the
		// new function version.
		Attributor::ArgumentReplacementInfo::ACSRepairCBTy ACSRepairCB =
		[=](const Attributor::ArgumentReplacementInfo &ARI,
		AbstractCallSite ACS, SmallVectorImpl<Value *> &NewArgOperands) {
		createReplacementValues(
		PrivatizableType.getValue(), ACS,
		ACS.getCallArgOperand(ARI.getReplacedArg().getArgNo()),
		NewArgOperands);
		};

		// Collect the types that will replace the privatizable type in the function
		// signature.
		SmallVector<Type *, 16> ReplacementTypes;
		identifyReplacementTypes(PrivatizableType.getValue(), ReplacementTypes);

		// Register a rewrite of the argument.
		if (A.registerFunctionSignatureRewrite(
		*Arg, ReplacementTypes, std::move(FnRepairCB), std::move(ACSRepairCB)))
		return ChangeStatus::CHANGED;
		return ChangeStatus::UNCHANGED;
		}

		/// See AbstractAttribute::trackStatistics()
		void trackStatistics() const override {
		STATS_DECLTRACK_ARG_ATTR(privatizable_ptr);
		}
		};

		struct AAPrivatizablePtrFloating : public AAPrivatizablePtrImpl {
		AAPrivatizablePtrFloating(const IRPosition &IRP)
		: AAPrivatizablePtrImpl(IRP) {}

		/// See AbstractAttribute::initialize(...).
		virtual void initialize(Attributor &A) override {
		// TODO: We can privatize more than arguments.
		indicatePessimisticFixpoint();
		}

		ChangeStatus updateImpl(Attributor &A) override {
		llvm_unreachable("AAPrivatizablePtr(Floating\|Returned\|CallSiteReturned)::"
		"updateImpl will not be called");
		}

		/// See AAPrivatizablePtrImpl::identifyPrivatizableType(...)
		Optional<Type *> identifyPrivatizableType(Attributor &A) override {
		Value *Obj =
		GetUnderlyingObject(&getAssociatedValue(), A.getInfoCache().getDL());
		if (!Obj) {
		LLVM_DEBUG(dbgs() << "[AAPrivatizablePtr] No underlying object found!\n");
		return nullptr;
		}

		if (auto *AI = dyn_cast<AllocaInst>(Obj))
		if (auto *CI = dyn_cast<ConstantInt>(AI->getArraySize()))
		if (CI->isOne())
		return Obj->getType()->getPointerElementType();
		efriedmaUnsubmitted Not Done Reply Inline Actions isArrayAllocation() efriedma: isArrayAllocation()
		jdoerfertAuthorUnsubmitted Done Reply Inline Actions Will do. jdoerfert: Will do.
		if (auto *Arg = dyn_cast<Argument>(Obj)) {
		auto &PrivArgAA =
		A.getAAFor<AAPrivatizablePtr>(this, IRPosition::argument(Arg));
		if (PrivArgAA.isAssumedPrivatizablePtr())
		return Obj->getType()->getPointerElementType();
		}

		LLVM_DEBUG(dbgs() << "[AAPrivatizablePtr] Underlying object neither valid "
		"alloca nor privatizable argument: "
		<< *Obj << "!\n");
		return nullptr;
		}

		/// See AbstractAttribute::trackStatistics()
		void trackStatistics() const override {
		STATS_DECLTRACK_FLOATING_ATTR(privatizable_ptr);
		}
		};

		struct AAPrivatizablePtrCallSiteArgument final
		: public AAPrivatizablePtrFloating {
		AAPrivatizablePtrCallSiteArgument(const IRPosition &IRP)
		: AAPrivatizablePtrFloating(IRP) {}

		/// See AbstractAttribute::initialize(...).
		void initialize(Attributor &A) override {}

		/// See AbstractAttribute::updateImpl(...).
		ChangeStatus updateImpl(Attributor &A) override {
		PrivatizableType = identifyPrivatizableType(A);
		if (!PrivatizableType.hasValue())
		return ChangeStatus::UNCHANGED;
		if (!PrivatizableType.getValue())
		return indicatePessimisticFixpoint();

		const IRPosition &IRP = getIRPosition();
		auto &NoCaptureAA = A.getAAFor<AANoCapture>(*this, IRP);
		if (!NoCaptureAA.isAssumedNoCapture()) {
		LLVM_DEBUG(dbgs() << "[AAPrivatizablePtr] pointer might be captured!\n");
		return indicatePessimisticFixpoint();
		}

		auto &NoAliasAA = A.getAAFor<AANoAlias>(*this, IRP);
		if (!NoAliasAA.isAssumedNoAlias()) {
		LLVM_DEBUG(dbgs() << "[AAPrivatizablePtr] pointer might alias!\n");
		return indicatePessimisticFixpoint();
		}

		const auto &MemBehaviorAA = A.getAAFor<AAMemoryBehavior>(*this, IRP);
		if (!MemBehaviorAA.isAssumedReadOnly()) {
		LLVM_DEBUG(dbgs() << "[AAPrivatizablePtr] pointer is written!\n");
		return indicatePessimisticFixpoint();
		}

		return ChangeStatus::UNCHANGED;
		}

		/// See AbstractAttribute::trackStatistics()
		void trackStatistics() const override {
		STATS_DECLTRACK_CSARG_ATTR(privatizable_ptr);
		}
		};

		struct AAPrivatizablePtrCallSiteReturned final
		: public AAPrivatizablePtrFloating {
		AAPrivatizablePtrCallSiteReturned(const IRPosition &IRP)
		: AAPrivatizablePtrFloating(IRP) {}

		/// See AbstractAttribute::initialize(...).
		void initialize(Attributor &A) override {
		// TODO: We can privatize more than arguments.
		indicatePessimisticFixpoint();
		}

		/// See AbstractAttribute::trackStatistics()
		void trackStatistics() const override {
		STATS_DECLTRACK_CSRET_ATTR(privatizable_ptr);
		}
		};

		struct AAPrivatizablePtrReturned final : public AAPrivatizablePtrFloating {
		AAPrivatizablePtrReturned(const IRPosition &IRP)
		: AAPrivatizablePtrFloating(IRP) {}

		/// See AbstractAttribute::initialize(...).
		void initialize(Attributor &A) override {
		// TODO: We can privatize more than arguments.
		indicatePessimisticFixpoint();
		}

		/// See AbstractAttribute::trackStatistics()
		void trackStatistics() const override {
		STATS_DECLTRACK_FNRET_ATTR(privatizable_ptr);
		}
		};

/// -------------------- Memory Behavior Attributes ----------------------------		/// -------------------- Memory Behavior Attributes ----------------------------
/// Includes read-none, read-only, and write-only.		/// Includes read-none, read-only, and write-only.
/// ----------------------------------------------------------------------------		/// ----------------------------------------------------------------------------
struct AAMemoryBehaviorImpl : public AAMemoryBehavior {		struct AAMemoryBehaviorImpl : public AAMemoryBehavior {
AAMemoryBehaviorImpl(const IRPosition &IRP) : AAMemoryBehavior(IRP) {}		AAMemoryBehaviorImpl(const IRPosition &IRP) : AAMemoryBehavior(IRP) {}

/// See AbstractAttribute::initialize(...).		/// See AbstractAttribute::initialize(...).
void initialize(Attributor &A) override {		void initialize(Attributor &A) override {
▲ Show 20 Lines • Show All 1,384 Lines • ▼ Show 20 Lines	if (Arg.getType()->isPointerTy()) {
getOrCreateAAFor<AAAlign>(ArgPos);		getOrCreateAAFor<AAAlign>(ArgPos);

// Every argument with pointer type might be marked nocapture.		// Every argument with pointer type might be marked nocapture.
getOrCreateAAFor<AANoCapture>(ArgPos);		getOrCreateAAFor<AANoCapture>(ArgPos);

// Every argument with pointer type might be marked		// Every argument with pointer type might be marked
// "readnone/readonly/writeonly/..."		// "readnone/readonly/writeonly/..."
getOrCreateAAFor<AAMemoryBehavior>(ArgPos);		getOrCreateAAFor<AAMemoryBehavior>(ArgPos);

		// Every argument with pointer type might be privatizable (or promotable)
		getOrCreateAAFor<AAPrivatizablePtr>(ArgPos);
}		}
}		}

auto CallSitePred = [&](Instruction &I) -> bool {		auto CallSitePred = [&](Instruction &I) -> bool {
CallSite CS(&I);		CallSite CS(&I);
if (Function *Callee = CS.getCalledFunction()) {		if (Function *Callee = CS.getCalledFunction()) {
if (!Callee->getReturnType()->isVoidTy()) {		if (!Callee->getReturnType()->isVoidTy()) {
IRPosition CSRetPos = IRPosition::callsite_returned(CS);		IRPosition CSRetPos = IRPosition::callsite_returned(CS);
▲ Show 20 Lines • Show All 205 Lines • ▼ Show 20 Lines
const char AANoAlias::ID = 0;		const char AANoAlias::ID = 0;
const char AANoReturn::ID = 0;		const char AANoReturn::ID = 0;
const char AAIsDead::ID = 0;		const char AAIsDead::ID = 0;
const char AADereferenceable::ID = 0;		const char AADereferenceable::ID = 0;
const char AAAlign::ID = 0;		const char AAAlign::ID = 0;
const char AANoCapture::ID = 0;		const char AANoCapture::ID = 0;
const char AAValueSimplify::ID = 0;		const char AAValueSimplify::ID = 0;
const char AAHeapToStack::ID = 0;		const char AAHeapToStack::ID = 0;
		const char AAPrivatizablePtr::ID = 0;
const char AAMemoryBehavior::ID = 0;		const char AAMemoryBehavior::ID = 0;

// Macro magic to create the static generator function for attributes that		// Macro magic to create the static generator function for attributes that
// follow the naming scheme.		// follow the naming scheme.

#define SWITCH_PK_INV(CLASS, PK, POS_NAME) \		#define SWITCH_PK_INV(CLASS, PK, POS_NAME) \
case IRPosition::PK: \		case IRPosition::PK: \
llvm_unreachable("Cannot create " #CLASS " for a " POS_NAME " position!");		llvm_unreachable("Cannot create " #CLASS " for a " POS_NAME " position!");
▲ Show 20 Lines • Show All 88 Lines • ▼ Show 20 Lines
CREATE_FUNCTION_ABSTRACT_ATTRIBUTE_FOR_POSITION(AANoFree)		CREATE_FUNCTION_ABSTRACT_ATTRIBUTE_FOR_POSITION(AANoFree)
CREATE_FUNCTION_ABSTRACT_ATTRIBUTE_FOR_POSITION(AANoRecurse)		CREATE_FUNCTION_ABSTRACT_ATTRIBUTE_FOR_POSITION(AANoRecurse)
CREATE_FUNCTION_ABSTRACT_ATTRIBUTE_FOR_POSITION(AAWillReturn)		CREATE_FUNCTION_ABSTRACT_ATTRIBUTE_FOR_POSITION(AAWillReturn)
CREATE_FUNCTION_ABSTRACT_ATTRIBUTE_FOR_POSITION(AANoReturn)		CREATE_FUNCTION_ABSTRACT_ATTRIBUTE_FOR_POSITION(AANoReturn)
CREATE_FUNCTION_ABSTRACT_ATTRIBUTE_FOR_POSITION(AAReturnedValues)		CREATE_FUNCTION_ABSTRACT_ATTRIBUTE_FOR_POSITION(AAReturnedValues)

CREATE_VALUE_ABSTRACT_ATTRIBUTE_FOR_POSITION(AANonNull)		CREATE_VALUE_ABSTRACT_ATTRIBUTE_FOR_POSITION(AANonNull)
CREATE_VALUE_ABSTRACT_ATTRIBUTE_FOR_POSITION(AANoAlias)		CREATE_VALUE_ABSTRACT_ATTRIBUTE_FOR_POSITION(AANoAlias)
		CREATE_VALUE_ABSTRACT_ATTRIBUTE_FOR_POSITION(AAPrivatizablePtr)
CREATE_VALUE_ABSTRACT_ATTRIBUTE_FOR_POSITION(AADereferenceable)		CREATE_VALUE_ABSTRACT_ATTRIBUTE_FOR_POSITION(AADereferenceable)
CREATE_VALUE_ABSTRACT_ATTRIBUTE_FOR_POSITION(AAAlign)		CREATE_VALUE_ABSTRACT_ATTRIBUTE_FOR_POSITION(AAAlign)
CREATE_VALUE_ABSTRACT_ATTRIBUTE_FOR_POSITION(AANoCapture)		CREATE_VALUE_ABSTRACT_ATTRIBUTE_FOR_POSITION(AANoCapture)

CREATE_ALL_ABSTRACT_ATTRIBUTE_FOR_POSITION(AAValueSimplify)		CREATE_ALL_ABSTRACT_ATTRIBUTE_FOR_POSITION(AAValueSimplify)
CREATE_ALL_ABSTRACT_ATTRIBUTE_FOR_POSITION(AAIsDead)		CREATE_ALL_ABSTRACT_ATTRIBUTE_FOR_POSITION(AAIsDead)

CREATE_FUNCTION_ONLY_ABSTRACT_ATTRIBUTE_FOR_POSITION(AAHeapToStack)		CREATE_FUNCTION_ONLY_ABSTRACT_ATTRIBUTE_FOR_POSITION(AAHeapToStack)
Show All 16 Lines

llvm/test/Transforms/ArgumentPromotion/2008-02-01-ReturnAttrs.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -argpromotion -mem2reg -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
				; RUN: opt -S -passes='attributor,function(mem2reg)' -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=3 < %s \| FileCheck %s --check-prefixes=ALL,ATTRIBUTOR

	define internal i32 @deref(i32* %x) nounwind {			define internal i32 @deref(i32* %x) nounwind {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@deref			; ALL-LABEL: define {{[^@]+}}@deref
	; ARGPROMOTION-SAME: (i32 [[X_VAL:%.*]])			; ALL-SAME: (i32 [[TMP0:%.*]])
	; ARGPROMOTION-NEXT: entry:			; ALL-NEXT: entry:
	; ARGPROMOTION-NEXT: ret i32 [[X_VAL]]			; ALL-NEXT: ret i32 [[TMP0]]
	;			;
	entry:			entry:
	%tmp2 = load i32, i32* %x, align 4			%tmp2 = load i32, i32* %x, align 4
	ret i32 %tmp2			ret i32 %tmp2
	}			}

	define i32 @f(i32 %x) {			define i32 @f(i32 %x) {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@f			; ALL-LABEL: define {{[^@]+}}@f
	; ARGPROMOTION-SAME: (i32 [[X:%.*]])			; ALL-SAME: (i32 [[X:%.*]])
	; ARGPROMOTION-NEXT: entry:			; ALL-NEXT: entry:
	; ARGPROMOTION-NEXT: [[X_ADDR:%.*]] = alloca i32			; ALL-NEXT: [[TMP1:%.*]] = call i32 @deref(i32 [[X]])
	; ARGPROMOTION-NEXT: store i32 [[X]], i32* [[X_ADDR]], align 4			; ALL-NEXT: ret i32 [[TMP1]]
	; ARGPROMOTION-NEXT: [[X_ADDR_VAL:%.]] = load i32, i32 [[X_ADDR]], align 4
	; ARGPROMOTION-NEXT: [[TMP1:%.*]] = call i32 @deref(i32 [[X_ADDR_VAL]])
	; ARGPROMOTION-NEXT: ret i32 [[TMP1]]
	;			;
	entry:			entry:
	%x_addr = alloca i32			%x_addr = alloca i32
	store i32 %x, i32* %x_addr, align 4			store i32 %x, i32* %x_addr, align 4
	%tmp1 = call i32 @deref( i32* %x_addr ) nounwind			%tmp1 = call i32 @deref( i32* %x_addr ) nounwind
	ret i32 %tmp1			ret i32 %tmp1
	}			}

llvm/test/Transforms/ArgumentPromotion/2008-07-02-array-indexing.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
				; RUN: opt -S -passes=attributor -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=5 < %s \| FileCheck %s --check-prefixes=ALL,ATTRIBUTOR
	; PR2498			; PR2498

	; This test tries to convince argpromotion about promoting the load from %A + 2,			; This test tries to convince argpromotion about promoting the load from %A + 2,
	; because there is a load of %A in the entry block			; because there is a load of %A in the entry block
	define internal i32 @callee(i1 %C, i32* %A) {			define internal i32 @callee(i1 %C, i32* %A) {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@callee			; ARGPROMOTION-LABEL: define {{[^@]+}}@callee
	; ARGPROMOTION-SAME: (i1 [[C:%.]], i32 [[A:%.*]])			; ARGPROMOTION-SAME: (i1 [[C:%.]], i32 [[A:%.*]])
	; ARGPROMOTION-NEXT: entry:			; ARGPROMOTION-NEXT: entry:
	; ARGPROMOTION-NEXT: [[A_0:%.]] = load i32, i32 [[A]]			; ARGPROMOTION-NEXT: [[A_0:%.]] = load i32, i32 [[A]]
	; ARGPROMOTION-NEXT: br i1 [[C]], label [[T:%.]], label [[F:%.]]			; ARGPROMOTION-NEXT: br i1 [[C]], label [[T:%.]], label [[F:%.]]
	; ARGPROMOTION: T:			; ARGPROMOTION: T:
	; ARGPROMOTION-NEXT: ret i32 [[A_0]]			; ARGPROMOTION-NEXT: ret i32 [[A_0]]
	; ARGPROMOTION: F:			; ARGPROMOTION: F:
	; ARGPROMOTION-NEXT: [[A_2:%.]] = getelementptr i32, i32 [[A]], i32 2			; ARGPROMOTION-NEXT: [[A_2:%.]] = getelementptr i32, i32 [[A]], i32 2
	; ARGPROMOTION-NEXT: [[R:%.]] = load i32, i32 [[A_2]]			; ARGPROMOTION-NEXT: [[R:%.]] = load i32, i32 [[A_2]]
	; ARGPROMOTION-NEXT: ret i32 [[R]]			; ARGPROMOTION-NEXT: ret i32 [[R]]
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@callee
				; ATTRIBUTOR-SAME: (i1 [[C:%.]], i32 nocapture nonnull readonly dereferenceable(4) [[A:%.*]])
				; ATTRIBUTOR-NEXT: entry:
				; ATTRIBUTOR-NEXT: [[A_0:%.]] = load i32, i32 [[A]]
				; ATTRIBUTOR-NEXT: br i1 [[C]], label [[T:%.]], label [[F:%.]]
				; ATTRIBUTOR: T:
				; ATTRIBUTOR-NEXT: ret i32 [[A_0]]
				; ATTRIBUTOR: F:
				; ATTRIBUTOR-NEXT: [[A_2:%.]] = getelementptr i32, i32 [[A]], i32 2
				; ATTRIBUTOR-NEXT: [[R:%.]] = load i32, i32 [[A_2]]
				; ATTRIBUTOR-NEXT: ret i32 [[R]]
				;
	entry:			entry:
	; Unconditonally load the element at %A			; Unconditonally load the element at %A
	%A.0 = load i32, i32* %A			%A.0 = load i32, i32* %A
	br i1 %C, label %T, label %F			br i1 %C, label %T, label %F

	T:			T:
	ret i32 %A.0			ret i32 %A.0

	F:			F:
	; Load the element at offset two from %A. This should not be promoted!			; Load the element at offset two from %A. This should not be promoted!
	%A.2 = getelementptr i32, i32* %A, i32 2			%A.2 = getelementptr i32, i32* %A, i32 2
	%R = load i32, i32* %A.2			%R = load i32, i32* %A.2
	ret i32 %R			ret i32 %R
	}			}

	define i32 @foo() {			define i32 @foo(i1 %c, i32* %A) {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@foo()			; ARGPROMOTION-LABEL: define {{[^@]+}}@foo
	; ARGPROMOTION-NEXT: [[X:%.]] = call i32 @callee(i1 false, i32 null)			; ARGPROMOTION-SAME: (i1 [[C:%.]], i32 [[A:%.*]])
				; ARGPROMOTION-NEXT: [[X:%.]] = call i32 @callee(i1 [[C]], i32 [[A]])
	; ARGPROMOTION-NEXT: ret i32 [[X]]			; ARGPROMOTION-NEXT: ret i32 [[X]]
	;			;
	%X = call i32 @callee(i1 false, i32* null) ; <i32> [#uses=1]			; ATTRIBUTOR-LABEL: define {{[^@]+}}@foo
				; ATTRIBUTOR-SAME: (i1 [[C:%.]], i32 nocapture readonly [[A:%.*]])
				; ATTRIBUTOR-NEXT: [[X:%.]] = call i32 @callee(i1 [[C]], i32 nocapture readonly [[A]])
				; ATTRIBUTOR-NEXT: ret i32 [[X]]
				;
				%X = call i32 @callee(i1 %c, i32* %A) ; <i32> [#uses=1]
	ret i32 %X			ret i32 %X
	}			}

llvm/test/Transforms/ArgumentPromotion/2008-09-07-CGUpdate.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -inline -argpromotion -disable-output			; RUN: opt < %s -inline -argpromotion -disable-output
				; RUN: opt -disable-output -passes=attributor -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=3 < %s

	define internal fastcc i32 @hash(i32* %ts, i32 %mod) nounwind {			define internal fastcc i32 @hash(i32* %ts, i32 %mod) nounwind {
	entry:			entry:
	unreachable			unreachable
	}			}

	define void @encode(i32* %m, i32* %ts, i32* %new) nounwind {			define void @encode(i32* %m, i32* %ts, i32* %new) nounwind {
	entry:			entry:
	%0 = call fastcc i32 @hash( i32* %ts, i32 0 ) nounwind ; <i32> [#uses=0]			%0 = call fastcc i32 @hash( i32* %ts, i32 0 ) nounwind ; <i32> [#uses=0]
	unreachable			unreachable
	}			}

llvm/test/Transforms/ArgumentPromotion/2008-09-08-CGUpdateSelfEdge.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -argpromotion -disable-output			; RUN: opt < %s -argpromotion -disable-output
				; RUN: opt -disable-output -passes=attributor -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=2 < %s

	define internal fastcc i32 @term_SharingList(i32* %Term, i32* %List) nounwind {			define internal fastcc i32 @term_SharingList(i32* %Term, i32* %List) nounwind {
	entry:			entry:
	br i1 false, label %bb, label %bb5			br i1 false, label %bb, label %bb5

	bb: ; preds = %entry			bb: ; preds = %entry
	%0 = call fastcc i32 @term_SharingList( i32* null, i32* %List ) nounwind ; <i32> [#uses=0]			%0 = call fastcc i32 @term_SharingList( i32* null, i32* %List ) nounwind ; <i32> [#uses=0]
	unreachable			unreachable
	Show All 16 Lines

llvm/test/Transforms/ArgumentPromotion/X86/attributes.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt -S -argpromotion < %s \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt -S -argpromotion -mem2reg < %s \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
	; RUN: opt -S -passes=argpromotion < %s \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt -S -passes='argpromotion,function(mem2reg)' < %s \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
				; RUN: opt -S -passes='attributor,function(mem2reg)' -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=3 < %s \| FileCheck %s --check-prefixes=ALL,ATTRIBUTOR
	; Test that we only promote arguments when the caller/callee have compatible			; Test that we only promote arguments when the caller/callee have compatible
	; function attrubtes.			; function attrubtes.

	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	define internal fastcc void @no_promote_avx2(<4 x i64>* %arg, <4 x i64>* readonly %arg1) #0 {			define internal fastcc void @no_promote_avx2(<4 x i64>* %arg, <4 x i64>* readonly %arg1) #0 {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@no_promote_avx2			; ARGPROMOTION-LABEL: define {{[^@]+}}@no_promote_avx2
	; ARGPROMOTION-SAME: (<4 x i64>* [[ARG:%.]], <4 x i64> readonly [[ARG1:%.*]])			; ARGPROMOTION-SAME: (<4 x i64>* [[ARG:%.]], <4 x i64> readonly [[ARG1:%.*]])
	; ARGPROMOTION-NEXT: bb:			; ARGPROMOTION-NEXT: bb:
	; ARGPROMOTION-NEXT: [[TMP:%.]] = load <4 x i64>, <4 x i64> [[ARG1]]			; ARGPROMOTION-NEXT: [[TMP:%.]] = load <4 x i64>, <4 x i64> [[ARG1]]
	; ARGPROMOTION-NEXT: store <4 x i64> [[TMP]], <4 x i64>* [[ARG]]			; ARGPROMOTION-NEXT: store <4 x i64> [[TMP]], <4 x i64>* [[ARG]]
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@no_promote_avx2
				; ATTRIBUTOR-SAME: (<4 x i64>* noalias nocapture nonnull writeonly align 32 dereferenceable(32) [[ARG:%.]], <4 x i64> noalias nocapture nonnull readonly align 32 dereferenceable(32) [[ARG1:%.*]])
				; ATTRIBUTOR-NEXT: bb:
				; ATTRIBUTOR-NEXT: [[TMP:%.]] = load <4 x i64>, <4 x i64> [[ARG1]], align 32
				; ATTRIBUTOR-NEXT: store <4 x i64> [[TMP]], <4 x i64>* [[ARG]], align 32
				; ATTRIBUTOR-NEXT: ret void
				;
	bb:			bb:
	%tmp = load <4 x i64>, <4 x i64>* %arg1			%tmp = load <4 x i64>, <4 x i64>* %arg1
	store <4 x i64> %tmp, <4 x i64>* %arg			store <4 x i64> %tmp, <4 x i64>* %arg
	ret void			ret void
	}			}

	define void @no_promote(<4 x i64>* %arg) #1 {			define void @no_promote(<4 x i64>* %arg) #1 {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@no_promote			; ARGPROMOTION-LABEL: define {{[^@]+}}@no_promote
	; ARGPROMOTION-SAME: (<4 x i64>* [[ARG:%.*]])			; ARGPROMOTION-SAME: (<4 x i64>* [[ARG:%.*]])
	; ARGPROMOTION-NEXT: bb:			; ARGPROMOTION-NEXT: bb:
	; ARGPROMOTION-NEXT: [[TMP:%.*]] = alloca <4 x i64>, align 32			; ARGPROMOTION-NEXT: [[TMP:%.*]] = alloca <4 x i64>, align 32
	; ARGPROMOTION-NEXT: [[TMP2:%.*]] = alloca <4 x i64>, align 32			; ARGPROMOTION-NEXT: [[TMP2:%.*]] = alloca <4 x i64>, align 32
	; ARGPROMOTION-NEXT: [[TMP3:%.]] = bitcast <4 x i64> [[TMP]] to i8*			; ARGPROMOTION-NEXT: [[TMP3:%.]] = bitcast <4 x i64> [[TMP]] to i8*
	; ARGPROMOTION-NEXT: call void @llvm.memset.p0i8.i64(i8* align 32 [[TMP3]], i8 0, i64 32, i1 false)			; ARGPROMOTION-NEXT: call void @llvm.memset.p0i8.i64(i8* align 32 [[TMP3]], i8 0, i64 32, i1 false)
	; ARGPROMOTION-NEXT: call fastcc void @no_promote_avx2(<4 x i64>* [[TMP2]], <4 x i64>* [[TMP]])			; ARGPROMOTION-NEXT: call fastcc void @no_promote_avx2(<4 x i64>* [[TMP2]], <4 x i64>* [[TMP]])
	; ARGPROMOTION-NEXT: [[TMP4:%.]] = load <4 x i64>, <4 x i64> [[TMP2]], align 32			; ARGPROMOTION-NEXT: [[TMP4:%.]] = load <4 x i64>, <4 x i64> [[TMP2]], align 32
	; ARGPROMOTION-NEXT: store <4 x i64> [[TMP4]], <4 x i64>* [[ARG]], align 2			; ARGPROMOTION-NEXT: store <4 x i64> [[TMP4]], <4 x i64>* [[ARG]], align 2
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@no_promote
				; ATTRIBUTOR-SAME: (<4 x i64>* nocapture writeonly [[ARG:%.*]])
				; ATTRIBUTOR-NEXT: bb:
				; ATTRIBUTOR-NEXT: [[TMP:%.*]] = alloca <4 x i64>, align 32
				; ATTRIBUTOR-NEXT: [[TMP2:%.*]] = alloca <4 x i64>, align 32
				; ATTRIBUTOR-NEXT: [[TMP3:%.]] = bitcast <4 x i64> [[TMP]] to i8*
				; ATTRIBUTOR-NEXT: call void @llvm.memset.p0i8.i64(i8* nonnull align 32 dereferenceable(32) [[TMP3]], i8 0, i64 32, i1 false)
				; ATTRIBUTOR-NEXT: call fastcc void @no_promote_avx2(<4 x i64>* noalias nocapture nonnull writeonly align 32 dereferenceable(32) [[TMP2]], <4 x i64>* noalias nocapture nonnull readonly align 32 dereferenceable(32) [[TMP]])
				; ATTRIBUTOR-NEXT: [[TMP4:%.]] = load <4 x i64>, <4 x i64> [[TMP2]], align 32
				; ATTRIBUTOR-NEXT: store <4 x i64> [[TMP4]], <4 x i64>* [[ARG]], align 2
				; ATTRIBUTOR-NEXT: ret void
				;
	bb:			bb:
	%tmp = alloca <4 x i64>, align 32			%tmp = alloca <4 x i64>, align 32
	%tmp2 = alloca <4 x i64>, align 32			%tmp2 = alloca <4 x i64>, align 32
	%tmp3 = bitcast <4 x i64>* %tmp to i8*			%tmp3 = bitcast <4 x i64>* %tmp to i8*
	call void @llvm.memset.p0i8.i64(i8* align 32 %tmp3, i8 0, i64 32, i1 false)			call void @llvm.memset.p0i8.i64(i8* align 32 %tmp3, i8 0, i64 32, i1 false)
	call fastcc void @no_promote_avx2(<4 x i64>* %tmp2, <4 x i64>* %tmp)			call fastcc void @no_promote_avx2(<4 x i64>* %tmp2, <4 x i64>* %tmp)
	%tmp4 = load <4 x i64>, <4 x i64>* %tmp2, align 32			%tmp4 = load <4 x i64>, <4 x i64>* %tmp2, align 32
	store <4 x i64> %tmp4, <4 x i64>* %arg, align 2			store <4 x i64> %tmp4, <4 x i64>* %arg, align 2
	ret void			ret void
	}			}

	define internal fastcc void @promote_avx2(<4 x i64>* %arg, <4 x i64>* readonly %arg1) #0 {			define internal fastcc void @promote_avx2(<4 x i64>* %arg, <4 x i64>* readonly %arg1) #0 {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@promote_avx2			; ARGPROMOTION-LABEL: define {{[^@]+}}@promote_avx2
	; ARGPROMOTION-SAME: (<4 x i64>* [[ARG:%.]], <4 x i64> [[ARG1_VAL:%.]])			; ARGPROMOTION-SAME: (<4 x i64>* [[ARG:%.]], <4 x i64> [[ARG1_VAL:%.]])
	; ARGPROMOTION-NEXT: bb:			; ARGPROMOTION-NEXT: bb:
	; ARGPROMOTION-NEXT: store <4 x i64> [[ARG1_VAL]], <4 x i64>* [[ARG]]			; ARGPROMOTION-NEXT: store <4 x i64> [[ARG1_VAL]], <4 x i64>* [[ARG]]
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@promote_avx2
				; ATTRIBUTOR-SAME: (<4 x i64>* noalias nocapture nonnull writeonly align 32 dereferenceable(32) [[ARG:%.]], <4 x i64> [[TMP0:%.]])
				; ATTRIBUTOR-NEXT: bb:
				; ATTRIBUTOR-NEXT: store <4 x i64> [[TMP0]], <4 x i64>* [[ARG]], align 32
				; ATTRIBUTOR-NEXT: ret void
				;
	bb:			bb:
	%tmp = load <4 x i64>, <4 x i64>* %arg1			%tmp = load <4 x i64>, <4 x i64>* %arg1
	store <4 x i64> %tmp, <4 x i64>* %arg			store <4 x i64> %tmp, <4 x i64>* %arg
	ret void			ret void
	}			}

	define void @promote(<4 x i64>* %arg) #0 {			define void @promote(<4 x i64>* %arg) #0 {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@promote			; ARGPROMOTION-LABEL: define {{[^@]+}}@promote
	; ARGPROMOTION-SAME: (<4 x i64>* [[ARG:%.*]])			; ARGPROMOTION-SAME: (<4 x i64>* [[ARG:%.*]])
	; ARGPROMOTION-NEXT: bb:			; ARGPROMOTION-NEXT: bb:
	; ARGPROMOTION-NEXT: [[TMP:%.*]] = alloca <4 x i64>, align 32			; ARGPROMOTION-NEXT: [[TMP:%.*]] = alloca <4 x i64>, align 32
	; ARGPROMOTION-NEXT: [[TMP2:%.*]] = alloca <4 x i64>, align 32			; ARGPROMOTION-NEXT: [[TMP2:%.*]] = alloca <4 x i64>, align 32
	; ARGPROMOTION-NEXT: [[TMP3:%.]] = bitcast <4 x i64> [[TMP]] to i8*			; ARGPROMOTION-NEXT: [[TMP3:%.]] = bitcast <4 x i64> [[TMP]] to i8*
	; ARGPROMOTION-NEXT: call void @llvm.memset.p0i8.i64(i8* align 32 [[TMP3]], i8 0, i64 32, i1 false)			; ARGPROMOTION-NEXT: call void @llvm.memset.p0i8.i64(i8* align 32 [[TMP3]], i8 0, i64 32, i1 false)
	; ARGPROMOTION-NEXT: [[TMP_VAL:%.]] = load <4 x i64>, <4 x i64> [[TMP]]			; ARGPROMOTION-NEXT: [[TMP_VAL:%.]] = load <4 x i64>, <4 x i64> [[TMP]]
	; ARGPROMOTION-NEXT: call fastcc void @promote_avx2(<4 x i64>* [[TMP2]], <4 x i64> [[TMP_VAL]])			; ARGPROMOTION-NEXT: call fastcc void @promote_avx2(<4 x i64>* [[TMP2]], <4 x i64> [[TMP_VAL]])
	; ARGPROMOTION-NEXT: [[TMP4:%.]] = load <4 x i64>, <4 x i64> [[TMP2]], align 32			; ARGPROMOTION-NEXT: [[TMP4:%.]] = load <4 x i64>, <4 x i64> [[TMP2]], align 32
	; ARGPROMOTION-NEXT: store <4 x i64> [[TMP4]], <4 x i64>* [[ARG]], align 2			; ARGPROMOTION-NEXT: store <4 x i64> [[TMP4]], <4 x i64>* [[ARG]], align 2
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@promote
				; ATTRIBUTOR-SAME: (<4 x i64>* nocapture writeonly [[ARG:%.*]])
				; ATTRIBUTOR-NEXT: bb:
				; ATTRIBUTOR-NEXT: [[TMP:%.*]] = alloca <4 x i64>, align 32
				; ATTRIBUTOR-NEXT: [[TMP2:%.*]] = alloca <4 x i64>, align 32
				; ATTRIBUTOR-NEXT: [[TMP3:%.]] = bitcast <4 x i64> [[TMP]] to i8*
				; ATTRIBUTOR-NEXT: call void @llvm.memset.p0i8.i64(i8* nonnull align 32 dereferenceable(32) [[TMP3]], i8 0, i64 32, i1 false)
				; ATTRIBUTOR-NEXT: [[TMP0:%.]] = load <4 x i64>, <4 x i64> [[TMP]]
				; ATTRIBUTOR-NEXT: call fastcc void @promote_avx2(<4 x i64>* noalias nocapture nonnull writeonly align 32 dereferenceable(32) [[TMP2]], <4 x i64> [[TMP0]])
				; ATTRIBUTOR-NEXT: [[TMP4:%.]] = load <4 x i64>, <4 x i64> [[TMP2]], align 32
				; ATTRIBUTOR-NEXT: store <4 x i64> [[TMP4]], <4 x i64>* [[ARG]], align 2
				; ATTRIBUTOR-NEXT: ret void
				;
	bb:			bb:
	%tmp = alloca <4 x i64>, align 32			%tmp = alloca <4 x i64>, align 32
	%tmp2 = alloca <4 x i64>, align 32			%tmp2 = alloca <4 x i64>, align 32
	%tmp3 = bitcast <4 x i64>* %tmp to i8*			%tmp3 = bitcast <4 x i64>* %tmp to i8*
	call void @llvm.memset.p0i8.i64(i8* align 32 %tmp3, i8 0, i64 32, i1 false)			call void @llvm.memset.p0i8.i64(i8* align 32 %tmp3, i8 0, i64 32, i1 false)
	call fastcc void @promote_avx2(<4 x i64>* %tmp2, <4 x i64>* %tmp)			call fastcc void @promote_avx2(<4 x i64>* %tmp2, <4 x i64>* %tmp)
	%tmp4 = load <4 x i64>, <4 x i64>* %tmp2, align 32			%tmp4 = load <4 x i64>, <4 x i64>* %tmp2, align 32
	store <4 x i64> %tmp4, <4 x i64>* %arg, align 2			store <4 x i64> %tmp4, <4 x i64>* %arg, align 2
	Show All 9 Lines

llvm/test/Transforms/ArgumentPromotion/X86/min-legal-vector-width.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt -S -argpromotion < %s \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt -S -argpromotion < %s \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
	; RUN: opt -S -passes=argpromotion < %s \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt -S -passes=argpromotion < %s \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
				; RUN: opt -S -passes=attributor -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=3 < %s \| FileCheck %s --check-prefixes=ALL,ATTRIBUTOR
	; Test that we only promote arguments when the caller/callee have compatible			; Test that we only promote arguments when the caller/callee have compatible
	; function attrubtes.			; function attrubtes.

	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	; This should promote			; This should promote
	define internal fastcc void @callee_avx512_legal512_prefer512_call_avx512_legal512_prefer512(<8 x i64>* %arg, <8 x i64>* readonly %arg1) #0 {			define internal fastcc void @callee_avx512_legal512_prefer512_call_avx512_legal512_prefer512(<8 x i64>* %arg, <8 x i64>* readonly %arg1) #0 {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@callee_avx512_legal512_prefer512_call_avx512_legal512_prefer512			; ARGPROMOTION-LABEL: define {{[^@]+}}@callee_avx512_legal512_prefer512_call_avx512_legal512_prefer512
	; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.]], <8 x i64> [[ARG1_VAL:%.]])			; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.]], <8 x i64> [[ARG1_VAL:%.]])
	; ARGPROMOTION-NEXT: bb:			; ARGPROMOTION-NEXT: bb:
	; ARGPROMOTION-NEXT: store <8 x i64> [[ARG1_VAL]], <8 x i64>* [[ARG]]			; ARGPROMOTION-NEXT: store <8 x i64> [[ARG1_VAL]], <8 x i64>* [[ARG]]
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@callee_avx512_legal512_prefer512_call_avx512_legal512_prefer512
				; ATTRIBUTOR-SAME: (<8 x i64>* noalias nocapture nonnull writeonly align 32 dereferenceable(64) [[ARG:%.]], <8 x i64> [[TMP0:%.]])
				; ATTRIBUTOR-NEXT: bb:
				; ATTRIBUTOR-NEXT: [[ARG1_PRIV:%.*]] = alloca <8 x i64>
				; ATTRIBUTOR-NEXT: store <8 x i64> [[TMP0]], <8 x i64>* [[ARG1_PRIV]]
				; ATTRIBUTOR-NEXT: [[TMP:%.]] = load <8 x i64>, <8 x i64> [[ARG1_PRIV]], align 32
				; ATTRIBUTOR-NEXT: store <8 x i64> [[TMP]], <8 x i64>* [[ARG]], align 32
				; ATTRIBUTOR-NEXT: ret void
				;
	bb:			bb:
	%tmp = load <8 x i64>, <8 x i64>* %arg1			%tmp = load <8 x i64>, <8 x i64>* %arg1
	store <8 x i64> %tmp, <8 x i64>* %arg			store <8 x i64> %tmp, <8 x i64>* %arg
	ret void			ret void
	}			}

	define void @avx512_legal512_prefer512_call_avx512_legal512_prefer512(<8 x i64>* %arg) #0 {			define void @avx512_legal512_prefer512_call_avx512_legal512_prefer512(<8 x i64>* %arg) #0 {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@avx512_legal512_prefer512_call_avx512_legal512_prefer512			; ARGPROMOTION-LABEL: define {{[^@]+}}@avx512_legal512_prefer512_call_avx512_legal512_prefer512
	; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.*]])			; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.*]])
	; ARGPROMOTION-NEXT: bb:			; ARGPROMOTION-NEXT: bb:
	; ARGPROMOTION-NEXT: [[TMP:%.*]] = alloca <8 x i64>, align 32			; ARGPROMOTION-NEXT: [[TMP:%.*]] = alloca <8 x i64>, align 32
	; ARGPROMOTION-NEXT: [[TMP2:%.*]] = alloca <8 x i64>, align 32			; ARGPROMOTION-NEXT: [[TMP2:%.*]] = alloca <8 x i64>, align 32
	; ARGPROMOTION-NEXT: [[TMP3:%.]] = bitcast <8 x i64> [[TMP]] to i8*			; ARGPROMOTION-NEXT: [[TMP3:%.]] = bitcast <8 x i64> [[TMP]] to i8*
	; ARGPROMOTION-NEXT: call void @llvm.memset.p0i8.i64(i8* align 32 [[TMP3]], i8 0, i64 32, i1 false)			; ARGPROMOTION-NEXT: call void @llvm.memset.p0i8.i64(i8* align 32 [[TMP3]], i8 0, i64 32, i1 false)
	; ARGPROMOTION-NEXT: [[TMP_VAL:%.]] = load <8 x i64>, <8 x i64> [[TMP]]			; ARGPROMOTION-NEXT: [[TMP_VAL:%.]] = load <8 x i64>, <8 x i64> [[TMP]]
	; ARGPROMOTION-NEXT: call fastcc void @callee_avx512_legal512_prefer512_call_avx512_legal512_prefer512(<8 x i64>* [[TMP2]], <8 x i64> [[TMP_VAL]])			; ARGPROMOTION-NEXT: call fastcc void @callee_avx512_legal512_prefer512_call_avx512_legal512_prefer512(<8 x i64>* [[TMP2]], <8 x i64> [[TMP_VAL]])
	; ARGPROMOTION-NEXT: [[TMP4:%.]] = load <8 x i64>, <8 x i64> [[TMP2]], align 32			; ARGPROMOTION-NEXT: [[TMP4:%.]] = load <8 x i64>, <8 x i64> [[TMP2]], align 32
	; ARGPROMOTION-NEXT: store <8 x i64> [[TMP4]], <8 x i64>* [[ARG]], align 2			; ARGPROMOTION-NEXT: store <8 x i64> [[TMP4]], <8 x i64>* [[ARG]], align 2
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@avx512_legal512_prefer512_call_avx512_legal512_prefer512
				; ATTRIBUTOR-SAME: (<8 x i64>* nocapture writeonly [[ARG:%.*]])
				; ATTRIBUTOR-NEXT: bb:
				; ATTRIBUTOR-NEXT: [[TMP:%.*]] = alloca <8 x i64>, align 32
				; ATTRIBUTOR-NEXT: [[TMP2:%.*]] = alloca <8 x i64>, align 32
				; ATTRIBUTOR-NEXT: [[TMP3:%.]] = bitcast <8 x i64> [[TMP]] to i8*
				; ATTRIBUTOR-NEXT: call void @llvm.memset.p0i8.i64(i8* nonnull align 32 dereferenceable(64) [[TMP3]], i8 0, i64 32, i1 false)
				; ATTRIBUTOR-NEXT: [[TMP0:%.]] = load <8 x i64>, <8 x i64> [[TMP]]
				; ATTRIBUTOR-NEXT: call fastcc void @callee_avx512_legal512_prefer512_call_avx512_legal512_prefer512(<8 x i64>* noalias nocapture nonnull writeonly align 32 dereferenceable(64) [[TMP2]], <8 x i64> [[TMP0]])
				; ATTRIBUTOR-NEXT: [[TMP4:%.]] = load <8 x i64>, <8 x i64> [[TMP2]], align 32
				; ATTRIBUTOR-NEXT: store <8 x i64> [[TMP4]], <8 x i64>* [[ARG]], align 2
				; ATTRIBUTOR-NEXT: ret void
				;
	bb:			bb:
	%tmp = alloca <8 x i64>, align 32			%tmp = alloca <8 x i64>, align 32
	%tmp2 = alloca <8 x i64>, align 32			%tmp2 = alloca <8 x i64>, align 32
	%tmp3 = bitcast <8 x i64>* %tmp to i8*			%tmp3 = bitcast <8 x i64>* %tmp to i8*
	call void @llvm.memset.p0i8.i64(i8* align 32 %tmp3, i8 0, i64 32, i1 false)			call void @llvm.memset.p0i8.i64(i8* align 32 %tmp3, i8 0, i64 32, i1 false)
	call fastcc void @callee_avx512_legal512_prefer512_call_avx512_legal512_prefer512(<8 x i64>* %tmp2, <8 x i64>* %tmp)			call fastcc void @callee_avx512_legal512_prefer512_call_avx512_legal512_prefer512(<8 x i64>* %tmp2, <8 x i64>* %tmp)
	%tmp4 = load <8 x i64>, <8 x i64>* %tmp2, align 32			%tmp4 = load <8 x i64>, <8 x i64>* %tmp2, align 32
	store <8 x i64> %tmp4, <8 x i64>* %arg, align 2			store <8 x i64> %tmp4, <8 x i64>* %arg, align 2
	ret void			ret void
	}			}

	; This should promote			; This should promote
	define internal fastcc void @callee_avx512_legal512_prefer256_call_avx512_legal512_prefer256(<8 x i64>* %arg, <8 x i64>* readonly %arg1) #1 {			define internal fastcc void @callee_avx512_legal512_prefer256_call_avx512_legal512_prefer256(<8 x i64>* %arg, <8 x i64>* readonly %arg1) #1 {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@callee_avx512_legal512_prefer256_call_avx512_legal512_prefer256			; ARGPROMOTION-LABEL: define {{[^@]+}}@callee_avx512_legal512_prefer256_call_avx512_legal512_prefer256
	; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.]], <8 x i64> [[ARG1_VAL:%.]])			; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.]], <8 x i64> [[ARG1_VAL:%.]])
	; ARGPROMOTION-NEXT: bb:			; ARGPROMOTION-NEXT: bb:
	; ARGPROMOTION-NEXT: store <8 x i64> [[ARG1_VAL]], <8 x i64>* [[ARG]]			; ARGPROMOTION-NEXT: store <8 x i64> [[ARG1_VAL]], <8 x i64>* [[ARG]]
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@callee_avx512_legal512_prefer256_call_avx512_legal512_prefer256
				; ATTRIBUTOR-SAME: (<8 x i64>* noalias nocapture nonnull writeonly align 32 dereferenceable(64) [[ARG:%.]], <8 x i64> [[TMP0:%.]])
				; ATTRIBUTOR-NEXT: bb:
				; ATTRIBUTOR-NEXT: [[ARG1_PRIV:%.*]] = alloca <8 x i64>
				; ATTRIBUTOR-NEXT: store <8 x i64> [[TMP0]], <8 x i64>* [[ARG1_PRIV]]
				; ATTRIBUTOR-NEXT: [[TMP:%.]] = load <8 x i64>, <8 x i64> [[ARG1_PRIV]], align 32
				; ATTRIBUTOR-NEXT: store <8 x i64> [[TMP]], <8 x i64>* [[ARG]], align 32
				; ATTRIBUTOR-NEXT: ret void
				;
	bb:			bb:
	%tmp = load <8 x i64>, <8 x i64>* %arg1			%tmp = load <8 x i64>, <8 x i64>* %arg1
	store <8 x i64> %tmp, <8 x i64>* %arg			store <8 x i64> %tmp, <8 x i64>* %arg
	ret void			ret void
	}			}

	define void @avx512_legal512_prefer256_call_avx512_legal512_prefer256(<8 x i64>* %arg) #1 {			define void @avx512_legal512_prefer256_call_avx512_legal512_prefer256(<8 x i64>* %arg) #1 {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@avx512_legal512_prefer256_call_avx512_legal512_prefer256			; ARGPROMOTION-LABEL: define {{[^@]+}}@avx512_legal512_prefer256_call_avx512_legal512_prefer256
	; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.*]])			; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.*]])
	; ARGPROMOTION-NEXT: bb:			; ARGPROMOTION-NEXT: bb:
	; ARGPROMOTION-NEXT: [[TMP:%.*]] = alloca <8 x i64>, align 32			; ARGPROMOTION-NEXT: [[TMP:%.*]] = alloca <8 x i64>, align 32
	; ARGPROMOTION-NEXT: [[TMP2:%.*]] = alloca <8 x i64>, align 32			; ARGPROMOTION-NEXT: [[TMP2:%.*]] = alloca <8 x i64>, align 32
	; ARGPROMOTION-NEXT: [[TMP3:%.]] = bitcast <8 x i64> [[TMP]] to i8*			; ARGPROMOTION-NEXT: [[TMP3:%.]] = bitcast <8 x i64> [[TMP]] to i8*
	; ARGPROMOTION-NEXT: call void @llvm.memset.p0i8.i64(i8* align 32 [[TMP3]], i8 0, i64 32, i1 false)			; ARGPROMOTION-NEXT: call void @llvm.memset.p0i8.i64(i8* align 32 [[TMP3]], i8 0, i64 32, i1 false)
	; ARGPROMOTION-NEXT: [[TMP_VAL:%.]] = load <8 x i64>, <8 x i64> [[TMP]]			; ARGPROMOTION-NEXT: [[TMP_VAL:%.]] = load <8 x i64>, <8 x i64> [[TMP]]
	; ARGPROMOTION-NEXT: call fastcc void @callee_avx512_legal512_prefer256_call_avx512_legal512_prefer256(<8 x i64>* [[TMP2]], <8 x i64> [[TMP_VAL]])			; ARGPROMOTION-NEXT: call fastcc void @callee_avx512_legal512_prefer256_call_avx512_legal512_prefer256(<8 x i64>* [[TMP2]], <8 x i64> [[TMP_VAL]])
	; ARGPROMOTION-NEXT: [[TMP4:%.]] = load <8 x i64>, <8 x i64> [[TMP2]], align 32			; ARGPROMOTION-NEXT: [[TMP4:%.]] = load <8 x i64>, <8 x i64> [[TMP2]], align 32
	; ARGPROMOTION-NEXT: store <8 x i64> [[TMP4]], <8 x i64>* [[ARG]], align 2			; ARGPROMOTION-NEXT: store <8 x i64> [[TMP4]], <8 x i64>* [[ARG]], align 2
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@avx512_legal512_prefer256_call_avx512_legal512_prefer256
				; ATTRIBUTOR-SAME: (<8 x i64>* nocapture writeonly [[ARG:%.*]])
				; ATTRIBUTOR-NEXT: bb:
				; ATTRIBUTOR-NEXT: [[TMP:%.*]] = alloca <8 x i64>, align 32
				; ATTRIBUTOR-NEXT: [[TMP2:%.*]] = alloca <8 x i64>, align 32
				; ATTRIBUTOR-NEXT: [[TMP3:%.]] = bitcast <8 x i64> [[TMP]] to i8*
				; ATTRIBUTOR-NEXT: call void @llvm.memset.p0i8.i64(i8* nonnull align 32 dereferenceable(64) [[TMP3]], i8 0, i64 32, i1 false)
				; ATTRIBUTOR-NEXT: [[TMP0:%.]] = load <8 x i64>, <8 x i64> [[TMP]]
				; ATTRIBUTOR-NEXT: call fastcc void @callee_avx512_legal512_prefer256_call_avx512_legal512_prefer256(<8 x i64>* noalias nocapture nonnull writeonly align 32 dereferenceable(64) [[TMP2]], <8 x i64> [[TMP0]])
				; ATTRIBUTOR-NEXT: [[TMP4:%.]] = load <8 x i64>, <8 x i64> [[TMP2]], align 32
				; ATTRIBUTOR-NEXT: store <8 x i64> [[TMP4]], <8 x i64>* [[ARG]], align 2
				; ATTRIBUTOR-NEXT: ret void
				;
	bb:			bb:
	%tmp = alloca <8 x i64>, align 32			%tmp = alloca <8 x i64>, align 32
	%tmp2 = alloca <8 x i64>, align 32			%tmp2 = alloca <8 x i64>, align 32
	%tmp3 = bitcast <8 x i64>* %tmp to i8*			%tmp3 = bitcast <8 x i64>* %tmp to i8*
	call void @llvm.memset.p0i8.i64(i8* align 32 %tmp3, i8 0, i64 32, i1 false)			call void @llvm.memset.p0i8.i64(i8* align 32 %tmp3, i8 0, i64 32, i1 false)
	call fastcc void @callee_avx512_legal512_prefer256_call_avx512_legal512_prefer256(<8 x i64>* %tmp2, <8 x i64>* %tmp)			call fastcc void @callee_avx512_legal512_prefer256_call_avx512_legal512_prefer256(<8 x i64>* %tmp2, <8 x i64>* %tmp)
	%tmp4 = load <8 x i64>, <8 x i64>* %tmp2, align 32			%tmp4 = load <8 x i64>, <8 x i64>* %tmp2, align 32
	store <8 x i64> %tmp4, <8 x i64>* %arg, align 2			store <8 x i64> %tmp4, <8 x i64>* %arg, align 2
	ret void			ret void
	}			}

	; This should promote			; This should promote
	define internal fastcc void @callee_avx512_legal512_prefer512_call_avx512_legal512_prefer256(<8 x i64>* %arg, <8 x i64>* readonly %arg1) #1 {			define internal fastcc void @callee_avx512_legal512_prefer512_call_avx512_legal512_prefer256(<8 x i64>* %arg, <8 x i64>* readonly %arg1) #1 {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@callee_avx512_legal512_prefer512_call_avx512_legal512_prefer256			; ARGPROMOTION-LABEL: define {{[^@]+}}@callee_avx512_legal512_prefer512_call_avx512_legal512_prefer256
	; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.]], <8 x i64> [[ARG1_VAL:%.]])			; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.]], <8 x i64> [[ARG1_VAL:%.]])
	; ARGPROMOTION-NEXT: bb:			; ARGPROMOTION-NEXT: bb:
	; ARGPROMOTION-NEXT: store <8 x i64> [[ARG1_VAL]], <8 x i64>* [[ARG]]			; ARGPROMOTION-NEXT: store <8 x i64> [[ARG1_VAL]], <8 x i64>* [[ARG]]
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@callee_avx512_legal512_prefer512_call_avx512_legal512_prefer256
				; ATTRIBUTOR-SAME: (<8 x i64>* noalias nocapture nonnull writeonly align 32 dereferenceable(64) [[ARG:%.]], <8 x i64> [[TMP0:%.]])
				; ATTRIBUTOR-NEXT: bb:
				; ATTRIBUTOR-NEXT: [[ARG1_PRIV:%.*]] = alloca <8 x i64>
				; ATTRIBUTOR-NEXT: store <8 x i64> [[TMP0]], <8 x i64>* [[ARG1_PRIV]]
				; ATTRIBUTOR-NEXT: [[TMP:%.]] = load <8 x i64>, <8 x i64> [[ARG1_PRIV]], align 32
				; ATTRIBUTOR-NEXT: store <8 x i64> [[TMP]], <8 x i64>* [[ARG]], align 32
				; ATTRIBUTOR-NEXT: ret void
				;
	bb:			bb:
	%tmp = load <8 x i64>, <8 x i64>* %arg1			%tmp = load <8 x i64>, <8 x i64>* %arg1
	store <8 x i64> %tmp, <8 x i64>* %arg			store <8 x i64> %tmp, <8 x i64>* %arg
	ret void			ret void
	}			}

	define void @avx512_legal512_prefer512_call_avx512_legal512_prefer256(<8 x i64>* %arg) #0 {			define void @avx512_legal512_prefer512_call_avx512_legal512_prefer256(<8 x i64>* %arg) #0 {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@avx512_legal512_prefer512_call_avx512_legal512_prefer256			; ARGPROMOTION-LABEL: define {{[^@]+}}@avx512_legal512_prefer512_call_avx512_legal512_prefer256
	; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.*]])			; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.*]])
	; ARGPROMOTION-NEXT: bb:			; ARGPROMOTION-NEXT: bb:
	; ARGPROMOTION-NEXT: [[TMP:%.*]] = alloca <8 x i64>, align 32			; ARGPROMOTION-NEXT: [[TMP:%.*]] = alloca <8 x i64>, align 32
	; ARGPROMOTION-NEXT: [[TMP2:%.*]] = alloca <8 x i64>, align 32			; ARGPROMOTION-NEXT: [[TMP2:%.*]] = alloca <8 x i64>, align 32
	; ARGPROMOTION-NEXT: [[TMP3:%.]] = bitcast <8 x i64> [[TMP]] to i8*			; ARGPROMOTION-NEXT: [[TMP3:%.]] = bitcast <8 x i64> [[TMP]] to i8*
	; ARGPROMOTION-NEXT: call void @llvm.memset.p0i8.i64(i8* align 32 [[TMP3]], i8 0, i64 32, i1 false)			; ARGPROMOTION-NEXT: call void @llvm.memset.p0i8.i64(i8* align 32 [[TMP3]], i8 0, i64 32, i1 false)
	; ARGPROMOTION-NEXT: [[TMP_VAL:%.]] = load <8 x i64>, <8 x i64> [[TMP]]			; ARGPROMOTION-NEXT: [[TMP_VAL:%.]] = load <8 x i64>, <8 x i64> [[TMP]]
	; ARGPROMOTION-NEXT: call fastcc void @callee_avx512_legal512_prefer512_call_avx512_legal512_prefer256(<8 x i64>* [[TMP2]], <8 x i64> [[TMP_VAL]])			; ARGPROMOTION-NEXT: call fastcc void @callee_avx512_legal512_prefer512_call_avx512_legal512_prefer256(<8 x i64>* [[TMP2]], <8 x i64> [[TMP_VAL]])
	; ARGPROMOTION-NEXT: [[TMP4:%.]] = load <8 x i64>, <8 x i64> [[TMP2]], align 32			; ARGPROMOTION-NEXT: [[TMP4:%.]] = load <8 x i64>, <8 x i64> [[TMP2]], align 32
	; ARGPROMOTION-NEXT: store <8 x i64> [[TMP4]], <8 x i64>* [[ARG]], align 2			; ARGPROMOTION-NEXT: store <8 x i64> [[TMP4]], <8 x i64>* [[ARG]], align 2
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@avx512_legal512_prefer512_call_avx512_legal512_prefer256
				; ATTRIBUTOR-SAME: (<8 x i64>* nocapture writeonly [[ARG:%.*]])
				; ATTRIBUTOR-NEXT: bb:
				; ATTRIBUTOR-NEXT: [[TMP:%.*]] = alloca <8 x i64>, align 32
				; ATTRIBUTOR-NEXT: [[TMP2:%.*]] = alloca <8 x i64>, align 32
				; ATTRIBUTOR-NEXT: [[TMP3:%.]] = bitcast <8 x i64> [[TMP]] to i8*
				; ATTRIBUTOR-NEXT: call void @llvm.memset.p0i8.i64(i8* nonnull align 32 dereferenceable(64) [[TMP3]], i8 0, i64 32, i1 false)
				; ATTRIBUTOR-NEXT: [[TMP0:%.]] = load <8 x i64>, <8 x i64> [[TMP]]
				; ATTRIBUTOR-NEXT: call fastcc void @callee_avx512_legal512_prefer512_call_avx512_legal512_prefer256(<8 x i64>* noalias nocapture nonnull writeonly align 32 dereferenceable(64) [[TMP2]], <8 x i64> [[TMP0]])
				; ATTRIBUTOR-NEXT: [[TMP4:%.]] = load <8 x i64>, <8 x i64> [[TMP2]], align 32
				; ATTRIBUTOR-NEXT: store <8 x i64> [[TMP4]], <8 x i64>* [[ARG]], align 2
				; ATTRIBUTOR-NEXT: ret void
				;
	bb:			bb:
	%tmp = alloca <8 x i64>, align 32			%tmp = alloca <8 x i64>, align 32
	%tmp2 = alloca <8 x i64>, align 32			%tmp2 = alloca <8 x i64>, align 32
	%tmp3 = bitcast <8 x i64>* %tmp to i8*			%tmp3 = bitcast <8 x i64>* %tmp to i8*
	call void @llvm.memset.p0i8.i64(i8* align 32 %tmp3, i8 0, i64 32, i1 false)			call void @llvm.memset.p0i8.i64(i8* align 32 %tmp3, i8 0, i64 32, i1 false)
	call fastcc void @callee_avx512_legal512_prefer512_call_avx512_legal512_prefer256(<8 x i64>* %tmp2, <8 x i64>* %tmp)			call fastcc void @callee_avx512_legal512_prefer512_call_avx512_legal512_prefer256(<8 x i64>* %tmp2, <8 x i64>* %tmp)
	%tmp4 = load <8 x i64>, <8 x i64>* %tmp2, align 32			%tmp4 = load <8 x i64>, <8 x i64>* %tmp2, align 32
	store <8 x i64> %tmp4, <8 x i64>* %arg, align 2			store <8 x i64> %tmp4, <8 x i64>* %arg, align 2
	ret void			ret void
	}			}

	; This should promote			; This should promote
	define internal fastcc void @callee_avx512_legal512_prefer256_call_avx512_legal512_prefer512(<8 x i64>* %arg, <8 x i64>* readonly %arg1) #0 {			define internal fastcc void @callee_avx512_legal512_prefer256_call_avx512_legal512_prefer512(<8 x i64>* %arg, <8 x i64>* readonly %arg1) #0 {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@callee_avx512_legal512_prefer256_call_avx512_legal512_prefer512			; ARGPROMOTION-LABEL: define {{[^@]+}}@callee_avx512_legal512_prefer256_call_avx512_legal512_prefer512
	; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.]], <8 x i64> [[ARG1_VAL:%.]])			; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.]], <8 x i64> [[ARG1_VAL:%.]])
	; ARGPROMOTION-NEXT: bb:			; ARGPROMOTION-NEXT: bb:
	; ARGPROMOTION-NEXT: store <8 x i64> [[ARG1_VAL]], <8 x i64>* [[ARG]]			; ARGPROMOTION-NEXT: store <8 x i64> [[ARG1_VAL]], <8 x i64>* [[ARG]]
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@callee_avx512_legal512_prefer256_call_avx512_legal512_prefer512
				; ATTRIBUTOR-SAME: (<8 x i64>* noalias nocapture nonnull writeonly align 32 dereferenceable(64) [[ARG:%.]], <8 x i64> [[TMP0:%.]])
				; ATTRIBUTOR-NEXT: bb:
				; ATTRIBUTOR-NEXT: [[ARG1_PRIV:%.*]] = alloca <8 x i64>
				; ATTRIBUTOR-NEXT: store <8 x i64> [[TMP0]], <8 x i64>* [[ARG1_PRIV]]
				; ATTRIBUTOR-NEXT: [[TMP:%.]] = load <8 x i64>, <8 x i64> [[ARG1_PRIV]], align 32
				; ATTRIBUTOR-NEXT: store <8 x i64> [[TMP]], <8 x i64>* [[ARG]], align 32
				; ATTRIBUTOR-NEXT: ret void
				;
	bb:			bb:
	%tmp = load <8 x i64>, <8 x i64>* %arg1			%tmp = load <8 x i64>, <8 x i64>* %arg1
	store <8 x i64> %tmp, <8 x i64>* %arg			store <8 x i64> %tmp, <8 x i64>* %arg
	ret void			ret void
	}			}

	define void @avx512_legal512_prefer256_call_avx512_legal512_prefer512(<8 x i64>* %arg) #1 {			define void @avx512_legal512_prefer256_call_avx512_legal512_prefer512(<8 x i64>* %arg) #1 {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@avx512_legal512_prefer256_call_avx512_legal512_prefer512			; ARGPROMOTION-LABEL: define {{[^@]+}}@avx512_legal512_prefer256_call_avx512_legal512_prefer512
	; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.*]])			; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.*]])
	; ARGPROMOTION-NEXT: bb:			; ARGPROMOTION-NEXT: bb:
	; ARGPROMOTION-NEXT: [[TMP:%.*]] = alloca <8 x i64>, align 32			; ARGPROMOTION-NEXT: [[TMP:%.*]] = alloca <8 x i64>, align 32
	; ARGPROMOTION-NEXT: [[TMP2:%.*]] = alloca <8 x i64>, align 32			; ARGPROMOTION-NEXT: [[TMP2:%.*]] = alloca <8 x i64>, align 32
	; ARGPROMOTION-NEXT: [[TMP3:%.]] = bitcast <8 x i64> [[TMP]] to i8*			; ARGPROMOTION-NEXT: [[TMP3:%.]] = bitcast <8 x i64> [[TMP]] to i8*
	; ARGPROMOTION-NEXT: call void @llvm.memset.p0i8.i64(i8* align 32 [[TMP3]], i8 0, i64 32, i1 false)			; ARGPROMOTION-NEXT: call void @llvm.memset.p0i8.i64(i8* align 32 [[TMP3]], i8 0, i64 32, i1 false)
	; ARGPROMOTION-NEXT: [[TMP_VAL:%.]] = load <8 x i64>, <8 x i64> [[TMP]]			; ARGPROMOTION-NEXT: [[TMP_VAL:%.]] = load <8 x i64>, <8 x i64> [[TMP]]
	; ARGPROMOTION-NEXT: call fastcc void @callee_avx512_legal512_prefer256_call_avx512_legal512_prefer512(<8 x i64>* [[TMP2]], <8 x i64> [[TMP_VAL]])			; ARGPROMOTION-NEXT: call fastcc void @callee_avx512_legal512_prefer256_call_avx512_legal512_prefer512(<8 x i64>* [[TMP2]], <8 x i64> [[TMP_VAL]])
	; ARGPROMOTION-NEXT: [[TMP4:%.]] = load <8 x i64>, <8 x i64> [[TMP2]], align 32			; ARGPROMOTION-NEXT: [[TMP4:%.]] = load <8 x i64>, <8 x i64> [[TMP2]], align 32
	; ARGPROMOTION-NEXT: store <8 x i64> [[TMP4]], <8 x i64>* [[ARG]], align 2			; ARGPROMOTION-NEXT: store <8 x i64> [[TMP4]], <8 x i64>* [[ARG]], align 2
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@avx512_legal512_prefer256_call_avx512_legal512_prefer512
				; ATTRIBUTOR-SAME: (<8 x i64>* nocapture writeonly [[ARG:%.*]])
				; ATTRIBUTOR-NEXT: bb:
				; ATTRIBUTOR-NEXT: [[TMP:%.*]] = alloca <8 x i64>, align 32
				; ATTRIBUTOR-NEXT: [[TMP2:%.*]] = alloca <8 x i64>, align 32
				; ATTRIBUTOR-NEXT: [[TMP3:%.]] = bitcast <8 x i64> [[TMP]] to i8*
				; ATTRIBUTOR-NEXT: call void @llvm.memset.p0i8.i64(i8* nonnull align 32 dereferenceable(64) [[TMP3]], i8 0, i64 32, i1 false)
				; ATTRIBUTOR-NEXT: [[TMP0:%.]] = load <8 x i64>, <8 x i64> [[TMP]]
				; ATTRIBUTOR-NEXT: call fastcc void @callee_avx512_legal512_prefer256_call_avx512_legal512_prefer512(<8 x i64>* noalias nocapture nonnull writeonly align 32 dereferenceable(64) [[TMP2]], <8 x i64> [[TMP0]])
				; ATTRIBUTOR-NEXT: [[TMP4:%.]] = load <8 x i64>, <8 x i64> [[TMP2]], align 32
				; ATTRIBUTOR-NEXT: store <8 x i64> [[TMP4]], <8 x i64>* [[ARG]], align 2
				; ATTRIBUTOR-NEXT: ret void
				;
	bb:			bb:
	%tmp = alloca <8 x i64>, align 32			%tmp = alloca <8 x i64>, align 32
	%tmp2 = alloca <8 x i64>, align 32			%tmp2 = alloca <8 x i64>, align 32
	%tmp3 = bitcast <8 x i64>* %tmp to i8*			%tmp3 = bitcast <8 x i64>* %tmp to i8*
	call void @llvm.memset.p0i8.i64(i8* align 32 %tmp3, i8 0, i64 32, i1 false)			call void @llvm.memset.p0i8.i64(i8* align 32 %tmp3, i8 0, i64 32, i1 false)
	call fastcc void @callee_avx512_legal512_prefer256_call_avx512_legal512_prefer512(<8 x i64>* %tmp2, <8 x i64>* %tmp)			call fastcc void @callee_avx512_legal512_prefer256_call_avx512_legal512_prefer512(<8 x i64>* %tmp2, <8 x i64>* %tmp)
	%tmp4 = load <8 x i64>, <8 x i64>* %tmp2, align 32			%tmp4 = load <8 x i64>, <8 x i64>* %tmp2, align 32
	store <8 x i64> %tmp4, <8 x i64>* %arg, align 2			store <8 x i64> %tmp4, <8 x i64>* %arg, align 2
	ret void			ret void
	}			}

	; This should not promote			; This should not promote
	define internal fastcc void @callee_avx512_legal256_prefer256_call_avx512_legal512_prefer256(<8 x i64>* %arg, <8 x i64>* readonly %arg1) #1 {			define internal fastcc void @callee_avx512_legal256_prefer256_call_avx512_legal512_prefer256(<8 x i64>* %arg, <8 x i64>* readonly %arg1) #1 {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@callee_avx512_legal256_prefer256_call_avx512_legal512_prefer256			; ARGPROMOTION-LABEL: define {{[^@]+}}@callee_avx512_legal256_prefer256_call_avx512_legal512_prefer256
	; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.]], <8 x i64> readonly [[ARG1:%.*]])			; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.]], <8 x i64> readonly [[ARG1:%.*]])
	; ARGPROMOTION-NEXT: bb:			; ARGPROMOTION-NEXT: bb:
	; ARGPROMOTION-NEXT: [[TMP:%.]] = load <8 x i64>, <8 x i64> [[ARG1]]			; ARGPROMOTION-NEXT: [[TMP:%.]] = load <8 x i64>, <8 x i64> [[ARG1]]
	; ARGPROMOTION-NEXT: store <8 x i64> [[TMP]], <8 x i64>* [[ARG]]			; ARGPROMOTION-NEXT: store <8 x i64> [[TMP]], <8 x i64>* [[ARG]]
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@callee_avx512_legal256_prefer256_call_avx512_legal512_prefer256
				; ATTRIBUTOR-SAME: (<8 x i64>* noalias nocapture nonnull writeonly align 32 dereferenceable(64) [[ARG:%.]], <8 x i64> noalias nocapture nonnull readonly align 32 dereferenceable(64) [[ARG1:%.*]])
				; ATTRIBUTOR-NEXT: bb:
				; ATTRIBUTOR-NEXT: [[TMP:%.]] = load <8 x i64>, <8 x i64> [[ARG1]], align 32
				; ATTRIBUTOR-NEXT: store <8 x i64> [[TMP]], <8 x i64>* [[ARG]], align 32
				; ATTRIBUTOR-NEXT: ret void
				;
	bb:			bb:
	%tmp = load <8 x i64>, <8 x i64>* %arg1			%tmp = load <8 x i64>, <8 x i64>* %arg1
	store <8 x i64> %tmp, <8 x i64>* %arg			store <8 x i64> %tmp, <8 x i64>* %arg
	ret void			ret void
	}			}

	define void @avx512_legal256_prefer256_call_avx512_legal512_prefer256(<8 x i64>* %arg) #2 {			define void @avx512_legal256_prefer256_call_avx512_legal512_prefer256(<8 x i64>* %arg) #2 {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@avx512_legal256_prefer256_call_avx512_legal512_prefer256			; ARGPROMOTION-LABEL: define {{[^@]+}}@avx512_legal256_prefer256_call_avx512_legal512_prefer256
	; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.*]])			; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.*]])
	; ARGPROMOTION-NEXT: bb:			; ARGPROMOTION-NEXT: bb:
	; ARGPROMOTION-NEXT: [[TMP:%.*]] = alloca <8 x i64>, align 32			; ARGPROMOTION-NEXT: [[TMP:%.*]] = alloca <8 x i64>, align 32
	; ARGPROMOTION-NEXT: [[TMP2:%.*]] = alloca <8 x i64>, align 32			; ARGPROMOTION-NEXT: [[TMP2:%.*]] = alloca <8 x i64>, align 32
	; ARGPROMOTION-NEXT: [[TMP3:%.]] = bitcast <8 x i64> [[TMP]] to i8*			; ARGPROMOTION-NEXT: [[TMP3:%.]] = bitcast <8 x i64> [[TMP]] to i8*
	; ARGPROMOTION-NEXT: call void @llvm.memset.p0i8.i64(i8* align 32 [[TMP3]], i8 0, i64 32, i1 false)			; ARGPROMOTION-NEXT: call void @llvm.memset.p0i8.i64(i8* align 32 [[TMP3]], i8 0, i64 32, i1 false)
	; ARGPROMOTION-NEXT: call fastcc void @callee_avx512_legal256_prefer256_call_avx512_legal512_prefer256(<8 x i64>* [[TMP2]], <8 x i64>* [[TMP]])			; ARGPROMOTION-NEXT: call fastcc void @callee_avx512_legal256_prefer256_call_avx512_legal512_prefer256(<8 x i64>* [[TMP2]], <8 x i64>* [[TMP]])
	; ARGPROMOTION-NEXT: [[TMP4:%.]] = load <8 x i64>, <8 x i64> [[TMP2]], align 32			; ARGPROMOTION-NEXT: [[TMP4:%.]] = load <8 x i64>, <8 x i64> [[TMP2]], align 32
	; ARGPROMOTION-NEXT: store <8 x i64> [[TMP4]], <8 x i64>* [[ARG]], align 2			; ARGPROMOTION-NEXT: store <8 x i64> [[TMP4]], <8 x i64>* [[ARG]], align 2
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@avx512_legal256_prefer256_call_avx512_legal512_prefer256
				; ATTRIBUTOR-SAME: (<8 x i64>* nocapture writeonly [[ARG:%.*]])
				; ATTRIBUTOR-NEXT: bb:
				; ATTRIBUTOR-NEXT: [[TMP:%.*]] = alloca <8 x i64>, align 32
				; ATTRIBUTOR-NEXT: [[TMP2:%.*]] = alloca <8 x i64>, align 32
				; ATTRIBUTOR-NEXT: [[TMP3:%.]] = bitcast <8 x i64> [[TMP]] to i8*
				; ATTRIBUTOR-NEXT: call void @llvm.memset.p0i8.i64(i8* nonnull align 32 dereferenceable(64) [[TMP3]], i8 0, i64 32, i1 false)
				; ATTRIBUTOR-NEXT: call fastcc void @callee_avx512_legal256_prefer256_call_avx512_legal512_prefer256(<8 x i64>* noalias nocapture nonnull writeonly align 32 dereferenceable(64) [[TMP2]], <8 x i64>* noalias nocapture nonnull readonly align 32 dereferenceable(64) [[TMP]])
				; ATTRIBUTOR-NEXT: [[TMP4:%.]] = load <8 x i64>, <8 x i64> [[TMP2]], align 32
				; ATTRIBUTOR-NEXT: store <8 x i64> [[TMP4]], <8 x i64>* [[ARG]], align 2
				; ATTRIBUTOR-NEXT: ret void
				;
	bb:			bb:
	%tmp = alloca <8 x i64>, align 32			%tmp = alloca <8 x i64>, align 32
	%tmp2 = alloca <8 x i64>, align 32			%tmp2 = alloca <8 x i64>, align 32
	%tmp3 = bitcast <8 x i64>* %tmp to i8*			%tmp3 = bitcast <8 x i64>* %tmp to i8*
	call void @llvm.memset.p0i8.i64(i8* align 32 %tmp3, i8 0, i64 32, i1 false)			call void @llvm.memset.p0i8.i64(i8* align 32 %tmp3, i8 0, i64 32, i1 false)
	call fastcc void @callee_avx512_legal256_prefer256_call_avx512_legal512_prefer256(<8 x i64>* %tmp2, <8 x i64>* %tmp)			call fastcc void @callee_avx512_legal256_prefer256_call_avx512_legal512_prefer256(<8 x i64>* %tmp2, <8 x i64>* %tmp)
	%tmp4 = load <8 x i64>, <8 x i64>* %tmp2, align 32			%tmp4 = load <8 x i64>, <8 x i64>* %tmp2, align 32
	store <8 x i64> %tmp4, <8 x i64>* %arg, align 2			store <8 x i64> %tmp4, <8 x i64>* %arg, align 2
	ret void			ret void
	}			}

	; This should not promote			; This should not promote
	define internal fastcc void @callee_avx512_legal512_prefer256_call_avx512_legal256_prefer256(<8 x i64>* %arg, <8 x i64>* readonly %arg1) #2 {			define internal fastcc void @callee_avx512_legal512_prefer256_call_avx512_legal256_prefer256(<8 x i64>* %arg, <8 x i64>* readonly %arg1) #2 {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@callee_avx512_legal512_prefer256_call_avx512_legal256_prefer256			; ARGPROMOTION-LABEL: define {{[^@]+}}@callee_avx512_legal512_prefer256_call_avx512_legal256_prefer256
	; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.]], <8 x i64> readonly [[ARG1:%.*]])			; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.]], <8 x i64> readonly [[ARG1:%.*]])
	; ARGPROMOTION-NEXT: bb:			; ARGPROMOTION-NEXT: bb:
	; ARGPROMOTION-NEXT: [[TMP:%.]] = load <8 x i64>, <8 x i64> [[ARG1]]			; ARGPROMOTION-NEXT: [[TMP:%.]] = load <8 x i64>, <8 x i64> [[ARG1]]
	; ARGPROMOTION-NEXT: store <8 x i64> [[TMP]], <8 x i64>* [[ARG]]			; ARGPROMOTION-NEXT: store <8 x i64> [[TMP]], <8 x i64>* [[ARG]]
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@callee_avx512_legal512_prefer256_call_avx512_legal256_prefer256
				; ATTRIBUTOR-SAME: (<8 x i64>* noalias nocapture nonnull writeonly align 32 dereferenceable(64) [[ARG:%.]], <8 x i64> noalias nocapture nonnull readonly align 32 dereferenceable(64) [[ARG1:%.*]])
				; ATTRIBUTOR-NEXT: bb:
				; ATTRIBUTOR-NEXT: [[TMP:%.]] = load <8 x i64>, <8 x i64> [[ARG1]], align 32
				; ATTRIBUTOR-NEXT: store <8 x i64> [[TMP]], <8 x i64>* [[ARG]], align 32
				; ATTRIBUTOR-NEXT: ret void
				;
	bb:			bb:
	%tmp = load <8 x i64>, <8 x i64>* %arg1			%tmp = load <8 x i64>, <8 x i64>* %arg1
	store <8 x i64> %tmp, <8 x i64>* %arg			store <8 x i64> %tmp, <8 x i64>* %arg
	ret void			ret void
	}			}

	define void @avx512_legal512_prefer256_call_avx512_legal256_prefer256(<8 x i64>* %arg) #1 {			define void @avx512_legal512_prefer256_call_avx512_legal256_prefer256(<8 x i64>* %arg) #1 {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@avx512_legal512_prefer256_call_avx512_legal256_prefer256			; ARGPROMOTION-LABEL: define {{[^@]+}}@avx512_legal512_prefer256_call_avx512_legal256_prefer256
	; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.*]])			; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.*]])
	; ARGPROMOTION-NEXT: bb:			; ARGPROMOTION-NEXT: bb:
	; ARGPROMOTION-NEXT: [[TMP:%.*]] = alloca <8 x i64>, align 32			; ARGPROMOTION-NEXT: [[TMP:%.*]] = alloca <8 x i64>, align 32
	; ARGPROMOTION-NEXT: [[TMP2:%.*]] = alloca <8 x i64>, align 32			; ARGPROMOTION-NEXT: [[TMP2:%.*]] = alloca <8 x i64>, align 32
	; ARGPROMOTION-NEXT: [[TMP3:%.]] = bitcast <8 x i64> [[TMP]] to i8*			; ARGPROMOTION-NEXT: [[TMP3:%.]] = bitcast <8 x i64> [[TMP]] to i8*
	; ARGPROMOTION-NEXT: call void @llvm.memset.p0i8.i64(i8* align 32 [[TMP3]], i8 0, i64 32, i1 false)			; ARGPROMOTION-NEXT: call void @llvm.memset.p0i8.i64(i8* align 32 [[TMP3]], i8 0, i64 32, i1 false)
	; ARGPROMOTION-NEXT: call fastcc void @callee_avx512_legal512_prefer256_call_avx512_legal256_prefer256(<8 x i64>* [[TMP2]], <8 x i64>* [[TMP]])			; ARGPROMOTION-NEXT: call fastcc void @callee_avx512_legal512_prefer256_call_avx512_legal256_prefer256(<8 x i64>* [[TMP2]], <8 x i64>* [[TMP]])
	; ARGPROMOTION-NEXT: [[TMP4:%.]] = load <8 x i64>, <8 x i64> [[TMP2]], align 32			; ARGPROMOTION-NEXT: [[TMP4:%.]] = load <8 x i64>, <8 x i64> [[TMP2]], align 32
	; ARGPROMOTION-NEXT: store <8 x i64> [[TMP4]], <8 x i64>* [[ARG]], align 2			; ARGPROMOTION-NEXT: store <8 x i64> [[TMP4]], <8 x i64>* [[ARG]], align 2
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@avx512_legal512_prefer256_call_avx512_legal256_prefer256
				; ATTRIBUTOR-SAME: (<8 x i64>* nocapture writeonly [[ARG:%.*]])
				; ATTRIBUTOR-NEXT: bb:
				; ATTRIBUTOR-NEXT: [[TMP:%.*]] = alloca <8 x i64>, align 32
				; ATTRIBUTOR-NEXT: [[TMP2:%.*]] = alloca <8 x i64>, align 32
				; ATTRIBUTOR-NEXT: [[TMP3:%.]] = bitcast <8 x i64> [[TMP]] to i8*
				; ATTRIBUTOR-NEXT: call void @llvm.memset.p0i8.i64(i8* nonnull align 32 dereferenceable(64) [[TMP3]], i8 0, i64 32, i1 false)
				; ATTRIBUTOR-NEXT: call fastcc void @callee_avx512_legal512_prefer256_call_avx512_legal256_prefer256(<8 x i64>* noalias nocapture nonnull writeonly align 32 dereferenceable(64) [[TMP2]], <8 x i64>* noalias nocapture nonnull readonly align 32 dereferenceable(64) [[TMP]])
				; ATTRIBUTOR-NEXT: [[TMP4:%.]] = load <8 x i64>, <8 x i64> [[TMP2]], align 32
				; ATTRIBUTOR-NEXT: store <8 x i64> [[TMP4]], <8 x i64>* [[ARG]], align 2
				; ATTRIBUTOR-NEXT: ret void
				;
	bb:			bb:
	%tmp = alloca <8 x i64>, align 32			%tmp = alloca <8 x i64>, align 32
	%tmp2 = alloca <8 x i64>, align 32			%tmp2 = alloca <8 x i64>, align 32
	%tmp3 = bitcast <8 x i64>* %tmp to i8*			%tmp3 = bitcast <8 x i64>* %tmp to i8*
	call void @llvm.memset.p0i8.i64(i8* align 32 %tmp3, i8 0, i64 32, i1 false)			call void @llvm.memset.p0i8.i64(i8* align 32 %tmp3, i8 0, i64 32, i1 false)
	call fastcc void @callee_avx512_legal512_prefer256_call_avx512_legal256_prefer256(<8 x i64>* %tmp2, <8 x i64>* %tmp)			call fastcc void @callee_avx512_legal512_prefer256_call_avx512_legal256_prefer256(<8 x i64>* %tmp2, <8 x i64>* %tmp)
	%tmp4 = load <8 x i64>, <8 x i64>* %tmp2, align 32			%tmp4 = load <8 x i64>, <8 x i64>* %tmp2, align 32
	store <8 x i64> %tmp4, <8 x i64>* %arg, align 2			store <8 x i64> %tmp4, <8 x i64>* %arg, align 2
	ret void			ret void
	}			}

	; This should promote			; This should promote
	define internal fastcc void @callee_avx2_legal256_prefer256_call_avx2_legal512_prefer256(<8 x i64>* %arg, <8 x i64>* readonly %arg1) #3 {			define internal fastcc void @callee_avx2_legal256_prefer256_call_avx2_legal512_prefer256(<8 x i64>* %arg, <8 x i64>* readonly %arg1) #3 {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@callee_avx2_legal256_prefer256_call_avx2_legal512_prefer256			; ARGPROMOTION-LABEL: define {{[^@]+}}@callee_avx2_legal256_prefer256_call_avx2_legal512_prefer256
	; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.]], <8 x i64> [[ARG1_VAL:%.]])			; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.]], <8 x i64> [[ARG1_VAL:%.]])
	; ARGPROMOTION-NEXT: bb:			; ARGPROMOTION-NEXT: bb:
	; ARGPROMOTION-NEXT: store <8 x i64> [[ARG1_VAL]], <8 x i64>* [[ARG]]			; ARGPROMOTION-NEXT: store <8 x i64> [[ARG1_VAL]], <8 x i64>* [[ARG]]
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@callee_avx2_legal256_prefer256_call_avx2_legal512_prefer256
				; ATTRIBUTOR-SAME: (<8 x i64>* noalias nocapture nonnull writeonly align 32 dereferenceable(64) [[ARG:%.]], <8 x i64> [[TMP0:%.]])
				; ATTRIBUTOR-NEXT: bb:
				; ATTRIBUTOR-NEXT: [[ARG1_PRIV:%.*]] = alloca <8 x i64>
				; ATTRIBUTOR-NEXT: store <8 x i64> [[TMP0]], <8 x i64>* [[ARG1_PRIV]]
				; ATTRIBUTOR-NEXT: [[TMP:%.]] = load <8 x i64>, <8 x i64> [[ARG1_PRIV]], align 32
				; ATTRIBUTOR-NEXT: store <8 x i64> [[TMP]], <8 x i64>* [[ARG]], align 32
				; ATTRIBUTOR-NEXT: ret void
				;
	bb:			bb:
	%tmp = load <8 x i64>, <8 x i64>* %arg1			%tmp = load <8 x i64>, <8 x i64>* %arg1
	store <8 x i64> %tmp, <8 x i64>* %arg			store <8 x i64> %tmp, <8 x i64>* %arg
	ret void			ret void
	}			}

	define void @avx2_legal256_prefer256_call_avx2_legal512_prefer256(<8 x i64>* %arg) #4 {			define void @avx2_legal256_prefer256_call_avx2_legal512_prefer256(<8 x i64>* %arg) #4 {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@avx2_legal256_prefer256_call_avx2_legal512_prefer256			; ARGPROMOTION-LABEL: define {{[^@]+}}@avx2_legal256_prefer256_call_avx2_legal512_prefer256
	; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.*]])			; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.*]])
	; ARGPROMOTION-NEXT: bb:			; ARGPROMOTION-NEXT: bb:
	; ARGPROMOTION-NEXT: [[TMP:%.*]] = alloca <8 x i64>, align 32			; ARGPROMOTION-NEXT: [[TMP:%.*]] = alloca <8 x i64>, align 32
	; ARGPROMOTION-NEXT: [[TMP2:%.*]] = alloca <8 x i64>, align 32			; ARGPROMOTION-NEXT: [[TMP2:%.*]] = alloca <8 x i64>, align 32
	; ARGPROMOTION-NEXT: [[TMP3:%.]] = bitcast <8 x i64> [[TMP]] to i8*			; ARGPROMOTION-NEXT: [[TMP3:%.]] = bitcast <8 x i64> [[TMP]] to i8*
	; ARGPROMOTION-NEXT: call void @llvm.memset.p0i8.i64(i8* align 32 [[TMP3]], i8 0, i64 32, i1 false)			; ARGPROMOTION-NEXT: call void @llvm.memset.p0i8.i64(i8* align 32 [[TMP3]], i8 0, i64 32, i1 false)
	; ARGPROMOTION-NEXT: [[TMP_VAL:%.]] = load <8 x i64>, <8 x i64> [[TMP]]			; ARGPROMOTION-NEXT: [[TMP_VAL:%.]] = load <8 x i64>, <8 x i64> [[TMP]]
	; ARGPROMOTION-NEXT: call fastcc void @callee_avx2_legal256_prefer256_call_avx2_legal512_prefer256(<8 x i64>* [[TMP2]], <8 x i64> [[TMP_VAL]])			; ARGPROMOTION-NEXT: call fastcc void @callee_avx2_legal256_prefer256_call_avx2_legal512_prefer256(<8 x i64>* [[TMP2]], <8 x i64> [[TMP_VAL]])
	; ARGPROMOTION-NEXT: [[TMP4:%.]] = load <8 x i64>, <8 x i64> [[TMP2]], align 32			; ARGPROMOTION-NEXT: [[TMP4:%.]] = load <8 x i64>, <8 x i64> [[TMP2]], align 32
	; ARGPROMOTION-NEXT: store <8 x i64> [[TMP4]], <8 x i64>* [[ARG]], align 2			; ARGPROMOTION-NEXT: store <8 x i64> [[TMP4]], <8 x i64>* [[ARG]], align 2
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@avx2_legal256_prefer256_call_avx2_legal512_prefer256
				; ATTRIBUTOR-SAME: (<8 x i64>* nocapture writeonly [[ARG:%.*]])
				; ATTRIBUTOR-NEXT: bb:
				; ATTRIBUTOR-NEXT: [[TMP:%.*]] = alloca <8 x i64>, align 32
				; ATTRIBUTOR-NEXT: [[TMP2:%.*]] = alloca <8 x i64>, align 32
				; ATTRIBUTOR-NEXT: [[TMP3:%.]] = bitcast <8 x i64> [[TMP]] to i8*
				; ATTRIBUTOR-NEXT: call void @llvm.memset.p0i8.i64(i8* nonnull align 32 dereferenceable(64) [[TMP3]], i8 0, i64 32, i1 false)
				; ATTRIBUTOR-NEXT: [[TMP0:%.]] = load <8 x i64>, <8 x i64> [[TMP]]
				; ATTRIBUTOR-NEXT: call fastcc void @callee_avx2_legal256_prefer256_call_avx2_legal512_prefer256(<8 x i64>* noalias nocapture nonnull writeonly align 32 dereferenceable(64) [[TMP2]], <8 x i64> [[TMP0]])
				; ATTRIBUTOR-NEXT: [[TMP4:%.]] = load <8 x i64>, <8 x i64> [[TMP2]], align 32
				; ATTRIBUTOR-NEXT: store <8 x i64> [[TMP4]], <8 x i64>* [[ARG]], align 2
				; ATTRIBUTOR-NEXT: ret void
				;
	bb:			bb:
	%tmp = alloca <8 x i64>, align 32			%tmp = alloca <8 x i64>, align 32
	%tmp2 = alloca <8 x i64>, align 32			%tmp2 = alloca <8 x i64>, align 32
	%tmp3 = bitcast <8 x i64>* %tmp to i8*			%tmp3 = bitcast <8 x i64>* %tmp to i8*
	call void @llvm.memset.p0i8.i64(i8* align 32 %tmp3, i8 0, i64 32, i1 false)			call void @llvm.memset.p0i8.i64(i8* align 32 %tmp3, i8 0, i64 32, i1 false)
	call fastcc void @callee_avx2_legal256_prefer256_call_avx2_legal512_prefer256(<8 x i64>* %tmp2, <8 x i64>* %tmp)			call fastcc void @callee_avx2_legal256_prefer256_call_avx2_legal512_prefer256(<8 x i64>* %tmp2, <8 x i64>* %tmp)
	%tmp4 = load <8 x i64>, <8 x i64>* %tmp2, align 32			%tmp4 = load <8 x i64>, <8 x i64>* %tmp2, align 32
	store <8 x i64> %tmp4, <8 x i64>* %arg, align 2			store <8 x i64> %tmp4, <8 x i64>* %arg, align 2
	ret void			ret void
	}			}

	; This should promote			; This should promote
	define internal fastcc void @callee_avx2_legal512_prefer256_call_avx2_legal256_prefer256(<8 x i64>* %arg, <8 x i64>* readonly %arg1) #4 {			define internal fastcc void @callee_avx2_legal512_prefer256_call_avx2_legal256_prefer256(<8 x i64>* %arg, <8 x i64>* readonly %arg1) #4 {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@callee_avx2_legal512_prefer256_call_avx2_legal256_prefer256			; ARGPROMOTION-LABEL: define {{[^@]+}}@callee_avx2_legal512_prefer256_call_avx2_legal256_prefer256
	; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.]], <8 x i64> [[ARG1_VAL:%.]])			; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.]], <8 x i64> [[ARG1_VAL:%.]])
	; ARGPROMOTION-NEXT: bb:			; ARGPROMOTION-NEXT: bb:
	; ARGPROMOTION-NEXT: store <8 x i64> [[ARG1_VAL]], <8 x i64>* [[ARG]]			; ARGPROMOTION-NEXT: store <8 x i64> [[ARG1_VAL]], <8 x i64>* [[ARG]]
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@callee_avx2_legal512_prefer256_call_avx2_legal256_prefer256
				; ATTRIBUTOR-SAME: (<8 x i64>* noalias nocapture nonnull writeonly align 32 dereferenceable(64) [[ARG:%.]], <8 x i64> [[TMP0:%.]])
				; ATTRIBUTOR-NEXT: bb:
				; ATTRIBUTOR-NEXT: [[ARG1_PRIV:%.*]] = alloca <8 x i64>
				; ATTRIBUTOR-NEXT: store <8 x i64> [[TMP0]], <8 x i64>* [[ARG1_PRIV]]
				; ATTRIBUTOR-NEXT: [[TMP:%.]] = load <8 x i64>, <8 x i64> [[ARG1_PRIV]], align 32
				; ATTRIBUTOR-NEXT: store <8 x i64> [[TMP]], <8 x i64>* [[ARG]], align 32
				; ATTRIBUTOR-NEXT: ret void
				;
	bb:			bb:
	%tmp = load <8 x i64>, <8 x i64>* %arg1			%tmp = load <8 x i64>, <8 x i64>* %arg1
	store <8 x i64> %tmp, <8 x i64>* %arg			store <8 x i64> %tmp, <8 x i64>* %arg
	ret void			ret void
	}			}

	define void @avx2_legal512_prefer256_call_avx2_legal256_prefer256(<8 x i64>* %arg) #3 {			define void @avx2_legal512_prefer256_call_avx2_legal256_prefer256(<8 x i64>* %arg) #3 {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@avx2_legal512_prefer256_call_avx2_legal256_prefer256			; ARGPROMOTION-LABEL: define {{[^@]+}}@avx2_legal512_prefer256_call_avx2_legal256_prefer256
	; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.*]])			; ARGPROMOTION-SAME: (<8 x i64>* [[ARG:%.*]])
	; ARGPROMOTION-NEXT: bb:			; ARGPROMOTION-NEXT: bb:
	; ARGPROMOTION-NEXT: [[TMP:%.*]] = alloca <8 x i64>, align 32			; ARGPROMOTION-NEXT: [[TMP:%.*]] = alloca <8 x i64>, align 32
	; ARGPROMOTION-NEXT: [[TMP2:%.*]] = alloca <8 x i64>, align 32			; ARGPROMOTION-NEXT: [[TMP2:%.*]] = alloca <8 x i64>, align 32
	; ARGPROMOTION-NEXT: [[TMP3:%.]] = bitcast <8 x i64> [[TMP]] to i8*			; ARGPROMOTION-NEXT: [[TMP3:%.]] = bitcast <8 x i64> [[TMP]] to i8*
	; ARGPROMOTION-NEXT: call void @llvm.memset.p0i8.i64(i8* align 32 [[TMP3]], i8 0, i64 32, i1 false)			; ARGPROMOTION-NEXT: call void @llvm.memset.p0i8.i64(i8* align 32 [[TMP3]], i8 0, i64 32, i1 false)
	; ARGPROMOTION-NEXT: [[TMP_VAL:%.]] = load <8 x i64>, <8 x i64> [[TMP]]			; ARGPROMOTION-NEXT: [[TMP_VAL:%.]] = load <8 x i64>, <8 x i64> [[TMP]]
	; ARGPROMOTION-NEXT: call fastcc void @callee_avx2_legal512_prefer256_call_avx2_legal256_prefer256(<8 x i64>* [[TMP2]], <8 x i64> [[TMP_VAL]])			; ARGPROMOTION-NEXT: call fastcc void @callee_avx2_legal512_prefer256_call_avx2_legal256_prefer256(<8 x i64>* [[TMP2]], <8 x i64> [[TMP_VAL]])
	; ARGPROMOTION-NEXT: [[TMP4:%.]] = load <8 x i64>, <8 x i64> [[TMP2]], align 32			; ARGPROMOTION-NEXT: [[TMP4:%.]] = load <8 x i64>, <8 x i64> [[TMP2]], align 32
	; ARGPROMOTION-NEXT: store <8 x i64> [[TMP4]], <8 x i64>* [[ARG]], align 2			; ARGPROMOTION-NEXT: store <8 x i64> [[TMP4]], <8 x i64>* [[ARG]], align 2
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@avx2_legal512_prefer256_call_avx2_legal256_prefer256
				; ATTRIBUTOR-SAME: (<8 x i64>* nocapture writeonly [[ARG:%.*]])
				; ATTRIBUTOR-NEXT: bb:
				; ATTRIBUTOR-NEXT: [[TMP:%.*]] = alloca <8 x i64>, align 32
				; ATTRIBUTOR-NEXT: [[TMP2:%.*]] = alloca <8 x i64>, align 32
				; ATTRIBUTOR-NEXT: [[TMP3:%.]] = bitcast <8 x i64> [[TMP]] to i8*
				; ATTRIBUTOR-NEXT: call void @llvm.memset.p0i8.i64(i8* nonnull align 32 dereferenceable(64) [[TMP3]], i8 0, i64 32, i1 false)
				; ATTRIBUTOR-NEXT: [[TMP0:%.]] = load <8 x i64>, <8 x i64> [[TMP]]
				; ATTRIBUTOR-NEXT: call fastcc void @callee_avx2_legal512_prefer256_call_avx2_legal256_prefer256(<8 x i64>* noalias nocapture nonnull writeonly align 32 dereferenceable(64) [[TMP2]], <8 x i64> [[TMP0]])
				; ATTRIBUTOR-NEXT: [[TMP4:%.]] = load <8 x i64>, <8 x i64> [[TMP2]], align 32
				; ATTRIBUTOR-NEXT: store <8 x i64> [[TMP4]], <8 x i64>* [[ARG]], align 2
				; ATTRIBUTOR-NEXT: ret void
				;
	bb:			bb:
	%tmp = alloca <8 x i64>, align 32			%tmp = alloca <8 x i64>, align 32
	%tmp2 = alloca <8 x i64>, align 32			%tmp2 = alloca <8 x i64>, align 32
	%tmp3 = bitcast <8 x i64>* %tmp to i8*			%tmp3 = bitcast <8 x i64>* %tmp to i8*
	call void @llvm.memset.p0i8.i64(i8* align 32 %tmp3, i8 0, i64 32, i1 false)			call void @llvm.memset.p0i8.i64(i8* align 32 %tmp3, i8 0, i64 32, i1 false)
	call fastcc void @callee_avx2_legal512_prefer256_call_avx2_legal256_prefer256(<8 x i64>* %tmp2, <8 x i64>* %tmp)			call fastcc void @callee_avx2_legal512_prefer256_call_avx2_legal256_prefer256(<8 x i64>* %tmp2, <8 x i64>* %tmp)
	%tmp4 = load <8 x i64>, <8 x i64>* %tmp2, align 32			%tmp4 = load <8 x i64>, <8 x i64>* %tmp2, align 32
	store <8 x i64> %tmp4, <8 x i64>* %arg, align 2			store <8 x i64> %tmp4, <8 x i64>* %arg, align 2
	Show All 12 Lines

llvm/test/Transforms/ArgumentPromotion/X86/thiscall.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; In PR41658, argpromotion put an inalloca in a position that per the			; In PR41658, argpromotion put an inalloca in a position that per the
	; calling convention is passed in a register. This test verifies that			; calling convention is passed in a register. This test verifies that
	; we don't do that anymore. It also verifies that the combination of			; we don't do that anymore. It also verifies that the combination of
	; globalopt and argpromotion is able to optimize the call safely.			; globalopt and argpromotion is able to optimize the call safely.
	;			;
	; RUN: opt -S -argpromotion %s \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt -S -argpromotion %s \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
	; RUN: opt -S -globalopt -argpromotion %s \| FileCheck %s --check-prefixes=GLOBALOPT_ARGPROMOTION,ALL			; RUN: opt -S -globalopt -argpromotion %s \| FileCheck %s --check-prefixes=ALL,GLOBALOPT_ARGPROMOTION
				; RUN: opt -S -passes=attributor -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=2 < %s \| FileCheck %s --check-prefixes=ALL,ATTRIBUTOR

	target datalayout = "e-m:x-p:32:32-i64:64-f80:32-n8:16:32-a:0:32-S32"			target datalayout = "e-m:x-p:32:32-i64:64-f80:32-n8:16:32-a:0:32-S32"
	target triple = "i386-pc-windows-msvc19.11.0"			target triple = "i386-pc-windows-msvc19.11.0"

	%struct.a = type { i8 }			%struct.a = type { i8 }

	define internal x86_thiscallcc void @internalfun(%struct.a* %this, <{ %struct.a }>* inalloca) {			define internal x86_thiscallcc void @internalfun(%struct.a* %this, <{ %struct.a }>* inalloca) {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@internalfun			; ARGPROMOTION-LABEL: define {{[^@]+}}@internalfun
	Show All 11 Lines
	; GLOBALOPT_ARGPROMOTION-NEXT: entry:			; GLOBALOPT_ARGPROMOTION-NEXT: entry:
	; GLOBALOPT_ARGPROMOTION-NEXT: [[A:%.]] = getelementptr inbounds <{ [[STRUCT_A]] }>, <{ [[STRUCT_A]] }> [[TMP0]], i32 0, i32 0			; GLOBALOPT_ARGPROMOTION-NEXT: [[A:%.]] = getelementptr inbounds <{ [[STRUCT_A]] }>, <{ [[STRUCT_A]] }> [[TMP0]], i32 0, i32 0
	; GLOBALOPT_ARGPROMOTION-NEXT: [[ARGMEM:%.*]] = alloca inalloca <{ [[STRUCT_A]] }>, align 4			; GLOBALOPT_ARGPROMOTION-NEXT: [[ARGMEM:%.*]] = alloca inalloca <{ [[STRUCT_A]] }>, align 4
	; GLOBALOPT_ARGPROMOTION-NEXT: [[TMP1:%.]] = getelementptr inbounds <{ [[STRUCT_A]] }>, <{ [[STRUCT_A]] }> [[ARGMEM]], i32 0, i32 0			; GLOBALOPT_ARGPROMOTION-NEXT: [[TMP1:%.]] = getelementptr inbounds <{ [[STRUCT_A]] }>, <{ [[STRUCT_A]] }> [[ARGMEM]], i32 0, i32 0
	; GLOBALOPT_ARGPROMOTION-NEXT: [[CALL:%.]] = call x86_thiscallcc %struct.a @copy_ctor(%struct.a* [[TMP1]], %struct.a* dereferenceable(1) [[A]])			; GLOBALOPT_ARGPROMOTION-NEXT: [[CALL:%.]] = call x86_thiscallcc %struct.a @copy_ctor(%struct.a* [[TMP1]], %struct.a* dereferenceable(1) [[A]])
	; GLOBALOPT_ARGPROMOTION-NEXT: call void @ext(<{ [[STRUCT_A]] }>* inalloca [[ARGMEM]])			; GLOBALOPT_ARGPROMOTION-NEXT: call void @ext(<{ [[STRUCT_A]] }>* inalloca [[ARGMEM]])
	; GLOBALOPT_ARGPROMOTION-NEXT: ret void			; GLOBALOPT_ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@internalfun
				; ATTRIBUTOR-SAME: (%struct.a* nocapture readnone [[THIS:%.]], <{ [[STRUCT_A:%.]] }>* inalloca nonnull align 4 dereferenceable(1) [[TMP0:%.*]])
				; ATTRIBUTOR-NEXT: entry:
				; ATTRIBUTOR-NEXT: [[A:%.]] = getelementptr inbounds <{ [[STRUCT_A]] }>, <{ [[STRUCT_A]] }> [[TMP0]], i32 0, i32 0
				; ATTRIBUTOR-NEXT: [[ARGMEM:%.*]] = alloca inalloca <{ [[STRUCT_A]] }>, align 4
				; ATTRIBUTOR-NEXT: [[TMP1:%.]] = getelementptr inbounds <{ [[STRUCT_A]] }>, <{ [[STRUCT_A]] }> [[ARGMEM]], i32 0, i32 0
				; ATTRIBUTOR-NEXT: [[CALL:%.]] = call x86_thiscallcc %struct.a @copy_ctor(%struct.a* nonnull align 4 dereferenceable(1) [[TMP1]], %struct.a* nonnull align 4 dereferenceable(1) [[A]])
				; ATTRIBUTOR-NEXT: call void @ext(<{ [[STRUCT_A]] }>* inalloca nonnull align 4 dereferenceable(1) [[ARGMEM]])
				; ATTRIBUTOR-NEXT: ret void
				;
	entry:			entry:
	%a = getelementptr inbounds <{ %struct.a }>, <{ %struct.a }>* %0, i32 0, i32 0			%a = getelementptr inbounds <{ %struct.a }>, <{ %struct.a }>* %0, i32 0, i32 0
	%argmem = alloca inalloca <{ %struct.a }>, align 4			%argmem = alloca inalloca <{ %struct.a }>, align 4
	%1 = getelementptr inbounds <{ %struct.a }>, <{ %struct.a }>* %argmem, i32 0, i32 0			%1 = getelementptr inbounds <{ %struct.a }>, <{ %struct.a }>* %argmem, i32 0, i32 0
	%call = call x86_thiscallcc %struct.a* @copy_ctor(%struct.a* %1, %struct.a* dereferenceable(1) %a)			%call = call x86_thiscallcc %struct.a* @copy_ctor(%struct.a* %1, %struct.a* dereferenceable(1) %a)
	call void @ext(<{ %struct.a }>* inalloca %argmem)			call void @ext(<{ %struct.a }>* inalloca %argmem)
	ret void			ret void
	}			}
	Show All 11 Lines
	; GLOBALOPT_ARGPROMOTION-LABEL: define {{[^@]+}}@exportedfun			; GLOBALOPT_ARGPROMOTION-LABEL: define {{[^@]+}}@exportedfun
	; GLOBALOPT_ARGPROMOTION-SAME: (%struct.a* [[A:%.*]]) local_unnamed_addr			; GLOBALOPT_ARGPROMOTION-SAME: (%struct.a* [[A:%.*]]) local_unnamed_addr
	; GLOBALOPT_ARGPROMOTION-NEXT: [[INALLOCA_SAVE:%.]] = tail call i8 @llvm.stacksave()			; GLOBALOPT_ARGPROMOTION-NEXT: [[INALLOCA_SAVE:%.]] = tail call i8 @llvm.stacksave()
	; GLOBALOPT_ARGPROMOTION-NEXT: [[ARGMEM:%.]] = alloca inalloca <{ [[STRUCT_A:%.]] }>, align 4			; GLOBALOPT_ARGPROMOTION-NEXT: [[ARGMEM:%.]] = alloca inalloca <{ [[STRUCT_A:%.]] }>, align 4
	; GLOBALOPT_ARGPROMOTION-NEXT: call fastcc void @internalfun(<{ [[STRUCT_A]] }>* [[ARGMEM]])			; GLOBALOPT_ARGPROMOTION-NEXT: call fastcc void @internalfun(<{ [[STRUCT_A]] }>* [[ARGMEM]])
	; GLOBALOPT_ARGPROMOTION-NEXT: call void @llvm.stackrestore(i8* [[INALLOCA_SAVE]])			; GLOBALOPT_ARGPROMOTION-NEXT: call void @llvm.stackrestore(i8* [[INALLOCA_SAVE]])
	; GLOBALOPT_ARGPROMOTION-NEXT: ret void			; GLOBALOPT_ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@exportedfun
				; ATTRIBUTOR-SAME: (%struct.a* nocapture readnone [[A:%.*]])
				; ATTRIBUTOR-NEXT: [[INALLOCA_SAVE:%.]] = tail call i8 @llvm.stacksave()
				; ATTRIBUTOR-NEXT: [[ARGMEM:%.]] = alloca inalloca <{ [[STRUCT_A:%.]] }>, align 4
				; ATTRIBUTOR-NEXT: call x86_thiscallcc void @internalfun(%struct.a* nocapture readnone undef, <{ [[STRUCT_A]] }>* inalloca nonnull align 4 dereferenceable(1) [[ARGMEM]])
				; ATTRIBUTOR-NEXT: call void @llvm.stackrestore(i8* [[INALLOCA_SAVE]])
				; ATTRIBUTOR-NEXT: ret void
				;
	%inalloca.save = tail call i8* @llvm.stacksave()			%inalloca.save = tail call i8* @llvm.stacksave()
	%argmem = alloca inalloca <{ %struct.a }>, align 4			%argmem = alloca inalloca <{ %struct.a }>, align 4
	call x86_thiscallcc void @internalfun(%struct.a* %a, <{ %struct.a }>* inalloca %argmem)			call x86_thiscallcc void @internalfun(%struct.a* %a, <{ %struct.a }>* inalloca %argmem)
	call void @llvm.stackrestore(i8* %inalloca.save)			call void @llvm.stackrestore(i8* %inalloca.save)
	ret void			ret void
	}			}

	declare x86_thiscallcc %struct.a* @copy_ctor(%struct.a* returned, %struct.a* dereferenceable(1))			declare x86_thiscallcc %struct.a* @copy_ctor(%struct.a* returned, %struct.a* dereferenceable(1))
	declare void @ext(<{ %struct.a }>* inalloca)			declare void @ext(<{ %struct.a }>* inalloca)
	declare i8* @llvm.stacksave()			declare i8* @llvm.stacksave()
	declare void @llvm.stackrestore(i8*)			declare void @llvm.stackrestore(i8*)

llvm/test/Transforms/ArgumentPromotion/aggregate-promote.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
	; RUN: opt < %s -passes=argpromotion -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -passes=argpromotion -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
				; RUN: opt -S -passes=attributor -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=5 < %s \| FileCheck %s --check-prefixes=ALL,ATTRIBUTOR

	%T = type { i32, i32, i32, i32 }			%T = type { i32, i32, i32, i32 }
	@G = constant %T { i32 0, i32 0, i32 17, i32 25 }			@G = constant %T { i32 0, i32 0, i32 17, i32 25 }

	define internal i32 @test(%T* %p) {			define internal i32 @test(%T* %p) {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@test			; ARGPROMOTION-LABEL: define {{[^@]+}}@test
	; ARGPROMOTION-SAME: (i32 [[P_0_2_VAL:%.]], i32 [[P_0_3_VAL:%.]])			; ARGPROMOTION-SAME: (i32 [[P_0_2_VAL:%.]], i32 [[P_0_3_VAL:%.]])
	; ARGPROMOTION-NEXT: entry:			; ARGPROMOTION-NEXT: entry:
	; ARGPROMOTION-NEXT: [[V:%.*]] = add i32 [[P_0_3_VAL]], [[P_0_2_VAL]]			; ARGPROMOTION-NEXT: [[V:%.*]] = add i32 [[P_0_3_VAL]], [[P_0_2_VAL]]
	; ARGPROMOTION-NEXT: ret i32 [[V]]			; ARGPROMOTION-NEXT: ret i32 [[V]]
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@test
				; ATTRIBUTOR-SAME: (%T* nocapture nonnull readonly align 8 dereferenceable(16) [[P:%.*]])
				; ATTRIBUTOR-NEXT: entry:
				; ATTRIBUTOR-NEXT: [[A_GEP:%.]] = getelementptr [[T:%.]], %T* @G, i64 0, i32 3
				; ATTRIBUTOR-NEXT: [[B_GEP:%.]] = getelementptr [[T]], %T @G, i64 0, i32 2
				; ATTRIBUTOR-NEXT: [[A:%.]] = load i32, i32 [[A_GEP]]
				; ATTRIBUTOR-NEXT: [[B:%.]] = load i32, i32 [[B_GEP]]
				; ATTRIBUTOR-NEXT: [[V:%.*]] = add i32 [[A]], [[B]]
				; ATTRIBUTOR-NEXT: ret i32 [[V]]
				;
	entry:			entry:
	%a.gep = getelementptr %T, %T* %p, i64 0, i32 3			%a.gep = getelementptr %T, %T* %p, i64 0, i32 3
	%b.gep = getelementptr %T, %T* %p, i64 0, i32 2			%b.gep = getelementptr %T, %T* %p, i64 0, i32 2
	%a = load i32, i32* %a.gep			%a = load i32, i32* %a.gep
	%b = load i32, i32* %b.gep			%b = load i32, i32* %b.gep
	%v = add i32 %a, %b			%v = add i32 %a, %b
	ret i32 %v			ret i32 %v
	}			}

	define i32 @caller() {			define i32 @caller() {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@caller()			; ARGPROMOTION-LABEL: define {{[^@]+}}@caller()
	; ARGPROMOTION-NEXT: entry:			; ARGPROMOTION-NEXT: entry:
	; ARGPROMOTION-NEXT: [[G_IDX:%.]] = getelementptr [[T:%.]], %T* @G, i64 0, i32 2			; ARGPROMOTION-NEXT: [[G_IDX:%.]] = getelementptr [[T:%.]], %T* @G, i64 0, i32 2
	; ARGPROMOTION-NEXT: [[G_IDX_VAL:%.]] = load i32, i32 [[G_IDX]]			; ARGPROMOTION-NEXT: [[G_IDX_VAL:%.]] = load i32, i32 [[G_IDX]]
	; ARGPROMOTION-NEXT: [[G_IDX1:%.]] = getelementptr [[T]], %T @G, i64 0, i32 3			; ARGPROMOTION-NEXT: [[G_IDX1:%.]] = getelementptr [[T]], %T @G, i64 0, i32 3
	; ARGPROMOTION-NEXT: [[G_IDX1_VAL:%.]] = load i32, i32 [[G_IDX1]]			; ARGPROMOTION-NEXT: [[G_IDX1_VAL:%.]] = load i32, i32 [[G_IDX1]]
	; ARGPROMOTION-NEXT: [[V:%.*]] = call i32 @test(i32 [[G_IDX_VAL]], i32 [[G_IDX1_VAL]])			; ARGPROMOTION-NEXT: [[V:%.*]] = call i32 @test(i32 [[G_IDX_VAL]], i32 [[G_IDX1_VAL]])
	; ARGPROMOTION-NEXT: ret i32 [[V]]			; ARGPROMOTION-NEXT: ret i32 [[V]]
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@caller()
				; ATTRIBUTOR-NEXT: entry:
				; ATTRIBUTOR-NEXT: [[V:%.]] = call i32 @test(%T nonnull align 8 dereferenceable(16) @G)
				; ATTRIBUTOR-NEXT: ret i32 [[V]]
				;
	entry:			entry:
	%v = call i32 @test(%T* @G)			%v = call i32 @test(%T* @G)
	ret i32 %v			ret i32 %v
	}			}

llvm/test/Transforms/ArgumentPromotion/attrs.ll

	Show All 17 Lines
	; ARGPROMOTION-NEXT: [[TMP:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[B]], i32 0, i32 0			; ARGPROMOTION-NEXT: [[TMP:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[B]], i32 0, i32 0
	; ARGPROMOTION-NEXT: [[TMP1:%.]] = load i32, i32 [[TMP]], align 4			; ARGPROMOTION-NEXT: [[TMP1:%.]] = load i32, i32 [[TMP]], align 4
	; ARGPROMOTION-NEXT: [[TMP2:%.*]] = add i32 [[TMP1]], 1			; ARGPROMOTION-NEXT: [[TMP2:%.*]] = add i32 [[TMP1]], 1
	; ARGPROMOTION-NEXT: store i32 [[TMP2]], i32* [[TMP]], align 4			; ARGPROMOTION-NEXT: store i32 [[TMP2]], i32* [[TMP]], align 4
	; ARGPROMOTION-NEXT: store i32 0, i32* [[X]]			; ARGPROMOTION-NEXT: store i32 0, i32* [[X]]
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
	; ATTRIBUTOR-LABEL: define {{[^@]+}}@f			; ATTRIBUTOR-LABEL: define {{[^@]+}}@f
	; ATTRIBUTOR-SAME: (%struct.ss* noalias nocapture nonnull byval align 8 dereferenceable(12) [[B:%.]], i32 nocapture nonnull writeonly byval dereferenceable(4) [[X:%.*]])			; ATTRIBUTOR-SAME: (i32 [[TMP0:%.]], i64 [[TMP1:%.]], i32 [[TMP2:%.*]])
	; ATTRIBUTOR-NEXT: entry:			; ATTRIBUTOR-NEXT: entry:
	; ATTRIBUTOR-NEXT: [[TMP:%.]] = getelementptr [[STRUCT_SS:%.]], %struct.ss* [[B]], i32 0, i32 0			; ATTRIBUTOR-NEXT: [[X_PRIV:%.*]] = alloca i32
				; ATTRIBUTOR-NEXT: store i32 [[TMP2]], i32* [[X_PRIV]]
				; ATTRIBUTOR-NEXT: [[B_PRIV:%.]] = alloca [[STRUCT_SS:%.]]
				; ATTRIBUTOR-NEXT: [[B_PRIV_CAST:%.]] = bitcast %struct.ss [[B_PRIV]] to i32*
				; ATTRIBUTOR-NEXT: store i32 [[TMP0]], i32* [[B_PRIV_CAST]]
				; ATTRIBUTOR-NEXT: [[B_PRIV_0_1:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[B_PRIV]], i32 0, i32 1
				; ATTRIBUTOR-NEXT: store i64 [[TMP1]], i64* [[B_PRIV_0_1]]
				; ATTRIBUTOR-NEXT: [[TMP:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[B_PRIV]], i32 0, i32 0
	; ATTRIBUTOR-NEXT: [[TMP1:%.]] = load i32, i32 [[TMP]], align 8			; ATTRIBUTOR-NEXT: [[TMP1:%.]] = load i32, i32 [[TMP]], align 8
	; ATTRIBUTOR-NEXT: [[TMP2:%.*]] = add i32 [[TMP1]], 1			; ATTRIBUTOR-NEXT: [[TMP2:%.*]] = add i32 [[TMP1]], 1
	; ATTRIBUTOR-NEXT: store i32 [[TMP2]], i32* [[TMP]], align 8			; ATTRIBUTOR-NEXT: store i32 [[TMP2]], i32* [[TMP]], align 8
	; ATTRIBUTOR-NEXT: store i32 0, i32* [[X]]			; ATTRIBUTOR-NEXT: store i32 0, i32* [[X_PRIV]]
	; ATTRIBUTOR-NEXT: ret void			; ATTRIBUTOR-NEXT: ret void
	;			;
	entry:			entry:

	%tmp = getelementptr %struct.ss, %struct.ss* %b, i32 0, i32 0			%tmp = getelementptr %struct.ss, %struct.ss* %b, i32 0, i32 0
	%tmp1 = load i32, i32* %tmp, align 4			%tmp1 = load i32, i32* %tmp, align 4
	%tmp2 = add i32 %tmp1, 1			%tmp2 = add i32 %tmp1, 1
	store i32 %tmp2, i32* %tmp, align 4			store i32 %tmp2, i32* %tmp, align 4
	Show All 22 Lines
	; ATTRIBUTOR-LABEL: define {{[^@]+}}@test			; ATTRIBUTOR-LABEL: define {{[^@]+}}@test
	; ATTRIBUTOR-SAME: (i32* nocapture writeonly [[X:%.*]])			; ATTRIBUTOR-SAME: (i32* nocapture writeonly [[X:%.*]])
	; ATTRIBUTOR-NEXT: entry:			; ATTRIBUTOR-NEXT: entry:
	; ATTRIBUTOR-NEXT: [[S:%.]] = alloca [[STRUCT_SS:%.]]			; ATTRIBUTOR-NEXT: [[S:%.]] = alloca [[STRUCT_SS:%.]]
	; ATTRIBUTOR-NEXT: [[TMP1:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[S]], i32 0, i32 0			; ATTRIBUTOR-NEXT: [[TMP1:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[S]], i32 0, i32 0
	; ATTRIBUTOR-NEXT: store i32 1, i32* [[TMP1]], align 8			; ATTRIBUTOR-NEXT: store i32 1, i32* [[TMP1]], align 8
	; ATTRIBUTOR-NEXT: [[TMP4:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[S]], i32 0, i32 1			; ATTRIBUTOR-NEXT: [[TMP4:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[S]], i32 0, i32 1
	; ATTRIBUTOR-NEXT: store i64 2, i64* [[TMP4]], align 4			; ATTRIBUTOR-NEXT: store i64 2, i64* [[TMP4]], align 4
	; ATTRIBUTOR-NEXT: call void @f(%struct.ss* noalias nocapture nonnull byval align 8 dereferenceable(12) [[S]], i32* nocapture writeonly byval [[X]])			; ATTRIBUTOR-NEXT: [[S_CAST:%.]] = bitcast %struct.ss [[S]] to i32*
				; ATTRIBUTOR-NEXT: [[TMP0:%.]] = load i32, i32 [[S_CAST]]
				; ATTRIBUTOR-NEXT: [[S_0_1:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[S]], i32 0, i32 1
				; ATTRIBUTOR-NEXT: [[TMP1:%.]] = load i64, i64 [[S_0_1]]
				; ATTRIBUTOR-NEXT: [[TMP2:%.]] = load i32, i32 [[X]]
				; ATTRIBUTOR-NEXT: call void @f(i32 [[TMP0]], i64 [[TMP1]], i32 [[TMP2]])
	; ATTRIBUTOR-NEXT: ret i32 0			; ATTRIBUTOR-NEXT: ret i32 0
	;			;
	entry:			entry:
	%S = alloca %struct.ss			%S = alloca %struct.ss
	%tmp1 = getelementptr %struct.ss, %struct.ss* %S, i32 0, i32 0			%tmp1 = getelementptr %struct.ss, %struct.ss* %S, i32 0, i32 0
	store i32 1, i32* %tmp1, align 8			store i32 1, i32* %tmp1, align 8
	%tmp4 = getelementptr %struct.ss, %struct.ss* %S, i32 0, i32 1			%tmp4 = getelementptr %struct.ss, %struct.ss* %S, i32 0, i32 1
	store i64 2, i64* %tmp4, align 4			store i64 2, i64* %tmp4, align 4

	call void @f( %struct.ss* byval %S, i32* byval %X, i32 zeroext 0)			call void @f( %struct.ss* byval %S, i32* byval %X, i32 zeroext 0)

	ret i32 0			ret i32 0
	}			}

llvm/test/Transforms/ArgumentPromotion/basictest.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -basicaa -argpromotion -mem2reg -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -basicaa -argpromotion -mem2reg -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
				; RUN: opt -S -passes='attributor,function(mem2reg)' -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=7 < %s \| FileCheck %s --check-prefixes=ALL,ATTRIBUTOR
	target datalayout = "E-p:64:64:64-a0:0:8-f32:32:32-f64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:32:64-v64:64:64-v128:128:128"			target datalayout = "E-p:64:64:64-a0:0:8-f32:32:32-f64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:32:64-v64:64:64-v128:128:128"

	define internal i32 @test(i32* %X, i32* %Y) {			define internal i32 @test(i32* %X, i32* %Y) {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@test			; ALL-LABEL: define {{[^@]+}}@test
	; ARGPROMOTION-SAME: (i32 [[X_VAL:%.]], i32 [[Y_VAL:%.]])			; ALL-SAME: (i32 [[TMP0:%.]], i32 [[TMP1:%.]])
	; ARGPROMOTION-NEXT: [[C:%.*]] = add i32 [[X_VAL]], [[Y_VAL]]			; ALL-NEXT: [[C:%.*]] = add i32 [[TMP0]], [[TMP1]]
	; ARGPROMOTION-NEXT: ret i32 [[C]]			; ALL-NEXT: ret i32 [[C]]
	;			;
	%A = load i32, i32* %X			%A = load i32, i32* %X
	%B = load i32, i32* %Y			%B = load i32, i32* %Y
	%C = add i32 %A, %B			%C = add i32 %A, %B
	ret i32 %C			ret i32 %C
	}			}

	define internal i32 @caller(i32* %B) {			define internal i32 @caller(i32* %B) {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@caller			; ALL-LABEL: define {{[^@]+}}@caller
	; ARGPROMOTION-SAME: (i32 [[B_VAL1:%.*]])			; ALL-SAME: (i32 [[TMP0:%.*]])
	; ARGPROMOTION-NEXT: [[C:%.*]] = call i32 @test(i32 1, i32 [[B_VAL1]])			; ALL-NEXT: [[C:%.*]] = call i32 @test(i32 1, i32 [[TMP0]])
	; ARGPROMOTION-NEXT: ret i32 [[C]]			; ALL-NEXT: ret i32 [[C]]
	;			;
	%A = alloca i32			%A = alloca i32
	store i32 1, i32* %A			store i32 1, i32* %A
	%C = call i32 @test(i32* %A, i32* %B)			%C = call i32 @test(i32* %A, i32* %B)
	ret i32 %C			ret i32 %C
	}			}

	define i32 @callercaller() {			define i32 @callercaller() {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@callercaller()			; ALL-LABEL: define {{[^@]+}}@callercaller()
	; ARGPROMOTION-NEXT: [[X:%.*]] = call i32 @caller(i32 2)			; ALL-NEXT: [[X:%.*]] = call i32 @caller(i32 2)
	; ARGPROMOTION-NEXT: ret i32 [[X]]			; ALL-NEXT: ret i32 [[X]]
	;			;
	%B = alloca i32			%B = alloca i32
	store i32 2, i32* %B			store i32 2, i32* %B
	%X = call i32 @caller(i32* %B)			%X = call i32 @caller(i32* %B)
	ret i32 %X			ret i32 %X
	}			}

llvm/test/Transforms/ArgumentPromotion/byval-2.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
	; RUN: opt < %s -passes=argpromotion -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -passes=argpromotion -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
				; RUN: opt -S -passes=attributor -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=2 < %s \| FileCheck %s --check-prefixes=ALL,ATTRIBUTOR

	; Arg promotion eliminates the struct argument.			; Arg promotion eliminates the struct argument.
	; FIXME: Should it eliminate the i32* argument?			; FIXME: Should it eliminate the i32* argument?
				; The attributor eliminates the i32*.

	%struct.ss = type { i32, i64 }			%struct.ss = type { i32, i64 }

	define internal void @f(%struct.ss* byval %b, i32* byval %X) nounwind {			define internal void @f(%struct.ss* byval %b, i32* byval %X) nounwind {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@f			; ARGPROMOTION-LABEL: define {{[^@]+}}@f
	; ARGPROMOTION-SAME: (i32 [[B_0:%.]], i64 [[B_1:%.]], i32* byval [[X:%.*]])			; ARGPROMOTION-SAME: (i32 [[B_0:%.]], i64 [[B_1:%.]], i32* byval [[X:%.*]])
	; ARGPROMOTION-NEXT: entry:			; ARGPROMOTION-NEXT: entry:
	; ARGPROMOTION-NEXT: [[B:%.]] = alloca [[STRUCT_SS:%.]]			; ARGPROMOTION-NEXT: [[B:%.]] = alloca [[STRUCT_SS:%.]]
	; ARGPROMOTION-NEXT: [[DOT0:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[B]], i32 0, i32 0			; ARGPROMOTION-NEXT: [[DOT0:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[B]], i32 0, i32 0
	; ARGPROMOTION-NEXT: store i32 [[B_0]], i32* [[DOT0]]			; ARGPROMOTION-NEXT: store i32 [[B_0]], i32* [[DOT0]]
	; ARGPROMOTION-NEXT: [[DOT1:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[B]], i32 0, i32 1			; ARGPROMOTION-NEXT: [[DOT1:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[B]], i32 0, i32 1
	; ARGPROMOTION-NEXT: store i64 [[B_1]], i64* [[DOT1]]			; ARGPROMOTION-NEXT: store i64 [[B_1]], i64* [[DOT1]]
	; ARGPROMOTION-NEXT: [[TMP:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[B]], i32 0, i32 0			; ARGPROMOTION-NEXT: [[TMP:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[B]], i32 0, i32 0
	; ARGPROMOTION-NEXT: [[TMP1:%.]] = load i32, i32 [[TMP]], align 4			; ARGPROMOTION-NEXT: [[TMP1:%.]] = load i32, i32 [[TMP]], align 4
	; ARGPROMOTION-NEXT: [[TMP2:%.*]] = add i32 [[TMP1]], 1			; ARGPROMOTION-NEXT: [[TMP2:%.*]] = add i32 [[TMP1]], 1
	; ARGPROMOTION-NEXT: store i32 [[TMP2]], i32* [[TMP]], align 4			; ARGPROMOTION-NEXT: store i32 [[TMP2]], i32* [[TMP]], align 4
	; ARGPROMOTION-NEXT: store i32 0, i32* [[X]]			; ARGPROMOTION-NEXT: store i32 0, i32* [[X]]
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@f
				; ATTRIBUTOR-SAME: (i32 [[TMP0:%.]], i64 [[TMP1:%.]], i32 [[TMP2:%.*]])
				; ATTRIBUTOR-NEXT: entry:
				; ATTRIBUTOR-NEXT: [[X_PRIV:%.*]] = alloca i32
				; ATTRIBUTOR-NEXT: store i32 [[TMP2]], i32* [[X_PRIV]]
				; ATTRIBUTOR-NEXT: [[B_PRIV:%.]] = alloca [[STRUCT_SS:%.]]
				; ATTRIBUTOR-NEXT: [[B_PRIV_CAST:%.]] = bitcast %struct.ss [[B_PRIV]] to i32*
				; ATTRIBUTOR-NEXT: store i32 [[TMP0]], i32* [[B_PRIV_CAST]]
				; ATTRIBUTOR-NEXT: [[B_PRIV_0_1:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[B_PRIV]], i32 0, i32 1
				; ATTRIBUTOR-NEXT: store i64 [[TMP1]], i64* [[B_PRIV_0_1]]
				; ATTRIBUTOR-NEXT: [[TMP:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[B_PRIV]], i32 0, i32 0
				; ATTRIBUTOR-NEXT: [[TMP1:%.]] = load i32, i32 [[TMP]], align 8
				; ATTRIBUTOR-NEXT: [[TMP2:%.*]] = add i32 [[TMP1]], 1
				; ATTRIBUTOR-NEXT: store i32 [[TMP2]], i32* [[TMP]], align 8
				; ATTRIBUTOR-NEXT: store i32 0, i32* [[X_PRIV]]
				; ATTRIBUTOR-NEXT: ret void
				;
	entry:			entry:
	%tmp = getelementptr %struct.ss, %struct.ss* %b, i32 0, i32 0			%tmp = getelementptr %struct.ss, %struct.ss* %b, i32 0, i32 0
	%tmp1 = load i32, i32* %tmp, align 4			%tmp1 = load i32, i32* %tmp, align 4
	%tmp2 = add i32 %tmp1, 1			%tmp2 = add i32 %tmp1, 1
	store i32 %tmp2, i32* %tmp, align 4			store i32 %tmp2, i32* %tmp, align 4

	store i32 0, i32* %X			store i32 0, i32* %X
	ret void			ret void
	Show All 10 Lines
	; ARGPROMOTION-NEXT: store i64 2, i64* [[TMP4]], align 4			; ARGPROMOTION-NEXT: store i64 2, i64* [[TMP4]], align 4
	; ARGPROMOTION-NEXT: [[S_0:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[S]], i32 0, i32 0			; ARGPROMOTION-NEXT: [[S_0:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[S]], i32 0, i32 0
	; ARGPROMOTION-NEXT: [[S_0_VAL:%.]] = load i32, i32 [[S_0]]			; ARGPROMOTION-NEXT: [[S_0_VAL:%.]] = load i32, i32 [[S_0]]
	; ARGPROMOTION-NEXT: [[S_1:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[S]], i32 0, i32 1			; ARGPROMOTION-NEXT: [[S_1:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[S]], i32 0, i32 1
	; ARGPROMOTION-NEXT: [[S_1_VAL:%.]] = load i64, i64 [[S_1]]			; ARGPROMOTION-NEXT: [[S_1_VAL:%.]] = load i64, i64 [[S_1]]
	; ARGPROMOTION-NEXT: call void @f(i32 [[S_0_VAL]], i64 [[S_1_VAL]], i32* byval [[X]])			; ARGPROMOTION-NEXT: call void @f(i32 [[S_0_VAL]], i64 [[S_1_VAL]], i32* byval [[X]])
	; ARGPROMOTION-NEXT: ret i32 0			; ARGPROMOTION-NEXT: ret i32 0
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@test
				; ATTRIBUTOR-SAME: (i32* nocapture writeonly [[X:%.*]])
				; ATTRIBUTOR-NEXT: entry:
				; ATTRIBUTOR-NEXT: [[S:%.]] = alloca [[STRUCT_SS:%.]]
				; ATTRIBUTOR-NEXT: [[TMP1:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[S]], i32 0, i32 0
				; ATTRIBUTOR-NEXT: store i32 1, i32* [[TMP1]], align 8
				; ATTRIBUTOR-NEXT: [[TMP4:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[S]], i32 0, i32 1
				; ATTRIBUTOR-NEXT: store i64 2, i64* [[TMP4]], align 4
				; ATTRIBUTOR-NEXT: [[S_CAST:%.]] = bitcast %struct.ss [[S]] to i32*
				; ATTRIBUTOR-NEXT: [[TMP0:%.]] = load i32, i32 [[S_CAST]]
				; ATTRIBUTOR-NEXT: [[S_0_1:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[S]], i32 0, i32 1
				; ATTRIBUTOR-NEXT: [[TMP1:%.]] = load i64, i64 [[S_0_1]]
				; ATTRIBUTOR-NEXT: [[TMP2:%.]] = load i32, i32 [[X]]
				; ATTRIBUTOR-NEXT: call void @f(i32 [[TMP0]], i64 [[TMP1]], i32 [[TMP2]])
				; ATTRIBUTOR-NEXT: ret i32 0
				;
	entry:			entry:
	%S = alloca %struct.ss			%S = alloca %struct.ss
	%tmp1 = getelementptr %struct.ss, %struct.ss* %S, i32 0, i32 0			%tmp1 = getelementptr %struct.ss, %struct.ss* %S, i32 0, i32 0
	store i32 1, i32* %tmp1, align 8			store i32 1, i32* %tmp1, align 8
	%tmp4 = getelementptr %struct.ss, %struct.ss* %S, i32 0, i32 1			%tmp4 = getelementptr %struct.ss, %struct.ss* %S, i32 0, i32 1
	store i64 2, i64* %tmp4, align 4			store i64 2, i64* %tmp4, align 4
	call void @f( %struct.ss* byval %S, i32* byval %X)			call void @f( %struct.ss* byval %S, i32* byval %X)
	ret i32 0			ret i32 0
	}			}

llvm/test/Transforms/ArgumentPromotion/byval.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -argpromotion -sroa -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
	; RUN: opt < %s -passes=argpromotion -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -passes='argpromotion,function(sroa)' -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
				; RUN: opt -S -passes='attributor,function(sroa)' -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=2 < %s \| FileCheck %s --check-prefixes=ALL,ATTRIBUTOR

	target datalayout = "E-p:64:64:64-a0:0:8-f32:32:32-f64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:32:64-v64:64:64-v128:128:128"			target datalayout = "E-p:64:64:64-a0:0:8-f32:32:32-f64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:32:64-v64:64:64-v128:128:128"

	%struct.ss = type { i32, i64 }			%struct.ss = type { i32, i64 }

	define internal void @f(%struct.ss* byval %b) nounwind {			define internal void @f(%struct.ss* byval %b) nounwind {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@f			; ALL-LABEL: define {{[^@]+}}@f
	; ARGPROMOTION-SAME: (i32 [[B_0:%.]], i64 [[B_1:%.]])			; ALL-SAME: (i32 [[TMP0:%.]], i64 [[TMP1:%.]])
	; ARGPROMOTION-NEXT: entry:			; ALL-NEXT: entry:
	; ARGPROMOTION-NEXT: [[B:%.]] = alloca [[STRUCT_SS:%.]]			; ALL-NEXT: [[TMP2:%.*]] = add i32 [[TMP0]], 1
	; ARGPROMOTION-NEXT: [[DOT0:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[B]], i32 0, i32 0			; ALL-NEXT: ret void
	; ARGPROMOTION-NEXT: store i32 [[B_0]], i32* [[DOT0]]
	; ARGPROMOTION-NEXT: [[DOT1:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[B]], i32 0, i32 1
	; ARGPROMOTION-NEXT: store i64 [[B_1]], i64* [[DOT1]]
	; ARGPROMOTION-NEXT: [[TMP:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[B]], i32 0, i32 0
	; ARGPROMOTION-NEXT: [[TMP1:%.]] = load i32, i32 [[TMP]], align 4
	; ARGPROMOTION-NEXT: [[TMP2:%.*]] = add i32 [[TMP1]], 1
	; ARGPROMOTION-NEXT: store i32 [[TMP2]], i32* [[TMP]], align 4
	; ARGPROMOTION-NEXT: ret void
	;			;
	entry:			entry:
	%tmp = getelementptr %struct.ss, %struct.ss* %b, i32 0, i32 0			%tmp = getelementptr %struct.ss, %struct.ss* %b, i32 0, i32 0
	%tmp1 = load i32, i32* %tmp, align 4			%tmp1 = load i32, i32* %tmp, align 4
	%tmp2 = add i32 %tmp1, 1			%tmp2 = add i32 %tmp1, 1
	store i32 %tmp2, i32* %tmp, align 4			store i32 %tmp2, i32* %tmp, align 4
	ret void			ret void
	}			}


	define internal void @g(%struct.ss* byval align 32 %b) nounwind {			define internal void @g(%struct.ss* byval align 32 %b) nounwind {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@g			; ALL-LABEL: define {{[^@]+}}@g
	; ARGPROMOTION-SAME: (i32 [[B_0:%.]], i64 [[B_1:%.]])			; ALL-SAME: (i32 [[TMP0:%.]], i64 [[TMP1:%.]])
	; ARGPROMOTION-NEXT: entry:			; ALL-NEXT: entry:
	; ARGPROMOTION-NEXT: [[B:%.]] = alloca [[STRUCT_SS:%.]], align 32			; ALL-NEXT: [[TMP2:%.*]] = add i32 [[TMP0]], 1
	; ARGPROMOTION-NEXT: [[DOT0:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[B]], i32 0, i32 0			; ALL-NEXT: ret void
	; ARGPROMOTION-NEXT: store i32 [[B_0]], i32* [[DOT0]]
	; ARGPROMOTION-NEXT: [[DOT1:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[B]], i32 0, i32 1
	; ARGPROMOTION-NEXT: store i64 [[B_1]], i64* [[DOT1]]
	; ARGPROMOTION-NEXT: [[TMP:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[B]], i32 0, i32 0
	; ARGPROMOTION-NEXT: [[TMP1:%.]] = load i32, i32 [[TMP]], align 4
	; ARGPROMOTION-NEXT: [[TMP2:%.*]] = add i32 [[TMP1]], 1
	; ARGPROMOTION-NEXT: store i32 [[TMP2]], i32* [[TMP]], align 4
	; ARGPROMOTION-NEXT: ret void
	;			;
	entry:			entry:
	%tmp = getelementptr %struct.ss, %struct.ss* %b, i32 0, i32 0			%tmp = getelementptr %struct.ss, %struct.ss* %b, i32 0, i32 0
	%tmp1 = load i32, i32* %tmp, align 4			%tmp1 = load i32, i32* %tmp, align 4
	%tmp2 = add i32 %tmp1, 1			%tmp2 = add i32 %tmp1, 1
	store i32 %tmp2, i32* %tmp, align 4			store i32 %tmp2, i32* %tmp, align 4
	ret void			ret void
	}			}


	define i32 @main() nounwind {			define i32 @main() nounwind {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@main()			; ALL-LABEL: define {{[^@]+}}@main()
	; ARGPROMOTION-NEXT: entry:			; ALL-NEXT: entry:
	; ARGPROMOTION-NEXT: [[S:%.]] = alloca [[STRUCT_SS:%.]]			; ALL-NEXT: call void @f(i32 1, i64 2)
	; ARGPROMOTION-NEXT: [[TMP1:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[S]], i32 0, i32 0			; ALL-NEXT: call void @g(i32 1, i64 2)
	; ARGPROMOTION-NEXT: store i32 1, i32* [[TMP1]], align 8			; ALL-NEXT: ret i32 0
	; ARGPROMOTION-NEXT: [[TMP4:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[S]], i32 0, i32 1
	; ARGPROMOTION-NEXT: store i64 2, i64* [[TMP4]], align 4
	; ARGPROMOTION-NEXT: [[S_0:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[S]], i32 0, i32 0
	; ARGPROMOTION-NEXT: [[S_0_VAL:%.]] = load i32, i32 [[S_0]]
	; ARGPROMOTION-NEXT: [[S_1:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[S]], i32 0, i32 1
	; ARGPROMOTION-NEXT: [[S_1_VAL:%.]] = load i64, i64 [[S_1]]
	; ARGPROMOTION-NEXT: call void @f(i32 [[S_0_VAL]], i64 [[S_1_VAL]])
	; ARGPROMOTION-NEXT: [[S_01:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[S]], i32 0, i32 0
	; ARGPROMOTION-NEXT: [[S_01_VAL:%.]] = load i32, i32 [[S_01]]
	; ARGPROMOTION-NEXT: [[S_12:%.]] = getelementptr [[STRUCT_SS]], %struct.ss [[S]], i32 0, i32 1
	; ARGPROMOTION-NEXT: [[S_12_VAL:%.]] = load i64, i64 [[S_12]]
	; ARGPROMOTION-NEXT: call void @g(i32 [[S_01_VAL]], i64 [[S_12_VAL]])
	; ARGPROMOTION-NEXT: ret i32 0
	;			;
	entry:			entry:
	%S = alloca %struct.ss			%S = alloca %struct.ss
	%tmp1 = getelementptr %struct.ss, %struct.ss* %S, i32 0, i32 0			%tmp1 = getelementptr %struct.ss, %struct.ss* %S, i32 0, i32 0
	store i32 1, i32* %tmp1, align 8			store i32 1, i32* %tmp1, align 8
	%tmp4 = getelementptr %struct.ss, %struct.ss* %S, i32 0, i32 1			%tmp4 = getelementptr %struct.ss, %struct.ss* %S, i32 0, i32 1
	store i64 2, i64* %tmp4, align 4			store i64 2, i64* %tmp4, align 4
	call void @f(%struct.ss* byval %S) nounwind			call void @f(%struct.ss* byval %S) nounwind
	call void @g(%struct.ss* byval %S) nounwind			call void @g(%struct.ss* byval %S) nounwind
	ret i32 0			ret i32 0
	}			}

llvm/test/Transforms/ArgumentPromotion/chained.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
	; RUN: opt < %s -passes=argpromotion -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -passes=argpromotion -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
				; RUN: opt -S -passes=attributor -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=5 < %s \| FileCheck %s --check-prefixes=ALL,ATTRIBUTOR

	@G1 = constant i32 0			@G1 = constant i32 0
	@G2 = constant i32* @G1			@G2 = constant i32* @G1

	define internal i32 @test(i32** %x) {			define internal i32 @test(i32** %x) {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@test			; ARGPROMOTION-LABEL: define {{[^@]+}}@test
	; ARGPROMOTION-SAME: (i32 [[X_VAL_VAL:%.*]])			; ARGPROMOTION-SAME: (i32 [[X_VAL_VAL:%.*]])
	; ARGPROMOTION-NEXT: entry:			; ARGPROMOTION-NEXT: entry:
	; ARGPROMOTION-NEXT: ret i32 [[X_VAL_VAL]]			; ARGPROMOTION-NEXT: ret i32 [[X_VAL_VAL]]
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@test
				; ATTRIBUTOR-SAME: (i32** nocapture nonnull readonly align 8 dereferenceable(8) [[X:%.*]])
				; ATTRIBUTOR-NEXT: entry:
				; ATTRIBUTOR-NEXT: [[Y:%.]] = load i32, i32** @G2, align 8
				uenokuUnsubmitted Not Done Reply Inline Actions Please add FIXME here for `AAValueSimplify`. uenoku: Please add FIXME here for `AAValueSimplify`.
				; ATTRIBUTOR-NEXT: [[Z:%.]] = load i32, i32 [[Y]]
				; ATTRIBUTOR-NEXT: ret i32 [[Z]]
				;
	entry:			entry:
	%y = load i32, i32* %x			%y = load i32, i32* %x
	%z = load i32, i32* %y			%z = load i32, i32* %y
	ret i32 %z			ret i32 %z
	}			}

	define i32 @caller() {			define i32 @caller() {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@caller()			; ARGPROMOTION-LABEL: define {{[^@]+}}@caller()
	; ARGPROMOTION-NEXT: entry:			; ARGPROMOTION-NEXT: entry:
	; ARGPROMOTION-NEXT: [[G2_VAL:%.]] = load i32, i32** @G2			; ARGPROMOTION-NEXT: [[G2_VAL:%.]] = load i32, i32** @G2
	; ARGPROMOTION-NEXT: [[G2_VAL_VAL:%.]] = load i32, i32 [[G2_VAL]]			; ARGPROMOTION-NEXT: [[G2_VAL_VAL:%.]] = load i32, i32 [[G2_VAL]]
	; ARGPROMOTION-NEXT: [[X:%.*]] = call i32 @test(i32 [[G2_VAL_VAL]])			; ARGPROMOTION-NEXT: [[X:%.*]] = call i32 @test(i32 [[G2_VAL_VAL]])
	; ARGPROMOTION-NEXT: ret i32 [[X]]			; ARGPROMOTION-NEXT: ret i32 [[X]]
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@caller()
				; ATTRIBUTOR-NEXT: entry:
				; ATTRIBUTOR-NEXT: [[X:%.]] = call i32 @test(i32* nonnull align 8 dereferenceable(8) @G2)
				; ATTRIBUTOR-NEXT: ret i32 [[X]]
				;
	entry:			entry:
	%x = call i32 @test(i32** @G2)			%x = call i32 @test(i32** @G2)
	ret i32 %x			ret i32 %x
	}			}

llvm/test/Transforms/ArgumentPromotion/control-flow.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
	; RUN: opt < %s -passes=argpromotion -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -passes=argpromotion -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
				; RUN: opt -S -passes=attributor -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=5 < %s \| FileCheck %s --check-prefixes=ALL,ATTRIBUTOR

	; Don't promote around control flow.			; Don't promote around control flow.
	define internal i32 @callee(i1 %C, i32* %P) {			define internal i32 @callee(i1 %C, i32* nocapture readonly %P) {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@callee			; ALL-LABEL: define {{[^@]+}}@callee
	; ARGPROMOTION-SAME: (i1 [[C:%.]], i32 [[P:%.*]])			; ALL-SAME: (i1 [[C:%.]], i32 nocapture readonly [[P:%.*]])
	; ARGPROMOTION-NEXT: entry:			; ALL-NEXT: entry:
	; ARGPROMOTION-NEXT: br i1 [[C]], label [[T:%.]], label [[F:%.]]			; ALL-NEXT: br i1 [[C]], label [[T:%.]], label [[F:%.]]
	; ARGPROMOTION: T:			; ALL: T:
	; ARGPROMOTION-NEXT: ret i32 17			; ALL-NEXT: ret i32 17
	; ARGPROMOTION: F:			; ALL: F:
	; ARGPROMOTION-NEXT: [[X:%.]] = load i32, i32 [[P]]			; ALL-NEXT: [[X:%.]] = load i32, i32 [[P]]
	; ARGPROMOTION-NEXT: ret i32 [[X]]			; ALL-NEXT: ret i32 [[X]]
	;			;
	entry:			entry:
	br i1 %C, label %T, label %F			br i1 %C, label %T, label %F

	T:			T:
	ret i32 17			ret i32 17

	F:			F:
	%X = load i32, i32* %P			%X = load i32, i32* %P
	ret i32 %X			ret i32 %X
	}			}

	define i32 @foo() {			define i32 @foo(i1 %C, i32* nocapture readonly %P) {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@foo()			; ALL-LABEL: define {{[^@]+}}@foo
	; ARGPROMOTION-NEXT: entry:			; ALL-SAME: (i1 [[C:%.]], i32 nocapture readonly [[P:%.*]])
	; ARGPROMOTION-NEXT: [[X:%.]] = call i32 @callee(i1 true, i32 null)			; ALL-NEXT: entry:
	; ARGPROMOTION-NEXT: ret i32 [[X]]			; ALL-NEXT: [[X:%.]] = call i32 @callee(i1 [[C]], i32 nocapture readonly [[P]])
				; ALL-NEXT: ret i32 [[X]]
	;			;
	entry:			entry:
	%X = call i32 @callee(i1 true, i32* null)			%X = call i32 @callee(i1 %C, i32* nocapture readonly %P)
	ret i32 %X			ret i32 %X
	}			}

llvm/test/Transforms/ArgumentPromotion/control-flow2.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -argpromotion -sroa -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
	; RUN: opt < %s -passes=argpromotion -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -passes='argpromotion,function(sroa)' -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
				; RUN: opt -S -passes='attributor,function(sroa)' -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=5 < %s \| FileCheck %s --check-prefixes=ALL,ATTRIBUTOR

	target datalayout = "E-p:64:64:64-a0:0:8-f32:32:32-f64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:32:64-v64:64:64-v128:128:128"			target datalayout = "E-p:64:64:64-a0:0:8-f32:32:32-f64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:32:64-v64:64:64-v128:128:128"

	define internal i32 @callee(i1 %C, i32* %P) {			define internal i32 @callee(i1 %C, i32* %P) {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@callee			; ARGPROMOTION-LABEL: define {{[^@]+}}@callee
	; ARGPROMOTION-SAME: (i1 [[C:%.]], i32 [[P_VAL:%.]])			; ARGPROMOTION-SAME: (i1 [[C:%.]], i32 [[P_VAL:%.]])
	; ARGPROMOTION-NEXT: br i1 [[C]], label [[T:%.]], label [[F:%.]]			; ARGPROMOTION-NEXT: br i1 [[C]], label [[T:%.]], label [[F:%.]]
	; ARGPROMOTION: T:			; ARGPROMOTION: T:
	; ARGPROMOTION-NEXT: ret i32 17			; ARGPROMOTION-NEXT: ret i32 17
	; ARGPROMOTION: F:			; ARGPROMOTION: F:
	; ARGPROMOTION-NEXT: ret i32 [[P_VAL]]			; ARGPROMOTION-NEXT: ret i32 [[P_VAL]]
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@callee
				; ATTRIBUTOR-SAME: (i1 [[C:%.*]])
				; ATTRIBUTOR-NEXT: br i1 [[C]], label [[T:%.]], label [[F:%.]]
				; ATTRIBUTOR: T:
				; ATTRIBUTOR-NEXT: unreachable
				; ATTRIBUTOR: F:
				; ATTRIBUTOR-NEXT: unreachable
				;
	br i1 %C, label %T, label %F			br i1 %C, label %T, label %F

	T: ; preds = %0			T: ; preds = %0
	ret i32 17			ret i32 17

	F: ; preds = %0			F: ; preds = %0
	%X = load i32, i32* %P ; <i32> [#uses=1]			%X = load i32, i32* %P ; <i32> [#uses=1]
	ret i32 %X			ret i32 %X
	}			}

	define i32 @foo() {			define i32 @foo(i1 %C) {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@foo()			; ARGPROMOTION-LABEL: define {{[^@]+}}@foo
	; ARGPROMOTION-NEXT: [[A:%.*]] = alloca i32			; ARGPROMOTION-SAME: (i1 [[C:%.*]])
	; ARGPROMOTION-NEXT: store i32 17, i32* [[A]]			; ARGPROMOTION-NEXT: [[X:%.*]] = call i32 @callee(i1 [[C]], i32 17)
	; ARGPROMOTION-NEXT: [[A_VAL:%.]] = load i32, i32 [[A]]
	; ARGPROMOTION-NEXT: [[X:%.*]] = call i32 @callee(i1 false, i32 [[A_VAL]])
	; ARGPROMOTION-NEXT: ret i32 [[X]]			; ARGPROMOTION-NEXT: ret i32 [[X]]
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@foo
				; ATTRIBUTOR-SAME: (i1 [[C:%.*]])
				; ATTRIBUTOR-NEXT: [[X:%.*]] = call i32 @callee(i1 [[C]])
				; ATTRIBUTOR-NEXT: ret i32 [[X]]
				;
				lebedev.riUnsubmitted Not Done Reply Inline Actions This is why i'm really pushing on using `--check-prefixes=ALL,ARGPROMOTION` from the getgo :[ lebedev.ri: This is why i'm really pushing on using `--check-prefixes=ALL,ARGPROMOTION` from the getgo :[
				jdoerfertAuthorUnsubmitted Done Reply Inline Actions Agreed. jdoerfert: Agreed.
	%A = alloca i32 ; <i32*> [#uses=2]			%A = alloca i32 ; <i32*> [#uses=2]
	store i32 17, i32* %A			store i32 17, i32* %A
	%X = call i32 @callee( i1 false, i32* %A ) ; <i32> [#uses=1]			%X = call i32 @callee( i1 %C, i32* %A ) ; <i32> [#uses=1]
				lebedev.riUnsubmitted Not Done Reply Inline Actions I strongly believe you want to precommit test changes+regeneration first. lebedev.ri: I strongly believe you want to precommit test changes+regeneration first.
				jdoerfertAuthorUnsubmitted Done Reply Inline Actions I will update the test lines after the D68766 update. Do you want me to split test changes, e.g., that remove UB, as well? jdoerfert: I will update the test lines after the D68766 update. Do you want me to split test changes, e.g.
	ret i32 %X			ret i32 %X
	}			}

llvm/test/Transforms/ArgumentPromotion/crash.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt -S < %s -inline -argpromotion \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL_OLDPM			; RUN: opt -S < %s -inline -argpromotion \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION_OLDPM
	; RUN: opt -S < %s -passes=inline,argpromotion \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL_NEWPM			; RUN: opt -S < %s -passes=inline,argpromotion \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION_NEWPM
				; RUN: opt -S -passes=attributor -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=4 < %s \| FileCheck %s --check-prefixes=ALL,ATTRIBUTOR

	%S = type { %S* }			%S = type { %S* }

	; Inlining should nuke the invoke (and any inlined calls) here even with			; Inlining should nuke the invoke (and any inlined calls) here even with
	; argument promotion running along with it.			; argument promotion running along with it.
	define void @zot() personality i32 (...)* @wibble {			define void @zot() personality i32 (...)* @wibble {
	; ALL-LABEL: define {{[^@]+}}@zot() personality i32 (...)* @wibble			; ARGPROMOTION_OLDPM-LABEL: define {{[^@]+}}@zot() personality i32 (...)* @wibble
	; ALL-NEXT: bb:			; ARGPROMOTION_OLDPM-NEXT: bb:
	; ALL-NEXT: unreachable			; ARGPROMOTION_OLDPM-NEXT: unreachable
	; ALL: hoge.exit:			; ARGPROMOTION_OLDPM: hoge.exit:
	; ALL-NEXT: br label [[BB1:%.*]]			; ARGPROMOTION_OLDPM-NEXT: br label [[BB1:%.*]]
	; ALL: bb1:			; ARGPROMOTION_OLDPM: bb1:
	; ALL-NEXT: unreachable			; ARGPROMOTION_OLDPM-NEXT: unreachable
	; ALL: bb2:			; ARGPROMOTION_OLDPM: bb2:
	; ALL-NEXT: [[TMP:%.]] = landingpad { i8, i32 }			; ARGPROMOTION_OLDPM-NEXT: [[TMP:%.]] = landingpad { i8, i32 }
	; ALL-NEXT: cleanup			; ARGPROMOTION_OLDPM-NEXT: cleanup
	; ALL-NEXT: unreachable			; ARGPROMOTION_OLDPM-NEXT: unreachable
	;			;
	; ARGPROMOTION-LABEL: define {{[^@]+}}@zot() personality i32 (...)* @wibble			; ARGPROMOTION_NEWPM-LABEL: define {{[^@]+}}@zot() personality i32 (...)* @wibble
	; ARGPROMOTION-NEXT: bb:			; ARGPROMOTION_NEWPM-NEXT: bb:
	; ARGPROMOTION-NEXT: unreachable			; ARGPROMOTION_NEWPM-NEXT: unreachable
	; ARGPROMOTION: hoge.exit:			; ARGPROMOTION_NEWPM: hoge.exit:
	; ARGPROMOTION-NEXT: br label [[BB1:%.*]]			; ARGPROMOTION_NEWPM-NEXT: br label [[BB1:%.*]]
	; ARGPROMOTION: bb1:			; ARGPROMOTION_NEWPM: bb1:
	; ARGPROMOTION-NEXT: unreachable			; ARGPROMOTION_NEWPM-NEXT: unreachable
	; ARGPROMOTION: bb2:			; ARGPROMOTION_NEWPM: bb2:
	; ARGPROMOTION-NEXT: [[TMP:%.]] = landingpad { i8, i32 }			; ARGPROMOTION_NEWPM-NEXT: [[TMP:%.]] = landingpad { i8, i32 }
	; ARGPROMOTION-NEXT: cleanup			; ARGPROMOTION_NEWPM-NEXT: cleanup
	; ARGPROMOTION-NEXT: unreachable			; ARGPROMOTION_NEWPM-NEXT: unreachable
				;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@zot() #0 personality i32 (...)* @wibble
				; ATTRIBUTOR-NEXT: bb:
				; ATTRIBUTOR-NEXT: call void @hoge()
				; ATTRIBUTOR-NEXT: unreachable
				; ATTRIBUTOR: bb.split:
				; ATTRIBUTOR-NEXT: unreachable
				; ATTRIBUTOR: bb1.i2c:
				; ATTRIBUTOR-NEXT: unreachable
				; ATTRIBUTOR: bb1:
				; ATTRIBUTOR-NEXT: unreachable
				; ATTRIBUTOR: bb2:
				; ATTRIBUTOR-NEXT: unreachable
	;			;
	bb:			bb:
	invoke void @hoge()			invoke void @hoge()
	to label %bb1 unwind label %bb2			to label %bb1 unwind label %bb2

	bb1:			bb1:
	unreachable			unreachable

	bb2:			bb2:
	%tmp = landingpad { i8*, i32 }			%tmp = landingpad { i8*, i32 }
	cleanup			cleanup
	unreachable			unreachable
	}			}

	define internal void @hoge() {			define internal void @hoge() {
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@hoge()
				; ATTRIBUTOR-NEXT: bb:
				; ATTRIBUTOR-NEXT: unreachable
				;
	bb:			bb:
	%tmp = call fastcc i8* @spam(i1 (i8) @eggs)			%tmp = call fastcc i8* @spam(i1 (i8) @eggs)
	%tmp1 = call fastcc i8* @spam(i1 (i8) @barney)			%tmp1 = call fastcc i8* @spam(i1 (i8) @barney)
	unreachable			unreachable
	}			}

	define internal fastcc i8* @spam(i1 (i8) %arg) {			define internal fastcc i8* @spam(i1 (i8) %arg) {
	bb:			bb:
	unreachable			unreachable
	}			}

	define internal i1 @eggs(i8* %arg) {			define internal i1 @eggs(i8* %arg) {
	; ARGPROMOTION_NEWPM-LABEL: define {{[^@]+}}@eggs()			; ARGPROMOTION_NEWPM-LABEL: define {{[^@]+}}@eggs()
	; ARGPROMOTION_NEWPM-NEXT: bb:			; ARGPROMOTION_NEWPM-NEXT: bb:
	; ARGPROMOTION_NEWPM-NEXT: unreachable			; ARGPROMOTION_NEWPM-NEXT: unreachable
	;			;
	; ALL_NEWPM-LABEL: define {{[^@]+}}@eggs()
	; ALL_NEWPM-NEXT: bb:
	; ALL_NEWPM-NEXT: unreachable
	;
	bb:			bb:
	%tmp = call zeroext i1 @barney(i8* %arg)			%tmp = call zeroext i1 @barney(i8* %arg)
	unreachable			unreachable
	}			}

	define internal i1 @barney(i8* %arg) {			define internal i1 @barney(i8* %arg) {
	bb:			bb:
	ret i1 undef			ret i1 undef
	}			}

	define i32 @test_inf_promote_caller(i32 %arg) {			define i32 @test_inf_promote_caller(i32 %arg) {
	; ALL-LABEL: define {{[^@]+}}@test_inf_promote_caller			; ARGPROMOTION_OLDPM-LABEL: define {{[^@]+}}@test_inf_promote_caller
	; ALL-SAME: (i32 [[ARG:%.*]])			; ARGPROMOTION_OLDPM-SAME: (i32 [[ARG:%.*]])
	; ALL-NEXT: bb:			; ARGPROMOTION_OLDPM-NEXT: bb:
	; ALL-NEXT: [[TMP:%.]] = alloca [[S:%.]]			; ARGPROMOTION_OLDPM-NEXT: [[TMP:%.]] = alloca [[S:%.]]
	; ALL-NEXT: [[TMP1:%.*]] = alloca [[S]]			; ARGPROMOTION_OLDPM-NEXT: [[TMP1:%.*]] = alloca [[S]]
	; ALL-NEXT: [[TMP2:%.]] = call i32 @test_inf_promote_callee(%S [[TMP]], %S* [[TMP1]])			; ARGPROMOTION_OLDPM-NEXT: [[TMP2:%.]] = call i32 @test_inf_promote_callee(%S [[TMP]], %S* [[TMP1]])
	; ALL-NEXT: ret i32 0			; ARGPROMOTION_OLDPM-NEXT: ret i32 0
	;			;
	; ARGPROMOTION-LABEL: define {{[^@]+}}@test_inf_promote_caller			; ARGPROMOTION_NEWPM-LABEL: define {{[^@]+}}@test_inf_promote_caller
	; ARGPROMOTION-SAME: (i32 [[ARG:%.*]])			; ARGPROMOTION_NEWPM-SAME: (i32 [[ARG:%.*]])
	; ARGPROMOTION-NEXT: bb:			; ARGPROMOTION_NEWPM-NEXT: bb:
	; ARGPROMOTION-NEXT: [[TMP:%.]] = alloca [[S:%.]]			; ARGPROMOTION_NEWPM-NEXT: [[TMP:%.]] = alloca [[S:%.]]
	; ARGPROMOTION-NEXT: [[TMP1:%.*]] = alloca [[S]]			; ARGPROMOTION_NEWPM-NEXT: [[TMP1:%.*]] = alloca [[S]]
	; ARGPROMOTION-NEXT: [[TMP2:%.]] = call i32 @test_inf_promote_callee(%S [[TMP]], %S* [[TMP1]])			; ARGPROMOTION_NEWPM-NEXT: [[TMP2:%.]] = call i32 @test_inf_promote_callee(%S [[TMP]], %S* [[TMP1]])
	; ARGPROMOTION-NEXT: ret i32 0			; ARGPROMOTION_NEWPM-NEXT: ret i32 0
				;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@test_inf_promote_caller
				; ATTRIBUTOR-SAME: (i32 [[ARG:%.*]])
				; ATTRIBUTOR-NEXT: bb:
				; ATTRIBUTOR-NEXT: unreachable
				; ATTRIBUTOR: bb.split:
				; ATTRIBUTOR-NEXT: unreachable
	;			;
	bb:			bb:
	%tmp = alloca %S			%tmp = alloca %S
	%tmp1 = alloca %S			%tmp1 = alloca %S
	%tmp2 = call i32 @test_inf_promote_callee(%S* %tmp, %S* %tmp1)			%tmp2 = call i32 @test_inf_promote_callee(%S* %tmp, %S* %tmp1)

	ret i32 0			ret i32 0
	}			}

	define internal i32 @test_inf_promote_callee(%S* %arg, %S* %arg1) {			define internal i32 @test_inf_promote_callee(%S* nocapture readonly %arg, %S* nocapture readonly %arg1) {
	; ALL-LABEL: define {{[^@]+}}@test_inf_promote_callee			; ARGPROMOTION_OLDPM-LABEL: define {{[^@]+}}@test_inf_promote_callee
	; ALL-SAME: (%S* [[ARG:%.]], %S [[ARG1:%.*]])			; ARGPROMOTION_OLDPM-SAME: (%S* nocapture readonly [[ARG:%.]], %S nocapture readonly [[ARG1:%.*]])
	; ALL-NEXT: bb:			; ARGPROMOTION_OLDPM-NEXT: bb:
	; ALL-NEXT: [[TMP:%.]] = getelementptr [[S:%.]], %S* [[ARG1]], i32 0, i32 0			; ARGPROMOTION_OLDPM-NEXT: [[TMP:%.]] = getelementptr [[S:%.]], %S* [[ARG1]], i32 0, i32 0
	; ALL-NEXT: [[TMP2:%.]] = load %S, %S** [[TMP]]			; ARGPROMOTION_OLDPM-NEXT: [[TMP2:%.]] = load %S, %S** [[TMP]]
	; ALL-NEXT: [[TMP3:%.]] = getelementptr [[S]], %S [[ARG]], i32 0, i32 0			; ARGPROMOTION_OLDPM-NEXT: [[TMP3:%.]] = getelementptr [[S]], %S [[ARG]], i32 0, i32 0
	; ALL-NEXT: [[TMP4:%.]] = load %S, %S** [[TMP3]]			; ARGPROMOTION_OLDPM-NEXT: [[TMP4:%.]] = load %S, %S** [[TMP3]]
	; ALL-NEXT: [[TMP5:%.]] = call i32 @test_inf_promote_callee(%S [[TMP4]], %S* [[TMP2]])			; ARGPROMOTION_OLDPM-NEXT: [[TMP5:%.]] = call i32 @test_inf_promote_callee(%S [[TMP4]], %S* [[TMP2]])
	; ALL-NEXT: ret i32 0			; ARGPROMOTION_OLDPM-NEXT: unreachable
	;			;
	; ARGPROMOTION-LABEL: define {{[^@]+}}@test_inf_promote_callee			; ARGPROMOTION_NEWPM-LABEL: define {{[^@]+}}@test_inf_promote_callee
	; ARGPROMOTION-SAME: (%S* [[ARG:%.]], %S [[ARG1:%.*]])			; ARGPROMOTION_NEWPM-SAME: (%S* nocapture readonly [[ARG:%.]], %S nocapture readonly [[ARG1:%.*]])
	; ARGPROMOTION-NEXT: bb:			; ARGPROMOTION_NEWPM-NEXT: bb:
	; ARGPROMOTION-NEXT: [[TMP:%.]] = getelementptr [[S:%.]], %S* [[ARG1]], i32 0, i32 0			; ARGPROMOTION_NEWPM-NEXT: [[TMP:%.]] = getelementptr [[S:%.]], %S* [[ARG1]], i32 0, i32 0
	; ARGPROMOTION-NEXT: [[TMP2:%.]] = load %S, %S** [[TMP]]			; ARGPROMOTION_NEWPM-NEXT: [[TMP2:%.]] = load %S, %S** [[TMP]]
	; ARGPROMOTION-NEXT: [[TMP3:%.]] = getelementptr [[S]], %S [[ARG]], i32 0, i32 0			; ARGPROMOTION_NEWPM-NEXT: [[TMP3:%.]] = getelementptr [[S]], %S [[ARG]], i32 0, i32 0
	; ARGPROMOTION-NEXT: [[TMP4:%.]] = load %S, %S** [[TMP3]]			; ARGPROMOTION_NEWPM-NEXT: [[TMP4:%.]] = load %S, %S** [[TMP3]]
	; ARGPROMOTION-NEXT: [[TMP5:%.]] = call i32 @test_inf_promote_callee(%S [[TMP4]], %S* [[TMP2]])			; ARGPROMOTION_NEWPM-NEXT: [[TMP5:%.]] = call i32 @test_inf_promote_callee(%S [[TMP4]], %S* [[TMP2]])
	; ARGPROMOTION-NEXT: ret i32 0			; ARGPROMOTION_NEWPM-NEXT: unreachable
	;			;
	bb:			bb:
	%tmp = getelementptr %S, %S* %arg1, i32 0, i32 0			%tmp = getelementptr %S, %S* %arg1, i32 0, i32 0
	%tmp2 = load %S, %S* %tmp			%tmp2 = load %S, %S* %tmp
	%tmp3 = getelementptr %S, %S* %arg, i32 0, i32 0			%tmp3 = getelementptr %S, %S* %arg, i32 0, i32 0
	%tmp4 = load %S, %S* %tmp3			%tmp4 = load %S, %S* %tmp3
	%tmp5 = call i32 @test_inf_promote_callee(%S* %tmp4, %S* %tmp2)			%tmp5 = call i32 @test_inf_promote_callee(%S* %tmp4, %S* %tmp2)
				unreachable
	ret i32 0
	}			}

	declare i32 @wibble(...)			declare i32 @wibble(...)

llvm/test/Transforms/ArgumentPromotion/fp80.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
	; RUN: opt < %s -passes=argpromotion -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -passes=argpromotion -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
				; RUN: opt -S -passes=attributor -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=3 < %s \| FileCheck %s --check-prefixes=ALL,ATTRIBUTOR

	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	%union.u = type { x86_fp80 }			%union.u = type { x86_fp80 }
	%struct.s = type { double, i16, i8, [5 x i8] }			%struct.s = type { double, i16, i8, [5 x i8] }

	@b = internal global %struct.s { double 3.14, i16 9439, i8 25, [5 x i8] undef }, align 16

	%struct.Foo = type { i32, i64 }			%struct.Foo = type { i32, i64 }
	@a = internal global %struct.Foo { i32 1, i64 2 }, align 8

	define void @run() {			define x86_fp80 @run(%struct.Foo* %a, %struct.s* %b, i8* %i8, i64* %i64a, i64* %i64b) {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@run()			; ARGPROMOTION-LABEL: define {{[^@]+}}@run
				; ARGPROMOTION-SAME: (%struct.Foo* [[A:%.]], %struct.s [[B:%.]], i8 [[I8:%.]], i64 [[I64A:%.]], i64 [[I64B:%.*]])
	; ARGPROMOTION-NEXT: entry:			; ARGPROMOTION-NEXT: entry:
	; ARGPROMOTION-NEXT: [[TMP0:%.]] = tail call i8 @UseLongDoubleUnsafely(%union.u byval align 16 bitcast (%struct.s* @b to %union.u*))			; ARGPROMOTION-NEXT: [[BC:%.]] = bitcast %struct.s [[B]] to %union.u*
	; ARGPROMOTION-NEXT: [[DOT0:%.]] = getelementptr [[UNION_U:%.]], %union.u* bitcast (%struct.s* @b to %union.u*), i32 0, i32 0			; ARGPROMOTION-NEXT: [[V0:%.]] = tail call i8 @UseLongDoubleUnsafely(%union.u byval align 16 [[BC]])
	; ARGPROMOTION-NEXT: [[DOT0_VAL:%.]] = load x86_fp80, x86_fp80 [[DOT0]]			; ARGPROMOTION-NEXT: store i8 [[V0]], i8* [[I8]]
	; ARGPROMOTION-NEXT: [[TMP1:%.*]] = tail call x86_fp80 @UseLongDoubleSafely(x86_fp80 [[DOT0_VAL]])			; ARGPROMOTION-NEXT: [[BC_0:%.]] = getelementptr [[UNION_U:%.]], %union.u* [[BC]], i32 0, i32 0
	; ARGPROMOTION-NEXT: [[TMP2:%.]] = call i64 @AccessPaddingOfStruct(%struct.Foo @a)			; ARGPROMOTION-NEXT: [[BC_0_VAL:%.]] = load x86_fp80, x86_fp80 [[BC_0]]
	; ARGPROMOTION-NEXT: [[TMP3:%.]] = call i64 @CaptureAStruct(%struct.Foo @a)			; ARGPROMOTION-NEXT: [[V1:%.*]] = tail call x86_fp80 @UseLongDoubleSafely(x86_fp80 [[BC_0_VAL]])
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: [[V2:%.]] = call i64 @AccessPaddingOfStruct(%struct.Foo [[A]])
				; ARGPROMOTION-NEXT: store i64 [[V2]], i64* [[I64A]]
				; ARGPROMOTION-NEXT: [[V3:%.]] = call i64 @CaptureAStruct(%struct.Foo [[A]])
				; ARGPROMOTION-NEXT: store i64 [[V3]], i64* [[I64B]]
				; ARGPROMOTION-NEXT: ret x86_fp80 [[V1]]
				;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@run
				; ATTRIBUTOR-SAME: (%struct.Foo* [[A:%.]], %struct.s nocapture readonly [[B:%.]], i8 nocapture writeonly [[I8:%.]], i64 nocapture writeonly [[I64A:%.]], i64 nocapture readnone [[I64B:%.*]])
				; ATTRIBUTOR-NEXT: entry:
				; ATTRIBUTOR-NEXT: [[BC:%.]] = bitcast %struct.s [[B]] to %union.u*
				; ATTRIBUTOR-NEXT: [[V0:%.]] = tail call i8 @UseLongDoubleUnsafely(%union.u nocapture readonly byval align 16 [[BC]])
				; ATTRIBUTOR-NEXT: store i8 [[V0]], i8* [[I8]]
				; ATTRIBUTOR-NEXT: [[V2:%.]] = call i64 @AccessPaddingOfStruct(%struct.Foo nocapture [[A]])
				; ATTRIBUTOR-NEXT: store i64 [[V2]], i64* [[I64A]]
				; ATTRIBUTOR-NEXT: [[V3:%.]] = call i64 @CaptureAStruct(%struct.Foo [[A]])
				; ATTRIBUTOR-NEXT: unreachable
				; ATTRIBUTOR: entry.split:
				; ATTRIBUTOR-NEXT: unreachable
	;			;
	entry:			entry:
	tail call i8 @UseLongDoubleUnsafely(%union.u* byval align 16 bitcast (%struct.s* @b to %union.u*))			%bc = bitcast %struct.s* %b to %union.u*
	tail call x86_fp80 @UseLongDoubleSafely(%union.u* byval align 16 bitcast (%struct.s* @b to %union.u*))			%v0 = tail call i8 @UseLongDoubleUnsafely(%union.u* byval align 16 %bc)
	call i64 @AccessPaddingOfStruct(%struct.Foo* @a)			store i8 %v0, i8* %i8
	call i64 @CaptureAStruct(%struct.Foo* @a)			%v1 = tail call x86_fp80 @UseLongDoubleSafely(%union.u* byval align 16 %bc)
	ret void			%v2 = call i64 @AccessPaddingOfStruct(%struct.Foo* %a)
				store i64 %v2, i64* %i64a
				%v3 = call i64 @CaptureAStruct(%struct.Foo* %a)
				store i64 %v3, i64* %i64b
				ret x86_fp80 %v1
	}			}

	define internal i8 @UseLongDoubleUnsafely(%union.u* byval align 16 %arg) {			define internal i8 @UseLongDoubleUnsafely(%union.u* byval align 16 %arg) {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@UseLongDoubleUnsafely			; ARGPROMOTION-LABEL: define {{[^@]+}}@UseLongDoubleUnsafely
	; ARGPROMOTION-SAME: (%union.u* byval align 16 [[ARG:%.*]])			; ARGPROMOTION-SAME: (%union.u* byval align 16 [[ARG:%.*]])
	; ARGPROMOTION-NEXT: entry:			; ARGPROMOTION-NEXT: entry:
	; ARGPROMOTION-NEXT: [[BITCAST:%.]] = bitcast %union.u [[ARG]] to %struct.s*			; ARGPROMOTION-NEXT: [[BITCAST:%.]] = bitcast %union.u [[ARG]] to %struct.s*
	; ARGPROMOTION-NEXT: [[GEP:%.]] = getelementptr inbounds [[STRUCT_S:%.]], %struct.s* [[BITCAST]], i64 0, i32 2			; ARGPROMOTION-NEXT: [[GEP:%.]] = getelementptr inbounds [[STRUCT_S:%.]], %struct.s* [[BITCAST]], i64 0, i32 2
	; ARGPROMOTION-NEXT: [[RESULT:%.]] = load i8, i8 [[GEP]]			; ARGPROMOTION-NEXT: [[RESULT:%.]] = load i8, i8 [[GEP]]
	; ARGPROMOTION-NEXT: ret i8 [[RESULT]]			; ARGPROMOTION-NEXT: ret i8 [[RESULT]]
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@UseLongDoubleUnsafely
				; ATTRIBUTOR-SAME: (%union.u* nocapture readonly byval align 16 [[ARG:%.*]])
				; ATTRIBUTOR-NEXT: entry:
				; ATTRIBUTOR-NEXT: [[BITCAST:%.]] = bitcast %union.u [[ARG]] to %struct.s*
				; ATTRIBUTOR-NEXT: [[GEP:%.]] = getelementptr inbounds [[STRUCT_S:%.]], %struct.s* [[BITCAST]], i64 0, i32 2
				; ATTRIBUTOR-NEXT: [[RESULT:%.]] = load i8, i8 [[GEP]]
				; ATTRIBUTOR-NEXT: ret i8 [[RESULT]]
				;
	entry:			entry:
	%bitcast = bitcast %union.u* %arg to %struct.s*			%bitcast = bitcast %union.u* %arg to %struct.s*
	%gep = getelementptr inbounds %struct.s, %struct.s* %bitcast, i64 0, i32 2			%gep = getelementptr inbounds %struct.s, %struct.s* %bitcast, i64 0, i32 2
	%result = load i8, i8* %gep			%result = load i8, i8* %gep
	ret i8 %result			ret i8 %result
	}			}

	define internal x86_fp80 @UseLongDoubleSafely(%union.u* byval align 16 %arg) {			define internal x86_fp80 @UseLongDoubleSafely(%union.u* byval align 16 %arg) {
	Show All 13 Lines

	define internal i64 @AccessPaddingOfStruct(%struct.Foo* byval %a) {			define internal i64 @AccessPaddingOfStruct(%struct.Foo* byval %a) {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@AccessPaddingOfStruct			; ARGPROMOTION-LABEL: define {{[^@]+}}@AccessPaddingOfStruct
	; ARGPROMOTION-SAME: (%struct.Foo* byval [[A:%.*]])			; ARGPROMOTION-SAME: (%struct.Foo* byval [[A:%.*]])
	; ARGPROMOTION-NEXT: [[P:%.]] = bitcast %struct.Foo [[A]] to i64*			; ARGPROMOTION-NEXT: [[P:%.]] = bitcast %struct.Foo [[A]] to i64*
	; ARGPROMOTION-NEXT: [[V:%.]] = load i64, i64 [[P]]			; ARGPROMOTION-NEXT: [[V:%.]] = load i64, i64 [[P]]
	; ARGPROMOTION-NEXT: ret i64 [[V]]			; ARGPROMOTION-NEXT: ret i64 [[V]]
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@AccessPaddingOfStruct
				; ATTRIBUTOR-SAME: (%struct.Foo* nocapture readonly byval [[A:%.*]])
				; ATTRIBUTOR-NEXT: [[P:%.]] = bitcast %struct.Foo [[A]] to i64*
				; ATTRIBUTOR-NEXT: [[V:%.]] = load i64, i64 [[P]]
				; ATTRIBUTOR-NEXT: ret i64 [[V]]
				;
	%p = bitcast %struct.Foo* %a to i64*			%p = bitcast %struct.Foo* %a to i64*
	%v = load i64, i64* %p			%v = load i64, i64* %p
	ret i64 %v			ret i64 %v
	}			}

	define internal i64 @CaptureAStruct(%struct.Foo* byval %a) {			define internal i64 @CaptureAStruct(%struct.Foo* byval %a) {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@CaptureAStruct			; ARGPROMOTION-LABEL: define {{[^@]+}}@CaptureAStruct
	; ARGPROMOTION-SAME: (%struct.Foo* byval [[A:%.*]])			; ARGPROMOTION-SAME: (%struct.Foo* byval [[A:%.*]])
	; ARGPROMOTION-NEXT: entry:			; ARGPROMOTION-NEXT: entry:
	; ARGPROMOTION-NEXT: [[A_PTR:%.]] = alloca %struct.Foo			; ARGPROMOTION-NEXT: [[A_PTR:%.]] = alloca %struct.Foo
	; ARGPROMOTION-NEXT: br label [[LOOP:%.*]]			; ARGPROMOTION-NEXT: br label [[LOOP:%.*]]
	; ARGPROMOTION: loop:			; ARGPROMOTION: loop:
	; ARGPROMOTION-NEXT: [[PHI:%.]] = phi %struct.Foo [ null, [[ENTRY:%.]] ], [ [[GEP:%.]], [[LOOP]] ]			; ARGPROMOTION-NEXT: [[PHI:%.]] = phi %struct.Foo [ null, [[ENTRY:%.]] ], [ [[GEP:%.]], [[LOOP]] ]
	; ARGPROMOTION-NEXT: [[TMP0:%.]] = phi %struct.Foo [ [[A]], [[ENTRY]] ], [ [[TMP0]], [[LOOP]] ]			; ARGPROMOTION-NEXT: [[TMP0:%.]] = phi %struct.Foo [ [[A]], [[ENTRY]] ], [ [[TMP0]], [[LOOP]] ]
	; ARGPROMOTION-NEXT: store %struct.Foo* [[PHI]], %struct.Foo** [[A_PTR]]			; ARGPROMOTION-NEXT: store %struct.Foo* [[PHI]], %struct.Foo** [[A_PTR]]
	; ARGPROMOTION-NEXT: [[GEP]] = getelementptr [[STRUCT_FOO:%.]], %struct.Foo [[A]], i64 0			; ARGPROMOTION-NEXT: [[GEP]] = getelementptr [[STRUCT_FOO:%.]], %struct.Foo [[A]], i64 0
	; ARGPROMOTION-NEXT: br label [[LOOP]]			; ARGPROMOTION-NEXT: br label [[LOOP]]
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@CaptureAStruct
				; ATTRIBUTOR-SAME: (%struct.Foo* writeonly byval [[A:%.*]])
				; ATTRIBUTOR-NEXT: entry:
				; ATTRIBUTOR-NEXT: [[A_PTR:%.]] = alloca %struct.Foo
				; ATTRIBUTOR-NEXT: br label [[LOOP:%.*]]
				; ATTRIBUTOR: loop:
				; ATTRIBUTOR-NEXT: [[PHI:%.]] = phi %struct.Foo [ null, [[ENTRY:%.]] ], [ [[GEP:%.]], [[LOOP]] ]
				; ATTRIBUTOR-NEXT: [[TMP0:%.]] = phi %struct.Foo [ [[A]], [[ENTRY]] ], [ [[TMP0]], [[LOOP]] ]
				; ATTRIBUTOR-NEXT: store %struct.Foo* [[PHI]], %struct.Foo** [[A_PTR]], align 8
				; ATTRIBUTOR-NEXT: [[GEP]] = getelementptr [[STRUCT_FOO:%.]], %struct.Foo [[A]], i64 0
				; ATTRIBUTOR-NEXT: br label [[LOOP]]
				;
	entry:			entry:
	%a_ptr = alloca %struct.Foo*			%a_ptr = alloca %struct.Foo*
	br label %loop			br label %loop

	loop:			loop:
	%phi = phi %struct.Foo* [ null, %entry ], [ %gep, %loop ]			%phi = phi %struct.Foo* [ null, %entry ], [ %gep, %loop ]
	%0 = phi %struct.Foo* [ %a, %entry ], [ %0, %loop ]			%0 = phi %struct.Foo* [ %a, %entry ], [ %0, %loop ]
	store %struct.Foo* %phi, %struct.Foo** %a_ptr			store %struct.Foo* %phi, %struct.Foo** %a_ptr
	%gep = getelementptr %struct.Foo, %struct.Foo* %a, i64 0			%gep = getelementptr %struct.Foo, %struct.Foo* %a, i64 0
	br label %loop			br label %loop
	}			}

llvm/test/Transforms/ArgumentPromotion/inalloca.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt %s -globalopt -argpromotion -sroa -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt %s -globalopt -argpromotion -sroa -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
	; RUN: opt %s -passes='module(globalopt),cgscc(argpromotion),function(sroa)' -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt %s -passes='module(globalopt),cgscc(argpromotion),function(sroa)' -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
				; RUN: opt -S -passes='attributor,function(sroa)' -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=2 < %s \| FileCheck %s --check-prefixes=ALL,ATTRIBUTOR

	target datalayout = "E-p:64:64:64-a0:0:8-f32:32:32-f64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:32:64-v64:64:64-v128:128:128"			target datalayout = "E-p:64:64:64-a0:0:8-f32:32:32-f64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:32:64-v64:64:64-v128:128:128"

	%struct.ss = type { i32, i32 }			%struct.ss = type { i32, i32 }

	; Argpromote + sroa should change this to passing the two integers by value.
	define internal i32 @f(%struct.ss* inalloca %s) {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@f
	; ARGPROMOTION-SAME: (i32 [[S_0_0_VAL:%.]], i32 [[S_0_1_VAL:%.]]) unnamed_addr
	; ARGPROMOTION-NEXT: entry:
	; ARGPROMOTION-NEXT: [[R:%.*]] = add i32 [[S_0_0_VAL]], [[S_0_1_VAL]]
	; ARGPROMOTION-NEXT: ret i32 [[R]]
	;
	entry:
	%f0 = getelementptr %struct.ss, %struct.ss* %s, i32 0, i32 0
	%f1 = getelementptr %struct.ss, %struct.ss* %s, i32 0, i32 1
	%a = load i32, i32* %f0, align 4
	%b = load i32, i32* %f1, align 4
	%r = add i32 %a, %b
	ret i32 %r
	}

	define i32 @main() {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@main() local_unnamed_addr
	; ARGPROMOTION-NEXT: entry:
	; ARGPROMOTION-NEXT: [[R:%.*]] = call fastcc i32 @f(i32 1, i32 2)
	; ARGPROMOTION-NEXT: ret i32 [[R]]
	;
	entry:
	%S = alloca inalloca %struct.ss
	%f0 = getelementptr %struct.ss, %struct.ss* %S, i32 0, i32 0
	%f1 = getelementptr %struct.ss, %struct.ss* %S, i32 0, i32 1
	store i32 1, i32* %f0, align 4
	store i32 2, i32* %f1, align 4
	%r = call i32 @f(%struct.ss* inalloca %S)
	ret i32 %r
	}

	; Argpromote can't promote %a because of the icmp use.			; Argpromote can't promote %a because of the icmp use.
	define internal i1 @g(%struct.ss* %a, %struct.ss* inalloca %b) nounwind {			define internal i1 @g(%struct.ss* %a, %struct.ss* inalloca %b) nounwind {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@g			; ARGPROMOTION-LABEL: define {{[^@]+}}@g
	; ARGPROMOTION-SAME: (%struct.ss* [[A:%.]], %struct.ss [[B:%.*]]) unnamed_addr			; ARGPROMOTION-SAME: (%struct.ss* [[A:%.]], %struct.ss [[B:%.*]]) unnamed_addr
	; ARGPROMOTION-NEXT: entry:			; ARGPROMOTION-NEXT: entry:
	; ARGPROMOTION-NEXT: [[C:%.]] = icmp eq %struct.ss [[A]], [[B]]			; ARGPROMOTION-NEXT: [[C:%.]] = icmp eq %struct.ss [[A]], [[B]]
	; ARGPROMOTION-NEXT: ret i1 [[C]]			; ARGPROMOTION-NEXT: ret i1 [[C]]
	;			;
	entry:			entry:
	%c = icmp eq %struct.ss* %a, %b			%c = icmp eq %struct.ss* %a, %b
	ret i1 %c			ret i1 %c
	}			}

	define i32 @test() {			define i32 @test() {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@test() local_unnamed_addr			; ARGPROMOTION-LABEL: define {{[^@]+}}@test() local_unnamed_addr
	; ARGPROMOTION-NEXT: entry:			; ARGPROMOTION-NEXT: entry:
	; ARGPROMOTION-NEXT: [[S:%.]] = alloca inalloca [[STRUCT_SS:%.]]			; ARGPROMOTION-NEXT: [[S:%.]] = alloca inalloca [[STRUCT_SS:%.]]
	; ARGPROMOTION-NEXT: [[C:%.]] = call fastcc i1 @g(%struct.ss [[S]], %struct.ss* [[S]])			; ARGPROMOTION-NEXT: [[C:%.]] = call fastcc i1 @g(%struct.ss [[S]], %struct.ss* [[S]])
	; ARGPROMOTION-NEXT: ret i32 0			; ARGPROMOTION-NEXT: ret i32 0
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@test()
				; ATTRIBUTOR-NEXT: entry:
				; ATTRIBUTOR-NEXT: ret i32 0
				;
	entry:			entry:
	%S = alloca inalloca %struct.ss			%S = alloca inalloca %struct.ss
	%c = call i1 @g(%struct.ss* %S, %struct.ss* inalloca %S)			%c = call i1 @g(%struct.ss* %S, %struct.ss* inalloca %S)
	ret i32 0			ret i32 0
	}			}

llvm/test/Transforms/ArgumentPromotion/invalidation.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; Check that when argument promotion changes a function in some parent node of			; Check that when argument promotion changes a function in some parent node of
	; the call graph, any analyses that happened to be cached for that function are			; the call graph, any analyses that happened to be cached for that function are
	; actually invalidated. We are using `demanded-bits` here because when printed			; actually invalidated. We are using `demanded-bits` here because when printed
	; it will end up caching a value for every instruction, making it easy to			; it will end up caching a value for every instruction, making it easy to
	; detect the instruction-level changes that will fail here. With improper			; detect the instruction-level changes that will fail here. With improper
	; invalidation this will crash in the second printer as it tries to reuse			; invalidation this will crash in the second printer as it tries to reuse
	; now-invalid demanded bits.			; now-invalid demanded bits.
	;			;
	; RUN: opt < %s -passes='function(print<demanded-bits>),cgscc(argpromotion,function(print<demanded-bits>))' -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -passes='function(print<demanded-bits>),cgscc(argpromotion,function(print<demanded-bits>))' -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
				; RUN: opt -S -passes=attributor -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=6 < %s \| FileCheck %s --check-prefixes=ALL,ATTRIBUTOR

	@G = constant i32 0			@G = constant i32 0

	define internal i32 @a(i32* %x) {			define internal i32 @a(i32* %x) {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@a			; ARGPROMOTION-LABEL: define {{[^@]+}}@a
	; ARGPROMOTION-SAME: (i32 [[X_VAL:%.*]])			; ARGPROMOTION-SAME: (i32 [[X_VAL:%.*]])
	; ARGPROMOTION-NEXT: entry:			; ARGPROMOTION-NEXT: entry:
	; ARGPROMOTION-NEXT: ret i32 [[X_VAL]]			; ARGPROMOTION-NEXT: ret i32 [[X_VAL]]
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@a
				; ATTRIBUTOR-SAME: (i32* nocapture nonnull readonly align 4 dereferenceable(4) [[X:%.*]])
				; ATTRIBUTOR-NEXT: entry:
				; ATTRIBUTOR-NEXT: [[V:%.]] = load i32, i32 @G, align 4
				; ATTRIBUTOR-NEXT: ret i32 [[V]]
				;
	entry:			entry:
	%v = load i32, i32* %x			%v = load i32, i32* %x
	ret i32 %v			ret i32 %v
	}			}

	define i32 @b() {			define i32 @b() {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@b()			; ARGPROMOTION-LABEL: define {{[^@]+}}@b()
	; ARGPROMOTION-NEXT: entry:			; ARGPROMOTION-NEXT: entry:
	; ARGPROMOTION-NEXT: [[G_VAL:%.]] = load i32, i32 @G			; ARGPROMOTION-NEXT: [[G_VAL:%.]] = load i32, i32 @G
	; ARGPROMOTION-NEXT: [[V:%.*]] = call i32 @a(i32 [[G_VAL]])			; ARGPROMOTION-NEXT: [[V:%.*]] = call i32 @a(i32 [[G_VAL]])
	; ARGPROMOTION-NEXT: ret i32 [[V]]			; ARGPROMOTION-NEXT: ret i32 [[V]]
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@b()
				; ATTRIBUTOR-NEXT: entry:
				; ATTRIBUTOR-NEXT: [[V:%.]] = call i32 @a(i32 nonnull align 4 dereferenceable(4) @G)
				; ATTRIBUTOR-NEXT: ret i32 [[V]]
				;
	entry:			entry:
	%v = call i32 @a(i32* @G)			%v = call i32 @a(i32* @G)
	ret i32 %v			ret i32 %v
	}			}

	define i32 @c() {			define i32 @c() {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@c()			; ARGPROMOTION-LABEL: define {{[^@]+}}@c()
	; ARGPROMOTION-NEXT: entry:			; ARGPROMOTION-NEXT: entry:
	; ARGPROMOTION-NEXT: [[G_VAL:%.]] = load i32, i32 @G			; ARGPROMOTION-NEXT: [[G_VAL:%.]] = load i32, i32 @G
	; ARGPROMOTION-NEXT: [[V1:%.*]] = call i32 @a(i32 [[G_VAL]])			; ARGPROMOTION-NEXT: [[V1:%.*]] = call i32 @a(i32 [[G_VAL]])
	; ARGPROMOTION-NEXT: [[V2:%.*]] = call i32 @b()			; ARGPROMOTION-NEXT: [[V2:%.*]] = call i32 @b()
	; ARGPROMOTION-NEXT: [[RESULT:%.*]] = add i32 [[V1]], [[V2]]			; ARGPROMOTION-NEXT: [[RESULT:%.*]] = add i32 [[V1]], [[V2]]
	; ARGPROMOTION-NEXT: ret i32 [[RESULT]]			; ARGPROMOTION-NEXT: ret i32 [[RESULT]]
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@c()
				; ATTRIBUTOR-NEXT: entry:
				; ATTRIBUTOR-NEXT: [[V1:%.]] = call i32 @a(i32 nonnull align 4 dereferenceable(4) @G)
				; ATTRIBUTOR-NEXT: [[V2:%.*]] = call i32 @b()
				; ATTRIBUTOR-NEXT: [[RESULT:%.*]] = add i32 [[V1]], [[V2]]
				; ATTRIBUTOR-NEXT: ret i32 [[RESULT]]
				;
	entry:			entry:
	%v1 = call i32 @a(i32* @G)			%v1 = call i32 @a(i32* @G)
	%v2 = call i32 @b()			%v2 = call i32 @b()
	%result = add i32 %v1, %v2			%result = add i32 %v1, %v2
	ret i32 %result			ret i32 %result
	}			}

llvm/test/Transforms/ArgumentPromotion/naked_functions.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
				; RUN: opt -S -passes=attributor -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=1 < %s \| FileCheck %s --check-prefixes=ALL,ATTRIBUTOR

	; Don't promote paramaters of/arguments to naked functions			; Don't promote paramaters of/arguments to naked functions

	@g = common global i32 0, align 4			@g = common global i32 0, align 4

	define i32 @bar() {			define i32 @bar() {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@bar()			; ARGPROMOTION-LABEL: define {{[^@]+}}@bar()
	; ARGPROMOTION-NEXT: entry:			; ARGPROMOTION-NEXT: entry:
	; ARGPROMOTION-NEXT: [[CALL:%.]] = call i32 @foo(i32 @g)			; ARGPROMOTION-NEXT: [[CALL:%.]] = call i32 @foo(i32 @g)
	; ARGPROMOTION-NEXT: ret i32 [[CALL]]			; ARGPROMOTION-NEXT: ret i32 [[CALL]]
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@bar()
				; ATTRIBUTOR-NEXT: entry:
				; ATTRIBUTOR-NEXT: [[CALL:%.]] = call i32 @foo(i32 nonnull align 4 dereferenceable(4) @g)
				; ATTRIBUTOR-NEXT: ret i32 [[CALL]]
				;
	entry:			entry:
	%call = call i32 @foo(i32* @g)			%call = call i32 @foo(i32* @g)
	ret i32 %call			ret i32 %call
	}			}

	define internal i32 @foo(i32*) #0 {			define internal i32 @foo(i32*) #0 {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@foo			; ALL-LABEL: define {{[^@]+}}@foo
	; ARGPROMOTION-SAME: (i32* [[TMP0:%.*]])			; ALL-SAME: (i32* [[TMP0:%.*]])
	; ARGPROMOTION-NEXT: entry:			; ALL-NEXT: entry:
	; ARGPROMOTION-NEXT: [[RETVAL:%.*]] = alloca i32, align 4			; ALL-NEXT: [[RETVAL:%.*]] = alloca i32, align 4
	; ARGPROMOTION-NEXT: call void asm sideeffect "ldr r0, [r0] \0Abx lr \0A", ""()			; ALL-NEXT: call void asm sideeffect "ldr r0, [r0] \0Abx lr \0A", ""()
	; ARGPROMOTION-NEXT: unreachable			; ALL-NEXT: unreachable
	;			;
	entry:			entry:
	%retval = alloca i32, align 4			%retval = alloca i32, align 4
	call void asm sideeffect "ldr r0, [r0] \0Abx lr \0A", ""()			call void asm sideeffect "ldr r0, [r0] \0Abx lr \0A", ""()
	unreachable			unreachable
	}			}


	attributes #0 = { naked }			attributes #0 = { naked }

llvm/test/Transforms/ArgumentPromotion/nonzero-address-spaces.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
				; RUN: opt -S -passes=attributor -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=1 < %s \| FileCheck %s --check-prefixes=ALL,ATTRIBUTOR

	; ArgumentPromotion should preserve the default function address space			; ArgumentPromotion should preserve the default function address space
	; from the data layout.			; from the data layout.

	target datalayout = "e-P1-p:16:8-i8:8-i16:8-i32:8-i64:8-f32:8-f64:8-n8-a:8"			target datalayout = "e-P1-p:16:8-i8:8-i16:8-i32:8-i64:8-f32:8-f64:8-n8-a:8"

	@g = common global i32 0, align 4			@g = common global i32 0, align 4

	define i32 @bar() {			define i32 @bar() {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@bar() addrspace(1)			; ARGPROMOTION-LABEL: define {{[^@]+}}@bar() addrspace(1)
	; ARGPROMOTION-NEXT: entry:			; ARGPROMOTION-NEXT: entry:
	; ARGPROMOTION-NEXT: [[CALL:%.*]] = call addrspace(1) i32 @foo()			; ARGPROMOTION-NEXT: [[CALL:%.]] = call addrspace(1) i32 @foo(i32 @g)
	; ARGPROMOTION-NEXT: ret i32 [[CALL]]			; ARGPROMOTION-NEXT: ret i32 [[CALL]]
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@bar() addrspace(1)
				; ATTRIBUTOR-NEXT: entry:
				; ATTRIBUTOR-NEXT: [[CALL:%.]] = call addrspace(1) i32 @foo(i32 nonnull align 4 dereferenceable(4) @g)
				; ATTRIBUTOR-NEXT: ret i32 [[CALL]]
				;

	entry:			entry:
	%call = call i32 @foo(i32* @g)			%call = call i32 @foo(i32* @g)
	ret i32 %call			ret i32 %call
	}			}

	define internal i32 @foo(i32*) {			define internal i32 @foo(i32*) naked {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@foo() addrspace(1)			; ALL-LABEL: define {{[^@]+}}@foo
	; ARGPROMOTION-NEXT: entry:			; ALL-SAME: (i32* [[TMP0:%.*]]) addrspace(1)
	; ARGPROMOTION-NEXT: [[RETVAL:%.*]] = alloca i32, align 4			; ALL-NEXT: entry:
	; ARGPROMOTION-NEXT: call addrspace(0) void asm sideeffect "ldr r0, [r0] \0Abx lr \0A", ""()			; ALL-NEXT: [[RETVAL:%.*]] = alloca i32, align 4
	; ARGPROMOTION-NEXT: unreachable			; ALL-NEXT: call addrspace(0) void asm sideeffect "ldr r0, [r0] \0Abx lr \0A", ""()
				; ALL-NEXT: unreachable
	;			;
	entry:			entry:
	%retval = alloca i32, align 4			%retval = alloca i32, align 4
	call void asm sideeffect "ldr r0, [r0] \0Abx lr \0A", ""()			call void asm sideeffect "ldr r0, [r0] \0Abx lr \0A", ""()
	unreachable			unreachable
	}			}

llvm/test/Transforms/ArgumentPromotion/pr27568.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt -S -argpromotion < %s \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt -S -argpromotion < %s \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
	; RUN: opt -S -passes=argpromotion < %s \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt -S -passes=argpromotion < %s \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
				; RUN: opt -S -passes=attributor -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=4 < %s \| FileCheck %s --check-prefixes=ALL,ATTRIBUTOR
	; RUN: opt -S -debugify -o /dev/null < %s			; RUN: opt -S -debugify -o /dev/null < %s
	target triple = "x86_64-pc-windows-msvc"			target triple = "x86_64-pc-windows-msvc"

	define internal void @callee(i8*) {			define internal void @callee(i8*) {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@callee()			; ALL-LABEL: define {{[^@]+}}@callee()
	; ARGPROMOTION-NEXT: entry:			; ALL-NEXT: entry:
	; ARGPROMOTION-NEXT: call void @thunk()			; ALL-NEXT: call void @thunk()
	; ARGPROMOTION-NEXT: ret void			; ALL-NEXT: ret void
	;			;
	entry:			entry:
	call void @thunk()			call void @thunk()
	ret void			ret void
	}			}

	define void @test1() personality i32 (...)* @__CxxFrameHandler3 {			define void @test1() personality i32 (...)* @__CxxFrameHandler3 {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@test1() personality i32 (...)* @__CxxFrameHandler3			; ALL-LABEL: define {{[^@]+}}@test1() personality i32 (...)* @__CxxFrameHandler3
	; ARGPROMOTION-NEXT: entry:			; ALL-NEXT: entry:
	; ARGPROMOTION-NEXT: invoke void @thunk()			; ALL-NEXT: invoke void @thunk()
	; ARGPROMOTION-NEXT: to label [[OUT:%.]] unwind label [[CPAD:%.]]			; ALL-NEXT: to label [[OUT:%.]] unwind label [[CPAD:%.]]
	; ARGPROMOTION: out:			; ALL: out:
	; ARGPROMOTION-NEXT: ret void			; ALL-NEXT: ret void
	; ARGPROMOTION: cpad:			; ALL: cpad:
	; ARGPROMOTION-NEXT: [[PAD:%.*]] = cleanuppad within none []			; ALL-NEXT: [[PAD:%.*]] = cleanuppad within none []
	; ARGPROMOTION-NEXT: call void @callee() [ "funclet"(token [[PAD]]) ]			; ALL-NEXT: call void @callee() [ "funclet"(token [[PAD]]) ]
	; ARGPROMOTION-NEXT: cleanupret from [[PAD]] unwind to caller			; ALL-NEXT: cleanupret from [[PAD]] unwind to caller
	;			;
	entry:			entry:
	invoke void @thunk()			invoke void @thunk()
	to label %out unwind label %cpad			to label %out unwind label %cpad

	out:			out:
	ret void			ret void

	Show All 10 Lines

llvm/test/Transforms/ArgumentPromotion/pr3085.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -disable-output -loop-extract-single -loop-rotate -loop-reduce -argpromotion			; RUN: opt < %s -disable-output -loop-extract-single -loop-rotate -loop-reduce -argpromotion
				; RUN: opt -disable-output -passes=attributor -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=1 < %s
	; PR 3085			; PR 3085

	%struct.Lit = type { i8 }			%struct.Lit = type { i8 }

	define fastcc %struct.Lit* @import_lit(i32 %lit) nounwind {			define fastcc %struct.Lit* @import_lit(i32 %lit) nounwind {
	entry:			entry:
	br i1 false, label %bb, label %bb1			br i1 false, label %bb, label %bb1

	▲ Show 20 Lines • Show All 1,935 Lines • Show Last 20 Lines

llvm/test/Transforms/ArgumentPromotion/pr32917.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
				; RUN: opt -S -passes=attributor -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=4 < %s \| FileCheck %s --check-prefixes=ALL,ATTRIBUTOR
	; PR 32917			; PR 32917

	@b = common local_unnamed_addr global i32 0, align 4			@b = common local_unnamed_addr global i32 0, align 4
	@a = common local_unnamed_addr global i32 0, align 4			@a = common local_unnamed_addr global i32 0, align 4

	define i32 @fn2() local_unnamed_addr {			define i32 @fn2() local_unnamed_addr {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@fn2() local_unnamed_addr			; ARGPROMOTION-LABEL: define {{[^@]+}}@fn2() local_unnamed_addr
	; ARGPROMOTION-NEXT: [[TMP1:%.]] = load i32, i32 @b, align 4			; ARGPROMOTION-NEXT: [[TMP1:%.]] = load i32, i32 @b, align 4
	; ARGPROMOTION-NEXT: [[TMP2:%.*]] = sext i32 [[TMP1]] to i64			; ARGPROMOTION-NEXT: [[TMP2:%.*]] = sext i32 [[TMP1]] to i64
	; ARGPROMOTION-NEXT: [[TMP3:%.]] = inttoptr i64 [[TMP2]] to i32			; ARGPROMOTION-NEXT: [[TMP3:%.]] = inttoptr i64 [[TMP2]] to i32
	; ARGPROMOTION-NEXT: [[DOTIDX:%.]] = getelementptr i32, i32 [[TMP3]], i64 -1			; ARGPROMOTION-NEXT: [[DOTIDX:%.]] = getelementptr i32, i32 [[TMP3]], i64 -1
	; ARGPROMOTION-NEXT: [[DOTIDX_VAL:%.]] = load i32, i32 [[DOTIDX]], align 4			; ARGPROMOTION-NEXT: [[DOTIDX_VAL:%.]] = load i32, i32 [[DOTIDX]], align 4
	; ARGPROMOTION-NEXT: call fastcc void @fn1(i32 [[DOTIDX_VAL]])			; ARGPROMOTION-NEXT: call fastcc void @fn1(i32 [[DOTIDX_VAL]])
	; ARGPROMOTION-NEXT: ret i32 undef			; ARGPROMOTION-NEXT: ret i32 undef
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@fn2() local_unnamed_addr
				; ATTRIBUTOR-NEXT: [[TMP1:%.]] = load i32, i32 @b, align 4
				; ATTRIBUTOR-NEXT: [[TMP2:%.*]] = sext i32 [[TMP1]] to i64
				; ATTRIBUTOR-NEXT: [[TMP3:%.]] = inttoptr i64 [[TMP2]] to i32
				; ATTRIBUTOR-NEXT: call fastcc void @fn1(i32* [[TMP3]])
				; ATTRIBUTOR-NEXT: ret i32 undef
				;
	%1 = load i32, i32* @b, align 4			%1 = load i32, i32* @b, align 4
	%2 = sext i32 %1 to i64			%2 = sext i32 %1 to i64
	%3 = inttoptr i64 %2 to i32*			%3 = inttoptr i64 %2 to i32*
	call fastcc void @fn1(i32* %3)			call fastcc void @fn1(i32* %3)
	ret i32 undef			ret i32 undef
	}			}

	define internal fastcc void @fn1(i32* nocapture readonly) unnamed_addr {			define internal fastcc void @fn1(i32* nocapture readonly) unnamed_addr {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@fn1			; ARGPROMOTION-LABEL: define {{[^@]+}}@fn1
	; ARGPROMOTION-SAME: (i32 [[DOT18446744073709551615_VAL:%.*]]) unnamed_addr			; ARGPROMOTION-SAME: (i32 [[DOT18446744073709551615_VAL:%.*]]) unnamed_addr
	; ARGPROMOTION-NEXT: store i32 [[DOT18446744073709551615_VAL]], i32* @a, align 4			; ARGPROMOTION-NEXT: store i32 [[DOT18446744073709551615_VAL]], i32* @a, align 4
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@fn1
				; ATTRIBUTOR-SAME: (i32* nocapture readonly [[TMP0:%.*]]) unnamed_addr
				; ATTRIBUTOR-NEXT: [[TMP2:%.]] = getelementptr inbounds i32, i32 [[TMP0]], i64 -1
				; ATTRIBUTOR-NEXT: [[TMP3:%.]] = load i32, i32 [[TMP2]], align 4
				; ATTRIBUTOR-NEXT: store i32 [[TMP3]], i32* @a, align 4
				; ATTRIBUTOR-NEXT: ret void
				;
	%2 = getelementptr inbounds i32, i32* %0, i64 -1			%2 = getelementptr inbounds i32, i32* %0, i64 -1
	%3 = load i32, i32* %2, align 4			%3 = load i32, i32* %2, align 4
	store i32 %3, i32* @a, align 4			store i32 %3, i32* @a, align 4
	ret void			ret void
	}			}

llvm/test/Transforms/ArgumentPromotion/pr33641_remove_arg_dbgvalue.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt -argpromotion -verify -dse -S %s -o - \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt -argpromotion -verify -dse -S %s -o - \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
				; RUN: opt -S -passes='function(dse),attributor' -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=1 < %s \| FileCheck %s --check-prefixes=ALL,ATTRIBUTOR

	; Fix for PR33641. ArgumentPromotion removed the argument to bar but left the call to			; Fix for PR33641. ArgumentPromotion removed the argument to bar but left the call to
	; dbg.value which still used the removed argument.			; dbg.value which still used the removed argument.

	; The %p argument should be removed, and the use of it in dbg.value should be			; The %p argument should be removed, and the use of it in dbg.value should be
	; changed to undef.			; changed to undef.


	%p_t = type i16*			%p_t = type i16*
	%fun_t = type void (%p_t)*			%fun_t = type void (%p_t)*

	define void @foo() {			define void @foo() {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@foo()			; ALL-LABEL: define {{[^@]+}}@foo()
	; ARGPROMOTION-NEXT: ret void			; ALL-NEXT: ret void
	;			;
	%tmp = alloca %fun_t			%tmp = alloca %fun_t
	store %fun_t @bar, %fun_t* %tmp			store %fun_t @bar, %fun_t* %tmp
	ret void			ret void
	}			}

	define internal void @bar(%p_t %p) {			define internal void @bar(%p_t %p) {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@bar()			; ARGPROMOTION-LABEL: define {{[^@]+}}@bar()
	Show All 19 Lines

llvm/test/Transforms/ArgumentPromotion/profile.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt -argpromotion -mem2reg -S < %s \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt -argpromotion -mem2reg -S < %s \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
				; RUN: opt -S -passes='attributor,function(mem2reg)' -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=2 < %s \| FileCheck %s --check-prefixes=ALL,ATTRIBUTOR
	target datalayout = "E-p:64:64:64-a0:0:8-f32:32:32-f64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:32:64-v64:64:64-v128:128:128"			target datalayout = "E-p:64:64:64-a0:0:8-f32:32:32-f64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:32:64-v64:64:64-v128:128:128"

	; Checks if !prof metadata is corret in deadargelim.			; Checks if !prof metadata is corret in deadargelim.

	define void @caller() #0 {			define void @caller() #0 {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@caller()			; ALL-LABEL: define {{[^@]+}}@caller()
	; ARGPROMOTION-NEXT: call void @promote_i32_ptr(i32 42), !prof !0			; ALL-NEXT: call void @promote_i32_ptr(i32 42), !prof !0
	; ARGPROMOTION-NEXT: ret void			; ALL-NEXT: ret void
	;			;
	%x = alloca i32			%x = alloca i32
	store i32 42, i32* %x			store i32 42, i32* %x
	call void @promote_i32_ptr(i32* %x), !prof !0			call void @promote_i32_ptr(i32* %x), !prof !0
	ret void			ret void
	}			}

	define internal void @promote_i32_ptr(i32* %xp) {			define internal void @promote_i32_ptr(i32* %xp) {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@promote_i32_ptr			; ALL-LABEL: define {{[^@]+}}@promote_i32_ptr
	; ARGPROMOTION-SAME: (i32 [[XP_VAL:%.*]])			; ALL-SAME: (i32 [[TMP0:%.*]])
	; ARGPROMOTION-NEXT: call void @use_i32(i32 [[XP_VAL]])			; ALL-NEXT: call void @use_i32(i32 [[TMP0]])
	; ARGPROMOTION-NEXT: ret void			; ALL-NEXT: ret void
	;			;
	%x = load i32, i32* %xp			%x = load i32, i32* %xp
	call void @use_i32(i32 %x)			call void @use_i32(i32 %x)
	ret void			ret void
	}			}

	declare void @use_i32(i32)			declare void @use_i32(i32)

	!0 = !{!"branch_weights", i32 30}			!0 = !{!"branch_weights", i32 30}

llvm/test/Transforms/ArgumentPromotion/reserve-tbaa.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
	; RUN: opt < %s -passes=argpromotion -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -passes=argpromotion -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
				; RUN: opt -S -passes=attributor -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=2 < %s \| FileCheck %s --check-prefixes=ALL,ATTRIBUTOR

	; PR17906			; PR17906
	; When we promote two arguments in a single function with different types,			; When we promote two arguments in a single function with different types,
	; before the fix, we used the same tag for the newly-created two loads.			; before the fix, we used the same tag for the newly-created two loads.
	; This testing case makes sure that we correctly transfer the tbaa tags from the			; This testing case makes sure that we correctly transfer the tbaa tags from the
	; original loads to the newly-created loads when promoting pointer arguments.			; original loads to the newly-created loads when promoting pointer arguments.
				;
				; TODO: This test doesn't work with the ATTRIBUTOR as it will not promote the arguments but propagate them.

	@a = global i32* null, align 8			@a = global i32* null, align 8
	@e = global i32** @a, align 8			@e = global i32** @a, align 8
	@g = global i32 0, align 4			@g = global i32 0, align 4
	@c = global i64 0, align 8			@c = global i64 0, align 8
	@d = global i8 0, align 1			@d = global i8 0, align 1

	define internal fastcc void @fn(i32* nocapture readonly %p1, i64* nocapture readonly %p2) {			define internal fastcc void @fn(i32* nocapture readonly %p1, i64* nocapture readonly %p2) {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@fn			; ARGPROMOTION-LABEL: define {{[^@]+}}@fn
	; ARGPROMOTION-SAME: (i32 [[P1_VAL:%.]], i64 [[P2_VAL:%.]])			; ARGPROMOTION-SAME: (i32 [[P1_VAL:%.]], i64 [[P2_VAL:%.]])
	; ARGPROMOTION-NEXT: entry:			; ARGPROMOTION-NEXT: entry:
	; ARGPROMOTION-NEXT: [[CONV:%.*]] = trunc i64 [[P2_VAL]] to i32			; ARGPROMOTION-NEXT: [[CONV:%.*]] = trunc i64 [[P2_VAL]] to i32
	; ARGPROMOTION-NEXT: [[CONV1:%.*]] = trunc i32 [[P1_VAL]] to i8			; ARGPROMOTION-NEXT: [[CONV1:%.*]] = trunc i32 [[P1_VAL]] to i8
	; ARGPROMOTION-NEXT: store i8 [[CONV1]], i8* @d, align 1, !tbaa !0			; ARGPROMOTION-NEXT: store i8 [[CONV1]], i8* @d, align 1, !tbaa !0
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@fn
				; ATTRIBUTOR-SAME: (i32* nocapture nonnull readonly align 4 dereferenceable(4) [[P1:%.*]])
				; ATTRIBUTOR-NEXT: entry:
				; ATTRIBUTOR-NEXT: [[TMP0:%.]] = load i64, i64 @c, align 8, !tbaa !0
				; ATTRIBUTOR-NEXT: [[CONV:%.*]] = trunc i64 [[TMP0]] to i32
				; ATTRIBUTOR-NEXT: [[TMP1:%.]] = load i32, i32 @g, align 4, !tbaa !4
				; ATTRIBUTOR-NEXT: [[CONV1:%.*]] = trunc i32 [[TMP1]] to i8
				; ATTRIBUTOR-NEXT: store i8 [[CONV1]], i8* @d, align 1, !tbaa !6
				; ATTRIBUTOR-NEXT: ret void
				;
	entry:			entry:
	%0 = load i64, i64* %p2, align 8, !tbaa !1			%0 = load i64, i64* %p2, align 8, !tbaa !1
	%conv = trunc i64 %0 to i32			%conv = trunc i64 %0 to i32
	%1 = load i32, i32* %p1, align 4, !tbaa !5			%1 = load i32, i32* %p1, align 4, !tbaa !5
	%conv1 = trunc i32 %1 to i8			%conv1 = trunc i32 %1 to i8
	store i8 %conv1, i8* @d, align 1, !tbaa !7			store i8 %conv1, i8* @d, align 1, !tbaa !7
	ret void			ret void
	}			}

	define i32 @main() {			define i32 @main() {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@main()			; ARGPROMOTION-LABEL: define {{[^@]+}}@main()
	; ARGPROMOTION-NEXT: entry:			; ARGPROMOTION-NEXT: entry:
	; ARGPROMOTION-NEXT: [[TMP0:%.]] = load i32, i32** @e, align 8, !tbaa !3			; ARGPROMOTION-NEXT: [[TMP0:%.]] = load i32, i32** @e, align 8, !tbaa !3
	; ARGPROMOTION-NEXT: store i32* @g, i32** [[TMP0]], align 8, !tbaa !3			; ARGPROMOTION-NEXT: store i32* @g, i32** [[TMP0]], align 8, !tbaa !3
	; ARGPROMOTION-NEXT: [[TMP1:%.]] = load i32, i32** @a, align 8, !tbaa !3			; ARGPROMOTION-NEXT: [[TMP1:%.]] = load i32, i32** @a, align 8, !tbaa !3
	; ARGPROMOTION-NEXT: store i32 1, i32* [[TMP1]], align 4, !tbaa !5			; ARGPROMOTION-NEXT: store i32 1, i32* [[TMP1]], align 4, !tbaa !5
	; ARGPROMOTION-NEXT: [[G_VAL:%.]] = load i32, i32 @g, align 4, !tbaa !5			; ARGPROMOTION-NEXT: [[G_VAL:%.]] = load i32, i32 @g, align 4, !tbaa !5
	; ARGPROMOTION-NEXT: [[C_VAL:%.]] = load i64, i64 @c, align 8, !tbaa !7			; ARGPROMOTION-NEXT: [[C_VAL:%.]] = load i64, i64 @c, align 8, !tbaa !7
	; ARGPROMOTION-NEXT: call fastcc void @fn(i32 [[G_VAL]], i64 [[C_VAL]])			; ARGPROMOTION-NEXT: call fastcc void @fn(i32 [[G_VAL]], i64 [[C_VAL]])
	; ARGPROMOTION-NEXT: ret i32 0			; ARGPROMOTION-NEXT: ret i32 0
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@main()
				; ATTRIBUTOR-NEXT: entry:
				; ATTRIBUTOR-NEXT: [[TMP0:%.]] = load i32, i32** @e, align 8, !tbaa !7
				; ATTRIBUTOR-NEXT: store i32* @g, i32** [[TMP0]], align 8, !tbaa !7
				; ATTRIBUTOR-NEXT: [[TMP1:%.]] = load i32, i32** @a, align 8, !tbaa !7
				; ATTRIBUTOR-NEXT: store i32 1, i32* [[TMP1]], align 4, !tbaa !4
				; ATTRIBUTOR-NEXT: call fastcc void @fn(i32* nonnull align 4 dereferenceable(4) @g)
				; ATTRIBUTOR-NEXT: ret i32 0
				;
	entry:			entry:
	%0 = load i32, i32* @e, align 8, !tbaa !8			%0 = load i32, i32* @e, align 8, !tbaa !8
	store i32* @g, i32** %0, align 8, !tbaa !8			store i32* @g, i32** %0, align 8, !tbaa !8
	%1 = load i32, i32* @a, align 8, !tbaa !8			%1 = load i32, i32* @a, align 8, !tbaa !8
	store i32 1, i32* %1, align 4, !tbaa !5			store i32 1, i32* %1, align 4, !tbaa !5
	call fastcc void @fn(i32* @g, i64* @c)			call fastcc void @fn(i32* @g, i64* @c)

	ret i32 0			ret i32 0
	Show All 12 Lines

llvm/test/Transforms/ArgumentPromotion/sret.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
	; RUN: opt < %s -passes=argpromotion -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -passes=argpromotion -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
				; RUN: opt -S -passes=attributor -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=2 < %s \| FileCheck %s --check-prefixes=ALL,ATTRIBUTOR

	target datalayout = "e-m:w-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:w-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-pc-windows-msvc"			target triple = "x86_64-pc-windows-msvc"

	define internal void @add({i32, i32}* %this, i32* sret %r) {			define internal void @add({i32, i32}* %this, i32* sret %r) {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@add			; ARGPROMOTION-LABEL: define {{[^@]+}}@add
	; ARGPROMOTION-SAME: (i32 [[THIS_0_0_VAL:%.]], i32 [[THIS_0_1_VAL:%.]], i32* noalias [[R:%.*]])			; ARGPROMOTION-SAME: (i32 [[THIS_0_0_VAL:%.]], i32 [[THIS_0_1_VAL:%.]], i32* noalias [[R:%.*]])
	; ARGPROMOTION-NEXT: [[AB:%.*]] = add i32 [[THIS_0_0_VAL]], [[THIS_0_1_VAL]]			; ARGPROMOTION-NEXT: [[AB:%.*]] = add i32 [[THIS_0_0_VAL]], [[THIS_0_1_VAL]]
	; ARGPROMOTION-NEXT: store i32 [[AB]], i32* [[R]]			; ARGPROMOTION-NEXT: store i32 [[AB]], i32* [[R]]
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@add
				; ATTRIBUTOR-SAME: ({ i32, i32 }* noalias nocapture nonnull readonly align 8 dereferenceable(8) [[THIS:%.]], i32 noalias nocapture nonnull sret writeonly align 4 dereferenceable(4) [[R:%.*]])
				; ATTRIBUTOR-NEXT: [[AP:%.]] = getelementptr { i32, i32 }, { i32, i32 } [[THIS]], i32 0, i32 0
				; ATTRIBUTOR-NEXT: [[BP:%.]] = getelementptr { i32, i32 }, { i32, i32 } [[THIS]], i32 0, i32 1
				; ATTRIBUTOR-NEXT: [[A:%.]] = load i32, i32 [[AP]], align 8
				; ATTRIBUTOR-NEXT: [[B:%.]] = load i32, i32 [[BP]]
				; ATTRIBUTOR-NEXT: [[AB:%.*]] = add i32 [[A]], [[B]]
				; ATTRIBUTOR-NEXT: store i32 [[AB]], i32* [[R]], align 4
				; ATTRIBUTOR-NEXT: ret void
				;
	%ap = getelementptr {i32, i32}, {i32, i32}* %this, i32 0, i32 0			%ap = getelementptr {i32, i32}, {i32, i32}* %this, i32 0, i32 0
	%bp = getelementptr {i32, i32}, {i32, i32}* %this, i32 0, i32 1			%bp = getelementptr {i32, i32}, {i32, i32}* %this, i32 0, i32 1
	%a = load i32, i32* %ap			%a = load i32, i32* %ap
	%b = load i32, i32* %bp			%b = load i32, i32* %bp
	%ab = add i32 %a, %b			%ab = add i32 %a, %b
	store i32 %ab, i32* %r			store i32 %ab, i32* %r
	ret void			ret void
	}			}

	define void @f() {			define void @f() {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@f()			; ARGPROMOTION-LABEL: define {{[^@]+}}@f()
	; ARGPROMOTION-NEXT: [[R:%.*]] = alloca i32			; ARGPROMOTION-NEXT: [[R:%.*]] = alloca i32
	; ARGPROMOTION-NEXT: [[PAIR:%.*]] = alloca { i32, i32 }			; ARGPROMOTION-NEXT: [[PAIR:%.*]] = alloca { i32, i32 }
	; ARGPROMOTION-NEXT: [[PAIR_IDX:%.]] = getelementptr { i32, i32 }, { i32, i32 } [[PAIR]], i64 0, i32 0			; ARGPROMOTION-NEXT: [[PAIR_IDX:%.]] = getelementptr { i32, i32 }, { i32, i32 } [[PAIR]], i64 0, i32 0
	; ARGPROMOTION-NEXT: [[PAIR_IDX_VAL:%.]] = load i32, i32 [[PAIR_IDX]]			; ARGPROMOTION-NEXT: [[PAIR_IDX_VAL:%.]] = load i32, i32 [[PAIR_IDX]]
	; ARGPROMOTION-NEXT: [[PAIR_IDX1:%.]] = getelementptr { i32, i32 }, { i32, i32 } [[PAIR]], i64 0, i32 1			; ARGPROMOTION-NEXT: [[PAIR_IDX1:%.]] = getelementptr { i32, i32 }, { i32, i32 } [[PAIR]], i64 0, i32 1
	; ARGPROMOTION-NEXT: [[PAIR_IDX1_VAL:%.]] = load i32, i32 [[PAIR_IDX1]]			; ARGPROMOTION-NEXT: [[PAIR_IDX1_VAL:%.]] = load i32, i32 [[PAIR_IDX1]]
	; ARGPROMOTION-NEXT: call void @add(i32 [[PAIR_IDX_VAL]], i32 [[PAIR_IDX1_VAL]], i32* noalias [[R]])			; ARGPROMOTION-NEXT: call void @add(i32 [[PAIR_IDX_VAL]], i32 [[PAIR_IDX1_VAL]], i32* noalias [[R]])
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@f()
				; ATTRIBUTOR-NEXT: [[R:%.*]] = alloca i32
				; ATTRIBUTOR-NEXT: [[PAIR:%.*]] = alloca { i32, i32 }
				; ATTRIBUTOR-NEXT: call void @add({ i32, i32 }* noalias nocapture nonnull readonly align 8 dereferenceable(8) [[PAIR]], i32* noalias nocapture nonnull sret writeonly align 4 dereferenceable(4) [[R]])
				; ATTRIBUTOR-NEXT: ret void
				;
	%r = alloca i32			%r = alloca i32
	%pair = alloca {i32, i32}			%pair = alloca {i32, i32}

	call void @add({i32, i32}* %pair, i32* sret %r)			call void @add({i32, i32}* %pair, i32* sret %r)
	ret void			ret void
	}			}

llvm/test/Transforms/ArgumentPromotion/tail.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt %s -argpromotion -S -o - \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt %s -argpromotion -sroa -S -o - \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
	; RUN: opt %s -passes=argpromotion -S -o - \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt %s -passes='argpromotion,function(sroa)' -S -o - \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
				; RUN: opt -S -passes='attributor,function(sroa)' -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=1 < %s \| FileCheck %s --check-prefixes=ALL,ATTRIBUTOR
	; PR14710			; PR14710

	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"

	%pair = type { i32, i32 }			%pair = type { i32, i32 }

	declare i8* @foo(%pair*)			declare i8* @foo(%pair*)

	define internal void @bar(%pair* byval %Data) {			define internal void @bar(%pair* byval %Data) {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@bar			; ARGPROMOTION-LABEL: define {{[^@]+}}@bar
	; ARGPROMOTION-SAME: (i32 [[DATA_0:%.]], i32 [[DATA_1:%.]])			; ARGPROMOTION-SAME: (i32 [[DATA_0:%.]], i32 [[DATA_1:%.]])
	; ARGPROMOTION-NEXT: [[DATA:%.]] = alloca [[PAIR:%.]]			; ARGPROMOTION-NEXT: [[DATA:%.]] = alloca [[PAIR:%.]]
	; ARGPROMOTION-NEXT: [[DOT0:%.]] = getelementptr [[PAIR]], %pair [[DATA]], i32 0, i32 0			; ARGPROMOTION-NEXT: [[DOT0:%.]] = getelementptr [[PAIR]], %pair [[DATA]], i32 0, i32 0
	; ARGPROMOTION-NEXT: store i32 [[DATA_0]], i32* [[DOT0]]			; ARGPROMOTION-NEXT: store i32 [[DATA_0]], i32* [[DOT0]]
	; ARGPROMOTION-NEXT: [[DOT1:%.]] = getelementptr [[PAIR]], %pair [[DATA]], i32 0, i32 1			; ARGPROMOTION-NEXT: [[DOT1:%.]] = getelementptr [[PAIR]], %pair [[DATA]], i32 0, i32 1
	; ARGPROMOTION-NEXT: store i32 [[DATA_1]], i32* [[DOT1]]			; ARGPROMOTION-NEXT: store i32 [[DATA_1]], i32* [[DOT1]]
	; ARGPROMOTION-NEXT: [[TMP1:%.]] = call i8 @foo(%pair* [[DATA]])			; ARGPROMOTION-NEXT: [[TMP1:%.]] = call i8 @foo(%pair* [[DATA]])
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@bar
				; ATTRIBUTOR-SAME: (i32 [[TMP0:%.]], i32 [[TMP1:%.]])
				; ATTRIBUTOR-NEXT: [[DATA_PRIV:%.]] = alloca [[PAIR:%.]]
				; ATTRIBUTOR-NEXT: [[DATA_PRIV_CAST:%.]] = bitcast %pair [[DATA_PRIV]] to i32*
				; ATTRIBUTOR-NEXT: store i32 [[TMP0]], i32* [[DATA_PRIV_CAST]]
				; ATTRIBUTOR-NEXT: [[DATA_PRIV_0_1:%.]] = getelementptr [[PAIR]], %pair [[DATA_PRIV]], i32 0, i32 1
				; ATTRIBUTOR-NEXT: store i32 [[TMP1]], i32* [[DATA_PRIV_0_1]]
				; ATTRIBUTOR-NEXT: [[TMP3:%.]] = call i8 @foo(%pair* nonnull [[DATA_PRIV]])
				; ATTRIBUTOR-NEXT: ret void
				;
	tail call i8* @foo(%pair* %Data)			tail call i8* @foo(%pair* %Data)
	ret void			ret void
	}			}

	define void @zed(%pair* byval %Data) {			define void @zed(%pair* byval %Data) {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@zed			; ARGPROMOTION-LABEL: define {{[^@]+}}@zed
	; ARGPROMOTION-SAME: (%pair* byval [[DATA:%.*]])			; ARGPROMOTION-SAME: (%pair* byval [[DATA:%.*]])
	; ARGPROMOTION-NEXT: [[DATA_0:%.]] = getelementptr [[PAIR:%.]], %pair* [[DATA]], i32 0, i32 0			; ARGPROMOTION-NEXT: [[DATA_0:%.]] = getelementptr [[PAIR:%.]], %pair* [[DATA]], i32 0, i32 0
	; ARGPROMOTION-NEXT: [[DATA_0_VAL:%.]] = load i32, i32 [[DATA_0]]			; ARGPROMOTION-NEXT: [[DATA_0_VAL:%.]] = load i32, i32 [[DATA_0]]
	; ARGPROMOTION-NEXT: [[DATA_1:%.]] = getelementptr [[PAIR]], %pair [[DATA]], i32 0, i32 1			; ARGPROMOTION-NEXT: [[DATA_1:%.]] = getelementptr [[PAIR]], %pair [[DATA]], i32 0, i32 1
	; ARGPROMOTION-NEXT: [[DATA_1_VAL:%.]] = load i32, i32 [[DATA_1]]			; ARGPROMOTION-NEXT: [[DATA_1_VAL:%.]] = load i32, i32 [[DATA_1]]
	; ARGPROMOTION-NEXT: call void @bar(i32 [[DATA_0_VAL]], i32 [[DATA_1_VAL]])			; ARGPROMOTION-NEXT: call void @bar(i32 [[DATA_0_VAL]], i32 [[DATA_1_VAL]])
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@zed
				; ATTRIBUTOR-SAME: (%pair* nonnull byval [[DATA:%.*]])
				; ATTRIBUTOR-NEXT: [[DATA_CAST:%.]] = bitcast %pair [[DATA]] to i32*
				; ATTRIBUTOR-NEXT: [[TMP1:%.]] = load i32, i32 [[DATA_CAST]]
				; ATTRIBUTOR-NEXT: [[DATA_0_1:%.]] = getelementptr [[PAIR:%.]], %pair* [[DATA]], i32 0, i32 1
				; ATTRIBUTOR-NEXT: [[TMP2:%.]] = load i32, i32 [[DATA_0_1]]
				; ATTRIBUTOR-NEXT: call void @bar(i32 [[TMP1]], i32 [[TMP2]])
				; ATTRIBUTOR-NEXT: ret void
				;
	call void @bar(%pair* byval %Data)			call void @bar(%pair* byval %Data)
	ret void			ret void
	}			}

llvm/test/Transforms/ArgumentPromotion/variadic.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -argpromotion -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
	; RUN: opt < %s -passes=argpromotion -S \| FileCheck %s --check-prefixes=ARGPROMOTION,ALL			; RUN: opt < %s -passes=argpromotion -S \| FileCheck %s --check-prefixes=ALL,ARGPROMOTION
				; RUN: opt -S -passes=attributor -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=1 < %s \| FileCheck %s --check-prefixes=ALL,ATTRIBUTOR

	; Unused arguments from variadic functions cannot be eliminated as that changes			; Unused arguments from variadic functions cannot be eliminated as that changes
	; their classiciation according to the SysV amd64 ABI. Clang and other frontends			; their classiciation according to the SysV amd64 ABI. Clang and other frontends
	; bake in the classification when they use things like byval, as in this test.			; bake in the classification when they use things like byval, as in this test.

	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	%struct.tt0 = type { i64, i64 }			%struct.tt0 = type { i64, i64 }
	%struct.__va_list_tag = type { i32, i32, i8, i8 }			%struct.__va_list_tag = type { i32, i32, i8, i8 }

	@t45 = internal global %struct.tt0 { i64 1335139741, i64 438042995 }, align 8			@t45 = internal global %struct.tt0 { i64 1335139741, i64 438042995 }, align 8

	; Function Attrs: nounwind uwtable			; Function Attrs: nounwind uwtable
	define i32 @main(i32 %argc, i8** nocapture readnone %argv) #0 {			define i32 @main(i32 %argc, i8** nocapture readnone %argv) #0 {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@main			; ALL-LABEL: define {{[^@]+}}@main
	; ARGPROMOTION-SAME: (i32 [[ARGC:%.]], i8* nocapture readnone [[ARGV:%.*]])			; ALL-SAME: (i32 [[ARGC:%.]], i8* nocapture readnone [[ARGV:%.*]])
	; ARGPROMOTION-NEXT: entry:			; ALL-NEXT: entry:
	; ARGPROMOTION-NEXT: tail call void (i8, i8, i8, i8, i8, ...) @callee_t0f(i8 undef, i8* undef, i8* undef, i8* undef, i8* undef, %struct.tt0* byval align 8 @t45)			; ALL-NEXT: tail call void (i8, i8, i8, i8, i8, ...) @callee_t0f(i8 undef, i8* undef, i8* undef, i8* undef, i8* undef, %struct.tt0* byval align 8 @t45)
	; ARGPROMOTION-NEXT: ret i32 0			; ALL-NEXT: ret i32 0
	;			;
	entry:			entry:
	tail call void (i8, i8, i8, i8, i8, ...) @callee_t0f(i8 undef, i8* undef, i8* undef, i8* undef, i8* undef, %struct.tt0* byval align 8 @t45)			tail call void (i8, i8, i8, i8, i8, ...) @callee_t0f(i8 undef, i8* undef, i8* undef, i8* undef, i8* undef, %struct.tt0* byval align 8 @t45)
	ret i32 0			ret i32 0
	}			}

	; Function Attrs: nounwind uwtable			; Function Attrs: nounwind uwtable
	define internal void @callee_t0f(i8* nocapture readnone %tp13, i8* nocapture readnone %tp14, i8* nocapture readnone %tp15, i8* nocapture readnone %tp16, i8* nocapture readnone %tp17, ...) {			define internal void @callee_t0f(i8* nocapture readnone %tp13, i8* nocapture readnone %tp14, i8* nocapture readnone %tp15, i8* nocapture readnone %tp16, i8* nocapture readnone %tp17, ...) {
	; ARGPROMOTION-LABEL: define {{[^@]+}}@callee_t0f			; ARGPROMOTION-LABEL: define {{[^@]+}}@callee_t0f
	; ARGPROMOTION-SAME: (i8* nocapture readnone [[TP13:%.]], i8 nocapture readnone [[TP14:%.]], i8 nocapture readnone [[TP15:%.]], i8 nocapture readnone [[TP16:%.]], i8 nocapture readnone [[TP17:%.*]], ...)			; ARGPROMOTION-SAME: (i8* nocapture readnone [[TP13:%.]], i8 nocapture readnone [[TP14:%.]], i8 nocapture readnone [[TP15:%.]], i8 nocapture readnone [[TP16:%.]], i8 nocapture readnone [[TP17:%.*]], ...)
	; ARGPROMOTION-NEXT: entry:			; ARGPROMOTION-NEXT: entry:
	; ARGPROMOTION-NEXT: ret void			; ARGPROMOTION-NEXT: ret void
	;			;
				; ATTRIBUTOR-LABEL: define {{[^@]+}}@callee_t0f
				; ATTRIBUTOR-SAME: (i8* noalias nocapture nonnull readnone [[TP13:%.]], i8 noalias nocapture nonnull readnone [[TP14:%.]], i8 noalias nocapture nonnull readnone [[TP15:%.]], i8 noalias nocapture nonnull readnone [[TP16:%.]], i8 noalias nocapture nonnull readnone [[TP17:%.*]], ...)
				; ATTRIBUTOR-NEXT: entry:
				; ATTRIBUTOR-NEXT: ret void
				;
	entry:			entry:
	ret void			ret void
	}			}

llvm/test/Transforms/FunctionAttrs/callbacks.ll

Show All 18 Lines
; CHECK-LABEL: @t0_caller(		; CHECK-LABEL: @t0_caller(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[B:%.*]] = alloca i32, align 32		; CHECK-NEXT: [[B:%.*]] = alloca i32, align 32
; CHECK-NEXT: [[C:%.]] = alloca i32, align 64		; CHECK-NEXT: [[C:%.]] = alloca i32, align 64
; CHECK-NEXT: [[PTR:%.*]] = alloca i32, align 128		; CHECK-NEXT: [[PTR:%.*]] = alloca i32, align 128
; CHECK-NEXT: [[TMP0:%.]] = bitcast i32 [[B]] to i8*		; CHECK-NEXT: [[TMP0:%.]] = bitcast i32 [[B]] to i8*
; CHECK-NEXT: store i32 42, i32* [[B]], align 32		; CHECK-NEXT: store i32 42, i32* [[B]], align 32
; CHECK-NEXT: store i32* [[B]], i32** [[C]], align 64		; CHECK-NEXT: store i32* [[B]], i32** [[C]], align 64
; CHECK-NEXT: call void (i32, i32, void (i32, i32, ...), ...) @t0_callback_broker(i32 noalias null, i32* nonnull align 128 dereferenceable(4) [[PTR]], void (i32, i32, ...)* nonnull bitcast (void (i32, i32, i32, i64, i32) @t0_callback_callee to void (i32, i32, ...)), i32 [[A:%.]], i64 99, i32* nonnull align 64 dereferenceable(8) [[C]])		; CHECK-NEXT: call void (i32, i32, void (i32, i32, ...), ...) @t0_callback_broker(i32 noalias null, i32* nonnull align 128 dereferenceable(4) [[PTR]], void (i32, i32, ...)* nonnull bitcast (void (i32, i32, i32, i64, i32) @t0_callback_callee to void (i32, i32, ...)), i32 [[A:%.]], i64 99, i32* noalias nocapture nonnull readonly align 64 dereferenceable(8) [[C]])
; CHECK-NEXT: ret void		; CHECK-NEXT: ret void
;		;
entry:		entry:
%b = alloca i32, align 32		%b = alloca i32, align 32
%c = alloca i32*, align 64		%c = alloca i32*, align 64
%ptr = alloca i32, align 128		%ptr = alloca i32, align 128
%0 = bitcast i32* %b to i8*		%0 = bitcast i32* %b to i8*
store i32 42, i32* %b, align 4		store i32 42, i32* %b, align 4
Show All 20 Lines	entry:
tail call void @t0_check(i32* %a, i64 %b, i32* %0)		tail call void @t0_check(i32* %a, i64 %b, i32* %0)
ret void		ret void
}		}

declare void @t0_check(i32* align 256, i64, i32*)		declare void @t0_check(i32* align 256, i64, i32*)

declare !callback !0 void @t0_callback_broker(i32, i32, void (i32, i32, ...)*, ...)		declare !callback !0 void @t0_callback_broker(i32, i32, void (i32, i32, ...)*, ...)

		; Test 1
		;
		; Similar to test 0 but with some additional annotations (noalias/nocapute) to make sure
		; we deduce and propagate noalias and others properly.

		define void @t1_caller(i32* noalias %a) {
		; CHECK-LABEL: define {{[^@]+}}@t1_caller
		; CHECK-SAME: (i32* noalias nocapture [[A:%.*]])
		; CHECK-NEXT: entry:
		; CHECK-NEXT: [[B:%.*]] = alloca i32, align 32
		; CHECK-NEXT: [[C:%.]] = alloca i32, align 64
		; CHECK-NEXT: [[PTR:%.*]] = alloca i32, align 128
		; CHECK-NEXT: [[TMP0:%.]] = bitcast i32 [[B]] to i8*
		; CHECK-NEXT: store i32 42, i32* [[B]], align 32
		; CHECK-NEXT: store i32* [[B]], i32** [[C]], align 64
		; CHECK-NEXT: call void (i32, i32, void (i32, i32, ...), ...) @t1_callback_broker(i32 noalias null, i32* noalias nonnull align 128 dereferenceable(4) [[PTR]], void (i32, i32, ...)* nonnull bitcast (void (i32, i32, i32, i64, i32) @t1_callback_callee to void (i32, i32, ...)), i32 noalias nocapture [[A:%.]], i64 99, i32* noalias nocapture nonnull readonly align 64 dereferenceable(8) [[C]])
		; CHECK-NEXT: ret void
		;
		entry:
		%b = alloca i32, align 32
		%c = alloca i32*, align 64
		%ptr = alloca i32, align 128
		%0 = bitcast i32* %b to i8*
		store i32 42, i32* %b, align 4
		store i32* %b, i32** %c, align 8
		call void (i32, i32, void (i32, i32, ...), ...) @t1_callback_broker(i32 null, i32* %ptr, void (i32, i32, ...)* bitcast (void (i32, i32, i32, i64, i32) @t1_callback_callee to void (i32, i32, ...)), i32 %a, i64 99, i32** %c)
		ret void
		}

		; Note that the first two arguments are provided by the callback_broker according to the callback in !1 below!
		; The others are annotated with alignment information, amongst others, or even replaced by the constants passed to the call.
		define internal void @t1_callback_callee(i32* %is_not_null, i32* %ptr, i32* %a, i64 %b, i32** %c) {
		; CHECK-LABEL: define {{[^@]+}}@t1_callback_callee
		; CHECK-SAME: (i32* nocapture nonnull writeonly dereferenceable(4) [[IS_NOT_NULL:%.]], i32 nocapture nonnull readonly dereferenceable(4) [[PTR:%.]], i32 noalias nocapture [[A:%.]], i64 [[B:%.]], i32** noalias nocapture nonnull readonly align 64 dereferenceable(8) [[C:%.*]])
		; CHECK-NEXT: entry:
		; CHECK-NEXT: [[PTR_VAL:%.]] = load i32, i32 [[PTR]], align 8
		; CHECK-NEXT: store i32 [[PTR_VAL]], i32* [[IS_NOT_NULL]]
		; CHECK-NEXT: [[TMP0:%.]] = load i32, i32** [[C]], align 64
		; CHECK-NEXT: tail call void @t1_check(i32* nocapture align 256 [[A]], i64 99, i32* [[TMP0]])
		; CHECK-NEXT: ret void
		;
		entry:
		%ptr_val = load i32, i32* %ptr, align 8
		store i32 %ptr_val, i32* %is_not_null
		%0 = load i32, i32* %c, align 8
		tail call void @t1_check(i32* %a, i64 %b, i32* %0)
		ret void
		}

		declare void @t1_check(i32* nocapture align 256, i64, i32* nocapture) nosync

		declare !callback !0 void @t1_callback_broker(i32* nocapture , i32* nocapture , void (i32, i32, ...)* nocapture, ...)

		; Test 2
		;
		; Similar to test 1 but checking that the noalias is only placed if potential synchronization through @t2_check is preserved.

		define void @t2_caller(i32* noalias %a) {
		; CHECK-LABEL: define {{[^@]+}}@t2_caller
		; CHECK-SAME: (i32* noalias nocapture [[A:%.*]])
		; CHECK-NEXT: entry:
		; CHECK-NEXT: [[B:%.*]] = alloca i32, align 32
		; CHECK-NEXT: [[C:%.]] = alloca i32, align 64
		; CHECK-NEXT: [[PTR:%.*]] = alloca i32, align 128
		; CHECK-NEXT: [[TMP0:%.]] = bitcast i32 [[B]] to i8*
		; CHECK-NEXT: store i32 42, i32* [[B]], align 32
		; CHECK-NEXT: store i32* [[B]], i32** [[C]], align 64
		; CHECK-NEXT: call void (i32, i32, void (i32, i32, ...), ...) @t2_callback_broker(i32 noalias null, i32* noalias nonnull align 128 dereferenceable(4) [[PTR]], void (i32, i32, ...)* nonnull bitcast (void (i32, i32, i32, i64, i32) @t2_callback_callee to void (i32, i32, ...)), i32 nocapture [[A:%.]], i64 99, i32* noalias nocapture nonnull readonly align 64 dereferenceable(8) [[C]])
		; CHECK-NEXT: ret void
		;
		entry:
		%b = alloca i32, align 32
		%c = alloca i32*, align 64
		%ptr = alloca i32, align 128
		%0 = bitcast i32* %b to i8*
		store i32 42, i32* %b, align 4
		store i32* %b, i32** %c, align 8
		call void (i32, i32, void (i32, i32, ...), ...) @t2_callback_broker(i32 null, i32* %ptr, void (i32, i32, ...)* bitcast (void (i32, i32, i32, i64, i32) @t2_callback_callee to void (i32, i32, ...)), i32 %a, i64 99, i32** %c)
		ret void
		}

		; Note that the first two arguments are provided by the callback_broker according to the callback in !1 below!
		; The others are annotated with alignment information, amongst others, or even replaced by the constants passed to the call.
		;
		; FIXME: We should derive noalias for %a and add a "fake use" of %a in all potentially synchronizing calls.
		define internal void @t2_callback_callee(i32* %is_not_null, i32* %ptr, i32* %a, i64 %b, i32** %c) {
		; CHECK-LABEL: define {{[^@]+}}@t2_callback_callee
		; CHECK-SAME: (i32* nocapture nonnull writeonly dereferenceable(4) [[IS_NOT_NULL:%.]], i32 nocapture nonnull readonly dereferenceable(4) [[PTR:%.]], i32 nocapture [[A:%.]], i64 [[B:%.]], i32** noalias nocapture nonnull readonly align 64 dereferenceable(8) [[C:%.*]])
		; CHECK-NEXT: entry:
		; CHECK-NEXT: [[PTR_VAL:%.]] = load i32, i32 [[PTR]], align 8
		; CHECK-NEXT: store i32 [[PTR_VAL]], i32* [[IS_NOT_NULL]]
		; CHECK-NEXT: [[TMP0:%.]] = load i32, i32** [[C]], align 64
		; CHECK-NEXT: tail call void @t2_check(i32* nocapture align 256 [[A]], i64 99, i32* [[TMP0]])
		; CHECK-NEXT: ret void
		;
		entry:
		%ptr_val = load i32, i32* %ptr, align 8
		store i32 %ptr_val, i32* %is_not_null
		%0 = load i32, i32* %c, align 8
		tail call void @t2_check(i32* %a, i64 %b, i32* %0)
		ret void
		}

		declare void @t2_check(i32* nocapture align 256, i64, i32* nocapture)

		declare !callback !0 void @t2_callback_broker(i32* nocapture , i32* nocapture , void (i32, i32, ...)* nocapture, ...)

!0 = !{!1}		!0 = !{!1}
!1 = !{i64 2, i64 -1, i64 -1, i1 true}		!1 = !{i64 2, i64 -1, i64 -1, i1 true}

This is an archive of the discontinued LLVM Phabricator instance.

[Attributor] Pointer privatization attribute (argument promotion)ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 227230

llvm/include/llvm/Transforms/IPO/ArgumentPromotion.h

llvm/include/llvm/Transforms/IPO/Attributor.h

llvm/lib/Transforms/IPO/ArgumentPromotion.cpp

llvm/lib/Transforms/IPO/Attributor.cpp

llvm/test/Transforms/ArgumentPromotion/2008-02-01-ReturnAttrs.ll

llvm/test/Transforms/ArgumentPromotion/2008-07-02-array-indexing.ll

llvm/test/Transforms/ArgumentPromotion/2008-09-07-CGUpdate.ll

llvm/test/Transforms/ArgumentPromotion/2008-09-08-CGUpdateSelfEdge.ll

llvm/test/Transforms/ArgumentPromotion/X86/attributes.ll

llvm/test/Transforms/ArgumentPromotion/X86/min-legal-vector-width.ll

llvm/test/Transforms/ArgumentPromotion/X86/thiscall.ll

llvm/test/Transforms/ArgumentPromotion/aggregate-promote.ll

llvm/test/Transforms/ArgumentPromotion/attrs.ll

llvm/test/Transforms/ArgumentPromotion/basictest.ll

llvm/test/Transforms/ArgumentPromotion/byval-2.ll

llvm/test/Transforms/ArgumentPromotion/byval.ll

llvm/test/Transforms/ArgumentPromotion/chained.ll

llvm/test/Transforms/ArgumentPromotion/control-flow.ll

llvm/test/Transforms/ArgumentPromotion/control-flow2.ll

llvm/test/Transforms/ArgumentPromotion/crash.ll

llvm/test/Transforms/ArgumentPromotion/fp80.ll

llvm/test/Transforms/ArgumentPromotion/inalloca.ll

llvm/test/Transforms/ArgumentPromotion/invalidation.ll

llvm/test/Transforms/ArgumentPromotion/naked_functions.ll

llvm/test/Transforms/ArgumentPromotion/nonzero-address-spaces.ll

llvm/test/Transforms/ArgumentPromotion/pr27568.ll

llvm/test/Transforms/ArgumentPromotion/pr3085.ll

llvm/test/Transforms/ArgumentPromotion/pr32917.ll

llvm/test/Transforms/ArgumentPromotion/pr33641_remove_arg_dbgvalue.ll

llvm/test/Transforms/ArgumentPromotion/profile.ll

llvm/test/Transforms/ArgumentPromotion/reserve-tbaa.ll

llvm/test/Transforms/ArgumentPromotion/sret.ll

llvm/test/Transforms/ArgumentPromotion/tail.ll

llvm/test/Transforms/ArgumentPromotion/variadic.ll

llvm/test/Transforms/FunctionAttrs/callbacks.ll

[Attributor] Pointer privatization attribute (argument promotion)
ClosedPublic