This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/
-
llvm/
-
CodeGen/
-
ISDOpcodes.h
-
TargetCallingConv.h
-
TargetLowering.h
-
IR/
-
Argument.h
-
Attributes.h
-
InstrTypes.h
-
Support/
-
TargetOpcodes.def
-
Target/
-
Target.td
-
TargetCallingConv.td
-
lib/
-
CodeGen/
-
GlobalISel/
-
CallLowering.cpp
-
SelectionDAG/
-
FastISel.cpp
-
SelectionDAG.cpp
1/1
SelectionDAGBuilder.cpp
-
SelectionDAGDumper.cpp
-
TargetLowering.cpp
-
IR/
-
Attributes.cpp
-
Function.cpp
-
Target/X86/
-
X86/
-
X86CallingConv.td
-
X86FastISel.cpp
-
X86FrameLowering.cpp
-
X86ISelDAGToDAG.cpp
2/2
X86ISelLowering.cpp
9/9
X86MachineFunctionInfo.h
1/2
X86RegisterInfo.cpp
-
Transforms/
-
Coroutines/
-
CoroSplit.cpp
-
IPO/
-
Attributor.cpp
-
AttributorAttributes.cpp
-
DeadArgumentElimination.cpp
-
FunctionAttrs.cpp
-
GlobalOpt.cpp
-
InstCombine/
-
InstCombineCalls.cpp
-
test/
-
CodeGen/X86/
-
X86/
-
arg-copy-elide.ll
1/3
musttail-indirect.ll
-
musttail-thiscall.ll
-
preallocated-nocall.ll
-
preallocated-x64.ll
-
preallocated.ll
-
shrink-wrap-chkstk.ll
-
tail-call-mutable-memarg.ll
-
Transforms/
-
Attributor/
-
value-simplify.ll
-
DeadArgElim/
-
keepalive.ll
-
DeadStoreElimination/
-
MSSA/
-
simple-todo.ll
-
simple.ll
-
FunctionAttrs/
-
readattrs.ll
-
GlobalOpt/
-
fastcc.ll
-
InstCombine/
-
call-cast-target-preallocated.ll

Differential D77689

[X86] Codegen for preallocated
ClosedPublic

Authored by aeubanks on Apr 7 2020, 4:31 PM.

Download Raw Diff

Details

Reviewers

rnk
jdoerfert
sstefan1
efriedma
craig.topper

Commits

rG8a88755610d0: Reland [X86] Codegen for preallocated
rG810567dc691a: [X86] Codegen for preallocated

Summary

See https://reviews.llvm.org/D74651 for the preallocated IR constructs
and LangRef changes.

In X86TargetLowering::LowerCall(), if a call is preallocated, record
each argument's offset from the stack pointer and the total stack
adjustment. Associate the call Value with an integer index. Store the
info in X86MachineFunctionInfo with the integer index as the key.

This adds two new target independent ISDOpcodes and two new target
dependent Opcodes corresponding to @llvm.call.preallocated.{setup,arg}.

The setup ISelDAG node takes in a chain and outputs a chain and a
SrcValue of the preallocated call Value. It is lowered to a target
dependent node with the SrcValue replaced with the integer index key by
looking in X86MachineFunctionInfo. In
X86TargetLowering::EmitInstrWithCustomInserter() this is lowered to an
%esp adjustment, the exact amount determined by looking in
X86MachineFunctionInfo with the integer index key.

The arg ISelDAG node takes in a chain, a SrcValue of the preallocated
call Value, and the arg index int constant. It produces a chain and the
pointer fo the arg. It is lowered to a target dependent node with the
SrcValue replaced with the integer index key by looking in
X86MachineFunctionInfo. In
X86TargetLowering::EmitInstrWithCustomInserter() this is lowered to a
lea of the stack pointer plus an offset determined by looking in
X86MachineFunctionInfo with the integer index key.

Force any function containing a preallocated call to use the frame
pointer.

Does not yet handle a setup without a call, or a conditional call.

Tried to look at all references to inalloca and see if they apply to
preallocated. I've made preallocated versions of tests testing inalloca
whenever possible and when they make sense (e.g. not alloca related,
inalloca edge cases).

Aside from the tests added here, I checked that this codegen produces
correct code for something like

struct A {
        A();
        A(A&&);
        ~A();
};

void bar() {
        foo(foo(foo(foo(foo(A(), 4), 5), 6), 7), 8);
}

by replacing the inalloca version of the .ll file with the appropriate
preallocated code. Running the executable produces the same results as
using the current inalloca implementation.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

aeubanks created this revision.Apr 7 2020, 4:31 PM

Herald added a project: Restricted Project. · View Herald TranscriptApr 7 2020, 4:31 PM

Herald added subscribers: llvm-commits, hiraditya. · View Herald Transcript

aeubanks edited the summary of this revision. (Show Details)Apr 7 2020, 4:33 PM

aeubanks added a reviewer: rnk.

Harbormaster failed remote builds in B52264: Diff 255853!Apr 7 2020, 5:28 PM

Rebase

Harbormaster failed remote builds in B53199: Diff 257424!Apr 14 2020, 12:58 PM

Add tests

Harbormaster failed remote builds in B54142: Diff 259083!Apr 21 2020, 2:06 PM

Add tests
Rename callsetup -> preallocated

Harbormaster failed remote builds in B55004: Diff 260707!Apr 28 2020, 12:24 PM

Look around for inalloca and add preallocated when relevant

Included lots of tests

Update commit message to be much more descriptive

Herald added a reviewer: jdoerfert. · View Herald TranscriptMay 1 2020, 4:37 PM

Herald added a reviewer: sstefan1. · View Herald Transcript

ready for review

Harbormaster failed remote builds in B55513: Diff 261574!May 1 2020, 7:16 PM

Nit: The commit name is still call setup.
Did the lang ref patch land already? We should link it in the commit message.

The llvm/lib/Transforms/IPO look straight forward except the one in llvm/lib/Transforms/IPO/GlobalOpt.cpp. I would suggest to split that one off and add a comment to the new code explaining what is happening and why. It also seems that not all IPO passes have tests with the new attribute. Don't wait for me to review the rest or to accept this.

More tests
Remove some handling of preallocated in places we don't support yet
Rebase

aeubanks retitled this revision from Codegen for call setup to [X86] Codegen for preallocated.May 5 2020, 9:19 AM

aeubanks edited the summary of this revision. (Show Details)

In D77689#2016748, @jdoerfert wrote:

Nit: The commit name is still call setup.
Did the lang ref patch land already? We should link it in the commit message.

Done.
Is there a way to automatically update the Phab change with the commit message if you're only uploading one commit?

The llvm/lib/Transforms/IPO look straight forward except the one in llvm/lib/Transforms/IPO/GlobalOpt.cpp. I would suggest to split that one off and add a comment to the new code explaining what is happening and why. It also seems that not all IPO passes have tests with the new attribute. Don't wait for me to review the rest or to accept this.

Specifically for GlobalOpt.cpp, I realized that I probably shouldn't have done that since we haven't implemented the part where you strip off the operand bundle and replace the calls to the intrinsics with something like alloca. So I removed that and added a TODO.
In general I went through all the IPO transforms and did some archaeology to see what tests they had. I did end up finding more tests to copy/add to, but some also never had tests (not a great excuse)... Anyway thanks for the quick review!

Harbormaster failed remote builds in B55807: Diff 262138!May 5 2020, 10:14 AM

I think the approach looks good, sorry for the delay.

In D77689#2020636, @aeubanks wrote:

Is there a way to automatically update the Phab change with the commit message if you're only uploading one commit?

I don't think so, I think the only way to update the commit message is through the web interface.

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
5827	The common code here seems to be this search of the use list to find the call that consumes the setup. I'd suggest factoring that out into a helper. Once that is done, it seems like this code might be simpler if you separate the setup and arg cases.
llvm/lib/Target/X86/X86ISelLowering.cpp
3887	This seems like it could be an assert, but I could go either way. If the condition occurs, it's an LLVM bug: it means the x86 calling conv code missed the case. We generally use report_fatal_error when user input could trigger the condition, and we don't want UB to occur.
llvm/lib/Target/X86/X86MachineFunctionInfo.h
110	This could be tracked as `PreallocatedStackSizes.size()`, perhaps.
112	I tend to micro-optimize data structures, but they keys to this are densely packed integers, so this could be a vector instead of a hash map.
113	DenseMap is open-addressed, so this takes up more memory than you might imagine. If you change it to SmallVector, the common case is that there are no call sites using preallocated, so I'd use 0 builtin elements.
203	Can this function use the get-or-insert pattern: auto Insert = PreallocatedIds.insert({CS, PreallocatedNextId}); if (Insert.second) ++PreallocatedNextId; return Insert.first->second;
215	You can shorten this with `.count(Id)`
225	You can shorten this with `.count(Id)`
llvm/lib/Target/X86/X86RegisterInfo.cpp
629–630	Ooh, base pointers are expensive. Is this necessary?

efriedma added a subscriber: efriedma.May 7 2020, 2:59 PM

Address code review comments

aeubanks marked 10 inline comments as done.May 7 2020, 4:28 PM

aeubanks added inline comments.

llvm/lib/Target/X86/X86ISelLowering.cpp
3887	I copied this from inalloca right above, but yeah assert seems fine.
llvm/lib/Target/X86/X86RegisterInfo.cpp
629–630	Yup, I tried removing it and it makes my test program crash. I spent quite a while trying to figure out how to force a frame pointer with preallocated calls (is the distinction between frame/base pointer relevant here?).

Harbormaster failed remote builds in B56117: Diff 262784!May 7 2020, 5:24 PM

aeubanks added reviewers: efriedma, craig.topper.May 12 2020, 4:31 PM

I have some surface level code style comments, please address those and land this if you agree.

The test case raises an interesting issue: how to handle musttail thunks. However, that will require verifier and maybe langref changes, so let's address that separately.

llvm/lib/Target/X86/X86MachineFunctionInfo.h
199	LLVM is inconsistent about the casing of names, but this class seems to consisitently use leadingLowerCamelCase, so let's do the same here and below, unless you see a reason not to.
217	Generally, we try not to pass `SmallVector` by value. It is only small in the sense that we believe they contain few elements. Embedding storage in the vector makes it large, and expensive to copy. Generally to pass in a list of things, we tend to prefer `ArrayRef<size_t>`, so the caller can be flexible about the collection they are using, as long as its a flat array.
221	Similarly, to give the caller a readonly view of the offsets, I would recommend ArrayRef.
llvm/test/CodeGen/X86/musttail-indirect.ll
58	This is interesting, we forgot about this when writing the LangRef. If you look at the previous inalloca test, it forwards its inalloca parameter directly to the musttail callee. It doesn't allocate new memory. That's the real use case for this `musttail` thing, and we probably need to bend the verifier rules to allow direct forwarding of preallocated arguments to a musttail call site. I think for now, committing this as is with a FIXME and coming back to it later would be fine. I assume that the verifier rejects it if you remove the setup and arg calls.

This revision is now accepted and ready to land.May 13 2020, 1:47 PM

efriedma added inline comments.May 13 2020, 2:40 PM

llvm/test/CodeGen/X86/musttail-indirect.ll
58	I don't think it makes sense to commit this test as-is. The verifier should reject the combination of musttail and a preallocated bundle. The llvm.call.preallocated.setup would have to allocate memory on top of the argument, and there's no reasonable way to prove that's safe. And yes, we probably need to bend the rules for the "preallocated" attribute to allow forwarding arguments in musttail calls.

Remove musttail tests and add TODOs
Address code review comments
Rebase

Herald added a subscriber: kuter. · View Herald TranscriptMay 18 2020, 12:22 PM

aeubanks marked 4 inline comments as done.May 18 2020, 12:29 PM

aeubanks added inline comments.

llvm/test/CodeGen/X86/musttail-indirect.ll
58	Added verifier check in https://reviews.llvm.org/D80132. Is it ok to proceed with this and do the LangRef/codegen changes to support musttail and preallocated together in a later change?

Harbormaster failed remote builds in B57113: Diff 264696!May 18 2020, 1:01 PM

Closed by commit rG810567dc691a: [X86] Codegen for preallocated (authored by aeubanks). · Explain WhyMay 20 2020, 9:49 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

include/

llvm/

CodeGen/

ISDOpcodes.h

7 lines

TargetCallingConv.h

10 lines

TargetLowering.h

15 lines

IR/

Argument.h

3 lines

Attributes.h

3 lines

InstrTypes.h

6 lines

Support/

TargetOpcodes.def

6 lines

Target/

Target.td

12 lines

TargetCallingConv.td

5 lines

lib/

CodeGen/

GlobalISel/

CallLowering.cpp

4 lines

SelectionDAG/

FastISel.cpp

11 lines

SelectionDAG.cpp

3 lines

SelectionDAGBuilder.cpp

79 lines

SelectionDAGDumper.cpp

4 lines

TargetLowering.cpp

6 lines

IR/

Attributes.cpp

4 lines

Function.cpp

6 lines

Target/

X86/

3 lines

2 lines

8 lines

33 lines

53 lines

X86MachineFunctionInfo.h

38 lines

X86RegisterInfo.cpp

28 lines

Transforms/

Coroutines/

CoroSplit.cpp

6 lines

IPO/

Attributor.cpp

3 lines

AttributorAttributes.cpp

5 lines

DeadArgumentElimination.cpp

7 lines

FunctionAttrs.cpp

2 lines

GlobalOpt.cpp

1 line

InstCombine/

InstCombineCalls.cpp

1 line

test/

CodeGen/

X86/

arg-copy-elide.ll

14 lines

musttail-indirect.ll

88 lines

musttail-thiscall.ll

19 lines

preallocated-nocall.ll

22 lines

preallocated-x64.ll

17 lines

preallocated.ll

187 lines

shrink-wrap-chkstk.ll

3 lines

tail-call-mutable-memarg.ll

15 lines

Transforms/

Attributor/

value-simplify.ll

34 lines

DeadArgElim/

keepalive.ll

21 lines

DeadStoreElimination/

MSSA/

simple-todo.ll

10 lines

simple.ll

10 lines

FunctionAttrs/

readattrs.ll

6 lines

GlobalOpt/

fastcc.ll

15 lines

InstCombine/

call-cast-target-preallocated.ll

28 lines

Diff 262784

llvm/include/llvm/CodeGen/ISDOpcodes.h

Show First 20 Lines • Show All 791 Lines • ▼ Show 20 Lines	enum NodeType {
/// a source pointer, a SRCVALUE for the destination, and a SRCVALUE for the		/// a source pointer, a SRCVALUE for the destination, and a SRCVALUE for the
/// source.		/// source.
VACOPY,		VACOPY,

/// VAEND, VASTART - VAEND and VASTART have three operands: an input chain,		/// VAEND, VASTART - VAEND and VASTART have three operands: an input chain,
/// pointer, and a SRCVALUE.		/// pointer, and a SRCVALUE.
VAEND, VASTART,		VAEND, VASTART,

		// PREALLOCATED_SETUP - This has 2 operands: an input chain and a SRCVALUE
		// with the preallocated call Value.
		PREALLOCATED_SETUP,
		// PREALLOCATED_ARG - This has 3 operands: an input chain, a SRCVALUE
		// with the preallocated call Value, and a constant int.
		PREALLOCATED_ARG,

/// SRCVALUE - This is a node type that holds a Value* that is used to		/// SRCVALUE - This is a node type that holds a Value* that is used to
/// make reference to a value in the LLVM IR.		/// make reference to a value in the LLVM IR.
SRCVALUE,		SRCVALUE,

/// MDNODE_SDNODE - This is a node that holdes an MDNode*, which is used to		/// MDNODE_SDNODE - This is a node that holdes an MDNode*, which is used to
/// reference metadata in the IR.		/// reference metadata in the IR.
MDNODE_SDNODE,		MDNODE_SDNODE,

▲ Show 20 Lines • Show All 343 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/TargetCallingConv.h

Show All 29 Lines	private:
unsigned IsSExt : 1; ///< Sign extended		unsigned IsSExt : 1; ///< Sign extended
unsigned IsInReg : 1; ///< Passed in register		unsigned IsInReg : 1; ///< Passed in register
unsigned IsSRet : 1; ///< Hidden struct-ret ptr		unsigned IsSRet : 1; ///< Hidden struct-ret ptr
unsigned IsByVal : 1; ///< Struct passed by value		unsigned IsByVal : 1; ///< Struct passed by value
unsigned IsNest : 1; ///< Nested fn static chain		unsigned IsNest : 1; ///< Nested fn static chain
unsigned IsReturned : 1; ///< Always returned		unsigned IsReturned : 1; ///< Always returned
unsigned IsSplit : 1;		unsigned IsSplit : 1;
unsigned IsInAlloca : 1; ///< Passed with inalloca		unsigned IsInAlloca : 1; ///< Passed with inalloca
		unsigned IsPreallocated : 1; ///< ByVal without the copy
unsigned IsSplitEnd : 1; ///< Last part of a split		unsigned IsSplitEnd : 1; ///< Last part of a split
unsigned IsSwiftSelf : 1; ///< Swift self parameter		unsigned IsSwiftSelf : 1; ///< Swift self parameter
unsigned IsSwiftError : 1; ///< Swift error parameter		unsigned IsSwiftError : 1; ///< Swift error parameter
unsigned IsCFGuardTarget : 1; ///< Control Flow Guard target		unsigned IsCFGuardTarget : 1; ///< Control Flow Guard target
unsigned IsHva : 1; ///< HVA field for		unsigned IsHva : 1; ///< HVA field for
unsigned IsHvaStart : 1; ///< HVA structure start		unsigned IsHvaStart : 1; ///< HVA structure start
unsigned IsSecArgPass : 1; ///< Second argument		unsigned IsSecArgPass : 1; ///< Second argument
unsigned ByValAlign : 4; ///< Log 2 of byval alignment		unsigned ByValAlign : 4; ///< Log 2 of byval alignment
unsigned OrigAlign : 5; ///< Log 2 of original alignment		unsigned OrigAlign : 5; ///< Log 2 of original alignment
unsigned IsInConsecutiveRegsLast : 1;		unsigned IsInConsecutiveRegsLast : 1;
unsigned IsInConsecutiveRegs : 1;		unsigned IsInConsecutiveRegs : 1;
unsigned IsCopyElisionCandidate : 1; ///< Argument copy elision candidate		unsigned IsCopyElisionCandidate : 1; ///< Argument copy elision candidate
unsigned IsPointer : 1;		unsigned IsPointer : 1;

unsigned ByValSize; ///< Byval struct size		unsigned ByValSize; ///< Byval struct size

unsigned PointerAddrSpace; ///< Address space of pointer argument		unsigned PointerAddrSpace; ///< Address space of pointer argument

public:		public:
ArgFlagsTy()		ArgFlagsTy()
: IsZExt(0), IsSExt(0), IsInReg(0), IsSRet(0), IsByVal(0), IsNest(0),		: IsZExt(0), IsSExt(0), IsInReg(0), IsSRet(0), IsByVal(0), IsNest(0),
IsReturned(0), IsSplit(0), IsInAlloca(0), IsSplitEnd(0),		IsReturned(0), IsSplit(0), IsInAlloca(0), IsPreallocated(0),
IsSwiftSelf(0), IsSwiftError(0), IsCFGuardTarget(0), IsHva(0),		IsSplitEnd(0), IsSwiftSelf(0), IsSwiftError(0), IsCFGuardTarget(0),
IsHvaStart(0), IsSecArgPass(0), ByValAlign(0), OrigAlign(0),		IsHva(0), IsHvaStart(0), IsSecArgPass(0), ByValAlign(0), OrigAlign(0),
IsInConsecutiveRegsLast(0), IsInConsecutiveRegs(0),		IsInConsecutiveRegsLast(0), IsInConsecutiveRegs(0),
IsCopyElisionCandidate(0), IsPointer(0), ByValSize(0),		IsCopyElisionCandidate(0), IsPointer(0), ByValSize(0),
PointerAddrSpace(0) {		PointerAddrSpace(0) {
static_assert(sizeof(this) == 3 sizeof(unsigned), "flags are too big");		static_assert(sizeof(this) == 3 sizeof(unsigned), "flags are too big");
}		}

bool isZExt() const { return IsZExt; }		bool isZExt() const { return IsZExt; }
void setZExt() { IsZExt = 1; }		void setZExt() { IsZExt = 1; }

bool isSExt() const { return IsSExt; }		bool isSExt() const { return IsSExt; }
void setSExt() { IsSExt = 1; }		void setSExt() { IsSExt = 1; }

bool isInReg() const { return IsInReg; }		bool isInReg() const { return IsInReg; }
void setInReg() { IsInReg = 1; }		void setInReg() { IsInReg = 1; }

bool isSRet() const { return IsSRet; }		bool isSRet() const { return IsSRet; }
void setSRet() { IsSRet = 1; }		void setSRet() { IsSRet = 1; }

bool isByVal() const { return IsByVal; }		bool isByVal() const { return IsByVal; }
void setByVal() { IsByVal = 1; }		void setByVal() { IsByVal = 1; }

bool isInAlloca() const { return IsInAlloca; }		bool isInAlloca() const { return IsInAlloca; }
void setInAlloca() { IsInAlloca = 1; }		void setInAlloca() { IsInAlloca = 1; }

		bool isPreallocated() const { return IsPreallocated; }
		void setPreallocated() { IsPreallocated = 1; }

bool isSwiftSelf() const { return IsSwiftSelf; }		bool isSwiftSelf() const { return IsSwiftSelf; }
void setSwiftSelf() { IsSwiftSelf = 1; }		void setSwiftSelf() { IsSwiftSelf = 1; }

bool isSwiftError() const { return IsSwiftError; }		bool isSwiftError() const { return IsSwiftError; }
void setSwiftError() { IsSwiftError = 1; }		void setSwiftError() { IsSwiftError = 1; }

bool isCFGuardTarget() const { return IsCFGuardTarget; }		bool isCFGuardTarget() const { return IsCFGuardTarget; }
void setCFGuardTarget() { IsCFGuardTarget = 1; }		void setCFGuardTarget() { IsCFGuardTarget = 1; }
▲ Show 20 Lines • Show All 137 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/TargetLowering.h

Show First 20 Lines • Show All 267 Lines • ▼ Show 20 Lines	public:
Type *Ty = nullptr;		Type *Ty = nullptr;
bool IsSExt : 1;		bool IsSExt : 1;
bool IsZExt : 1;		bool IsZExt : 1;
bool IsInReg : 1;		bool IsInReg : 1;
bool IsSRet : 1;		bool IsSRet : 1;
bool IsNest : 1;		bool IsNest : 1;
bool IsByVal : 1;		bool IsByVal : 1;
bool IsInAlloca : 1;		bool IsInAlloca : 1;
		bool IsPreallocated : 1;
bool IsReturned : 1;		bool IsReturned : 1;
bool IsSwiftSelf : 1;		bool IsSwiftSelf : 1;
bool IsSwiftError : 1;		bool IsSwiftError : 1;
bool IsCFGuardTarget : 1;		bool IsCFGuardTarget : 1;
MaybeAlign Alignment = None;		MaybeAlign Alignment = None;
Type *ByValType = nullptr;		Type *ByValType = nullptr;
		Type *PreallocatedType = nullptr;

ArgListEntry()		ArgListEntry()
: IsSExt(false), IsZExt(false), IsInReg(false), IsSRet(false),		: IsSExt(false), IsZExt(false), IsInReg(false), IsSRet(false),
IsNest(false), IsByVal(false), IsInAlloca(false), IsReturned(false),		IsNest(false), IsByVal(false), IsInAlloca(false),
IsSwiftSelf(false), IsSwiftError(false), IsCFGuardTarget(false) {}		IsPreallocated(false), IsReturned(false), IsSwiftSelf(false),
		IsSwiftError(false), IsCFGuardTarget(false) {}

void setAttributes(const CallBase *Call, unsigned ArgIdx);		void setAttributes(const CallBase *Call, unsigned ArgIdx);
};		};
using ArgListTy = std::vector<ArgListEntry>;		using ArgListTy = std::vector<ArgListEntry>;

virtual void markLibCallAttributes(MachineFunction *MF, unsigned CC,		virtual void markLibCallAttributes(MachineFunction *MF, unsigned CC,
ArgListTy &Args) const {};		ArgListTy &Args) const {};

▲ Show 20 Lines • Show All 3,308 Lines • ▼ Show 20 Lines	struct CallLoweringInfo {
bool RetSExt : 1;		bool RetSExt : 1;
bool RetZExt : 1;		bool RetZExt : 1;
bool IsVarArg : 1;		bool IsVarArg : 1;
bool IsInReg : 1;		bool IsInReg : 1;
bool DoesNotReturn : 1;		bool DoesNotReturn : 1;
bool IsReturnValueUsed : 1;		bool IsReturnValueUsed : 1;
bool IsConvergent : 1;		bool IsConvergent : 1;
bool IsPatchPoint : 1;		bool IsPatchPoint : 1;
		bool IsPreallocated : 1;

// IsTailCall should be modified by implementations of		// IsTailCall should be modified by implementations of
// TargetLowering::LowerCall that perform tail call conversions.		// TargetLowering::LowerCall that perform tail call conversions.
bool IsTailCall = false;		bool IsTailCall = false;

// Is Call lowering done post SelectionDAG type legalization.		// Is Call lowering done post SelectionDAG type legalization.
bool IsPostTypeLegalization = false;		bool IsPostTypeLegalization = false;

unsigned NumFixedArgs = -1;		unsigned NumFixedArgs = -1;
CallingConv::ID CallConv = CallingConv::C;		CallingConv::ID CallConv = CallingConv::C;
SDValue Callee;		SDValue Callee;
ArgListTy Args;		ArgListTy Args;
SelectionDAG &DAG;		SelectionDAG &DAG;
SDLoc DL;		SDLoc DL;
const CallBase *CB = nullptr;		const CallBase *CB = nullptr;
SmallVector<ISD::OutputArg, 32> Outs;		SmallVector<ISD::OutputArg, 32> Outs;
SmallVector<SDValue, 32> OutVals;		SmallVector<SDValue, 32> OutVals;
SmallVector<ISD::InputArg, 32> Ins;		SmallVector<ISD::InputArg, 32> Ins;
SmallVector<SDValue, 4> InVals;		SmallVector<SDValue, 4> InVals;

CallLoweringInfo(SelectionDAG &DAG)		CallLoweringInfo(SelectionDAG &DAG)
: RetSExt(false), RetZExt(false), IsVarArg(false), IsInReg(false),		: RetSExt(false), RetZExt(false), IsVarArg(false), IsInReg(false),
DoesNotReturn(false), IsReturnValueUsed(true), IsConvergent(false),		DoesNotReturn(false), IsReturnValueUsed(true), IsConvergent(false),
IsPatchPoint(false), DAG(DAG) {}		IsPatchPoint(false), IsPreallocated(false), DAG(DAG) {}

CallLoweringInfo &setDebugLoc(const SDLoc &dl) {		CallLoweringInfo &setDebugLoc(const SDLoc &dl) {
DL = dl;		DL = dl;
return *this;		return *this;
}		}

CallLoweringInfo &setChain(SDValue InChain) {		CallLoweringInfo &setChain(SDValue InChain) {
Chain = InChain;		Chain = InChain;
▲ Show 20 Lines • Show All 89 Lines • ▼ Show 20 Lines	CallLoweringInfo &setZExtResult(bool Value = true) {
return *this;		return *this;
}		}

CallLoweringInfo &setIsPatchPoint(bool Value = true) {		CallLoweringInfo &setIsPatchPoint(bool Value = true) {
IsPatchPoint = Value;		IsPatchPoint = Value;
return *this;		return *this;
}		}

		CallLoweringInfo &setIsPreallocated(bool Value = true) {
		IsPreallocated = Value;
		return *this;
		}

CallLoweringInfo &setIsPostTypeLegalization(bool Value=true) {		CallLoweringInfo &setIsPostTypeLegalization(bool Value=true) {
IsPostTypeLegalization = Value;		IsPostTypeLegalization = Value;
return *this;		return *this;
}		}

ArgListTy &getArgs() {		ArgListTy &getArgs() {
return Args;		return Args;
}		}
▲ Show 20 Lines • Show All 705 Lines • Show Last 20 Lines

llvm/include/llvm/IR/Argument.h

Show First 20 Lines • Show All 104 Lines • ▼ Show 20 Lines	public:
bool hasReturnedAttr() const;		bool hasReturnedAttr() const;

/// Return true if this argument has the readonly or readnone attribute.		/// Return true if this argument has the readonly or readnone attribute.
bool onlyReadsMemory() const;		bool onlyReadsMemory() const;

/// Return true if this argument has the inalloca attribute.		/// Return true if this argument has the inalloca attribute.
bool hasInAllocaAttr() const;		bool hasInAllocaAttr() const;

		/// Return true if this argument has the preallocated attribute.
		bool hasPreallocatedAttr() const;

/// Return true if this argument has the zext attribute.		/// Return true if this argument has the zext attribute.
bool hasZExtAttr() const;		bool hasZExtAttr() const;

/// Return true if this argument has the sext attribute.		/// Return true if this argument has the sext attribute.
bool hasSExtAttr() const;		bool hasSExtAttr() const;

/// Add attributes to an argument.		/// Add attributes to an argument.
void addAttrs(AttrBuilder &B);		void addAttrs(AttrBuilder &B);
Show All 22 Lines

llvm/include/llvm/IR/Attributes.h

Show First 20 Lines • Show All 617 Lines • ▼ Show 20 Lines	public:
MaybeAlign getRetAlignment() const;		MaybeAlign getRetAlignment() const;

/// Return the alignment for the specified function parameter.		/// Return the alignment for the specified function parameter.
MaybeAlign getParamAlignment(unsigned ArgNo) const;		MaybeAlign getParamAlignment(unsigned ArgNo) const;

/// Return the byval type for the specified function parameter.		/// Return the byval type for the specified function parameter.
Type *getParamByValType(unsigned ArgNo) const;		Type *getParamByValType(unsigned ArgNo) const;

		/// Return the preallocated type for the specified function parameter.
		Type *getParamPreallocatedType(unsigned ArgNo) const;

/// Get the stack alignment.		/// Get the stack alignment.
MaybeAlign getStackAlignment(unsigned Index) const;		MaybeAlign getStackAlignment(unsigned Index) const;

/// Get the number of dereferenceable bytes (or zero if unknown).		/// Get the number of dereferenceable bytes (or zero if unknown).
uint64_t getDereferenceableBytes(unsigned Index) const;		uint64_t getDereferenceableBytes(unsigned Index) const;

/// Get the number of dereferenceable bytes (or zero if unknown) of an		/// Get the number of dereferenceable bytes (or zero if unknown) of an
/// arg.		/// arg.
▲ Show 20 Lines • Show All 272 Lines • Show Last 20 Lines

llvm/include/llvm/IR/InstrTypes.h

Show First 20 Lines • Show All 1,598 Lines • ▼ Show 20 Lines	public:
}		}

/// Extract the byval type for a call or parameter.		/// Extract the byval type for a call or parameter.
Type *getParamByValType(unsigned ArgNo) const {		Type *getParamByValType(unsigned ArgNo) const {
Type *Ty = Attrs.getParamByValType(ArgNo);		Type *Ty = Attrs.getParamByValType(ArgNo);
return Ty ? Ty : getArgOperand(ArgNo)->getType()->getPointerElementType();		return Ty ? Ty : getArgOperand(ArgNo)->getType()->getPointerElementType();
}		}

		/// Extract the preallocated type for a call or parameter.
		Type *getParamPreallocatedType(unsigned ArgNo) const {
		Type *Ty = Attrs.getParamPreallocatedType(ArgNo);
		return Ty ? Ty : getArgOperand(ArgNo)->getType()->getPointerElementType();
		}

/// Extract the number of dereferenceable bytes for a call or		/// Extract the number of dereferenceable bytes for a call or
/// parameter (0=unknown).		/// parameter (0=unknown).
uint64_t getDereferenceableBytes(unsigned i) const {		uint64_t getDereferenceableBytes(unsigned i) const {
return Attrs.getDereferenceableBytes(i);		return Attrs.getDereferenceableBytes(i);
}		}

/// Extract the number of dereferenceable_or_null bytes for a call or		/// Extract the number of dereferenceable_or_null bytes for a call or
/// parameter (0=unknown).		/// parameter (0=unknown).
▲ Show 20 Lines • Show All 597 Lines • Show Last 20 Lines

llvm/include/llvm/Support/TargetOpcodes.def

	Show First 20 Lines • Show All 121 Lines • ▼ Show 20 Lines
	HANDLE_TARGET_OPCODE(PATCHPOINT)			HANDLE_TARGET_OPCODE(PATCHPOINT)

	/// This pseudo-instruction loads the stack guard value. Targets which need			/// This pseudo-instruction loads the stack guard value. Targets which need
	/// to prevent the stack guard value or address from being spilled to the			/// to prevent the stack guard value or address from being spilled to the
	/// stack should override TargetLowering::emitLoadStackGuardNode and			/// stack should override TargetLowering::emitLoadStackGuardNode and
	/// additionally expand this pseudo after register allocation.			/// additionally expand this pseudo after register allocation.
	HANDLE_TARGET_OPCODE(LOAD_STACK_GUARD)			HANDLE_TARGET_OPCODE(LOAD_STACK_GUARD)

				/// These are used to support call sites that must have the stack adjusted
				/// before the call (e.g. to initialize an argument passed by value).
				/// See llvm.call.preallocated.{setup,arg} in the LangRef for more details.
				HANDLE_TARGET_OPCODE(PREALLOCATED_SETUP)
				HANDLE_TARGET_OPCODE(PREALLOCATED_ARG)

	/// Call instruction with associated vm state for deoptimization and list			/// Call instruction with associated vm state for deoptimization and list
	/// of live pointers for relocation by the garbage collector. It is			/// of live pointers for relocation by the garbage collector. It is
	/// intended to support garbage collection with fully precise relocating			/// intended to support garbage collection with fully precise relocating
	/// collectors and deoptimizations in either the callee or caller.			/// collectors and deoptimizations in either the callee or caller.
	HANDLE_TARGET_OPCODE(STATEPOINT)			HANDLE_TARGET_OPCODE(STATEPOINT)

	/// Instruction that records the offset of a local stack allocation passed to			/// Instruction that records the offset of a local stack allocation passed to
	/// llvm.localescape. It has two arguments: the symbol for the label and the			/// llvm.localescape. It has two arguments: the symbol for the label and the
	▲ Show 20 Lines • Show All 514 Lines • Show Last 20 Lines

llvm/include/llvm/Target/Target.td

	Show First 20 Lines • Show All 1,167 Lines • ▼ Show 20 Lines
	def LOAD_STACK_GUARD : StandardPseudoInstruction {			def LOAD_STACK_GUARD : StandardPseudoInstruction {
	let OutOperandList = (outs ptr_rc:$dst);			let OutOperandList = (outs ptr_rc:$dst);
	let InOperandList = (ins);			let InOperandList = (ins);
	let mayLoad = 1;			let mayLoad = 1;
	bit isReMaterializable = 1;			bit isReMaterializable = 1;
	let hasSideEffects = 0;			let hasSideEffects = 0;
	bit isPseudo = 1;			bit isPseudo = 1;
	}			}
				def PREALLOCATED_SETUP : StandardPseudoInstruction {
				let OutOperandList = (outs);
				let InOperandList = (ins i32imm:$a);
				let usesCustomInserter = 1;
				let hasSideEffects = 1;
				}
				def PREALLOCATED_ARG : StandardPseudoInstruction {
				let OutOperandList = (outs ptr_rc:$loc);
				let InOperandList = (ins i32imm:$a, i32imm:$b);
				let usesCustomInserter = 1;
				let hasSideEffects = 1;
				}
	def LOCAL_ESCAPE : StandardPseudoInstruction {			def LOCAL_ESCAPE : StandardPseudoInstruction {
	// This instruction is really just a label. It has to be part of the chain so			// This instruction is really just a label. It has to be part of the chain so
	// that it doesn't get dropped from the DAG, but it produces nothing and has			// that it doesn't get dropped from the DAG, but it produces nothing and has
	// no side effects.			// no side effects.
	let OutOperandList = (outs);			let OutOperandList = (outs);
	let InOperandList = (ins ptr_rc:$symbol, i32imm:$id);			let InOperandList = (ins ptr_rc:$symbol, i32imm:$id);
	let hasSideEffects = 0;			let hasSideEffects = 0;
	let hasCtrlDep = 1;			let hasCtrlDep = 1;
	▲ Show 20 Lines • Show All 472 Lines • Show Last 20 Lines

llvm/include/llvm/Target/TargetCallingConv.td

Show All 35 Lines	class CCIf<string predicate, CCAction A> : CCPredicateAction<A> {
string Predicate = predicate;		string Predicate = predicate;
}		}

/// CCIfByVal - If the current argument has ByVal parameter attribute, apply		/// CCIfByVal - If the current argument has ByVal parameter attribute, apply
/// Action A.		/// Action A.
class CCIfByVal<CCAction A> : CCIf<"ArgFlags.isByVal()", A> {		class CCIfByVal<CCAction A> : CCIf<"ArgFlags.isByVal()", A> {
}		}

		/// CCIfPreallocated - If the current argument has Preallocated parameter attribute,
		/// apply Action A.
		class CCIfPreallocated<CCAction A> : CCIf<"ArgFlags.isPreallocated()", A> {
		}

/// CCIfSwiftSelf - If the current argument has swiftself parameter attribute,		/// CCIfSwiftSelf - If the current argument has swiftself parameter attribute,
/// apply Action A.		/// apply Action A.
class CCIfSwiftSelf<CCAction A> : CCIf<"ArgFlags.isSwiftSelf()", A> {		class CCIfSwiftSelf<CCAction A> : CCIf<"ArgFlags.isSwiftSelf()", A> {
}		}

/// CCIfSwiftError - If the current argument has swifterror parameter attribute,		/// CCIfSwiftError - If the current argument has swifterror parameter attribute,
/// apply Action A.		/// apply Action A.
class CCIfSwiftError<CCAction A> : CCIf<"ArgFlags.isSwiftError()", A> {		class CCIfSwiftError<CCAction A> : CCIf<"ArgFlags.isSwiftError()", A> {
▲ Show 20 Lines • Show All 160 Lines • Show Last 20 Lines

llvm/lib/CodeGen/GlobalISel/CallLowering.cpp

Show First 20 Lines • Show All 90 Lines • ▼ Show 20 Lines	void CallLowering::setArgFlags(CallLowering::ArgInfo &Arg, unsigned OpIdx,
if (Attrs.hasAttribute(OpIdx, Attribute::StructRet))		if (Attrs.hasAttribute(OpIdx, Attribute::StructRet))
Flags.setSRet();		Flags.setSRet();
if (Attrs.hasAttribute(OpIdx, Attribute::SwiftSelf))		if (Attrs.hasAttribute(OpIdx, Attribute::SwiftSelf))
Flags.setSwiftSelf();		Flags.setSwiftSelf();
if (Attrs.hasAttribute(OpIdx, Attribute::SwiftError))		if (Attrs.hasAttribute(OpIdx, Attribute::SwiftError))
Flags.setSwiftError();		Flags.setSwiftError();
if (Attrs.hasAttribute(OpIdx, Attribute::ByVal))		if (Attrs.hasAttribute(OpIdx, Attribute::ByVal))
Flags.setByVal();		Flags.setByVal();
		if (Attrs.hasAttribute(OpIdx, Attribute::Preallocated))
		Flags.setPreallocated();
if (Attrs.hasAttribute(OpIdx, Attribute::InAlloca))		if (Attrs.hasAttribute(OpIdx, Attribute::InAlloca))
Flags.setInAlloca();		Flags.setInAlloca();

if (Flags.isByVal() \|\| Flags.isInAlloca()) {		if (Flags.isByVal() \|\| Flags.isInAlloca() \|\| Flags.isPreallocated()) {
Type *ElementTy = cast<PointerType>(Arg.Ty)->getElementType();		Type *ElementTy = cast<PointerType>(Arg.Ty)->getElementType();

auto Ty = Attrs.getAttribute(OpIdx, Attribute::ByVal).getValueAsType();		auto Ty = Attrs.getAttribute(OpIdx, Attribute::ByVal).getValueAsType();
Flags.setByValSize(DL.getTypeAllocSize(Ty ? Ty : ElementTy));		Flags.setByValSize(DL.getTypeAllocSize(Ty ? Ty : ElementTy));

// For ByVal, alignment should be passed from FE. BE will guess if		// For ByVal, alignment should be passed from FE. BE will guess if
// this info is not there but there are cases it cannot get right.		// this info is not there but there are cases it cannot get right.
Align FrameAlign;		Align FrameAlign;
▲ Show 20 Lines • Show All 389 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/FastISel.cpp

Show First 20 Lines • Show All 1,208 Lines • ▼ Show 20 Lines	if (Arg.IsInAlloca) {
Flags.setInAlloca();		Flags.setInAlloca();
// Set the byval flag for CCAssignFn callbacks that don't know about		// Set the byval flag for CCAssignFn callbacks that don't know about
// inalloca. This way we can know how many bytes we should've allocated		// inalloca. This way we can know how many bytes we should've allocated
// and how many bytes a callee cleanup function will pop. If we port		// and how many bytes a callee cleanup function will pop. If we port
// inalloca to more targets, we'll have to add custom inalloca handling in		// inalloca to more targets, we'll have to add custom inalloca handling in
// the various CC lowering callbacks.		// the various CC lowering callbacks.
Flags.setByVal();		Flags.setByVal();
}		}
if (Arg.IsByVal \|\| Arg.IsInAlloca) {		if (Arg.IsPreallocated) {
		Flags.setPreallocated();
		// Set the byval flag for CCAssignFn callbacks that don't know about
		// preallocated. This way we can know how many bytes we should've
		// allocated and how many bytes a callee cleanup function will pop. If we
		// port preallocated to more targets, we'll have to add custom
		// preallocated handling in the various CC lowering callbacks.
		Flags.setByVal();
		}
		if (Arg.IsByVal \|\| Arg.IsInAlloca \|\| Arg.IsPreallocated) {
PointerType *Ty = cast<PointerType>(Arg.Ty);		PointerType *Ty = cast<PointerType>(Arg.Ty);
Type *ElementTy = Ty->getElementType();		Type *ElementTy = Ty->getElementType();
unsigned FrameSize =		unsigned FrameSize =
DL.getTypeAllocSize(Arg.ByValType ? Arg.ByValType : ElementTy);		DL.getTypeAllocSize(Arg.ByValType ? Arg.ByValType : ElementTy);

// For ByVal, alignment should come from FE. BE will guess if this info		// For ByVal, alignment should come from FE. BE will guess if this info
// is not there, but there are cases it cannot get right.		// is not there, but there are cases it cannot get right.
MaybeAlign FrameAlign = Arg.Alignment;		MaybeAlign FrameAlign = Arg.Alignment;
▲ Show 20 Lines • Show All 1,287 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,867 Lines • ▼ Show 20 Lines	SDValue SelectionDAG::getBlockAddress(const BlockAddress *BA, EVT VT,

auto *N = newSDNode<BlockAddressSDNode>(Opc, VT, BA, Offset, TargetFlags);		auto *N = newSDNode<BlockAddressSDNode>(Opc, VT, BA, Offset, TargetFlags);
CSEMap.InsertNode(N, IP);		CSEMap.InsertNode(N, IP);
InsertNode(N);		InsertNode(N);
return SDValue(N, 0);		return SDValue(N, 0);
}		}

SDValue SelectionDAG::getSrcValue(const Value *V) {		SDValue SelectionDAG::getSrcValue(const Value *V) {
assert((!V \|\| V->getType()->isPointerTy()) &&
"SrcValue is not a pointer?");

FoldingSetNodeID ID;		FoldingSetNodeID ID;
AddNodeIDNode(ID, ISD::SRCVALUE, getVTList(MVT::Other), None);		AddNodeIDNode(ID, ISD::SRCVALUE, getVTList(MVT::Other), None);
ID.AddPointer(V);		ID.AddPointer(V);

void *IP = nullptr;		void *IP = nullptr;
if (SDNode *E = FindNodeOrInsertPos(ID, IP))		if (SDNode *E = FindNodeOrInsertPos(ID, IP))
return SDValue(E, 0);		return SDValue(E, 0);

▲ Show 20 Lines • Show All 7,981 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 5,601 Lines • ▼ Show 20 Lines	void SelectionDAGBuilder::lowerCallToExternalSymbol(const CallInst &I,
const char *FunctionName) {		const char *FunctionName) {
assert(FunctionName && "FunctionName must not be nullptr");		assert(FunctionName && "FunctionName must not be nullptr");
SDValue Callee = DAG.getExternalSymbol(		SDValue Callee = DAG.getExternalSymbol(
FunctionName,		FunctionName,
DAG.getTargetLoweringInfo().getPointerTy(DAG.getDataLayout()));		DAG.getTargetLoweringInfo().getPointerTy(DAG.getDataLayout()));
LowerCallTo(I, Callee, I.isTailCall());		LowerCallTo(I, Callee, I.isTailCall());
}		}

		/// Given a @llvm.call.preallocated.setup, return the corresponding
		/// preallocated call.
		static const CallBase FindPreallocatedCall(const Value PreallocatedSetup) {
		assert(cast<CallBase>(PreallocatedSetup)
		->getCalledFunction()
		->getIntrinsicID() == Intrinsic::call_preallocated_setup &&
		"expected call_preallocated_setup Value");
		for (auto *U : PreallocatedSetup->users()) {
		auto *UseCall = cast<CallBase>(U);
		const Function *Fn = UseCall->getCalledFunction();
		if (!Fn \|\| Fn->getIntrinsicID() != Intrinsic::call_preallocated_arg) {
		return UseCall;
		}
		}
		llvm_unreachable("expected corresponding call to preallocated setup/arg");
		}

/// Lower the call to the specified intrinsic function.		/// Lower the call to the specified intrinsic function.
void SelectionDAGBuilder::visitIntrinsicCall(const CallInst &I,		void SelectionDAGBuilder::visitIntrinsicCall(const CallInst &I,
unsigned Intrinsic) {		unsigned Intrinsic) {
const TargetLowering &TLI = DAG.getTargetLoweringInfo();		const TargetLowering &TLI = DAG.getTargetLoweringInfo();
SDLoc sdl = getCurSDLoc();		SDLoc sdl = getCurSDLoc();
DebugLoc dl = getCurDebugLoc();		DebugLoc dl = getCurDebugLoc();
SDValue Res;		SDValue Res;

▲ Show 20 Lines • Show All 176 Lines • ▼ Show 20 Lines	case Intrinsic::memset_element_unordered_atomic: {
unsigned ElemSz = MI.getElementSizeInBytes();		unsigned ElemSz = MI.getElementSizeInBytes();
bool isTC = I.isTailCall() && isInTailCallPosition(I, DAG.getTarget());		bool isTC = I.isTailCall() && isInTailCallPosition(I, DAG.getTarget());
SDValue MC = DAG.getAtomicMemset(getRoot(), sdl, Dst, DstAlign, Val, Length,		SDValue MC = DAG.getAtomicMemset(getRoot(), sdl, Dst, DstAlign, Val, Length,
LengthTy, ElemSz, isTC,		LengthTy, ElemSz, isTC,
MachinePointerInfo(MI.getRawDest()));		MachinePointerInfo(MI.getRawDest()));
updateDAGForMaybeTailCall(MC);		updateDAGForMaybeTailCall(MC);
return;		return;
}		}
		case Intrinsic::call_preallocated_setup: {
		const CallBase *PreallocatedCall = FindPreallocatedCall(&I);
		SDValue SrcValue = DAG.getSrcValue(PreallocatedCall);
		SDValue Res = DAG.getNode(ISD::PREALLOCATED_SETUP, sdl, MVT::Other,
		getRoot(), SrcValue);
		setValue(&I, Res);
		DAG.setRoot(Res);
		return;
		}
		rnkUnsubmitted Done Reply Inline Actions The common code here seems to be this search of the use list to find the call that consumes the setup. I'd suggest factoring that out into a helper. Once that is done, it seems like this code might be simpler if you separate the setup and arg cases. rnk: The common code here seems to be this search of the use list to find the call that consumes the…
		case Intrinsic::call_preallocated_arg: {
		const CallBase *PreallocatedCall = FindPreallocatedCall(I.getOperand(0));
		SDValue SrcValue = DAG.getSrcValue(PreallocatedCall);
		SDValue Ops[3];
		Ops[0] = getRoot();
		Ops[1] = SrcValue;
		Ops[2] = DAG.getTargetConstant(*cast<ConstantInt>(I.getArgOperand(1)), sdl,
		MVT::i32); // arg index
		SDValue Res = DAG.getNode(
		ISD::PREALLOCATED_ARG, sdl,
		DAG.getVTList(TLI.getPointerTy(DAG.getDataLayout()), MVT::Other), Ops);
		setValue(&I, Res);
		DAG.setRoot(Res.getValue(1));
		return;
		}
case Intrinsic::dbg_addr:		case Intrinsic::dbg_addr:
case Intrinsic::dbg_declare: {		case Intrinsic::dbg_declare: {
const auto &DI = cast<DbgVariableIntrinsic>(I);		const auto &DI = cast<DbgVariableIntrinsic>(I);
DILocalVariable *Variable = DI.getVariable();		DILocalVariable *Variable = DI.getVariable();
DIExpression *Expression = DI.getExpression();		DIExpression *Expression = DI.getExpression();
dropDanglingDebugInfo(Variable, Expression);		dropDanglingDebugInfo(Variable, Expression);
assert(Variable && "Missing variable");		assert(Variable && "Missing variable");
LLVM_DEBUG(dbgs() << "SelectionDAG visiting debug intrinsic: " << DI		LLVM_DEBUG(dbgs() << "SelectionDAG visiting debug intrinsic: " << DI
▲ Show 20 Lines • Show All 1,304 Lines • ▼ Show 20 Lines	void SelectionDAGBuilder::LowerCallTo(const CallBase &CB, SDValue Callee,
if (TLI.supportSwiftError() && SwiftErrorVal)		if (TLI.supportSwiftError() && SwiftErrorVal)
isTailCall = false;		isTailCall = false;

TargetLowering::CallLoweringInfo CLI(DAG);		TargetLowering::CallLoweringInfo CLI(DAG);
CLI.setDebugLoc(getCurSDLoc())		CLI.setDebugLoc(getCurSDLoc())
.setChain(getRoot())		.setChain(getRoot())
.setCallee(RetTy, FTy, Callee, std::move(Args), CB)		.setCallee(RetTy, FTy, Callee, std::move(Args), CB)
.setTailCall(isTailCall)		.setTailCall(isTailCall)
.setConvergent(CB.isConvergent());		.setConvergent(CB.isConvergent())
		.setIsPreallocated(
		CB.countOperandBundlesOfType(LLVMContext::OB_preallocated) != 0);
std::pair<SDValue, SDValue> Result = lowerInvokable(CLI, EHPadBB);		std::pair<SDValue, SDValue> Result = lowerInvokable(CLI, EHPadBB);

if (Result.first.getNode()) {		if (Result.first.getNode()) {
Result.first = lowerRangeToAssertZExt(DAG, CB, Result.first);		Result.first = lowerRangeToAssertZExt(DAG, CB, Result.first);
setValue(&CB, Result.first);		setValue(&CB, Result.first);
}		}

// The last element of CLI.InVals has the SDValue for swifterror return.		// The last element of CLI.InVals has the SDValue for swifterror return.
▲ Show 20 Lines • Show All 509 Lines • ▼ Show 20 Lines	if (!I.isNoBuiltin() && !I.isStrictFP() && !F->hasLocalLinkage() &&
break;		break;
}		}
}		}
}		}

// Deopt bundles are lowered in LowerCallSiteWithDeoptBundle, and we don't		// Deopt bundles are lowered in LowerCallSiteWithDeoptBundle, and we don't
// have to do anything here to lower funclet bundles.		// have to do anything here to lower funclet bundles.
// CFGuardTarget bundles are lowered in LowerCallTo.		// CFGuardTarget bundles are lowered in LowerCallTo.
assert(!I.hasOperandBundlesOtherThan({LLVMContext::OB_deopt,		assert(!I.hasOperandBundlesOtherThan(
LLVMContext::OB_funclet,		{LLVMContext::OB_deopt, LLVMContext::OB_funclet,
LLVMContext::OB_cfguardtarget}) &&		LLVMContext::OB_cfguardtarget, LLVMContext::OB_preallocated}) &&
"Cannot lower calls with arbitrary operand bundles!");		"Cannot lower calls with arbitrary operand bundles!");

SDValue Callee = getValue(I.getCalledOperand());		SDValue Callee = getValue(I.getCalledOperand());

if (I.countOperandBundlesOfType(LLVMContext::OB_deopt))		if (I.countOperandBundlesOfType(LLVMContext::OB_deopt))
LowerCallSiteWithDeoptBundle(&I, Callee, nullptr);		LowerCallSiteWithDeoptBundle(&I, Callee, nullptr);
else		else
// Check if we can potentially perform a tail call. More detailed checking		// Check if we can potentially perform a tail call. More detailed checking
▲ Show 20 Lines • Show All 944 Lines • ▼ Show 20 Lines	for (unsigned ArgI = ArgIdx, ArgE = ArgIdx + NumArgs;
Entry.setAttributes(Call, ArgI);		Entry.setAttributes(Call, ArgI);
Args.push_back(Entry);		Args.push_back(Entry);
}		}

CLI.setDebugLoc(getCurSDLoc())		CLI.setDebugLoc(getCurSDLoc())
.setChain(getRoot())		.setChain(getRoot())
.setCallee(Call->getCallingConv(), ReturnTy, Callee, std::move(Args))		.setCallee(Call->getCallingConv(), ReturnTy, Callee, std::move(Args))
.setDiscardResult(Call->use_empty())		.setDiscardResult(Call->use_empty())
.setIsPatchPoint(IsPatchPoint);		.setIsPatchPoint(IsPatchPoint)
		.setIsPreallocated(
		Call->countOperandBundlesOfType(LLVMContext::OB_preallocated) != 0);
}		}

/// Add a stack map intrinsic call's live variable operands to a stackmap		/// Add a stack map intrinsic call's live variable operands to a stackmap
/// or patchpoint target node's operand list.		/// or patchpoint target node's operand list.
///		///
/// Constants are converted to TargetConstants purely as an optimization to		/// Constants are converted to TargetConstants purely as an optimization to
/// avoid constant materialization and register allocation.		/// avoid constant materialization and register allocation.
///		///
▲ Show 20 Lines • Show All 503 Lines • ▼ Show 20 Lines	for (unsigned Value = 0, NumValues = ValueVTs.size(); Value != NumValues;
if (Args[i].IsSwiftSelf)		if (Args[i].IsSwiftSelf)
Flags.setSwiftSelf();		Flags.setSwiftSelf();
if (Args[i].IsSwiftError)		if (Args[i].IsSwiftError)
Flags.setSwiftError();		Flags.setSwiftError();
if (Args[i].IsCFGuardTarget)		if (Args[i].IsCFGuardTarget)
Flags.setCFGuardTarget();		Flags.setCFGuardTarget();
if (Args[i].IsByVal)		if (Args[i].IsByVal)
Flags.setByVal();		Flags.setByVal();
		if (Args[i].IsPreallocated) {
		Flags.setPreallocated();
		// Set the byval flag for CCAssignFn callbacks that don't know about
		// preallocated. This way we can know how many bytes we should've
		// allocated and how many bytes a callee cleanup function will pop. If
		// we port preallocated to more targets, we'll have to add custom
		// preallocated handling in the various CC lowering callbacks.
		Flags.setByVal();
		}
if (Args[i].IsInAlloca) {		if (Args[i].IsInAlloca) {
Flags.setInAlloca();		Flags.setInAlloca();
// Set the byval flag for CCAssignFn callbacks that don't know about		// Set the byval flag for CCAssignFn callbacks that don't know about
// inalloca. This way we can know how many bytes we should've allocated		// inalloca. This way we can know how many bytes we should've allocated
// and how many bytes a callee cleanup function will pop. If we port		// and how many bytes a callee cleanup function will pop. If we port
// inalloca to more targets, we'll have to add custom inalloca handling		// inalloca to more targets, we'll have to add custom inalloca handling
// in the various CC lowering callbacks.		// in the various CC lowering callbacks.
Flags.setByVal();		Flags.setByVal();
}		}
if (Args[i].IsByVal \|\| Args[i].IsInAlloca) {		if (Args[i].IsByVal \|\| Args[i].IsInAlloca \|\| Args[i].IsPreallocated) {
PointerType *Ty = cast<PointerType>(Args[i].Ty);		PointerType *Ty = cast<PointerType>(Args[i].Ty);
Type *ElementTy = Ty->getElementType();		Type *ElementTy = Ty->getElementType();

unsigned FrameSize = DL.getTypeAllocSize(		unsigned FrameSize = DL.getTypeAllocSize(
Args[i].ByValType ? Args[i].ByValType : ElementTy);		Args[i].ByValType ? Args[i].ByValType : ElementTy);
Flags.setByValSize(FrameSize);		Flags.setByValSize(FrameSize);

// info is not there but there are cases it cannot get right.		// info is not there but there are cases it cannot get right.
▲ Show 20 Lines • Show All 297 Lines • ▼ Show 20 Lines	for (const Instruction &I : FuncInfo->Fn->getEntryBlock()) {
// Skip allocas that have been initialized or clobbered.		// Skip allocas that have been initialized or clobbered.
if (*Info != StaticAllocaInfo::Unknown)		if (*Info != StaticAllocaInfo::Unknown)
continue;		continue;

// Check if the stored value is an argument, and that this store fully		// Check if the stored value is an argument, and that this store fully
// initializes the alloca. Don't elide copies from the same argument twice.		// initializes the alloca. Don't elide copies from the same argument twice.
const Value *Val = SI->getValueOperand()->stripPointerCasts();		const Value *Val = SI->getValueOperand()->stripPointerCasts();
const auto *Arg = dyn_cast<Argument>(Val);		const auto *Arg = dyn_cast<Argument>(Val);
if (!Arg \|\| Arg->hasInAllocaAttr() \|\| Arg->hasByValAttr() \|\|		if (!Arg \|\| Arg->hasPassPointeeByValueAttr() \|\|
Arg->getType()->isEmptyTy() \|\|		Arg->getType()->isEmptyTy() \|\|
DL.getTypeStoreSize(Arg->getType()) !=		DL.getTypeStoreSize(Arg->getType()) !=
DL.getTypeAllocSize(AI->getAllocatedType()) \|\|		DL.getTypeAllocSize(AI->getAllocatedType()) \|\|
ArgCopyElisionCandidates.count(Arg)) {		ArgCopyElisionCandidates.count(Arg)) {
*Info = StaticAllocaInfo::Clobbered;		*Info = StaticAllocaInfo::Clobbered;
continue;		continue;
}		}

▲ Show 20 Lines • Show All 170 Lines • ▼ Show 20 Lines	for (unsigned Value = 0, NumValues = ValueVTs.size();
Flags.setInAlloca();		Flags.setInAlloca();
// Set the byval flag for CCAssignFn callbacks that don't know about		// Set the byval flag for CCAssignFn callbacks that don't know about
// inalloca. This way we can know how many bytes we should've allocated		// inalloca. This way we can know how many bytes we should've allocated
// and how many bytes a callee cleanup function will pop. If we port		// and how many bytes a callee cleanup function will pop. If we port
// inalloca to more targets, we'll have to add custom inalloca handling		// inalloca to more targets, we'll have to add custom inalloca handling
// in the various CC lowering callbacks.		// in the various CC lowering callbacks.
Flags.setByVal();		Flags.setByVal();
}		}
		if (Arg.hasAttribute(Attribute::Preallocated)) {
		Flags.setPreallocated();
		// Set the byval flag for CCAssignFn callbacks that don't know about
		// preallocated. This way we can know how many bytes we should've
		// allocated and how many bytes a callee cleanup function will pop. If
		// we port preallocated to more targets, we'll have to add custom
		// preallocated handling in the various CC lowering callbacks.
		Flags.setByVal();
		}
if (F.getCallingConv() == CallingConv::X86_INTR) {		if (F.getCallingConv() == CallingConv::X86_INTR) {
// IA Interrupt passes frame (1st parameter) by value in the stack.		// IA Interrupt passes frame (1st parameter) by value in the stack.
if (ArgNo == 0)		if (ArgNo == 0)
Flags.setByVal();		Flags.setByVal();
}		}
if (Flags.isByVal() \|\| Flags.isInAlloca()) {		if (Flags.isByVal() \|\| Flags.isInAlloca() \|\| Flags.isPreallocated()) {
Type *ElementTy = Arg.getParamByValType();		Type *ElementTy = Arg.getParamByValType();

// For ByVal, size and alignment should be passed from FE. BE will		// For ByVal, size and alignment should be passed from FE. BE will
// guess if this info is not there but there are cases it cannot get		// guess if this info is not there but there are cases it cannot get
// right.		// right.
unsigned FrameSize = DL.getTypeAllocSize(Arg.getParamByValType());		unsigned FrameSize = DL.getTypeAllocSize(Arg.getParamByValType());
Flags.setByValSize(FrameSize);		Flags.setByValSize(FrameSize);

▲ Show 20 Lines • Show All 915 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp

Show First 20 Lines • Show All 387 Lines • ▼ Show 20 Lines	#endif
case ISD::TRAP: return "trap";		case ISD::TRAP: return "trap";
case ISD::DEBUGTRAP: return "debugtrap";		case ISD::DEBUGTRAP: return "debugtrap";
case ISD::LIFETIME_START: return "lifetime.start";		case ISD::LIFETIME_START: return "lifetime.start";
case ISD::LIFETIME_END: return "lifetime.end";		case ISD::LIFETIME_END: return "lifetime.end";
case ISD::GC_TRANSITION_START: return "gc_transition.start";		case ISD::GC_TRANSITION_START: return "gc_transition.start";
case ISD::GC_TRANSITION_END: return "gc_transition.end";		case ISD::GC_TRANSITION_END: return "gc_transition.end";
case ISD::GET_DYNAMIC_AREA_OFFSET: return "get.dynamic.area.offset";		case ISD::GET_DYNAMIC_AREA_OFFSET: return "get.dynamic.area.offset";
case ISD::FREEZE: return "freeze";		case ISD::FREEZE: return "freeze";
		case ISD::PREALLOCATED_SETUP:
		return "call_setup";
		case ISD::PREALLOCATED_ARG:
		return "call_alloc";

// Bit manipulation		// Bit manipulation
case ISD::ABS: return "abs";		case ISD::ABS: return "abs";
case ISD::BITREVERSE: return "bitreverse";		case ISD::BITREVERSE: return "bitreverse";
case ISD::BSWAP: return "bswap";		case ISD::BSWAP: return "bswap";
case ISD::CTPOP: return "ctpop";		case ISD::CTPOP: return "ctpop";
case ISD::CTTZ: return "cttz";		case ISD::CTTZ: return "cttz";
case ISD::CTTZ_ZERO_UNDEF: return "cttz_zero_undef";		case ISD::CTTZ_ZERO_UNDEF: return "cttz_zero_undef";
▲ Show 20 Lines • Show All 583 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 104 Lines • ▼ Show 20 Lines
	void TargetLoweringBase::ArgListEntry::setAttributes(const CallBase *Call,			void TargetLoweringBase::ArgListEntry::setAttributes(const CallBase *Call,
	unsigned ArgIdx) {			unsigned ArgIdx) {
	IsSExt = Call->paramHasAttr(ArgIdx, Attribute::SExt);			IsSExt = Call->paramHasAttr(ArgIdx, Attribute::SExt);
	IsZExt = Call->paramHasAttr(ArgIdx, Attribute::ZExt);			IsZExt = Call->paramHasAttr(ArgIdx, Attribute::ZExt);
	IsInReg = Call->paramHasAttr(ArgIdx, Attribute::InReg);			IsInReg = Call->paramHasAttr(ArgIdx, Attribute::InReg);
	IsSRet = Call->paramHasAttr(ArgIdx, Attribute::StructRet);			IsSRet = Call->paramHasAttr(ArgIdx, Attribute::StructRet);
	IsNest = Call->paramHasAttr(ArgIdx, Attribute::Nest);			IsNest = Call->paramHasAttr(ArgIdx, Attribute::Nest);
	IsByVal = Call->paramHasAttr(ArgIdx, Attribute::ByVal);			IsByVal = Call->paramHasAttr(ArgIdx, Attribute::ByVal);
				IsPreallocated = Call->paramHasAttr(ArgIdx, Attribute::Preallocated);
	IsInAlloca = Call->paramHasAttr(ArgIdx, Attribute::InAlloca);			IsInAlloca = Call->paramHasAttr(ArgIdx, Attribute::InAlloca);
	IsReturned = Call->paramHasAttr(ArgIdx, Attribute::Returned);			IsReturned = Call->paramHasAttr(ArgIdx, Attribute::Returned);
	IsSwiftSelf = Call->paramHasAttr(ArgIdx, Attribute::SwiftSelf);			IsSwiftSelf = Call->paramHasAttr(ArgIdx, Attribute::SwiftSelf);
	IsSwiftError = Call->paramHasAttr(ArgIdx, Attribute::SwiftError);			IsSwiftError = Call->paramHasAttr(ArgIdx, Attribute::SwiftError);
	Alignment = Call->getParamAlign(ArgIdx);			Alignment = Call->getParamAlign(ArgIdx);
	ByValType = nullptr;			ByValType = nullptr;
	if (Call->paramHasAttr(ArgIdx, Attribute::ByVal))			if (IsByVal)
	ByValType = Call->getParamByValType(ArgIdx);			ByValType = Call->getParamByValType(ArgIdx);
				PreallocatedType = nullptr;
				if (IsPreallocated)
				PreallocatedType = Call->getParamPreallocatedType(ArgIdx);
	}			}

	/// Generate a libcall taking the given operands as arguments and returning a			/// Generate a libcall taking the given operands as arguments and returning a
	/// result of type RetVT.			/// result of type RetVT.
	std::pair<SDValue, SDValue>			std::pair<SDValue, SDValue>
	TargetLowering::makeLibCall(SelectionDAG &DAG, RTLIB::Libcall LC, EVT RetVT,			TargetLowering::makeLibCall(SelectionDAG &DAG, RTLIB::Libcall LC, EVT RetVT,
	ArrayRef<SDValue> Ops,			ArrayRef<SDValue> Ops,
	MakeLibCallOptions CallOptions,			MakeLibCallOptions CallOptions,
	▲ Show 20 Lines • Show All 7,629 Lines • Show Last 20 Lines

llvm/lib/IR/Attributes.cpp

	Show First 20 Lines • Show All 1,427 Lines • ▼ Show 20 Lines
	MaybeAlign AttributeList::getParamAlignment(unsigned ArgNo) const {			MaybeAlign AttributeList::getParamAlignment(unsigned ArgNo) const {
	return getAttributes(ArgNo + FirstArgIndex).getAlignment();			return getAttributes(ArgNo + FirstArgIndex).getAlignment();
	}			}

	Type *AttributeList::getParamByValType(unsigned Index) const {			Type *AttributeList::getParamByValType(unsigned Index) const {
	return getAttributes(Index+FirstArgIndex).getByValType();			return getAttributes(Index+FirstArgIndex).getByValType();
	}			}

				Type *AttributeList::getParamPreallocatedType(unsigned Index) const {
				return getAttributes(Index + FirstArgIndex).getPreallocatedType();
				}

	MaybeAlign AttributeList::getStackAlignment(unsigned Index) const {			MaybeAlign AttributeList::getStackAlignment(unsigned Index) const {
	return getAttributes(Index).getStackAlignment();			return getAttributes(Index).getStackAlignment();
	}			}

	uint64_t AttributeList::getDereferenceableBytes(unsigned Index) const {			uint64_t AttributeList::getDereferenceableBytes(unsigned Index) const {
	return getAttributes(Index).getDereferenceableBytes();			return getAttributes(Index).getDereferenceableBytes();
	}			}

	▲ Show 20 Lines • Show All 549 Lines • Show Last 20 Lines

llvm/lib/IR/Function.cpp

Show First 20 Lines • Show All 108 Lines • ▼ Show 20 Lines	bool Argument::hasSwiftErrorAttr() const {
return getParent()->hasParamAttribute(getArgNo(), Attribute::SwiftError);		return getParent()->hasParamAttribute(getArgNo(), Attribute::SwiftError);
}		}

bool Argument::hasInAllocaAttr() const {		bool Argument::hasInAllocaAttr() const {
if (!getType()->isPointerTy()) return false;		if (!getType()->isPointerTy()) return false;
return hasAttribute(Attribute::InAlloca);		return hasAttribute(Attribute::InAlloca);
}		}

		bool Argument::hasPreallocatedAttr() const {
		if (!getType()->isPointerTy())
		return false;
		return hasAttribute(Attribute::Preallocated);
		}

bool Argument::hasPassPointeeByValueAttr() const {		bool Argument::hasPassPointeeByValueAttr() const {
if (!getType()->isPointerTy()) return false;		if (!getType()->isPointerTy()) return false;
AttributeList Attrs = getParent()->getAttributes();		AttributeList Attrs = getParent()->getAttributes();
return Attrs.hasParamAttribute(getArgNo(), Attribute::ByVal) \|\|		return Attrs.hasParamAttribute(getArgNo(), Attribute::ByVal) \|\|
Attrs.hasParamAttribute(getArgNo(), Attribute::InAlloca) \|\|		Attrs.hasParamAttribute(getArgNo(), Attribute::InAlloca) \|\|
Attrs.hasParamAttribute(getArgNo(), Attribute::Preallocated);		Attrs.hasParamAttribute(getArgNo(), Attribute::Preallocated);
}		}

▲ Show 20 Lines • Show All 1,526 Lines • Show Last 20 Lines

llvm/lib/Target/X86/X86CallingConv.td

Show First 20 Lines • Show All 783 Lines • ▼ Show 20 Lines	CCIfNotVarArg<CCIfType<[v64i8, v32i16, v16i32, v8i64, v16f32, v8f64],
CCAssignToReg<[ZMM0, ZMM1, ZMM2, ZMM3]>>>,		CCAssignToReg<[ZMM0, ZMM1, ZMM2, ZMM3]>>>,

CCDelegateTo<CC_X86_32_Vector_Common>		CCDelegateTo<CC_X86_32_Vector_Common>
]>;		]>;

/// CC_X86_32_Common - In all X86-32 calling conventions, extra integers and FP		/// CC_X86_32_Common - In all X86-32 calling conventions, extra integers and FP
/// values are spilled on the stack.		/// values are spilled on the stack.
def CC_X86_32_Common : CallingConv<[		def CC_X86_32_Common : CallingConv<[
// Handles byval parameters.		// Handles byval/preallocated parameters.
CCIfByVal<CCPassByVal<4, 4>>,		CCIfByVal<CCPassByVal<4, 4>>,
		CCIfPreallocated<CCPassByVal<4, 4>>,

// The first 3 float or double arguments, if marked 'inreg' and if the call		// The first 3 float or double arguments, if marked 'inreg' and if the call
// is not a vararg call and if SSE2 is available, are passed in SSE registers.		// is not a vararg call and if SSE2 is available, are passed in SSE registers.
CCIfNotVarArg<CCIfInReg<CCIfType<[f32,f64],		CCIfNotVarArg<CCIfInReg<CCIfType<[f32,f64],
CCIfSubtarget<"hasSSE2()",		CCIfSubtarget<"hasSSE2()",
CCAssignToReg<[XMM0,XMM1,XMM2]>>>>>,		CCAssignToReg<[XMM0,XMM1,XMM2]>>>>>,

// The first 3 __m64 vector arguments are passed in mmx registers if the		// The first 3 __m64 vector arguments are passed in mmx registers if the
▲ Show 20 Lines • Show All 368 Lines • Show Last 20 Lines

llvm/lib/Target/X86/X86FastISel.cpp

Show First 20 Lines • Show All 3,239 Lines • ▼ Show 20 Lines	bool X86FastISel::fastLowerCall(CallLoweringInfo &CLI) {
if (IsVarArg && IsWin64)		if (IsVarArg && IsWin64)
return false;		return false;

// Don't know about inalloca yet.		// Don't know about inalloca yet.
if (CLI.CB && CLI.CB->hasInAllocaArgument())		if (CLI.CB && CLI.CB->hasInAllocaArgument())
return false;		return false;

for (auto Flag : CLI.OutFlags)		for (auto Flag : CLI.OutFlags)
if (Flag.isSwiftError())		if (Flag.isSwiftError() \|\| Flag.isPreallocated())
return false;		return false;

SmallVector<MVT, 16> OutVTs;		SmallVector<MVT, 16> OutVTs;
SmallVector<unsigned, 16> ArgRegs;		SmallVector<unsigned, 16> ArgRegs;

// If this is a constant i1/i8/i16 argument, promote to i32 to avoid an extra		// If this is a constant i1/i8/i16 argument, promote to i32 to avoid an extra
// instruction. This is safe because it is common to all FastISel supported		// instruction. This is safe because it is common to all FastISel supported
// calling conventions on x86.		// calling conventions on x86.
▲ Show 20 Lines • Show All 753 Lines • Show Last 20 Lines

llvm/lib/Target/X86/X86FrameLowering.cpp

Show First 20 Lines • Show All 51 Lines • ▼ Show 20 Lines	X86FrameLowering::X86FrameLowering(const X86Subtarget &STI,
IsLP64 = STI.isTarget64BitLP64();		IsLP64 = STI.isTarget64BitLP64();
// standard x86_64 and NaCl use 64-bit frame/stack pointers, x32 - 32-bit.		// standard x86_64 and NaCl use 64-bit frame/stack pointers, x32 - 32-bit.
Uses64BitFramePtr = STI.isTarget64BitLP64() \|\| STI.isTargetNaCl64();		Uses64BitFramePtr = STI.isTarget64BitLP64() \|\| STI.isTargetNaCl64();
StackPtr = TRI->getStackRegister();		StackPtr = TRI->getStackRegister();
}		}

bool X86FrameLowering::hasReservedCallFrame(const MachineFunction &MF) const {		bool X86FrameLowering::hasReservedCallFrame(const MachineFunction &MF) const {
return !MF.getFrameInfo().hasVarSizedObjects() &&		return !MF.getFrameInfo().hasVarSizedObjects() &&
!MF.getInfo<X86MachineFunctionInfo>()->getHasPushSequences();		!MF.getInfo<X86MachineFunctionInfo>()->getHasPushSequences() &&
		!MF.getInfo<X86MachineFunctionInfo>()->hasPreallocatedCall();
}		}

/// canSimplifyCallFramePseudos - If there is a reserved call frame, the		/// canSimplifyCallFramePseudos - If there is a reserved call frame, the
/// call frame pseudos can be simplified. Having a FP, as in the default		/// call frame pseudos can be simplified. Having a FP, as in the default
/// implementation, is not sufficient here since we can't always use it.		/// implementation, is not sufficient here since we can't always use it.
/// Use a more nuanced condition.		/// Use a more nuanced condition.
bool		bool
X86FrameLowering::canSimplifyCallFramePseudos(const MachineFunction &MF) const {		X86FrameLowering::canSimplifyCallFramePseudos(const MachineFunction &MF) const {
return hasReservedCallFrame(MF) \|\|		return hasReservedCallFrame(MF) \|\|
		MF.getInfo<X86MachineFunctionInfo>()->hasPreallocatedCall() \|\|
(hasFP(MF) && !TRI->needsStackRealignment(MF)) \|\|		(hasFP(MF) && !TRI->needsStackRealignment(MF)) \|\|
TRI->hasBasePointer(MF);		TRI->hasBasePointer(MF);
}		}

// needsFrameIndexResolution - Do we need to perform FI resolution for		// needsFrameIndexResolution - Do we need to perform FI resolution for
// this function. Normally, this is required only when the function		// this function. Normally, this is required only when the function
// has any stack objects. However, FI resolution actually has another job,		// has any stack objects. However, FI resolution actually has another job,
// not apparent from the title - it resolves callframesetup/destroy		// not apparent from the title - it resolves callframesetup/destroy
// that were not simplified earlier.		// that were not simplified earlier.
// So, this is required for x86 functions that have push sequences even		// So, this is required for x86 functions that have push sequences even
// when there are no stack objects.		// when there are no stack objects.
bool		bool
X86FrameLowering::needsFrameIndexResolution(const MachineFunction &MF) const {		X86FrameLowering::needsFrameIndexResolution(const MachineFunction &MF) const {
return MF.getFrameInfo().hasStackObjects() \|\|		return MF.getFrameInfo().hasStackObjects() \|\|
MF.getInfo<X86MachineFunctionInfo>()->getHasPushSequences();		MF.getInfo<X86MachineFunctionInfo>()->getHasPushSequences();
}		}

/// hasFP - Return true if the specified function should have a dedicated frame		/// hasFP - Return true if the specified function should have a dedicated frame
/// pointer register. This is true if the function has variable sized allocas		/// pointer register. This is true if the function has variable sized allocas
/// or if frame pointer elimination is disabled.		/// or if frame pointer elimination is disabled.
bool X86FrameLowering::hasFP(const MachineFunction &MF) const {		bool X86FrameLowering::hasFP(const MachineFunction &MF) const {
const MachineFrameInfo &MFI = MF.getFrameInfo();		const MachineFrameInfo &MFI = MF.getFrameInfo();
return (MF.getTarget().Options.DisableFramePointerElim(MF) \|\|		return (MF.getTarget().Options.DisableFramePointerElim(MF) \|\|
TRI->needsStackRealignment(MF) \|\|		TRI->needsStackRealignment(MF) \|\| MFI.hasVarSizedObjects() \|\|
MFI.hasVarSizedObjects() \|\|
MFI.isFrameAddressTaken() \|\| MFI.hasOpaqueSPAdjustment() \|\|		MFI.isFrameAddressTaken() \|\| MFI.hasOpaqueSPAdjustment() \|\|
MF.getInfo<X86MachineFunctionInfo>()->getForceFramePointer() \|\|		MF.getInfo<X86MachineFunctionInfo>()->getForceFramePointer() \|\|
		MF.getInfo<X86MachineFunctionInfo>()->hasPreallocatedCall() \|\|
MF.callsUnwindInit() \|\| MF.hasEHFunclets() \|\| MF.callsEHReturn() \|\|		MF.callsUnwindInit() \|\| MF.hasEHFunclets() \|\| MF.callsEHReturn() \|\|
MFI.hasStackMap() \|\| MFI.hasPatchPoint() \|\|		MFI.hasStackMap() \|\| MFI.hasPatchPoint() \|\|
MFI.hasCopyImplyingStackAdjustment());		MFI.hasCopyImplyingStackAdjustment());
}		}

static unsigned getSUBriOpcode(bool IsLP64, int64_t Imm) {		static unsigned getSUBriOpcode(bool IsLP64, int64_t Imm) {
if (IsLP64) {		if (IsLP64) {
if (isInt<8>(Imm))		if (isInt<8>(Imm))
▲ Show 20 Lines • Show All 3,347 Lines • Show Last 20 Lines

llvm/lib/Target/X86/X86ISelDAGToDAG.cpp

Show First 20 Lines • Show All 5,619 Lines • ▼ Show 20 Lines	case X86ISD::MSCATTER: {
SDValue Ops[] = {Base, Scale, Index, Disp, Segment, Mask, Value, Chain};		SDValue Ops[] = {Base, Scale, Index, Disp, Segment, Mask, Value, Chain};

MachineSDNode *NewNode = CurDAG->getMachineNode(Opc, SDLoc(dl), VTs, Ops);		MachineSDNode *NewNode = CurDAG->getMachineNode(Opc, SDLoc(dl), VTs, Ops);
CurDAG->setNodeMemRefs(NewNode, {Sc->getMemOperand()});		CurDAG->setNodeMemRefs(NewNode, {Sc->getMemOperand()});
ReplaceUses(SDValue(Node, 0), SDValue(NewNode, 1));		ReplaceUses(SDValue(Node, 0), SDValue(NewNode, 1));
CurDAG->RemoveDeadNode(Node);		CurDAG->RemoveDeadNode(Node);
return;		return;
}		}
		case ISD::PREALLOCATED_SETUP: {
		auto MFI = CurDAG->getMachineFunction().getInfo<X86MachineFunctionInfo>();
		auto CallId = MFI->PreallocatedIdForCallSite(
		cast<SrcValueSDNode>(Node->getOperand(1))->getValue());
		SDValue Chain = Node->getOperand(0);
		SDValue CallIdValue = CurDAG->getTargetConstant(CallId, dl, MVT::i32);
		MachineSDNode *New = CurDAG->getMachineNode(
		TargetOpcode::PREALLOCATED_SETUP, dl, MVT::Other, CallIdValue, Chain);
		ReplaceUses(SDValue(Node, 0), SDValue(New, 0)); // Chain
		CurDAG->RemoveDeadNode(Node);
		return;
		}
		case ISD::PREALLOCATED_ARG: {
		auto MFI = CurDAG->getMachineFunction().getInfo<X86MachineFunctionInfo>();
		auto CallId = MFI->PreallocatedIdForCallSite(
		cast<SrcValueSDNode>(Node->getOperand(1))->getValue());
		SDValue Chain = Node->getOperand(0);
		SDValue CallIdValue = CurDAG->getTargetConstant(CallId, dl, MVT::i32);
		SDValue ArgIndex = Node->getOperand(2);
		SDValue Ops[3];
		Ops[0] = CallIdValue;
		Ops[1] = ArgIndex;
		Ops[2] = Chain;
		MachineSDNode *New = CurDAG->getMachineNode(
		TargetOpcode::PREALLOCATED_ARG, dl,
		CurDAG->getVTList(TLI->getPointerTy(CurDAG->getDataLayout()),
		MVT::Other),
		Ops);
		ReplaceUses(SDValue(Node, 0), SDValue(New, 0)); // Arg pointer
		ReplaceUses(SDValue(Node, 1), SDValue(New, 1)); // Chain
		CurDAG->RemoveDeadNode(Node);
		return;
		}
}		}

SelectCode(Node);		SelectCode(Node);
}		}

bool X86DAGToDAGISel::		bool X86DAGToDAGISel::
SelectInlineAsmMemoryOperand(const SDValue &Op, unsigned ConstraintID,		SelectInlineAsmMemoryOperand(const SDValue &Op, unsigned ConstraintID,
std::vector<SDValue> &OutOps) {		std::vector<SDValue> &OutOps) {
Show All 27 Lines

llvm/lib/Target/X86/X86ISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 3,876 Lines • ▼ Show 20 Lines	X86TargetLowering::LowerCall(TargetLowering::CallLoweringInfo &CLI,
if (!Outs.empty() && Outs.back().Flags.isInAlloca()) {		if (!Outs.empty() && Outs.back().Flags.isInAlloca()) {
NumBytesToPush = 0;		NumBytesToPush = 0;
if (!ArgLocs.back().isMemLoc())		if (!ArgLocs.back().isMemLoc())
report_fatal_error("cannot use inalloca attribute on a register "		report_fatal_error("cannot use inalloca attribute on a register "
"parameter");		"parameter");
if (ArgLocs.back().getLocMemOffset() != 0)		if (ArgLocs.back().getLocMemOffset() != 0)
report_fatal_error("any parameter with the inalloca attribute must be "		report_fatal_error("any parameter with the inalloca attribute must be "
"the only memory argument");		"the only memory argument");
		} else if (CLI.IsPreallocated) {
		assert(ArgLocs.back().isMemLoc() &&
		"cannot use preallocated attribute on a register "
		rnkUnsubmitted Done Reply Inline Actions This seems like it could be an assert, but I could go either way. If the condition occurs, it's an LLVM bug: it means the x86 calling conv code missed the case. We generally use report_fatal_error when user input could trigger the condition, and we don't want UB to occur. rnk: This seems like it could be an assert, but I could go either way. If the condition occurs, it's…
		aeubanksAuthorUnsubmitted Done Reply Inline Actions I copied this from inalloca right above, but yeah assert seems fine. aeubanks: I copied this from inalloca right above, but yeah assert seems fine.
		"parameter");
		SmallVector<size_t, 4> PreallocatedOffsets;
		for (size_t i = 0; i < CLI.OutVals.size(); ++i) {
		if (CLI.CB->paramHasAttr(i, Attribute::Preallocated)) {
		PreallocatedOffsets.push_back(ArgLocs[i].getLocMemOffset());
		}
		}
		auto MFI = DAG.getMachineFunction().getInfo<X86MachineFunctionInfo>();
		size_t PreallocatedId = MFI->PreallocatedIdForCallSite(CLI.CB);
		MFI->SetPreallocatedStackSize(PreallocatedId, NumBytes);
		MFI->SetPreallocatedArgOffsets(PreallocatedId, PreallocatedOffsets);
		NumBytesToPush = 0;
}		}

if (!IsSibcall && !IsMustTail)		if (!IsSibcall && !IsMustTail)
Chain = DAG.getCALLSEQ_START(Chain, NumBytesToPush,		Chain = DAG.getCALLSEQ_START(Chain, NumBytesToPush,
NumBytes - NumBytesToPush, dl);		NumBytes - NumBytesToPush, dl);

SDValue RetAddrFrIdx;		SDValue RetAddrFrIdx;
// Load return address for tail calls.		// Load return address for tail calls.
Show All 11 Lines	assert(isSortedByValueNo(ArgLocs) &&
"Argument Location list must be sorted before lowering");		"Argument Location list must be sorted before lowering");

// Walk the register/memloc assignments, inserting copies/loads. In the case		// Walk the register/memloc assignments, inserting copies/loads. In the case
// of tail call optimization arguments are handle later.		// of tail call optimization arguments are handle later.
const X86RegisterInfo *RegInfo = Subtarget.getRegisterInfo();		const X86RegisterInfo *RegInfo = Subtarget.getRegisterInfo();
for (unsigned I = 0, OutIndex = 0, E = ArgLocs.size(); I != E;		for (unsigned I = 0, OutIndex = 0, E = ArgLocs.size(); I != E;
++I, ++OutIndex) {		++I, ++OutIndex) {
assert(OutIndex < Outs.size() && "Invalid Out index");		assert(OutIndex < Outs.size() && "Invalid Out index");
// Skip inalloca arguments, they have already been written.		// Skip inalloca/preallocated arguments, they have already been written.
ISD::ArgFlagsTy Flags = Outs[OutIndex].Flags;		ISD::ArgFlagsTy Flags = Outs[OutIndex].Flags;
if (Flags.isInAlloca())		if (Flags.isInAlloca() \|\| Flags.isPreallocated())
continue;		continue;

CCValAssign &VA = ArgLocs[I];		CCValAssign &VA = ArgLocs[I];
EVT RegVT = VA.getLocVT();		EVT RegVT = VA.getLocVT();
SDValue Arg = OutVals[OutIndex];		SDValue Arg = OutVals[OutIndex];
bool isByVal = Flags.isByVal();		bool isByVal = Flags.isByVal();

// Promote the value if needed.		// Promote the value if needed.
▲ Show 20 Lines • Show All 171 Lines • ▼ Show 20 Lines	for (unsigned I = 0, OutsIndex = 0, E = ArgLocs.size(); I != E;
}		}

continue;		continue;
}		}

assert(VA.isMemLoc());		assert(VA.isMemLoc());
SDValue Arg = OutVals[OutsIndex];		SDValue Arg = OutVals[OutsIndex];
ISD::ArgFlagsTy Flags = Outs[OutsIndex].Flags;		ISD::ArgFlagsTy Flags = Outs[OutsIndex].Flags;
// Skip inalloca arguments. They don't require any work.		// Skip inalloca/preallocated arguments. They don't require any work.
if (Flags.isInAlloca())		if (Flags.isInAlloca() \|\| Flags.isPreallocated())
continue;		continue;
// Create frame index.		// Create frame index.
int32_t Offset = VA.getLocMemOffset()+FPDiff;		int32_t Offset = VA.getLocMemOffset()+FPDiff;
uint32_t OpSize = (VA.getLocVT().getSizeInBits()+7)/8;		uint32_t OpSize = (VA.getLocVT().getSizeInBits()+7)/8;
FI = MF.getFrameInfo().CreateFixedObject(OpSize, Offset, true);		FI = MF.getFrameInfo().CreateFixedObject(OpSize, Offset, true);
FIN = DAG.getFrameIndex(FI, getPointerTy(DAG.getDataLayout()));		FIN = DAG.getFrameIndex(FI, getPointerTy(DAG.getDataLayout()));

if (Flags.isByVal()) {		if (Flags.isByVal()) {
▲ Show 20 Lines • Show All 28,944 Lines • ▼ Show 20 Lines	X86TargetLowering::EmitInstrWithCustomInserter(MachineInstr &MI,
case X86::LCMPXCHG8B_SAVE_EBX:		case X86::LCMPXCHG8B_SAVE_EBX:
case X86::LCMPXCHG16B_SAVE_RBX: {		case X86::LCMPXCHG16B_SAVE_RBX: {
unsigned BasePtr =		unsigned BasePtr =
MI.getOpcode() == X86::LCMPXCHG8B_SAVE_EBX ? X86::EBX : X86::RBX;		MI.getOpcode() == X86::LCMPXCHG8B_SAVE_EBX ? X86::EBX : X86::RBX;
if (!BB->isLiveIn(BasePtr))		if (!BB->isLiveIn(BasePtr))
BB->addLiveIn(BasePtr);		BB->addLiveIn(BasePtr);
return BB;		return BB;
}		}
		case TargetOpcode::PREALLOCATED_SETUP: {
		assert(Subtarget.is32Bit() && "preallocated only used in 32-bit");
		auto MFI = MF->getInfo<X86MachineFunctionInfo>();
		MFI->setHasPreallocatedCall(true);
		int64_t PreallocatedId = MI.getOperand(0).getImm();
		size_t StackAdjustment = MFI->GetPreallocatedStackSize(PreallocatedId);
		assert(StackAdjustment != 0 && "0 stack adjustment");
		LLVM_DEBUG(dbgs() << "PREALLOCATED_SETUP stack adjustment "
		<< StackAdjustment << "\n");
		BuildMI(*BB, MI, DL, TII->get(X86::SUB32ri), X86::ESP)
		.addReg(X86::ESP)
		.addImm(StackAdjustment);
		MI.eraseFromParent();
		return BB;
		}
		case TargetOpcode::PREALLOCATED_ARG: {
		assert(Subtarget.is32Bit() && "preallocated calls only used in 32-bit");
		int64_t PreallocatedId = MI.getOperand(1).getImm();
		int64_t ArgIdx = MI.getOperand(2).getImm();
		auto MFI = MF->getInfo<X86MachineFunctionInfo>();
		size_t ArgOffset = MFI->GetPreallocatedArgOffsets(PreallocatedId)[ArgIdx];
		LLVM_DEBUG(dbgs() << "PREALLOCATED_ARG arg index " << ArgIdx
		<< ", arg offset " << ArgOffset << "\n");
		// stack pointer + offset
		addRegOffset(
		BuildMI(*BB, MI, DL, TII->get(X86::LEA32r), MI.getOperand(0).getReg()),
		X86::ESP, false, ArgOffset);
		MI.eraseFromParent();
		return BB;
		}
}		}
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// X86 Optimization Hooks		// X86 Optimization Hooks
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

bool		bool
▲ Show 20 Lines • Show All 15,852 Lines • Show Last 20 Lines

llvm/lib/Target/X86/X86MachineFunctionInfo.h

//===-- X86MachineFunctionInfo.h - X86 machine function info ----- C++ --===//		//===-- X86MachineFunctionInfo.h - X86 machine function info ----- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// This file declares X86-specific per-machine-function information.		// This file declares X86-specific per-machine-function information.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_LIB_TARGET_X86_X86MACHINEFUNCTIONINFO_H		#ifndef LLVM_LIB_TARGET_X86_X86MACHINEFUNCTIONINFO_H
#define LLVM_LIB_TARGET_X86_X86MACHINEFUNCTIONINFO_H		#define LLVM_LIB_TARGET_X86_X86MACHINEFUNCTIONINFO_H

		#include "llvm/ADT/SmallVector.h"
#include "llvm/CodeGen/CallingConvLower.h"		#include "llvm/CodeGen/CallingConvLower.h"
#include "llvm/CodeGen/MachineFunction.h"		#include "llvm/CodeGen/MachineFunction.h"

namespace llvm {		namespace llvm {

/// X86MachineFunctionInfo - This class is derived from MachineFunction and		/// X86MachineFunctionInfo - This class is derived from MachineFunction and
/// contains private X86 target-specific information for each MachineFunction.		/// contains private X86 target-specific information for each MachineFunction.
class X86MachineFunctionInfo : public MachineFunctionInfo {		class X86MachineFunctionInfo : public MachineFunctionInfo {
▲ Show 20 Lines • Show All 74 Lines • ▼ Show 20 Lines	class X86MachineFunctionInfo : public MachineFunctionInfo {
bool IsSplitCSR = false;		bool IsSplitCSR = false;

/// True if this function uses the red zone.		/// True if this function uses the red zone.
bool UsesRedZone = false;		bool UsesRedZone = false;

/// True if this function has WIN_ALLOCA instructions.		/// True if this function has WIN_ALLOCA instructions.
bool HasWinAlloca = false;		bool HasWinAlloca = false;

		/// True if this function has any preallocated calls.
		bool HasPreallocatedCall = false;

		ValueMap<const Value *, size_t> PreallocatedIds;
		rnkUnsubmitted Done Reply Inline Actions This could be tracked as `PreallocatedStackSizes.size()`, perhaps. rnk: This could be tracked as `PreallocatedStackSizes.size()`, perhaps.
		SmallVector<size_t, 0> PreallocatedStackSizes;
		SmallVector<SmallVector<size_t, 4>, 0> PreallocatedArgOffsets;
		rnkUnsubmitted Done Reply Inline Actions I tend to micro-optimize data structures, but they keys to this are densely packed integers, so this could be a vector instead of a hash map. rnk: I tend to micro-optimize data structures, but they keys to this are densely packed integers, so…

		rnkUnsubmitted Done Reply Inline Actions DenseMap is open-addressed, so this takes up more memory than you might imagine. If you change it to SmallVector, the common case is that there are no call sites using preallocated, so I'd use 0 builtin elements. rnk: DenseMap is open-addressed, so this takes up more memory than you might imagine. If you change…
private:		private:
/// ForwardedMustTailRegParms - A list of virtual and physical registers		/// ForwardedMustTailRegParms - A list of virtual and physical registers
/// that must be forwarded to every musttail call.		/// that must be forwarded to every musttail call.
SmallVector<ForwardedRegister, 1> ForwardedMustTailRegParms;		SmallVector<ForwardedRegister, 1> ForwardedMustTailRegParms;

public:		public:
X86MachineFunctionInfo() = default;		X86MachineFunctionInfo() = default;

▲ Show 20 Lines • Show All 65 Lines • ▼ Show 20 Lines	public:
bool isSplitCSR() const { return IsSplitCSR; }		bool isSplitCSR() const { return IsSplitCSR; }
void setIsSplitCSR(bool s) { IsSplitCSR = s; }		void setIsSplitCSR(bool s) { IsSplitCSR = s; }

bool getUsesRedZone() const { return UsesRedZone; }		bool getUsesRedZone() const { return UsesRedZone; }
void setUsesRedZone(bool V) { UsesRedZone = V; }		void setUsesRedZone(bool V) { UsesRedZone = V; }

bool hasWinAlloca() const { return HasWinAlloca; }		bool hasWinAlloca() const { return HasWinAlloca; }
void setHasWinAlloca(bool v) { HasWinAlloca = v; }		void setHasWinAlloca(bool v) { HasWinAlloca = v; }

		bool hasPreallocatedCall() const { return HasPreallocatedCall; }
		void setHasPreallocatedCall(bool v) { HasPreallocatedCall = v; }

		size_t PreallocatedIdForCallSite(const Value *CS) {
		rnkUnsubmitted Done Reply Inline Actions LLVM is inconsistent about the casing of names, but this class seems to consisitently use leadingLowerCamelCase, so let's do the same here and below, unless you see a reason not to. rnk: LLVM is inconsistent about the casing of names, but this class seems to consisitently use…
		auto Insert = PreallocatedIds.insert({CS, PreallocatedIds.size()});
		if (Insert.second) {
		PreallocatedStackSizes.push_back(0);
		PreallocatedArgOffsets.emplace_back();
		rnkUnsubmitted Done Reply Inline Actions Can this function use the get-or-insert pattern: auto Insert = PreallocatedIds.insert({CS, PreallocatedNextId}); if (Insert.second) ++PreallocatedNextId; return Insert.first->second; rnk: Can this function use the get-or-insert pattern: auto Insert = PreallocatedIds.insert({CS…
		}
		return Insert.first->second;
		}

		void SetPreallocatedStackSize(size_t Id, size_t StackSize) {
		PreallocatedStackSizes[Id] = StackSize;
		}

		size_t GetPreallocatedStackSize(const size_t Id) {
		assert(PreallocatedStackSizes[Id] != 0 && "stack size not set");
		return PreallocatedStackSizes[Id];
		}
		rnkUnsubmitted Done Reply Inline Actions You can shorten this with `.count(Id)` rnk: You can shorten this with `.count(Id)`

		void SetPreallocatedArgOffsets(size_t Id, SmallVector<size_t, 4> AO) {
		rnkUnsubmitted Done Reply Inline Actions Generally, we try not to pass `SmallVector` by value. It is only small in the sense that we believe they contain few elements. Embedding storage in the vector makes it large, and expensive to copy. Generally to pass in a list of things, we tend to prefer `ArrayRef<size_t>`, so the caller can be flexible about the collection they are using, as long as its a flat array. rnk: Generally, we try not to pass `SmallVector` by value. It is only small in the sense that we…
		PreallocatedArgOffsets[Id] = AO;
		}

		const SmallVector<size_t, 4> &GetPreallocatedArgOffsets(const size_t Id) {
		rnkUnsubmitted Done Reply Inline Actions Similarly, to give the caller a readonly view of the offsets, I would recommend ArrayRef. rnk: Similarly, to give the caller a readonly view of the offsets, I would recommend ArrayRef.
		assert(!PreallocatedArgOffsets[Id].empty() && "arg offsets not set");
		return PreallocatedArgOffsets[Id];
		}
};		};
		rnkUnsubmitted Done Reply Inline Actions You can shorten this with `.count(Id)` rnk: You can shorten this with `.count(Id)`

} // End llvm namespace		} // End llvm namespace

#endif		#endif

llvm/lib/Target/X86/X86RegisterInfo.cpp

	Show First 20 Lines • Show All 620 Lines • ▼ Show 20 Lines
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// Stack Frame Processing methods			// Stack Frame Processing methods
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	static bool CantUseSP(const MachineFrameInfo &MFI) {			static bool CantUseSP(const MachineFrameInfo &MFI) {
	return MFI.hasVarSizedObjects() \|\| MFI.hasOpaqueSPAdjustment();			return MFI.hasVarSizedObjects() \|\| MFI.hasOpaqueSPAdjustment();
	}			}

	bool X86RegisterInfo::hasBasePointer(const MachineFunction &MF) const {			bool X86RegisterInfo::hasBasePointer(const MachineFunction &MF) const {
				const X86MachineFunctionInfo *X86FI = MF.getInfo<X86MachineFunctionInfo>();
				rnkUnsubmitted Not Done Reply Inline Actions Ooh, base pointers are expensive. Is this necessary? rnk: Ooh, base pointers are expensive. Is this necessary?
				aeubanksAuthorUnsubmitted Done Reply Inline Actions Yup, I tried removing it and it makes my test program crash. I spent quite a while trying to figure out how to force a frame pointer with preallocated calls (is the distinction between frame/base pointer relevant here?). aeubanks: Yup, I tried removing it and it makes my test program crash. I spent quite a while trying to…
				if (X86FI->hasPreallocatedCall())
				return true;

	const MachineFrameInfo &MFI = MF.getFrameInfo();			const MachineFrameInfo &MFI = MF.getFrameInfo();

	if (!EnableBasePointer)			if (!EnableBasePointer)
	return false;			return false;

	// When we need stack realignment, we can't address the stack from the frame			// When we need stack realignment, we can't address the stack from the frame
	// pointer. When we have dynamic allocas or stack-adjusting inline asm, we			// pointer. When we have dynamic allocas or stack-adjusting inline asm, we
	// can't address variables from the stack pointer. MS inline asm can			// can't address variables from the stack pointer. MS inline asm can
	// reference locals while also adjusting the stack pointer. When we can't			// reference locals while also adjusting the stack pointer. When we can't
	// use both the SP and the FP, we need a separate base pointer register.			// use both the SP and the FP, we need a separate base pointer register.
	bool CantUseFP = needsStackRealignment(MF);			bool CantUseFP = needsStackRealignment(MF);
	return CantUseFP && CantUseSP(MFI);			return CantUseFP && CantUseSP(MFI);
	}			}

	bool X86RegisterInfo::canRealignStack(const MachineFunction &MF) const {			bool X86RegisterInfo::canRealignStack(const MachineFunction &MF) const {
	if (!TargetRegisterInfo::canRealignStack(MF))			if (!TargetRegisterInfo::canRealignStack(MF))
	return false;			return false;

	const MachineFrameInfo &MFI = MF.getFrameInfo();			const MachineFrameInfo &MFI = MF.getFrameInfo();
	const MachineRegisterInfo *MRI = &MF.getRegInfo();			const MachineRegisterInfo *MRI = &MF.getRegInfo();
	▲ Show 20 Lines • Show All 161 Lines • Show Last 20 Lines

llvm/lib/Transforms/Coroutines/CoroSplit.cpp

Show First 20 Lines • Show All 1,008 Lines • ▼ Show 20 Lines	if (!CalleeParmTy->isPointerTy() \|\|
(CalleeParmTy->getPointerAddressSpace() != 0))		(CalleeParmTy->getPointerAddressSpace() != 0))
return false;		return false;

if (CI.getCallingConv() != F.getCallingConv())		if (CI.getCallingConv() != F.getCallingConv())
return false;		return false;

// CI should not has any ABI-impacting function attributes.		// CI should not has any ABI-impacting function attributes.
static const Attribute::AttrKind ABIAttrs[] = {		static const Attribute::AttrKind ABIAttrs[] = {
Attribute::StructRet, Attribute::ByVal, Attribute::InAlloca,		Attribute::StructRet, Attribute::ByVal, Attribute::InAlloca,
Attribute::InReg, Attribute::Returned, Attribute::SwiftSelf,		Attribute::Preallocated, Attribute::InReg, Attribute::Returned,
Attribute::SwiftError};		Attribute::SwiftSelf, Attribute::SwiftError};
AttributeList Attrs = CI.getAttributes();		AttributeList Attrs = CI.getAttributes();
for (auto AK : ABIAttrs)		for (auto AK : ABIAttrs)
if (Attrs.hasParamAttribute(0, AK))		if (Attrs.hasParamAttribute(0, AK))
return false;		return false;

return true;		return true;
}		}

▲ Show 20 Lines • Show All 778 Lines • Show Last 20 Lines

llvm/lib/Transforms/IPO/Attributor.cpp

Show First 20 Lines • Show All 1,366 Lines • ▼ Show 20 Lines	if (Fn->isVarArg()) {
LLVM_DEBUG(dbgs() << "[Attributor] Cannot rewrite var-args functions\n");		LLVM_DEBUG(dbgs() << "[Attributor] Cannot rewrite var-args functions\n");
return false;		return false;
}		}

// Avoid functions with complicated argument passing semantics.		// Avoid functions with complicated argument passing semantics.
AttributeList FnAttributeList = Fn->getAttributes();		AttributeList FnAttributeList = Fn->getAttributes();
if (FnAttributeList.hasAttrSomewhere(Attribute::Nest) \|\|		if (FnAttributeList.hasAttrSomewhere(Attribute::Nest) \|\|
FnAttributeList.hasAttrSomewhere(Attribute::StructRet) \|\|		FnAttributeList.hasAttrSomewhere(Attribute::StructRet) \|\|
FnAttributeList.hasAttrSomewhere(Attribute::InAlloca)) {		FnAttributeList.hasAttrSomewhere(Attribute::InAlloca) \|\|
		FnAttributeList.hasAttrSomewhere(Attribute::Preallocated)) {
LLVM_DEBUG(		LLVM_DEBUG(
dbgs() << "[Attributor] Cannot rewrite due to complex attribute\n");		dbgs() << "[Attributor] Cannot rewrite due to complex attribute\n");
return false;		return false;
}		}

// Avoid callbacks for now.		// Avoid callbacks for now.
bool AllCallSitesKnown;		bool AllCallSitesKnown;
if (!checkForAllCallSites(CallSiteCanBeChanged, *Fn, true, nullptr,		if (!checkForAllCallSites(CallSiteCanBeChanged, *Fn, true, nullptr,
▲ Show 20 Lines • Show All 816 Lines • Show Last 20 Lines

llvm/lib/Transforms/IPO/AttributorAttributes.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 4,355 Lines • ▼ Show 20 Lines
struct AAValueSimplifyArgument final : AAValueSimplifyImpl {		struct AAValueSimplifyArgument final : AAValueSimplifyImpl {
AAValueSimplifyArgument(const IRPosition &IRP, Attributor &A)		AAValueSimplifyArgument(const IRPosition &IRP, Attributor &A)
: AAValueSimplifyImpl(IRP, A) {}		: AAValueSimplifyImpl(IRP, A) {}

void initialize(Attributor &A) override {		void initialize(Attributor &A) override {
AAValueSimplifyImpl::initialize(A);		AAValueSimplifyImpl::initialize(A);
if (!getAnchorScope() \|\| getAnchorScope()->isDeclaration())		if (!getAnchorScope() \|\| getAnchorScope()->isDeclaration())
indicatePessimisticFixpoint();		indicatePessimisticFixpoint();
if (hasAttr({Attribute::InAlloca, Attribute::StructRet, Attribute::Nest},		if (hasAttr({Attribute::InAlloca, Attribute::Preallocated,
		Attribute::StructRet, Attribute::Nest},
/* IgnoreSubsumingPositions */ true))		/* IgnoreSubsumingPositions */ true))
indicatePessimisticFixpoint();		indicatePessimisticFixpoint();

// FIXME: This is a hack to prevent us from propagating function poiner in		// FIXME: This is a hack to prevent us from propagating function poiner in
// the new pass manager CGSCC pass as it creates call edges the		// the new pass manager CGSCC pass as it creates call edges the
// CallGraphUpdater cannot handle yet.		// CallGraphUpdater cannot handle yet.
Value &V = getAssociatedValue();		Value &V = getAssociatedValue();
if (V.getType()->isPointerTy() &&		if (V.getType()->isPointerTy() &&
▲ Show 20 Lines • Show All 1,210 Lines • ▼ Show 20 Lines	struct AAMemoryBehaviorArgument : AAMemoryBehaviorFloating {

ChangeStatus manifest(Attributor &A) override {		ChangeStatus manifest(Attributor &A) override {
// TODO: Pointer arguments are not supported on vectors of pointers yet.		// TODO: Pointer arguments are not supported on vectors of pointers yet.
if (!getAssociatedValue().getType()->isPointerTy())		if (!getAssociatedValue().getType()->isPointerTy())
return ChangeStatus::UNCHANGED;		return ChangeStatus::UNCHANGED;

// TODO: From readattrs.ll: "inalloca parameters are always		// TODO: From readattrs.ll: "inalloca parameters are always
// considered written"		// considered written"
if (hasAttr({Attribute::InAlloca})) {		if (hasAttr({Attribute::InAlloca, Attribute::Preallocated})) {
removeKnownBits(NO_WRITES);		removeKnownBits(NO_WRITES);
removeAssumedBits(NO_WRITES);		removeAssumedBits(NO_WRITES);
}		}
return AAMemoryBehaviorFloating::manifest(A);		return AAMemoryBehaviorFloating::manifest(A);
}		}

/// See AbstractAttribute::trackStatistics()		/// See AbstractAttribute::trackStatistics()
void trackStatistics() const override {		void trackStatistics() const override {
▲ Show 20 Lines • Show All 1,490 Lines • Show Last 20 Lines

llvm/lib/Transforms/IPO/DeadArgumentElimination.cpp

	Show First 20 Lines • Show All 477 Lines • ▼ Show 20 Lines
	// SurveyFunction - This performs the initial survey of the specified function,			// SurveyFunction - This performs the initial survey of the specified function,
	// checking out whether or not it uses any of its incoming arguments or whether			// checking out whether or not it uses any of its incoming arguments or whether
	// any callers use the return value. This fills in the LiveValues set and Uses			// any callers use the return value. This fills in the LiveValues set and Uses
	// map.			// map.
	//			//
	// We consider arguments of non-internal functions to be intrinsically alive as			// We consider arguments of non-internal functions to be intrinsically alive as
	// well as arguments to functions which have their "address taken".			// well as arguments to functions which have their "address taken".
	void DeadArgumentEliminationPass::SurveyFunction(const Function &F) {			void DeadArgumentEliminationPass::SurveyFunction(const Function &F) {
	// Functions with inalloca parameters are expecting args in a particular			// Functions with inalloca/preallocated parameters are expecting args in a
	// register and memory layout.			// particular register and memory layout.
	if (F.getAttributes().hasAttrSomewhere(Attribute::InAlloca)) {			if (F.getAttributes().hasAttrSomewhere(Attribute::InAlloca) \|\|
				F.getAttributes().hasAttrSomewhere(Attribute::Preallocated)) {
	MarkLive(F);			MarkLive(F);
	return;			return;
	}			}

	// Don't touch naked functions. The assembly might be using an argument, or			// Don't touch naked functions. The assembly might be using an argument, or
	// otherwise rely on the frame layout in a way that this analysis will not			// otherwise rely on the frame layout in a way that this analysis will not
	// see.			// see.
	if (F.hasFnAttribute(Attribute::Naked)) {			if (F.hasFnAttribute(Attribute::Naked)) {
	▲ Show 20 Lines • Show All 618 Lines • Show Last 20 Lines

llvm/lib/Transforms/IPO/FunctionAttrs.cpp

	Show First 20 Lines • Show All 441 Lines • ▼ Show 20 Lines
	/// Returns Attribute::None, Attribute::ReadOnly or Attribute::ReadNone.			/// Returns Attribute::None, Attribute::ReadOnly or Attribute::ReadNone.
	static Attribute::AttrKind			static Attribute::AttrKind
	determinePointerReadAttrs(Argument *A,			determinePointerReadAttrs(Argument *A,
	const SmallPtrSet<Argument *, 8> &SCCNodes) {			const SmallPtrSet<Argument *, 8> &SCCNodes) {
	SmallVector<Use *, 32> Worklist;			SmallVector<Use *, 32> Worklist;
	SmallPtrSet<Use *, 32> Visited;			SmallPtrSet<Use *, 32> Visited;

	// inalloca arguments are always clobbered by the call.			// inalloca arguments are always clobbered by the call.
	if (A->hasInAllocaAttr())			if (A->hasInAllocaAttr() \|\| A->hasPreallocatedAttr())
	return Attribute::None;			return Attribute::None;

	bool IsRead = false;			bool IsRead = false;
	// We don't need to track IsWritten. If A is written to, return immediately.			// We don't need to track IsWritten. If A is written to, return immediately.

	for (Use &U : A->uses()) {			for (Use &U : A->uses()) {
	Visited.insert(&U);			Visited.insert(&U);
	Worklist.push_back(&U);			Worklist.push_back(&U);
	▲ Show 20 Lines • Show All 1,171 Lines • Show Last 20 Lines

llvm/lib/Transforms/IPO/GlobalOpt.cpp

Show First 20 Lines • Show All 2,327 Lines • ▼ Show 20 Lines	for (Module::iterator FI = M.begin(), E = M.end(); FI != E; ) {
if (!F->hasLocalLinkage())		if (!F->hasLocalLinkage())
continue;		continue;

// If we have an inalloca parameter that we can safely remove the		// If we have an inalloca parameter that we can safely remove the
// inalloca attribute from, do so. This unlocks optimizations that		// inalloca attribute from, do so. This unlocks optimizations that
// wouldn't be safe in the presence of inalloca.		// wouldn't be safe in the presence of inalloca.
// FIXME: We should also hoist alloca affected by this to the entry		// FIXME: We should also hoist alloca affected by this to the entry
// block if possible.		// block if possible.
		// FIXME: handle preallocated
if (F->getAttributes().hasAttrSomewhere(Attribute::InAlloca) &&		if (F->getAttributes().hasAttrSomewhere(Attribute::InAlloca) &&
!F->hasAddressTaken()) {		!F->hasAddressTaken()) {
RemoveAttribute(F, Attribute::InAlloca);		RemoveAttribute(F, Attribute::InAlloca);
Changed = true;		Changed = true;
}		}

if (hasChangeableCC(F) && !F->isVarArg() && !F->hasAddressTaken()) {		if (hasChangeableCC(F) && !F->isVarArg() && !F->hasAddressTaken()) {
NumInternalFunc++;		NumInternalFunc++;
▲ Show 20 Lines • Show All 740 Lines • Show Last 20 Lines

llvm/lib/Transforms/InstCombine/InstCombineCalls.cpp

Show First 20 Lines • Show All 4,734 Lines • ▼ Show 20 Lines	bool InstCombiner::transformConstExprCastCall(CallBase &Call) {
// declare void @takes_i32_inalloca(i32* inalloca)		// declare void @takes_i32_inalloca(i32* inalloca)
// call void bitcast (void (i32) @takes_i32_inalloca to void (i32)*)(i32 0)		// call void bitcast (void (i32) @takes_i32_inalloca to void (i32)*)(i32 0)
//		//
// into:		// into:
// call void @takes_i32_inalloca(i32* null)		// call void @takes_i32_inalloca(i32* null)
//		//
// Similarly, avoid folding away bitcasts of byval calls.		// Similarly, avoid folding away bitcasts of byval calls.
if (Callee->getAttributes().hasAttrSomewhere(Attribute::InAlloca) \|\|		if (Callee->getAttributes().hasAttrSomewhere(Attribute::InAlloca) \|\|
		Callee->getAttributes().hasAttrSomewhere(Attribute::Preallocated) \|\|
Callee->getAttributes().hasAttrSomewhere(Attribute::ByVal))		Callee->getAttributes().hasAttrSomewhere(Attribute::ByVal))
return false;		return false;

auto AI = Call.arg_begin();		auto AI = Call.arg_begin();
for (unsigned i = 0, e = NumCommonArgs; i != e; ++i, ++AI) {		for (unsigned i = 0, e = NumCommonArgs; i != e; ++i, ++AI) {
Type *ParamTy = FT->getParamType(i);		Type *ParamTy = FT->getParamType(i);
Type ActTy = (AI)->getType();		Type ActTy = (AI)->getType();

▲ Show 20 Lines • Show All 345 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/arg-copy-elide.ll

	Show First 20 Lines • Show All 247 Lines • ▼ Show 20 Lines
	}			}

	; CHECK-LABEL: _avoid_inalloca:			; CHECK-LABEL: _avoid_inalloca:
	; CHECK: leal {{[0-9]+}}(%esp), %[[reg:[^ ]*]]			; CHECK: leal {{[0-9]+}}(%esp), %[[reg:[^ ]*]]
	; CHECK: pushl %[[reg]]			; CHECK: pushl %[[reg]]
	; CHECK: calll _addrof_i32			; CHECK: calll _addrof_i32
	; CHECK: retl			; CHECK: retl

				define void @avoid_preallocated(i32* preallocated(i32) %x) {
				entry:
				%x.p.p = alloca i32*
				store i32* %x, i32** %x.p.p
				call void @addrof_i32(i32* %x)
				ret void
				}

				; CHECK-LABEL: _avoid_preallocated:
				; CHECK: leal {{[0-9]+}}(%esp), %[[reg:[^ ]*]]
				; CHECK: pushl %[[reg]]
				; CHECK: calll _addrof_i32
				; CHECK: retl

	; Don't elide the copy when the alloca is escaped with a store.			; Don't elide the copy when the alloca is escaped with a store.
	define void @escape_with_store(i32 %x) {			define void @escape_with_store(i32 %x) {
	%x1 = alloca i32			%x1 = alloca i32
	%x2 = alloca i32*			%x2 = alloca i32*
	store i32* %x1, i32** %x2			store i32* %x1, i32** %x2
	%x3 = load i32, i32* %x2			%x3 = load i32, i32* %x2
	store i32 0, i32* %x3			store i32 0, i32* %x3
	store i32 %x, i32* %x1			store i32 %x, i32* %x1
	Show All 33 Lines

llvm/test/CodeGen/X86/musttail-indirect.ll

Show All 16 Lines
; int (B::*mp_g)(A, int, A) = &B::g;		; int (B::*mp_g)(A, int, A) = &B::g;
; void (B::*mp_h)(A, int, A) = &B::h;		; void (B::*mp_h)(A, int, A) = &B::h;
; A (B::*mp_i)(A, int, A) = &B::i;		; A (B::*mp_i)(A, int, A) = &B::i;
; A (B::*mp_j)(int) = &B::j;		; A (B::*mp_j)(int) = &B::j;

; Each member pointer creates a thunk. The ones with inalloca are required to		; Each member pointer creates a thunk. The ones with inalloca are required to
; tail calls by the ABI, even at O0.		; tail calls by the ABI, even at O0.

		declare token @llvm.call.preallocated.setup(i32)
		declare i8* @llvm.call.preallocated.arg(token, i32)

%struct.B = type { i32 (...)** }		%struct.B = type { i32 (...)** }
%struct.A = type { i32 }		%struct.A = type { i32 }

; CHECK-LABEL: f_thunk:		; CHECK-LABEL: f_thunk:
; CHECK: jmpl		; CHECK: jmpl
; CHECK-NOT: ret		; CHECK-NOT: ret
define x86_thiscallcc i32 @f_thunk(%struct.B* %this, i32) {		define x86_thiscallcc i32 @f_thunk(%struct.B* %this, i32) {
entry:		entry:
Show All 14 Lines	entry:
%1 = bitcast %struct.B* %this to i32 (%struct.B, <{ %struct.A, i32, %struct.A }>)***		%1 = bitcast %struct.B* %this to i32 (%struct.B, <{ %struct.A, i32, %struct.A }>)***
%vtable = load i32 (%struct.B, <{ %struct.A, i32, %struct.A }>)*, i32 (%struct.B, <{ %struct.A, i32, %struct.A }>)** %1		%vtable = load i32 (%struct.B, <{ %struct.A, i32, %struct.A }>)*, i32 (%struct.B, <{ %struct.A, i32, %struct.A }>)** %1
%vfn = getelementptr inbounds i32 (%struct.B, <{ %struct.A, i32, %struct.A }>), i32 (%struct.B, <{ %struct.A, i32, %struct.A }>)* %vtable, i32 1		%vfn = getelementptr inbounds i32 (%struct.B, <{ %struct.A, i32, %struct.A }>), i32 (%struct.B, <{ %struct.A, i32, %struct.A }>)* %vtable, i32 1
%2 = load i32 (%struct.B, <{ %struct.A, i32, %struct.A }>), i32 (%struct.B, <{ %struct.A, i32, %struct.A }>)* %vfn		%2 = load i32 (%struct.B, <{ %struct.A, i32, %struct.A }>), i32 (%struct.B, <{ %struct.A, i32, %struct.A }>)* %vfn
%3 = musttail call x86_thiscallcc i32 %2(%struct.B* %this, <{ %struct.A, i32, %struct.A }>* inalloca %0)		%3 = musttail call x86_thiscallcc i32 %2(%struct.B* %this, <{ %struct.A, i32, %struct.A }>* inalloca %0)
ret i32 %3		ret i32 %3
}		}

		; FIXME: This generates a lot of code even at -O2, any better way to do this? Same with all the preallocated versions of functions below.
		rnkUnsubmitted Not Done Reply Inline Actions This is interesting, we forgot about this when writing the LangRef. If you look at the previous inalloca test, it forwards its inalloca parameter directly to the musttail callee. It doesn't allocate new memory. That's the real use case for this `musttail` thing, and we probably need to bend the verifier rules to allow direct forwarding of preallocated arguments to a musttail call site. I think for now, committing this as is with a FIXME and coming back to it later would be fine. I assume that the verifier rejects it if you remove the setup and arg calls. rnk: This is interesting, we forgot about this when writing the LangRef. If you look at the…
		efriedmaUnsubmitted Not Done Reply Inline Actions I don't think it makes sense to commit this test as-is. The verifier should reject the combination of musttail and a preallocated bundle. The llvm.call.preallocated.setup would have to allocate memory on top of the argument, and there's no reasonable way to prove that's safe. And yes, we probably need to bend the rules for the "preallocated" attribute to allow forwarding arguments in musttail calls. efriedma: I don't think it makes sense to commit this test as-is. The verifier should reject the…
		aeubanksAuthorUnsubmitted Done Reply Inline Actions Added verifier check in https://reviews.llvm.org/D80132. Is it ok to proceed with this and do the LangRef/codegen changes to support musttail and preallocated together in a later change? aeubanks: Added verifier check in https://reviews.llvm.org/D80132. Is it ok to proceed with this and do…
		; CHECK-LABEL: g_thunk_2:
		; CHECK: jmpl
		; CHECK-NOT: ret
		define x86_thiscallcc i32 @g_thunk_2(%struct.B* %this, <{ %struct.A, i32, %struct.A }>* preallocated(<{ %struct.A, i32, %struct.A }>) %0) {
		entry:
		%1 = bitcast %struct.B* %this to i32 (%struct.B, <{ %struct.A, i32, %struct.A }>)***
		%vtable = load i32 (%struct.B, <{ %struct.A, i32, %struct.A }>)*, i32 (%struct.B, <{ %struct.A, i32, %struct.A }>)** %1
		%vfn = getelementptr inbounds i32 (%struct.B, <{ %struct.A, i32, %struct.A }>), i32 (%struct.B, <{ %struct.A, i32, %struct.A }>)* %vtable, i32 1
		%2 = load i32 (%struct.B, <{ %struct.A, i32, %struct.A }>), i32 (%struct.B, <{ %struct.A, i32, %struct.A }>)* %vfn
		%tmp = load <{ %struct.A, i32, %struct.A }>, <{ %struct.A, i32, %struct.A }>* %0
		%c = call token @llvm.call.preallocated.setup(i32 1)
		%A = call i8* @llvm.call.preallocated.arg(token %c, i32 0) preallocated(<{ %struct.A, i32, %struct.A }>)
		%a = bitcast i8* %A to <{ %struct.A, i32, %struct.A }>*
		store <{ %struct.A, i32, %struct.A }> %tmp, <{ %struct.A, i32, %struct.A }>* %a
		%3 = musttail call x86_thiscallcc i32 %2(%struct.B* %this, <{ %struct.A, i32, %struct.A }>* preallocated(<{ %struct.A, i32, %struct.A }>) %a) ["preallocated"(token %c)]
		ret i32 %3
		}

; CHECK-LABEL: h_thunk:		; CHECK-LABEL: h_thunk:
; CHECK: jmpl		; CHECK: jmpl
; CHECK-NOT: mov %{{.}}, {{.(.esp.)}}		; CHECK-NOT: mov %{{.}}, {{.(.esp.)}}
; CHECK-NOT: ret		; CHECK-NOT: ret
define x86_thiscallcc void @h_thunk(%struct.B* %this, <{ %struct.A, i32, %struct.A }>* inalloca) {		define x86_thiscallcc void @h_thunk(%struct.B* %this, <{ %struct.A, i32, %struct.A }>* inalloca) {
entry:		entry:
%1 = bitcast %struct.B* %this to void (%struct.B, <{ %struct.A, i32, %struct.A }>)***		%1 = bitcast %struct.B* %this to void (%struct.B, <{ %struct.A, i32, %struct.A }>)***
%vtable = load void (%struct.B, <{ %struct.A, i32, %struct.A }>)*, void (%struct.B, <{ %struct.A, i32, %struct.A }>)** %1		%vtable = load void (%struct.B, <{ %struct.A, i32, %struct.A }>)*, void (%struct.B, <{ %struct.A, i32, %struct.A }>)** %1
%vfn = getelementptr inbounds void (%struct.B, <{ %struct.A, i32, %struct.A }>), void (%struct.B, <{ %struct.A, i32, %struct.A }>)* %vtable, i32 2		%vfn = getelementptr inbounds void (%struct.B, <{ %struct.A, i32, %struct.A }>), void (%struct.B, <{ %struct.A, i32, %struct.A }>)* %vtable, i32 2
%2 = load void (%struct.B, <{ %struct.A, i32, %struct.A }>), void (%struct.B, <{ %struct.A, i32, %struct.A }>)* %vfn		%2 = load void (%struct.B, <{ %struct.A, i32, %struct.A }>), void (%struct.B, <{ %struct.A, i32, %struct.A }>)* %vfn
musttail call x86_thiscallcc void %2(%struct.B* %this, <{ %struct.A, i32, %struct.A }>* inalloca %0)		musttail call x86_thiscallcc void %2(%struct.B* %this, <{ %struct.A, i32, %struct.A }>* inalloca %0)
ret void		ret void
}		}

		; CHECK-LABEL: h_thunk_2:
		; CHECK: jmpl
		; CHECK-NOT: ret
		define x86_thiscallcc void @h_thunk_2(%struct.B* %this, <{ %struct.A, i32, %struct.A }>* preallocated(<{ %struct.A, i32, %struct.A }>)) {
		entry:
		%1 = bitcast %struct.B* %this to void (%struct.B, <{ %struct.A, i32, %struct.A }>)***
		%vtable = load void (%struct.B, <{ %struct.A, i32, %struct.A }>)*, void (%struct.B, <{ %struct.A, i32, %struct.A }>)** %1
		%vfn = getelementptr inbounds void (%struct.B, <{ %struct.A, i32, %struct.A }>), void (%struct.B, <{ %struct.A, i32, %struct.A }>)* %vtable, i32 2
		%2 = load void (%struct.B, <{ %struct.A, i32, %struct.A }>), void (%struct.B, <{ %struct.A, i32, %struct.A }>)* %vfn
		%c = call token @llvm.call.preallocated.setup(i32 1)
		%A = call i8* @llvm.call.preallocated.arg(token %c, i32 0) preallocated(<{ %struct.A, i32, %struct.A }>)
		%a = bitcast i8* %A to <{ %struct.A, i32, %struct.A }>*
		musttail call x86_thiscallcc void %2(%struct.B* %this, <{ %struct.A, i32, %struct.A }>* preallocated(<{ %struct.A, i32, %struct.A }>) %a) ["preallocated"(token %c)]
		ret void
		}

; CHECK-LABEL: i_thunk:		; CHECK-LABEL: i_thunk:
; CHECK-NOT: mov %{{.}}, {{.(.esp.)}}		; CHECK-NOT: mov %{{.}}, {{.(.esp.)}}
; CHECK: jmpl		; CHECK: jmpl
; CHECK-NOT: ret		; CHECK-NOT: ret
define x86_thiscallcc %struct.A* @i_thunk(%struct.B* %this, <{ %struct.A, %struct.A, i32, %struct.A }> inalloca) {		define x86_thiscallcc %struct.A* @i_thunk(%struct.B* %this, <{ %struct.A, %struct.A, i32, %struct.A }> inalloca) {
entry:		entry:
%1 = bitcast %struct.B* %this to %struct.A* (%struct.B, <{ %struct.A, %struct.A, i32, %struct.A }>)**		%1 = bitcast %struct.B* %this to %struct.A* (%struct.B, <{ %struct.A, %struct.A, i32, %struct.A }>)**
%vtable = load %struct.A* (%struct.B, <{ %struct.A, %struct.A, i32, %struct.A }>), %struct.A (%struct.B, <{ %struct.A, %struct.A, i32, %struct.A }>)** %1		%vtable = load %struct.A* (%struct.B, <{ %struct.A, %struct.A, i32, %struct.A }>), %struct.A (%struct.B, <{ %struct.A, %struct.A, i32, %struct.A }>)** %1
%vfn = getelementptr inbounds %struct.A* (%struct.B, <{ %struct.A, %struct.A, i32, %struct.A }>), %struct.A* (%struct.B, <{ %struct.A, %struct.A, i32, %struct.A }>)* %vtable, i32 3		%vfn = getelementptr inbounds %struct.A* (%struct.B, <{ %struct.A, %struct.A, i32, %struct.A }>), %struct.A* (%struct.B, <{ %struct.A, %struct.A, i32, %struct.A }>)* %vtable, i32 3
%2 = load %struct.A* (%struct.B, <{ %struct.A, %struct.A, i32, %struct.A }>), %struct.A* (%struct.B, <{ %struct.A, %struct.A, i32, %struct.A }>)* %vfn		%2 = load %struct.A* (%struct.B, <{ %struct.A, %struct.A, i32, %struct.A }>), %struct.A* (%struct.B, <{ %struct.A, %struct.A, i32, %struct.A }>)* %vfn
%3 = musttail call x86_thiscallcc %struct.A* %2(%struct.B* %this, <{ %struct.A, %struct.A, i32, %struct.A }> inalloca %0)		%3 = musttail call x86_thiscallcc %struct.A* %2(%struct.B* %this, <{ %struct.A, %struct.A, i32, %struct.A }> inalloca %0)
ret %struct.A* %3		ret %struct.A* %3
}		}

		; CHECK-LABEL: i_thunk_2:
		; CHECK: jmpl
		; CHECK-NOT: ret
		define x86_thiscallcc %struct.A* @i_thunk_2(%struct.B* %this, <{ %struct.A, %struct.A, i32, %struct.A }> preallocated(<{ %struct.A*, %struct.A, i32, %struct.A }>)) {
		entry:
		%1 = bitcast %struct.B* %this to %struct.A* (%struct.B, <{ %struct.A, %struct.A, i32, %struct.A }>)**
		%vtable = load %struct.A* (%struct.B, <{ %struct.A, %struct.A, i32, %struct.A }>), %struct.A (%struct.B, <{ %struct.A, %struct.A, i32, %struct.A }>)** %1
		%vfn = getelementptr inbounds %struct.A* (%struct.B, <{ %struct.A, %struct.A, i32, %struct.A }>), %struct.A* (%struct.B, <{ %struct.A, %struct.A, i32, %struct.A }>)* %vtable, i32 3
		%2 = load %struct.A* (%struct.B, <{ %struct.A, %struct.A, i32, %struct.A }>), %struct.A* (%struct.B, <{ %struct.A, %struct.A, i32, %struct.A }>)* %vfn
		%c = call token @llvm.call.preallocated.setup(i32 1)
		%A = call i8* @llvm.call.preallocated.arg(token %c, i32 0) preallocated(<{ %struct.A, i32, %struct.A }>)
		%a = bitcast i8* %A to <{ %struct.A, %struct.A, i32, %struct.A }>
		%3 = musttail call x86_thiscallcc %struct.A* %2(%struct.B* %this, <{ %struct.A, %struct.A, i32, %struct.A }> preallocated(<{ %struct.A*, %struct.A, i32, %struct.A }>) %a) ["preallocated"(token %c)]
		ret %struct.A* %3
		}

; CHECK-LABEL: j_thunk:		; CHECK-LABEL: j_thunk:
; CHECK: jmpl		; CHECK: jmpl
; CHECK-NOT: ret		; CHECK-NOT: ret
define x86_thiscallcc void @j_thunk(%struct.A* noalias sret %agg.result, %struct.B* %this, i32) {		define x86_thiscallcc void @j_thunk(%struct.A* noalias sret %agg.result, %struct.B* %this, i32) {
entry:		entry:
%1 = bitcast %struct.B* %this to void (%struct.A, %struct.B, i32)***		%1 = bitcast %struct.B* %this to void (%struct.A, %struct.B, i32)***
%vtable = load void (%struct.A, %struct.B, i32)*, void (%struct.A, %struct.B, i32)** %1		%vtable = load void (%struct.A, %struct.B, i32)*, void (%struct.A, %struct.B, i32)** %1
%vfn = getelementptr inbounds void (%struct.A, %struct.B, i32), void (%struct.A, %struct.B, i32)* %vtable, i32 4		%vfn = getelementptr inbounds void (%struct.A, %struct.B, i32), void (%struct.A, %struct.B, i32)* %vtable, i32 4
Show All 13 Lines	entry:
%1 = bitcast %struct.B* %this to i32 (<{ %struct.B, %struct.A }>)***		%1 = bitcast %struct.B* %this to i32 (<{ %struct.B, %struct.A }>)***
%vtable = load i32 (<{ %struct.B, %struct.A }>)*, i32 (<{ %struct.B, %struct.A }>)** %1		%vtable = load i32 (<{ %struct.B, %struct.A }>)*, i32 (<{ %struct.B, %struct.A }>)** %1
%vfn = getelementptr inbounds i32 (<{ %struct.B, %struct.A }>), i32 (<{ %struct.B, %struct.A }>)* %vtable, i32 1		%vfn = getelementptr inbounds i32 (<{ %struct.B, %struct.A }>), i32 (<{ %struct.B, %struct.A }>)* %vtable, i32 1
%2 = load i32 (<{ %struct.B, %struct.A }>), i32 (<{ %struct.B, %struct.A }>)* %vfn		%2 = load i32 (<{ %struct.B, %struct.A }>), i32 (<{ %struct.B, %struct.A }>)* %vfn
%3 = musttail call x86_stdcallcc i32 %2(<{ %struct.B, %struct.A }> inalloca %0)		%3 = musttail call x86_stdcallcc i32 %2(<{ %struct.B, %struct.A }> inalloca %0)
ret i32 %3		ret i32 %3
}		}

		; CHECK-LABEL: _stdcall_thunk_2@8:
		; CHECK: jmpl
		; CHECK-NOT: ret
		define x86_stdcallcc i32 @stdcall_thunk_2(<{ %struct.B, %struct.A }> preallocated(<{ %struct.B*, %struct.A }>)) {
		entry:
		%this_ptr = getelementptr inbounds <{ %struct.B, %struct.A }>, <{ %struct.B, %struct.A }>* %0, i32 0, i32 0
		%this = load %struct.B, %struct.B* %this_ptr
		%1 = bitcast %struct.B* %this to i32 (<{ %struct.B, %struct.A }>)***
		%vtable = load i32 (<{ %struct.B, %struct.A }>)*, i32 (<{ %struct.B, %struct.A }>)** %1
		%vfn = getelementptr inbounds i32 (<{ %struct.B, %struct.A }>), i32 (<{ %struct.B, %struct.A }>)* %vtable, i32 1
		%2 = load i32 (<{ %struct.B, %struct.A }>), i32 (<{ %struct.B, %struct.A }>)* %vfn
		%c = call token @llvm.call.preallocated.setup(i32 1)
		%A = call i8* @llvm.call.preallocated.arg(token %c, i32 0) preallocated(<{ %struct.B*, %struct.A }>)
		%a = bitcast i8* %A to <{ %struct.B, %struct.A }>
		%3 = musttail call x86_stdcallcc i32 %2(<{ %struct.B, %struct.A }> preallocated(<{ %struct.B*, %struct.A }>) %a) ["preallocated"(token %c)]
		ret i32 %3
		}

; CHECK-LABEL: @fastcall_thunk@8:		; CHECK-LABEL: @fastcall_thunk@8:
; CHECK-NOT: mov %{{.}}, {{.(.esp.)}}		; CHECK-NOT: mov %{{.}}, {{.(.esp.)}}
; CHECK: jmpl		; CHECK: jmpl
; CHECK-NOT: ret		; CHECK-NOT: ret
define x86_fastcallcc i32 @fastcall_thunk(%struct.B* inreg %this, <{ %struct.A }>* inalloca) {		define x86_fastcallcc i32 @fastcall_thunk(%struct.B* inreg %this, <{ %struct.A }>* inalloca) {
entry:		entry:
%1 = bitcast %struct.B* %this to i32 (%struct.B, <{ %struct.A }>)***		%1 = bitcast %struct.B* %this to i32 (%struct.B, <{ %struct.A }>)***
%vtable = load i32 (%struct.B, <{ %struct.A }>)*, i32 (%struct.B, <{ %struct.A }>)** %1		%vtable = load i32 (%struct.B, <{ %struct.A }>)*, i32 (%struct.B, <{ %struct.A }>)** %1
%vfn = getelementptr inbounds i32 (%struct.B, <{ %struct.A }>), i32 (%struct.B, <{ %struct.A }>)* %vtable, i32 1		%vfn = getelementptr inbounds i32 (%struct.B, <{ %struct.A }>), i32 (%struct.B, <{ %struct.A }>)* %vtable, i32 1
%2 = load i32 (%struct.B, <{ %struct.A }>), i32 (%struct.B, <{ %struct.A }>)* %vfn		%2 = load i32 (%struct.B, <{ %struct.A }>), i32 (%struct.B, <{ %struct.A }>)* %vfn
%3 = musttail call x86_fastcallcc i32 %2(%struct.B* inreg %this, <{ %struct.A }>* inalloca %0)		%3 = musttail call x86_fastcallcc i32 %2(%struct.B* inreg %this, <{ %struct.A }>* inalloca %0)
ret i32 %3		ret i32 %3
}		}

		; CHECK-LABEL: @fastcall_thunk_2@8:
		; CHECK: jmpl
		; CHECK-NOT: ret
		define x86_fastcallcc i32 @fastcall_thunk_2(%struct.B* inreg %this, <{ %struct.A }>* preallocated(<{%struct.A}>)) {
		entry:
		%1 = bitcast %struct.B* %this to i32 (%struct.B, <{ %struct.A }>)***
		%vtable = load i32 (%struct.B, <{ %struct.A }>)*, i32 (%struct.B, <{ %struct.A }>)** %1
		%vfn = getelementptr inbounds i32 (%struct.B, <{ %struct.A }>), i32 (%struct.B, <{ %struct.A }>)* %vtable, i32 1
		%2 = load i32 (%struct.B, <{ %struct.A }>), i32 (%struct.B, <{ %struct.A }>)* %vfn
		%c = call token @llvm.call.preallocated.setup(i32 1)
		%A = call i8* @llvm.call.preallocated.arg(token %c, i32 0) preallocated(<{ %struct.A }>)
		%a = bitcast i8* %A to <{ %struct.A }>*
		%3 = musttail call x86_fastcallcc i32 %2(%struct.B* inreg %this, <{ %struct.A }>* preallocated(<{ %struct.A }>) %a) ["preallocated"(token %c)]
		ret i32 %3
		}

llvm/test/CodeGen/X86/musttail-thiscall.ll

	; RUN: llc -verify-machineinstrs -mtriple=i686-- < %s \| FileCheck %s			; RUN: llc -verify-machineinstrs -mtriple=i686-- < %s \| FileCheck %s
	; RUN: llc -verify-machineinstrs -mtriple=i686-- -O0 < %s \| FileCheck %s			; RUN: llc -verify-machineinstrs -mtriple=i686-- -O0 < %s \| FileCheck %s

				declare token @llvm.call.preallocated.setup(i32)
				declare i8* @llvm.call.preallocated.arg(token, i32)

	; CHECK-LABEL: t1:			; CHECK-LABEL: t1:
	; CHECK: jmp {{_?}}t1_callee			; CHECK: jmp {{_?}}t1_callee
	define x86_thiscallcc void @t1(i8* %this) {			define x86_thiscallcc void @t1(i8* %this) {
	%adj = getelementptr i8, i8* %this, i32 4			%adj = getelementptr i8, i8* %this, i32 4
	musttail call x86_thiscallcc void @t1_callee(i8* %adj)			musttail call x86_thiscallcc void @t1_callee(i8* %adj)
	ret void			ret void
	}			}
	declare x86_thiscallcc void @t1_callee(i8* %this)			declare x86_thiscallcc void @t1_callee(i8* %this)
	Show All 12 Lines
	define x86_thiscallcc i8* @t3(i8* %this, <{ i8, i32 }> inalloca %args) {			define x86_thiscallcc i8* @t3(i8* %this, <{ i8, i32 }> inalloca %args) {
	%adj = getelementptr i8, i8* %this, i32 4			%adj = getelementptr i8, i8* %this, i32 4
	%a_ptr = getelementptr <{ i8, i32 }>, <{ i8, i32 }>* %args, i32 0, i32 1			%a_ptr = getelementptr <{ i8, i32 }>, <{ i8, i32 }>* %args, i32 0, i32 1
	store i32 0, i32* %a_ptr			store i32 0, i32* %a_ptr
	%rv = musttail call x86_thiscallcc i8* @t3_callee(i8* %adj, <{ i8, i32 }> inalloca %args)			%rv = musttail call x86_thiscallcc i8* @t3_callee(i8* %adj, <{ i8, i32 }> inalloca %args)
	ret i8* %rv			ret i8* %rv
	}			}
	declare x86_thiscallcc i8* @t3_callee(i8* %this, <{ i8, i32 }> inalloca %args);			declare x86_thiscallcc i8* @t3_callee(i8* %this, <{ i8, i32 }> inalloca %args);

				; CHECK-LABEL: t4:
				; CHECK: jmp {{_?}}t4_callee
				define x86_thiscallcc i8* @t4(i8* %this, <{ i8, i32 }> preallocated(<{i8*, i32}>) %args) {
				%adj = getelementptr i8, i8* %this, i32 4
				%a_ptr = getelementptr <{ i8, i32 }>, <{ i8, i32 }>* %args, i32 0, i32 1
				store i32 0, i32* %a_ptr
				%c = call token @llvm.call.preallocated.setup(i32 1)
				%A = call i8* @llvm.call.preallocated.arg(token %c, i32 0) preallocated(<{i8*, i32}>)
				%a = bitcast i8* %A to <{ i8, i32 }>
				%tmp = load <{ i8, i32 }>, <{ i8, i32 }>* %args
				store <{ i8, i32 }> %tmp, <{ i8, i32 }>* %a
				%rv = musttail call x86_thiscallcc i8* @t4_callee(i8* %adj, <{ i8, i32 }> preallocated(<{ i8*, i32 }>) %a) ["preallocated"(token %c)]
				ret i8* %rv
				}
				declare x86_thiscallcc i8* @t4_callee(i8* %this, <{ i8, i32 }> preallocated(<{i8*, i32}>) %args);

llvm/test/CodeGen/X86/preallocated-nocall.ll

This file was added.

				; RUN: llc < %s -mtriple=i686-pc-win32 \| FileCheck %s
				; XFAIL: *

				declare token @llvm.call.preallocated.setup(i32)
				declare i8* @llvm.call.preallocated.arg(token, i32)

				%Foo = type { i32, i32 }

				declare void @init(%Foo*)



				declare void @foo_p(%Foo* preallocated(%Foo))

				define void @no_call() {
				; CHECK-LABEL: _no_call:
				%t = call token @llvm.call.preallocated.setup(i32 1)
				%a = call i8* @llvm.call.preallocated.arg(token %t, i32 0) preallocated(%Foo)
				%b = bitcast i8* %a to %Foo*
				call void @init(%Foo* %b)
				ret void
				}

llvm/test/CodeGen/X86/preallocated-x64.ll

This file was added.

				; RUN: llc %s -mtriple=x86_64-windows-msvc -o /dev/null 2>&1
				; XFAIL: *

				declare token @llvm.call.preallocated.setup(i32)
				declare i8* @llvm.call.preallocated.arg(token, i32)

				%Foo = type { i32, i32 }

				declare x86_thiscallcc void @f(i32, %Foo* preallocated(%Foo))

				define void @g() {
				%t = call token @llvm.call.preallocated.setup(i32 1)
				%a = call i8* @llvm.call.preallocated.arg(token %t, i32 0) preallocated(%Foo)
				%b = bitcast i8* %a to %Foo*
				call void @f(i32 0, %Foo* preallocated(%Foo) %b) ["preallocated"(token %t)]
				ret void
				}

llvm/test/CodeGen/X86/preallocated.ll

This file was added.

				; RUN: llc < %s -mtriple=i686-pc-win32 \| FileCheck %s

				declare token @llvm.call.preallocated.setup(i32)
				declare i8* @llvm.call.preallocated.arg(token, i32)

				%Foo = type { i32, i32 }

				declare void @init(%Foo*)



				declare void @foo_p(%Foo* preallocated(%Foo))

				define void @one_preallocated() {
				; CHECK-LABEL: _one_preallocated:
				%t = call token @llvm.call.preallocated.setup(i32 1)
				%a = call i8* @llvm.call.preallocated.arg(token %t, i32 0) preallocated(%Foo)
				%b = bitcast i8* %a to %Foo*
				; CHECK: subl $8, %esp
				; CHECK: calll _foo_p
				call void @foo_p(%Foo* preallocated(%Foo) %b) ["preallocated"(token %t)]
				ret void
				}

				define void @one_preallocated_two_blocks() {
				; CHECK-LABEL: _one_preallocated_two_blocks:
				%t = call token @llvm.call.preallocated.setup(i32 1)
				br label %second
				second:
				%a = call i8* @llvm.call.preallocated.arg(token %t, i32 0) preallocated(%Foo)
				%b = bitcast i8* %a to %Foo*
				; CHECK: subl $8, %esp
				; CHECK: calll _foo_p
				call void @foo_p(%Foo* preallocated(%Foo) %b) ["preallocated"(token %t)]
				ret void
				}

				define void @preallocated_with_store() {
				; CHECK-LABEL: _preallocated_with_store:
				; CHECK: subl $8, %esp
				%t = call token @llvm.call.preallocated.setup(i32 1)
				; CHECK: leal (%esp), [[REGISTER:%[a-z]+]]
				%a = call i8* @llvm.call.preallocated.arg(token %t, i32 0) preallocated(%Foo)
				%b = bitcast i8* %a to %Foo*
				%p0 = getelementptr %Foo, %Foo* %b, i32 0, i32 0
				%p1 = getelementptr %Foo, %Foo* %b, i32 0, i32 1
				store i32 13, i32* %p0
				store i32 42, i32* %p1
				; CHECK-DAG: movl $13, ([[REGISTER]])
				; CHECK-DAG: movl $42, 4([[REGISTER]])
				; CHECK-NOT: subl {{\$[0-9]+}}, %esp
				; CHECK-NOT: pushl
				; CHECK: calll _foo_p
				call void @foo_p(%Foo* preallocated(%Foo) %b) ["preallocated"(token %t)]
				ret void
				}

				define void @preallocated_with_init() {
				; CHECK-LABEL: _preallocated_with_init:
				; CHECK: subl $8, %esp
				%t = call token @llvm.call.preallocated.setup(i32 1)
				; CHECK: leal (%esp), [[REGISTER:%[a-z]+]]
				%a = call i8* @llvm.call.preallocated.arg(token %t, i32 0) preallocated(%Foo)
				%b = bitcast i8* %a to %Foo*
				; CHECK: pushl [[REGISTER]]
				; CHECK: calll _init
				call void @init(%Foo* %b)
				; CHECK-NOT: subl {{\$[0-9]+}}, %esp
				; CHECK-NOT: pushl
				; CHECK: calll _foo_p
				call void @foo_p(%Foo* preallocated(%Foo) %b) ["preallocated"(token %t)]
				ret void
				}

				declare void @foo_p_p(%Foo* preallocated(%Foo), %Foo* preallocated(%Foo))

				define void @two_preallocated() {
				; CHECK-LABEL: _two_preallocated:
				%t = call token @llvm.call.preallocated.setup(i32 2)
				%a1 = call i8* @llvm.call.preallocated.arg(token %t, i32 0) preallocated(%Foo)
				%b1 = bitcast i8* %a1 to %Foo*
				%a2 = call i8* @llvm.call.preallocated.arg(token %t, i32 1) preallocated(%Foo)
				%b2 = bitcast i8* %a2 to %Foo*
				; CHECK: subl $16, %esp
				; CHECK: calll _foo_p_p
				call void @foo_p_p(%Foo* preallocated(%Foo) %b1, %Foo* preallocated(%Foo) %b2) ["preallocated"(token %t)]
				ret void
				}

				declare void @foo_p_int(%Foo* preallocated(%Foo), i32)

				define void @one_preallocated_one_normal() {
				; CHECK-LABEL: _one_preallocated_one_normal:
				; CHECK: subl $12, %esp
				%t = call token @llvm.call.preallocated.setup(i32 1)
				; CHECK: leal (%esp), [[REGISTER:%[a-z]+]]
				%a = call i8* @llvm.call.preallocated.arg(token %t, i32 0) preallocated(%Foo)
				%b = bitcast i8* %a to %Foo*
				; CHECK: pushl [[REGISTER]]
				; CHECK: calll _init
				call void @init(%Foo* %b)
				; CHECK-NOT: subl {{\$[0-9]+}}, %esp
				; CHECK-NOT: pushl
				; CHECK: movl $2, 8(%esp)
				; CHECK: calll _foo_p_int
				call void @foo_p_int(%Foo* preallocated(%Foo) %b, i32 2) ["preallocated"(token %t)]
				ret void
				}

				declare void @foo_ret_p(%Foo* sret, %Foo* preallocated(%Foo))

				define void @nested_with_init() {
				; CHECK-LABEL: _nested_with_init:
				%tmp = alloca %Foo

				%t1 = call token @llvm.call.preallocated.setup(i32 1)
				; CHECK: subl $12, %esp
				%a1 = call i8* @llvm.call.preallocated.arg(token %t1, i32 0) preallocated(%Foo)
				%b1 = bitcast i8* %a1 to %Foo*
				; CHECK: leal 4(%esp), [[REGISTER1:%[a-z]+]]

				%t2 = call token @llvm.call.preallocated.setup(i32 1)
				; CHECK: subl $12, %esp
				%a2 = call i8* @llvm.call.preallocated.arg(token %t2, i32 0) preallocated(%Foo)
				; CHECK: leal 4(%esp), [[REGISTER2:%[a-z]+]]
				%b2 = bitcast i8* %a2 to %Foo*

				call void @init(%Foo* %b2)
				; CHECK: pushl [[REGISTER2]]
				; CHECK: calll _init

				call void @foo_ret_p(%Foo* %b1, %Foo* preallocated(%Foo) %b2) ["preallocated"(token %t2)]
				; CHECK-NOT: subl {{\$[0-9]+}}, %esp
				; CHECK-NOT: pushl
				; CHECK: calll _foo_ret_p
				call void @foo_ret_p(%Foo* %tmp, %Foo* preallocated(%Foo) %b1) ["preallocated"(token %t1)]
				; CHECK-NOT: subl {{\$[0-9]+}}, %esp
				; CHECK-NOT: pushl
				; CHECK: calll _foo_ret_p
				ret void
				}

				declare void @foo_inreg_p(i32 inreg, %Foo* preallocated(%Foo))

				define void @inreg() {
				; CHECK-LABEL: _inreg:
				%t = call token @llvm.call.preallocated.setup(i32 1)
				%a = call i8* @llvm.call.preallocated.arg(token %t, i32 0) preallocated(%Foo)
				%b = bitcast i8* %a to %Foo*
				; CHECK: subl $8, %esp
				; CHECK: movl $9, %eax
				; CHECK: calll _foo_inreg_p
				call void @foo_inreg_p(i32 9, %Foo* preallocated(%Foo) %b) ["preallocated"(token %t)]
				ret void
				}

				declare x86_thiscallcc void @foo_thiscall_p(i8, %Foo preallocated(%Foo))

				define void @thiscall() {
				; CHECK-LABEL: _thiscall:
				%t = call token @llvm.call.preallocated.setup(i32 1)
				%a = call i8* @llvm.call.preallocated.arg(token %t, i32 0) preallocated(%Foo)
				%b = bitcast i8* %a to %Foo*
				; CHECK: subl $8, %esp
				; CHECK: xorl %ecx, %ecx
				; CHECK: calll _foo_thiscall_p
				call x86_thiscallcc void @foo_thiscall_p(i8* null, %Foo* preallocated(%Foo) %b) ["preallocated"(token %t)]
				ret void
				}

				declare x86_stdcallcc void @foo_stdcall_p(%Foo* preallocated(%Foo))
				declare x86_stdcallcc void @i(i32)

				define void @stdcall() {
				; CHECK-LABEL: _stdcall:
				%t = call token @llvm.call.preallocated.setup(i32 1)
				%a = call i8* @llvm.call.preallocated.arg(token %t, i32 0) preallocated(%Foo)
				%b = bitcast i8* %a to %Foo*
				; CHECK: subl $8, %esp
				; CHECK: calll _foo_stdcall_p@8
				call x86_stdcallcc void @foo_stdcall_p(%Foo* preallocated(%Foo) %b) ["preallocated"(token %t)]
				; CHECK-NOT: %esp
				; CHECK: pushl
				; CHECK: calll _i@4
				call x86_stdcallcc void @i(i32 0)
				ret void
				}

llvm/test/CodeGen/X86/shrink-wrap-chkstk.ll

	; RUN: llc < %s -enable-shrink-wrap=true \| FileCheck %s			; RUN: llc < %s -enable-shrink-wrap=true \| FileCheck %s

				; TODO: add preallocated versions of tests
				; we don't yet support conditionally called preallocated calls after the setup

	; chkstk cannot come before the usual prologue, since it adjusts ESP.			; chkstk cannot come before the usual prologue, since it adjusts ESP.
	; If chkstk is used in the prologue, we also have to be careful about preserving			; If chkstk is used in the prologue, we also have to be careful about preserving
	; EAX if it is used.			; EAX if it is used.

	target datalayout = "e-m:x-p:32:32-i64:64-f80:32-n8:16:32-a:0:32-S32"			target datalayout = "e-m:x-p:32:32-i64:64-f80:32-n8:16:32-a:0:32-S32"
	target triple = "i686-pc-windows-msvc18.0.0"			target triple = "i686-pc-windows-msvc18.0.0"

	%struct.S = type { [8192 x i8] }			%struct.S = type { [8192 x i8] }
	▲ Show 20 Lines • Show All 62 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/tail-call-mutable-memarg.ll

	; RUN: llc < %s \| FileCheck %s			; RUN: llc < %s \| FileCheck %s

	; Make sure we check that forwarded memory arguments are not modified when tail			; Make sure we check that forwarded memory arguments are not modified when tail
	; calling. inalloca and copy arg elimination make argument slots mutable.			; calling. inalloca and copy arg elimination make argument slots mutable.

	target datalayout = "e-m:x-p:32:32-i64:64-f80:32-n8:16:32-a:0:32-S32"			target datalayout = "e-m:x-p:32:32-i64:64-f80:32-n8:16:32-a:0:32-S32"
	target triple = "i386-pc-windows-msvc19.0.24215"			target triple = "i386-pc-windows-msvc19.0.24215"

	declare x86_stdcallcc void @tail_std(i32)			declare x86_stdcallcc void @tail_std(i32)
	declare void @capture(i32*)			declare void @capture(i32*)

				define x86_thiscallcc void @preallocated(i32* %this, i32* preallocated(i32) %args) {
				entry:
				%val = load i32, i32* %args
				store i32 0, i32* %args
				tail call x86_stdcallcc void @tail_std(i32 %val)
				ret void
				}

				; CHECK-LABEL: _preallocated: # @preallocated
				; CHECK: movl 4(%esp), %[[reg:[^ ]*]]
				; CHECK: movl $0, 4(%esp)
				; CHECK: pushl %[[reg]]
				; CHECK: calll _tail_std@4
				; CHECK: retl $4

	define x86_thiscallcc void @inalloca(i32* %this, i32* inalloca %args) {			define x86_thiscallcc void @inalloca(i32* %this, i32* inalloca %args) {
	entry:			entry:
	%val = load i32, i32* %args			%val = load i32, i32* %args
	store i32 0, i32* %args			store i32 0, i32* %args
	tail call x86_stdcallcc void @tail_std(i32 %val)			tail call x86_stdcallcc void @tail_std(i32 %val)
	ret void			ret void
	}			}

	Show All 23 Lines

llvm/test/Transforms/Attributor/value-simplify.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --function-signature --scrub-attributes			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --function-signature --scrub-attributes
	; RUN: opt -attributor -attributor-manifest-internal -attributor-max-iterations-verify -attributor-annotate-decl-cs -attributor-max-iterations=4 -S < %s \| FileCheck %s --check-prefixes=CHECK,NOT_CGSCC_NPM,NOT_CGSCC_OPM,NOT_TUNIT_NPM,IS__TUNIT____,IS________OPM,IS__TUNIT_OPM			; RUN: opt -attributor -attributor-manifest-internal -attributor-max-iterations-verify -attributor-annotate-decl-cs -attributor-max-iterations=4 -S < %s \| FileCheck %s --check-prefixes=CHECK,NOT_CGSCC_NPM,NOT_CGSCC_OPM,NOT_TUNIT_NPM,IS__TUNIT____,IS________OPM,IS__TUNIT_OPM
	; RUN: opt -aa-pipeline=basic-aa -passes=attributor -attributor-manifest-internal -attributor-max-iterations-verify -attributor-annotate-decl-cs -attributor-max-iterations=4 -S < %s \| FileCheck %s --check-prefixes=CHECK,NOT_CGSCC_OPM,NOT_CGSCC_NPM,NOT_TUNIT_OPM,IS__TUNIT____,IS________NPM,IS__TUNIT_NPM			; RUN: opt -aa-pipeline=basic-aa -passes=attributor -attributor-manifest-internal -attributor-max-iterations-verify -attributor-annotate-decl-cs -attributor-max-iterations=4 -S < %s \| FileCheck %s --check-prefixes=CHECK,NOT_CGSCC_OPM,NOT_CGSCC_NPM,NOT_TUNIT_OPM,IS__TUNIT____,IS________NPM,IS__TUNIT_NPM
	; RUN: opt -attributor-cgscc -attributor-manifest-internal -attributor-annotate-decl-cs -S < %s \| FileCheck %s --check-prefixes=CHECK,NOT_TUNIT_NPM,NOT_TUNIT_OPM,NOT_CGSCC_NPM,IS__CGSCC____,IS________OPM,IS__CGSCC_OPM			; RUN: opt -attributor-cgscc -attributor-manifest-internal -attributor-annotate-decl-cs -S < %s \| FileCheck %s --check-prefixes=CHECK,NOT_TUNIT_NPM,NOT_TUNIT_OPM,NOT_CGSCC_NPM,IS__CGSCC____,IS________OPM,IS__CGSCC_OPM
	; RUN: opt -aa-pipeline=basic-aa -passes=attributor-cgscc -attributor-manifest-internal -attributor-annotate-decl-cs -S < %s \| FileCheck %s --check-prefixes=CHECK,NOT_TUNIT_NPM,NOT_TUNIT_OPM,NOT_CGSCC_OPM,IS__CGSCC____,IS________NPM,IS__CGSCC_NPM			; RUN: opt -aa-pipeline=basic-aa -passes=attributor-cgscc -attributor-manifest-internal -attributor-annotate-decl-cs -S < %s \| FileCheck %s --check-prefixes=CHECK,NOT_TUNIT_NPM,NOT_TUNIT_OPM,NOT_CGSCC_OPM,IS__CGSCC____,IS________NPM,IS__CGSCC_NPM

	target datalayout = "e-m:o-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:o-i64:64-f80:128-n8:16:32:64-S128"
	declare void @f(i32)			declare void @f(i32)
				declare token @llvm.call.preallocated.setup(i32)
				declare i8* @llvm.call.preallocated.arg(token, i32)

	; Test1: Replace argument with constant			; Test1: Replace argument with constant
	define internal void @test1(i32 %a) {			define internal void @test1(i32 %a) {
	; CHECK-LABEL: define {{[^@]+}}@test1()			; CHECK-LABEL: define {{[^@]+}}@test1()
	; CHECK-NEXT: tail call void @f(i32 1)			; CHECK-NEXT: tail call void @f(i32 1)
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	tail call void @f(i32 %a)			tail call void @f(i32 %a)
	▲ Show 20 Lines • Show All 262 Lines • ▼ Show 20 Lines
	; CHECK-LABEL: define {{[^@]+}}@complicated_args_inalloca()			; CHECK-LABEL: define {{[^@]+}}@complicated_args_inalloca()
	; CHECK-NEXT: [[CALL:%.]] = call i32 @test_inalloca(i32* noalias nocapture nofree writeonly align 536870912 null)			; CHECK-NEXT: [[CALL:%.]] = call i32 @test_inalloca(i32* noalias nocapture nofree writeonly align 536870912 null)
	; CHECK-NEXT: ret i32* [[CALL]]			; CHECK-NEXT: ret i32* [[CALL]]
	;			;
	%call = call i32* @test_inalloca(i32* null)			%call = call i32* @test_inalloca(i32* null)
	ret i32* %call			ret i32* %call
	}			}

				define internal i32* @test_preallocated(i32* preallocated(i32) %a) {
				; IS__TUNIT____-LABEL: define {{[^@]+}}@test_preallocated
				; IS__TUNIT____-SAME: (i32* noalias nofree returned writeonly preallocated(i32) align 536870912 "no-capture-maybe-returned" [[A:%.*]])
				; IS__TUNIT____-NEXT: ret i32* [[A]]
				;
				; IS__CGSCC____-LABEL: define {{[^@]+}}@test_preallocated
				; IS__CGSCC____-SAME: (i32* noalias nofree returned writeonly preallocated(i32) "no-capture-maybe-returned" [[A:%.*]])
				; IS__CGSCC____-NEXT: ret i32* [[A]]
				;
				ret i32* %a
				}
				define i32* @complicated_args_preallocated() {
				; IS__TUNIT_OPM-LABEL: define {{[^@]+}}@complicated_args_preallocated()
				; IS__TUNIT_OPM-NEXT: [[C:%.*]] = call token @llvm.call.preallocated.setup(i32 1)
				; IS__TUNIT_OPM-NEXT: [[CALL:%.]] = call i32 @test_preallocated(i32* noalias nocapture nofree writeonly preallocated(i32) align 536870912 null) #5 [ "preallocated"(token [[C]]) ]
				; IS__TUNIT_OPM-NEXT: ret i32* [[CALL]]
				;
				; IS__TUNIT_NPM-LABEL: define {{[^@]+}}@complicated_args_preallocated()
				; IS__TUNIT_NPM-NEXT: [[C:%.*]] = call token @llvm.call.preallocated.setup(i32 1)
				; IS__TUNIT_NPM-NEXT: [[CALL:%.]] = call i32 @test_preallocated(i32* noalias nocapture nofree writeonly preallocated(i32) align 536870912 null) #4 [ "preallocated"(token [[C]]) ]
				; IS__TUNIT_NPM-NEXT: ret i32* [[CALL]]
				;
				; IS__CGSCC____-LABEL: define {{[^@]+}}@complicated_args_preallocated()
				; IS__CGSCC____-NEXT: [[C:%.*]] = call token @llvm.call.preallocated.setup(i32 1)
				; IS__CGSCC____-NEXT: [[CALL:%.]] = call i32 @test_preallocated(i32* noalias nocapture nofree writeonly preallocated(i32) align 536870912 null) #6 [ "preallocated"(token [[C]]) ]
				; IS__CGSCC____-NEXT: ret i32* [[CALL]]
				;
				%c = call token @llvm.call.preallocated.setup(i32 1)
				%call = call i32* @test_preallocated(i32* preallocated(i32) null) ["preallocated"(token %c)]
				ret i32* %call
				}

	define internal void @test_sret(%struct.X* sret %a, %struct.X** %b) {			define internal void @test_sret(%struct.X* sret %a, %struct.X** %b) {
	;			;
	; IS__TUNIT____-LABEL: define {{[^@]+}}@test_sret			; IS__TUNIT____-LABEL: define {{[^@]+}}@test_sret
	; IS__TUNIT____-SAME: (%struct.X* noalias nofree sret writeonly align 536870912 [[A:%.]], %struct.X* nocapture nofree nonnull writeonly align 8 dereferenceable(8) [[B:%.*]])			; IS__TUNIT____-SAME: (%struct.X* noalias nofree sret writeonly align 536870912 [[A:%.]], %struct.X* nocapture nofree nonnull writeonly align 8 dereferenceable(8) [[B:%.*]])
	; IS__TUNIT____-NEXT: store %struct.X* [[A]], %struct.X** [[B]], align 8			; IS__TUNIT____-NEXT: store %struct.X* [[A]], %struct.X** [[B]], align 8
	; IS__TUNIT____-NEXT: ret void			; IS__TUNIT____-NEXT: ret void
	;			;
	; IS__CGSCC____-LABEL: define {{[^@]+}}@test_sret			; IS__CGSCC____-LABEL: define {{[^@]+}}@test_sret
	▲ Show 20 Lines • Show All 200 Lines • Show Last 20 Lines

llvm/test/Transforms/DeadArgElim/keepalive.ll

	; RUN: opt < %s -deadargelim -S \| FileCheck %s			; RUN: opt < %s -deadargelim -S \| FileCheck %s

				declare token @llvm.call.preallocated.setup(i32)
				declare i8* @llvm.call.preallocated.arg(token, i32)

	%Ty = type <{ i32, i32 }>			%Ty = type <{ i32, i32 }>

	; Check if the pass doesn't modify anything that doesn't need changing. We feed			; Check if the pass doesn't modify anything that doesn't need changing. We feed
	; an unused argument to each function to lure it into changing _something_ about			; an unused argument to each function to lure it into changing _something_ about
	; the function and then changing too much.			; the function and then changing too much.

	; This checks if the return value attributes are not removed			; This checks if the return value attributes are not removed
	; CHECK: define internal zeroext i32 @test1() #0			; CHECK: define internal zeroext i32 @test1() #0
	Show All 28 Lines
	define i32 @caller2() {			define i32 @caller2() {
	%t = alloca i32			%t = alloca i32
	%m = alloca inalloca i32			%m = alloca inalloca i32
	store i32 42, i32* %m			store i32 42, i32* %m
	%v = call x86_thiscallcc i32 @unused_this(i32* %t, i32* inalloca %m)			%v = call x86_thiscallcc i32 @unused_this(i32* %t, i32* inalloca %m)
	ret i32 %v			ret i32 %v
	}			}

				; We can't remove 'this' here, as that would put argmem in ecx instead of
				; memory.
				define internal x86_thiscallcc i32 @unused_this_preallocated(i32* %this, i32* preallocated(i32) %argmem) {
				%v = load i32, i32* %argmem
				ret i32 %v
				}
				; CHECK-LABEL: define internal x86_thiscallcc i32 @unused_this_preallocated(i32* %this, i32* preallocated(i32) %argmem)

				define i32 @caller3() {
				%t = alloca i32
				%c = call token @llvm.call.preallocated.setup(i32 1)
				%M = call i8* @llvm.call.preallocated.arg(token %c, i32 0) preallocated(i32)
				%m = bitcast i8* %M to i32*
				store i32 42, i32* %m
				%v = call x86_thiscallcc i32 @unused_this_preallocated(i32* %t, i32* preallocated(i32) %m) ["preallocated"(token %c)]
				ret i32 %v
				}

	; CHECK: attributes #0 = { nounwind }			; CHECK: attributes #0 = { nounwind }

llvm/test/Transforms/DeadStoreElimination/MSSA/simple-todo.ll

	Show First 20 Lines • Show All 52 Lines • ▼ Show 20 Lines
	; CHECK-LABEL: @test9_2(			; CHECK-LABEL: @test9_2(
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	%tmp2 = getelementptr %struct.x, %struct.x* %a, i32 0, i32 0			%tmp2 = getelementptr %struct.x, %struct.x* %a, i32 0, i32 0
	store i32 1, i32* %tmp2, align 4			store i32 1, i32* %tmp2, align 4
	ret void			ret void
	}			}

				; Test for preallocated handling.
				define void @test9_3(%struct.x* preallocated(%struct.x) %a) nounwind {
				; CHECK-LABEL: @test9_3(
				; CHECK-NEXT: ret void
				;
				%tmp2 = getelementptr %struct.x, %struct.x* %a, i32 0, i32 0
				store i32 1, i32* %tmp2, align 4
				ret void
				}

	; DSE should delete the dead trampoline.			; DSE should delete the dead trampoline.
	declare void @test11f()			declare void @test11f()
	define void @test11() {			define void @test11() {
	; CHECK-LABEL: @test11(			; CHECK-LABEL: @test11(
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	%storage = alloca [10 x i8], align 16 ; <[10 x i8]*> [#uses=1]			%storage = alloca [10 x i8], align 16 ; <[10 x i8]*> [#uses=1]
	%cast = getelementptr [10 x i8], [10 x i8]* %storage, i32 0, i32 0 ; <i8*> [#uses=1]			%cast = getelementptr [10 x i8], [10 x i8]* %storage, i32 0, i32 0 ; <i8*> [#uses=1]
	▲ Show 20 Lines • Show All 193 Lines • Show Last 20 Lines

llvm/test/Transforms/DeadStoreElimination/simple.ll

	Show First 20 Lines • Show All 163 Lines • ▼ Show 20 Lines
	; CHECK-LABEL: @test9_2(			; CHECK-LABEL: @test9_2(
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	%tmp2 = getelementptr %struct.x, %struct.x* %a, i32 0, i32 0			%tmp2 = getelementptr %struct.x, %struct.x* %a, i32 0, i32 0
	store i32 1, i32* %tmp2, align 4			store i32 1, i32* %tmp2, align 4
	ret void			ret void
	}			}

				; Test for preallocated handling.
				define void @test9_3(%struct.x* preallocated(%struct.x) %a) nounwind {
				; CHECK-LABEL: @test9_3(
				; CHECK-NEXT: ret void
				;
				%tmp2 = getelementptr %struct.x, %struct.x* %a, i32 0, i32 0
				store i32 1, i32* %tmp2, align 4
				ret void
				}

	; va_arg has fuzzy dependence, the store shouldn't be zapped.			; va_arg has fuzzy dependence, the store shouldn't be zapped.
	define double @test10(i8* %X) {			define double @test10(i8* %X) {
	; CHECK-LABEL: @test10(			; CHECK-LABEL: @test10(
	; CHECK-NEXT: [[X_ADDR:%.]] = alloca i8			; CHECK-NEXT: [[X_ADDR:%.]] = alloca i8
	; CHECK-NEXT: store i8* [[X:%.]], i8* [[X_ADDR]]			; CHECK-NEXT: store i8* [[X:%.]], i8* [[X_ADDR]]
	; CHECK-NEXT: [[TMP_0:%.]] = va_arg i8* [[X_ADDR]], double			; CHECK-NEXT: [[TMP_0:%.]] = va_arg i8* [[X_ADDR]], double
	; CHECK-NEXT: ret double [[TMP_0]]			; CHECK-NEXT: ret double [[TMP_0]]
	;			;
	▲ Show 20 Lines • Show All 1,025 Lines • Show Last 20 Lines

llvm/test/Transforms/FunctionAttrs/readattrs.ll

	Show First 20 Lines • Show All 50 Lines • ▼ Show 20 Lines
	}			}

	; CHECK: define void @test7_1(i32* inalloca nocapture %a)			; CHECK: define void @test7_1(i32* inalloca nocapture %a)
	; inalloca parameters are always considered written			; inalloca parameters are always considered written
	define void @test7_1(i32* inalloca %a) {			define void @test7_1(i32* inalloca %a) {
	ret void			ret void
	}			}

				; CHECK: define void @test7_2(i32* nocapture preallocated(i32) %a)
				; preallocated parameters are always considered written
				define void @test7_2(i32* preallocated(i32) %a) {
				ret void
				}

	; CHECK: define i32* @test8_1(i32* readnone returned %p)			; CHECK: define i32* @test8_1(i32* readnone returned %p)
	define i32* @test8_1(i32* %p) {			define i32* @test8_1(i32* %p) {
	entry:			entry:
	ret i32* %p			ret i32* %p
	}			}

	; CHECK: define void @test8_2(i32* %p)			; CHECK: define void @test8_2(i32* %p)
	define void @test8_2(i32* %p) {			define void @test8_2(i32* %p) {
	▲ Show 20 Lines • Show All 80 Lines • Show Last 20 Lines

llvm/test/Transforms/GlobalOpt/fastcc.ll

	; RUN: opt < %s -globalopt -S \| FileCheck %s			; RUN: opt < %s -globalopt -S \| FileCheck %s

				declare token @llvm.call.preallocated.setup(i32)
				declare i8* @llvm.call.preallocated.arg(token, i32)

	define internal i32 @f(i32* %m) {			define internal i32 @f(i32* %m) {
	; CHECK-LABEL: define internal fastcc i32 @f			; CHECK-LABEL: define internal fastcc i32 @f
	%v = load i32, i32* %m			%v = load i32, i32* %m
	ret i32 %v			ret i32 %v
	}			}

	define internal x86_thiscallcc i32 @g(i32* %m) {			define internal x86_thiscallcc i32 @g(i32* %m) {
	; CHECK-LABEL: define internal fastcc i32 @g			; CHECK-LABEL: define internal fastcc i32 @g
	Show All 16 Lines
	}			}

	define internal i32 @inalloca(i32* inalloca %p) {			define internal i32 @inalloca(i32* inalloca %p) {
	; CHECK-LABEL: define internal fastcc i32 @inalloca(i32* %p)			; CHECK-LABEL: define internal fastcc i32 @inalloca(i32* %p)
	%rv = load i32, i32* %p			%rv = load i32, i32* %p
	ret i32 %rv			ret i32 %rv
	}			}

				define internal i32 @preallocated(i32* preallocated(i32) %p) {
				; TODO: handle preallocated:
				; CHECK-NOT-LABEL: define internal fastcc i32 @preallocated(i32* %p)
				%rv = load i32, i32* %p
				ret i32 %rv
				}

	define void @call_things() {			define void @call_things() {
	%m = alloca i32			%m = alloca i32
	call i32 @f(i32* %m)			call i32 @f(i32* %m)
	call x86_thiscallcc i32 @g(i32* %m)			call x86_thiscallcc i32 @g(i32* %m)
	call coldcc i32 @h(i32* %m)			call coldcc i32 @h(i32* %m)
	call i32 @j(i32* %m)			call i32 @j(i32* %m)
	%args = alloca inalloca i32			%args = alloca inalloca i32
	call i32 @inalloca(i32* inalloca %args)			call i32 @inalloca(i32* inalloca %args)
				; TODO: handle preallocated
				;%c = call token @llvm.call.preallocated.setup(i32 1)
				;%N = call i8* @llvm.call.preallocated.arg(token %c, i32 0) preallocated(i32)
				;%n = bitcast i8* %N to i32*
				;call i32 @preallocated(i32* preallocated(i32) %n) ["preallocated"(token %c)]
	ret void			ret void
	}			}

	@llvm.used = appending global [1 x i8*] [			@llvm.used = appending global [1 x i8*] [
	i8* bitcast (i32(i32) @j to i8*)			i8* bitcast (i32(i32) @j to i8*)
	], section "llvm.metadata"			], section "llvm.metadata"

	; CHECK-LABEL: define void @call_things()			; CHECK-LABEL: define void @call_things()
	; CHECK: call fastcc i32 @f			; CHECK: call fastcc i32 @f
	; CHECK: call fastcc i32 @g			; CHECK: call fastcc i32 @g
	; CHECK: call coldcc i32 @h			; CHECK: call coldcc i32 @h
	; CHECK: call i32 @j			; CHECK: call i32 @j
	; CHECK: call fastcc i32 @inalloca(i32* %args)			; CHECK: call fastcc i32 @inalloca(i32* %args)

llvm/test/Transforms/InstCombine/call-cast-target-preallocated.ll

This file was added.

				; RUN: opt < %s -instcombine -S \| FileCheck %s

				target datalayout = "e-p:32:32"
				target triple = "i686-pc-win32"


				declare token @llvm.call.preallocated.setup(i32)
				declare i8* @llvm.call.preallocated.arg(token, i32)

				declare void @takes_i32(i32)
				declare void @takes_i32_preallocated(i32* preallocated(i32))

				define void @f() {
				; CHECK-LABEL: define void @f()
				%t = call token @llvm.call.preallocated.setup(i32 1)
				%a = call i8* @llvm.call.preallocated.arg(token %t, i32 0) preallocated(i32)
				%arg = bitcast i8* %a to i32*
				call void bitcast (void (i32)* @takes_i32 to void (i32))(i32* preallocated(i32) %arg) ["preallocated"(token %t)]
				; CHECK: call void bitcast{{.*}}@takes_i32
				ret void
				}

				define void @g() {
				; CHECK-LABEL: define void @g()
				call void bitcast (void (i32) @takes_i32_preallocated to void (i32)*)(i32 0)
				; CHECK: call void bitcast{{.*}}@takes_i32_preallocated
				ret void
				}

This is an archive of the discontinued LLVM Phabricator instance.

[X86] Codegen for preallocatedClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 262784

llvm/include/llvm/CodeGen/ISDOpcodes.h

llvm/include/llvm/CodeGen/TargetCallingConv.h

llvm/include/llvm/CodeGen/TargetLowering.h

llvm/include/llvm/IR/Argument.h

llvm/include/llvm/IR/Attributes.h

llvm/include/llvm/IR/InstrTypes.h

llvm/include/llvm/Support/TargetOpcodes.def

llvm/include/llvm/Target/Target.td

llvm/include/llvm/Target/TargetCallingConv.td

llvm/lib/CodeGen/GlobalISel/CallLowering.cpp

llvm/lib/CodeGen/SelectionDAG/FastISel.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp

llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp

llvm/lib/IR/Attributes.cpp

llvm/lib/IR/Function.cpp

llvm/lib/Target/X86/X86CallingConv.td

llvm/lib/Target/X86/X86FastISel.cpp

llvm/lib/Target/X86/X86FrameLowering.cpp

llvm/lib/Target/X86/X86ISelDAGToDAG.cpp

llvm/lib/Target/X86/X86ISelLowering.cpp

llvm/lib/Target/X86/X86MachineFunctionInfo.h

llvm/lib/Target/X86/X86RegisterInfo.cpp

llvm/lib/Transforms/Coroutines/CoroSplit.cpp

llvm/lib/Transforms/IPO/Attributor.cpp

llvm/lib/Transforms/IPO/AttributorAttributes.cpp

llvm/lib/Transforms/IPO/DeadArgumentElimination.cpp

llvm/lib/Transforms/IPO/FunctionAttrs.cpp

llvm/lib/Transforms/IPO/GlobalOpt.cpp

llvm/lib/Transforms/InstCombine/InstCombineCalls.cpp

llvm/test/CodeGen/X86/arg-copy-elide.ll

llvm/test/CodeGen/X86/musttail-indirect.ll

llvm/test/CodeGen/X86/musttail-thiscall.ll

llvm/test/CodeGen/X86/preallocated-nocall.ll

llvm/test/CodeGen/X86/preallocated-x64.ll

llvm/test/CodeGen/X86/preallocated.ll

llvm/test/CodeGen/X86/shrink-wrap-chkstk.ll

llvm/test/CodeGen/X86/tail-call-mutable-memarg.ll

llvm/test/Transforms/Attributor/value-simplify.ll

llvm/test/Transforms/DeadArgElim/keepalive.ll

llvm/test/Transforms/DeadStoreElimination/MSSA/simple-todo.ll

llvm/test/Transforms/DeadStoreElimination/simple.ll

llvm/test/Transforms/FunctionAttrs/readattrs.ll

llvm/test/Transforms/GlobalOpt/fastcc.ll

llvm/test/Transforms/InstCombine/call-cast-target-preallocated.ll

[X86] Codegen for preallocated
ClosedPublic