This is an archive of the discontinued LLVM Phabricator instance.

lib/Target/AArch64/AArch64CallLowering.cpp
248 ↗	(On Diff #156734)	Hi Alexander, Thanks for doing this! + @aemerson It feels a little unnatural to get all the EVTs from an IR Type and then map them back to IR Types (see `SplitEVTs[i].getTypeForEVT(Ctx)` below) just to map them back to EVTs again (see `splitToValueTypes`s implementation. It would later on map them back to Type again (and finally to LLT) if I'm not mistaken). It looks like the only functionality of `ComputeValueVTs` we are actually interested in here specifically is flattening the aggregate IR Types. In fact, there is a very similar function `computeValueLLTs` in IRTranslator that computes LLTs from a Value Type by flattening the aggregates. It also dictates the size of the `VRegs` that is asserted below. `computeValueLLTs`: https://github.com/llvm-mirror/llvm/blob/a145774c76ef75206356f92dd4c1b2c0d8dea896/lib/CodeGen/GlobalISel/IRTranslator.cpp#L112-L140 Compare with `ComputeValueVTs`: https://github.com/llvm-mirror/llvm/blob/a145774c76ef75206356f92dd4c1b2c0d8dea896/lib/CodeGen/Analysis.cpp#L77-L115 What I would suggest is to see if we can extract a similar flattening function (if it doesn't already exist somewhere else) that outputs IR Types (and offsets) instead of EVTs or LLTs, implement `computeValueLLTs` and `ComputeValueVTs` via that function (by post-processing the results with `getLLTForType` and `TLI.getValueType(DL, Ty)` respectively), but use it directly here. I think that way we could have: No Type -> EVT -> Type -> EVT -> Type -> LLT conversion, just Type -> EVT -> Type -> LLT (not sure if it could be even further reduced to Type -> LLT as `TargetLoweringBase::getValueType` seems to be doing stuff like replacing pointer types with integers, and it's unclear to me if it's required in GlobalISel's context given we have pointer LLT types) Code deduplication Simpler `*CallLowering::lowerReturn`s Otherwise I think this chain of type conversions gets a little too long to follow. What do you think? That could be a separate patch of course.
lib/Target/X86/X86CallLowering.cpp
210 ↗	(On Diff #156734)	It would also be nice if we could reduce the amount of repetition from target to target if it's possible w/o introducing weird APIs. This could be a separate follow-up patch of course.

aivchenk added inline comments.Jul 27 2018, 7:32 AM

lib/Target/AArch64/AArch64CallLowering.cpp
248 ↗	(On Diff #156734)	Hello Roman, First of all, thank you for your review! Yes, I've seen the similarity between computeValueLLTs and ComputeValueVTs. I tried several things first like moving computeValueLLTs into GlobalISel/Utils.h and have it return another pointer of SmallVectorImpl<Type*>, but it didn't look good. I haven't looked at where we can introduce that generic base functionality, but this is something that I can take a look for sure. I think that another semi-major refactoring that needs to be done - and it goes to the point that you made below - is to change splitToValueTypes. AFAIU, we don't really need to do ComputeValueVTs there, because compound types are already split at this point (at least for lowerReturn), so we only need to have the "second part" : handling how we treat individual parts e.g. with TLI.functionArgumentNeedsConsecutiveRegisters or TLI.getNumRegisters. I can take a look at it as well

Thanks for taking this on. I think some x86 tests would also be good here as there evidently isn't much coverage at the moment.

nhaehnle removed a subscriber: nhaehnle.Jul 29 2018, 12:16 PM

Thanks for your comments. Indeed the struct handling is not very well covered with tests in x86. I added the tests for cases, which are supported now

Hi Alexander,

Do you want to do the refactoring within this patch or as a separate one?

Thanks,
Roman

Hi Roman, I would prefer to do it as separate patches: the first one for Value splitting routine and the second for one splitToValueTypes

In D49660#1184980, @aivchenk wrote:

Hi Roman, I would prefer to do it as separate patches: the first one for Value splitting routine and the second for one splitToValueTypes

Great, thanks!

This revision is now accepted and ready to land.Aug 1 2018, 5:12 PM

Closed by commit rL338685: [GlobalISel] Rewrite CallLowering::lowerReturn to accept multiple VRegs per… (authored by aivchenk). · Explain WhyAug 2 2018, 1:34 AM

This revision was automatically updated to reflect the committed changes.

Herald added a subscriber: jrtc27. · View Herald TranscriptAug 2 2018, 1:34 AM

xiangzhai awarded a token.Sep 3 2018, 8:49 PM

Herald added a subscriber: jvesely. · View Herald TranscriptSep 3 2018, 8:49 PM

rovka mentioned this in D63549: [GlobalISel] Accept multiple vregs in lowerFormalArgs.Jun 19 2019, 6:54 AM

rovka mentioned this in D63550: [GlobalISel] Accept multiple vregs for lowerCall's result.Jun 19 2019, 6:57 AM

rovka mentioned this in D63551: [GlobalISel] Accept multiple vregs for lowerCall's arguments.Jun 19 2019, 6:59 AM

rovka mentioned this in rL364510: [GlobalISel] Accept multiple vregs in lowerFormalArgs.Jun 27 2019, 1:55 AM

rovka mentioned this in rGc3dbe2397792: [GlobalISel] Accept multiple vregs in lowerFormalArgs.

rovka mentioned this in rG8138996128cd: [GlobalISel] Accept multiple vregs for lowerCall's result.Jun 27 2019, 2:21 AM

rovka mentioned this in rL364511: [GlobalISel] Accept multiple vregs for lowerCall's result.

rovka mentioned this in rG43fb5ae50c53: [GlobalISel] Accept multiple vregs for lowerCall's args.

rovka mentioned this in rL364512: [GlobalISel] Accept multiple vregs for lowerCall's args.

Revision Contents

Path

Size

llvm/

trunk/

include/

llvm/

CodeGen/

GlobalISel/

CallLowering.h

6 lines

lib/

CodeGen/

GlobalISel/

IRTranslator.cpp

10 lines

Target/

AArch64/

AArch64CallLowering.h

4 lines

AArch64CallLowering.cpp

47 lines

AMDGPU/

AMDGPUCallLowering.h

4 lines

AMDGPUCallLowering.cpp

3 lines

ARM/

ARMCallLowering.h

7 lines

ARMCallLowering.cpp

35 lines

Mips/

MipsCallLowering.h

4 lines

MipsCallLowering.cpp

24 lines

X86/

X86CallLowering.h

4 lines

X86CallLowering.cpp

41 lines

test/

CodeGen/

AArch64/

GlobalISel/

call-translator.ll

16 lines

irtranslator-exceptions.ll

11 lines

ARM/

GlobalISel/

arm-param-lowering.ll

49 lines

X86/

GlobalISel/

x86_64-irtranslator-struct-return.ll

290 lines

Diff 158700

llvm/trunk/include/llvm/CodeGen/GlobalISel/CallLowering.h

Show First 20 Lines • Show All 132 Lines • ▼ Show 20 Lines	protected:
bool handleAssignments(MachineIRBuilder &MIRBuilder, ArrayRef<ArgInfo> Args,		bool handleAssignments(MachineIRBuilder &MIRBuilder, ArrayRef<ArgInfo> Args,
ValueHandler &Handler) const;		ValueHandler &Handler) const;

public:		public:
CallLowering(const TargetLowering *TLI) : TLI(TLI) {}		CallLowering(const TargetLowering *TLI) : TLI(TLI) {}
virtual ~CallLowering() = default;		virtual ~CallLowering() = default;

/// This hook must be implemented to lower outgoing return values, described		/// This hook must be implemented to lower outgoing return values, described
/// by \p Val, into the specified virtual register \p VReg.		/// by \p Val, into the specified virtual registers \p VRegs.
/// This hook is used by GlobalISel.		/// This hook is used by GlobalISel.
///		///
/// \return True if the lowering succeeds, false otherwise.		/// \return True if the lowering succeeds, false otherwise.
virtual bool lowerReturn(MachineIRBuilder &MIRBuilder,		virtual bool lowerReturn(MachineIRBuilder &MIRBuilder, const Value *Val,
const Value *Val, unsigned VReg) const {		ArrayRef<unsigned> VRegs) const {
return false;		return false;
}		}

/// This hook must be implemented to lower the incoming (formal)		/// This hook must be implemented to lower the incoming (formal)
/// arguments, described by \p Args, for GlobalISel. Each argument		/// arguments, described by \p Args, for GlobalISel. Each argument
/// must end up in the related virtual register described by VRegs.		/// must end up in the related virtual register described by VRegs.
/// In other words, the first argument should end up in VRegs[0],		/// In other words, the first argument should end up in VRegs[0],
/// the second in VRegs[1], and so on.		/// the second in VRegs[1], and so on.
▲ Show 20 Lines • Show All 61 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/GlobalISel/IRTranslator.cpp

Show First 20 Lines • Show All 317 Lines • ▼ Show 20 Lines	bool IRTranslator::translateCompare(const User &U,
return true;		return true;
}		}

bool IRTranslator::translateRet(const User &U, MachineIRBuilder &MIRBuilder) {		bool IRTranslator::translateRet(const User &U, MachineIRBuilder &MIRBuilder) {
const ReturnInst &RI = cast<ReturnInst>(U);		const ReturnInst &RI = cast<ReturnInst>(U);
const Value *Ret = RI.getReturnValue();		const Value *Ret = RI.getReturnValue();
if (Ret && DL->getTypeStoreSize(Ret->getType()) == 0)		if (Ret && DL->getTypeStoreSize(Ret->getType()) == 0)
Ret = nullptr;		Ret = nullptr;

		ArrayRef<unsigned> VRegs;
		if (Ret)
		VRegs = getOrCreateVRegs(*Ret);

// The target may mess up with the insertion point, but		// The target may mess up with the insertion point, but
// this is not important as a return is the last instruction		// this is not important as a return is the last instruction
// of the block anyway.		// of the block anyway.

// FIXME: this interface should simplify when CallLowering gets adapted to		return CLI->lowerReturn(MIRBuilder, Ret, VRegs);
// multiple VRegs per Value.
unsigned VReg = Ret ? packRegs(*Ret, MIRBuilder) : 0;
return CLI->lowerReturn(MIRBuilder, Ret, VReg);
}		}

bool IRTranslator::translateBr(const User &U, MachineIRBuilder &MIRBuilder) {		bool IRTranslator::translateBr(const User &U, MachineIRBuilder &MIRBuilder) {
const BranchInst &BrInst = cast<BranchInst>(U);		const BranchInst &BrInst = cast<BranchInst>(U);
unsigned Succ = 0;		unsigned Succ = 0;
if (!BrInst.isUnconditional()) {		if (!BrInst.isUnconditional()) {
// We want a G_BRCOND to the true BB followed by an unconditional branch.		// We want a G_BRCOND to the true BB followed by an unconditional branch.
unsigned Tst = getOrCreateVReg(*BrInst.getCondition());		unsigned Tst = getOrCreateVReg(*BrInst.getCondition());
▲ Show 20 Lines • Show All 1,340 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/AArch64/AArch64CallLowering.h

	Show All 28 Lines
	class MachineIRBuilder;			class MachineIRBuilder;
	class MachineRegisterInfo;			class MachineRegisterInfo;
	class Type;			class Type;

	class AArch64CallLowering: public CallLowering {			class AArch64CallLowering: public CallLowering {
	public:			public:
	AArch64CallLowering(const AArch64TargetLowering &TLI);			AArch64CallLowering(const AArch64TargetLowering &TLI);

	bool lowerReturn(MachineIRBuilder &MIRBuiler, const Value *Val,			bool lowerReturn(MachineIRBuilder &MIRBuilder, const Value *Val,
	unsigned VReg) const override;			ArrayRef<unsigned> VRegs) const override;

	bool lowerFormalArguments(MachineIRBuilder &MIRBuilder, const Function &F,			bool lowerFormalArguments(MachineIRBuilder &MIRBuilder, const Function &F,
	ArrayRef<unsigned> VRegs) const override;			ArrayRef<unsigned> VRegs) const override;

	bool lowerCall(MachineIRBuilder &MIRBuilder, CallingConv::ID CallConv,			bool lowerCall(MachineIRBuilder &MIRBuilder, CallingConv::ID CallConv,
	const MachineOperand &Callee, const ArgInfo &OrigRet,			const MachineOperand &Callee, const ArgInfo &OrigRet,
	ArrayRef<ArgInfo> OrigArgs) const override;			ArrayRef<ArgInfo> OrigArgs) const override;

	Show All 19 Lines

llvm/trunk/lib/Target/AArch64/AArch64CallLowering.cpp

Show First 20 Lines • Show All 221 Lines • ▼ Show 20 Lines	void AArch64CallLowering::splitToValueTypes(

SplitArgs.back().Flags.setInConsecutiveRegsLast();		SplitArgs.back().Flags.setInConsecutiveRegsLast();

for (unsigned i = 0; i < Offsets.size(); ++i)		for (unsigned i = 0; i < Offsets.size(); ++i)
PerformArgSplit(SplitArgs[FirstRegIdx + i].Reg, Offsets[i] * 8);		PerformArgSplit(SplitArgs[FirstRegIdx + i].Reg, Offsets[i] * 8);
}		}

bool AArch64CallLowering::lowerReturn(MachineIRBuilder &MIRBuilder,		bool AArch64CallLowering::lowerReturn(MachineIRBuilder &MIRBuilder,
const Value *Val, unsigned VReg) const {		const Value *Val,
		ArrayRef<unsigned> VRegs) const {
		auto MIB = MIRBuilder.buildInstrNoInsert(AArch64::RET_ReallyLR);
		assert(((Val && !VRegs.empty()) \|\| (!Val && VRegs.empty())) &&
		"Return value without a vreg");

		bool Success = true;
		if (!VRegs.empty()) {
MachineFunction &MF = MIRBuilder.getMF();		MachineFunction &MF = MIRBuilder.getMF();
const Function &F = MF.getFunction();		const Function &F = MF.getFunction();

auto MIB = MIRBuilder.buildInstrNoInsert(AArch64::RET_ReallyLR);
assert(((Val && VReg) \|\| (!Val && !VReg)) && "Return value without a vreg");
bool Success = true;
if (VReg) {
MachineRegisterInfo &MRI = MF.getRegInfo();		MachineRegisterInfo &MRI = MF.getRegInfo();

// We zero-extend i1s to i8.
if (MRI.getType(VReg).getSizeInBits() == 1)
VReg = MIRBuilder.buildZExt(LLT::scalar(8), VReg)->getOperand(0).getReg();

const AArch64TargetLowering &TLI = *getTLI<AArch64TargetLowering>();		const AArch64TargetLowering &TLI = *getTLI<AArch64TargetLowering>();
CCAssignFn *AssignFn = TLI.CCAssignFnForReturn(F.getCallingConv());		CCAssignFn *AssignFn = TLI.CCAssignFnForReturn(F.getCallingConv());
auto &DL = F.getParent()->getDataLayout();		auto &DL = F.getParent()->getDataLayout();
		LLVMContext &Ctx = Val->getType()->getContext();

ArgInfo OrigArg{VReg, Val->getType()};		SmallVector<EVT, 4> SplitEVTs;
setArgFlags(OrigArg, AttributeList::ReturnIndex, DL, F);		ComputeValueVTs(TLI, DL, Val->getType(), SplitEVTs);
		assert(VRegs.size() == SplitEVTs.size() &&
		"For each split Type there should be exactly one VReg.");

SmallVector<ArgInfo, 8> SplitArgs;		SmallVector<ArgInfo, 8> SplitArgs;
splitToValueTypes(OrigArg, SplitArgs, DL, MRI, F.getCallingConv(),		for (unsigned i = 0; i < SplitEVTs.size(); ++i) {
		// We zero-extend i1s to i8.
		unsigned CurVReg = VRegs[i];
		if (MRI.getType(VRegs[i]).getSizeInBits() == 1) {
		CurVReg = MIRBuilder.buildZExt(LLT::scalar(8), CurVReg)
		->getOperand(0)
		.getReg();
		}

		ArgInfo CurArgInfo = ArgInfo{CurVReg, SplitEVTs[i].getTypeForEVT(Ctx)};
		setArgFlags(CurArgInfo, AttributeList::ReturnIndex, DL, F);
		splitToValueTypes(CurArgInfo, SplitArgs, DL, MRI, F.getCallingConv(),
[&](unsigned Reg, uint64_t Offset) {		[&](unsigned Reg, uint64_t Offset) {
MIRBuilder.buildExtract(Reg, VReg, Offset);		MIRBuilder.buildExtract(Reg, CurVReg, Offset);
});		});
		}

OutgoingArgHandler Handler(MIRBuilder, MRI, MIB, AssignFn, AssignFn);		OutgoingArgHandler Handler(MIRBuilder, MRI, MIB, AssignFn, AssignFn);
Success = handleAssignments(MIRBuilder, SplitArgs, Handler);		Success = handleAssignments(MIRBuilder, SplitArgs, Handler);
}		}

MIRBuilder.insertInstr(MIB);		MIRBuilder.insertInstr(MIB);
return Success;		return Success;
}		}
▲ Show 20 Lines • Show All 154 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/AMDGPU/AMDGPUCallLowering.h

Show All 29 Lines	class AMDGPUCallLowering: public CallLowering {

void lowerParameter(MachineIRBuilder &MIRBuilder, Type *ParamTy,		void lowerParameter(MachineIRBuilder &MIRBuilder, Type *ParamTy,
uint64_t Offset, unsigned Align,		uint64_t Offset, unsigned Align,
unsigned DstReg) const;		unsigned DstReg) const;

public:		public:
AMDGPUCallLowering(const AMDGPUTargetLowering &TLI);		AMDGPUCallLowering(const AMDGPUTargetLowering &TLI);

bool lowerReturn(MachineIRBuilder &MIRBuiler, const Value *Val,		bool lowerReturn(MachineIRBuilder &MIRBuilder, const Value *Val,
unsigned VReg) const override;		ArrayRef<unsigned> VRegs) const override;
bool lowerFormalArguments(MachineIRBuilder &MIRBuilder, const Function &F,		bool lowerFormalArguments(MachineIRBuilder &MIRBuilder, const Function &F,
ArrayRef<unsigned> VRegs) const override;		ArrayRef<unsigned> VRegs) const override;
static CCAssignFn *CCAssignFnForCall(CallingConv::ID CC, bool IsVarArg);		static CCAssignFn *CCAssignFnForCall(CallingConv::ID CC, bool IsVarArg);
static CCAssignFn *CCAssignFnForReturn(CallingConv::ID CC, bool IsVarArg);		static CCAssignFn *CCAssignFnForReturn(CallingConv::ID CC, bool IsVarArg);
};		};
} // End of namespace llvm;		} // End of namespace llvm;
#endif		#endif

llvm/trunk/lib/Target/AMDGPU/AMDGPUCallLowering.cpp

	Show All 26 Lines

	using namespace llvm;			using namespace llvm;

	AMDGPUCallLowering::AMDGPUCallLowering(const AMDGPUTargetLowering &TLI)			AMDGPUCallLowering::AMDGPUCallLowering(const AMDGPUTargetLowering &TLI)
	: CallLowering(&TLI), AMDGPUASI(TLI.getAMDGPUAS()) {			: CallLowering(&TLI), AMDGPUASI(TLI.getAMDGPUAS()) {
	}			}

	bool AMDGPUCallLowering::lowerReturn(MachineIRBuilder &MIRBuilder,			bool AMDGPUCallLowering::lowerReturn(MachineIRBuilder &MIRBuilder,
	const Value *Val, unsigned VReg) const {			const Value *Val,
				ArrayRef<unsigned> VRegs) const {
	// FIXME: Add support for non-void returns.			// FIXME: Add support for non-void returns.
	if (Val)			if (Val)
	return false;			return false;

	MIRBuilder.buildInstr(AMDGPU::S_ENDPGM);			MIRBuilder.buildInstr(AMDGPU::S_ENDPGM);
	return true;			return true;
	}			}

	▲ Show 20 Lines • Show All 207 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/ARM/ARMCallLowering.h

	Show All 27 Lines
	class MachineInstrBuilder;			class MachineInstrBuilder;
	class MachineIRBuilder;			class MachineIRBuilder;
	class Value;			class Value;

	class ARMCallLowering : public CallLowering {			class ARMCallLowering : public CallLowering {
	public:			public:
	ARMCallLowering(const ARMTargetLowering &TLI);			ARMCallLowering(const ARMTargetLowering &TLI);

	bool lowerReturn(MachineIRBuilder &MIRBuiler, const Value *Val,			bool lowerReturn(MachineIRBuilder &MIRBuilder, const Value *Val,
	unsigned VReg) const override;			ArrayRef<unsigned> VRegs) const override;

	bool lowerFormalArguments(MachineIRBuilder &MIRBuilder, const Function &F,			bool lowerFormalArguments(MachineIRBuilder &MIRBuilder, const Function &F,
	ArrayRef<unsigned> VRegs) const override;			ArrayRef<unsigned> VRegs) const override;

	bool lowerCall(MachineIRBuilder &MIRBuilder, CallingConv::ID CallConv,			bool lowerCall(MachineIRBuilder &MIRBuilder, CallingConv::ID CallConv,
	const MachineOperand &Callee, const ArgInfo &OrigRet,			const MachineOperand &Callee, const ArgInfo &OrigRet,
	ArrayRef<ArgInfo> OrigArgs) const override;			ArrayRef<ArgInfo> OrigArgs) const override;

	private:			private:
	bool lowerReturnVal(MachineIRBuilder &MIRBuilder, const Value *Val,			bool lowerReturnVal(MachineIRBuilder &MIRBuilder, const Value *Val,
	unsigned VReg, MachineInstrBuilder &Ret) const;			ArrayRef<unsigned> VRegs,
				MachineInstrBuilder &Ret) const;

	using SplitArgTy = std::function<void(unsigned Reg, uint64_t Offset)>;			using SplitArgTy = std::function<void(unsigned Reg, uint64_t Offset)>;

	/// Split an argument into one or more arguments that the CC lowering can cope			/// Split an argument into one or more arguments that the CC lowering can cope
	/// with (e.g. replace pointers with integers).			/// with (e.g. replace pointers with integers).
	void splitToValueTypes(const ArgInfo &OrigArg,			void splitToValueTypes(const ArgInfo &OrigArg,
	SmallVectorImpl<ArgInfo> &SplitArgs,			SmallVectorImpl<ArgInfo> &SplitArgs,
	MachineFunction &MF,			MachineFunction &MF,
	const SplitArgTy &PerformArgSplit) const;			const SplitArgTy &PerformArgSplit) const;
	};			};

	} // end namespace llvm			} // end namespace llvm

	#endif // LLVM_LIB_TARGET_ARM_ARMCALLLOWERING_H			#endif // LLVM_LIB_TARGET_ARM_ARMCALLLOWERING_H

llvm/trunk/lib/Target/ARM/ARMCallLowering.cpp

Show First 20 Lines • Show All 231 Lines • ▼ Show 20 Lines	void ARMCallLowering::splitToValueTypes(

for (unsigned i = 0; i < Offsets.size(); ++i)		for (unsigned i = 0; i < Offsets.size(); ++i)
PerformArgSplit(SplitArgs[FirstRegIdx + i].Reg, Offsets[i] * 8);		PerformArgSplit(SplitArgs[FirstRegIdx + i].Reg, Offsets[i] * 8);
}		}

/// Lower the return value for the already existing \p Ret. This assumes that		/// Lower the return value for the already existing \p Ret. This assumes that
/// \p MIRBuilder's insertion point is correct.		/// \p MIRBuilder's insertion point is correct.
bool ARMCallLowering::lowerReturnVal(MachineIRBuilder &MIRBuilder,		bool ARMCallLowering::lowerReturnVal(MachineIRBuilder &MIRBuilder,
const Value *Val, unsigned VReg,		const Value *Val, ArrayRef<unsigned> VRegs,
MachineInstrBuilder &Ret) const {		MachineInstrBuilder &Ret) const {
if (!Val)		if (!Val)
// Nothing to do here.		// Nothing to do here.
return true;		return true;

auto &MF = MIRBuilder.getMF();		auto &MF = MIRBuilder.getMF();
const auto &F = MF.getFunction();		const auto &F = MF.getFunction();

auto DL = MF.getDataLayout();		auto DL = MF.getDataLayout();
auto &TLI = *getTLI<ARMTargetLowering>();		auto &TLI = *getTLI<ARMTargetLowering>();
if (!isSupportedType(DL, TLI, Val->getType()))		if (!isSupportedType(DL, TLI, Val->getType()))
return false;		return false;

		SmallVector<EVT, 4> SplitEVTs;
		ComputeValueVTs(TLI, DL, Val->getType(), SplitEVTs);
		assert(VRegs.size() == SplitEVTs.size() &&
		"For each split Type there should be exactly one VReg.");

SmallVector<ArgInfo, 4> SplitVTs;		SmallVector<ArgInfo, 4> SplitVTs;
SmallVector<unsigned, 4> Regs;		LLVMContext &Ctx = Val->getType()->getContext();
ArgInfo RetInfo(VReg, Val->getType());		for (unsigned i = 0; i < SplitEVTs.size(); ++i) {
setArgFlags(RetInfo, AttributeList::ReturnIndex, DL, F);		ArgInfo CurArgInfo(VRegs[i], SplitEVTs[i].getTypeForEVT(Ctx));
splitToValueTypes(RetInfo, SplitVTs, MF, [&](unsigned Reg, uint64_t Offset) {		setArgFlags(CurArgInfo, AttributeList::ReturnIndex, DL, F);
Regs.push_back(Reg);
});

		SmallVector<unsigned, 4> Regs;
		splitToValueTypes(
		CurArgInfo, SplitVTs, MF,
		[&](unsigned Reg, uint64_t Offset) { Regs.push_back(Reg); });
if (Regs.size() > 1)		if (Regs.size() > 1)
MIRBuilder.buildUnmerge(Regs, VReg);		MIRBuilder.buildUnmerge(Regs, VRegs[i]);
		}

CCAssignFn *AssignFn =		CCAssignFn *AssignFn =
TLI.CCAssignFnForReturn(F.getCallingConv(), F.isVarArg());		TLI.CCAssignFnForReturn(F.getCallingConv(), F.isVarArg());

OutgoingValueHandler RetHandler(MIRBuilder, MF.getRegInfo(), Ret, AssignFn);		OutgoingValueHandler RetHandler(MIRBuilder, MF.getRegInfo(), Ret, AssignFn);
return handleAssignments(MIRBuilder, SplitVTs, RetHandler);		return handleAssignments(MIRBuilder, SplitVTs, RetHandler);
}		}

bool ARMCallLowering::lowerReturn(MachineIRBuilder &MIRBuilder,		bool ARMCallLowering::lowerReturn(MachineIRBuilder &MIRBuilder,
const Value *Val, unsigned VReg) const {		const Value *Val,
assert(!Val == !VReg && "Return value without a vreg");		ArrayRef<unsigned> VRegs) const {
		assert(!Val == VRegs.empty() && "Return value without a vreg");

auto const &ST = MIRBuilder.getMF().getSubtarget<ARMSubtarget>();		auto const &ST = MIRBuilder.getMF().getSubtarget<ARMSubtarget>();
unsigned Opcode = ST.getReturnOpcode();		unsigned Opcode = ST.getReturnOpcode();
auto Ret = MIRBuilder.buildInstrNoInsert(Opcode).add(predOps(ARMCC::AL));		auto Ret = MIRBuilder.buildInstrNoInsert(Opcode).add(predOps(ARMCC::AL));

if (!lowerReturnVal(MIRBuilder, Val, VReg, Ret))		if (!lowerReturnVal(MIRBuilder, Val, VRegs, Ret))
return false;		return false;

MIRBuilder.insertInstr(Ret);		MIRBuilder.insertInstr(Ret);
return true;		return true;
}		}

namespace {		namespace {

▲ Show 20 Lines • Show All 305 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/Mips/MipsCallLowering.h

Show First 20 Lines • Show All 44 Lines • ▼ Show 20 Lines	private:

virtual void assignValueToAddress(unsigned ValVReg, unsigned Addr,		virtual void assignValueToAddress(unsigned ValVReg, unsigned Addr,
uint64_t Size,		uint64_t Size,
MachinePointerInfo &MPO) = 0;		MachinePointerInfo &MPO) = 0;
};		};

MipsCallLowering(const MipsTargetLowering &TLI);		MipsCallLowering(const MipsTargetLowering &TLI);

bool lowerReturn(MachineIRBuilder &MIRBuiler, const Value *Val,		bool lowerReturn(MachineIRBuilder &MIRBuilder, const Value *Val,
unsigned VReg) const override;		ArrayRef<unsigned> VRegs) const;

bool lowerFormalArguments(MachineIRBuilder &MIRBuilder, const Function &F,		bool lowerFormalArguments(MachineIRBuilder &MIRBuilder, const Function &F,
ArrayRef<unsigned> VRegs) const override;		ArrayRef<unsigned> VRegs) const override;

bool lowerCall(MachineIRBuilder &MIRBuilder, CallingConv::ID CallConv,		bool lowerCall(MachineIRBuilder &MIRBuilder, CallingConv::ID CallConv,
const MachineOperand &Callee, const ArgInfo &OrigRet,		const MachineOperand &Callee, const ArgInfo &OrigRet,
ArrayRef<ArgInfo> OrigArgs) const override;		ArrayRef<ArgInfo> OrigArgs) const override;

Show All 24 Lines

llvm/trunk/lib/Target/Mips/MipsCallLowering.cpp

Show All 10 Lines
/// This file implements the lowering of LLVM calls to machine code calls for		/// This file implements the lowering of LLVM calls to machine code calls for
/// GlobalISel.		/// GlobalISel.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "MipsCallLowering.h"		#include "MipsCallLowering.h"
#include "MipsCCState.h"		#include "MipsCCState.h"
#include "MipsTargetMachine.h"		#include "MipsTargetMachine.h"
		#include "llvm/CodeGen/Analysis.h"
#include "llvm/CodeGen/GlobalISel/MachineIRBuilder.h"		#include "llvm/CodeGen/GlobalISel/MachineIRBuilder.h"

using namespace llvm;		using namespace llvm;

MipsCallLowering::MipsCallLowering(const MipsTargetLowering &TLI)		MipsCallLowering::MipsCallLowering(const MipsTargetLowering &TLI)
: CallLowering(&TLI) {}		: CallLowering(&TLI) {}

bool MipsCallLowering::MipsHandler::assign(const CCValAssign &VA,		bool MipsCallLowering::MipsHandler::assign(const CCValAssign &VA,
▲ Show 20 Lines • Show All 160 Lines • ▼ Show 20 Lines	static bool isSupportedType(Type *T) {
if (T->isIntegerTy() && T->getScalarSizeInBits() == 32)		if (T->isIntegerTy() && T->getScalarSizeInBits() == 32)
return true;		return true;
if (T->isPointerTy())		if (T->isPointerTy())
return true;		return true;
return false;		return false;
}		}

bool MipsCallLowering::lowerReturn(MachineIRBuilder &MIRBuilder,		bool MipsCallLowering::lowerReturn(MachineIRBuilder &MIRBuilder,
const Value *Val, unsigned VReg) const {		const Value *Val,
		ArrayRef<unsigned> VRegs) const {

MachineInstrBuilder Ret = MIRBuilder.buildInstrNoInsert(Mips::RetRA);		MachineInstrBuilder Ret = MIRBuilder.buildInstrNoInsert(Mips::RetRA);

if (Val != nullptr) {		if (Val != nullptr && !isSupportedType(Val->getType()))
if (!isSupportedType(Val->getType()))
return false;		return false;

		if (!VRegs.empty()) {
MachineFunction &MF = MIRBuilder.getMF();		MachineFunction &MF = MIRBuilder.getMF();
const Function &F = MF.getFunction();		const Function &F = MF.getFunction();
const DataLayout &DL = MF.getDataLayout();		const DataLayout &DL = MF.getDataLayout();
const MipsTargetLowering &TLI = *getTLI<MipsTargetLowering>();		const MipsTargetLowering &TLI = *getTLI<MipsTargetLowering>();
		LLVMContext &Ctx = Val->getType()->getContext();

		SmallVector<EVT, 4> SplitEVTs;
		ComputeValueVTs(TLI, DL, Val->getType(), SplitEVTs);
		assert(VRegs.size() == SplitEVTs.size() &&
		"For each split Type there should be exactly one VReg.");

SmallVector<ArgInfo, 8> RetInfos;		SmallVector<ArgInfo, 8> RetInfos;
SmallVector<unsigned, 8> OrigArgIndices;		SmallVector<unsigned, 8> OrigArgIndices;

ArgInfo ArgRetInfo(VReg, Val->getType());		for (unsigned i = 0; i < SplitEVTs.size(); ++i) {
setArgFlags(ArgRetInfo, AttributeList::ReturnIndex, DL, F);		ArgInfo CurArgInfo = ArgInfo{VRegs[i], SplitEVTs[i].getTypeForEVT(Ctx)};
splitToValueTypes(ArgRetInfo, 0, RetInfos, OrigArgIndices);		setArgFlags(CurArgInfo, AttributeList::ReturnIndex, DL, F);
		splitToValueTypes(CurArgInfo, 0, RetInfos, OrigArgIndices);
		}

SmallVector<ISD::OutputArg, 8> Outs;		SmallVector<ISD::OutputArg, 8> Outs;
subTargetRegTypeForCallingConv(		subTargetRegTypeForCallingConv(
MIRBuilder, RetInfos, OrigArgIndices,		MIRBuilder, RetInfos, OrigArgIndices,
[&](ISD::ArgFlagsTy flags, EVT vt, EVT argvt, bool used,		[&](ISD::ArgFlagsTy flags, EVT vt, EVT argvt, bool used,
unsigned origIdx, unsigned partOffs) {		unsigned origIdx, unsigned partOffs) {
Outs.emplace_back(flags, vt, argvt, used, origIdx, partOffs);		Outs.emplace_back(flags, vt, argvt, used, origIdx, partOffs);
});		});
▲ Show 20 Lines • Show All 221 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/X86/X86CallLowering.h

	Show All 23 Lines
	class DataLayout;			class DataLayout;
	class MachineRegisterInfo;			class MachineRegisterInfo;
	class X86TargetLowering;			class X86TargetLowering;

	class X86CallLowering : public CallLowering {			class X86CallLowering : public CallLowering {
	public:			public:
	X86CallLowering(const X86TargetLowering &TLI);			X86CallLowering(const X86TargetLowering &TLI);

	bool lowerReturn(MachineIRBuilder &MIRBuiler, const Value *Val,			bool lowerReturn(MachineIRBuilder &MIRBuilder, const Value *Val,
	unsigned VReg) const override;			ArrayRef<unsigned> VRegs) const override;

	bool lowerFormalArguments(MachineIRBuilder &MIRBuilder, const Function &F,			bool lowerFormalArguments(MachineIRBuilder &MIRBuilder, const Function &F,
	ArrayRef<unsigned> VRegs) const override;			ArrayRef<unsigned> VRegs) const override;

	bool lowerCall(MachineIRBuilder &MIRBuilder, CallingConv::ID CallConv,			bool lowerCall(MachineIRBuilder &MIRBuilder, CallingConv::ID CallConv,
	const MachineOperand &Callee, const ArgInfo &OrigRet,			const MachineOperand &Callee, const ArgInfo &OrigRet,
	ArrayRef<ArgInfo> OrigArgs) const override;			ArrayRef<ArgInfo> OrigArgs) const override;

	Show All 13 Lines

llvm/trunk/lib/Target/X86/X86CallLowering.cpp

Show First 20 Lines • Show All 59 Lines • ▼ Show 20 Lines	bool X86CallLowering::splitToValueTypes(const ArgInfo &OrigArg,
SplitArgTy PerformArgSplit) const {		SplitArgTy PerformArgSplit) const {
const X86TargetLowering &TLI = *getTLI<X86TargetLowering>();		const X86TargetLowering &TLI = *getTLI<X86TargetLowering>();
LLVMContext &Context = OrigArg.Ty->getContext();		LLVMContext &Context = OrigArg.Ty->getContext();

SmallVector<EVT, 4> SplitVTs;		SmallVector<EVT, 4> SplitVTs;
SmallVector<uint64_t, 4> Offsets;		SmallVector<uint64_t, 4> Offsets;
ComputeValueVTs(TLI, DL, OrigArg.Ty, SplitVTs, &Offsets, 0);		ComputeValueVTs(TLI, DL, OrigArg.Ty, SplitVTs, &Offsets, 0);

if (SplitVTs.size() != 1) {		if (OrigArg.Ty->isVoidTy())
// TODO: support struct/array split		return true;
return false;
}

EVT VT = SplitVTs[0];		EVT VT = SplitVTs[0];
unsigned NumParts = TLI.getNumRegisters(Context, VT);		unsigned NumParts = TLI.getNumRegisters(Context, VT);

if (NumParts == 1) {		if (NumParts == 1) {
// replace the original type ( pointer -> GPR ).		// replace the original type ( pointer -> GPR ).
SplitArgs.emplace_back(OrigArg.Reg, VT.getTypeForEVT(Context),		SplitArgs.emplace_back(OrigArg.Reg, VT.getTypeForEVT(Context),
OrigArg.Flags, OrigArg.IsFixed);		OrigArg.Flags, OrigArg.IsFixed);
▲ Show 20 Lines • Show All 100 Lines • ▼ Show 20 Lines	protected:
uint64_t StackSize = 0;		uint64_t StackSize = 0;
const DataLayout &DL;		const DataLayout &DL;
const X86Subtarget &STI;		const X86Subtarget &STI;
unsigned NumXMMRegs = 0;		unsigned NumXMMRegs = 0;
};		};

} // end anonymous namespace		} // end anonymous namespace

bool X86CallLowering::lowerReturn(MachineIRBuilder &MIRBuilder,		bool X86CallLowering::lowerReturn(
const Value *Val, unsigned VReg) const {		MachineIRBuilder &MIRBuilder, const Value *Val,
assert(((Val && VReg) \|\| (!Val && !VReg)) && "Return value without a vreg");		ArrayRef<unsigned> VRegs) const {
		assert(((Val && !VRegs.empty()) \|\| (!Val && VRegs.empty())) &&
		"Return value without a vreg");
auto MIB = MIRBuilder.buildInstrNoInsert(X86::RET).addImm(0);		auto MIB = MIRBuilder.buildInstrNoInsert(X86::RET).addImm(0);

if (VReg) {		if (!VRegs.empty()) {
MachineFunction &MF = MIRBuilder.getMF();		MachineFunction &MF = MIRBuilder.getMF();
		const Function &F = MF.getFunction();
MachineRegisterInfo &MRI = MF.getRegInfo();		MachineRegisterInfo &MRI = MF.getRegInfo();
auto &DL = MF.getDataLayout();		auto &DL = MF.getDataLayout();
const Function &F = MF.getFunction();		LLVMContext &Ctx = Val->getType()->getContext();
		const X86TargetLowering &TLI = *getTLI<X86TargetLowering>();

ArgInfo OrigArg{VReg, Val->getType()};		SmallVector<EVT, 4> SplitEVTs;
setArgFlags(OrigArg, AttributeList::ReturnIndex, DL, F);		ComputeValueVTs(TLI, DL, Val->getType(), SplitEVTs);
		assert(VRegs.size() == SplitEVTs.size() &&
		"For each split Type there should be exactly one VReg.");

SmallVector<ArgInfo, 8> SplitArgs;		SmallVector<ArgInfo, 8> SplitArgs;
if (!splitToValueTypes(OrigArg, SplitArgs, DL, MRI,		for (unsigned i = 0; i < SplitEVTs.size(); ++i) {
		ArgInfo CurArgInfo = ArgInfo{VRegs[i], SplitEVTs[i].getTypeForEVT(Ctx)};
		setArgFlags(CurArgInfo, AttributeList::ReturnIndex, DL, F);
		if (!splitToValueTypes(CurArgInfo, SplitArgs, DL, MRI,
[&](ArrayRef<unsigned> Regs) {		[&](ArrayRef<unsigned> Regs) {
MIRBuilder.buildUnmerge(Regs, VReg);		MIRBuilder.buildUnmerge(Regs, VRegs[i]);
}))		}))
return false;		return false;
		}

OutgoingValueHandler Handler(MIRBuilder, MRI, MIB, RetCC_X86);		OutgoingValueHandler Handler(MIRBuilder, MRI, MIB, RetCC_X86);
if (!handleAssignments(MIRBuilder, SplitArgs, Handler))		if (!handleAssignments(MIRBuilder, SplitArgs, Handler))
return false;		return false;
}		}

MIRBuilder.insertInstr(MIB);		MIRBuilder.insertInstr(MIB);
return true;		return true;
▲ Show 20 Lines • Show All 259 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/AArch64/GlobalISel/call-translator.ll

	Show First 20 Lines • Show All 91 Lines • ▼ Show 20 Lines

	; CHECK: [[LD1:%[0-9]+]]:_(s64) = G_LOAD [[ADDR]](p0) :: (load 8 from %ir.addr)			; CHECK: [[LD1:%[0-9]+]]:_(s64) = G_LOAD [[ADDR]](p0) :: (load 8 from %ir.addr)
	; CHECK: [[CST1:%[0-9]+]]:_(s64) = G_CONSTANT i64 8			; CHECK: [[CST1:%[0-9]+]]:_(s64) = G_CONSTANT i64 8
	; CHECK: [[GEP1:%[0-9]+]]:_(p0) = G_GEP [[ADDR]], [[CST1]](s64)			; CHECK: [[GEP1:%[0-9]+]]:_(p0) = G_GEP [[ADDR]], [[CST1]](s64)
	; CHECK: [[LD2:%[0-9]+]]:_(s64) = G_LOAD [[GEP1]](p0) :: (load 8 from %ir.addr + 8)			; CHECK: [[LD2:%[0-9]+]]:_(s64) = G_LOAD [[GEP1]](p0) :: (load 8 from %ir.addr + 8)
	; CHECK: [[CST2:%[0-9]+]]:_(s64) = G_CONSTANT i64 16			; CHECK: [[CST2:%[0-9]+]]:_(s64) = G_CONSTANT i64 16
	; CHECK: [[GEP2:%[0-9]+]]:_(p0) = G_GEP [[ADDR]], [[CST2]](s64)			; CHECK: [[GEP2:%[0-9]+]]:_(p0) = G_GEP [[ADDR]], [[CST2]](s64)
	; CHECK: [[LD3:%[0-9]+]]:_(s32) = G_LOAD [[GEP2]](p0) :: (load 4 from %ir.addr + 16, align 8)			; CHECK: [[LD3:%[0-9]+]]:_(s32) = G_LOAD [[GEP2]](p0) :: (load 4 from %ir.addr + 16, align 8)
	; CHECK: [[IMPDEF:%[0-9]+]]:_(s192) = G_IMPLICIT_DEF
	; CHECK: [[INS1:%[0-9]+]]:_(s192) = G_INSERT [[IMPDEF]], [[LD1]](s64), 0			; CHECK: $d0 = COPY [[LD1]](s64)
	; CHECK: [[INS2:%[0-9]+]]:_(s192) = G_INSERT [[INS1]], [[LD2]](s64), 64			; CHECK: $x0 = COPY [[LD2]](s64)
	; CHECK: [[VAL:%[0-9]+]]:_(s192) = G_INSERT [[INS2]], [[LD3]](s32), 128			; CHECK: $w1 = COPY [[LD3]](s32)

	; CHECK: [[DBL:%[0-9]+]]:_(s64) = G_EXTRACT [[VAL]](s192), 0
	; CHECK: [[I64:%[0-9]+]]:_(s64) = G_EXTRACT [[VAL]](s192), 64
	; CHECK: [[I32:%[0-9]+]]:_(s32) = G_EXTRACT [[VAL]](s192), 128

	; CHECK: $d0 = COPY [[DBL]](s64)
	; CHECK: $x0 = COPY [[I64]](s64)
	; CHECK: $w1 = COPY [[I32]](s32)
	; CHECK: RET_ReallyLR implicit $d0, implicit $x0, implicit $w1			; CHECK: RET_ReallyLR implicit $d0, implicit $x0, implicit $w1
	define {double, i64, i32} @test_struct_return({double, i64, i32}* %addr) {			define {double, i64, i32} @test_struct_return({double, i64, i32}* %addr) {
	%val = load {double, i64, i32}, {double, i64, i32}* %addr			%val = load {double, i64, i32}, {double, i64, i32}* %addr
	ret {double, i64, i32} %val			ret {double, i64, i32} %val
	}			}

	; CHECK-LABEL: name: test_arr_call			; CHECK-LABEL: name: test_arr_call
	; CHECK: hasCalls: true			; CHECK: hasCalls: true
	▲ Show 20 Lines • Show All 200 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/AArch64/GlobalISel/irtranslator-exceptions.ll

	Show All 13 Lines
	; CHECK: $w0 = COPY			; CHECK: $w0 = COPY
	; CHECK: BL @foo, csr_aarch64_aapcs, implicit-def $lr, implicit $sp, implicit $w0, implicit-def $w0			; CHECK: BL @foo, csr_aarch64_aapcs, implicit-def $lr, implicit $sp, implicit $w0, implicit-def $w0
	; CHECK: {{%[0-9]+}}:_(s32) = COPY $w0			; CHECK: {{%[0-9]+}}:_(s32) = COPY $w0
	; CHECK: EH_LABEL			; CHECK: EH_LABEL
	; CHECK: G_BR %[[GOOD]]			; CHECK: G_BR %[[GOOD]]

	; CHECK: [[BAD]].{{[a-z]+}} (landing-pad):			; CHECK: [[BAD]].{{[a-z]+}} (landing-pad):
	; CHECK: EH_LABEL			; CHECK: EH_LABEL
	; CHECK: [[PTR:%[0-9]+]]:_(p0) = COPY $x0			; CHECK: [[PTR_RET:%[0-9]+]]:_(p0) = COPY $x0
	; CHECK: [[SEL_PTR:%[0-9]+]]:_(p0) = COPY $x1			; CHECK: [[SEL_PTR:%[0-9]+]]:_(p0) = COPY $x1
	; CHECK: [[SEL:%[0-9]+]]:_(s32) = G_PTRTOINT [[SEL_PTR]]			; CHECK: [[SEL_RET:%[0-9]+]]:_(s32) = G_PTRTOINT [[SEL_PTR]]
	; CHECK: [[UNDEF:%[0-9]+]]:_(s128) = G_IMPLICIT_DEF
	; CHECK: [[VAL_WITH_PTR:%[0-9]+]]:_(s128) = G_INSERT [[UNDEF]], [[PTR]](p0), 0
	; CHECK: [[PTR_SEL:%[0-9]+]]:_(s128) = G_INSERT [[VAL_WITH_PTR]], [[SEL]](s32), 64
	; CHECK: [[PTR_RET:%[0-9]+]]:_(s64) = G_EXTRACT [[PTR_SEL]](s128), 0
	; CHECK: [[SEL_RET:%[0-9]+]]:_(s32) = G_EXTRACT [[PTR_SEL]](s128), 64
	; CHECK: $x0 = COPY [[PTR_RET]]			; CHECK: $x0 = COPY [[PTR_RET]]
	; CHECK: $w1 = COPY [[SEL_RET]]			; CHECK: $w1 = COPY [[SEL_RET]]

	; CHECK: [[GOOD]].{{[a-z]+}}:			; CHECK: [[GOOD]].{{[a-z]+}}:
	; CHECK: [[SEL:%[0-9]+]]:_(s32) = G_CONSTANT i32 1			; CHECK: [[SEL:%[0-9]+]]:_(s32) = G_CONSTANT i32 1
	; CHECK: {{%[0-9]+}}:_(s128) = G_INSERT {{%[0-9]+}}, [[SEL]](s32), 64			; CHECK: $w1 = COPY [[SEL]]

	define { i8, i32 } @bar() personality i8 bitcast (i32 (...)* @__gxx_personality_v0 to i8*) {			define { i8, i32 } @bar() personality i8 bitcast (i32 (...)* @__gxx_personality_v0 to i8*) {
	%res32 = invoke i32 @foo(i32 42) to label %continue unwind label %broken			%res32 = invoke i32 @foo(i32 42) to label %continue unwind label %broken


	broken:			broken:
	%ptr.sel = landingpad { i8, i32 } catch i8 bitcast(i8** @_ZTIi to i8*)			%ptr.sel = landingpad { i8, i32 } catch i8 bitcast(i8** @_ZTIi to i8*)
	ret { i8*, i32 } %ptr.sel			ret { i8*, i32 } %ptr.sel
	▲ Show 20 Lines • Show All 53 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/ARM/GlobalISel/arm-param-lowering.ll

	Show First 20 Lines • Show All 203 Lines • ▼ Show 20 Lines
	; CHECK: [[R0:%[0-9]+]]:_(s32) = COPY $r0			; CHECK: [[R0:%[0-9]+]]:_(s32) = COPY $r0
	; CHECK: [[R1:%[0-9]+]]:_(s32) = COPY $r1			; CHECK: [[R1:%[0-9]+]]:_(s32) = COPY $r1
	; CHECK: [[R2:%[0-9]+]]:_(s32) = COPY $r2			; CHECK: [[R2:%[0-9]+]]:_(s32) = COPY $r2
	; CHECK: [[RES_ARR:%[0-9]+]]:_(s96) = G_MERGE_VALUES [[R0]](s32), [[R1]](s32), [[R2]](s32)			; CHECK: [[RES_ARR:%[0-9]+]]:_(s96) = G_MERGE_VALUES [[R0]](s32), [[R1]](s32), [[R2]](s32)
	; CHECK: ADJCALLSTACKUP 0, 0, 14, $noreg, implicit-def $sp, implicit $sp			; CHECK: ADJCALLSTACKUP 0, 0, 14, $noreg, implicit-def $sp, implicit $sp
	; CHECK: [[EXT3:%[0-9]+]]:_(s32) = G_EXTRACT [[RES_ARR]](s96), 0			; CHECK: [[EXT3:%[0-9]+]]:_(s32) = G_EXTRACT [[RES_ARR]](s96), 0
	; CHECK: [[EXT4:%[0-9]+]]:_(s32) = G_EXTRACT [[RES_ARR]](s96), 32			; CHECK: [[EXT4:%[0-9]+]]:_(s32) = G_EXTRACT [[RES_ARR]](s96), 32
	; CHECK: [[EXT5:%[0-9]+]]:_(s32) = G_EXTRACT [[RES_ARR]](s96), 64			; CHECK: [[EXT5:%[0-9]+]]:_(s32) = G_EXTRACT [[RES_ARR]](s96), 64
	; CHECK: [[IMPDEF2:%[0-9]+]]:_(s96) = G_IMPLICIT_DEF
	; CHECK: [[INS3:%[0-9]+]]:_(s96) = G_INSERT [[IMPDEF2]], [[EXT3]](s32), 0
	; CHECK: [[INS4:%[0-9]+]]:_(s96) = G_INSERT [[INS3]], [[EXT4]](s32), 32
	; CHECK: [[INS5:%[0-9]+]]:_(s96) = G_INSERT [[INS4]], [[EXT5]](s32), 64
	; CHECK: [[R0:%[0-9]+]]:_(s32), [[R1:%[0-9]+]]:_(s32), [[R2:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[INS5]](s96)
	; FIXME: This doesn't seem correct with regard to the AAPCS docs (which say			; FIXME: This doesn't seem correct with regard to the AAPCS docs (which say
	; that composite types larger than 4 bytes should be passed through memory),			; that composite types larger than 4 bytes should be passed through memory),
	; but it's what DAGISel does. We should fix it in the common code for both.			; but it's what DAGISel does. We should fix it in the common code for both.
	; CHECK: $r0 = COPY [[R0]]			; CHECK: $r0 = COPY [[EXT3]]
	; CHECK: $r1 = COPY [[R1]]			; CHECK: $r1 = COPY [[EXT4]]
	; CHECK: $r2 = COPY [[R2]]			; CHECK: $r2 = COPY [[EXT5]]
	; CHECK: BX_RET 14, $noreg, implicit $r0, implicit $r1, implicit $r2			; CHECK: BX_RET 14, $noreg, implicit $r0, implicit $r1, implicit $r2
	entry:			entry:
	%r = notail call arm_aapcscc [3 x i32] @tiny_int_arrays_target([2 x i32] %arr)			%r = notail call arm_aapcscc [3 x i32] @tiny_int_arrays_target([2 x i32] %arr)
	ret [3 x i32] %r			ret [3 x i32] %r
	}			}

	declare arm_aapcscc void @multiple_int_arrays_target([2 x i32], [2 x i32])			declare arm_aapcscc void @multiple_int_arrays_target([2 x i32], [2 x i32])

	▲ Show 20 Lines • Show All 118 Lines • ▼ Show 20 Lines
	; CHECK: G_STORE [[ARR2]](s64), [[ARR2_ADDR]](p0){{.*}}store 8			; CHECK: G_STORE [[ARR2]](s64), [[ARR2_ADDR]](p0){{.*}}store 8
	; CHECK: BL @fp_arrays_aapcs_target, csr_aapcs, implicit-def $lr, implicit $sp, implicit $r0, implicit $r1, implicit $r2, implicit $r3, implicit-def $r0, implicit-def $r1			; CHECK: BL @fp_arrays_aapcs_target, csr_aapcs, implicit-def $lr, implicit $sp, implicit $r0, implicit $r1, implicit $r2, implicit $r3, implicit-def $r0, implicit-def $r1
	; CHECK: [[R0:%[0-9]+]]:_(s32) = COPY $r0			; CHECK: [[R0:%[0-9]+]]:_(s32) = COPY $r0
	; CHECK: [[R1:%[0-9]+]]:_(s32) = COPY $r1			; CHECK: [[R1:%[0-9]+]]:_(s32) = COPY $r1
	; CHECK: [[R_MERGED:%[0-9]+]]:_(s64) = G_MERGE_VALUES [[R0]](s32), [[R1]](s32)			; CHECK: [[R_MERGED:%[0-9]+]]:_(s64) = G_MERGE_VALUES [[R0]](s32), [[R1]](s32)
	; CHECK: ADJCALLSTACKUP 8, 0, 14, $noreg, implicit-def $sp, implicit $sp			; CHECK: ADJCALLSTACKUP 8, 0, 14, $noreg, implicit-def $sp, implicit $sp
	; CHECK: [[EXT4:%[0-9]+]]:_(s32) = G_EXTRACT [[R_MERGED]](s64), 0			; CHECK: [[EXT4:%[0-9]+]]:_(s32) = G_EXTRACT [[R_MERGED]](s64), 0
	; CHECK: [[EXT5:%[0-9]+]]:_(s32) = G_EXTRACT [[R_MERGED]](s64), 32			; CHECK: [[EXT5:%[0-9]+]]:_(s32) = G_EXTRACT [[R_MERGED]](s64), 32
	; CHECK: [[IMPDEF2:%[0-9]+]]:_(s64) = G_IMPLICIT_DEF			; CHECK: $r0 = COPY [[EXT4]]
	; CHECK: [[INS4:%[0-9]+]]:_(s64) = G_INSERT [[IMPDEF2]], [[EXT4]](s32), 0			; CHECK: $r1 = COPY [[EXT5]]
	; CHECK: [[INS5:%[0-9]+]]:_(s64) = G_INSERT [[INS4]], [[EXT5]](s32), 32
	; CHECK: [[R0:%[0-9]+]]:_(s32), [[R1:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[INS5]](s64)
	; CHECK: $r0 = COPY [[R0]]
	; CHECK: $r1 = COPY [[R1]]
	; CHECK: BX_RET 14, $noreg, implicit $r0, implicit $r1			; CHECK: BX_RET 14, $noreg, implicit $r0, implicit $r1
	entry:			entry:
	%r = notail call arm_aapcscc [2 x float] @fp_arrays_aapcs_target([3 x double] %arr)			%r = notail call arm_aapcscc [2 x float] @fp_arrays_aapcs_target([3 x double] %arr)
	ret [2 x float] %r			ret [2 x float] %r
	}			}

	declare arm_aapcs_vfpcc [4 x float] @fp_arrays_aapcs_vfp_target([3 x double], [3 x float], [4 x double])			declare arm_aapcs_vfpcc [4 x float] @fp_arrays_aapcs_vfp_target([3 x double], [3 x float], [4 x double])

	▲ Show 20 Lines • Show All 77 Lines • ▼ Show 20 Lines
	; CHECK: [[R2:%[0-9]+]]:_(s32) = COPY $s2			; CHECK: [[R2:%[0-9]+]]:_(s32) = COPY $s2
	; CHECK: [[R3:%[0-9]+]]:_(s32) = COPY $s3			; CHECK: [[R3:%[0-9]+]]:_(s32) = COPY $s3
	; CHECK: [[R_MERGED:%[0-9]+]]:_(s128) = G_MERGE_VALUES [[R0]](s32), [[R1]](s32), [[R2]](s32), [[R3]](s32)			; CHECK: [[R_MERGED:%[0-9]+]]:_(s128) = G_MERGE_VALUES [[R0]](s32), [[R1]](s32), [[R2]](s32), [[R3]](s32)
	; CHECK: ADJCALLSTACKUP 32, 0, 14, $noreg, implicit-def $sp, implicit $sp			; CHECK: ADJCALLSTACKUP 32, 0, 14, $noreg, implicit-def $sp, implicit $sp
	; CHECK: [[EXT11:%[0-9]+]]:_(s32) = G_EXTRACT [[R_MERGED]](s128), 0			; CHECK: [[EXT11:%[0-9]+]]:_(s32) = G_EXTRACT [[R_MERGED]](s128), 0
	; CHECK: [[EXT12:%[0-9]+]]:_(s32) = G_EXTRACT [[R_MERGED]](s128), 32			; CHECK: [[EXT12:%[0-9]+]]:_(s32) = G_EXTRACT [[R_MERGED]](s128), 32
	; CHECK: [[EXT13:%[0-9]+]]:_(s32) = G_EXTRACT [[R_MERGED]](s128), 64			; CHECK: [[EXT13:%[0-9]+]]:_(s32) = G_EXTRACT [[R_MERGED]](s128), 64
	; CHECK: [[EXT14:%[0-9]+]]:_(s32) = G_EXTRACT [[R_MERGED]](s128), 96			; CHECK: [[EXT14:%[0-9]+]]:_(s32) = G_EXTRACT [[R_MERGED]](s128), 96
	; CHECK: [[IMPDEF4:%[0-9]+]]:_(s128) = G_IMPLICIT_DEF			; CHECK: $s0 = COPY [[EXT11]]
	; CHECK: [[INS11:%[0-9]+]]:_(s128) = G_INSERT [[IMPDEF4]], [[EXT11]](s32), 0			; CHECK: $s1 = COPY [[EXT12]]
	; CHECK: [[INS12:%[0-9]+]]:_(s128) = G_INSERT [[INS11]], [[EXT12]](s32), 32			; CHECK: $s2 = COPY [[EXT13]]
	; CHECK: [[INS13:%[0-9]+]]:_(s128) = G_INSERT [[INS12]], [[EXT13]](s32), 64			; CHECK: $s3 = COPY [[EXT14]]
	; CHECK: [[INS14:%[0-9]+]]:_(s128) = G_INSERT [[INS13]], [[EXT14]](s32), 96
	; CHECK: [[R0:%[0-9]+]]:_(s32), [[R1:%[0-9]+]]:_(s32), [[R2:%[0-9]+]]:_(s32), [[R3:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[INS14]](s128)
	; CHECK: $s0 = COPY [[R0]]
	; CHECK: $s1 = COPY [[R1]]
	; CHECK: $s2 = COPY [[R2]]
	; CHECK: $s3 = COPY [[R3]]
	; CHECK: BX_RET 14, $noreg, implicit $s0, implicit $s1, implicit $s2, implicit $s3			; CHECK: BX_RET 14, $noreg, implicit $s0, implicit $s1, implicit $s2, implicit $s3
	entry:			entry:
	%r = notail call arm_aapcs_vfpcc [4 x float] @fp_arrays_aapcs_vfp_target([3 x double] %x, [3 x float] %y, [4 x double] %z)			%r = notail call arm_aapcs_vfpcc [4 x float] @fp_arrays_aapcs_vfp_target([3 x double] %x, [3 x float] %y, [4 x double] %z)
	ret [4 x float] %r			ret [4 x float] %r
	}			}

	declare arm_aapcscc [2 x i32*] @tough_arrays_target([6 x [4 x i32]] %arr)			declare arm_aapcscc [2 x i32*] @tough_arrays_target([6 x [4 x i32]] %arr)

	Show All 33 Lines
	; CHECK: G_STORE [[LAST_STACK_ELEMENT]](s32), [[LAST_STACK_ARG_ADDR]]{{.*}}store 4			; CHECK: G_STORE [[LAST_STACK_ELEMENT]](s32), [[LAST_STACK_ARG_ADDR]]{{.*}}store 4
	; CHECK: BL @tough_arrays_target, csr_aapcs, implicit-def $lr, implicit $sp, implicit $r0, implicit $r1, implicit $r2, implicit $r3, implicit-def $r0, implicit-def $r1			; CHECK: BL @tough_arrays_target, csr_aapcs, implicit-def $lr, implicit $sp, implicit $r0, implicit $r1, implicit $r2, implicit $r3, implicit-def $r0, implicit-def $r1
	; CHECK: [[R0:%[0-9]+]]:_(s32) = COPY $r0			; CHECK: [[R0:%[0-9]+]]:_(s32) = COPY $r0
	; CHECK: [[R1:%[0-9]+]]:_(s32) = COPY $r1			; CHECK: [[R1:%[0-9]+]]:_(s32) = COPY $r1
	; CHECK: [[RES_ARR:%[0-9]+]]:_(s64) = G_MERGE_VALUES [[R0]](s32), [[R1]](s32)			; CHECK: [[RES_ARR:%[0-9]+]]:_(s64) = G_MERGE_VALUES [[R0]](s32), [[R1]](s32)
	; CHECK: ADJCALLSTACKUP 80, 0, 14, $noreg, implicit-def $sp, implicit $sp			; CHECK: ADJCALLSTACKUP 80, 0, 14, $noreg, implicit-def $sp, implicit $sp
	; CHECK: [[EXT1:%[0-9]+]]:_(p0) = G_EXTRACT [[RES_ARR]](s64), 0			; CHECK: [[EXT1:%[0-9]+]]:_(p0) = G_EXTRACT [[RES_ARR]](s64), 0
	; CHECK: [[EXT2:%[0-9]+]]:_(p0) = G_EXTRACT [[RES_ARR]](s64), 32			; CHECK: [[EXT2:%[0-9]+]]:_(p0) = G_EXTRACT [[RES_ARR]](s64), 32
	; CHECK: [[IMPDEF:%[0-9]+]]:_(s64) = G_IMPLICIT_DEF			; CHECK: $r0 = COPY [[EXT1]]
	; CHECK: [[INS2:%[0-9]+]]:_(s64) = G_INSERT [[IMPDEF]], [[EXT1]](p0), 0			; CHECK: $r1 = COPY [[EXT2]]
	; CHECK: [[INS3:%[0-9]+]]:_(s64) = G_INSERT [[INS2]], [[EXT2]](p0), 32
	; CHECK: [[R0:%[0-9]+]]:_(s32), [[R1:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[INS3]](s64)
	; CHECK: $r0 = COPY [[R0]]
	; CHECK: $r1 = COPY [[R1]]
	; CHECK: BX_RET 14, $noreg, implicit $r0, implicit $r1			; CHECK: BX_RET 14, $noreg, implicit $r0, implicit $r1
	entry:			entry:
	%r = notail call arm_aapcscc [2 x i32*] @tough_arrays_target([6 x [4 x i32]] %arr)			%r = notail call arm_aapcscc [2 x i32*] @tough_arrays_target([6 x [4 x i32]] %arr)
	ret [2 x i32*] %r			ret [2 x i32*] %r
	}			}

	declare arm_aapcscc {i32, i32} @structs_target({i32, i32})			declare arm_aapcscc {i32, i32} @structs_target({i32, i32})

	Show All 14 Lines
	; CHECK-DAG: $r1 = COPY [[X1]](s32)			; CHECK-DAG: $r1 = COPY [[X1]](s32)
	; CHECK: BL @structs_target, csr_aapcs, implicit-def $lr, implicit $sp, implicit $r0, implicit $r1, implicit-def $r0, implicit-def $r1			; CHECK: BL @structs_target, csr_aapcs, implicit-def $lr, implicit $sp, implicit $r0, implicit $r1, implicit-def $r0, implicit-def $r1
	; CHECK: [[R0:%[0-9]+]]:_(s32) = COPY $r0			; CHECK: [[R0:%[0-9]+]]:_(s32) = COPY $r0
	; CHECK: [[R1:%[0-9]+]]:_(s32) = COPY $r1			; CHECK: [[R1:%[0-9]+]]:_(s32) = COPY $r1
	; CHECK: [[R:%[0-9]+]]:_(s64) = G_MERGE_VALUES [[R0]](s32), [[R1]](s32)			; CHECK: [[R:%[0-9]+]]:_(s64) = G_MERGE_VALUES [[R0]](s32), [[R1]](s32)
	; CHECK: ADJCALLSTACKUP 0, 0, 14, $noreg, implicit-def $sp, implicit $sp			; CHECK: ADJCALLSTACKUP 0, 0, 14, $noreg, implicit-def $sp, implicit $sp
	; CHECK: [[EXT3:%[0-9]+]]:_(s32) = G_EXTRACT [[R]](s64), 0			; CHECK: [[EXT3:%[0-9]+]]:_(s32) = G_EXTRACT [[R]](s64), 0
	; CHECK: [[EXT4:%[0-9]+]]:_(s32) = G_EXTRACT [[R]](s64), 32			; CHECK: [[EXT4:%[0-9]+]]:_(s32) = G_EXTRACT [[R]](s64), 32
	; CHECK: [[IMPDEF2:%[0-9]+]]:_(s64) = G_IMPLICIT_DEF			; CHECK: $r0 = COPY [[EXT3]](s32)
	; CHECK: [[INS3:%[0-9]+]]:_(s64) = G_INSERT [[IMPDEF2]], [[EXT3]](s32), 0			; CHECK: $r1 = COPY [[EXT4]](s32)
	; CHECK: [[INS4:%[0-9]+]]:_(s64) = G_INSERT [[INS3]], [[EXT4]](s32), 32
	; CHECK: [[R0:%[0-9]+]]:_(s32), [[R1:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[INS4]](s64)
	; CHECK: $r0 = COPY [[R0]](s32)
	; CHECK: $r1 = COPY [[R1]](s32)
	; CHECK: BX_RET 14, $noreg, implicit $r0, implicit $r1			; CHECK: BX_RET 14, $noreg, implicit $r0, implicit $r1
	%r = notail call arm_aapcscc {i32, i32} @structs_target({i32, i32} %x)			%r = notail call arm_aapcscc {i32, i32} @structs_target({i32, i32} %x)
	ret {i32, i32} %r			ret {i32, i32} %r
	}			}

llvm/trunk/test/CodeGen/X86/GlobalISel/x86_64-irtranslator-struct-return.ll

				; NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
				; RUN: llc -mtriple=x86_64-linux-gnu -global-isel -stop-after=irtranslator < %s -o - \| FileCheck %s --check-prefix=ALL

				%struct.f1 = type { float }
				%struct.d1 = type { double }
				%struct.d2 = type { double, double }
				%struct.i1 = type { i32 }
				%struct.i2 = type { i32, i32 }
				%struct.i3 = type { i32, i32, i32 }
				%struct.i4 = type { i32, i32, i32, i32 }

				define float @test_return_f1(float %f.coerce) {
				; ALL-LABEL: name: test_return_f1
				; ALL: bb.1.entry:
				; ALL: liveins: $xmm0
				; ALL: [[COPY:%[0-9]+]]:_(s128) = COPY $xmm0
				; ALL: [[TRUNC:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s128)
				; ALL: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 4
				; ALL: [[FRAME_INDEX:%[0-9]+]]:_(p0) = G_FRAME_INDEX %stack.0.retval
				; ALL: [[FRAME_INDEX1:%[0-9]+]]:_(p0) = G_FRAME_INDEX %stack.1.f
				; ALL: G_STORE [[TRUNC]](s32), [[FRAME_INDEX1]](p0) :: (store 4 into %ir.coerce.dive2)
				; ALL: ADJCALLSTACKDOWN64 0, 0, 0, implicit-def $rsp, implicit-def $eflags, implicit-def $ssp, implicit $rsp, implicit $ssp
				; ALL: $rdi = COPY [[FRAME_INDEX]](p0)
				; ALL: $rsi = COPY [[FRAME_INDEX1]](p0)
				; ALL: $rdx = COPY [[C]](s64)
				; ALL: CALL64pcrel32 &memcpy, csr_64, implicit $rsp, implicit $ssp, implicit $rdi, implicit $rsi, implicit $rdx
				; ALL: ADJCALLSTACKUP64 0, 0, implicit-def $rsp, implicit-def $eflags, implicit-def $ssp, implicit $rsp, implicit $ssp
				; ALL: [[LOAD:%[0-9]+]]:_(s32) = G_LOAD [[FRAME_INDEX]](p0) :: (load 4 from %ir.coerce.dive13)
				; ALL: [[ANYEXT:%[0-9]+]]:_(s128) = G_ANYEXT [[LOAD]](s32)
				; ALL: $xmm0 = COPY [[ANYEXT]](s128)
				; ALL: RET 0, implicit $xmm0
				entry:
				%retval = alloca %struct.f1, align 4
				%f = alloca %struct.f1, align 4
				%coerce.dive = getelementptr inbounds %struct.f1, %struct.f1* %f, i32 0, i32 0
				store float %f.coerce, float* %coerce.dive, align 4
				%0 = bitcast %struct.f1* %retval to i8*
				%1 = bitcast %struct.f1* %f to i8*
				call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 4 %0, i8* align 4 %1, i64 4, i1 false)
				%coerce.dive1 = getelementptr inbounds %struct.f1, %struct.f1* %retval, i32 0, i32 0
				%2 = load float, float* %coerce.dive1, align 4
				ret float %2
				}

				declare void @llvm.memcpy.p0i8.p0i8.i64(i8* nocapture writeonly, i8* nocapture readonly, i64, i1) #1

				define double @test_return_d1(double %d.coerce) {
				; ALL-LABEL: name: test_return_d1
				; ALL: bb.1.entry:
				; ALL: liveins: $xmm0
				; ALL: [[COPY:%[0-9]+]]:_(s128) = COPY $xmm0
				; ALL: [[TRUNC:%[0-9]+]]:_(s64) = G_TRUNC [[COPY]](s128)
				; ALL: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 8
				; ALL: [[FRAME_INDEX:%[0-9]+]]:_(p0) = G_FRAME_INDEX %stack.0.retval
				; ALL: [[FRAME_INDEX1:%[0-9]+]]:_(p0) = G_FRAME_INDEX %stack.1.d
				; ALL: G_STORE [[TRUNC]](s64), [[FRAME_INDEX1]](p0) :: (store 8 into %ir.coerce.dive2)
				; ALL: ADJCALLSTACKDOWN64 0, 0, 0, implicit-def $rsp, implicit-def $eflags, implicit-def $ssp, implicit $rsp, implicit $ssp
				; ALL: $rdi = COPY [[FRAME_INDEX]](p0)
				; ALL: $rsi = COPY [[FRAME_INDEX1]](p0)
				; ALL: $rdx = COPY [[C]](s64)
				; ALL: CALL64pcrel32 &memcpy, csr_64, implicit $rsp, implicit $ssp, implicit $rdi, implicit $rsi, implicit $rdx
				; ALL: ADJCALLSTACKUP64 0, 0, implicit-def $rsp, implicit-def $eflags, implicit-def $ssp, implicit $rsp, implicit $ssp
				; ALL: [[LOAD:%[0-9]+]]:_(s64) = G_LOAD [[FRAME_INDEX]](p0) :: (load 8 from %ir.coerce.dive13)
				; ALL: [[ANYEXT:%[0-9]+]]:_(s128) = G_ANYEXT [[LOAD]](s64)
				; ALL: $xmm0 = COPY [[ANYEXT]](s128)
				; ALL: RET 0, implicit $xmm0
				entry:
				%retval = alloca %struct.d1, align 8
				%d = alloca %struct.d1, align 8
				%coerce.dive = getelementptr inbounds %struct.d1, %struct.d1* %d, i32 0, i32 0
				store double %d.coerce, double* %coerce.dive, align 8
				%0 = bitcast %struct.d1* %retval to i8*
				%1 = bitcast %struct.d1* %d to i8*
				call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 8 %0, i8* align 8 %1, i64 8, i1 false)
				%coerce.dive1 = getelementptr inbounds %struct.d1, %struct.d1* %retval, i32 0, i32 0
				%2 = load double, double* %coerce.dive1, align 8
				ret double %2
				}

				define { double, double } @test_return_d2(double %d.coerce0, double %d.coerce1) {
				; ALL-LABEL: name: test_return_d2
				; ALL: bb.1.entry:
				; ALL: liveins: $xmm0, $xmm1
				; ALL: [[COPY:%[0-9]+]]:_(s128) = COPY $xmm0
				; ALL: [[TRUNC:%[0-9]+]]:_(s64) = G_TRUNC [[COPY]](s128)
				; ALL: [[COPY1:%[0-9]+]]:_(s128) = COPY $xmm1
				; ALL: [[TRUNC1:%[0-9]+]]:_(s64) = G_TRUNC [[COPY1]](s128)
				; ALL: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 8
				; ALL: [[C1:%[0-9]+]]:_(s64) = G_CONSTANT i64 16
				; ALL: [[FRAME_INDEX:%[0-9]+]]:_(p0) = G_FRAME_INDEX %stack.0.retval
				; ALL: [[FRAME_INDEX1:%[0-9]+]]:_(p0) = G_FRAME_INDEX %stack.1.d
				; ALL: G_STORE [[TRUNC]](s64), [[FRAME_INDEX1]](p0) :: (store 8 into %ir.1)
				; ALL: [[GEP:%[0-9]+]]:_(p0) = G_GEP [[FRAME_INDEX1]], [[C]](s64)
				; ALL: G_STORE [[TRUNC1]](s64), [[GEP]](p0) :: (store 8 into %ir.2)
				; ALL: ADJCALLSTACKDOWN64 0, 0, 0, implicit-def $rsp, implicit-def $eflags, implicit-def $ssp, implicit $rsp, implicit $ssp
				; ALL: $rdi = COPY [[FRAME_INDEX]](p0)
				; ALL: $rsi = COPY [[FRAME_INDEX1]](p0)
				; ALL: $rdx = COPY [[C1]](s64)
				; ALL: CALL64pcrel32 &memcpy, csr_64, implicit $rsp, implicit $ssp, implicit $rdi, implicit $rsi, implicit $rdx
				; ALL: ADJCALLSTACKUP64 0, 0, implicit-def $rsp, implicit-def $eflags, implicit-def $ssp, implicit $rsp, implicit $ssp
				; ALL: [[LOAD:%[0-9]+]]:_(s64) = G_LOAD [[FRAME_INDEX]](p0) :: (load 8 from %ir.5)
				; ALL: [[C2:%[0-9]+]]:_(s64) = G_CONSTANT i64 8
				; ALL: [[GEP1:%[0-9]+]]:_(p0) = G_GEP [[FRAME_INDEX]], [[C2]](s64)
				; ALL: [[LOAD1:%[0-9]+]]:_(s64) = G_LOAD [[GEP1]](p0) :: (load 8 from %ir.5 + 8)
				; ALL: [[ANYEXT:%[0-9]+]]:_(s128) = G_ANYEXT [[LOAD]](s64)
				; ALL: $xmm0 = COPY [[ANYEXT]](s128)
				; ALL: [[ANYEXT1:%[0-9]+]]:_(s128) = G_ANYEXT [[LOAD1]](s64)
				; ALL: $xmm1 = COPY [[ANYEXT1]](s128)
				; ALL: RET 0, implicit $xmm0, implicit $xmm1
				entry:
				%retval = alloca %struct.d2, align 8
				%d = alloca %struct.d2, align 8
				%0 = bitcast %struct.d2* %d to { double, double }*
				%1 = getelementptr inbounds { double, double }, { double, double }* %0, i32 0, i32 0
				store double %d.coerce0, double* %1, align 8
				%2 = getelementptr inbounds { double, double }, { double, double }* %0, i32 0, i32 1
				store double %d.coerce1, double* %2, align 8
				%3 = bitcast %struct.d2* %retval to i8*
				%4 = bitcast %struct.d2* %d to i8*
				call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 8 %3, i8* align 8 %4, i64 16, i1 false)
				%5 = bitcast %struct.d2* %retval to { double, double }*
				%6 = load { double, double }, { double, double }* %5, align 8
				ret { double, double } %6
				}

				define i32 @test_return_i1(i32 %i.coerce) {
				; ALL-LABEL: name: test_return_i1
				; ALL: bb.1.entry:
				; ALL: liveins: $edi
				; ALL: [[COPY:%[0-9]+]]:_(s32) = COPY $edi
				; ALL: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 4
				; ALL: [[FRAME_INDEX:%[0-9]+]]:_(p0) = G_FRAME_INDEX %stack.0.retval
				; ALL: [[FRAME_INDEX1:%[0-9]+]]:_(p0) = G_FRAME_INDEX %stack.1.i
				; ALL: G_STORE [[COPY]](s32), [[FRAME_INDEX1]](p0) :: (store 4 into %ir.coerce.dive2)
				; ALL: ADJCALLSTACKDOWN64 0, 0, 0, implicit-def $rsp, implicit-def $eflags, implicit-def $ssp, implicit $rsp, implicit $ssp
				; ALL: $rdi = COPY [[FRAME_INDEX]](p0)
				; ALL: $rsi = COPY [[FRAME_INDEX1]](p0)
				; ALL: $rdx = COPY [[C]](s64)
				; ALL: CALL64pcrel32 &memcpy, csr_64, implicit $rsp, implicit $ssp, implicit $rdi, implicit $rsi, implicit $rdx
				; ALL: ADJCALLSTACKUP64 0, 0, implicit-def $rsp, implicit-def $eflags, implicit-def $ssp, implicit $rsp, implicit $ssp
				; ALL: [[LOAD:%[0-9]+]]:_(s32) = G_LOAD [[FRAME_INDEX]](p0) :: (load 4 from %ir.coerce.dive13)
				; ALL: $eax = COPY [[LOAD]](s32)
				; ALL: RET 0, implicit $eax
				entry:
				%retval = alloca %struct.i1, align 4
				%i = alloca %struct.i1, align 4
				%coerce.dive = getelementptr inbounds %struct.i1, %struct.i1* %i, i32 0, i32 0
				store i32 %i.coerce, i32* %coerce.dive, align 4
				%0 = bitcast %struct.i1* %retval to i8*
				%1 = bitcast %struct.i1* %i to i8*
				call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 4 %0, i8* align 4 %1, i64 4, i1 false)
				%coerce.dive1 = getelementptr inbounds %struct.i1, %struct.i1* %retval, i32 0, i32 0
				%2 = load i32, i32* %coerce.dive1, align 4
				ret i32 %2
				}

				define i64 @test_return_i2(i64 %i.coerce) {
				; ALL-LABEL: name: test_return_i2
				; ALL: bb.1.entry:
				; ALL: liveins: $rdi
				; ALL: [[COPY:%[0-9]+]]:_(s64) = COPY $rdi
				; ALL: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 8
				; ALL: [[FRAME_INDEX:%[0-9]+]]:_(p0) = G_FRAME_INDEX %stack.0.retval
				; ALL: [[FRAME_INDEX1:%[0-9]+]]:_(p0) = G_FRAME_INDEX %stack.1.i
				; ALL: G_STORE [[COPY]](s64), [[FRAME_INDEX1]](p0) :: (store 8 into %ir.0, align 4)
				; ALL: ADJCALLSTACKDOWN64 0, 0, 0, implicit-def $rsp, implicit-def $eflags, implicit-def $ssp, implicit $rsp, implicit $ssp
				; ALL: $rdi = COPY [[FRAME_INDEX]](p0)
				; ALL: $rsi = COPY [[FRAME_INDEX1]](p0)
				; ALL: $rdx = COPY [[C]](s64)
				; ALL: CALL64pcrel32 &memcpy, csr_64, implicit $rsp, implicit $ssp, implicit $rdi, implicit $rsi, implicit $rdx
				; ALL: ADJCALLSTACKUP64 0, 0, implicit-def $rsp, implicit-def $eflags, implicit-def $ssp, implicit $rsp, implicit $ssp
				; ALL: [[LOAD:%[0-9]+]]:_(s64) = G_LOAD [[FRAME_INDEX]](p0) :: (load 8 from %ir.3, align 4)
				; ALL: $rax = COPY [[LOAD]](s64)
				; ALL: RET 0, implicit $rax
				entry:
				%retval = alloca %struct.i2, align 4
				%i = alloca %struct.i2, align 4
				%0 = bitcast %struct.i2* %i to i64*
				store i64 %i.coerce, i64* %0, align 4
				%1 = bitcast %struct.i2* %retval to i8*
				%2 = bitcast %struct.i2* %i to i8*
				call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 4 %1, i8* align 4 %2, i64 8, i1 false)
				%3 = bitcast %struct.i2* %retval to i64*
				%4 = load i64, i64* %3, align 4
				ret i64 %4
				}

				define { i64, i32 } @test_return_i3(i64 %i.coerce0, i32 %i.coerce1) {
				; ALL-LABEL: name: test_return_i3
				; ALL: bb.1.entry:
				; ALL: liveins: $esi, $rdi
				; ALL: [[COPY:%[0-9]+]]:_(s64) = COPY $rdi
				; ALL: [[COPY1:%[0-9]+]]:_(s32) = COPY $esi
				; ALL: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 8
				; ALL: [[C1:%[0-9]+]]:_(s64) = G_CONSTANT i64 12
				; ALL: [[FRAME_INDEX:%[0-9]+]]:_(p0) = G_FRAME_INDEX %stack.0.retval
				; ALL: [[FRAME_INDEX1:%[0-9]+]]:_(p0) = G_FRAME_INDEX %stack.1.i
				; ALL: [[FRAME_INDEX2:%[0-9]+]]:_(p0) = G_FRAME_INDEX %stack.2.coerce
				; ALL: [[FRAME_INDEX3:%[0-9]+]]:_(p0) = G_FRAME_INDEX %stack.3.tmp
				; ALL: G_STORE [[COPY]](s64), [[FRAME_INDEX2]](p0) :: (store 8 into %ir.0, align 4)
				; ALL: [[GEP:%[0-9]+]]:_(p0) = G_GEP [[FRAME_INDEX2]], [[C]](s64)
				; ALL: G_STORE [[COPY1]](s32), [[GEP]](p0) :: (store 4 into %ir.1)
				; ALL: ADJCALLSTACKDOWN64 0, 0, 0, implicit-def $rsp, implicit-def $eflags, implicit-def $ssp, implicit $rsp, implicit $ssp
				; ALL: $rdi = COPY [[FRAME_INDEX1]](p0)
				; ALL: $rsi = COPY [[FRAME_INDEX2]](p0)
				; ALL: $rdx = COPY [[C1]](s64)
				; ALL: CALL64pcrel32 &memcpy, csr_64, implicit $rsp, implicit $ssp, implicit $rdi, implicit $rsi, implicit $rdx
				; ALL: ADJCALLSTACKUP64 0, 0, implicit-def $rsp, implicit-def $eflags, implicit-def $ssp, implicit $rsp, implicit $ssp
				; ALL: ADJCALLSTACKDOWN64 0, 0, 0, implicit-def $rsp, implicit-def $eflags, implicit-def $ssp, implicit $rsp, implicit $ssp
				; ALL: $rdi = COPY [[FRAME_INDEX]](p0)
				; ALL: $rsi = COPY [[FRAME_INDEX1]](p0)
				; ALL: $rdx = COPY [[C1]](s64)
				; ALL: CALL64pcrel32 &memcpy, csr_64, implicit $rsp, implicit $ssp, implicit $rdi, implicit $rsi, implicit $rdx
				; ALL: ADJCALLSTACKUP64 0, 0, implicit-def $rsp, implicit-def $eflags, implicit-def $ssp, implicit $rsp, implicit $ssp
				; ALL: ADJCALLSTACKDOWN64 0, 0, 0, implicit-def $rsp, implicit-def $eflags, implicit-def $ssp, implicit $rsp, implicit $ssp
				; ALL: $rdi = COPY [[FRAME_INDEX3]](p0)
				; ALL: $rsi = COPY [[FRAME_INDEX]](p0)
				; ALL: $rdx = COPY [[C1]](s64)
				; ALL: CALL64pcrel32 &memcpy, csr_64, implicit $rsp, implicit $ssp, implicit $rdi, implicit $rsi, implicit $rdx
				; ALL: ADJCALLSTACKUP64 0, 0, implicit-def $rsp, implicit-def $eflags, implicit-def $ssp, implicit $rsp, implicit $ssp
				; ALL: [[LOAD:%[0-9]+]]:_(s64) = G_LOAD [[FRAME_INDEX3]](p0) :: (load 8 from %ir.tmp)
				; ALL: [[C2:%[0-9]+]]:_(s64) = G_CONSTANT i64 8
				; ALL: [[GEP1:%[0-9]+]]:_(p0) = G_GEP [[FRAME_INDEX3]], [[C2]](s64)
				; ALL: [[LOAD1:%[0-9]+]]:_(s32) = G_LOAD [[GEP1]](p0) :: (load 4 from %ir.tmp + 8, align 8)
				; ALL: $rax = COPY [[LOAD]](s64)
				; ALL: $edx = COPY [[LOAD1]](s32)
				; ALL: RET 0, implicit $rax, implicit $edx
				entry:
				%retval = alloca %struct.i3, align 4
				%i = alloca %struct.i3, align 4
				%coerce = alloca { i64, i32 }, align 4
				%tmp = alloca { i64, i32 }, align 8
				%0 = getelementptr inbounds { i64, i32 }, { i64, i32 }* %coerce, i32 0, i32 0
				store i64 %i.coerce0, i64* %0, align 4
				%1 = getelementptr inbounds { i64, i32 }, { i64, i32 }* %coerce, i32 0, i32 1
				store i32 %i.coerce1, i32* %1, align 4
				%2 = bitcast %struct.i3* %i to i8*
				%3 = bitcast { i64, i32 }* %coerce to i8*
				call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 4 %2, i8* align 4 %3, i64 12, i1 false)
				%4 = bitcast %struct.i3* %retval to i8*
				%5 = bitcast %struct.i3* %i to i8*
				call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 4 %4, i8* align 4 %5, i64 12, i1 false)
				%6 = bitcast { i64, i32 }* %tmp to i8*
				%7 = bitcast %struct.i3* %retval to i8*
				call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 8 %6, i8* align 4 %7, i64 12, i1 false)
				%8 = load { i64, i32 }, { i64, i32 }* %tmp, align 8
				ret { i64, i32 } %8
				}

				define { i64, i64 } @test_return_i4(i64 %i.coerce0, i64 %i.coerce1) {
				; ALL-LABEL: name: test_return_i4
				; ALL: bb.1.entry:
				; ALL: liveins: $rdi, $rsi
				; ALL: [[COPY:%[0-9]+]]:_(s64) = COPY $rdi
				; ALL: [[COPY1:%[0-9]+]]:_(s64) = COPY $rsi
				; ALL: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 8
				; ALL: [[C1:%[0-9]+]]:_(s64) = G_CONSTANT i64 16
				; ALL: [[FRAME_INDEX:%[0-9]+]]:_(p0) = G_FRAME_INDEX %stack.0.retval
				; ALL: [[FRAME_INDEX1:%[0-9]+]]:_(p0) = G_FRAME_INDEX %stack.1.i
				; ALL: G_STORE [[COPY]](s64), [[FRAME_INDEX1]](p0) :: (store 8 into %ir.1, align 4)
				; ALL: [[GEP:%[0-9]+]]:_(p0) = G_GEP [[FRAME_INDEX1]], [[C]](s64)
				; ALL: G_STORE [[COPY1]](s64), [[GEP]](p0) :: (store 8 into %ir.2, align 4)
				; ALL: ADJCALLSTACKDOWN64 0, 0, 0, implicit-def $rsp, implicit-def $eflags, implicit-def $ssp, implicit $rsp, implicit $ssp
				; ALL: $rdi = COPY [[FRAME_INDEX]](p0)
				; ALL: $rsi = COPY [[FRAME_INDEX1]](p0)
				; ALL: $rdx = COPY [[C1]](s64)
				; ALL: CALL64pcrel32 &memcpy, csr_64, implicit $rsp, implicit $ssp, implicit $rdi, implicit $rsi, implicit $rdx
				; ALL: ADJCALLSTACKUP64 0, 0, implicit-def $rsp, implicit-def $eflags, implicit-def $ssp, implicit $rsp, implicit $ssp
				; ALL: [[LOAD:%[0-9]+]]:_(s64) = G_LOAD [[FRAME_INDEX]](p0) :: (load 8 from %ir.5, align 4)
				; ALL: [[C2:%[0-9]+]]:_(s64) = G_CONSTANT i64 8
				; ALL: [[GEP1:%[0-9]+]]:_(p0) = G_GEP [[FRAME_INDEX]], [[C2]](s64)
				; ALL: [[LOAD1:%[0-9]+]]:_(s64) = G_LOAD [[GEP1]](p0) :: (load 8 from %ir.5 + 8, align 4)
				; ALL: $rax = COPY [[LOAD]](s64)
				; ALL: $rdx = COPY [[LOAD1]](s64)
				; ALL: RET 0, implicit $rax, implicit $rdx
				entry:
				%retval = alloca %struct.i4, align 4
				%i = alloca %struct.i4, align 4
				%0 = bitcast %struct.i4* %i to { i64, i64 }*
				%1 = getelementptr inbounds { i64, i64 }, { i64, i64 }* %0, i32 0, i32 0
				store i64 %i.coerce0, i64* %1, align 4
				%2 = getelementptr inbounds { i64, i64 }, { i64, i64 }* %0, i32 0, i32 1
				store i64 %i.coerce1, i64* %2, align 4
				%3 = bitcast %struct.i4* %retval to i8*
				%4 = bitcast %struct.i4* %i to i8*
				call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 4 %3, i8* align 4 %4, i64 16, i1 false)
				%5 = bitcast %struct.i4* %retval to { i64, i64 }*
				%6 = load { i64, i64 }, { i64, i64 }* %5, align 4
				ret { i64, i64 } %6
				}

This is an archive of the discontinued LLVM Phabricator instance.

[GlobalISel] Rewrite CallLowering::lowerReturn to accept multiple VRegs per ValueClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 158700

llvm/trunk/include/llvm/CodeGen/GlobalISel/CallLowering.h

llvm/trunk/lib/CodeGen/GlobalISel/IRTranslator.cpp

llvm/trunk/lib/Target/AArch64/AArch64CallLowering.h

llvm/trunk/lib/Target/AArch64/AArch64CallLowering.cpp

llvm/trunk/lib/Target/AMDGPU/AMDGPUCallLowering.h

llvm/trunk/lib/Target/AMDGPU/AMDGPUCallLowering.cpp

llvm/trunk/lib/Target/ARM/ARMCallLowering.h

llvm/trunk/lib/Target/ARM/ARMCallLowering.cpp

llvm/trunk/lib/Target/Mips/MipsCallLowering.h

llvm/trunk/lib/Target/Mips/MipsCallLowering.cpp

llvm/trunk/lib/Target/X86/X86CallLowering.h

llvm/trunk/lib/Target/X86/X86CallLowering.cpp

llvm/trunk/test/CodeGen/AArch64/GlobalISel/call-translator.ll

llvm/trunk/test/CodeGen/AArch64/GlobalISel/irtranslator-exceptions.ll

llvm/trunk/test/CodeGen/ARM/GlobalISel/arm-param-lowering.ll

llvm/trunk/test/CodeGen/X86/GlobalISel/x86_64-irtranslator-struct-return.ll

[GlobalISel] Rewrite CallLowering::lowerReturn to accept multiple VRegs per Value
ClosedPublic