This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/lib/Target/NVPTX/
-
lib/
-
Target/
-
NVPTX/
4/4
NVPTXISelLowering.cpp

Differential D122381

[NVPTX] Remove code duplication in LowerCall
ClosedPublic

Authored by kovdan01 on Mar 24 2022, 4:17 AM.

Download Raw Diff

Details

Reviewers

tra
jholewinski
jlebar

Commits

rG5bf86d9e88fa: [NVPTX] Remove code duplication in LowerCall

Summary

In D120129 we enhanced vectorization options of byval parameters. This patch
removes code duplication when handling byval and non-byval cases.

Diff Detail

Unit TestsFailed

	Time	Test
	60,040 ms	x64 debian > libFuzzer.libFuzzer::large.test

Event Timeline

kovdan01 created this revision.Mar 24 2022, 4:17 AM

Herald added a project: Restricted Project. · View Herald TranscriptMar 24 2022, 4:17 AM

Herald added subscribers: asavonic, hiraditya. · View Herald Transcript

kovdan01 requested review of this revision.Mar 24 2022, 4:17 AM

Herald added a project: Restricted Project. · View Herald TranscriptMar 24 2022, 4:17 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B156025: Diff 417879.Mar 24 2022, 5:07 AM

Nice. LGTM with few style nits.

llvm/lib/Target/NVPTX/NVPTXISelLowering.cpp
1476	alway->always
1477	"whether it's naturally aligned or not."
1479–1490	This all could be folded: ArgAlign = std::max( std::max(Outs[OIdx].Flags.getNonZeroByValAlign(), getFunctionParamOptimizedAlign(CB->getCalledFunction(), ETy, DL)), 4 ); It could be further extended to incorporate the `if (IsByVal)` -> `IsByVal ? ...`. I think calculating ArgAlign in one place is a bit easier to follow than a series of conditional adjustments.

This revision is now accepted and ready to land.Mar 24 2022, 10:57 AM

kovdan01 updated this revision to Diff 418163.Mar 25 2022, 2:34 AM

kovdan01 added inline comments.

llvm/lib/Target/NVPTX/NVPTXISelLowering.cpp

1479–1490

Creating a one-liner here does not look very good as for me - after clang-format we get the following code:

Align ArgAlign =
        IsByVal
            ? std::max(std::max(
                           // The ByValAlign in the Outs[OIdx].Flags is always
                           // set at this point, so we don't need to worry
                           // whether it's naturally aligned or not. See
                           // TargetLowering::LowerCallTo().
                           Outs[OIdx].Flags.getNonZeroByValAlign(),
                           // Try to increase alignment to enhance vectorization
                           // options.
                           getFunctionParamOptimizedAlign(
                               CB->getCalledFunction(), ETy, DL)),
                       // Enforce minumum alignment of 4 to work around ptxas
                       // miscompile for sm_50+. See corresponding alignment
                       // adjustment in emitFunctionParamList() for details.
                       Align(4))
            : getArgumentAlignment(Callee, CB, Ty, ParamCount + 1, DL);

I just removed some intermediate variables and used temporary values instead. Looks clear enough IMHO, and there is no problem in reading code like

// ...
ArgAlign = ...;
// ...
ArgAlign = std::max(ArgAlign, ...);
// ...
ArgAlign = std::max(ArgAlign, ...);

kovdan01 marked 3 inline comments as done.Mar 25 2022, 2:35 AM

This revision was landed with ongoing or failed builds.Mar 25 2022, 2:43 AM

Closed by commit rG5bf86d9e88fa: [NVPTX] Remove code duplication in LowerCall (authored by kovdan01). · Explain Why

This revision was automatically updated to reflect the committed changes.

kovdan01 added a commit: rG5bf86d9e88fa: [NVPTX] Remove code duplication in LowerCall.

Harbormaster completed remote builds in B156240: Diff 418163.Mar 25 2022, 3:25 AM

kovdan01 mentioned this in D120129: [NVPTX] Enhance vectorization of ld.param & st.param.Mar 27 2022, 8:52 PM

Revision Contents

Path

Size

llvm/

lib/

Target/

NVPTX/

NVPTXISelLowering.cpp

258 lines

Diff 418163

llvm/lib/Target/NVPTX/NVPTXISelLowering.cpp

Show First 20 Lines • Show All 1,435 Lines • ▼ Show 20 Lines	SDValue NVPTXTargetLowering::LowerCall(TargetLowering::CallLoweringInfo &CLI,
const DataLayout &DL = DAG.getDataLayout();		const DataLayout &DL = DAG.getDataLayout();

bool isABI = (STI.getSmVersion() >= 20);		bool isABI = (STI.getSmVersion() >= 20);
assert(isABI && "Non-ABI compilation is not supported");		assert(isABI && "Non-ABI compilation is not supported");
if (!isABI)		if (!isABI)
return Chain;		return Chain;

unsigned UniqueCallSite = GlobalUniqueCallSite.fetch_add(1);		unsigned UniqueCallSite = GlobalUniqueCallSite.fetch_add(1);
SDValue tempChain = Chain;		SDValue TempChain = Chain;
Chain = DAG.getCALLSEQ_START(Chain, UniqueCallSite, 0, dl);		Chain = DAG.getCALLSEQ_START(Chain, UniqueCallSite, 0, dl);
SDValue InFlag = Chain.getValue(1);		SDValue InFlag = Chain.getValue(1);

unsigned paramCount = 0;		unsigned ParamCount = 0;
// Args.size() and Outs.size() need not match.		// Args.size() and Outs.size() need not match.
// Outs.size() will be larger		// Outs.size() will be larger
// * if there is an aggregate argument with multiple fields (each field		// * if there is an aggregate argument with multiple fields (each field
// showing up separately in Outs)		// showing up separately in Outs)
// * if there is a vector argument with more than typical vector-length		// * if there is a vector argument with more than typical vector-length
// elements (generally if more than 4) where each vector element is		// elements (generally if more than 4) where each vector element is
// individually present in Outs.		// individually present in Outs.
// So a different index should be used for indexing into Outs/OutVals.		// So a different index should be used for indexing into Outs/OutVals.
// See similar issue in LowerFormalArguments.		// See similar issue in LowerFormalArguments.
unsigned OIdx = 0;		unsigned OIdx = 0;
// Declare the .params or .reg need to pass values		// Declare the .params or .reg need to pass values
// to the function		// to the function
for (unsigned i = 0, e = Args.size(); i != e; ++i, ++OIdx) {		for (unsigned i = 0, e = Args.size(); i != e; ++i, ++OIdx) {
EVT VT = Outs[OIdx].VT;		EVT VT = Outs[OIdx].VT;
Type *Ty = Args[i].Ty;		Type *Ty = Args[i].Ty;
		bool IsByVal = Outs[OIdx].Flags.isByVal();

if (!Outs[OIdx].Flags.isByVal()) {
SmallVector<EVT, 16> VTs;		SmallVector<EVT, 16> VTs;
SmallVector<uint64_t, 16> Offsets;		SmallVector<uint64_t, 16> Offsets;
ComputePTXValueVTs(*this, DL, Ty, VTs, &Offsets);
Align ArgAlign = getArgumentAlignment(Callee, CB, Ty, paramCount + 1, DL);		assert((!IsByVal \|\| Args[i].IndirectType) &&
unsigned AllocSize = DL.getTypeAllocSize(Ty);		"byval arg must have indirect type");
		Type *ETy = (IsByVal ? Args[i].IndirectType : Ty);
		ComputePTXValueVTs(*this, DL, ETy, VTs, &Offsets);

		Align ArgAlign;
		if (IsByVal) {
		// The ByValAlign in the Outs[OIdx].Flags is always set at this point,
		traUnsubmitted Done Reply Inline Actions alway->always tra: alway->always
		// so we don't need to worry whether it's naturally aligned or not.
		traUnsubmitted Done Reply Inline Actions "whether it's naturally aligned or not." tra: "whether it's naturally aligned or not."
		// See TargetLowering::LowerCallTo().
		ArgAlign = Outs[OIdx].Flags.getNonZeroByValAlign();

		// Try to increase alignment to enhance vectorization options.
		ArgAlign = std::max(ArgAlign, getFunctionParamOptimizedAlign(
		CB->getCalledFunction(), ETy, DL));

		// Enforce minumum alignment of 4 to work around ptxas miscompile
		// for sm_50+. See corresponding alignment adjustment in
		// emitFunctionParamList() for details.
		ArgAlign = std::max(ArgAlign, Align(4));
		} else {
		ArgAlign = getArgumentAlignment(Callee, CB, Ty, ParamCount + 1, DL);
		traUnsubmitted Done Reply Inline Actions This all could be folded: ArgAlign = std::max( std::max(Outs[OIdx].Flags.getNonZeroByValAlign(), getFunctionParamOptimizedAlign(CB->getCalledFunction(), ETy, DL)), 4 ); It could be further extended to incorporate the `if (IsByVal)` -> `IsByVal ? ...`. I think calculating ArgAlign in one place is a bit easier to follow than a series of conditional adjustments. tra: This all could be folded: ``` ArgAlign = std::max( std::max(Outs[OIdx].Flags.
		kovdan01AuthorUnsubmitted Done Reply Inline Actions Creating a one-liner here does not look very good as for me - after clang-format we get the following code: Align ArgAlign = IsByVal ? std::max(std::max( // The ByValAlign in the Outs[OIdx].Flags is always // set at this point, so we don't need to worry // whether it's naturally aligned or not. See // TargetLowering::LowerCallTo(). Outs[OIdx].Flags.getNonZeroByValAlign(), // Try to increase alignment to enhance vectorization // options. getFunctionParamOptimizedAlign( CB->getCalledFunction(), ETy, DL)), // Enforce minumum alignment of 4 to work around ptxas // miscompile for sm_50+. See corresponding alignment // adjustment in emitFunctionParamList() for details. Align(4)) : getArgumentAlignment(Callee, CB, Ty, ParamCount + 1, DL); I just removed some intermediate variables and used temporary values instead. Looks clear enough IMHO, and there is no problem in reading code like // ... ArgAlign = ...; // ... ArgAlign = std::max(ArgAlign, ...); // ... ArgAlign = std::max(ArgAlign, ...); kovdan01: Creating a one-liner here does not look very good as for me - after clang-format we get the…
		}

		unsigned TypeSize =
		(IsByVal ? Outs[OIdx].Flags.getByValSize() : DL.getTypeAllocSize(Ty));
SDVTList DeclareParamVTs = DAG.getVTList(MVT::Other, MVT::Glue);		SDVTList DeclareParamVTs = DAG.getVTList(MVT::Other, MVT::Glue);

bool NeedAlign; // Does argument declaration specify alignment?		bool NeedAlign; // Does argument declaration specify alignment?
if (Ty->isAggregateType() \|\| Ty->isVectorTy() \|\| Ty->isIntegerTy(128)) {		if (IsByVal \|\|
		(Ty->isAggregateType() \|\| Ty->isVectorTy() \|\| Ty->isIntegerTy(128))) {
// declare .param .align <align> .b8 .param<n>[<size>];		// declare .param .align <align> .b8 .param<n>[<size>];
SDValue DeclareParamOps[] = {		SDValue DeclareParamOps[] = {
Chain, DAG.getConstant(ArgAlign.value(), dl, MVT::i32),		Chain, DAG.getConstant(ArgAlign.value(), dl, MVT::i32),
DAG.getConstant(paramCount, dl, MVT::i32),		DAG.getConstant(ParamCount, dl, MVT::i32),
DAG.getConstant(AllocSize, dl, MVT::i32), InFlag};		DAG.getConstant(TypeSize, dl, MVT::i32), InFlag};
Chain = DAG.getNode(NVPTXISD::DeclareParam, dl, DeclareParamVTs,		Chain = DAG.getNode(NVPTXISD::DeclareParam, dl, DeclareParamVTs,
DeclareParamOps);		DeclareParamOps);
NeedAlign = true;		NeedAlign = true;
} else {		} else {
// declare .param .b<size> .param<n>;		// declare .param .b<size> .param<n>;
if ((VT.isInteger() \|\| VT.isFloatingPoint()) && AllocSize < 4) {		if ((VT.isInteger() \|\| VT.isFloatingPoint()) && TypeSize < 4) {
// PTX ABI requires integral types to be at least 32 bits in		// PTX ABI requires integral types to be at least 32 bits in
// size. FP16 is loaded/stored using i16, so it's handled		// size. FP16 is loaded/stored using i16, so it's handled
// here as well.		// here as well.
AllocSize = 4;		TypeSize = 4;
}		}
SDValue DeclareScalarParamOps[] = {		SDValue DeclareScalarParamOps[] = {
Chain, DAG.getConstant(paramCount, dl, MVT::i32),		Chain, DAG.getConstant(ParamCount, dl, MVT::i32),
DAG.getConstant(AllocSize * 8, dl, MVT::i32),		DAG.getConstant(TypeSize * 8, dl, MVT::i32),
DAG.getConstant(0, dl, MVT::i32), InFlag};		DAG.getConstant(0, dl, MVT::i32), InFlag};
Chain = DAG.getNode(NVPTXISD::DeclareScalarParam, dl, DeclareParamVTs,		Chain = DAG.getNode(NVPTXISD::DeclareScalarParam, dl, DeclareParamVTs,
DeclareScalarParamOps);		DeclareScalarParamOps);
NeedAlign = false;		NeedAlign = false;
}		}
InFlag = Chain.getValue(1);		InFlag = Chain.getValue(1);

// PTX Interoperability Guide 3.3(A): [Integer] Values shorter		// PTX Interoperability Guide 3.3(A): [Integer] Values shorter
// than 32-bits are sign extended or zero extended, depending on		// than 32-bits are sign extended or zero extended, depending on
// whether they are signed or unsigned types. This case applies		// whether they are signed or unsigned types. This case applies
// only to scalar parameters and not to aggregate values.		// only to scalar parameters and not to aggregate values.
bool ExtendIntegerParam =		bool ExtendIntegerParam =
Ty->isIntegerTy() && DL.getTypeAllocSizeInBits(Ty) < 32;		Ty->isIntegerTy() && DL.getTypeAllocSizeInBits(Ty) < 32;

auto VectorInfo = VectorizePTXValueVTs(VTs, Offsets, ArgAlign);		auto VectorInfo = VectorizePTXValueVTs(VTs, Offsets, ArgAlign);
SmallVector<SDValue, 6> StoreOperands;		SmallVector<SDValue, 6> StoreOperands;
for (unsigned j = 0, je = VTs.size(); j != je; ++j) {		for (unsigned j = 0, je = VTs.size(); j != je; ++j) {
		EVT EltVT = VTs[j];
		int CurOffset = Offsets[j];
		MaybeAlign PartAlign;
		if (NeedAlign)
		PartAlign = commonAlignment(ArgAlign, CurOffset);

// New store.		// New store.
if (VectorInfo[j] & PVF_FIRST) {		if (VectorInfo[j] & PVF_FIRST) {
assert(StoreOperands.empty() && "Unfinished preceding store.");		assert(StoreOperands.empty() && "Unfinished preceding store.");
StoreOperands.push_back(Chain);		StoreOperands.push_back(Chain);
StoreOperands.push_back(DAG.getConstant(paramCount, dl, MVT::i32));		StoreOperands.push_back(DAG.getConstant(ParamCount, dl, MVT::i32));
StoreOperands.push_back(DAG.getConstant(Offsets[j], dl, MVT::i32));		StoreOperands.push_back(DAG.getConstant(CurOffset, dl, MVT::i32));
}		}

EVT EltVT = VTs[j];
SDValue StVal = OutVals[OIdx];		SDValue StVal = OutVals[OIdx];
if (ExtendIntegerParam) {		if (IsByVal) {
		auto PtrVT = getPointerTy(DL);
		SDValue srcAddr = DAG.getNode(ISD::ADD, dl, PtrVT, StVal,
		DAG.getConstant(CurOffset, dl, PtrVT));
		StVal = DAG.getLoad(EltVT, dl, TempChain, srcAddr, MachinePointerInfo(),
		PartAlign);
		} else if (ExtendIntegerParam) {
assert(VTs.size() == 1 && "Scalar can't have multiple parts.");		assert(VTs.size() == 1 && "Scalar can't have multiple parts.");
// zext/sext to i32		// zext/sext to i32
StVal = DAG.getNode(Outs[OIdx].Flags.isSExt() ? ISD::SIGN_EXTEND		StVal = DAG.getNode(Outs[OIdx].Flags.isSExt() ? ISD::SIGN_EXTEND
: ISD::ZERO_EXTEND,		: ISD::ZERO_EXTEND,
dl, MVT::i32, StVal);		dl, MVT::i32, StVal);
} else if (EltVT.getSizeInBits() < 16) {		}

		if (!ExtendIntegerParam && EltVT.getSizeInBits() < 16) {
// Use 16-bit registers for small stores as it's the		// Use 16-bit registers for small stores as it's the
// smallest general purpose register size supported by NVPTX.		// smallest general purpose register size supported by NVPTX.
StVal = DAG.getNode(ISD::ANY_EXTEND, dl, MVT::i16, StVal);		StVal = DAG.getNode(ISD::ANY_EXTEND, dl, MVT::i16, StVal);
}		}

// Record the value to store.		// Record the value to store.
StoreOperands.push_back(StVal);		StoreOperands.push_back(StVal);

if (VectorInfo[j] & PVF_LAST) {		if (VectorInfo[j] & PVF_LAST) {
unsigned NumElts = StoreOperands.size() - 3;		unsigned NumElts = StoreOperands.size() - 3;
NVPTXISD::NodeType Op;		NVPTXISD::NodeType Op;
switch (NumElts) {		switch (NumElts) {
case 1:		case 1:
Op = NVPTXISD::StoreParam;		Op = NVPTXISD::StoreParam;
break;		break;
case 2:		case 2:
Op = NVPTXISD::StoreParamV2;		Op = NVPTXISD::StoreParamV2;
break;		break;
case 4:		case 4:
Op = NVPTXISD::StoreParamV4;		Op = NVPTXISD::StoreParamV4;
break;		break;
default:		default:
llvm_unreachable("Invalid vector info.");		llvm_unreachable("Invalid vector info.");
}		}

StoreOperands.push_back(InFlag);		StoreOperands.push_back(InFlag);

// Adjust type of the store op if we've extended the scalar		// Adjust type of the store op if we've extended the scalar
// return value.		// return value.
EVT TheStoreType = ExtendIntegerParam ? MVT::i32 : VTs[j];		EVT TheStoreType = ExtendIntegerParam ? MVT::i32 : EltVT;
MaybeAlign EltAlign;
if (NeedAlign)
EltAlign = commonAlignment(ArgAlign, Offsets[j]);

Chain = DAG.getMemIntrinsicNode(		Chain = DAG.getMemIntrinsicNode(
Op, dl, DAG.getVTList(MVT::Other, MVT::Glue), StoreOperands,		Op, dl, DAG.getVTList(MVT::Other, MVT::Glue), StoreOperands,
TheStoreType, MachinePointerInfo(), EltAlign,		TheStoreType, MachinePointerInfo(), PartAlign,
MachineMemOperand::MOStore);		MachineMemOperand::MOStore);
InFlag = Chain.getValue(1);		InFlag = Chain.getValue(1);

// Cleanup.		// Cleanup.
StoreOperands.clear();		StoreOperands.clear();
}		}
		if (!IsByVal)
++OIdx;		++OIdx;
}		}
assert(StoreOperands.empty() && "Unfinished parameter store.");		assert(StoreOperands.empty() && "Unfinished parameter store.");
if (VTs.size() > 0)		if (!IsByVal && VTs.size() > 0)
--OIdx;		--OIdx;
++paramCount;		++ParamCount;
continue;
}

// ByVal arguments
// TODO: remove code duplication when handling byval and non-byval cases.
SmallVector<EVT, 16> VTs;
SmallVector<uint64_t, 16> Offsets;
Type *ETy = Args[i].IndirectType;
assert(ETy && "byval arg must have indirect type");
ComputePTXValueVTs(*this, DL, ETy, VTs, &Offsets, 0);

// declare .param .align <align> .b8 .param<n>[<size>];
unsigned sz = Outs[OIdx].Flags.getByValSize();
SDVTList DeclareParamVTs = DAG.getVTList(MVT::Other, MVT::Glue);

// The ByValAlign in the Outs[OIdx].Flags is alway set at this point,
// so we don't need to worry about natural alignment or not.
// See TargetLowering::LowerCallTo().
Align ArgAlign = Outs[OIdx].Flags.getNonZeroByValAlign();

// Try to increase alignment to enhance vectorization options.
const Function *F = CB->getCalledFunction();
Align AlignCandidate = getFunctionParamOptimizedAlign(F, ETy, DL);
ArgAlign = std::max(ArgAlign, AlignCandidate);

// Enforce minumum alignment of 4 to work around ptxas miscompile
// for sm_50+. See corresponding alignment adjustment in
// emitFunctionParamList() for details.
if (ArgAlign < Align(4))
ArgAlign = Align(4);
SDValue DeclareParamOps[] = {
Chain, DAG.getConstant(ArgAlign.value(), dl, MVT::i32),
DAG.getConstant(paramCount, dl, MVT::i32),
DAG.getConstant(sz, dl, MVT::i32), InFlag};
Chain = DAG.getNode(NVPTXISD::DeclareParam, dl, DeclareParamVTs,
DeclareParamOps);
InFlag = Chain.getValue(1);

auto VectorInfo = VectorizePTXValueVTs(VTs, Offsets, ArgAlign);
SmallVector<SDValue, 6> StoreOperands;
for (unsigned j = 0, je = VTs.size(); j != je; ++j) {
EVT elemtype = VTs[j];
int curOffset = Offsets[j];
Align PartAlign = commonAlignment(ArgAlign, curOffset);

// New store.
if (VectorInfo[j] & PVF_FIRST) {
assert(StoreOperands.empty() && "Unfinished preceding store.");
StoreOperands.push_back(Chain);
StoreOperands.push_back(DAG.getConstant(paramCount, dl, MVT::i32));
StoreOperands.push_back(DAG.getConstant(curOffset, dl, MVT::i32));
}

auto PtrVT = getPointerTy(DL);
SDValue srcAddr = DAG.getNode(ISD::ADD, dl, PtrVT, OutVals[OIdx],
DAG.getConstant(curOffset, dl, PtrVT));
SDValue theVal = DAG.getLoad(elemtype, dl, tempChain, srcAddr,
MachinePointerInfo(), PartAlign);

if (elemtype.getSizeInBits() < 16) {
// Use 16-bit registers for small stores as it's the
// smallest general purpose register size supported by NVPTX.
theVal = DAG.getNode(ISD::ANY_EXTEND, dl, MVT::i16, theVal);
}

// Record the value to store.
StoreOperands.push_back(theVal);

if (VectorInfo[j] & PVF_LAST) {
unsigned NumElts = StoreOperands.size() - 3;
NVPTXISD::NodeType Op;
switch (NumElts) {
case 1:
Op = NVPTXISD::StoreParam;
break;
case 2:
Op = NVPTXISD::StoreParamV2;
break;
case 4:
Op = NVPTXISD::StoreParamV4;
break;
default:
llvm_unreachable("Invalid vector info.");
}

StoreOperands.push_back(InFlag);

Chain = DAG.getMemIntrinsicNode(
Op, dl, DAG.getVTList(MVT::Other, MVT::Glue), StoreOperands,
elemtype, MachinePointerInfo(), PartAlign,
MachineMemOperand::MOStore);
InFlag = Chain.getValue(1);

// Cleanup.
StoreOperands.clear();
}
}
assert(StoreOperands.empty() && "Unfinished parameter store.");
++paramCount;
}		}

GlobalAddressSDNode *Func = dyn_cast<GlobalAddressSDNode>(Callee.getNode());		GlobalAddressSDNode *Func = dyn_cast<GlobalAddressSDNode>(Callee.getNode());
MaybeAlign retAlignment = None;		MaybeAlign retAlignment = None;

// Handle Result		// Handle Result
if (Ins.size() > 0) {		if (Ins.size() > 0) {
SmallVector<EVT, 16> resvtparts;		SmallVector<EVT, 16> resvtparts;
▲ Show 20 Lines • Show All 90 Lines • ▼ Show 20 Lines	SDValue NVPTXTargetLowering::LowerCall(TargetLowering::CallLoweringInfo &CLI,

// Ops to print out the param list		// Ops to print out the param list
SDVTList CallArgBeginVTs = DAG.getVTList(MVT::Other, MVT::Glue);		SDVTList CallArgBeginVTs = DAG.getVTList(MVT::Other, MVT::Glue);
SDValue CallArgBeginOps[] = { Chain, InFlag };		SDValue CallArgBeginOps[] = { Chain, InFlag };
Chain = DAG.getNode(NVPTXISD::CallArgBegin, dl, CallArgBeginVTs,		Chain = DAG.getNode(NVPTXISD::CallArgBegin, dl, CallArgBeginVTs,
CallArgBeginOps);		CallArgBeginOps);
InFlag = Chain.getValue(1);		InFlag = Chain.getValue(1);

for (unsigned i = 0, e = paramCount; i != e; ++i) {		for (unsigned i = 0, e = ParamCount; i != e; ++i) {
unsigned opcode;		unsigned opcode;
if (i == (e - 1))		if (i == (e - 1))
opcode = NVPTXISD::LastCallArg;		opcode = NVPTXISD::LastCallArg;
else		else
opcode = NVPTXISD::CallArg;		opcode = NVPTXISD::CallArg;
SDVTList CallArgVTs = DAG.getVTList(MVT::Other, MVT::Glue);		SDVTList CallArgVTs = DAG.getVTList(MVT::Other, MVT::Glue);
SDValue CallArgOps[] = { Chain, DAG.getConstant(1, dl, MVT::i32),		SDValue CallArgOps[] = { Chain, DAG.getConstant(1, dl, MVT::i32),
DAG.getConstant(i, dl, MVT::i32), InFlag };		DAG.getConstant(i, dl, MVT::i32), InFlag };
▲ Show 20 Lines • Show All 3,430 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[NVPTX] Remove code duplication in LowerCallClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 418163

llvm/lib/Target/NVPTX/NVPTXISelLowering.cpp

[NVPTX] Remove code duplication in LowerCall
ClosedPublic