This is an archive of the discontinued LLVM Phabricator instance.

[SelectionDAG] Add expansion and promotion of [US]MUL_LOHI
ClosedPublic

Authored by nhaehnle on Sep 27 2016, 3:36 AM.

Download Raw Diff

Details

Reviewers

spatel
nadav
• tstellarAMD
venkatra
bkramer
hfinkel
ast
efriedma

Commits

rGf08dc90253f4: [SelectionDAG] Add expansion and promotion of [US]MUL_LOHI
rL289050: [SelectionDAG] Add expansion and promotion of [US]MUL_LOHI

Summary

Most targets set the action for these nodes to Expand even though there
isn't actually any code for them in ExpandNode. Instead, targets simply
relied on the fact that no code generates these nodes as long as the
nodes aren't legal or custom.

However, generating these nodes can be useful e.g. for divide-by-constant
in wider integer types. Targets will want to deal with [US]MUL_LOHI
differently depending on the available instructions, e.g. targets with a
native 64-bit multiply will want to Promote the 32-bit [US]MUL_LOHI
instructions, and targets with native MULH[US] instructions will want to
Split.

This patch intends to not change the generated code, but indirect effects
are possible since expansions/promotions that were previously done in
DAGCombine may now be done in LegalizeDAG.

See D24822 for a change that actually uses the new expansion.

Diff Detail

Event Timeline

nhaehnle updated this revision to Diff 72620.Sep 27 2016, 3:36 AM

nhaehnle retitled this revision from to [SelectionDAG] Add expansion and promotion of [US]MUL_LOHI.

nhaehnle updated this object.

nhaehnle added reviewers: spatel, bkramer, venkatra, efriedma, hfinkel, ast, nadav, • tstellarAMD.

nhaehnle added a subscriber: llvm-commits.

Herald added subscribers: nhaehnle, wdng, nemanjai and 2 others. · View Herald TranscriptSep 27 2016, 3:36 AM

nhaehnle mentioned this in D24822: [SelectionDAG] Enable division-by-constant optimization for wide types.Sep 27 2016, 3:38 AM

nhaehnle added a child revision: D24822: [SelectionDAG] Enable division-by-constant optimization for wide types.Sep 27 2016, 3:39 AM

Ping.

efriedma added inline comments.Oct 4 2016, 9:51 AM

include/llvm/Target/TargetLowering.h
147	It doesn't look like None is actually used anywhere... did you mean to use it somehow?
lib/CodeGen/SelectionDAG/TargetLowering.cpp
3156	Why are you using ADDC+ADDE rather than just plain ADD? It seems strange to split the operation, given that it might be legal. (I might be missing something here, though.)
lib/Target/Sparc/SparcISelLowering.cpp
1696	This FIXME doesn't make sense here; Promote is just the right thing to do here.
lib/Target/X86/X86ISelLowering.cpp
32532	I'm confused; you're returning "Scalarize" for scalar types?

RKSimon added a subscriber: RKSimon.Oct 5 2016, 7:04 AM

Thank you for taking a look. This patch should address all your comments,
plus I'm adding a guard against incorrectly using the known bits
optimizations with vector types.

Herald edited edge metadata. · View Herald TranscriptOct 5 2016, 11:34 AM

nhaehnle added inline comments.Oct 5 2016, 11:35 AM

include/llvm/Target/TargetLowering.h
147	I used it at some point to preserve the original behaviour. I'll remove it.
lib/CodeGen/SelectionDAG/TargetLowering.cpp
3156	That's just how the code happened to evolve because AMDGPU (which is where this all started) doesn't have 64-bit additions, but it's a good point. I'm changing it to use additions in the larger type.
lib/Target/Sparc/SparcISelLowering.cpp
1696	FIXME removed.
lib/Target/X86/X86ISelLowering.cpp
32532	On X86 this only gets called for vector types. I'm adding an assertion to make that explicit.

I don't really understand why this needs to be a new target hook. Can't it have similar code as the MUL expansion code at LegalizeDAG.cpp:3316 (in the base version): if MULH and MUL are LegalOrCustom, then you'd split the MUL_LOHI into those two operations. Otherwise, you'd do half-width multiplies?

Also, I've fixed the sparc backend to properly implement [US]MUL_LOHI in r283381, removing the FIXME here.

In D24956#562725, @jyknight wrote:

I don't really understand why this needs to be a new target hook. Can't it have similar code as the MUL expansion code at LegalizeDAG.cpp:3316 (in the base version): if MULH and MUL are LegalOrCustom, then you'd split the MUL_LOHI into those two operations. Otherwise, you'd do half-width multiplies?

IMHO, explicit is better than implicit. Also, how would you choose between scalarizing and half-width multiplies for vector types? You might be able to come up with a magic heuristic that works now, only to be broken when a new backend comes along.

Also, I've fixed the sparc backend to properly implement [US]MUL_LOHI in r283381, removing the FIXME here.

Thanks!

IMHO, explicit is better than implicit. Also, how would you choose between scalarizing and half-width multiplies for vector types? You might be able to come up with a magic heuristic that works now, only to be broken when a new backend comes along.

I don't think it makes much sense to require backends to decide between Split and Half; whenever you can split into MULH and MUL, you should, and do 4 half-width multiplies otherwise.

I don't know much about the vector question -- how is that done for other operations? Surely the question about which axis to "expand" when expanding a vector operation isn't only an question for multiplication, so a MUL-specific solution to this issue feels strange -- I'd expect that either there's already a solution in use elsewhere, or a wider problem.

RKSimon added inline comments.Oct 12 2016, 8:59 AM

lib/CodeGen/SelectionDAG/TargetLowering.cpp
3035	If this is going to start being used by vectors, shouldn't these getSizeInBits() calls be replaced with getScalarSizeInBits()?

Legalization for certain vector operations uses "PROMOTE" to indicate that the operation should be performed in a different vector type. That really stretches the meaning if we're using it to mean "split into multiple vector multiplies", but I guess it could work. See VectorLegalizer::Promote in LegalizeVectorOps.cpp.

lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
3331	Sorry, I really haven't worked with this code in a while; I should have spotted the issue here earlier. :( We can't ever scalarize an operation in LegalizeDAG: scalarizing can produce illegal types, and LegalizeDAG can't handle them. We have to handle this a bit earlier, in LegalizeVectorOps.

Thank you all for looking into this and sorry for the delay. I think I've
addressed all your comments:

getMulExpansion is gone
vector scalarization is done in LegalizeVectorOps
use getScalarSizeInBits() throughout to prepare for vector use

There's also the question of "promoting" / using half-width multiplies for
vectors. That could become useful to have more aggressive replacement of
divide-by-constant with the equivalent multiply+shift sequence also for
vector types, i.e. pushing D24822 even further.

However, there are more problems there (X86 gets unnecessarily worse looking
instruction sequences in some tests), so that should be a separate change.

Simplify the Expand part in LegalizeDAG, since vectors are now expanded
earlier.

Ping.

lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
3331	Okay, fixing this.
lib/CodeGen/SelectionDAG/TargetLowering.cpp
3035	Fixed, thanks.

Ping^2

Sorry about the delay...

Have you seen https://reviews.llvm.org/D26628 ? Does that interact with the patch in any way?

TargetLowering::expandMUL_LOHI is kind of difficult to review with all the different unrelated changes involved; would it be possible to separate out the non-functional changes?

lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
3330	Maybe add an assertion here that HalfType is a legal type? Some architectures could trip over this...
lib/CodeGen/SelectionDAG/TargetLowering.cpp
3203	Should there be an assertion here? If expandMUL_LOHI isn't returning exactly two results, something went wrong.

nhaehnle mentioned this in D27063: [SelectionDAG] Early-out in TargetLowering::expandMUL (NFC).Nov 23 2016, 1:08 PM

nhaehnle mentioned this in D27064: [SelectionDAG] Refactor TargetLowering::expandMUL (NFC).

Thank you for taking a look.

D26628 does not directly interact, since it affected a custom sequence used
by DAGTypeLegalizer if the call to TargetLowering has failed. I suppose now
that TargetLowering no longer insists on only using legal instructions one
could think about using that mode instead of the custom sequence in
DAGTypeLegalizer, but that sequence is also qualitatively different (using a
regular NxN-bit multiply on values where everything but the low N/2 bits
have been masked out).

I've addressed your minor comments and split this up into three patches,
the first two being D27063 and D27064.

I'm going to add them as dependencies as well. Unfortunately, handling
patch series with Phabricator seems really awkward :(

nhaehnle added parent revisions: D27063: [SelectionDAG] Early-out in TargetLowering::expandMUL (NFC), D27064: [SelectionDAG] Refactor TargetLowering::expandMUL (NFC).Nov 23 2016, 1:12 PM

nhaehnle added inline comments.Nov 23 2016, 1:15 PM

lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
3330	I missed this one, sorry.

Add assertion that HalfType is legal.

Diffusion mentioned this in rL287831: [SelectionDAG] Early-out in TargetLowering::expandMUL (NFC).Nov 23 2016, 2:24 PM

Diffusion mentioned this in rL288248: [SelectionDAG] Refactor TargetLowering::expandMUL (NFC).Nov 30 2016, 8:36 AM

Ping.

efriedma added inline comments.Dec 2 2016, 10:58 AM

include/llvm/Target/TargetLowering.h
3031	Please make OnlyLegalOrCustom an enum; otherwise it's completely impossible to tell what the parameter does without looking at the header.
lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
2197	Can you replace this code with a call to expandMUL?

Use an enum instead of OnlyLegalOrCustom
Handle a corner case where shift amounts are too large for the native shift type

nhaehnle added inline comments.Dec 7 2016, 6:52 AM

lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
2197	I played around with that, but I'd rather not do it in this patch. The generated code sequences are different, and in particular we'd generate non-legal instructions that at least the X86 target doesn't expect and doesn't handle properly. This is visible in test/CodeGen/X86/imul-256/512/1024.ll.

LGTM with minor comments addressed.

lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
2197	Okay, that's fine.
lib/CodeGen/SelectionDAG/TargetLowering.cpp
3079	MaskedValueIsZero should do the right thing for vectors, I think, with recent changes to computeKnownBits.
3113	Hmm... I was going to say that ShiftAmount always fits into ShiftAmountTy because it's a defined shift (ShiftAmount < OuterBitSize < maximum unsigned value), but getShiftAmountTy doesn't always return a sensible result for illegal types. Please just note this problem with a FIXME.

This revision is now accepted and ready to land.Dec 7 2016, 10:51 AM

Closed by commit rL289050: [SelectionDAG] Add expansion and promotion of [US]MUL_LOHI (authored by nha). · Explain WhyDec 8 2016, 6:18 AM

This revision was automatically updated to reflect the committed changes.

I pruned three of \param(s) in r289057. Could you recheck them?

llvm/trunk/include/llvm/Target/TargetLowering.h
3051 ↗	(On Diff #80754)	HalfVT and OnlyLegalOrCustom cannot be seen here.
3067 ↗	(On Diff #80754)	ditto.

Revision Contents

Path

Size

include/

llvm/

Target/

TargetLowering.h

40 lines

lib/

CodeGen/

SelectionDAG/

LegalizeDAG.cpp

98 lines

LegalizeIntegerTypes.cpp

2 lines

TargetLowering.cpp

262 lines

Target/

AMDGPU/

AMDGPUISelLowering.h

2 lines

AMDGPUISelLowering.cpp

7 lines

BPF/

BPFISelLowering.h

2 lines

BPFISelLowering.cpp

5 lines

PowerPC/

PPCISelLowering.h

2 lines

PPCISelLowering.cpp

7 lines

Sparc/

SparcISelLowering.h

2 lines

SparcISelLowering.cpp

19 lines

X86/

X86ISelLowering.h

2 lines

X86ISelLowering.cpp

7 lines

Diff 72620

include/llvm/Target/TargetLowering.h

Show First 20 Lines • Show All 136 Lines • ▼ Show 20 Lines	enum class AtomicExpansionKind {
None, // Don't expand the instruction.		None, // Don't expand the instruction.
LLSC, // Expand the instruction into loadlinked/storeconditional; used		LLSC, // Expand the instruction into loadlinked/storeconditional; used
// by ARM/AArch64.		// by ARM/AArch64.
LLOnly, // Expand the (load) instruction into just a load-linked, which has		LLOnly, // Expand the (load) instruction into just a load-linked, which has
// greater atomic guarantees than a normal load.		// greater atomic guarantees than a normal load.
CmpXChg, // Expand the instruction into cmpxchg; used by at least X86.		CmpXChg, // Expand the instruction into cmpxchg; used by at least X86.
};		};

		/// Enum that specifies what [US]MUL_LOHI is expanded to.
		enum class MulExpansion {
		None, // Do not actually expand anything
		efriedmaUnsubmitted Done Reply Inline Actions It doesn't look like None is actually used anywhere... did you mean to use it somehow? efriedma: It doesn't look like None is actually used anywhere... did you mean to use it somehow?
		nhaehnleAuthorUnsubmitted Not Done Reply Inline Actions I used it at some point to preserve the original behaviour. I'll remove it. nhaehnle: I used it at some point to preserve the original behaviour. I'll remove it.
		Split, // Split into MUL + MULH[US]
		Scalarize, // Scalarize vector operations
		HalfWidth, // Sequence of half-width multiplications and additions
		};

static ISD::NodeType getExtendForContent(BooleanContent Content) {		static ISD::NodeType getExtendForContent(BooleanContent Content) {
switch (Content) {		switch (Content) {
case UndefinedBooleanContent:		case UndefinedBooleanContent:
// Extend by adding rubbish bits.		// Extend by adding rubbish bits.
return ISD::ANY_EXTEND;		return ISD::ANY_EXTEND;
case ZeroOrOneBooleanContent:		case ZeroOrOneBooleanContent:
// Extend by adding zero bits.		// Extend by adding zero bits.
return ISD::ZERO_EXTEND;		return ISD::ZERO_EXTEND;
▲ Show 20 Lines • Show All 100 Lines • ▼ Show 20 Lines	public:
bool isSlowDivBypassed() const { return !BypassSlowDivWidths.empty(); }		bool isSlowDivBypassed() const { return !BypassSlowDivWidths.empty(); }

/// Returns map of slow types for division or remainder with corresponding		/// Returns map of slow types for division or remainder with corresponding
/// fast types		/// fast types
const DenseMap<unsigned int, unsigned int> &getBypassSlowDivWidths() const {		const DenseMap<unsigned int, unsigned int> &getBypassSlowDivWidths() const {
return BypassSlowDivWidths;		return BypassSlowDivWidths;
}		}

		/// Returns which of a number of possible expansions should be used for
		/// [US]MUL_LOHI instructions. This is only used when the corresponding
		/// operation actions is set to Expand.
		virtual MulExpansion getMulExpansion(unsigned Opcode, MVT VT) const {
		return MulExpansion::Split;
		}

/// Return true if Flow Control is an expensive operation that should be		/// Return true if Flow Control is an expensive operation that should be
/// avoided.		/// avoided.
bool isJumpExpensive() const { return JumpIsExpensive; }		bool isJumpExpensive() const { return JumpIsExpensive; }

/// Return true if selects are only cheaper than branches if the branch is		/// Return true if selects are only cheaper than branches if the branch is
/// unlikely to be predicted right.		/// unlikely to be predicted right.
bool isPredictableSelectExpensive() const {		bool isPredictableSelectExpensive() const {
return PredictableSelectIsExpensive;		return PredictableSelectIsExpensive;
▲ Show 20 Lines • Show All 2,725 Lines • ▼ Show 20 Lines	virtual SDValue getRecipEstimate(SDValue Operand, DAGCombinerInfo &DCI,
unsigned &RefinementSteps) const {		unsigned &RefinementSteps) const {
return SDValue();		return SDValue();
}		}

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
// Legalization utility functions		// Legalization utility functions
//		//

		/// Expand a MUL or [US]MUL_LOHI of n-bit values into two or four nodes,
		/// respectively, each computing an n/2-bit part of the result.
		/// \param Result A vector that will be filled with the parts of the result
		/// in little-endian order.
		/// \param HalfVT The value type to use for the result nodes.
		/// \param OnlyLegalOrCustom Only legal or custom instructions are used.
		/// \param LL Low bits of the LHS of the MUL. You can use this parameter
		/// if you want to control how low bits are extracted from the LHS.
		/// \param LH High bits of the LHS of the MUL. See LL for meaning.
		/// \param RL Low bits of the RHS of the MUL. See LL for meaning
		/// \param RH High bits of the RHS of the MUL. See LL for meaning.
		/// \returns true if the node has been expanded, false if it has not
		bool expandMUL_LOHI(unsigned Opcode, EVT VT, SDLoc dl, SDValue LHS,
		SDValue RHS, SmallVectorImpl<SDValue> &Result, EVT HalfVT,
		SelectionDAG &DAG, bool OnlyLegalOrCustom,
		efriedmaUnsubmitted Done Reply Inline Actions Please make OnlyLegalOrCustom an enum; otherwise it's completely impossible to tell what the parameter does without looking at the header. efriedma: Please make OnlyLegalOrCustom an enum; otherwise it's completely impossible to tell what the…
		SDValue LL = SDValue(), SDValue LH = SDValue(),
		SDValue RL = SDValue(), SDValue RH = SDValue()) const;

/// Expand a MUL into two nodes. One that computes the high bits of		/// Expand a MUL into two nodes. One that computes the high bits of
/// the result and one that computes the low bits.		/// the result and one that computes the low bits.
/// \param HiLoVT The value type to use for the Lo and Hi nodes.		/// \param HiLoVT The value type to use for the Lo and Hi nodes.
		/// \param OnlyLegalOrCustom Only legal or custom instructions are used.
/// \param LL Low bits of the LHS of the MUL. You can use this parameter		/// \param LL Low bits of the LHS of the MUL. You can use this parameter
/// if you want to control how low bits are extracted from the LHS.		/// if you want to control how low bits are extracted from the LHS.
/// \param LH High bits of the LHS of the MUL. See LL for meaning.		/// \param LH High bits of the LHS of the MUL. See LL for meaning.
/// \param RL Low bits of the RHS of the MUL. See LL for meaning		/// \param RL Low bits of the RHS of the MUL. See LL for meaning
/// \param RH High bits of the RHS of the MUL. See LL for meaning.		/// \param RH High bits of the RHS of the MUL. See LL for meaning.
/// \returns true if the node has been expanded. false if it has not		/// \returns true if the node has been expanded. false if it has not
bool expandMUL(SDNode *N, SDValue &Lo, SDValue &Hi, EVT HiLoVT,		bool expandMUL(SDNode *N, SDValue &Lo, SDValue &Hi, EVT HiLoVT,
SelectionDAG &DAG, SDValue LL = SDValue(),		SelectionDAG &DAG, bool OnlyLegalOrCustom,
SDValue LH = SDValue(), SDValue RL = SDValue(),		SDValue LL = SDValue(), SDValue LH = SDValue(),
SDValue RH = SDValue()) const;		SDValue RL = SDValue(), SDValue RH = SDValue()) const;

/// Expand float(f32) to SINT(i64) conversion		/// Expand float(f32) to SINT(i64) conversion
/// \param N Node to expand		/// \param N Node to expand
/// \param Result output after conversion		/// \param Result output after conversion
/// \returns True, if the expansion was successful, false otherwise		/// \returns True, if the expansion was successful, false otherwise
bool expandFP_TO_SINT(SDNode *N, SDValue &Result, SelectionDAG &DAG) const;		bool expandFP_TO_SINT(SDNode *N, SDValue &Result, SelectionDAG &DAG) const;

/// Turn load of vector type into a load of the individual elements.		/// Turn load of vector type into a load of the individual elements.
▲ Show 20 Lines • Show All 72 Lines • Show Last 20 Lines

lib/CodeGen/SelectionDAG/LegalizeDAG.cpp

Show First 20 Lines • Show All 3,296 Lines • ▼ Show 20 Lines	if (TLI.isOperationLegalOrCustom(DivRemOpc, VT)) {
Tmp1 = DAG.getNode(DivRemOpc, dl, VTs, Node->getOperand(0),		Tmp1 = DAG.getNode(DivRemOpc, dl, VTs, Node->getOperand(0),
Node->getOperand(1));		Node->getOperand(1));
Results.push_back(Tmp1);		Results.push_back(Tmp1);
}		}
break;		break;
}		}
case ISD::MULHU:		case ISD::MULHU:
case ISD::MULHS: {		case ISD::MULHS: {
unsigned ExpandOpcode = Node->getOpcode() == ISD::MULHU ? ISD::UMUL_LOHI :		unsigned ExpandOpcode =
ISD::SMUL_LOHI;		Node->getOpcode() == ISD::MULHU ? ISD::UMUL_LOHI : ISD::SMUL_LOHI;
EVT VT = Node->getValueType(0);		EVT VT = Node->getValueType(0);
SDVTList VTs = DAG.getVTList(VT, VT);		SDVTList VTs = DAG.getVTList(VT, VT);
assert(TLI.isOperationLegalOrCustom(ExpandOpcode, VT) &&
"If this wasn't legal, it shouldn't have been created!");
Tmp1 = DAG.getNode(ExpandOpcode, dl, VTs, Node->getOperand(0),		Tmp1 = DAG.getNode(ExpandOpcode, dl, VTs, Node->getOperand(0),
Node->getOperand(1));		Node->getOperand(1));
Results.push_back(Tmp1.getValue(1));		Results.push_back(Tmp1.getValue(1));
break;		break;
}		}
		case ISD::UMUL_LOHI:
		case ISD::SMUL_LOHI: {
		SDValue LHS = Node->getOperand(0);
		SDValue RHS = Node->getOperand(1);
		MVT VT = LHS.getSimpleValueType();
		TargetLowering::MulExpansion Kind =
		TLI.getMulExpansion(Node->getOpcode(), VT);

		switch (Kind) {
		case TargetLowering::MulExpansion::None:
		break;

		case TargetLowering::MulExpansion::Split:
		Results.push_back(DAG.getNode(ISD::MUL, dl, VT, LHS, RHS));
		Results.push_back(DAG.getNode(
		Node->getOpcode() == ISD::UMUL_LOHI ? ISD::MULHU : ISD::MULHS, dl, VT,
		efriedmaUnsubmitted Not Done Reply Inline Actions Maybe add an assertion here that HalfType is a legal type? Some architectures could trip over this... efriedma: Maybe add an assertion here that HalfType is a legal type? Some architectures could trip over…
		nhaehnleAuthorUnsubmitted Not Done Reply Inline Actions I missed this one, sorry. nhaehnle: I missed this one, sorry.
		LHS, RHS));
		efriedmaUnsubmitted Done Reply Inline Actions Sorry, I really haven't worked with this code in a while; I should have spotted the issue here earlier. :( We can't ever scalarize an operation in LegalizeDAG: scalarizing can produce illegal types, and LegalizeDAG can't handle them. We have to handle this a bit earlier, in LegalizeVectorOps. efriedma: Sorry, I really haven't worked with this code in a while; I should have spotted the issue here…
		nhaehnleAuthorUnsubmitted Not Done Reply Inline Actions Okay, fixing this. nhaehnle: Okay, fixing this.
		break;

		case TargetLowering::MulExpansion::Scalarize: {
		MVT ScalarVT = VT.getScalarType();
		unsigned NumElements = VT.getVectorNumElements();
		MVT IdxTy = TLI.getVectorIdxTy(DAG.getDataLayout());
		SmallVector<SDValue, 16> Scalars;

		for (unsigned i = 0; i < NumElements; ++i) {
		SDValue Idx = DAG.getConstant(i, dl, IdxTy);
		Scalars.push_back(DAG.getNode(
		Node->getOpcode(), dl, DAG.getVTList(ScalarVT, ScalarVT),
		DAG.getNode(ISD::EXTRACT_VECTOR_ELT, dl, ScalarVT, LHS, Idx),
		DAG.getNode(ISD::EXTRACT_VECTOR_ELT, dl, ScalarVT, RHS, Idx)));
		}

		for (unsigned i = 0; i < 2; ++i) {
		SmallVector<SDValue, 16> Elements;
		for (unsigned j = 0; j < NumElements; ++j)
		Elements.push_back(Scalars[j].getValue(i));
		Results.push_back(DAG.getNode(ISD::BUILD_VECTOR, dl, VT, Elements));
		}
		break;
		}

		case TargetLowering::MulExpansion::HalfWidth: {
		SmallVector<SDValue, 4> Halves;
		EVT HalfType;
		if (!VT.isVector()) {
		HalfType = EVT(VT).getHalfSizedIntegerVT(*DAG.getContext());
		} else {
		HalfType = EVT::getVectorVT(
		*DAG.getContext(),
		EVT(VT.getScalarType()).getHalfSizedIntegerVT(*DAG.getContext()),
		VT.getVectorNumElements());
		}
		if (TLI.expandMUL_LOHI(Node->getOpcode(), VT, Node, LHS, RHS, Halves,
		HalfType, DAG, false)) {
		for (unsigned i = 0; i < 2; ++i) {
		SDValue Lo = DAG.getNode(ISD::ZERO_EXTEND, dl, VT, Halves[2 * i]);
		SDValue Hi = DAG.getNode(ISD::ANY_EXTEND, dl, VT, Halves[2 * i + 1]);
		SDValue Shift = DAG.getConstant(
		HalfType.getSizeInBits(), dl,
		TLI.getShiftAmountTy(HalfType, DAG.getDataLayout()));
		Hi = DAG.getNode(ISD::SHL, dl, VT, Hi, Shift);
		Results.push_back(DAG.getNode(ISD::OR, dl, VT, Lo, Hi));
		}
		break;
		}
		break;
		}
		}
		break;
		}
case ISD::MUL: {		case ISD::MUL: {
EVT VT = Node->getValueType(0);		EVT VT = Node->getValueType(0);
SDVTList VTs = DAG.getVTList(VT, VT);		SDVTList VTs = DAG.getVTList(VT, VT);
// See if multiply or divide can be lowered using two-result operations.		// See if multiply or divide can be lowered using two-result operations.
// We just need the low half of the multiply; try both the signed		// We just need the low half of the multiply; try both the signed
// and unsigned forms. If the target supports both SMUL_LOHI and		// and unsigned forms. If the target supports both SMUL_LOHI and
// UMUL_LOHI, form a preference by checking which forms of plain		// UMUL_LOHI, form a preference by checking which forms of plain
// MULH it supports.		// MULH it supports.
Show All 18 Lines	case ISD::MUL: {
}		}

SDValue Lo, Hi;		SDValue Lo, Hi;
EVT HalfType = VT.getHalfSizedIntegerVT(*DAG.getContext());		EVT HalfType = VT.getHalfSizedIntegerVT(*DAG.getContext());
if (TLI.isOperationLegalOrCustom(ISD::ZERO_EXTEND, VT) &&		if (TLI.isOperationLegalOrCustom(ISD::ZERO_EXTEND, VT) &&
TLI.isOperationLegalOrCustom(ISD::ANY_EXTEND, VT) &&		TLI.isOperationLegalOrCustom(ISD::ANY_EXTEND, VT) &&
TLI.isOperationLegalOrCustom(ISD::SHL, VT) &&		TLI.isOperationLegalOrCustom(ISD::SHL, VT) &&
TLI.isOperationLegalOrCustom(ISD::OR, VT) &&		TLI.isOperationLegalOrCustom(ISD::OR, VT) &&
TLI.expandMUL(Node, Lo, Hi, HalfType, DAG)) {		TLI.expandMUL(Node, Lo, Hi, HalfType, DAG, true)) {
Lo = DAG.getNode(ISD::ZERO_EXTEND, dl, VT, Lo);		Lo = DAG.getNode(ISD::ZERO_EXTEND, dl, VT, Lo);
Hi = DAG.getNode(ISD::ANY_EXTEND, dl, VT, Hi);		Hi = DAG.getNode(ISD::ANY_EXTEND, dl, VT, Hi);
SDValue Shift =		SDValue Shift =
DAG.getConstant(HalfType.getSizeInBits(), dl,		DAG.getConstant(HalfType.getSizeInBits(), dl,
TLI.getShiftAmountTy(HalfType, DAG.getDataLayout()));		TLI.getShiftAmountTy(HalfType, DAG.getDataLayout()));
Hi = DAG.getNode(ISD::SHL, dl, VT, Hi, Shift);		Hi = DAG.getNode(ISD::SHL, dl, VT, Hi, Shift);
Results.push_back(DAG.getNode(ISD::OR, dl, VT, Lo, Hi));		Results.push_back(DAG.getNode(ISD::OR, dl, VT, Lo, Hi));
}		}
▲ Show 20 Lines • Show All 783 Lines • ▼ Show 20 Lines	case ISD::XOR: {
// Promote each of the values to the new type.		// Promote each of the values to the new type.
Tmp1 = DAG.getNode(ExtOp, dl, NVT, Node->getOperand(0));		Tmp1 = DAG.getNode(ExtOp, dl, NVT, Node->getOperand(0));
Tmp2 = DAG.getNode(ExtOp, dl, NVT, Node->getOperand(1));		Tmp2 = DAG.getNode(ExtOp, dl, NVT, Node->getOperand(1));
// Perform the larger operation, then convert back		// Perform the larger operation, then convert back
Tmp1 = DAG.getNode(Node->getOpcode(), dl, NVT, Tmp1, Tmp2);		Tmp1 = DAG.getNode(Node->getOpcode(), dl, NVT, Tmp1, Tmp2);
Results.push_back(DAG.getNode(TruncOp, dl, OVT, Tmp1));		Results.push_back(DAG.getNode(TruncOp, dl, OVT, Tmp1));
break;		break;
}		}
		case ISD::UMUL_LOHI:
		case ISD::SMUL_LOHI: {
		// Promote to a multiply in a wider integer type.
		unsigned ExtOp = Node->getOpcode() == ISD::UMUL_LOHI ? ISD::ZERO_EXTEND
		: ISD::SIGN_EXTEND;
		Tmp1 = DAG.getNode(ExtOp, dl, NVT, Node->getOperand(0));
		Tmp2 = DAG.getNode(ExtOp, dl, NVT, Node->getOperand(1));
		Tmp1 = DAG.getNode(ISD::MUL, dl, NVT, Tmp1, Tmp2);

		auto &DL = DAG.getDataLayout();
		unsigned OriginalSize = OVT.getScalarSizeInBits();
		Tmp2 = DAG.getNode(
		ISD::SRL, dl, NVT, Tmp1,
		DAG.getConstant(OriginalSize, dl, TLI.getScalarShiftAmountTy(DL, NVT)));
		Results.push_back(DAG.getNode(ISD::TRUNCATE, dl, OVT, Tmp1));
		Results.push_back(DAG.getNode(ISD::TRUNCATE, dl, OVT, Tmp2));
		break;
		}
case ISD::SELECT: {		case ISD::SELECT: {
unsigned ExtOp, TruncOp;		unsigned ExtOp, TruncOp;
if (Node->getValueType(0).isVector() \|\|		if (Node->getValueType(0).isVector() \|\|
Node->getValueType(0).getSizeInBits() == NVT.getSizeInBits()) {		Node->getValueType(0).getSizeInBits() == NVT.getSizeInBits()) {
ExtOp = ISD::BITCAST;		ExtOp = ISD::BITCAST;
TruncOp = ISD::BITCAST;		TruncOp = ISD::BITCAST;
} else if (Node->getValueType(0).isInteger()) {		} else if (Node->getValueType(0).isInteger()) {
ExtOp = ISD::ANY_EXTEND;		ExtOp = ISD::ANY_EXTEND;
▲ Show 20 Lines • Show All 335 Lines • Show Last 20 Lines

lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp

Show First 20 Lines • Show All 2,170 Lines • ▼ Show 20 Lines	void DAGTypeLegalizer::ExpandIntRes_MUL(SDNode *N,
EVT VT = N->getValueType(0);		EVT VT = N->getValueType(0);
EVT NVT = TLI.getTypeToTransformTo(*DAG.getContext(), VT);		EVT NVT = TLI.getTypeToTransformTo(*DAG.getContext(), VT);
SDLoc dl(N);		SDLoc dl(N);

SDValue LL, LH, RL, RH;		SDValue LL, LH, RL, RH;
GetExpandedInteger(N->getOperand(0), LL, LH);		GetExpandedInteger(N->getOperand(0), LL, LH);
GetExpandedInteger(N->getOperand(1), RL, RH);		GetExpandedInteger(N->getOperand(1), RL, RH);

if (TLI.expandMUL(N, Lo, Hi, NVT, DAG, LL, LH, RL, RH))		if (TLI.expandMUL(N, Lo, Hi, NVT, DAG, true, LL, LH, RL, RH))
return;		return;

// If nothing else, we can make a libcall.		// If nothing else, we can make a libcall.
RTLIB::Libcall LC = RTLIB::UNKNOWN_LIBCALL;		RTLIB::Libcall LC = RTLIB::UNKNOWN_LIBCALL;
if (VT == MVT::i16)		if (VT == MVT::i16)
LC = RTLIB::MUL_I16;		LC = RTLIB::MUL_I16;
else if (VT == MVT::i32)		else if (VT == MVT::i32)
LC = RTLIB::MUL_I32;		LC = RTLIB::MUL_I32;
else if (VT == MVT::i64)		else if (VT == MVT::i64)
LC = RTLIB::MUL_I64;		LC = RTLIB::MUL_I64;
else if (VT == MVT::i128)		else if (VT == MVT::i128)
LC = RTLIB::MUL_I128;		LC = RTLIB::MUL_I128;

if (LC == RTLIB::UNKNOWN_LIBCALL) {		if (LC == RTLIB::UNKNOWN_LIBCALL) {
// We'll expand the multiplication by brute force because we have no other		// We'll expand the multiplication by brute force because we have no other
// options. This is a trivially-generalized version of the code from		// options. This is a trivially-generalized version of the code from
// Hacker's Delight (itself derived from Knuth's Algorithm M from section		// Hacker's Delight (itself derived from Knuth's Algorithm M from section
// 4.3.1).		// 4.3.1).
		efriedmaUnsubmitted Not Done Reply Inline Actions Can you replace this code with a call to expandMUL? efriedma: Can you replace this code with a call to expandMUL?
		nhaehnleAuthorUnsubmitted Not Done Reply Inline Actions I played around with that, but I'd rather not do it in this patch. The generated code sequences are different, and in particular we'd generate non-legal instructions that at least the X86 target doesn't expect and doesn't handle properly. This is visible in test/CodeGen/X86/imul-256/512/1024.ll. nhaehnle: I played around with that, but I'd rather not do it in this patch. The generated code sequences…
		efriedmaUnsubmitted Not Done Reply Inline Actions Okay, that's fine. efriedma: Okay, that's fine.
unsigned Bits = NVT.getSizeInBits();		unsigned Bits = NVT.getSizeInBits();
unsigned HalfBits = Bits >> 1;		unsigned HalfBits = Bits >> 1;
SDValue Mask = DAG.getConstant(APInt::getLowBitsSet(Bits, HalfBits), dl,		SDValue Mask = DAG.getConstant(APInt::getLowBitsSet(Bits, HalfBits), dl,
NVT);		NVT);
SDValue LLL = DAG.getNode(ISD::AND, dl, NVT, LL, Mask);		SDValue LLL = DAG.getNode(ISD::AND, dl, NVT, LL, Mask);
SDValue RLL = DAG.getNode(ISD::AND, dl, NVT, RL, Mask);		SDValue RLL = DAG.getNode(ISD::AND, dl, NVT, RL, Mask);

SDValue T = DAG.getNode(ISD::MUL, dl, NVT, LLL, RLL);		SDValue T = DAG.getNode(ISD::MUL, dl, NVT, LLL, RLL);
▲ Show 20 Lines • Show All 1,198 Lines • Show Last 20 Lines

lib/CodeGen/SelectionDAG/TargetLowering.cpp

Show First 20 Lines • Show All 3,009 Lines • ▼ Show 20 Lines	verifyReturnAddressArgumentIsConstant(SDValue Op, SelectionDAG &DAG) const {

return false;		return false;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Legalization Utilities		// Legalization Utilities
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

bool TargetLowering::expandMUL(SDNode *N, SDValue &Lo, SDValue &Hi, EVT HiLoVT,		bool TargetLowering::expandMUL_LOHI(unsigned Opcode, EVT VT, SDLoc dl,
SelectionDAG &DAG, SDValue LL, SDValue LH,		SDValue LHS, SDValue RHS,
SDValue RL, SDValue RH) const {		SmallVectorImpl<SDValue> &Result,
EVT VT = N->getValueType(0);		EVT HalfVT, SelectionDAG &DAG,
SDLoc dl(N);		bool OnlyLegalOrCustom, SDValue LL,
		SDValue LH, SDValue RL, SDValue RH) const {
bool HasMULHS = isOperationLegalOrCustom(ISD::MULHS, HiLoVT);		assert(Opcode == ISD::MUL \|\| Opcode == ISD::UMUL_LOHI \|\|
bool HasMULHU = isOperationLegalOrCustom(ISD::MULHU, HiLoVT);		Opcode == ISD::SMUL_LOHI);
bool HasSMUL_LOHI = isOperationLegalOrCustom(ISD::SMUL_LOHI, HiLoVT);
bool HasUMUL_LOHI = isOperationLegalOrCustom(ISD::UMUL_LOHI, HiLoVT);		bool HasMULHS =
if (HasMULHU \|\| HasMULHS \|\| HasUMUL_LOHI \|\| HasSMUL_LOHI) {		!OnlyLegalOrCustom \|\| isOperationLegalOrCustom(ISD::MULHS, HalfVT);
		bool HasMULHU =
		!OnlyLegalOrCustom \|\| isOperationLegalOrCustom(ISD::MULHU, HalfVT);
		bool HasSMUL_LOHI =
		!OnlyLegalOrCustom \|\| isOperationLegalOrCustom(ISD::SMUL_LOHI, HalfVT);
		bool HasUMUL_LOHI =
		!OnlyLegalOrCustom \|\| isOperationLegalOrCustom(ISD::UMUL_LOHI, HalfVT);
unsigned OuterBitSize = VT.getSizeInBits();		unsigned OuterBitSize = VT.getSizeInBits();
		RKSimonUnsubmitted Done Reply Inline Actions If this is going to start being used by vectors, shouldn't these getSizeInBits() calls be replaced with getScalarSizeInBits()? RKSimon: If this is going to start being used by vectors, shouldn't these getSizeInBits() calls be…
		nhaehnleAuthorUnsubmitted Not Done Reply Inline Actions Fixed, thanks. nhaehnle: Fixed, thanks.
unsigned InnerBitSize = HiLoVT.getSizeInBits();		unsigned InnerBitSize = HalfVT.getSizeInBits();
unsigned LHSSB = DAG.ComputeNumSignBits(N->getOperand(0));		unsigned LHSSB = DAG.ComputeNumSignBits(LHS);
unsigned RHSSB = DAG.ComputeNumSignBits(N->getOperand(1));		unsigned RHSSB = DAG.ComputeNumSignBits(RHS);

// LL, LH, RL, and RH must be either all NULL or all set to a value.		// LL, LH, RL, and RH must be either all NULL or all set to a value.
assert((LL.getNode() && LH.getNode() && RL.getNode() && RH.getNode()) \|\|		assert((LL.getNode() && LH.getNode() && RL.getNode() && RH.getNode()) \|\|
(!LL.getNode() && !LH.getNode() && !RL.getNode() && !RH.getNode()));		(!LL.getNode() && !LH.getNode() && !RL.getNode() && !RH.getNode()));

if (!LL.getNode() && !RL.getNode() &&		if (!HasMULHS && !HasMULHU && !HasSMUL_LOHI && !HasUMUL_LOHI)
isOperationLegalOrCustom(ISD::TRUNCATE, HiLoVT)) {
LL = DAG.getNode(ISD::TRUNCATE, dl, HiLoVT, N->getOperand(0));
RL = DAG.getNode(ISD::TRUNCATE, dl, HiLoVT, N->getOperand(1));
}

if (!LL.getNode())
return false;		return false;

APInt HighMask = APInt::getHighBitsSet(OuterBitSize, InnerBitSize);		SDVTList VTs = DAG.getVTList(HalfVT, HalfVT);
if (DAG.MaskedValueIsZero(N->getOperand(0), HighMask) &&		auto MakeUMUL_LOHI = [&](SDValue L, SDValue R, SDValue &Lo,
DAG.MaskedValueIsZero(N->getOperand(1), HighMask)) {		SDValue &Hi) -> bool {
// The inputs are both zero-extended.
if (HasUMUL_LOHI) {		if (HasUMUL_LOHI) {
// We can emit a umul_lohi.		Lo = DAG.getNode(ISD::UMUL_LOHI, dl, VTs, L, R);
Lo = DAG.getNode(ISD::UMUL_LOHI, dl, DAG.getVTList(HiLoVT, HiLoVT), LL,
RL);
Hi = SDValue(Lo.getNode(), 1);		Hi = SDValue(Lo.getNode(), 1);
return true;		return true;
}		}
if (HasMULHU) {		if (HasMULHU) {
// We can emit a mulhu+mul.		Lo = DAG.getNode(ISD::MUL, dl, HalfVT, L, R);
Lo = DAG.getNode(ISD::MUL, dl, HiLoVT, LL, RL);		Hi = DAG.getNode(ISD::MULHU, dl, HalfVT, L, R);
Hi = DAG.getNode(ISD::MULHU, dl, HiLoVT, LL, RL);
return true;		return true;
}		}
}		return false;
if (LHSSB > InnerBitSize && RHSSB > InnerBitSize) {		};
// The input values are both sign-extended.		auto MakeSMUL_LOHI = [&](SDValue L, SDValue R, SDValue &Lo,
		SDValue &Hi) -> bool {
if (HasSMUL_LOHI) {		if (HasSMUL_LOHI) {
// We can emit a smul_lohi.		Lo = DAG.getNode(ISD::SMUL_LOHI, dl, VTs, L, R);
Lo = DAG.getNode(ISD::SMUL_LOHI, dl, DAG.getVTList(HiLoVT, HiLoVT), LL,
RL);
Hi = SDValue(Lo.getNode(), 1);		Hi = SDValue(Lo.getNode(), 1);
return true;		return true;
}		}
if (HasMULHS) {		if (HasMULHS) {
// We can emit a mulhs+mul.		Lo = DAG.getNode(ISD::MUL, dl, HalfVT, L, R);
Lo = DAG.getNode(ISD::MUL, dl, HiLoVT, LL, RL);		Hi = DAG.getNode(ISD::MULHS, dl, HalfVT, L, R);
Hi = DAG.getNode(ISD::MULHS, dl, HiLoVT, LL, RL);		return true;
		}
		return false;
		};

		SDValue Lo, Hi;

		if (!LL.getNode() && !RL.getNode() &&
		efriedmaUnsubmitted Not Done Reply Inline Actions MaskedValueIsZero should do the right thing for vectors, I think, with recent changes to computeKnownBits. efriedma: MaskedValueIsZero should do the right thing for vectors, I think, with recent changes to…
		isOperationLegalOrCustom(ISD::TRUNCATE, HalfVT)) {
		LL = DAG.getNode(ISD::TRUNCATE, dl, HalfVT, LHS);
		RL = DAG.getNode(ISD::TRUNCATE, dl, HalfVT, RHS);
		}

		if (!LL.getNode())
		return false;

		APInt HighMask = APInt::getHighBitsSet(OuterBitSize, InnerBitSize);
		if (DAG.MaskedValueIsZero(LHS, HighMask) &&
		DAG.MaskedValueIsZero(RHS, HighMask)) {
		// The inputs are both zero-extended.
		if (MakeUMUL_LOHI(LL, RL, Lo, Hi)) {
		Result.push_back(Lo);
		Result.push_back(Hi);
		if (Opcode != ISD::MUL) {
		SDValue Zero = DAG.getConstant(0, dl, HalfVT);
		Result.push_back(Zero);
		Result.push_back(Zero);
		}
		return true;
		}
		}

		if (LHSSB > InnerBitSize && RHSSB > InnerBitSize && Opcode == ISD::MUL) {
		// The input values are both sign-extended.
		// TODO non-MUL case?
		if (MakeSMUL_LOHI(LL, RL, Lo, Hi)) {
		Result.push_back(Lo);
		Result.push_back(Hi);
return true;		return true;
}		}
}		}

		efriedmaUnsubmitted Not Done Reply Inline Actions Hmm... I was going to say that ShiftAmount always fits into ShiftAmountTy because it's a defined shift (ShiftAmount < OuterBitSize < maximum unsigned value), but getShiftAmountTy doesn't always return a sensible result for illegal types. Please just note this problem with a FIXME. efriedma: Hmm... I was going to say that ShiftAmount always fits into ShiftAmountTy because it's a…
if (!LH.getNode() && !RH.getNode() &&		if (!LH.getNode() && !RH.getNode() &&
isOperationLegalOrCustom(ISD::SRL, VT) &&		isOperationLegalOrCustom(ISD::SRL, VT) &&
isOperationLegalOrCustom(ISD::TRUNCATE, HiLoVT)) {		isOperationLegalOrCustom(ISD::TRUNCATE, HalfVT)) {
auto &DL = DAG.getDataLayout();		auto &DL = DAG.getDataLayout();
unsigned ShiftAmt = VT.getSizeInBits() - HiLoVT.getSizeInBits();		unsigned ShiftAmt = VT.getSizeInBits() - HalfVT.getSizeInBits();
SDValue Shift = DAG.getConstant(ShiftAmt, dl, getShiftAmountTy(VT, DL));		SDValue Shift = DAG.getConstant(ShiftAmt, dl, getShiftAmountTy(VT, DL));
LH = DAG.getNode(ISD::SRL, dl, VT, N->getOperand(0), Shift);		LH = DAG.getNode(ISD::SRL, dl, VT, LHS, Shift);
LH = DAG.getNode(ISD::TRUNCATE, dl, HiLoVT, LH);		LH = DAG.getNode(ISD::TRUNCATE, dl, HalfVT, LH);
RH = DAG.getNode(ISD::SRL, dl, VT, N->getOperand(1), Shift);		RH = DAG.getNode(ISD::SRL, dl, VT, RHS, Shift);
RH = DAG.getNode(ISD::TRUNCATE, dl, HiLoVT, RH);		RH = DAG.getNode(ISD::TRUNCATE, dl, HalfVT, RH);
}		}

if (!LH.getNode())		if (!LH.getNode())
return false;		return false;

if (HasUMUL_LOHI) {		if (!MakeUMUL_LOHI(LL, RL, Lo, Hi))
// Lo,Hi = umul LHS, RHS.		return false;
SDValue UMulLOHI = DAG.getNode(ISD::UMUL_LOHI, dl,
DAG.getVTList(HiLoVT, HiLoVT), LL, RL);		Result.push_back(Lo);
Lo = UMulLOHI;
Hi = UMulLOHI.getValue(1);		if (Opcode == ISD::MUL) {
RH = DAG.getNode(ISD::MUL, dl, HiLoVT, LL, RH);		RH = DAG.getNode(ISD::MUL, dl, HalfVT, LL, RH);
LH = DAG.getNode(ISD::MUL, dl, HiLoVT, LH, RL);		LH = DAG.getNode(ISD::MUL, dl, HalfVT, LH, RL);
Hi = DAG.getNode(ISD::ADD, dl, HiLoVT, Hi, RH);		Hi = DAG.getNode(ISD::ADD, dl, HalfVT, Hi, RH);
Hi = DAG.getNode(ISD::ADD, dl, HiLoVT, Hi, LH);		Hi = DAG.getNode(ISD::ADD, dl, HalfVT, Hi, LH);
		Result.push_back(Hi);
return true;		return true;
}		}
if (HasMULHU) {
Lo = DAG.getNode(ISD::MUL, dl, HiLoVT, LL, RL);		SDValue Next = Hi;
Hi = DAG.getNode(ISD::MULHU, dl, HiLoVT, LL, RL);		if (!MakeUMUL_LOHI(LL, RH, Lo, Hi))
RH = DAG.getNode(ISD::MUL, dl, HiLoVT, LL, RH);		return false;
LH = DAG.getNode(ISD::MUL, dl, HiLoVT, LH, RL);
Hi = DAG.getNode(ISD::ADD, dl, HiLoVT, Hi, RH);		SDVTList CarryVTs = DAG.getVTList(HalfVT, MVT::Glue);
Hi = DAG.getNode(ISD::ADD, dl, HiLoVT, Hi, LH);		SDValue SumLo, SumHi;
		SumHi = Hi;
		SumLo = DAG.getNode(ISD::ADDC, dl, CarryVTs, Next, Lo);

		if (!MakeUMUL_LOHI(LH, RL, Lo, Hi))
		return false;

		SumHi = DAG.getNode(ISD::ADDE, dl, CarryVTs, SumHi, Hi, SumLo.getValue(1));
		SumLo = DAG.getNode(ISD::ADDC, dl, CarryVTs, SumLo, Lo);
		efriedmaUnsubmitted Done Reply Inline Actions Why are you using ADDC+ADDE rather than just plain ADD? It seems strange to split the operation, given that it might be legal. (I might be missing something here, though.) efriedma: Why are you using ADDC+ADDE rather than just plain ADD? It seems strange to split the…
		nhaehnleAuthorUnsubmitted Not Done Reply Inline Actions That's just how the code happened to evolve because AMDGPU (which is where this all started) doesn't have 64-bit additions, but it's a good point. I'm changing it to use additions in the larger type. nhaehnle: That's just how the code happened to evolve because AMDGPU (which is where this all started)…
		Result.push_back(SumLo);

		if (!(Opcode == ISD::UMUL_LOHI ? MakeUMUL_LOHI(LH, RH, Lo, Hi)
		: MakeSMUL_LOHI(LH, RH, Lo, Hi)))
		return false;

		SDValue Zero = DAG.getConstant(0, dl, HalfVT);
		SumLo = DAG.getNode(ISD::ADDE, dl, CarryVTs, SumHi, Lo, SumLo.getValue(1));
		SumHi = DAG.getNode(ISD::ADDE, dl, CarryVTs, Hi, Zero, SumHi.getValue(1));
		SumHi = DAG.getNode(ISD::ADDE, dl, CarryVTs, SumHi, Zero, SumLo.getValue(1));

		if (Opcode == ISD::SMUL_LOHI) {
		SDValue LoSub = DAG.getNode(ISD::SUBC, dl, CarryVTs, SumLo, RL);
		SDValue HiSub =
		DAG.getNode(ISD::SUBE, dl, CarryVTs, SumHi, Zero, LoSub.getValue(1));

		SumLo = DAG.getSelectCC(dl, LH, Zero, LoSub, SumLo, ISD::SETLT);
		SumHi = DAG.getSelectCC(dl, LH, Zero, HiSub, SumHi, ISD::SETLT);

		LoSub = DAG.getNode(ISD::SUBC, dl, CarryVTs, SumLo, LL);
		HiSub =
		DAG.getNode(ISD::SUBE, dl, CarryVTs, SumHi, Zero, LoSub.getValue(1));

		SumLo = DAG.getSelectCC(dl, RH, Zero, LoSub, SumLo, ISD::SETLT);
		SumHi = DAG.getSelectCC(dl, RH, Zero, HiSub, SumHi, ISD::SETLT);
		}

		Result.push_back(SumLo);
		Result.push_back(SumHi);
return true;		return true;
}		}

		bool TargetLowering::expandMUL(SDNode *N, SDValue &Lo, SDValue &Hi, EVT HiLoVT,
		SelectionDAG &DAG, bool OnlyLegalOrCustom,
		SDValue LL, SDValue LH, SDValue RL,
		SDValue RH) const {
		SmallVector<SDValue, 2> Result;
		bool Ok = expandMUL_LOHI(N->getOpcode(), N->getValueType(0), N,
		N->getOperand(0), N->getOperand(1), Result, HiLoVT,
		DAG, OnlyLegalOrCustom, LL, LH, RL, RH);
		if (Result.size() >= 2) {
		Lo = Result[0];
		Hi = Result[1];
}		}
return false;		return Ok;
}		}

		efriedmaUnsubmitted Not Done Reply Inline Actions Should there be an assertion here? If expandMUL_LOHI isn't returning exactly two results, something went wrong. efriedma: Should there be an assertion here? If expandMUL_LOHI isn't returning exactly two results…
bool TargetLowering::expandFP_TO_SINT(SDNode *Node, SDValue &Result,		bool TargetLowering::expandFP_TO_SINT(SDNode *Node, SDValue &Result,
SelectionDAG &DAG) const {		SelectionDAG &DAG) const {
EVT VT = Node->getOperand(0).getValueType();		EVT VT = Node->getOperand(0).getValueType();
EVT NVT = Node->getValueType(0);		EVT NVT = Node->getValueType(0);
SDLoc dl(SDValue(Node, 0));		SDLoc dl(SDValue(Node, 0));

// FIXME: Only f32 to i64 conversions are supported.		// FIXME: Only f32 to i64 conversions are supported.
if (VT != MVT::f32 \|\| NVT != MVT::i64)		if (VT != MVT::f32 \|\| NVT != MVT::i64)
▲ Show 20 Lines • Show All 484 Lines • Show Last 20 Lines

lib/Target/AMDGPU/AMDGPUISelLowering.h

Show First 20 Lines • Show All 115 Lines • ▼ Show 20 Lines	public:
bool isTruncateFree(Type Src, Type Dest) const override;		bool isTruncateFree(Type Src, Type Dest) const override;

bool isZExtFree(Type Src, Type Dest) const override;		bool isZExtFree(Type Src, Type Dest) const override;
bool isZExtFree(EVT Src, EVT Dest) const override;		bool isZExtFree(EVT Src, EVT Dest) const override;
bool isZExtFree(SDValue Val, EVT VT2) const override;		bool isZExtFree(SDValue Val, EVT VT2) const override;

bool isNarrowingProfitable(EVT VT1, EVT VT2) const override;		bool isNarrowingProfitable(EVT VT1, EVT VT2) const override;

		MulExpansion getMulExpansion(unsigned Opcode, MVT VT) const override;

MVT getVectorIdxTy(const DataLayout &) const override;		MVT getVectorIdxTy(const DataLayout &) const override;
bool isSelectSupported(SelectSupportKind) const override;		bool isSelectSupported(SelectSupportKind) const override;

bool isFPImmLegal(const APFloat &Imm, EVT VT) const override;		bool isFPImmLegal(const APFloat &Imm, EVT VT) const override;
bool ShouldShrinkFPConstant(EVT VT) const override;		bool ShouldShrinkFPConstant(EVT VT) const override;
bool shouldReduceLoadWidth(SDNode *Load,		bool shouldReduceLoadWidth(SDNode *Load,
ISD::LoadExtType ExtType,		ISD::LoadExtType ExtType,
EVT ExtVT) const override;		EVT ExtVT) const override;
▲ Show 20 Lines • Show All 193 Lines • Show Last 20 Lines

lib/Target/AMDGPU/AMDGPUISelLowering.cpp

Show First 20 Lines • Show All 616 Lines • ▼ Show 20 Lines	bool AMDGPUTargetLowering::isNarrowingProfitable(EVT SrcVT, EVT DestVT) const {
// limited number of native 64-bit operations. Shrinking an operation to fit		// limited number of native 64-bit operations. Shrinking an operation to fit
// in a single 32-bit register should always be helpful. As currently used,		// in a single 32-bit register should always be helpful. As currently used,
// this is much less general than the name suggests, and is only used in		// this is much less general than the name suggests, and is only used in
// places trying to reduce the sizes of loads. Shrinking loads to < 32-bits is		// places trying to reduce the sizes of loads. Shrinking loads to < 32-bits is
// not profitable, and may actually be harmful.		// not profitable, and may actually be harmful.
return SrcVT.getSizeInBits() > 32 && DestVT.getSizeInBits() == 32;		return SrcVT.getSizeInBits() > 32 && DestVT.getSizeInBits() == 32;
}		}

		TargetLowering::MulExpansion
		AMDGPUTargetLowering::getMulExpansion(unsigned Opcode, MVT VT) const {
		if (VT.getSizeInBits() > 32)
		return MulExpansion::HalfWidth;
		return MulExpansion::Split;
		}

//===---------------------------------------------------------------------===//		//===---------------------------------------------------------------------===//
// TargetLowering Callbacks		// TargetLowering Callbacks
//===---------------------------------------------------------------------===//		//===---------------------------------------------------------------------===//

/// The SelectionDAGBuilder will automatically promote function arguments		/// The SelectionDAGBuilder will automatically promote function arguments
/// with illegal types. However, this does not work for the AMDGPU targets		/// with illegal types. However, this does not work for the AMDGPU targets
/// since the function arguments are stored in memory as these illegal types.		/// since the function arguments are stored in memory as these illegal types.
/// In order to handle this properly we need to get the original types sizes		/// In order to handle this properly we need to get the original types sizes
▲ Show 20 Lines • Show All 2,343 Lines • Show Last 20 Lines

lib/Target/BPF/BPFISelLowering.h

Show All 40 Lines	public:

// This method returns the name of a target specific DAG node.		// This method returns the name of a target specific DAG node.
const char *getTargetNodeName(unsigned Opcode) const override;		const char *getTargetNodeName(unsigned Opcode) const override;

MachineBasicBlock *		MachineBasicBlock *
EmitInstrWithCustomInserter(MachineInstr &MI,		EmitInstrWithCustomInserter(MachineInstr &MI,
MachineBasicBlock *BB) const override;		MachineBasicBlock *BB) const override;

		MulExpansion getMulExpansion(unsigned Opcode, MVT VT) const override;

private:		private:
SDValue LowerBR_CC(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerBR_CC(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerSELECT_CC(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerSELECT_CC(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerGlobalAddress(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerGlobalAddress(SDValue Op, SelectionDAG &DAG) const;

// Lower the result values of a call, copying them out of physregs into vregs		// Lower the result values of a call, copying them out of physregs into vregs
SDValue LowerCallResult(SDValue Chain, SDValue InFlag,		SDValue LowerCallResult(SDValue Chain, SDValue InFlag,
CallingConv::ID CallConv, bool IsVarArg,		CallingConv::ID CallConv, bool IsVarArg,
Show All 37 Lines

lib/Target/BPF/BPFISelLowering.cpp

Show First 20 Lines • Show All 126 Lines • ▼ Show 20 Lines	BPFTargetLowering::BPFTargetLowering(const TargetMachine &TM,
setPrefFunctionAlignment(3);		setPrefFunctionAlignment(3);

// inline memcpy() for kernel to see explicit copy		// inline memcpy() for kernel to see explicit copy
MaxStoresPerMemset = MaxStoresPerMemsetOptSize = 128;		MaxStoresPerMemset = MaxStoresPerMemsetOptSize = 128;
MaxStoresPerMemcpy = MaxStoresPerMemcpyOptSize = 128;		MaxStoresPerMemcpy = MaxStoresPerMemcpyOptSize = 128;
MaxStoresPerMemmove = MaxStoresPerMemmoveOptSize = 128;		MaxStoresPerMemmove = MaxStoresPerMemmoveOptSize = 128;
}		}

		TargetLowering::MulExpansion BPFTargetLowering::getMulExpansion(unsigned Opcode,
		MVT VT) const {
		return MulExpansion::HalfWidth;
		}

SDValue BPFTargetLowering::LowerOperation(SDValue Op, SelectionDAG &DAG) const {		SDValue BPFTargetLowering::LowerOperation(SDValue Op, SelectionDAG &DAG) const {
switch (Op.getOpcode()) {		switch (Op.getOpcode()) {
case ISD::BR_CC:		case ISD::BR_CC:
return LowerBR_CC(Op, DAG);		return LowerBR_CC(Op, DAG);
case ISD::GlobalAddress:		case ISD::GlobalAddress:
return LowerGlobalAddress(Op, DAG);		return LowerGlobalAddress(Op, DAG);
case ISD::SELECT_CC:		case ISD::SELECT_CC:
return LowerSELECT_CC(Op, DAG);		return LowerSELECT_CC(Op, DAG);
▲ Show 20 Lines • Show All 454 Lines • Show Last 20 Lines

lib/Target/PowerPC/PPCISelLowering.h

Show First 20 Lines • Show All 561 Lines • ▼ Show 20 Lines	public:
SDValue expandVSXLoadForLE(SDNode *N, DAGCombinerInfo &DCI) const;		SDValue expandVSXLoadForLE(SDNode *N, DAGCombinerInfo &DCI) const;
SDValue expandVSXStoreForLE(SDNode *N, DAGCombinerInfo &DCI) const;		SDValue expandVSXStoreForLE(SDNode *N, DAGCombinerInfo &DCI) const;

SDValue PerformDAGCombine(SDNode *N, DAGCombinerInfo &DCI) const override;		SDValue PerformDAGCombine(SDNode *N, DAGCombinerInfo &DCI) const override;

SDValue BuildSDIVPow2(SDNode *N, const APInt &Divisor, SelectionDAG &DAG,		SDValue BuildSDIVPow2(SDNode *N, const APInt &Divisor, SelectionDAG &DAG,
std::vector<SDNode > Created) const override;		std::vector<SDNode > Created) const override;

		MulExpansion getMulExpansion(unsigned Opcode, MVT VT) const override;

unsigned getRegisterByName(const char* RegName, EVT VT,		unsigned getRegisterByName(const char* RegName, EVT VT,
SelectionDAG &DAG) const override;		SelectionDAG &DAG) const override;

void computeKnownBitsForTargetNode(const SDValue Op,		void computeKnownBitsForTargetNode(const SDValue Op,
APInt &KnownZero,		APInt &KnownZero,
APInt &KnownOne,		APInt &KnownOne,
const SelectionDAG &DAG,		const SelectionDAG &DAG,
unsigned Depth = 0) const override;		unsigned Depth = 0) const override;
▲ Show 20 Lines • Show All 419 Lines • Show Last 20 Lines

lib/Target/PowerPC/PPCISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 11,429 Lines • ▼ Show 20 Lines	if (IsNegPow2) {
Op = DAG.getNode(ISD::SUB, DL, VT, DAG.getConstant(0, DL, VT), Op);		Op = DAG.getNode(ISD::SUB, DL, VT, DAG.getConstant(0, DL, VT), Op);
if (Created)		if (Created)
Created->push_back(Op.getNode());		Created->push_back(Op.getNode());
}		}

return Op;		return Op;
}		}

		TargetLowering::MulExpansion PPCTargetLowering::getMulExpansion(unsigned Opcode,
		MVT VT) const {
		if (VT.isVector())
		return MulExpansion::Scalarize;
		return MulExpansion::Split;
		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Inline Assembly Support		// Inline Assembly Support
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

void PPCTargetLowering::computeKnownBitsForTargetNode(const SDValue Op,		void PPCTargetLowering::computeKnownBitsForTargetNode(const SDValue Op,
APInt &KnownZero,		APInt &KnownZero,
APInt &KnownOne,		APInt &KnownOne,
const SelectionDAG &DAG,		const SelectionDAG &DAG,
▲ Show 20 Lines • Show All 903 Lines • Show Last 20 Lines

lib/Target/Sparc/SparcISelLowering.h

Show First 20 Lines • Show All 199 Lines • ▼ Show 20 Lines	public:

bool shouldInsertFencesForAtomic(const Instruction *I) const override {		bool shouldInsertFencesForAtomic(const Instruction *I) const override {
// FIXME: We insert fences for each atomics and generate		// FIXME: We insert fences for each atomics and generate
// sub-optimal code for PSO/TSO. (Approximately nobody uses any		// sub-optimal code for PSO/TSO. (Approximately nobody uses any
// mode but TSO, which makes this even more silly)		// mode but TSO, which makes this even more silly)
return true;		return true;
}		}

		MulExpansion getMulExpansion(unsigned Opcode, MVT VT) const override;

AtomicExpansionKind shouldExpandAtomicRMWInIR(AtomicRMWInst *AI) const override;		AtomicExpansionKind shouldExpandAtomicRMWInIR(AtomicRMWInst *AI) const override;

void ReplaceNodeResults(SDNode *N,		void ReplaceNodeResults(SDNode *N,
SmallVectorImpl<SDValue>& Results,		SmallVectorImpl<SDValue>& Results,
SelectionDAG &DAG) const override;		SelectionDAG &DAG) const override;

MachineBasicBlock expandSelectCC(MachineInstr &MI, MachineBasicBlock BB,		MachineBasicBlock expandSelectCC(MachineInstr &MI, MachineBasicBlock BB,
unsigned BROpcode) const;		unsigned BROpcode) const;
MachineBasicBlock *emitEHSjLjSetJmp(MachineInstr &MI,		MachineBasicBlock *emitEHSjLjSetJmp(MachineInstr &MI,
MachineBasicBlock *MBB) const;		MachineBasicBlock *MBB) const;
MachineBasicBlock *emitEHSjLjLongJmp(MachineInstr &MI,		MachineBasicBlock *emitEHSjLjLongJmp(MachineInstr &MI,
MachineBasicBlock *MBB) const;		MachineBasicBlock *MBB) const;
};		};
} // end namespace llvm		} // end namespace llvm

#endif // SPARC_ISELLOWERING_H		#endif // SPARC_ISELLOWERING_H

lib/Target/Sparc/SparcISelLowering.cpp

Show First 20 Lines • Show All 1,388 Lines • ▼ Show 20 Lines	SparcTargetLowering::LowerCall_64(TargetLowering::CallLoweringInfo &CLI,

return Chain;		return Chain;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// TargetLowering Implementation		// TargetLowering Implementation
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

		TargetLowering::MulExpansion
		SparcTargetLowering::getMulExpansion(unsigned Opcode, MVT VT) const {
		if (VT.getSizeInBits() > 32)
		return MulExpansion::HalfWidth;
		return MulExpansion::Split;
		}

TargetLowering::AtomicExpansionKind SparcTargetLowering::shouldExpandAtomicRMWInIR(AtomicRMWInst *AI) const {		TargetLowering::AtomicExpansionKind SparcTargetLowering::shouldExpandAtomicRMWInIR(AtomicRMWInst *AI) const {
if (AI->getOperation() == AtomicRMWInst::Xchg &&		if (AI->getOperation() == AtomicRMWInst::Xchg &&
AI->getType()->getPrimitiveSizeInBits() == 32)		AI->getType()->getPrimitiveSizeInBits() == 32)
return AtomicExpansionKind::None; // Uses xchg instruction		return AtomicExpansionKind::None; // Uses xchg instruction

return AtomicExpansionKind::CmpXChg;		return AtomicExpansionKind::CmpXChg;
}		}

▲ Show 20 Lines • Show All 275 Lines • ▼ Show 20 Lines	SparcTargetLowering::SparcTargetLowering(const TargetMachine &TM,
setOperationAction(ISD::FPOW , MVT::f128, Expand);		setOperationAction(ISD::FPOW , MVT::f128, Expand);
setOperationAction(ISD::FPOW , MVT::f64, Expand);		setOperationAction(ISD::FPOW , MVT::f64, Expand);
setOperationAction(ISD::FPOW , MVT::f32, Expand);		setOperationAction(ISD::FPOW , MVT::f32, Expand);

setOperationAction(ISD::SHL_PARTS, MVT::i32, Expand);		setOperationAction(ISD::SHL_PARTS, MVT::i32, Expand);
setOperationAction(ISD::SRA_PARTS, MVT::i32, Expand);		setOperationAction(ISD::SRA_PARTS, MVT::i32, Expand);
setOperationAction(ISD::SRL_PARTS, MVT::i32, Expand);		setOperationAction(ISD::SRL_PARTS, MVT::i32, Expand);

		if (Subtarget->is64Bit()) {
// FIXME: Sparc provides these multiplies, but we don't have them yet.		// FIXME: Sparc provides these multiplies, but we don't have them yet.
		efriedmaUnsubmitted Done Reply Inline Actions This FIXME doesn't make sense here; Promote is just the right thing to do here. efriedma: This FIXME doesn't make sense here; Promote is just the right thing to do here.
		nhaehnleAuthorUnsubmitted Not Done Reply Inline Actions FIXME removed. nhaehnle: FIXME removed.
setOperationAction(ISD::UMUL_LOHI, MVT::i32, Expand);		setOperationAction(ISD::UMUL_LOHI, MVT::i32, Promote);
setOperationAction(ISD::SMUL_LOHI, MVT::i32, Expand);		setOperationAction(ISD::SMUL_LOHI, MVT::i32, Promote);

if (Subtarget->is64Bit()) {
setOperationAction(ISD::UMUL_LOHI, MVT::i64, Expand);		setOperationAction(ISD::UMUL_LOHI, MVT::i64, Expand);
setOperationAction(ISD::SMUL_LOHI, MVT::i64, Expand);		setOperationAction(ISD::SMUL_LOHI, MVT::i64, Expand);
setOperationAction(ISD::MULHU, MVT::i64, Expand);		setOperationAction(ISD::MULHU, MVT::i64, Expand);
setOperationAction(ISD::MULHS, MVT::i64, Expand);		setOperationAction(ISD::MULHS, MVT::i64, Expand);

setOperationAction(ISD::UMULO, MVT::i64, Custom);		setOperationAction(ISD::UMULO, MVT::i64, Custom);
setOperationAction(ISD::SMULO, MVT::i64, Custom);		setOperationAction(ISD::SMULO, MVT::i64, Custom);

setOperationAction(ISD::SHL_PARTS, MVT::i64, Expand);		setOperationAction(ISD::SHL_PARTS, MVT::i64, Expand);
setOperationAction(ISD::SRA_PARTS, MVT::i64, Expand);		setOperationAction(ISD::SRA_PARTS, MVT::i64, Expand);
setOperationAction(ISD::SRL_PARTS, MVT::i64, Expand);		setOperationAction(ISD::SRL_PARTS, MVT::i64, Expand);
		} else {
		// FIXME: Sparc provides these multiplies, but we don't have them yet.
		setOperationAction(ISD::UMUL_LOHI, MVT::i32, Expand);
		setOperationAction(ISD::SMUL_LOHI, MVT::i32, Expand);
}		}

// VASTART needs to be custom lowered to use the VarArgsFrameIndex.		// VASTART needs to be custom lowered to use the VarArgsFrameIndex.
setOperationAction(ISD::VASTART , MVT::Other, Custom);		setOperationAction(ISD::VASTART , MVT::Other, Custom);
// VAARG needs to be lowered to not do unaligned accesses for doubles.		// VAARG needs to be lowered to not do unaligned accesses for doubles.
setOperationAction(ISD::VAARG , MVT::Other, Custom);		setOperationAction(ISD::VAARG , MVT::Other, Custom);

setOperationAction(ISD::TRAP , MVT::Other, Legal);		setOperationAction(ISD::TRAP , MVT::Other, Legal);
▲ Show 20 Lines • Show All 1,822 Lines • Show Last 20 Lines

lib/Target/X86/X86ISelLowering.h

Show First 20 Lines • Show All 1,020 Lines • ▼ Show 20 Lines	public:

bool isNoopAddrSpaceCast(unsigned SrcAS, unsigned DestAS) const override;		bool isNoopAddrSpaceCast(unsigned SrcAS, unsigned DestAS) const override;

/// \brief Customize the preferred legalization strategy for certain types.		/// \brief Customize the preferred legalization strategy for certain types.
LegalizeTypeAction getPreferredVectorAction(EVT VT) const override;		LegalizeTypeAction getPreferredVectorAction(EVT VT) const override;

bool isIntDivCheap(EVT VT, AttributeSet Attr) const override;		bool isIntDivCheap(EVT VT, AttributeSet Attr) const override;

		MulExpansion getMulExpansion(unsigned Opcode, MVT VT) const override;

bool supportSwiftError() const override;		bool supportSwiftError() const override;

protected:		protected:
std::pair<const TargetRegisterClass *, uint8_t>		std::pair<const TargetRegisterClass *, uint8_t>
findRepresentativeClass(const TargetRegisterInfo *TRI,		findRepresentativeClass(const TargetRegisterInfo *TRI,
MVT VT) const override;		MVT VT) const override;

private:		private:
▲ Show 20 Lines • Show All 237 Lines • Show Last 20 Lines

lib/Target/X86/X86ISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 32,520 Lines • ▼ Show 20 Lines	bool X86TargetLowering::isIntDivCheap(EVT VT, AttributeSet Attr) const {
// integer division, leaving the division as-is is a loss even in terms of		// integer division, leaving the division as-is is a loss even in terms of
// size, because it will have to be scalarized, while the alternative code		// size, because it will have to be scalarized, while the alternative code
// sequence can be performed in vector form.		// sequence can be performed in vector form.
bool OptSize = Attr.hasAttribute(AttributeSet::FunctionIndex,		bool OptSize = Attr.hasAttribute(AttributeSet::FunctionIndex,
Attribute::MinSize);		Attribute::MinSize);
return OptSize && !VT.isVector();		return OptSize && !VT.isVector();
}		}

		TargetLowering::MulExpansion X86TargetLowering::getMulExpansion(unsigned Opcode,
		MVT VT) const {
		if (VT.getScalarSizeInBits() >= 64)
		return MulExpansion::Scalarize;
		efriedmaUnsubmitted Done Reply Inline Actions I'm confused; you're returning "Scalarize" for scalar types? efriedma: I'm confused; you're returning "Scalarize" for scalar types?
		nhaehnleAuthorUnsubmitted Not Done Reply Inline Actions On X86 this only gets called for vector types. I'm adding an assertion to make that explicit. nhaehnle: On X86 this only gets called for vector types. I'm adding an assertion to make that explicit.
		return MulExpansion::Split;
		}

void X86TargetLowering::initializeSplitCSR(MachineBasicBlock *Entry) const {		void X86TargetLowering::initializeSplitCSR(MachineBasicBlock *Entry) const {
if (!Subtarget.is64Bit())		if (!Subtarget.is64Bit())
return;		return;

// Update IsSplitCSR in X86MachineFunctionInfo.		// Update IsSplitCSR in X86MachineFunctionInfo.
X86MachineFunctionInfo *AFI =		X86MachineFunctionInfo *AFI =
Entry->getParent()->getInfo<X86MachineFunctionInfo>();		Entry->getParent()->getInfo<X86MachineFunctionInfo>();
AFI->setIsSplitCSR(true);		AFI->setIsSplitCSR(true);
▲ Show 20 Lines • Show All 44 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[SelectionDAG] Add expansion and promotion of [US]MUL_LOHIClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 72620

include/llvm/Target/TargetLowering.h

lib/CodeGen/SelectionDAG/LegalizeDAG.cpp

lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp

lib/CodeGen/SelectionDAG/TargetLowering.cpp

lib/Target/AMDGPU/AMDGPUISelLowering.h

lib/Target/AMDGPU/AMDGPUISelLowering.cpp

lib/Target/BPF/BPFISelLowering.h

lib/Target/BPF/BPFISelLowering.cpp

lib/Target/PowerPC/PPCISelLowering.h

lib/Target/PowerPC/PPCISelLowering.cpp

lib/Target/Sparc/SparcISelLowering.h

lib/Target/Sparc/SparcISelLowering.cpp

lib/Target/X86/X86ISelLowering.h

lib/Target/X86/X86ISelLowering.cpp

[SelectionDAG] Add expansion and promotion of [US]MUL_LOHI
ClosedPublic