This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/lib/CodeGen/SelectionDAG/
-
trunk/
-
lib/
-
CodeGen/
-
SelectionDAG/
-
LegalizeDAG.cpp

Differential D25223

[SelectionDAG] Fix calling convention in expansion of ?MULO.
ClosedPublic

Authored by whitequark on Oct 3 2016, 8:59 PM.

Download Raw Diff

Details

Reviewers

echristo
resistor

Commits

rG7c4fe0e9a323: [SelectionDAG] Fix calling convention in expansion of ?MULO.
rL283203: [SelectionDAG] Fix calling convention in expansion of ?MULO.

Summary

The SMULO/UMULO DAG nodes, when not directly supported by the target,
expand to a multiplication twice as wide. In case that the resulting
type is not legal, an __mul?i3 intrinsic is used. Since the type is
not legal, the legalizer cannot directly call the intrinsic with
the wide arguments; instead, it "pre-lowers" them by splitting them
in halves.

The "pre-lowering" code in essence made assumptions about
the calling convention, specifically that i(N*2) values will be
split into two iN values and passed in consecutive registers in
little-endian order. This, naturally, breaks on a big-endian system,
such as our OR1K out-of-tree backend.

Thanks to James Miller <james@aatch.net> for help in debugging.

Diff Detail

Repository: rL LLVM

Event Timeline

whitequark updated this revision to Diff 73393.Oct 3 2016, 8:59 PM

whitequark retitled this revision from to [SelectionDAG] Fix calling convention in expansion of ?MULO..

whitequark updated this object.

whitequark added a reviewer: resistor.

whitequark set the repository for this revision to rL LLVM.

It is quite regretful that this change comes without tests, but I wasn't able to trick any of the existing in-tree bi-endian backends into generating a __muldi3 sequence that I need to match against. Even worse, just unconditionally swapping the registers doesn't break any of the tests, implying that this branch wasn't tested in the first place (and also that there's nothing I can easily modify).

whitequark updated this object.Oct 3 2016, 9:08 PM

Maybe ppc32 big endian for the testcase? (If you've tried it and it doesn't work that's fine, just a thought).

-eric

lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
3689–3696 ↗	(On Diff #73393)	Comment about it the difference here?

Maybe ppc32 big endian for the testcase?

Nope, for __builtin_mul_overflow this generates a completely incomprehensible sequence that nevertheless includes no builtin calls:

stw 31, -4(1)
stwu 1, -16(1)
mulhwu 6, 3, 4
mullw 3, 3, 4
mr 31, 1
cntlzw   12, 6
stw 3, 0(5)
nor 4, 12, 12
rlwinm 3, 4, 27, 31, 31
addi 1, 1, 16
lwz 31, -4(1)
blr

Something similar happens with MIPS.

Oh well.

Anyhow, one inline comment then I guess it looks OK?

Thanks.

Added explanatory comment

LGTM.

This revision is now accepted and ready to land.Oct 3 2016, 11:55 PM

Closed by commit rL283203: [SelectionDAG] Fix calling convention in expansion of ?MULO. (authored by whitequark). · Explain WhyOct 4 2016, 2:16 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

trunk/

lib/

CodeGen/

SelectionDAG/

LegalizeDAG.cpp

14 lines

Diff 73435

llvm/trunk/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp

Show First 20 Lines • Show All 3,464 Lines • ▼ Show 20 Lines	if (TLI.isOperationLegalOrCustom(Ops[isSigned][0], VT)) {
DAG.getNode(ISD::SRA, dl, VT, LHS,		DAG.getNode(ISD::SRA, dl, VT, LHS,
DAG.getConstant(LoSize - 1, dl,		DAG.getConstant(LoSize - 1, dl,
TLI.getPointerTy(DAG.getDataLayout())));		TLI.getPointerTy(DAG.getDataLayout())));

// Here we're passing the 2 arguments explicitly as 4 arguments that are		// Here we're passing the 2 arguments explicitly as 4 arguments that are
// pre-lowered to the correct types. This all depends upon WideVT not		// pre-lowered to the correct types. This all depends upon WideVT not
// being a legal type for the architecture and thus has to be split to		// being a legal type for the architecture and thus has to be split to
// two arguments.		// two arguments.
		SDValue Ret;
		if(DAG.getDataLayout().isLittleEndian()) {
		// Halves of WideVT are packed into registers in different order
		// depending on platform endianness. This is usually handled by
		// the C calling convention, but we can't defer to it in
		// the legalizer.
SDValue Args[] = { LHS, HiLHS, RHS, HiRHS };		SDValue Args[] = { LHS, HiLHS, RHS, HiRHS };
SDValue Ret = ExpandLibCall(LC, WideVT, Args, 4, isSigned, dl);		Ret = ExpandLibCall(LC, WideVT, Args, 4, isSigned, dl);
		} else {
		SDValue Args[] = { HiLHS, LHS, HiRHS, RHS };
		Ret = ExpandLibCall(LC, WideVT, Args, 4, isSigned, dl);
		}
BottomHalf = DAG.getNode(ISD::EXTRACT_ELEMENT, dl, VT, Ret,		BottomHalf = DAG.getNode(ISD::EXTRACT_ELEMENT, dl, VT, Ret,
DAG.getIntPtrConstant(0, dl));		DAG.getIntPtrConstant(0, dl));
TopHalf = DAG.getNode(ISD::EXTRACT_ELEMENT, dl, VT, Ret,		TopHalf = DAG.getNode(ISD::EXTRACT_ELEMENT, dl, VT, Ret,
DAG.getIntPtrConstant(1, dl));		DAG.getIntPtrConstant(1, dl));
// Ret is a node with an illegal type. Because such things are not		// Ret is a node with an illegal type. Because such things are not
// generally permitted during this phase of legalization, make sure the		// generally permitted during this phase of legalization, make sure the
// node has no more uses. The above EXTRACT_ELEMENT nodes should have been		// node has no more uses. The above EXTRACT_ELEMENT nodes should have been
// folded.		// folded.
▲ Show 20 Lines • Show All 1,010 Lines • Show Last 20 Lines