This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
include/llvm/CodeGen/
-
llvm/
-
CodeGen/
-
GlobalISel/
-
LegalizerHelper.h
-
MachineIRBuilder.h
-
MachineOperand.h
-
lib/CodeGen/GlobalISel/
-
CodeGen/
-
GlobalISel/
-
LegalizerHelper.cpp
-
MachineIRBuilder.cpp
-
test/CodeGen/AArch64/GlobalISel/
-
CodeGen/
-
AArch64/
-
GlobalISel/
-
legalize-phi.mir

Differential D46414

[GlobalISel][Legalizer] More concise and faster widenScalar, NFC
ClosedPublic

Authored by rtereshin on May 3 2018, 5:35 PM.

Download Raw Diff

Details

Reviewers

t.p.northover
ab
bogner
aditya_nandakumar
dsanders
arsenm
javed.absar

Commits

rG25cbfe680ef0: [GlobalISel][Legalizer] More concise and faster widenScalar, NFC
rL331819: [GlobalISel][Legalizer] More concise and faster widenScalar, NFC

Summary

Refactoring LegalizerHelper::widenScalar member function reducing its
size by approximately a factor of 2 and (hopefuly) making it more
straightforward and regular by introducing widenScalarSrc and
widenScalarDst helper methods.

The new widenScalar* methods mutate the instructions in place instead
of recreating them from scratch and removing the originals. The
compile time implications of this were measured on sqlite3
amalgamation, targeting AArch64 in -O0:

LegalizerHelper::widenScalar: > 25% faster
Legalizer::runOnMachineFunction: ~ 4.0 - 4.5% faster

Also adding MachineOperand::setCImm and refactoring out
MachineIRBuilder::recordInsertion methods to make the change possible.

Diff Detail

Repository: rL LLVM

Event Timeline

rtereshin created this revision.May 3 2018, 5:35 PM

Herald added a reviewer: javed.absar. · View Herald TranscriptMay 3 2018, 5:35 PM

Herald added subscribers: llvm-commits, kristof.beyls, rovka, wdng. · View Herald Transcript

rtereshin added a child revision: D46413: [GlobalISel][Legalizer] Widening the second src op of shifts bug fix.May 3 2018, 5:36 PM

I have simplified widenScalar{Src,Dst} methods a bit by using a tad more high-level MachineIRBuilder methods.

@rtereshin . Looks good and much cleaner. Thanks for working on this.

lib/CodeGen/GlobalISel/LegalizerHelper.cpp
730 ↗	(On Diff #145301)	This bit looks like it's not NFC (but seems more correct). If it's indeed not NFC, then please push this in a separate commit.

rtereshin marked an inline comment as done.May 8 2018, 2:31 PM

rtereshin added inline comments.

lib/CodeGen/GlobalISel/LegalizerHelper.cpp
730 ↗	(On Diff #145301)	I believe it is an NFC, take a look at the `MachineIRBuilder::buildConstant` that was here before: https://github.com/llvm-mirror/llvm/blob/0cb4ac08f0b0388854df5ff419fc56c5e63361dc/lib/CodeGen/GlobalISel/MachineIRBuilder.cpp#L243 `sextOrTrunc` will always resolve to `sext` there as the new type is always larger, it's a `widenScalar` after all. Also, I have recently committed a `MachineVerifier` patch that checks that we don't have extends down or truncates up: https://reviews.llvm.org/rL331718 Hopefully that helps a bit.

aditya_nandakumar accepted this revision.May 8 2018, 3:34 PM

aditya_nandakumar added inline comments.

lib/CodeGen/GlobalISel/LegalizerHelper.cpp
730 ↗	(On Diff #145301)	Thanks. I hadn't seen the implementation of buildConstant. Should be good.

This revision is now accepted and ready to land.May 8 2018, 3:34 PM

@aditya_nandakumar Yes, I do have my doubts that the original implementation with sign extension is always correct, but it's NFC here anyway. I'm thinking to take a look at it at some point and see if I can come up with a breaking test case for that sign extend.

I'm also worried about how we legalize some "generic-generic" opcodes, like G_PHI - the ones that are supposedly universal and can handle floating point values and integer values both - if we always G_TRUNCate them, for instance, how is it going to inter-operate with floating point values and floating point opcodes?

Thank you for looking into this,
Roman

Closed by commit rL331819: [GlobalISel][Legalizer] More concise and faster widenScalar, NFC (authored by rtereshin). · Explain WhyMay 8 2018, 3:56 PM

This revision was automatically updated to reflect the committed changes.

Yes, I do have my doubts that the original implementation with sign extension is always correct, but it's NFC here anyway. I'm thinking to take a look at it at some point and see if I can come up with a breaking test case for that sign extend.

This should be ok. It's really an any-extend until another MI forces it one way or the other. Suppose we have:

%0:_(s32) = G_CONSTANT i32 ...
%1:_(s32) = G_OPCODE %0

after widening G_CONSTANT we get:

%0:_(s64) = G_CONSTANT i64 anyext_from_i32(...)
%1:_(s32) = G_TRUNC %0
%2:_(s32) = G_OPCODE %1

When we legalize G_OPCODE we get one of the following cases:
It consumes the original type, leaving the truncate and removing the undefined bits:

%0:_(s64) = G_CONSTANT i64 anyext_from_i32(...)
%1:_(s32) = G_TRUNC %0
%2:_(s32) = G_OPCODE %1

It widens and doesn't care about the excess bits:

%0:_(s64) = G_CONSTANT i64 anyext_from_i32(...)
%1:_(s32) = G_TRUNC %0
%2:_(s64) = G_ANYEXT %1
%2:_(s64) = G_OPCODE %2

which simplifies to:

%0:_(s64) = G_CONSTANT i64 anyext_from_i32(...)
%2:_(s64) = G_OPCODE %0

It widens and does care about the excess bits:

%0:_(s64) = G_CONSTANT i64 anyext_from_i32(...)
%1:_(s32) = G_TRUNC %0
%2:_(s64) = G_SEXT %1
%2:_(s64) = G_OPCODE %2

which simplifies to:

%0:_(s64) = G_CONSTANT i64 signext_from_i32(...)
%2:_(s64) = G_OPCODE %0

The narrowScalar case is a mixture of the two. The portions that correspond solely to extension bits end up dead and removed while the portions that partially correspond to both extension bits and value bits act like widenScalar

Revision Contents

Path

Size

llvm/

trunk/

include/

llvm/

CodeGen/

GlobalISel/

LegalizerHelper.h

12 lines

MachineIRBuilder.h

1 line

MachineOperand.h

5 lines

lib/

CodeGen/

GlobalISel/

LegalizerHelper.cpp

370 lines

MachineIRBuilder.cpp

8 lines

test/

CodeGen/

AArch64/

GlobalISel/

legalize-phi.mir

24 lines

Diff 145806

llvm/trunk/include/llvm/CodeGen/GlobalISel/LegalizerHelper.h

Show First 20 Lines • Show All 87 Lines • ▼ Show 20 Lines	public:
/// Expose MIRBuilder so clients can set their own RecordInsertInstruction		/// Expose MIRBuilder so clients can set their own RecordInsertInstruction
/// functions		/// functions
MachineIRBuilder MIRBuilder;		MachineIRBuilder MIRBuilder;

/// Expose LegalizerInfo so the clients can re-use.		/// Expose LegalizerInfo so the clients can re-use.
const LegalizerInfo &getLegalizerInfo() const { return LI; }		const LegalizerInfo &getLegalizerInfo() const { return LI; }

private:		private:
		/// Legalize a single operand \p OpIdx of the machine instruction \p MI as a
		/// Use by extending the operand's type to \p WideTy using the specified \p
		/// ExtOpcode for the extension instruction, and replacing the vreg of the
		/// operand in place.
		void widenScalarSrc(MachineInstr &MI, LLT WideTy, unsigned OpIdx,
		unsigned ExtOpcode);

		/// Legalize a single operand \p OpIdx of the machine instruction \p MI as a
		/// Def by extending the operand's type to \p WideTy and truncating it back
		/// with the \p TruncOpcode, and replacing the vreg of the operand in place.
		void widenScalarDst(MachineInstr &MI, LLT WideTy, unsigned OpIdx = 0,
		unsigned TruncOpcode = TargetOpcode::G_TRUNC);

/// Helper function to split a wide generic register into bitwise blocks with		/// Helper function to split a wide generic register into bitwise blocks with
/// the given Type (which implies the number of blocks needed). The generic		/// the given Type (which implies the number of blocks needed). The generic
/// registers created are appended to Ops, starting at bit 0 of Reg.		/// registers created are appended to Ops, starting at bit 0 of Reg.
void extractParts(unsigned Reg, LLT Ty, int NumParts,		void extractParts(unsigned Reg, LLT Ty, int NumParts,
SmallVectorImpl<unsigned> &Ops);		SmallVectorImpl<unsigned> &Ops);

MachineRegisterInfo &MRI;		MachineRegisterInfo &MRI;
Show All 12 Lines

llvm/trunk/include/llvm/CodeGen/GlobalISel/MachineIRBuilder.h

Show First 20 Lines • Show All 148 Lines • ▼ Show 20 Lines	public:
/// Set the insertion point to before MI.		/// Set the insertion point to before MI.
/// \pre MI must be in getMF().		/// \pre MI must be in getMF().
void setInstr(MachineInstr &MI);		void setInstr(MachineInstr &MI);
/// @}		/// @}

/// \name Control where instructions we create are recorded (typically for		/// \name Control where instructions we create are recorded (typically for
/// visiting again later during legalization).		/// visiting again later during legalization).
/// @{		/// @{
		void recordInsertion(MachineInstr *InsertedInstr) const;
void recordInsertions(std::function<void(MachineInstr *)> InsertedInstr);		void recordInsertions(std::function<void(MachineInstr *)> InsertedInstr);
void stopRecordingInsertions();		void stopRecordingInsertions();
/// @}		/// @}

/// Set the debug location to \p DL for all the next build instructions.		/// Set the debug location to \p DL for all the next build instructions.
void setDebugLoc(const DebugLoc &DL) { this->State.DL = DL; }		void setDebugLoc(const DebugLoc &DL) { this->State.DL = DL; }

/// Get the current instruction's debug location.		/// Get the current instruction's debug location.
▲ Show 20 Lines • Show All 710 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/CodeGen/MachineOperand.h

Show First 20 Lines • Show All 630 Lines • ▼ Show 20 Lines	public:
// Mutators for various operand types.		// Mutators for various operand types.
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//

void setImm(int64_t immVal) {		void setImm(int64_t immVal) {
assert(isImm() && "Wrong MachineOperand mutator");		assert(isImm() && "Wrong MachineOperand mutator");
Contents.ImmVal = immVal;		Contents.ImmVal = immVal;
}		}

		void setCImm(const ConstantInt *CI) {
		assert(isCImm() && "Wrong MachineOperand mutator");
		Contents.CI = CI;
		}

void setFPImm(const ConstantFP *CFP) {		void setFPImm(const ConstantFP *CFP) {
assert(isFPImm() && "Wrong MachineOperand mutator");		assert(isFPImm() && "Wrong MachineOperand mutator");
Contents.CFP = CFP;		Contents.CFP = CFP;
}		}

void setOffset(int64_t Offset) {		void setOffset(int64_t Offset) {
assert((isGlobal() \|\| isSymbol() \|\| isMCSymbol() \|\| isCPI() \|\|		assert((isGlobal() \|\| isSymbol() \|\| isMCSymbol() \|\| isCPI() \|\|
isTargetIndex() \|\| isBlockAddress()) &&		isTargetIndex() \|\| isBlockAddress()) &&
▲ Show 20 Lines • Show All 301 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/GlobalISel/LegalizerHelper.cpp

Show First 20 Lines • Show All 585 Lines • ▼ Show 20 Lines	case TargetOpcode::G_OR: {
unsigned DstReg = MI.getOperand(0).getReg();		unsigned DstReg = MI.getOperand(0).getReg();
MIRBuilder.buildMerge(DstReg, DstRegs);		MIRBuilder.buildMerge(DstReg, DstRegs);
MI.eraseFromParent();		MI.eraseFromParent();
return Legalized;		return Legalized;
}		}
}		}
}		}

		void LegalizerHelper::widenScalarSrc(MachineInstr &MI, LLT WideTy,
		unsigned OpIdx, unsigned ExtOpcode) {
		MachineOperand &MO = MI.getOperand(OpIdx);
		auto ExtB = MIRBuilder.buildInstr(ExtOpcode, WideTy, MO.getReg());
		MO.setReg(ExtB->getOperand(0).getReg());
		}

		void LegalizerHelper::widenScalarDst(MachineInstr &MI, LLT WideTy,
		unsigned OpIdx, unsigned TruncOpcode) {
		MachineOperand &MO = MI.getOperand(OpIdx);
		unsigned DstExt = MRI.createGenericVirtualRegister(WideTy);
		MIRBuilder.setInsertPt(MIRBuilder.getMBB(), ++MIRBuilder.getInsertPt());
		MIRBuilder.buildInstr(TruncOpcode, MO.getReg(), DstExt);
		MO.setReg(DstExt);
		}

LegalizerHelper::LegalizeResult		LegalizerHelper::LegalizeResult
LegalizerHelper::widenScalar(MachineInstr &MI, unsigned TypeIdx, LLT WideTy) {		LegalizerHelper::widenScalar(MachineInstr &MI, unsigned TypeIdx, LLT WideTy) {
MIRBuilder.setInstr(MI);		MIRBuilder.setInstr(MI);

switch (MI.getOpcode()) {		switch (MI.getOpcode()) {
default:		default:
return UnableToLegalize;		return UnableToLegalize;

case TargetOpcode::G_ADD:		case TargetOpcode::G_ADD:
case TargetOpcode::G_AND:		case TargetOpcode::G_AND:
case TargetOpcode::G_MUL:		case TargetOpcode::G_MUL:
case TargetOpcode::G_OR:		case TargetOpcode::G_OR:
case TargetOpcode::G_XOR:		case TargetOpcode::G_XOR:
case TargetOpcode::G_SUB:		case TargetOpcode::G_SUB:
case TargetOpcode::G_SHL: {		case TargetOpcode::G_SHL:
// Perform operation at larger width (any extension is fine here, high bits		// Perform operation at larger width (any extension is fine here, high bits
// don't affect the result) and then truncate the result back to the		// don't affect the result) and then truncate the result back to the
// original type.		// original type.
unsigned Src1Ext = MRI.createGenericVirtualRegister(WideTy);		widenScalarSrc(MI, WideTy, 1, TargetOpcode::G_ANYEXT);
unsigned Src2Ext = MRI.createGenericVirtualRegister(WideTy);		widenScalarSrc(MI, WideTy, 2, TargetOpcode::G_ANYEXT);
MIRBuilder.buildAnyExt(Src1Ext, MI.getOperand(1).getReg());		widenScalarDst(MI, WideTy);
MIRBuilder.buildAnyExt(Src2Ext, MI.getOperand(2).getReg());		MIRBuilder.recordInsertion(&MI);

unsigned DstExt = MRI.createGenericVirtualRegister(WideTy);
MIRBuilder.buildInstr(MI.getOpcode())
.addDef(DstExt)
.addUse(Src1Ext)
.addUse(Src2Ext);

MIRBuilder.buildTrunc(MI.getOperand(0).getReg(), DstExt);
MI.eraseFromParent();
return Legalized;		return Legalized;
}
case TargetOpcode::G_SDIV:		case TargetOpcode::G_SDIV:
case TargetOpcode::G_UDIV:
case TargetOpcode::G_SREM:		case TargetOpcode::G_SREM:
case TargetOpcode::G_UREM:
case TargetOpcode::G_ASHR:		case TargetOpcode::G_ASHR:
case TargetOpcode::G_LSHR: {		widenScalarSrc(MI, WideTy, 1, TargetOpcode::G_SEXT);
unsigned ExtOp = MI.getOpcode() == TargetOpcode::G_SDIV \|\|		widenScalarSrc(MI, WideTy, 2, TargetOpcode::G_SEXT);
MI.getOpcode() == TargetOpcode::G_SREM \|\|		widenScalarDst(MI, WideTy);
MI.getOpcode() == TargetOpcode::G_ASHR		MIRBuilder.recordInsertion(&MI);
? TargetOpcode::G_SEXT		return Legalized;
: TargetOpcode::G_ZEXT;

unsigned LHSExt = MRI.createGenericVirtualRegister(WideTy);
MIRBuilder.buildInstr(ExtOp).addDef(LHSExt).addUse(
MI.getOperand(1).getReg());

unsigned RHSExt = MRI.createGenericVirtualRegister(WideTy);
MIRBuilder.buildInstr(ExtOp).addDef(RHSExt).addUse(
MI.getOperand(2).getReg());

unsigned ResExt = MRI.createGenericVirtualRegister(WideTy);
MIRBuilder.buildInstr(MI.getOpcode())
.addDef(ResExt)
.addUse(LHSExt)
.addUse(RHSExt);

MIRBuilder.buildTrunc(MI.getOperand(0).getReg(), ResExt);		case TargetOpcode::G_UDIV:
MI.eraseFromParent();		case TargetOpcode::G_UREM:
		case TargetOpcode::G_LSHR:
		widenScalarSrc(MI, WideTy, 1, TargetOpcode::G_ZEXT);
		widenScalarSrc(MI, WideTy, 2, TargetOpcode::G_ZEXT);
		widenScalarDst(MI, WideTy);
		MIRBuilder.recordInsertion(&MI);
return Legalized;		return Legalized;
}
case TargetOpcode::G_SELECT: {		case TargetOpcode::G_SELECT:
if (TypeIdx != 0)		if (TypeIdx != 0)
return UnableToLegalize;		return UnableToLegalize;

// Perform operation at larger width (any extension is fine here, high bits		// Perform operation at larger width (any extension is fine here, high bits
// don't affect the result) and then truncate the result back to the		// don't affect the result) and then truncate the result back to the
// original type.		// original type.
unsigned Src1Ext = MRI.createGenericVirtualRegister(WideTy);		widenScalarSrc(MI, WideTy, 2, TargetOpcode::G_ANYEXT);
unsigned Src2Ext = MRI.createGenericVirtualRegister(WideTy);		widenScalarSrc(MI, WideTy, 3, TargetOpcode::G_ANYEXT);
MIRBuilder.buildAnyExt(Src1Ext, MI.getOperand(2).getReg());		widenScalarDst(MI, WideTy);
MIRBuilder.buildAnyExt(Src2Ext, MI.getOperand(3).getReg());		MIRBuilder.recordInsertion(&MI);

unsigned DstExt = MRI.createGenericVirtualRegister(WideTy);
MIRBuilder.buildInstr(TargetOpcode::G_SELECT)
.addDef(DstExt)
.addReg(MI.getOperand(1).getReg())
.addUse(Src1Ext)
.addUse(Src2Ext);

MIRBuilder.buildTrunc(MI.getOperand(0).getReg(), DstExt);
MI.eraseFromParent();
return Legalized;		return Legalized;
}
case TargetOpcode::G_FPTOSI:		case TargetOpcode::G_FPTOSI:
case TargetOpcode::G_FPTOUI: {		case TargetOpcode::G_FPTOUI:
if (TypeIdx != 0)		if (TypeIdx != 0)
return UnableToLegalize;		return UnableToLegalize;
		widenScalarDst(MI, WideTy);
unsigned DstExt = MRI.createGenericVirtualRegister(WideTy);		MIRBuilder.recordInsertion(&MI);
MIRBuilder.buildInstr(MI.getOpcode())
.addDef(DstExt)
.addUse(MI.getOperand(1).getReg());

MIRBuilder.buildTrunc(MI.getOperand(0).getReg(), DstExt);
MI.eraseFromParent();
return Legalized;		return Legalized;
}
case TargetOpcode::G_SITOFP:		case TargetOpcode::G_SITOFP:
case TargetOpcode::G_UITOFP: {
if (TypeIdx != 1)		if (TypeIdx != 1)
return UnableToLegalize;		return UnableToLegalize;
		widenScalarSrc(MI, WideTy, 1, TargetOpcode::G_SEXT);
		MIRBuilder.recordInsertion(&MI);
		return Legalized;

unsigned Src = MI.getOperand(1).getReg();		case TargetOpcode::G_UITOFP:
unsigned SrcExt = MRI.createGenericVirtualRegister(WideTy);		if (TypeIdx != 1)
		return UnableToLegalize;
if (MI.getOpcode() == TargetOpcode::G_SITOFP) {		widenScalarSrc(MI, WideTy, 1, TargetOpcode::G_ZEXT);
MIRBuilder.buildSExt(SrcExt, Src);		MIRBuilder.recordInsertion(&MI);
} else {
assert(MI.getOpcode() == TargetOpcode::G_UITOFP && "Unexpected conv op");
MIRBuilder.buildZExt(SrcExt, Src);
}

MIRBuilder.buildInstr(MI.getOpcode())
.addDef(MI.getOperand(0).getReg())
.addUse(SrcExt);

MI.eraseFromParent();
return Legalized;		return Legalized;
}
case TargetOpcode::G_INSERT: {		case TargetOpcode::G_INSERT:
if (TypeIdx != 0)		if (TypeIdx != 0)
return UnableToLegalize;		return UnableToLegalize;
		widenScalarSrc(MI, WideTy, 1, TargetOpcode::G_ANYEXT);
unsigned Src = MI.getOperand(1).getReg();		widenScalarDst(MI, WideTy);
unsigned SrcExt = MRI.createGenericVirtualRegister(WideTy);		MIRBuilder.recordInsertion(&MI);
MIRBuilder.buildAnyExt(SrcExt, Src);

unsigned DstExt = MRI.createGenericVirtualRegister(WideTy);
auto MIB = MIRBuilder.buildInsert(DstExt, SrcExt, MI.getOperand(2).getReg(),
MI.getOperand(3).getImm());
for (unsigned OpNum = 4; OpNum < MI.getNumOperands(); OpNum += 2) {
MIB.addReg(MI.getOperand(OpNum).getReg());
MIB.addImm(MI.getOperand(OpNum + 1).getImm());
}

MIRBuilder.buildTrunc(MI.getOperand(0).getReg(), DstExt);
MI.eraseFromParent();
return Legalized;		return Legalized;
}
case TargetOpcode::G_LOAD:		case TargetOpcode::G_LOAD:
// For some types like i24, we might try to widen to i32. To properly handle		// For some types like i24, we might try to widen to i32. To properly handle
// this we should be using a dedicated extending load, until then avoid		// this we should be using a dedicated extending load, until then avoid
// trying to legalize.		// trying to legalize.
if (alignTo(MRI.getType(MI.getOperand(0).getReg()).getSizeInBits(), 8) !=		if (alignTo(MRI.getType(MI.getOperand(0).getReg()).getSizeInBits(), 8) !=
WideTy.getSizeInBits())		WideTy.getSizeInBits())
return UnableToLegalize;		return UnableToLegalize;
LLVM_FALLTHROUGH;		LLVM_FALLTHROUGH;
case TargetOpcode::G_SEXTLOAD:		case TargetOpcode::G_SEXTLOAD:
case TargetOpcode::G_ZEXTLOAD: {		case TargetOpcode::G_ZEXTLOAD:
unsigned DstExt = MRI.createGenericVirtualRegister(WideTy);		widenScalarDst(MI, WideTy);
MIRBuilder.buildLoadInstr(MI.getOpcode(), DstExt, MI.getOperand(1).getReg(),		MIRBuilder.recordInsertion(&MI);
**MI.memoperands_begin());
MIRBuilder.buildTrunc(MI.getOperand(0).getReg(), DstExt);
MI.eraseFromParent();
return Legalized;		return Legalized;
}
case TargetOpcode::G_STORE: {		case TargetOpcode::G_STORE: {
if (MRI.getType(MI.getOperand(0).getReg()) != LLT::scalar(1) \|\|		if (MRI.getType(MI.getOperand(0).getReg()) != LLT::scalar(1) \|\|
WideTy != LLT::scalar(8))		WideTy != LLT::scalar(8))
return UnableToLegalize;		return UnableToLegalize;

auto &TLI = *MIRBuilder.getMF().getSubtarget().getTargetLowering();		const auto &TLI = *MIRBuilder.getMF().getSubtarget().getTargetLowering();
auto Content = TLI.getBooleanContents(false, false);		switch (TLI.getBooleanContents(false, false)) {
		case TargetLoweringBase::ZeroOrNegativeOneBooleanContent:
unsigned ExtOp = TargetOpcode::G_ANYEXT;		widenScalarSrc(MI, WideTy, 0, TargetOpcode::G_SEXT);
if (Content == TargetLoweringBase::ZeroOrOneBooleanContent)		break;
ExtOp = TargetOpcode::G_ZEXT;		case TargetLoweringBase::ZeroOrOneBooleanContent:
else if (Content == TargetLoweringBase::ZeroOrNegativeOneBooleanContent)		widenScalarSrc(MI, WideTy, 0, TargetOpcode::G_ZEXT);
ExtOp = TargetOpcode::G_SEXT;		break;
else		default:
ExtOp = TargetOpcode::G_ANYEXT;		widenScalarSrc(MI, WideTy, 0, TargetOpcode::G_ANYEXT);
		}
unsigned SrcExt = MRI.createGenericVirtualRegister(WideTy);		MIRBuilder.recordInsertion(&MI);
MIRBuilder.buildInstr(ExtOp).addDef(SrcExt).addUse(
MI.getOperand(0).getReg());
MIRBuilder.buildStore(SrcExt, MI.getOperand(1).getReg(),
**MI.memoperands_begin());
MI.eraseFromParent();
return Legalized;		return Legalized;
}		}
case TargetOpcode::G_CONSTANT: {		case TargetOpcode::G_CONSTANT: {
unsigned DstExt = MRI.createGenericVirtualRegister(WideTy);		MachineOperand &SrcMO = MI.getOperand(1);
MIRBuilder.buildConstant(DstExt, *MI.getOperand(1).getCImm());		LLVMContext &Ctx = MIRBuilder.getMF().getFunction().getContext();
MIRBuilder.buildTrunc(MI.getOperand(0).getReg(), DstExt);		const APInt &Val = SrcMO.getCImm()->getValue().sext(WideTy.getSizeInBits());
MI.eraseFromParent();		SrcMO.setCImm(ConstantInt::get(Ctx, Val));

		widenScalarDst(MI, WideTy);
		MIRBuilder.recordInsertion(&MI);
return Legalized;		return Legalized;
}		}
case TargetOpcode::G_FCONSTANT: {		case TargetOpcode::G_FCONSTANT: {
const ConstantFP *CFP = MI.getOperand(1).getFPImm();		MachineOperand &SrcMO = MI.getOperand(1);
APFloat Val = CFP->getValueAPF();
LLVMContext &Ctx = MIRBuilder.getMF().getFunction().getContext();		LLVMContext &Ctx = MIRBuilder.getMF().getFunction().getContext();
auto LLT2Sem = [](LLT Ty) {		APFloat Val = SrcMO.getFPImm()->getValueAPF();
switch (Ty.getSizeInBits()) {		bool LosesInfo;
		switch (WideTy.getSizeInBits()) {
case 32:		case 32:
return &APFloat::IEEEsingle();		Val.convert(APFloat::IEEEsingle(), APFloat::rmTowardZero, &LosesInfo);
break;		break;
case 64:		case 64:
return &APFloat::IEEEdouble();		Val.convert(APFloat::IEEEdouble(), APFloat::rmTowardZero, &LosesInfo);
break;		break;
default:		default:
llvm_unreachable("Unhandled fp widen type");		llvm_unreachable("Unhandled fp widen type");
}		}
};		SrcMO.setFPImm(ConstantFP::get(Ctx, Val));
bool LosesInfo;
Val.convert(*LLT2Sem(WideTy), APFloat::rmTowardZero, &LosesInfo);		widenScalarDst(MI, WideTy, 0, TargetOpcode::G_FPTRUNC);
auto Cst = MIRBuilder.buildFConstant(WideTy, *ConstantFP::get(Ctx, Val));		MIRBuilder.recordInsertion(&MI);
MIRBuilder.buildFPTrunc(MI.getOperand(0).getReg(), Cst);
MI.eraseFromParent();
return Legalized;		return Legalized;
}		}
case TargetOpcode::G_BRCOND: {		case TargetOpcode::G_BRCOND:
unsigned TstExt = MRI.createGenericVirtualRegister(WideTy);		widenScalarSrc(MI, WideTy, 0, TargetOpcode::G_ANYEXT);
MIRBuilder.buildAnyExt(TstExt, MI.getOperand(0).getReg());		MIRBuilder.recordInsertion(&MI);
MIRBuilder.buildBrCond(TstExt, *MI.getOperand(1).getMBB());
MI.eraseFromParent();
return Legalized;		return Legalized;
}
case TargetOpcode::G_FCMP: {		case TargetOpcode::G_FCMP:
unsigned Op0Ext, Op1Ext, DstReg;
unsigned Cmp1 = MI.getOperand(2).getReg();
unsigned Cmp2 = MI.getOperand(3).getReg();
if (TypeIdx == 0) {
Op0Ext = Cmp1;
Op1Ext = Cmp2;
DstReg = MRI.createGenericVirtualRegister(WideTy);
} else {
Op0Ext = MRI.createGenericVirtualRegister(WideTy);
Op1Ext = MRI.createGenericVirtualRegister(WideTy);
DstReg = MI.getOperand(0).getReg();
MIRBuilder.buildInstr(TargetOpcode::G_FPEXT, Op0Ext, Cmp1);
MIRBuilder.buildInstr(TargetOpcode::G_FPEXT, Op1Ext, Cmp2);
}
MIRBuilder.buildFCmp(
static_cast<CmpInst::Predicate>(MI.getOperand(1).getPredicate()),
DstReg, Op0Ext, Op1Ext);
if (TypeIdx == 0)		if (TypeIdx == 0)
MIRBuilder.buildInstr(TargetOpcode::G_TRUNC, MI.getOperand(0).getReg(),		widenScalarDst(MI, WideTy);
DstReg);		else {
MI.eraseFromParent();		widenScalarSrc(MI, WideTy, 2, TargetOpcode::G_FPEXT);
return Legalized;		widenScalarSrc(MI, WideTy, 3, TargetOpcode::G_FPEXT);
}		}
case TargetOpcode::G_ICMP: {		MIRBuilder.recordInsertion(&MI);
bool IsSigned = CmpInst::isSigned(
static_cast<CmpInst::Predicate>(MI.getOperand(1).getPredicate()));
unsigned Cmp1 = MI.getOperand(2).getReg();
unsigned Cmp2 = MI.getOperand(3).getReg();
unsigned Op0Ext, Op1Ext, DstReg;
if (TypeIdx == 0) {
Op0Ext = Cmp1;
Op1Ext = Cmp2;
DstReg = MRI.createGenericVirtualRegister(WideTy);
} else {
Op0Ext = MRI.createGenericVirtualRegister(WideTy);
Op1Ext = MRI.createGenericVirtualRegister(WideTy);
DstReg = MI.getOperand(0).getReg();
if (IsSigned) {
MIRBuilder.buildSExt(Op0Ext, Cmp1);
MIRBuilder.buildSExt(Op1Ext, Cmp2);
} else {
MIRBuilder.buildZExt(Op0Ext, Cmp1);
MIRBuilder.buildZExt(Op1Ext, Cmp2);
}
}
MIRBuilder.buildICmp(
static_cast<CmpInst::Predicate>(MI.getOperand(1).getPredicate()),
DstReg, Op0Ext, Op1Ext);
if (TypeIdx == 0)
MIRBuilder.buildInstr(TargetOpcode::G_TRUNC, MI.getOperand(0).getReg(),
DstReg);
MI.eraseFromParent();
return Legalized;		return Legalized;

		case TargetOpcode::G_ICMP:
		if (TypeIdx == 0)
		widenScalarDst(MI, WideTy);
		else {
		unsigned ExtOpcode = CmpInst::isSigned(static_cast<CmpInst::Predicate>(
		MI.getOperand(1).getPredicate()))
		? TargetOpcode::G_SEXT
		: TargetOpcode::G_ZEXT;
		widenScalarSrc(MI, WideTy, 2, ExtOpcode);
		widenScalarSrc(MI, WideTy, 3, ExtOpcode);
}		}
case TargetOpcode::G_GEP: {		MIRBuilder.recordInsertion(&MI);
		return Legalized;

		case TargetOpcode::G_GEP:
assert(TypeIdx == 1 && "unable to legalize pointer of GEP");		assert(TypeIdx == 1 && "unable to legalize pointer of GEP");
unsigned OffsetExt = MRI.createGenericVirtualRegister(WideTy);		widenScalarSrc(MI, WideTy, 2, TargetOpcode::G_SEXT);
MIRBuilder.buildSExt(OffsetExt, MI.getOperand(2).getReg());		MIRBuilder.recordInsertion(&MI);
MI.getOperand(2).setReg(OffsetExt);
return Legalized;		return Legalized;
}
case TargetOpcode::G_PHI: {		case TargetOpcode::G_PHI: {
assert(TypeIdx == 0 && "Expecting only Idx 0");		assert(TypeIdx == 0 && "Expecting only Idx 0");
auto getExtendedReg = [&](unsigned Reg, MachineBasicBlock &MBB) {
auto FirstTermIt = MBB.getFirstTerminator();		for (unsigned I = 1; I < MI.getNumOperands(); I += 2) {
MIRBuilder.setInsertPt(MBB, FirstTermIt);		MachineBasicBlock &OpMBB = *MI.getOperand(I + 1).getMBB();
MachineInstr *DefMI = MRI.getVRegDef(Reg);		MIRBuilder.setInsertPt(OpMBB, OpMBB.getFirstTerminator());
MachineInstrBuilder MIB;		widenScalarSrc(MI, WideTy, I, TargetOpcode::G_ANYEXT);
if (DefMI->getOpcode() == TargetOpcode::G_TRUNC)		}
MIB = MIRBuilder.buildAnyExtOrTrunc(WideTy,
DefMI->getOperand(1).getReg());		MachineBasicBlock &MBB = *MI.getParent();
else		MIRBuilder.setInsertPt(MBB, --MBB.getFirstNonPHI());
MIB = MIRBuilder.buildAnyExt(WideTy, Reg);		widenScalarDst(MI, WideTy);
return MIB->getOperand(0).getReg();		MIRBuilder.recordInsertion(&MI);
};
auto MIB = MIRBuilder.buildInstr(TargetOpcode::G_PHI, WideTy);
for (auto OpIt = MI.operands_begin() + 1, OpE = MI.operands_end();
OpIt != OpE;) {
unsigned Reg = OpIt++->getReg();
MachineBasicBlock *OpMBB = OpIt++->getMBB();
MIB.addReg(getExtendedReg(Reg, *OpMBB));
MIB.addMBB(OpMBB);
}
auto *MBB = MI.getParent();
MIRBuilder.setInsertPt(*MBB, MBB->getFirstNonPHI());
MIRBuilder.buildTrunc(MI.getOperand(0).getReg(),
MIB->getOperand(0).getReg());
MI.eraseFromParent();
return Legalized;		return Legalized;
}		}
}		}
}		}

LegalizerHelper::LegalizeResult		LegalizerHelper::LegalizeResult
LegalizerHelper::lower(MachineInstr &MI, unsigned TypeIdx, LLT Ty) {		LegalizerHelper::lower(MachineInstr &MI, unsigned TypeIdx, LLT Ty) {
using namespace TargetOpcode;		using namespace TargetOpcode;
▲ Show 20 Lines • Show All 207 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/GlobalISel/MachineIRBuilder.cpp

	Show First 20 Lines • Show All 47 Lines • ▼ Show 20 Lines
	void MachineIRBuilderBase::setInsertPt(MachineBasicBlock &MBB,			void MachineIRBuilderBase::setInsertPt(MachineBasicBlock &MBB,
	MachineBasicBlock::iterator II) {			MachineBasicBlock::iterator II) {
	assert(MBB.getParent() == &getMF() &&			assert(MBB.getParent() == &getMF() &&
	"Basic block is in a different function");			"Basic block is in a different function");
	State.MBB = &MBB;			State.MBB = &MBB;
	State.II = II;			State.II = II;
	}			}

				void MachineIRBuilderBase::recordInsertion(MachineInstr *InsertedInstr) const {
				if (State.InsertedInstr)
				State.InsertedInstr(InsertedInstr);
				}

	void MachineIRBuilderBase::recordInsertions(			void MachineIRBuilderBase::recordInsertions(
	std::function<void(MachineInstr *)> Inserted) {			std::function<void(MachineInstr *)> Inserted) {
	State.InsertedInstr = std::move(Inserted);			State.InsertedInstr = std::move(Inserted);
	}			}

	void MachineIRBuilderBase::stopRecordingInsertions() {			void MachineIRBuilderBase::stopRecordingInsertions() {
	State.InsertedInstr = nullptr;			State.InsertedInstr = nullptr;
	}			}

	//------------------------------------------------------------------------------			//------------------------------------------------------------------------------
	// Build instruction variants.			// Build instruction variants.
	//------------------------------------------------------------------------------			//------------------------------------------------------------------------------

	MachineInstrBuilder MachineIRBuilderBase::buildInstr(unsigned Opcode) {			MachineInstrBuilder MachineIRBuilderBase::buildInstr(unsigned Opcode) {
	return insertInstr(buildInstrNoInsert(Opcode));			return insertInstr(buildInstrNoInsert(Opcode));
	}			}

	MachineInstrBuilder MachineIRBuilderBase::buildInstrNoInsert(unsigned Opcode) {			MachineInstrBuilder MachineIRBuilderBase::buildInstrNoInsert(unsigned Opcode) {
	MachineInstrBuilder MIB = BuildMI(getMF(), getDL(), getTII().get(Opcode));			MachineInstrBuilder MIB = BuildMI(getMF(), getDL(), getTII().get(Opcode));
	return MIB;			return MIB;
	}			}

	MachineInstrBuilder MachineIRBuilderBase::insertInstr(MachineInstrBuilder MIB) {			MachineInstrBuilder MachineIRBuilderBase::insertInstr(MachineInstrBuilder MIB) {
	getMBB().insert(getInsertPt(), MIB);			getMBB().insert(getInsertPt(), MIB);
	if (State.InsertedInstr)			recordInsertion(MIB);
	State.InsertedInstr(MIB);
	return MIB;			return MIB;
	}			}

	MachineInstrBuilder			MachineInstrBuilder
	MachineIRBuilderBase::buildDirectDbgValue(unsigned Reg, const MDNode *Variable,			MachineIRBuilderBase::buildDirectDbgValue(unsigned Reg, const MDNode *Variable,
	const MDNode *Expr) {			const MDNode *Expr) {
	assert(isa<DILocalVariable>(Variable) && "not a variable");			assert(isa<DILocalVariable>(Variable) && "not a variable");
	assert(cast<DIExpression>(Expr)->isValid() && "not an expression");			assert(cast<DIExpression>(Expr)->isValid() && "not an expression");
	▲ Show 20 Lines • Show All 624 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/AArch64/GlobalISel/legalize-phi.mir

# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
# RUN: llc -mtriple=aarch64-unknown-unknown -verify-machineinstrs -run-pass=legalizer %s -o - \| FileCheck %s		# RUN: llc -mtriple=aarch64-unknown-unknown -verify-machineinstrs -run-pass=legalizer %s -o - \| FileCheck %s
--- \|		--- \|
; ModuleID = '/tmp/test.ll'		; ModuleID = '/tmp/test.ll'
source_filename = "/tmp/test.ll"		source_filename = "/tmp/test.ll"
target datalayout = "e-m:e-i8:8:32-i16:16:32-i64:64-i128:128-n32:64-S128"		target datalayout = "e-m:e-i8:8:32-i16:16:32-i64:64-i128:128-n32:64-S128"
target triple = "aarch64-unknown-unknown"		target triple = "aarch64-unknown-unknown"

define i32 @legalize_phi(i32 %argc) {		define i32 @legalize_phi(i32 %argc) {
▲ Show 20 Lines • Show All 281 Lines • ▼ Show 20 Lines	body: \|
; CHECK: successors: %bb.1(0x80000000)		; CHECK: successors: %bb.1(0x80000000)
; CHECK: liveins: $w0		; CHECK: liveins: $w0
; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $w0		; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $w0
; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 1		; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 1
; CHECK: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 0		; CHECK: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 0
; CHECK: [[TRUNC:%[0-9]+]]:_(s16) = G_TRUNC [[C1]](s32)		; CHECK: [[TRUNC:%[0-9]+]]:_(s16) = G_TRUNC [[C1]](s32)
; CHECK: bb.1:		; CHECK: bb.1:
; CHECK: successors: %bb.1(0x40000000), %bb.2(0x40000000)		; CHECK: successors: %bb.1(0x40000000), %bb.2(0x40000000)
; CHECK: [[PHI:%[0-9]+]]:_(s16) = G_PHI [[TRUNC]](s16), %bb.0, %14(s16), %bb.1		; CHECK: [[PHI:%[0-9]+]]:_(s16) = G_PHI [[TRUNC]](s16), %bb.0, [[TRUNC3:%[0-9]+]](s16), %bb.1
; CHECK: [[ANYEXT:%[0-9]+]]:_(s32) = G_ANYEXT [[PHI]](s16)		; CHECK: [[ANYEXT:%[0-9]+]]:_(s32) = G_ANYEXT [[PHI]](s16)
; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY [[C]](s32)		; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY [[C]](s32)
; CHECK: [[ADD:%[0-9]+]]:_(s32) = G_ADD [[ANYEXT]], [[COPY1]]		; CHECK: [[ADD:%[0-9]+]]:_(s32) = G_ADD [[ANYEXT]], [[COPY1]]
; CHECK: [[TRUNC1:%[0-9]+]]:_(s8) = G_TRUNC [[ADD]](s32)		; CHECK: [[TRUNC1:%[0-9]+]]:_(s8) = G_TRUNC [[ADD]](s32)
; CHECK: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 255		; CHECK: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 255
; CHECK: [[COPY2:%[0-9]+]]:_(s32) = COPY [[ADD]](s32)		; CHECK: [[COPY2:%[0-9]+]]:_(s32) = COPY [[ADD]](s32)
; CHECK: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY2]], [[C2]]		; CHECK: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY2]], [[C2]]
; CHECK: [[ICMP:%[0-9]+]]:_(s32) = G_ICMP intpred(ugt), [[AND]](s32), [[COPY]]		; CHECK: [[ICMP:%[0-9]+]]:_(s32) = G_ICMP intpred(ugt), [[AND]](s32), [[COPY]]
; CHECK: [[TRUNC2:%[0-9]+]]:_(s1) = G_TRUNC [[ICMP]](s32)		; CHECK: [[TRUNC2:%[0-9]+]]:_(s1) = G_TRUNC [[ICMP]](s32)
; CHECK: [[TRUNC3:%[0-9]+]]:_(s16) = G_TRUNC [[ADD]](s32)		; CHECK: [[TRUNC3]]:_(s16) = G_TRUNC [[ADD]](s32)
; CHECK: G_BRCOND [[TRUNC2]](s1), %bb.1		; CHECK: G_BRCOND [[TRUNC2]](s1), %bb.1
; CHECK: bb.2:		; CHECK: bb.2:
; CHECK: [[C3:%[0-9]+]]:_(s32) = G_CONSTANT i32 255		; CHECK: [[C3:%[0-9]+]]:_(s32) = G_CONSTANT i32 255
; CHECK: [[COPY3:%[0-9]+]]:_(s32) = COPY [[ADD]](s32)		; CHECK: [[COPY3:%[0-9]+]]:_(s32) = COPY [[ADD]](s32)
; CHECK: [[AND1:%[0-9]+]]:_(s32) = G_AND [[COPY3]], [[C3]]		; CHECK: [[AND1:%[0-9]+]]:_(s32) = G_AND [[COPY3]], [[C3]]
; CHECK: $w0 = COPY [[AND1]](s32)		; CHECK: $w0 = COPY [[AND1]](s32)
; CHECK: RET_ReallyLR implicit $w0		; CHECK: RET_ReallyLR implicit $w0
bb.0:		bb.0:
Show All 40 Lines	body: \|
; CHECK: bb.0:		; CHECK: bb.0:
; CHECK: successors: %bb.1(0x80000000)		; CHECK: successors: %bb.1(0x80000000)
; CHECK: liveins: $w0		; CHECK: liveins: $w0
; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $w0		; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $w0
; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 0		; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 0
; CHECK: [[TRUNC:%[0-9]+]]:_(s16) = G_TRUNC [[C]](s32)		; CHECK: [[TRUNC:%[0-9]+]]:_(s16) = G_TRUNC [[C]](s32)
; CHECK: bb.1:		; CHECK: bb.1:
; CHECK: successors: %bb.1(0x40000000), %bb.2(0x40000000)		; CHECK: successors: %bb.1(0x40000000), %bb.2(0x40000000)
; CHECK: [[PHI:%[0-9]+]]:_(s16) = G_PHI [[TRUNC]](s16), %bb.0, %8(s16), %bb.1		; CHECK: [[PHI:%[0-9]+]]:_(s16) = G_PHI [[TRUNC]](s16), %bb.0, [[COPY1:%[0-9]+]](s16), %bb.1
; CHECK: [[TRUNC1:%[0-9]+]]:_(s8) = G_TRUNC [[PHI]](s16)		; CHECK: [[TRUNC1:%[0-9]+]]:_(s8) = G_TRUNC [[PHI]](s16)
; CHECK: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 255		; CHECK: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 255
; CHECK: [[ANYEXT:%[0-9]+]]:_(s32) = G_ANYEXT [[PHI]](s16)		; CHECK: [[ANYEXT:%[0-9]+]]:_(s32) = G_ANYEXT [[PHI]](s16)
; CHECK: [[AND:%[0-9]+]]:_(s32) = G_AND [[ANYEXT]], [[C1]]		; CHECK: [[AND:%[0-9]+]]:_(s32) = G_AND [[ANYEXT]], [[C1]]
; CHECK: [[ICMP:%[0-9]+]]:_(s32) = G_ICMP intpred(ugt), [[AND]](s32), [[COPY]]		; CHECK: [[ICMP:%[0-9]+]]:_(s32) = G_ICMP intpred(ugt), [[AND]](s32), [[COPY]]
; CHECK: [[TRUNC2:%[0-9]+]]:_(s1) = G_TRUNC [[ICMP]](s32)		; CHECK: [[TRUNC2:%[0-9]+]]:_(s1) = G_TRUNC [[ICMP]](s32)
; CHECK: [[COPY1:%[0-9]+]]:_(s16) = COPY [[PHI]](s16)		; CHECK: [[COPY1]]:_(s16) = COPY [[PHI]](s16)
; CHECK: G_BRCOND [[TRUNC2]](s1), %bb.1		; CHECK: G_BRCOND [[TRUNC2]](s1), %bb.1
; CHECK: bb.2:		; CHECK: bb.2:
; CHECK: $w0 = COPY [[AND]](s32)		; CHECK: $w0 = COPY [[AND]](s32)
; CHECK: RET_ReallyLR implicit $w0		; CHECK: RET_ReallyLR implicit $w0
bb.0:		bb.0:
successors: %bb.1(0x80000000)		successors: %bb.1(0x80000000)
liveins: $w0		liveins: $w0
; Test that we properly legalize a phi that uses itself		; Test that we properly legalize a phi that uses itself
▲ Show 20 Lines • Show All 69 Lines • ▼ Show 20 Lines	body: \|
; CHECK: [[PHI:%[0-9]+]]:_(s16) = G_PHI [[TRUNC2]](s16), %bb.1, [[TRUNC4]](s16), %bb.2		; CHECK: [[PHI:%[0-9]+]]:_(s16) = G_PHI [[TRUNC2]](s16), %bb.1, [[TRUNC4]](s16), %bb.2
; CHECK: [[PHI1:%[0-9]+]]:_(s16) = G_PHI [[TRUNC1]](s16), %bb.1, [[TRUNC3]](s16), %bb.2		; CHECK: [[PHI1:%[0-9]+]]:_(s16) = G_PHI [[TRUNC1]](s16), %bb.1, [[TRUNC3]](s16), %bb.2
; CHECK: [[C4:%[0-9]+]]:_(s32) = G_CONSTANT i32 255		; CHECK: [[C4:%[0-9]+]]:_(s32) = G_CONSTANT i32 255
; CHECK: [[ANYEXT:%[0-9]+]]:_(s32) = G_ANYEXT [[PHI]](s16)		; CHECK: [[ANYEXT:%[0-9]+]]:_(s32) = G_ANYEXT [[PHI]](s16)
; CHECK: [[AND:%[0-9]+]]:_(s32) = G_AND [[ANYEXT]], [[C4]]		; CHECK: [[AND:%[0-9]+]]:_(s32) = G_AND [[ANYEXT]], [[C4]]
; CHECK: [[C5:%[0-9]+]]:_(s32) = G_CONSTANT i32 255		; CHECK: [[C5:%[0-9]+]]:_(s32) = G_CONSTANT i32 255
; CHECK: [[ANYEXT1:%[0-9]+]]:_(s32) = G_ANYEXT [[PHI1]](s16)		; CHECK: [[ANYEXT1:%[0-9]+]]:_(s32) = G_ANYEXT [[PHI1]](s16)
; CHECK: [[AND1:%[0-9]+]]:_(s32) = G_AND [[ANYEXT1]], [[C5]]		; CHECK: [[AND1:%[0-9]+]]:_(s32) = G_AND [[ANYEXT1]], [[C5]]
; CHECK: [[ADD2:%[0-9]+]]:_(s32) = G_ADD [[C]]1, [[C]]2		; CHECK: [[ADD2:%[0-9]+]]:_(s32) = G_ADD [[AND]], [[AND1]]
; CHECK: $w0 = COPY [[C]]3(s32)		; CHECK: $w0 = COPY [[ADD2]](s32)
; CHECK: RET_ReallyLR implicit $w0		; CHECK: RET_ReallyLR implicit $w0
bb.0:		bb.0:
successors: %bb.1(0x40000000), %bb.2(0x40000000)		successors: %bb.1(0x40000000), %bb.2(0x40000000)
liveins: $w0		liveins: $w0
; Make sure that we correctly insert the new legalized G_PHI at the		; Make sure that we correctly insert the new legalized G_PHI at the
; correct location (ie make sure G_PHIs are the first insts in the BB).		; correct location (ie make sure G_PHIs are the first insts in the BB).


▲ Show 20 Lines • Show All 58 Lines • ▼ Show 20 Lines	registers:
- { id: 15, class: _, preferred-register: '' }		- { id: 15, class: _, preferred-register: '' }
liveins:		liveins:
body: \|		body: \|
; CHECK-LABEL: name: legalize_phi_diff_bb		; CHECK-LABEL: name: legalize_phi_diff_bb
; CHECK: bb.0:		; CHECK: bb.0:
; CHECK: successors: %bb.1(0x40000000), %bb.2(0x40000000)		; CHECK: successors: %bb.1(0x40000000), %bb.2(0x40000000)
; CHECK: liveins: $w0, $w1		; CHECK: liveins: $w0, $w1
; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $w0		; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $w0
		; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY $w1
; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 0		; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 0
; CHECK: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 3		; CHECK: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 3
; CHECK: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 1		; CHECK: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 1
; CHECK: [[C3:%[0-9]+]]:_(s32) = G_CONSTANT i32 44		; CHECK: [[C3:%[0-9]+]]:_(s32) = G_CONSTANT i32 44
; CHECK: [[C4:%[0-9]+]]:_(s32) = G_CONSTANT i32 43		; CHECK: [[C4:%[0-9]+]]:_(s32) = G_CONSTANT i32 43
; CHECK: [[ICMP:%[0-9]+]]:_(s32) = G_ICMP intpred(ugt), [[COPY]](s32), [[C]]		; CHECK: [[ICMP:%[0-9]+]]:_(s32) = G_ICMP intpred(ugt), [[COPY]](s32), [[C]]
; CHECK: [[TRUNC:%[0-9]+]]:_(s1) = G_TRUNC [[ICMP]](s32)		; CHECK: [[TRUNC:%[0-9]+]]:_(s1) = G_TRUNC [[ICMP]](s32)
; CHECK: [[ADD:%[0-9]+]]:_(s32) = G_ADD [[COPY]], [[C1]]		; CHECK: [[ADD:%[0-9]+]]:_(s32) = G_ADD [[COPY]], [[C1]]
; CHECK: [[TRUNC1:%[0-9]+]]:_(s16) = G_TRUNC [[ADD]](s32)		; CHECK: [[TRUNC1:%[0-9]+]]:_(s16) = G_TRUNC [[ADD]](s32)
; CHECK: [[TRUNC2:%[0-9]+]]:_(s16) = G_TRUNC [[ADD]](s32)		; CHECK: [[TRUNC2:%[0-9]+]]:_(s16) = G_TRUNC [[ADD]](s32)
; CHECK: G_BRCOND [[TRUNC]](s1), %bb.1		; CHECK: G_BRCOND [[TRUNC]](s1), %bb.1
; CHECK: G_BR %bb.2		; CHECK: G_BR %bb.2
; CHECK: bb.1:		; CHECK: bb.1:
; CHECK: successors: %bb.2(0x40000000), %bb.1(0x40000000)		; CHECK: successors: %bb.2(0x40000000), %bb.1(0x40000000)
; CHECK: [[PHI:%[0-9]+]]:_(s16) = G_PHI [[TRUNC2]](s16), %bb.0, [[C]]2(s16), %bb.1		; CHECK: [[PHI:%[0-9]+]]:_(s16) = G_PHI [[TRUNC2]](s16), %bb.0, [[TRUNC5:%[0-9]+]](s16), %bb.1
; CHECK: [[TRUNC3:%[0-9]+]]:_(s8) = G_TRUNC [[PHI]](s16)		; CHECK: [[TRUNC3:%[0-9]+]]:_(s8) = G_TRUNC [[PHI]](s16)
; CHECK: [[C5:%[0-9]+]]:_(s32) = G_CONSTANT i32 255		; CHECK: [[C5:%[0-9]+]]:_(s32) = G_CONSTANT i32 255
; CHECK: [[ANYEXT:%[0-9]+]]:_(s32) = G_ANYEXT [[PHI]](s16)		; CHECK: [[ANYEXT:%[0-9]+]]:_(s32) = G_ANYEXT [[PHI]](s16)
; CHECK: [[AND:%[0-9]+]]:_(s32) = G_AND [[ANYEXT]], [[C5]]		; CHECK: [[AND:%[0-9]+]]:_(s32) = G_AND [[ANYEXT]], [[C5]]
; CHECK: [[ADD1:%[0-9]+]]:_(s32) = G_ADD [[AND]], [[C2]]		; CHECK: [[ADD1:%[0-9]+]]:_(s32) = G_ADD [[AND]], [[C2]]
; CHECK: [[ICMP1:%[0-9]+]]:_(s32) = G_ICMP intpred(ugt), [[ADD1]](s32), [[C3]]		; CHECK: [[ICMP1:%[0-9]+]]:_(s32) = G_ICMP intpred(ugt), [[ADD1]](s32), [[C3]]
; CHECK: [[TRUNC4:%[0-9]+]]:_(s1) = G_TRUNC [[ICMP1]](s32)		; CHECK: [[TRUNC4:%[0-9]+]]:_(s1) = G_TRUNC [[ICMP1]](s32)
; CHECK: [[COPY1:%[0-9]+]]:_(s16) = COPY [[PHI]](s16)		; CHECK: [[COPY2:%[0-9]+]]:_(s16) = COPY [[PHI]](s16)
; CHECK: [[TRUNC5:%[0-9]+]]:_(s16) = G_TRUNC [[C4]](s32)		; CHECK: [[TRUNC5]]:_(s16) = G_TRUNC [[C4]](s32)
; CHECK: G_BRCOND [[TRUNC4]](s1), %bb.2		; CHECK: G_BRCOND [[TRUNC4]](s1), %bb.2
; CHECK: G_BR %bb.1		; CHECK: G_BR %bb.1
; CHECK: bb.2:		; CHECK: bb.2:
; CHECK: [[PHI1:%[0-9]+]]:_(s16) = G_PHI [[COPY1]](s16), %bb.1, [[TRUNC1]](s16), %bb.0		; CHECK: [[PHI1:%[0-9]+]]:_(s16) = G_PHI [[COPY2]](s16), %bb.1, [[TRUNC1]](s16), %bb.0
; CHECK: [[C6:%[0-9]+]]:_(s32) = G_CONSTANT i32 255		; CHECK: [[C6:%[0-9]+]]:_(s32) = G_CONSTANT i32 255
; CHECK: [[ANYEXT1:%[0-9]+]]:_(s32) = G_ANYEXT [[PHI1]](s16)		; CHECK: [[ANYEXT1:%[0-9]+]]:_(s32) = G_ANYEXT [[PHI1]](s16)
; CHECK: [[AND1:%[0-9]+]]:_(s32) = G_AND [[C]]8, [[C]]7		; CHECK: [[AND1:%[0-9]+]]:_(s32) = G_AND [[ANYEXT1]], [[C6]]
; CHECK: $w0 = COPY [[AND1]](s32)		; CHECK: $w0 = COPY [[AND1]](s32)
; CHECK: RET_ReallyLR implicit $w0		; CHECK: RET_ReallyLR implicit $w0
bb.0:		bb.0:
successors: %bb.1(0x40000000), %bb.3(0x40000000)		successors: %bb.1(0x40000000), %bb.3(0x40000000)
liveins: $w0, $w1		liveins: $w0, $w1
; Make sure that we correctly legalize PHIs sharing common defs		; Make sure that we correctly legalize PHIs sharing common defs
; in different BBs.		; in different BBs.

Show All 32 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[GlobalISel][Legalizer] More concise and faster widenScalar, NFCClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 145806

llvm/trunk/include/llvm/CodeGen/GlobalISel/LegalizerHelper.h

llvm/trunk/include/llvm/CodeGen/GlobalISel/MachineIRBuilder.h

llvm/trunk/include/llvm/CodeGen/MachineOperand.h

llvm/trunk/lib/CodeGen/GlobalISel/LegalizerHelper.cpp

llvm/trunk/lib/CodeGen/GlobalISel/MachineIRBuilder.cpp

llvm/trunk/test/CodeGen/AArch64/GlobalISel/legalize-phi.mir

[GlobalISel][Legalizer] More concise and faster widenScalar, NFC
ClosedPublic