This is an archive of the discontinued LLVM Phabricator instance.

llvm/lib/Target/AArch64/AArch64ISelLowering.h
101	It is more than a rename but I can do that, it might even allow an easier path to lowering for scalable vector types. I do want the signed and unsigned variants to be consistent though. So, any objections to me creating a matching predicated zero_extend_inreg?

efriedma added inline comments.Aug 7 2020, 4:52 PM

llvm/lib/Target/AArch64/AArch64ISelLowering.h
101	That's fine. We actually have a function on SelectionDAG called getZeroExtendInReg(), so that isn't completely new terminology.

Changed new nodes to SIGN/ZERO_EXTEND_INREG_MERGE_PATHRU that closer resemble SIGN_EXTEND_INREG but using a predicate and passthru positioned as required by _MERGE_PATHRU style nodes.

paulwalker-arm retitled this revision from [SVE] Add ISD nodes for integer extend inreg operations. to [SVE] Add ISD nodes for predicated integer extend inreg operations.Aug 7 2020, 6:16 PM

Harbormaster completed remote builds in B67575: Diff 284108.Aug 7 2020, 6:44 PM

paulwalker-arm added a child revision: D85640: [SVE] Lower fixed length integer extend operations..Aug 10 2020, 5:19 AM

paulwalker-arm mentioned this in D85640: [SVE] Lower fixed length integer extend operations..Aug 10 2020, 5:22 AM

LGTM

This revision is now accepted and ready to land.Aug 10 2020, 10:31 AM

Closed by commit rGb6c7b7fa31b7: [SVE] Add ISD nodes for predicated integer extend inreg operations. (authored by paulwalker-arm). · Explain WhyAug 11 2020, 3:41 AM

This revision was automatically updated to reflect the committed changes.

paulwalker-arm added a commit: rGb6c7b7fa31b7: [SVE] Add ISD nodes for predicated integer extend inreg operations..

These patterns might need attention. ISD::SIGN_EXTEND_INREG expects both the input and output registers to have the same type, extending the small values in place. I.e. the input is unpacked.

But the AArch64ISD::SIGN_EXTEND_INREG_MERGE_PASSTHRU patterns are expecting to explicitly change the element size. I.e. the input is packed.

I have another patch that's needed to show the problem, but legalize can produce something like this:

    t78: v16i8 = BUILD_VECTOR t217, t216, t215, t214, t213, t212, t211, t210, t209, t208, t207, t206, t205, t204, t203, t202
  t80: v16i16 = any_extend t78
t82: v16i16 = sign_extend_inreg t80, ValueType:ch:v16i1

This revision is now accepted and ready to land.Aug 21 2020, 2:47 PM

I'm not sure I understand what you think the issue is here. ISD::SIGN_EXTEND_INREG, AArch64ISD::SIGN_EXTEND_INREG_MERGE_PASSTHRU, and the SVE sxtb instruction are all the same, as far as I know. The operand and result types are the same, and the operation in each lane is independent.

By contrast, ISD::ANY_EXTEND does change the element size.

Ah, you're right. I misread the register classes.

def : SVE_InReg_Extend<nxv8i16, op, nxv8i1, nxv8i8, !cast<Instruction>(NAME # _H)>;

I was reading nxv16i8 instead of nxv8i8 for the input operand. Since it's nxv8i8, they are unpacked.

I'll have to find a way to generate the nxv8i8 type, to cast the operand to it. Will take that to another Diff though...

Eh, thinking some more, it's still a little weird:

class SVE_InReg_Extend<ValueType vt, SDPatternOperator op, ValueType pt,
                       ValueType inreg_vt, Instruction inst>
: Pat<(vt (op pt:$Pg, vt:$Src, inreg_vt, vt:$PassThru)),
      (inst $PassThru, $Pg, $Src)>;

ISD::SIGN_EXTEND_INREG has the same type for the input and output reg, and an additional VT operand to encode the input type.

AArch64ISD::ZERO_EXTEND_INREG_MERGE_PASSTHRU encodes the input type in the register, and tosses the VT operand.

Maybe it would be better to lower based on the VT operand? That way we don't need to get into the half vector type business (e.g. nxv8i8).

Perhaps I've misunderstood but based on Cameron's original message I suspect he's hit a bug because I lowered all SIGN_EXTEND_INREG operations regardless of the inreg type. This is wrong because there's no patterns for non-power-of-2 non-byte-based inreg types and thus I guess Cameron has hit a selection failure?

If so then the fix is easy enough, we should expand those cases, which looking at the common code does the correct thing of replacing with shifts. I've create D86394, even if I have misunderstood it fixes a bug anyway.

D86394 addresses a different issue, I think. Posted some new code in D85364 to expose the AArch64ISD::ZERO_EXTEND_INREG_MERGE_PASSTHRU issue.

Revision Contents

Path

Size

llvm/

lib/

Target/

AArch64/

AArch64ISelLowering.h

5 lines

AArch64ISelLowering.cpp

39 lines

AArch64SVEInstrInfo.td

25 lines

SVEInstrFormats.td

17 lines

Diff 284638

llvm/lib/Target/AArch64/AArch64ISelLowering.h

Show First 20 Lines • Show All 85 Lines • ▼ Show 20 Lines	enum NodeType : unsigned {
SMIN_PRED,		SMIN_PRED,
SRA_PRED,		SRA_PRED,
SRL_PRED,		SRL_PRED,
SUB_PRED,		SUB_PRED,
UDIV_PRED,		UDIV_PRED,
UMAX_PRED,		UMAX_PRED,
UMIN_PRED,		UMIN_PRED,

		// Predicated instructions with the result of inactive lanes provided by the
		// last operand.
		SIGN_EXTEND_INREG_MERGE_PASSTHRU,
		ZERO_EXTEND_INREG_MERGE_PASSTHRU,

SETCC_MERGE_ZERO,		SETCC_MERGE_ZERO,

// Arithmetic instructions which write flags.		// Arithmetic instructions which write flags.
		efriedmaUnsubmitted Not Done Reply Inline Actions The signed versions are essentially predicated versions of ISD::SIGN_EXTEND_INREG? Can we use names that reflect that? efriedma: The signed versions are essentially predicated versions of ISD::SIGN_EXTEND_INREG? Can we use…
		paulwalker-armAuthorUnsubmitted Done Reply Inline Actions It is more than a rename but I can do that, it might even allow an easier path to lowering for scalable vector types. I do want the signed and unsigned variants to be consistent though. So, any objections to me creating a matching predicated zero_extend_inreg? paulwalker-arm: It is more than a rename but I can do that, it might even allow an easier path to lowering for…
		efriedmaUnsubmitted Not Done Reply Inline Actions That's fine. We actually have a function on SelectionDAG called getZeroExtendInReg(), so that isn't completely new terminology. efriedma: That's fine. We actually have a function on SelectionDAG called getZeroExtendInReg(), so that…
ADDS,		ADDS,
SUBS,		SUBS,
ADCS,		ADCS,
SBCS,		SBCS,
ANDS,		ANDS,

// Conditional compares. Operands: left,right,falsecc,cc,flags		// Conditional compares. Operands: left,right,falsecc,cc,flags
CCMP,		CCMP,
▲ Show 20 Lines • Show All 879 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64ISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,411 Lines • ▼ Show 20 Lines	case AArch64ISD::FIRST_NUMBER:
MAKE_CASE(AArch64ISD::SMAX_PRED)		MAKE_CASE(AArch64ISD::SMAX_PRED)
MAKE_CASE(AArch64ISD::SMIN_PRED)		MAKE_CASE(AArch64ISD::SMIN_PRED)
MAKE_CASE(AArch64ISD::SRA_PRED)		MAKE_CASE(AArch64ISD::SRA_PRED)
MAKE_CASE(AArch64ISD::SRL_PRED)		MAKE_CASE(AArch64ISD::SRL_PRED)
MAKE_CASE(AArch64ISD::SUB_PRED)		MAKE_CASE(AArch64ISD::SUB_PRED)
MAKE_CASE(AArch64ISD::UDIV_PRED)		MAKE_CASE(AArch64ISD::UDIV_PRED)
MAKE_CASE(AArch64ISD::UMAX_PRED)		MAKE_CASE(AArch64ISD::UMAX_PRED)
MAKE_CASE(AArch64ISD::UMIN_PRED)		MAKE_CASE(AArch64ISD::UMIN_PRED)
		MAKE_CASE(AArch64ISD::SIGN_EXTEND_INREG_MERGE_PASSTHRU)
		MAKE_CASE(AArch64ISD::ZERO_EXTEND_INREG_MERGE_PASSTHRU)
MAKE_CASE(AArch64ISD::SETCC_MERGE_ZERO)		MAKE_CASE(AArch64ISD::SETCC_MERGE_ZERO)
MAKE_CASE(AArch64ISD::ADC)		MAKE_CASE(AArch64ISD::ADC)
MAKE_CASE(AArch64ISD::SBC)		MAKE_CASE(AArch64ISD::SBC)
MAKE_CASE(AArch64ISD::ADDS)		MAKE_CASE(AArch64ISD::ADDS)
MAKE_CASE(AArch64ISD::SUBS)		MAKE_CASE(AArch64ISD::SUBS)
MAKE_CASE(AArch64ISD::ADCS)		MAKE_CASE(AArch64ISD::ADCS)
MAKE_CASE(AArch64ISD::SBCS)		MAKE_CASE(AArch64ISD::SBCS)
MAKE_CASE(AArch64ISD::ANDS)		MAKE_CASE(AArch64ISD::ANDS)
▲ Show 20 Lines • Show All 1,866 Lines • ▼ Show 20 Lines	case Intrinsic::aarch64_sve_insr: {
EVT ScalarTy = Scalar.getValueType();		EVT ScalarTy = Scalar.getValueType();
if ((ScalarTy == MVT::i8) \|\| (ScalarTy == MVT::i16))		if ((ScalarTy == MVT::i8) \|\| (ScalarTy == MVT::i16))
Scalar = DAG.getNode(ISD::ANY_EXTEND, dl, MVT::i32, Scalar);		Scalar = DAG.getNode(ISD::ANY_EXTEND, dl, MVT::i32, Scalar);

return DAG.getNode(AArch64ISD::INSR, dl, Op.getValueType(),		return DAG.getNode(AArch64ISD::INSR, dl, Op.getValueType(),
Op.getOperand(1), Scalar);		Op.getOperand(1), Scalar);
}		}

		case Intrinsic::aarch64_sve_sxtb:
		return DAG.getNode(
		AArch64ISD::SIGN_EXTEND_INREG_MERGE_PASSTHRU, dl, Op.getValueType(),
		Op.getOperand(2), Op.getOperand(3),
		DAG.getValueType(Op.getValueType().changeVectorElementType(MVT::i8)),
		Op.getOperand(1));
		case Intrinsic::aarch64_sve_sxth:
		return DAG.getNode(
		AArch64ISD::SIGN_EXTEND_INREG_MERGE_PASSTHRU, dl, Op.getValueType(),
		Op.getOperand(2), Op.getOperand(3),
		DAG.getValueType(Op.getValueType().changeVectorElementType(MVT::i16)),
		Op.getOperand(1));
		case Intrinsic::aarch64_sve_sxtw:
		return DAG.getNode(
		AArch64ISD::SIGN_EXTEND_INREG_MERGE_PASSTHRU, dl, Op.getValueType(),
		Op.getOperand(2), Op.getOperand(3),
		DAG.getValueType(Op.getValueType().changeVectorElementType(MVT::i32)),
		Op.getOperand(1));
		case Intrinsic::aarch64_sve_uxtb:
		return DAG.getNode(
		AArch64ISD::ZERO_EXTEND_INREG_MERGE_PASSTHRU, dl, Op.getValueType(),
		Op.getOperand(2), Op.getOperand(3),
		DAG.getValueType(Op.getValueType().changeVectorElementType(MVT::i8)),
		Op.getOperand(1));
		case Intrinsic::aarch64_sve_uxth:
		return DAG.getNode(
		AArch64ISD::ZERO_EXTEND_INREG_MERGE_PASSTHRU, dl, Op.getValueType(),
		Op.getOperand(2), Op.getOperand(3),
		DAG.getValueType(Op.getValueType().changeVectorElementType(MVT::i16)),
		Op.getOperand(1));
		case Intrinsic::aarch64_sve_uxtw:
		return DAG.getNode(
		AArch64ISD::ZERO_EXTEND_INREG_MERGE_PASSTHRU, dl, Op.getValueType(),
		Op.getOperand(2), Op.getOperand(3),
		DAG.getValueType(Op.getValueType().changeVectorElementType(MVT::i32)),
		Op.getOperand(1));

case Intrinsic::localaddress: {		case Intrinsic::localaddress: {
const auto &MF = DAG.getMachineFunction();		const auto &MF = DAG.getMachineFunction();
const auto *RegInfo = Subtarget->getRegisterInfo();		const auto *RegInfo = Subtarget->getRegisterInfo();
unsigned Reg = RegInfo->getLocalAddressRegister(MF);		unsigned Reg = RegInfo->getLocalAddressRegister(MF);
return DAG.getCopyFromReg(DAG.getEntryNode(), dl, Reg,		return DAG.getCopyFromReg(DAG.getEntryNode(), dl, Reg,
Op.getSimpleValueType());		Op.getSimpleValueType());
}		}

▲ Show 20 Lines • Show All 12,023 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64SVEInstrInfo.td

Show First 20 Lines • Show All 158 Lines • ▼ Show 20 Lines
def AArch64orv_p : SDNode<"AArch64ISD::ORV_PRED", SDT_AArch64Reduce>;		def AArch64orv_p : SDNode<"AArch64ISD::ORV_PRED", SDT_AArch64Reduce>;
def AArch64eorv_p : SDNode<"AArch64ISD::EORV_PRED", SDT_AArch64Reduce>;		def AArch64eorv_p : SDNode<"AArch64ISD::EORV_PRED", SDT_AArch64Reduce>;
def AArch64andv_p : SDNode<"AArch64ISD::ANDV_PRED", SDT_AArch64Reduce>;		def AArch64andv_p : SDNode<"AArch64ISD::ANDV_PRED", SDT_AArch64Reduce>;
def AArch64lasta : SDNode<"AArch64ISD::LASTA", SDT_AArch64Reduce>;		def AArch64lasta : SDNode<"AArch64ISD::LASTA", SDT_AArch64Reduce>;
def AArch64lastb : SDNode<"AArch64ISD::LASTB", SDT_AArch64Reduce>;		def AArch64lastb : SDNode<"AArch64ISD::LASTB", SDT_AArch64Reduce>;

def SDT_AArch64Arith : SDTypeProfile<1, 3, [		def SDT_AArch64Arith : SDTypeProfile<1, 3, [
SDTCisVec<0>, SDTCisVec<1>, SDTCisVec<2>, SDTCisVec<3>,		SDTCisVec<0>, SDTCisVec<1>, SDTCisVec<2>, SDTCisVec<3>,
SDTCVecEltisVT<1,i1>, SDTCisSameAs<2,3>		SDTCVecEltisVT<1,i1>, SDTCisSameAs<0,2>, SDTCisSameAs<2,3>
]>;		]>;

def SDT_AArch64FMA : SDTypeProfile<1, 4, [		def SDT_AArch64FMA : SDTypeProfile<1, 4, [
SDTCisVec<0>, SDTCisVec<1>, SDTCisVec<2>, SDTCisVec<3>, SDTCisVec<4>,		SDTCisVec<0>, SDTCisVec<1>, SDTCisVec<2>, SDTCisVec<3>, SDTCisVec<4>,
SDTCVecEltisVT<1,i1>, SDTCisSameAs<2,3>, SDTCisSameAs<3,4>		SDTCVecEltisVT<1,i1>, SDTCisSameAs<0,2>, SDTCisSameAs<2,3>, SDTCisSameAs<3,4>
]>;		]>;

// Predicated operations with the result of inactive lanes being unspecified.		// Predicated operations with the result of inactive lanes being unspecified.
def AArch64add_p : SDNode<"AArch64ISD::ADD_PRED", SDT_AArch64Arith>;		def AArch64add_p : SDNode<"AArch64ISD::ADD_PRED", SDT_AArch64Arith>;
def AArch64asr_p : SDNode<"AArch64ISD::SRA_PRED", SDT_AArch64Arith>;		def AArch64asr_p : SDNode<"AArch64ISD::SRA_PRED", SDT_AArch64Arith>;
def AArch64fadd_p : SDNode<"AArch64ISD::FADD_PRED", SDT_AArch64Arith>;		def AArch64fadd_p : SDNode<"AArch64ISD::FADD_PRED", SDT_AArch64Arith>;
def AArch64fdiv_p : SDNode<"AArch64ISD::FDIV_PRED", SDT_AArch64Arith>;		def AArch64fdiv_p : SDNode<"AArch64ISD::FDIV_PRED", SDT_AArch64Arith>;
def AArch64fma_p : SDNode<"AArch64ISD::FMA_PRED", SDT_AArch64FMA>;		def AArch64fma_p : SDNode<"AArch64ISD::FMA_PRED", SDT_AArch64FMA>;
def AArch64fmul_p : SDNode<"AArch64ISD::FMUL_PRED", SDT_AArch64Arith>;		def AArch64fmul_p : SDNode<"AArch64ISD::FMUL_PRED", SDT_AArch64Arith>;
def AArch64fsub_p : SDNode<"AArch64ISD::FSUB_PRED", SDT_AArch64Arith>;		def AArch64fsub_p : SDNode<"AArch64ISD::FSUB_PRED", SDT_AArch64Arith>;
def AArch64lsl_p : SDNode<"AArch64ISD::SHL_PRED", SDT_AArch64Arith>;		def AArch64lsl_p : SDNode<"AArch64ISD::SHL_PRED", SDT_AArch64Arith>;
def AArch64lsr_p : SDNode<"AArch64ISD::SRL_PRED", SDT_AArch64Arith>;		def AArch64lsr_p : SDNode<"AArch64ISD::SRL_PRED", SDT_AArch64Arith>;
def AArch64mul_p : SDNode<"AArch64ISD::MUL_PRED", SDT_AArch64Arith>;		def AArch64mul_p : SDNode<"AArch64ISD::MUL_PRED", SDT_AArch64Arith>;
def AArch64sdiv_p : SDNode<"AArch64ISD::SDIV_PRED", SDT_AArch64Arith>;		def AArch64sdiv_p : SDNode<"AArch64ISD::SDIV_PRED", SDT_AArch64Arith>;
def AArch64smax_p : SDNode<"AArch64ISD::SMAX_PRED", SDT_AArch64Arith>;		def AArch64smax_p : SDNode<"AArch64ISD::SMAX_PRED", SDT_AArch64Arith>;
def AArch64smin_p : SDNode<"AArch64ISD::SMIN_PRED", SDT_AArch64Arith>;		def AArch64smin_p : SDNode<"AArch64ISD::SMIN_PRED", SDT_AArch64Arith>;
def AArch64sub_p : SDNode<"AArch64ISD::SUB_PRED", SDT_AArch64Arith>;		def AArch64sub_p : SDNode<"AArch64ISD::SUB_PRED", SDT_AArch64Arith>;
def AArch64udiv_p : SDNode<"AArch64ISD::UDIV_PRED", SDT_AArch64Arith>;		def AArch64udiv_p : SDNode<"AArch64ISD::UDIV_PRED", SDT_AArch64Arith>;
def AArch64umax_p : SDNode<"AArch64ISD::UMAX_PRED", SDT_AArch64Arith>;		def AArch64umax_p : SDNode<"AArch64ISD::UMAX_PRED", SDT_AArch64Arith>;
def AArch64umin_p : SDNode<"AArch64ISD::UMIN_PRED", SDT_AArch64Arith>;		def AArch64umin_p : SDNode<"AArch64ISD::UMIN_PRED", SDT_AArch64Arith>;

		def SDT_AArch64IntExtend : SDTypeProfile<1, 4, [
		SDTCisVec<0>, SDTCisVec<1>, SDTCisVec<2>, SDTCisVT<3, OtherVT>, SDTCisVec<4>,
		SDTCVecEltisVT<1,i1>, SDTCisSameAs<0,2>, SDTCisVTSmallerThanOp<3, 2>, SDTCisSameAs<0,4>
		]>;

		// Predicated operations with the result of inactive lanes provided by the last operand.
		def AArch64sxt_mt : SDNode<"AArch64ISD::SIGN_EXTEND_INREG_MERGE_PASSTHRU", SDT_AArch64IntExtend>;
		def AArch64uxt_mt : SDNode<"AArch64ISD::ZERO_EXTEND_INREG_MERGE_PASSTHRU", SDT_AArch64IntExtend>;

def SDT_AArch64ReduceWithInit : SDTypeProfile<1, 3, [SDTCisVec<1>, SDTCisVec<3>]>;		def SDT_AArch64ReduceWithInit : SDTypeProfile<1, 3, [SDTCisVec<1>, SDTCisVec<3>]>;
def AArch64clasta_n : SDNode<"AArch64ISD::CLASTA_N", SDT_AArch64ReduceWithInit>;		def AArch64clasta_n : SDNode<"AArch64ISD::CLASTA_N", SDT_AArch64ReduceWithInit>;
def AArch64clastb_n : SDNode<"AArch64ISD::CLASTB_N", SDT_AArch64ReduceWithInit>;		def AArch64clastb_n : SDNode<"AArch64ISD::CLASTB_N", SDT_AArch64ReduceWithInit>;
def AArch64fadda_p : SDNode<"AArch64ISD::FADDA_PRED", SDT_AArch64ReduceWithInit>;		def AArch64fadda_p : SDNode<"AArch64ISD::FADDA_PRED", SDT_AArch64ReduceWithInit>;

def SDT_AArch64Rev : SDTypeProfile<1, 1, [SDTCisVec<0>, SDTCisSameAs<0,1>]>;		def SDT_AArch64Rev : SDTypeProfile<1, 1, [SDTCisVec<0>, SDTCisSameAs<0,1>]>;
def AArch64rev : SDNode<"AArch64ISD::REV", SDT_AArch64Rev>;		def AArch64rev : SDNode<"AArch64ISD::REV", SDT_AArch64Rev>;

▲ Show 20 Lines • Show All 107 Lines • ▼ Show 20 Lines	let Predicates = [HasSVE] in {
defm UDIV_ZPZZ : sve_int_bin_pred_sd<AArch64udiv_p>;		defm UDIV_ZPZZ : sve_int_bin_pred_sd<AArch64udiv_p>;

defm SDOT_ZZZ : sve_intx_dot<0b0, "sdot", int_aarch64_sve_sdot>;		defm SDOT_ZZZ : sve_intx_dot<0b0, "sdot", int_aarch64_sve_sdot>;
defm UDOT_ZZZ : sve_intx_dot<0b1, "udot", int_aarch64_sve_udot>;		defm UDOT_ZZZ : sve_intx_dot<0b1, "udot", int_aarch64_sve_udot>;

defm SDOT_ZZZI : sve_intx_dot_by_indexed_elem<0b0, "sdot", int_aarch64_sve_sdot_lane>;		defm SDOT_ZZZI : sve_intx_dot_by_indexed_elem<0b0, "sdot", int_aarch64_sve_sdot_lane>;
defm UDOT_ZZZI : sve_intx_dot_by_indexed_elem<0b1, "udot", int_aarch64_sve_udot_lane>;		defm UDOT_ZZZI : sve_intx_dot_by_indexed_elem<0b1, "udot", int_aarch64_sve_udot_lane>;

defm SXTB_ZPmZ : sve_int_un_pred_arit_0_h<0b000, "sxtb", int_aarch64_sve_sxtb>;		defm SXTB_ZPmZ : sve_int_un_pred_arit_0_h<0b000, "sxtb", AArch64sxt_mt>;
defm UXTB_ZPmZ : sve_int_un_pred_arit_0_h<0b001, "uxtb", int_aarch64_sve_uxtb>;		defm UXTB_ZPmZ : sve_int_un_pred_arit_0_h<0b001, "uxtb", AArch64uxt_mt>;
defm SXTH_ZPmZ : sve_int_un_pred_arit_0_w<0b010, "sxth", int_aarch64_sve_sxth>;		defm SXTH_ZPmZ : sve_int_un_pred_arit_0_w<0b010, "sxth", AArch64sxt_mt>;
defm UXTH_ZPmZ : sve_int_un_pred_arit_0_w<0b011, "uxth", int_aarch64_sve_uxth>;		defm UXTH_ZPmZ : sve_int_un_pred_arit_0_w<0b011, "uxth", AArch64uxt_mt>;
defm SXTW_ZPmZ : sve_int_un_pred_arit_0_d<0b100, "sxtw", int_aarch64_sve_sxtw>;		defm SXTW_ZPmZ : sve_int_un_pred_arit_0_d<0b100, "sxtw", AArch64sxt_mt>;
defm UXTW_ZPmZ : sve_int_un_pred_arit_0_d<0b101, "uxtw", int_aarch64_sve_uxtw>;		defm UXTW_ZPmZ : sve_int_un_pred_arit_0_d<0b101, "uxtw", AArch64uxt_mt>;
defm ABS_ZPmZ : sve_int_un_pred_arit_0< 0b110, "abs", int_aarch64_sve_abs>;		defm ABS_ZPmZ : sve_int_un_pred_arit_0< 0b110, "abs", int_aarch64_sve_abs>;
defm NEG_ZPmZ : sve_int_un_pred_arit_0< 0b111, "neg", int_aarch64_sve_neg>;		defm NEG_ZPmZ : sve_int_un_pred_arit_0< 0b111, "neg", int_aarch64_sve_neg>;

defm CLS_ZPmZ : sve_int_un_pred_arit_1< 0b000, "cls", int_aarch64_sve_cls>;		defm CLS_ZPmZ : sve_int_un_pred_arit_1< 0b000, "cls", int_aarch64_sve_cls>;
defm CLZ_ZPmZ : sve_int_un_pred_arit_1< 0b001, "clz", int_aarch64_sve_clz>;		defm CLZ_ZPmZ : sve_int_un_pred_arit_1< 0b001, "clz", int_aarch64_sve_clz>;
defm CNT_ZPmZ : sve_int_un_pred_arit_1< 0b010, "cnt", int_aarch64_sve_cnt>;		defm CNT_ZPmZ : sve_int_un_pred_arit_1< 0b010, "cnt", int_aarch64_sve_cnt>;

let Predicates = [HasSVE, HasBF16] in {		let Predicates = [HasSVE, HasBF16] in {
▲ Show 20 Lines • Show All 2,302 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/SVEInstrFormats.td

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 398 Lines • ▼ Show 20 Lines
	: Pat<(vtd (op vt1:$Op1)),			: Pat<(vtd (op vt1:$Op1)),
	(inst (IMPLICIT_DEF), (ptrue 31), $Op1)>;			(inst (IMPLICIT_DEF), (ptrue 31), $Op1)>;

	class SVE_2_Op_AllActive_Pat<ValueType vtd, SDPatternOperator op, ValueType vt1,			class SVE_2_Op_AllActive_Pat<ValueType vtd, SDPatternOperator op, ValueType vt1,
	ValueType vt2, Instruction inst, Instruction ptrue>			ValueType vt2, Instruction inst, Instruction ptrue>
	: Pat<(vtd (op vt1:$Op1, vt2:$Op2)),			: Pat<(vtd (op vt1:$Op1, vt2:$Op2)),
	(inst (ptrue 31), $Op1, $Op2)>;			(inst (ptrue 31), $Op1, $Op2)>;

				class SVE_InReg_Extend<ValueType vt, SDPatternOperator op, ValueType pt,
				ValueType inreg_vt, Instruction inst>
				: Pat<(vt (op pt:$Pg, vt:$Src, inreg_vt, vt:$PassThru)),
				(inst $PassThru, $Pg, $Src)>;

	//			//
	// Pseudo -> Instruction mappings			// Pseudo -> Instruction mappings
	//			//
	def getSVEPseudoMap : InstrMapping {			def getSVEPseudoMap : InstrMapping {
	let FilterClass = "SVEPseudo2Instr";			let FilterClass = "SVEPseudo2Instr";
	let RowFields = ["PseudoName"];			let RowFields = ["PseudoName"];
	let ColFields = ["IsInstr"];			let ColFields = ["IsInstr"];
	let KeyCol = ["0"];			let KeyCol = ["0"];
	▲ Show 20 Lines • Show All 3,268 Lines • ▼ Show 20 Lines
	}			}

	multiclass sve_int_un_pred_arit_0_h<bits<3> opc, string asm,			multiclass sve_int_un_pred_arit_0_h<bits<3> opc, string asm,
	SDPatternOperator op> {			SDPatternOperator op> {
	def _H : sve_int_un_pred_arit<0b01, { opc, 0b0 }, asm, ZPR16>;			def _H : sve_int_un_pred_arit<0b01, { opc, 0b0 }, asm, ZPR16>;
	def _S : sve_int_un_pred_arit<0b10, { opc, 0b0 }, asm, ZPR32>;			def _S : sve_int_un_pred_arit<0b10, { opc, 0b0 }, asm, ZPR32>;
	def _D : sve_int_un_pred_arit<0b11, { opc, 0b0 }, asm, ZPR64>;			def _D : sve_int_un_pred_arit<0b11, { opc, 0b0 }, asm, ZPR64>;

	def : SVE_3_Op_Pat<nxv8i16, op, nxv8i16, nxv8i1, nxv8i16, !cast<Instruction>(NAME # _H)>;			def : SVE_InReg_Extend<nxv8i16, op, nxv8i1, nxv8i8, !cast<Instruction>(NAME # _H)>;
	def : SVE_3_Op_Pat<nxv4i32, op, nxv4i32, nxv4i1, nxv4i32, !cast<Instruction>(NAME # _S)>;			def : SVE_InReg_Extend<nxv4i32, op, nxv4i1, nxv4i8, !cast<Instruction>(NAME # _S)>;
	def : SVE_3_Op_Pat<nxv2i64, op, nxv2i64, nxv2i1, nxv2i64, !cast<Instruction>(NAME # _D)>;			def : SVE_InReg_Extend<nxv2i64, op, nxv2i1, nxv2i8, !cast<Instruction>(NAME # _D)>;
	}			}

	multiclass sve_int_un_pred_arit_0_w<bits<3> opc, string asm,			multiclass sve_int_un_pred_arit_0_w<bits<3> opc, string asm,
	SDPatternOperator op> {			SDPatternOperator op> {
	def _S : sve_int_un_pred_arit<0b10, { opc, 0b0 }, asm, ZPR32>;			def _S : sve_int_un_pred_arit<0b10, { opc, 0b0 }, asm, ZPR32>;
	def _D : sve_int_un_pred_arit<0b11, { opc, 0b0 }, asm, ZPR64>;			def _D : sve_int_un_pred_arit<0b11, { opc, 0b0 }, asm, ZPR64>;

	def : SVE_3_Op_Pat<nxv4i32, op, nxv4i32, nxv4i1, nxv4i32, !cast<Instruction>(NAME # _S)>;			def : SVE_InReg_Extend<nxv4i32, op, nxv4i1, nxv4i16, !cast<Instruction>(NAME # _S)>;
	def : SVE_3_Op_Pat<nxv2i64, op, nxv2i64, nxv2i1, nxv2i64, !cast<Instruction>(NAME # _D)>;			def : SVE_InReg_Extend<nxv2i64, op, nxv2i1, nxv2i16, !cast<Instruction>(NAME # _D)>;
	}			}

	multiclass sve_int_un_pred_arit_0_d<bits<3> opc, string asm,			multiclass sve_int_un_pred_arit_0_d<bits<3> opc, string asm,
	SDPatternOperator op> {			SDPatternOperator op> {
	def _D : sve_int_un_pred_arit<0b11, { opc, 0b0 }, asm, ZPR64>;			def _D : sve_int_un_pred_arit<0b11, { opc, 0b0 }, asm, ZPR64>;

	def : SVE_3_Op_Pat<nxv2i64, op, nxv2i64, nxv2i1, nxv2i64, !cast<Instruction>(NAME # _D)>;			def : SVE_InReg_Extend<nxv2i64, op, nxv2i1, nxv2i32, !cast<Instruction>(NAME # _D)>;
	}			}

	multiclass sve_int_un_pred_arit_1<bits<3> opc, string asm,			multiclass sve_int_un_pred_arit_1<bits<3> opc, string asm,
	SDPatternOperator op> {			SDPatternOperator op> {
	def _B : sve_int_un_pred_arit<0b00, { opc, 0b1 }, asm, ZPR8>;			def _B : sve_int_un_pred_arit<0b00, { opc, 0b1 }, asm, ZPR8>;
	def _H : sve_int_un_pred_arit<0b01, { opc, 0b1 }, asm, ZPR16>;			def _H : sve_int_un_pred_arit<0b01, { opc, 0b1 }, asm, ZPR16>;
	def _S : sve_int_un_pred_arit<0b10, { opc, 0b1 }, asm, ZPR32>;			def _S : sve_int_un_pred_arit<0b10, { opc, 0b1 }, asm, ZPR32>;
	def _D : sve_int_un_pred_arit<0b11, { opc, 0b1 }, asm, ZPR64>;			def _D : sve_int_un_pred_arit<0b11, { opc, 0b1 }, asm, ZPR64>;
	▲ Show 20 Lines • Show All 4,148 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[SVE] Add ISD nodes for predicated integer extend inreg operationsClosedPublic

Details

Diff Detail