This is an archive of the discontinued LLVM Phabricator instance.

[POC][SVE] Allow code generation for fixed length vectorised loops [Patch 2/2].
AbandonedPublic

Authored by paulwalker-arm on Dec 20 2019, 6:46 AM.

Download Raw Diff

Details

Reviewers

rengolin
efriedma

Summary

No expectations for review at this stage unless you are super keen.

This extends the proof of concept introduced by https://reviews.llvm.org/D71760 to show the work required to custom lower the main fixed length vector operations. The general scheme being:

<n x ty> op (<n x ty> op1, <n x ty> op2...

becomes

pg = create_predicate_for(<n x ty>)
new_op1 = convertToSVE(op1)
new_op2 = convertToSVE(op2)
...
return convertFromSVE(sve_op(pg, new_op1, new_op2...

To keep the patch small I've reused the existing intrinsic isel rules. This provides a route to reasonable performance whilst allowing us to start work on immediate packing, condition code handling and better utilisation of reversed instructions and movprfx. The ultimately goal is that the first patch ensures everything can run, whilst this and subsequent patches improve performance.

WARNING: The more operations we lower the more existing code paths will need to be cleaned so they no longer assume that (isTypeLegal(VectorType) implies VectorType.getSizeInBits() >= 128). This can be seen in this patch which required a charge to one of the AArch64 DAGCombines.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

paulwalker-arm created this revision.Dec 20 2019, 6:46 AM

Herald added a reviewer: rengolin. · View Herald TranscriptDec 20 2019, 6:46 AM

Herald added a reviewer: efriedma. · View Herald Transcript

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: llvm-commits, psnobl, rkruppe and 3 others. · View Herald Transcript

amehsan added a subscriber: amehsan.Dec 23 2019, 6:40 AM

sdesmalen added a subscriber: sdesmalen.Jan 2 2020, 6:57 AM

rogfer01 added a subscriber: rogfer01.Jan 12 2020, 10:39 PM

cameron.mcinally added a subscriber: cameron.mcinally.Mar 19 2020, 8:12 AM

Herald added a subscriber: danielkiss. · View Herald TranscriptMar 19 2020, 8:12 AM

Is there any particular reason to lower ISD::AND on a fixed vector to an intrinsic, as opposed to simply lowering it to an ISD::AND on a scalable vector? The patterns should mostly be there to make it work.

Otherwise, the general approach seems reasonable.

Err, oops, ignore me; somehow I thought this was new.

This seems like a good approach.

Do we have any tests for it? I notice a simple fixed-width argument test blows up (unless I'm making a mistake on my side):

; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
; RUN: llc -mtriple=aarch64-linux-gnu -mattr=+sve -aarch64-isel-sve-vector-bits=512 < %s | FileCheck %s

define <16 x float> @fadd_float_512b(<16 x float> %a, <16 x float> %b) {
; CHECK-LABEL: fadd_float_512b:
; CHECK:       // %bb.0:
; CHECK-NEXT:    fadd z0.s, z0.s, z1.s
; CHECK-NEXT:    ret
  %fadd = fadd <16 x float> %a, %b
  ret <16 x float> %fadd
}

I have a somewhat hacky solution to the calling convention issues, which I'll share soon. Hoping for a better solution though, since mine is clunky...

The handling of larger than NEON fixed length vector function parameters is still up in the air. We cannot break existing NEON support so we're almost certainly going to need a new function attribute for fixed length SVE. For this reason and generally not knowing if the approach this patch is based on will be acceptable I did not prioritise writing tests, instead I focused on execution testing. I'll see about resolving the lack of tests albeit at this stage using pass by reference rather than value.

fhahn added a subscriber: fhahn.Mar 20 2020, 10:45 AM

Ah, ok. So you wouldn't see fixed-width vector arguments since the vectorizers works at the loop/block/etc level. That's what I misunderstood.

We have a pre-IR vectorizer and vectorize lib calls, so our needs are a little askew.

What about a scheme where we make the fixed-width vector types proper subregs of the scalable types? That way we can add the fixed-width types to the calling conventions and register classes. Would that be of interest you?

It would be similar to X86's XMM->YMM->ZMM registers (i.e. 128b->256b->512b), except that all of SVE's sub-registers would print the same register name. it's a little hacky, but might polish up ok.

I'll see about resolving the lack of tests albeit at this stage using pass by reference rather than value.

Thanks, but I'm fine without them for now. Up to you...

What about a scheme where we make the fixed-width vector types proper subregs of the scalable types? That way we can add the fixed-width types to the calling conventions and register classes. Would that be of interest you?

It would be similar to X86's XMM->YMM->ZMM registers (i.e. 128b->256b->512b), except that all of SVE's sub-registers would print the same register name. it's a little hacky, but might polish

The problem here is what fixed-width vector types would you use? The mapping would be dependent on the target. We did experiment with HwMode but it didn't work out. At this stage I think the simplest option is for the vectorised functions to use scalable argument and then provide a way to insert/extract fixed width vectors into/from them. During code generation these insert/extract operations will become _SUBVECTOR operations that should generate no code. This is akin to the general philosophy I have used for fixed width code generation.

In D71767#1954496, @paulwalker-arm wrote:

What about a scheme where we make the fixed-width vector types proper subregs of the scalable types? That way we can add the fixed-width types to the calling conventions and register classes. Would that be of interest you?

It would be similar to X86's XMM->YMM->ZMM registers (i.e. 128b->256b->512b), except that all of SVE's sub-registers would print the same register name. it's a little hacky, but might polish

The problem here is what fixed-width vector types would you use? The mapping would be dependent on the target.

I'm not sure if I'm following. If we don't add a vector type to a register class for a target, then it shouldn't be a problem, I think. We should be able to do that programmatically in ISel since we know the max vector width at compile time. E.g. Don't add the 1024b vector types to the ZPR register class for a 512b wide target (or we could create a ZPR and subreg hierarchy instead).

I've posted a back-of-the-envelope patch so we can look at something concrete, D77224. It definitely has some wart, but I think it could be workable going forward.

vkmr added a subscriber: vkmr.Apr 2 2020, 7:18 AM

dancgr added a subscriber: dancgr.May 28 2020, 10:39 AM

paulwalker-arm mentioned this in D83568: [SVE] Ensure fixed length vector fptrunc operations bigger than NEON are not considered legal..Jul 10 2020, 9:26 AM

Nothing to see here, just rebasing for those who want to experiment.

Herald added a subscriber: steven.zhang. · View Herald TranscriptJul 13 2020, 9:19 AM

paulwalker-arm planned changes to this revision.Jul 13 2020, 10:18 AM

Harbormaster completed remote builds in B63978: Diff 277458.Jul 13 2020, 10:38 AM

paulwalker-arm mentioned this in D85558: [SVE] Implement fixed-width ZEXT lowering.Aug 7 2020, 4:06 PM

paulwalker-arm updated this revision to Diff 284380.Aug 10 2020, 8:13 AM

Herald added a project: Restricted Project. · View Herald TranscriptAug 10 2020, 8:13 AM

Herald added a subscriber: cfe-commits. · View Herald Transcript

paulwalker-arm planned changes to this revision.Aug 10 2020, 8:13 AM

@cameron.mcinally this is the patch I mentioned the other day, which contains the nodes where once I've written suitable tests I'll push separate patches for. Anything else is fair game. This patch implements VSELECT but that was just to investigate what we talked about during the previous sync call so I'll ignore it if you're planning to push on with your original work?

Harbormaster completed remote builds in B67721: Diff 284380.Aug 10 2020, 8:58 AM

In D71767#2206947, @paulwalker-arm wrote:

@cameron.mcinally this is the patch I mentioned the other day, which contains the nodes where once I've written suitable tests I'll push separate patches for.

Thanks, Paul. You mentioned that you would be focusing on another project for a few weeks. Would it help if I attempted to cherry-pick some of this Diff into individual patches (with new tests) for you? Or would I be stepping on your toes too much?

Anything else is fair game. This patch implements VSELECT but that was just to investigate what we talked about during the previous sync call so I'll ignore it if you're planning to push on with your original work?

I can carry on with VSELECT. I think I have enough intuition built around it now. I haven't yet looked into the state of the unpack[lo|hi] instructions, but will do so today.

In D71767#2207158, @cameron.mcinally wrote:

In D71767#2206947, @paulwalker-arm wrote:

@cameron.mcinally this is the patch I mentioned the other day, which contains the nodes where once I've written suitable tests I'll push separate patches for.

Thanks, Paul. You mentioned that you would be focusing on another project for a few weeks. Would it help if I attempted to cherry-pick some of this Diff into individual patches (with new tests) for you? Or would I be stepping on your toes too much?

That would be great, thanks. I already have patches up for the extends and am currently focusing on setcc, sub and the shifts, which leaves min/max and divides. That said, one area I've not looked at yet are the VECREDUCE_ nodes. I don't anticipate them to be that problematic but having proof of this would be nice. Let me know what you decide.

MIN/MAX and DIVs are up my alley. I'll try those. Will check out VREDUCE if I get through the others.

cameron.mcinally mentioned this in D85744: [SVE] Lower fixed length FP minnum/maxnum.Aug 11 2020, 9:10 AM

cameron.mcinally mentioned this in rGce2c991061bf: [SVE] Lower fixed length FP minnum/maxnum.Aug 12 2020, 10:03 AM

rebase

paulwalker-arm planned changes to this revision.Aug 13 2020, 5:15 AM

Harbormaster completed remote builds in B68244: Diff 285336.Aug 13 2020, 6:08 AM

With the exception of VSELECT lowering, which is being worked under D85364, everything else is available in master.

Herald added a subscriber: ecnelises. · View Herald TranscriptSep 2 2020, 4:22 AM

Revision Contents

Path

Size

clang/

lib/

Driver/

ToolChains/

Clang.cpp

10 lines

llvm/

lib/

CodeGen/

SelectionDAG/

DAGCombiner.cpp

3 lines

Target/

AArch64/

AArch64ISelLowering.h

1 line

AArch64ISelLowering.cpp

34 lines

Diff 285336

clang/lib/Driver/ToolChains/Clang.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,718 Lines • ▼ Show 20 Lines	if (IndirectBranches)
CmdArgs.push_back("-mbranch-target-enforce");		CmdArgs.push_back("-mbranch-target-enforce");
}		}

// Handle -msve_vector_bits=<bits>		// Handle -msve_vector_bits=<bits>
if (Arg *A = Args.getLastArg(options::OPT_msve_vector_bits_EQ)) {		if (Arg *A = Args.getLastArg(options::OPT_msve_vector_bits_EQ)) {
StringRef Val = A->getValue();		StringRef Val = A->getValue();
const Driver &D = getToolChain().getDriver();		const Driver &D = getToolChain().getDriver();
if (Val.equals("128") \|\| Val.equals("256") \|\| Val.equals("512") \|\|		if (Val.equals("128") \|\| Val.equals("256") \|\| Val.equals("512") \|\|
Val.equals("1024") \|\| Val.equals("2048"))		Val.equals("1024") \|\| Val.equals("2048")) {
CmdArgs.push_back(		CmdArgs.push_back(
Args.MakeArgString(llvm::Twine("-msve-vector-bits=") + Val));		Args.MakeArgString(llvm::Twine("-msve-vector-bits=") + Val));

		CmdArgs.push_back("-mllvm");
		CmdArgs.push_back(
		Args.MakeArgString("-aarch64-sve-vector-bits-min=" + Val));
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - Args.MakeArgString("-aarch64-sve-vector-bits-min=" + Val)); + Args.MakeArgString("-aarch64-sve-vector-bits-min=" + Val)); Lint: Pre-merge checks: clang-format: please reformat the code ``` - Args.MakeArgString("-aarch64-sve-vector…
		// CmdArgs.push_back("-mllvm");
		// CmdArgs.push_back(
		// Args.MakeArgString("-aarch64-sve-vector-bits-max=" + Val));
		}
// Silently drop requests for vector-length agnostic code as it's implied.		// Silently drop requests for vector-length agnostic code as it's implied.
else if (!Val.equals("scalable"))		else if (!Val.equals("scalable"))
// Handle the unsupported values passed to msve-vector-bits.		// Handle the unsupported values passed to msve-vector-bits.
D.Diag(diag::err_drv_unsupported_option_argument)		D.Diag(diag::err_drv_unsupported_option_argument)
<< A->getOption().getName() << Val;		<< A->getOption().getName() << Val;
}		}
}		}

▲ Show 20 Lines • Show All 5,444 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 20,571 Lines • ▼ Show 20 Lines	SDValue DAGCombiner::visitINSERT_SUBVECTOR(SDNode *N) {
// -> bitcast(insert_subvector(v, s, c2))		// -> bitcast(insert_subvector(v, s, c2))
if ((N0.isUndef() \|\| N0.getOpcode() == ISD::BITCAST) &&		if ((N0.isUndef() \|\| N0.getOpcode() == ISD::BITCAST) &&
N1.getOpcode() == ISD::BITCAST) {		N1.getOpcode() == ISD::BITCAST) {
SDValue N0Src = peekThroughBitcasts(N0);		SDValue N0Src = peekThroughBitcasts(N0);
SDValue N1Src = peekThroughBitcasts(N1);		SDValue N1Src = peekThroughBitcasts(N1);
EVT N0SrcSVT = N0Src.getValueType().getScalarType();		EVT N0SrcSVT = N0Src.getValueType().getScalarType();
EVT N1SrcSVT = N1Src.getValueType().getScalarType();		EVT N1SrcSVT = N1Src.getValueType().getScalarType();
if ((N0.isUndef() \|\| N0SrcSVT == N1SrcSVT) &&		if ((N0.isUndef() \|\| N0SrcSVT == N1SrcSVT) &&
N0Src.getValueType().isVector() && N1Src.getValueType().isVector()) {		N0Src.getValueType().isFixedLengthVector() &&
		N1Src.getValueType().isFixedLengthVector()) {
EVT NewVT;		EVT NewVT;
SDLoc DL(N);		SDLoc DL(N);
SDValue NewIdx;		SDValue NewIdx;
LLVMContext &Ctx = *DAG.getContext();		LLVMContext &Ctx = *DAG.getContext();
unsigned NumElts = VT.getVectorNumElements();		unsigned NumElts = VT.getVectorNumElements();
unsigned EltSizeInBits = VT.getScalarSizeInBits();		unsigned EltSizeInBits = VT.getScalarSizeInBits();
if ((EltSizeInBits % N1SrcSVT.getSizeInBits()) == 0) {		if ((EltSizeInBits % N1SrcSVT.getSizeInBits()) == 0) {
unsigned Scale = EltSizeInBits / N1SrcSVT.getSizeInBits();		unsigned Scale = EltSizeInBits / N1SrcSVT.getSizeInBits();
▲ Show 20 Lines • Show All 1,605 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64ISelLowering.h

Show First 20 Lines • Show All 872 Lines • ▼ Show 20 Lines	private:
SDValue LowerToScalableOp(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerToScalableOp(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerEXTRACT_SUBVECTOR(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerEXTRACT_SUBVECTOR(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerINSERT_SUBVECTOR(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerINSERT_SUBVECTOR(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerDIV(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerDIV(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerMUL(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerMUL(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerVectorSRA_SRL_SHL(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerVectorSRA_SRL_SHL(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerShiftLeftParts(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerShiftLeftParts(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerShiftRightParts(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerShiftRightParts(SDValue Op, SelectionDAG &DAG) const;
		SDValue LowerVSELECT(SDValue Op, SelectionDAG &DAG) const;
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for function 'LowerVSELECT' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for function 'LowerVSELECT' [readability-identifier…
SDValue LowerVSETCC(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerVSETCC(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerCTPOP(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerCTPOP(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerF128Call(SDValue Op, SelectionDAG &DAG,		SDValue LowerF128Call(SDValue Op, SelectionDAG &DAG,
RTLIB::Libcall Call) const;		RTLIB::Libcall Call) const;
SDValue LowerFCOPYSIGN(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerFCOPYSIGN(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerFP_EXTEND(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerFP_EXTEND(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerFP_ROUND(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerFP_ROUND(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerVectorFP_TO_INT(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerVectorFP_TO_INT(SDValue Op, SelectionDAG &DAG) const;
▲ Show 20 Lines • Show All 105 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64ISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,104 Lines • ▼ Show 20 Lines	void AArch64TargetLowering::addTypeForFixedLengthSVE(MVT VT) {
setOperationAction(ISD::FSUB, VT, Custom);		setOperationAction(ISD::FSUB, VT, Custom);
setOperationAction(ISD::LOAD, VT, Custom);		setOperationAction(ISD::LOAD, VT, Custom);
setOperationAction(ISD::MUL, VT, Custom);		setOperationAction(ISD::MUL, VT, Custom);
setOperationAction(ISD::OR, VT, Custom);		setOperationAction(ISD::OR, VT, Custom);
setOperationAction(ISD::SETCC, VT, Custom);		setOperationAction(ISD::SETCC, VT, Custom);
setOperationAction(ISD::SHL, VT, Custom);		setOperationAction(ISD::SHL, VT, Custom);
setOperationAction(ISD::SIGN_EXTEND, VT, Custom);		setOperationAction(ISD::SIGN_EXTEND, VT, Custom);
setOperationAction(ISD::SIGN_EXTEND_INREG, VT, Custom);		setOperationAction(ISD::SIGN_EXTEND_INREG, VT, Custom);
		setOperationAction(ISD::SMAX, VT, Custom);
		setOperationAction(ISD::SMIN, VT, Custom);
setOperationAction(ISD::SRA, VT, Custom);		setOperationAction(ISD::SRA, VT, Custom);
setOperationAction(ISD::SRL, VT, Custom);		setOperationAction(ISD::SRL, VT, Custom);
setOperationAction(ISD::STORE, VT, Custom);		setOperationAction(ISD::STORE, VT, Custom);
setOperationAction(ISD::SUB, VT, Custom);		setOperationAction(ISD::SUB, VT, Custom);
setOperationAction(ISD::TRUNCATE, VT, Custom);		setOperationAction(ISD::TRUNCATE, VT, Custom);
		setOperationAction(ISD::UMAX, VT, Custom);
		setOperationAction(ISD::UMIN, VT, Custom);
		setOperationAction(ISD::VSELECT, VT, Custom);
setOperationAction(ISD::XOR, VT, Custom);		setOperationAction(ISD::XOR, VT, Custom);
setOperationAction(ISD::ZERO_EXTEND, VT, Custom);		setOperationAction(ISD::ZERO_EXTEND, VT, Custom);

		if (VT.getVectorElementType() == MVT::i32 \|\|
		VT.getVectorElementType() == MVT::i64) {
		setOperationAction(ISD::SDIV, VT, Custom);
		setOperationAction(ISD::UDIV, VT, Custom);
		}
}		}

void AArch64TargetLowering::addDRTypeForNEON(MVT VT) {		void AArch64TargetLowering::addDRTypeForNEON(MVT VT) {
addRegisterClass(VT, &AArch64::FPR64RegClass);		addRegisterClass(VT, &AArch64::FPR64RegClass);
addTypeForNEON(VT, MVT::v2i32);		addTypeForNEON(VT, MVT::v2i32);
}		}

void AArch64TargetLowering::addQRTypeForNEON(MVT VT) {		void AArch64TargetLowering::addQRTypeForNEON(MVT VT) {
▲ Show 20 Lines • Show All 2,577 Lines • ▼ Show 20 Lines	SDValue AArch64TargetLowering::LowerOperation(SDValue Op,
case ISD::AND:		case ISD::AND:
return LowerToScalableOp(Op, DAG);		return LowerToScalableOp(Op, DAG);
case ISD::SUB:		case ISD::SUB:
return LowerToPredicatedOp(Op, DAG, AArch64ISD::SUB_PRED);		return LowerToPredicatedOp(Op, DAG, AArch64ISD::SUB_PRED);
case ISD::FMAXNUM:		case ISD::FMAXNUM:
return LowerToPredicatedOp(Op, DAG, AArch64ISD::FMAXNM_PRED);		return LowerToPredicatedOp(Op, DAG, AArch64ISD::FMAXNM_PRED);
case ISD::FMINNUM:		case ISD::FMINNUM:
return LowerToPredicatedOp(Op, DAG, AArch64ISD::FMINNM_PRED);		return LowerToPredicatedOp(Op, DAG, AArch64ISD::FMINNM_PRED);
		case ISD::VSELECT:
		return LowerVSELECT(Op, DAG);
}		}
}		}

bool AArch64TargetLowering::useSVEForFixedLengthVectors() const {		bool AArch64TargetLowering::useSVEForFixedLengthVectors() const {
// Prefer NEON unless larger SVE registers are available.		// Prefer NEON unless larger SVE registers are available.
return Subtarget->hasSVE() && Subtarget->getMinSVEVectorSizeInBits() >= 256;		return Subtarget->hasSVE() && Subtarget->getMinSVEVectorSizeInBits() >= 256;
}		}

▲ Show 20 Lines • Show All 11,765 Lines • ▼ Show 20 Lines	SDValue AArch64TargetLowering::LowerFixedLengthVectorSetccToSVE(
EVT CmpVT = Pg.getValueType();		EVT CmpVT = Pg.getValueType();
SmallVector<SDValue, 4> CmpOps = {Pg, Op1, Op2, Op.getOperand(2)};		SmallVector<SDValue, 4> CmpOps = {Pg, Op1, Op2, Op.getOperand(2)};
auto Cmp = DAG.getNode(AArch64ISD::SETCC_MERGE_ZERO, DL, CmpVT, CmpOps);		auto Cmp = DAG.getNode(AArch64ISD::SETCC_MERGE_ZERO, DL, CmpVT, CmpOps);

EVT PromoteVT = ContainerVT.changeTypeToInteger();		EVT PromoteVT = ContainerVT.changeTypeToInteger();
auto Promote = DAG.getBoolExtOrTrunc(Cmp, DL, PromoteVT, InVT);		auto Promote = DAG.getBoolExtOrTrunc(Cmp, DL, PromoteVT, InVT);
return convertFromScalableVector(DAG, Op.getValueType(), Promote);		return convertFromScalableVector(DAG, Op.getValueType(), Promote);
}		}

		SDValue AArch64TargetLowering::LowerVSELECT(SDValue Op,
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for function 'LowerVSELECT' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for function 'LowerVSELECT' [readability-identifier…
		SelectionDAG &DAG) const {
		SDLoc DL(Op);

		EVT InVT = Op.getOperand(1).getValueType();
		EVT ContainerVT = getContainerForFixedLengthVector(DAG, InVT);
		auto Op1 = convertToScalableVector(DAG, ContainerVT, Op.getOperand(1));
		auto Op2 = convertToScalableVector(DAG, ContainerVT, Op.getOperand(2));

		// Convert the mask to a predicated (NOTE: We don't need to worry about
		// inactive lanes since VSELECT is safe when given undefined elements).
		EVT MaskVT = Op.getOperand(0).getValueType();
		EVT MaskContainerVT = getContainerForFixedLengthVector(DAG, MaskVT);
		auto Mask = convertToScalableVector(DAG, MaskContainerVT, Op.getOperand(0));
		Mask = DAG.getNode(ISD::TRUNCATE, DL,
		MaskContainerVT.changeVectorElementType(MVT::i1), Mask);

		auto VSel = DAG.getNode(ISD::VSELECT, DL, ContainerVT, Mask, Op1, Op2);
		return convertFromScalableVector(DAG, InVT, VSel);
		}