Download Raw Diff

Details

Reviewers

aemerson
dzhidzhoev
arsenm
paquette
dmgreen
tschuett

Commits

rG0aaeb885326a: [AArch64][GlobalISel] Legalize <2 x s8> and <4 x s8> for G_BUILD_VECTOR

Summary

Refer to commit ccffc27, the remaining types <2 x s8> and <4 x s8> should
also be promoted to <2 x s32> and <4 x s16>.

Fixes https://github.com/llvm/llvm-project/issues/58274

Diff Detail

Unit TestsFailed

	Time	Test
	4,390 ms	x64 debian > AddressSanitizer-x86_64-linux.TestCases/Linux::auto_memory_profile_test.cpp

Event Timeline

Allen created this revision.Jun 20 2023, 7:01 PM

Herald added a project: Restricted Project. · View Herald TranscriptJun 20 2023, 7:01 PM

Herald added subscribers: hiraditya, kristof.beyls. · View Herald Transcript

Allen requested review of this revision.Jun 20 2023, 7:01 PM

Herald added a project: Restricted Project. · View Herald TranscriptJun 20 2023, 7:01 PM

Herald added subscribers: llvm-commits, wdng. · View Herald Transcript

Harbormaster completed remote builds in B240133: Diff 533100.Jun 20 2023, 8:05 PM

This test, whilst technically correct, doesn't look right. I don't think it can just generate a SUBREG_TO_REG in the same way it does for integer. Can you change the test to return <i16 0, i16 1>, and make sure the returned value would be the same as SDAG.

update test case according comment

Allen added a reviewer: dmgreen.Jun 20 2023, 8:51 PM

Harbormaster completed remote builds in B240144: Diff 533117.Jun 20 2023, 9:43 PM

tschuett added a subscriber: tschuett.Jun 25 2023, 12:43 AM

tschuett added inline comments.

llvm/lib/Target/AArch64/GISel/AArch64CallLowering.cpp
420	Could you instead query OldLLT and NewLLT whether they are `isScalar()`? Looks odd to query MVTs in GISel.

tschuett added inline comments.Jun 25 2023, 12:51 AM

llvm/lib/Target/AArch64/GISel/AArch64CallLowering.cpp
420	It is about integer and floats?

Allen added inline comments.Jun 27 2023, 5:48 AM

llvm/lib/Target/AArch64/GISel/AArch64CallLowering.cpp
420	Yes, It is about integer and floats (not about the scalar and vector).

If I'm reading the test correctly, then SDAG version is returning (as bytes) 0,0,0,0,1,0,0,0. i.e the v2i16 is promoted to a v2i32. The GlobalISel version is returning 0,0,1,0,0,0,0,0, as the v2i16 is widened to a v4i16.

I don't think we can have a difference in the calling convention between the two, it would mean they are ABI incompatible. Either they both need to change to widen (which looks like a lot of work, including an ABI break for SDAG), or the GISel code need return values in the same way as SDAG does.

llvm/lib/Target/AArch64/GISel/AArch64InstructionSelector.cpp
3230	I don't think this will handle vector extends correctly.

In D153394#4452589, @dmgreen wrote:

If I'm reading the test correctly, then SDAG version is returning (as bytes) 0,0,0,0,1,0,0,0. i.e the v2i16 is promoted to a v2i32. The GlobalISel version is returning 0,0,1,0,0,0,0,0, as the v2i16 is widened to a v4i16.

I don't think we can have a difference in the calling convention between the two, it would mean they are ABI incompatible. Either they both need to change to widen (which looks like a lot of work, including an ABI break for SDAG), or the GISel code need return values in the same way as SDAG does.

Thanks, I find it seems to be an optimization in SDAG version. So GISel version needs to do something like
SelectionDAG::FoldConstantArithmetic and TryToFoldExtendOfConstant for const vector constant ?

In D153394#4482586, @Allen wrote:

In D153394#4452589, @dmgreen wrote:

If I'm reading the test correctly, then SDAG version is returning (as bytes) 0,0,0,0,1,0,0,0. i.e the v2i16 is promoted to a v2i32. The GlobalISel version is returning 0,0,1,0,0,0,0,0, as the v2i16 is widened to a v4i16.

I don't think we can have a difference in the calling convention between the two, it would mean they are ABI incompatible. Either they both need to change to widen (which looks like a lot of work, including an ABI break for SDAG), or the GISel code need return values in the same way as SDAG does.

Thanks, I find it seems to be an optimization in SDAG version. So GISel version needs to do something like
SelectionDAG::FoldConstantArithmetic and TryToFoldExtendOfConstant for const vector constant ?

I believe that you are mixing optimizations with ABI. The SDAG ABI result has to win indepent of the inefficient GISel code.

Add tryToFoldExtendOfVectorConstant to adjust the ABI

Harbormaster completed remote builds in B244081: Diff 538557.Jul 10 2023, 3:30 AM

Allen edited the summary of this revision. (Show Details)Jul 10 2023, 3:51 AM

Can you add a MIR testcase?

llvm/lib/Target/AArch64/GISel/AArch64InstructionSelector.cpp
3212	Might be good to update this comment Scalar G_ANYEXT on bank...
5608	Comment?
5615	Could use a comment explaining that you're looking for G_BUILD_VECTORs with all constant source operands?
5634	emitConstantVector should return a nullptr on failure, right? So then we can save one LOC: // Try to replace ExtI with a constant vector. MachineInstr *MaybeCVec = emitConstantVector(ExtI.getOperand(0).getReg(), CV, MIB, MRI); if (MaybeCVec) ExtI.eraseFromParent(); return MaybeCVec;

address comment and add a new mir test llvm/test/CodeGen/AArch64/GlobalISel/select-neon-vector-const.mir

Allen marked 5 inline comments as done.Jul 10 2023, 7:57 PM

Allen added inline comments.

llvm/lib/Target/AArch64/GISel/AArch64InstructionSelector.cpp
3212	Done, thanks
3230	Thanks, add a new function tryToFoldExtendOfVectorConstant to handle this case.
5634	Thanks, apply your comment

Harbormaster completed remote builds in B244329: Diff 538896.Jul 10 2023, 9:12 PM

paquette added inline comments.Jul 10 2023, 11:11 PM

llvm/lib/Target/AArch64/GISel/AArch64InstructionSelector.cpp
5621	can you add a testcase that shows what happens when one of the G_BUILD_VECTOR sources is a constant? e.g %x = G_BUILD_VECTOR %constant, %not_a_constant
5638	can you add a test to the MIR testcase that shows what happens when `emitConstantVector` returns nullptr?
llvm/test/CodeGen/AArch64/GlobalISel/select-neon-vector-const.mir
3 ↗	(On Diff #538896)	you can delete the IR portion
17 ↗	(On Diff #538896)	you can delete the registers section
22 ↗	(On Diff #538896)	if you delete the IR section, then this will need to be renamed so that it does not reference the IR

Allen marked 3 inline comments as done.Jul 12 2023, 5:54 AM

Allen added inline comments.

llvm/lib/Target/AArch64/GISel/AArch64InstructionSelector.cpp
5621	it will crash before regbankselect, so it seems another independent issue, https://gcc.godbolt.org/z/e3Wq9Mdar, so I fire a issue https://github.com/llvm/llvm-project/issues/63826
5638	I add this function refer to above function tryOptConstantBuildVec, and I don't have the idea how to construct a constant that meets the scenario where returns null , do you have any suggestion ?

I haven't forgotten about this patch, I just need to find some time to look into this issue. In the mean time: as a rule, anything that's produced by the translator must be correct by itself, it can't rely on any optimizations to run in order to generate the correct code. It's easiest to check this by writing an additional MIR test for the irtranslator change. It should be clear from that test whether or not the change is correct. The optimizations, if needed and appropriate, can be a separate patch.

I'm glad to know about your plans，thank you for your time.

tschuett mentioned this in D155274: [GIsel][AArch64] extend legalization of G_INSERT_VECTOR_ELT.Jul 16 2023, 11:41 PM

So I had a look at this particular test case, and from what I can tell there's nothing we're doing wrong in the IRTranslator. lowerReturn() is correctly widening the <2 x i16> return type to <2 x i32>. This leaves the following MIR:

body:             |
  bb.1 (%ir-block.0):
    %1:_(s16) = G_CONSTANT i16 0
    %2:_(s16) = G_CONSTANT i16 1
    %0:_(<2 x s16>) = G_BUILD_VECTOR %1(s16), %2(s16)
    %3:_(<2 x s32>) = G_ANYEXT %0(<2 x s16>)
    $d0 = COPY %3(<2 x s32>)
    RET_ReallyLR implicit $d0

I think the problem is that G_BUILD_VECTOR of <2 x i16> needs to be widened to a supported type. Since this was a trivial change, I went ahead and did it in ccffc2705054

Thanks anyway for taking a look at it!

Thank you for your guidance

Allen abandoned this revision.Jul 21 2023, 1:44 AM

Allen updated this revision to Diff 542866.Jul 21 2023, 5:08 AM

Allen retitled this revision from [AArch64][GlobalISel] Selection support for v2s16 G_ANYEXT to [AArch64][GlobalISel] Legalize <2 x s16> and <4 x s8> for G_BUILD_VECTOR.

Allen edited the summary of this revision. (Show Details)

aemerson added inline comments.Jul 21 2023, 1:11 PM

llvm/lib/Target/AArch64/GISel/AArch64LegalizerInfo.cpp
723–729 ↗	(On Diff #542866)	Now that we're going to do this multiple times, I think it's worth factoring out the logic to make it easier to re-use. The underlying logic is I believe: "vectors must be at least 64 bits wide", right? I think we could make this easier by adding a new action/predicates in LegalizerInfo.h, so that we could do something like: .promoteVectoreEltsToVectorMinSize(0, 64) I think there are other places in this file that could also use this new action to simplify the code. P.S. please attach more context to your diffs (-U9999 works).

address comment, add new function promoteVectorEltsToVectorMinSize

Allen marked an inline comment as done.Jul 22 2023, 5:01 AM

Allen added inline comments.

llvm/lib/Target/AArch64/GISel/AArch64LegalizerInfo.cpp
723–729 ↗	(On Diff #542866)	thanks for your detail suggestion, apply your comment.

tschuett added inline comments.Jul 22 2023, 11:03 AM

llvm/include/llvm/CodeGen/GlobalISel/LegalizerInfo.h
944 ↗	(On Diff #543173)	Nice! s/Ty/VectorSize/ .

LGTM with a few nits. Thanks for working on this!

llvm/include/llvm/CodeGen/GlobalISel/LegalizerInfo.h
945 ↗	(On Diff #543173)	Sorry, the name I suggested didn't fit with the rest of the naming scheme. I think `widenVectorEltsToVectorMinSize` is better.
953 ↗	(On Diff #543173)	`LLT::isScalable()`

This revision is now accepted and ready to land.Jul 22 2023, 10:23 PM

address comments

Allen marked 3 inline comments as done.Jul 23 2023, 7:55 PM

Allen added inline comments.

llvm/include/llvm/CodeGen/GlobalISel/LegalizerInfo.h
944 ↗	(On Diff #543173)	Done, thanks
945 ↗	(On Diff #543173)	Done, thanks

This revision was landed with ongoing or failed builds.Jul 23 2023, 8:28 PM

Closed by commit rG0aaeb885326a: [AArch64][GlobalISel] Legalize <2 x s8> and <4 x s8> for G_BUILD_VECTOR (authored by Allen). · Explain Why

This revision was automatically updated to reflect the committed changes.

Allen marked 2 inline comments as done.

Allen added a commit: rG0aaeb885326a: [AArch64][GlobalISel] Legalize <2 x s8> and <4 x s8> for G_BUILD_VECTOR.

Harbormaster completed remote builds in B247545: Diff 543349.Jul 23 2023, 9:26 PM

GitHub <noreply@github.com> mentioned this in rGeaf23b2480a1: [GIsel][AArch64] Legalize <2 x i16> for G_INSERT_VECTOR_ELT (#65830).Sep 12 2023, 6:15 AM

Diff 533100

llvm/lib/Target/AArch64/GISel/AArch64CallLowering.cpp

	Show First 20 Lines • Show All 91 Lines • ▼ Show 20 Lines
	if (EVT(NewVT) != SplitEVTs[i]) {			if (EVT(NewVT) != SplitEVTs[i]) {
	unsigned ExtendOp = TargetOpcode::G_ANYEXT;			unsigned ExtendOp = TargetOpcode::G_ANYEXT;
	if (F.getAttributes().hasRetAttr(Attribute::SExt))			if (F.getAttributes().hasRetAttr(Attribute::SExt))
	ExtendOp = TargetOpcode::G_SEXT;			ExtendOp = TargetOpcode::G_SEXT;
	else if (F.getAttributes().hasRetAttr(Attribute::ZExt))			else if (F.getAttributes().hasRetAttr(Attribute::ZExt))
	ExtendOp = TargetOpcode::G_ZEXT;			ExtendOp = TargetOpcode::G_ZEXT;

	LLT NewLLT(NewVT);			LLT NewLLT(NewVT);
	LLT OldLLT(MVT::getVT(CurArgInfo.Ty));			MVT OldVT = MVT::getVT(CurArgInfo.Ty);
				LLT OldLLT(OldVT);
	CurArgInfo.Ty = EVT(NewVT).getTypeForEVT(Ctx);			CurArgInfo.Ty = EVT(NewVT).getTypeForEVT(Ctx);
	// Instead of an extend, we might have a vector type which needs			// Instead of an extend, we might have a vector type which needs
	// padding with more elements, e.g. <2 x half> -> <4 x half>.			// padding with more elements, e.g. <2 x half> -> <4 x half>.
	if (NewVT.isVector()) {			if (NewVT.isVector()) {
	if (OldLLT.isVector()) {			if (OldLLT.isVector()) {
	if (NewLLT.getNumElements() > OldLLT.getNumElements()) {			if (NewLLT.getNumElements() > OldLLT.getNumElements()) {
	// We don't handle VA types which are not exactly twice the			// We don't handle VA types which are not exactly twice the
	// size, but can easily be done in future.			// size, but can easily be done in future.
	if (NewLLT.getNumElements() != OldLLT.getNumElements() * 2) {			if (NewLLT.getNumElements() != OldLLT.getNumElements() * 2) {
	LLVM_DEBUG(dbgs() << "Outgoing vector ret has too many elts");			LLVM_DEBUG(dbgs() << "Outgoing vector ret has too many elts");
	return false;			return false;
	}			}
	auto Undef = MIRBuilder.buildUndef({OldLLT});			auto Undef = MIRBuilder.buildUndef({OldLLT});
	CurVReg =			CurVReg =
	MIRBuilder.buildMergeLikeInstr({NewLLT}, {CurVReg, Undef})			MIRBuilder.buildMergeLikeInstr({NewLLT}, {CurVReg, Undef})
	.getReg(0);			.getReg(0);
	} else {			} else if (OldVT.isInteger() && NewVT.isInteger()) {
				tschuettUnsubmitted Not Done Reply Inline Actions Could you instead query OldLLT and NewLLT whether they are `isScalar()`? Looks odd to query MVTs in GISel. tschuett: Could you instead query OldLLT and NewLLT whether they are `isScalar()`? Looks odd to query…
				tschuettUnsubmitted Not Done Reply Inline Actions It is about integer and floats? tschuett: It is about integer and floats?
				AllenAuthorUnsubmitted Done Reply Inline Actions Yes, It is about integer and floats (not about the scalar and vector). Allen: Yes, It is about integer and floats (not about the scalar and vector).
	// Just do a vector extend.			// Just do a vector extend.
	CurVReg = MIRBuilder.buildInstr(ExtendOp, {NewLLT}, {CurVReg})			CurVReg = MIRBuilder.buildInstr(ExtendOp, {NewLLT}, {CurVReg})
	.getReg(0);			.getReg(0);
				} else {
				LLVM_DEBUG(dbgs() << "Could not handle float type\n");
				return false;
	}			}
	} else if (NewLLT.getNumElements() == 2) {			} else if (NewLLT.getNumElements() == 2) {
	// We need to pad a <1 x S> type to <2 x S>. Since we don't have			// We need to pad a <1 x S> type to <2 x S>. Since we don't have
	// <1 x S> vector types in GISel we use a build_vector instead			// <1 x S> vector types in GISel we use a build_vector instead
	// of a vector merge/concat.			// of a vector merge/concat.
	auto Undef = MIRBuilder.buildUndef({OldLLT});			auto Undef = MIRBuilder.buildUndef({OldLLT});
	CurVReg =			CurVReg =
	MIRBuilder			MIRBuilder
	▲ Show 20 Lines • Show All 91 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/GISel/AArch64InstructionSelector.cpp

	Show First 20 Lines • Show All 91 Lines • ▼ Show 20 Lines
	case TargetOpcode::G_ANYEXT: {	case TargetOpcode::G_ANYEXT: {
	if (selectUSMovFromExtend(I, MRI))	if (selectUSMovFromExtend(I, MRI))
	return true;	return true;

	const Register DstReg = I.getOperand(0).getReg();	const Register DstReg = I.getOperand(0).getReg();
	const Register SrcReg = I.getOperand(1).getReg();	const Register SrcReg = I.getOperand(1).getReg();

	const RegisterBank &RBDst = *RBI.getRegBank(DstReg, MRI, TRI);	const RegisterBank &RBDst = *RBI.getRegBank(DstReg, MRI, TRI);
	if (RBDst.getID() != AArch64::GPRRegBankID) {	// The integer vector can be extend
		if (RBDst.getID() != AArch64::GPRRegBankID &&
		!MRI.getType(DstReg).isVector()) {
	LLVM_DEBUG(dbgs() << "G_ANYEXT on bank: " << RBDst	LLVM_DEBUG(dbgs() << "G_ANYEXT on bank: " << RBDst
	<< ", expected: GPR\n");	<< ", expected: GPR\n");
	return false;	return false;
	}	}

	const RegisterBank &RBSrc = *RBI.getRegBank(SrcReg, MRI, TRI);	const RegisterBank &RBSrc = *RBI.getRegBank(SrcReg, MRI, TRI);
	if (RBSrc.getID() != AArch64::GPRRegBankID) {	if (RBSrc.getID() != AArch64::GPRRegBankID &&
		!MRI.getType(SrcReg).isVector()) {
		paquetteUnsubmitted Done Reply Inline Actions Might be good to update this comment Scalar G_ANYEXT on bank... paquette: Might be good to update this comment Scalar G_ANYEXT on bank...
		AllenAuthorUnsubmitted Done Reply Inline Actions Done, thanks Allen: Done, thanks
	LLVM_DEBUG(dbgs() << "G_ANYEXT on bank: " << RBSrc	LLVM_DEBUG(dbgs() << "G_ANYEXT on bank: " << RBSrc
	<< ", expected: GPR\n");	<< ", expected: GPR\n");
	return false;	return false;
	}	}

	const unsigned DstSize = MRI.getType(DstReg).getSizeInBits();	const unsigned DstSize = MRI.getType(DstReg).getSizeInBits();

	if (DstSize == 0) {	if (DstSize == 0) {
	LLVM_DEBUG(dbgs() << "G_ANYEXT operand has no size, not a gvreg?\n");	LLVM_DEBUG(dbgs() << "G_ANYEXT operand has no size, not a gvreg?\n");
	return false;	return false;
	}	}

	if (DstSize != 64 && DstSize > 32) {	if (DstSize != 64 && DstSize > 32) {
	LLVM_DEBUG(dbgs() << "G_ANYEXT to size: " << DstSize	LLVM_DEBUG(dbgs() << "G_ANYEXT to size: " << DstSize
	<< ", expected: 32 or 64\n");	<< ", expected: 32 or 64\n");
	return false;	return false;
	}	}
	// At this point G_ANYEXT is just like a plain COPY, but we need	// At this point G_ANYEXT is just like a plain COPY, but we need
	// to explicitly form the 64-bit value if any.	// to explicitly form the 64-bit value if any.
	if (DstSize > 32) {	if (DstSize > 32) {
	Register ExtSrc = MRI.createVirtualRegister(&AArch64::GPR64allRegClass);	Register ExtSrc = MRI.createVirtualRegister(&AArch64::GPR64allRegClass);
	dmgreenUnsubmitted Done Reply Inline Actions I don't think this will handle vector extends correctly. dmgreen: I don't think this will handle vector extends correctly.
	AllenAuthorUnsubmitted Done Reply Inline Actions Thanks, add a new function tryToFoldExtendOfVectorConstant to handle this case. Allen: Thanks, add a new function tryToFoldExtendOfVectorConstant to handle this case.
	BuildMI(MBB, I, I.getDebugLoc(), TII.get(AArch64::SUBREG_TO_REG))	BuildMI(MBB, I, I.getDebugLoc(), TII.get(AArch64::SUBREG_TO_REG))
	.addDef(ExtSrc)	.addDef(ExtSrc)
	.addImm(0)	.addImm(0)
	.addUse(SrcReg)	.addUse(SrcReg)
	.addImm(AArch64::sub_32);	.addImm(AArch64::sub_32);
	I.getOperand(1).setReg(ExtSrc);	I.getOperand(1).setReg(ExtSrc);
	}	}
	return selectCopy(I, TII, MRI, TRI, RBI);	return selectCopy(I, TII, MRI, TRI, RBI);
	▲ Show 20 Lines • Show All 61 Lines • ▼ Show 20 Lines

	if (!RBI.constrainGenericRegister(DefReg, AArch64::GPR64RegClass,	if (!RBI.constrainGenericRegister(DefReg, AArch64::GPR64RegClass,
	MRI)) {	MRI)) {
	LLVM_DEBUG(dbgs() << "Failed to constrain G_ZEXT destination\n");	LLVM_DEBUG(dbgs() << "Failed to constrain G_ZEXT destination\n");
	return false;	return false;
	}	}

	if (!RBI.constrainGenericRegister(SrcReg, AArch64::GPR32RegClass,	if (!RBI.constrainGenericRegister(SrcReg, AArch64::GPR32RegClass,
	MRI)) {	MRI)) {
Context not available.
		paquetteUnsubmitted Done Reply Inline Actions emitConstantVector should return a nullptr on failure, right? So then we can save one LOC: // Try to replace ExtI with a constant vector. MachineInstr MaybeCVec = emitConstantVector(ExtI.getOperand(0).getReg(), CV, MIB, MRI); if (MaybeCVec) ExtI.eraseFromParent(); return MaybeCVec; paquette:* emitConstantVector should return a nullptr on failure, right? So then we can save one LOC…
		AllenAuthorUnsubmitted Done Reply Inline Actions Thanks, apply your comment Allen: Thanks, apply your comment
		paquetteUnsubmitted Done Reply Inline Actions Comment? paquette: Comment?
		paquetteUnsubmitted Done Reply Inline Actions Could use a comment explaining that you're looking for G_BUILD_VECTORs with all constant source operands? paquette: Could use a comment explaining that you're looking for G_BUILD_VECTORs with all constant source…
		paquetteUnsubmitted Not Done Reply Inline Actions can you add a test to the MIR testcase that shows what happens when `emitConstantVector` returns nullptr? paquette: can you add a test to the MIR testcase that shows what happens when `emitConstantVector`…
		AllenAuthorUnsubmitted Done Reply Inline Actions I add this function refer to above function tryOptConstantBuildVec, and I don't have the idea how to construct a constant that meets the scenario where returns null , do you have any suggestion ? Allen: I add this function refer to above function tryOptConstantBuildVec, and I don't have the idea…
		paquetteUnsubmitted Not Done Reply Inline Actions can you add a testcase that shows what happens when one of the G_BUILD_VECTOR sources is a constant? e.g %x = G_BUILD_VECTOR %constant, %not_a_constant paquette: can you add a testcase that shows what happens when one of the G_BUILD_VECTOR sources is a…
		AllenAuthorUnsubmitted Done Reply Inline Actions it will crash before regbankselect, so it seems another independent issue, https://gcc.godbolt.org/z/e3Wq9Mdar, so I fire a issue https://github.com/llvm/llvm-project/issues/63826 Allen: it will crash before regbankselect, so it seems another independent issue, https://gcc.godbolt.

llvm/test/CodeGen/AArch64/extract-sext-zext.ll

	Show First 20 Lines • Show All 91 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: eor x0, x8, x8, lsl #56	; CHECK-NEXT: eor x0, x8, x8, lsl #56
	; CHECK-NEXT: ret	; CHECK-NEXT: ret
	%e = extractelement <8 x i8> %x, i64 2	%e = extractelement <8 x i8> %x, i64 2
	%s = sext i8 %e to i64	%s = sext i8 %e to i64
	%t = shl i64 %s, 56	%t = shl i64 %s, 56
	%u = xor i64 %s, %t	%u = xor i64 %s, %t
	ret i64 %u	ret i64 %u
	}	}

		define <2 x i16> @extend_v2i16() {
		; CHECK-ISEL-LABEL: extend_v2i16:
		; CHECK-ISEL: // %bb.0:
		; CHECK-ISEL-NEXT: movi v0.2d, #0000000000000000
		; CHECK-ISEL-NEXT: ret
		;
		; CHECK-GLOBAL-LABEL: extend_v2i16:
		; CHECK-GLOBAL: // %bb.0:
		; CHECK-GLOBAL-NEXT: adrp x8, .LCPI42_0
		; CHECK-GLOBAL-NEXT: ldr s0, [x8, :lo12:.LCPI42_0]
		; CHECK-GLOBAL-NEXT: fmov w0, s0
		; CHECK-GLOBAL-NEXT: fmov d0, x0
		; CHECK-GLOBAL-NEXT: ret
		ret <2 x i16> <i16 0, i16 0>
		}
Context not available.

This is an archive of the discontinued LLVM Phabricator instance.

[AArch64][GlobalISel] Legalize <2 x s8> and <4 x s8> for G_BUILD_VECTOR
ClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 533100

llvm/lib/Target/AArch64/GISel/AArch64CallLowering.cpp

llvm/lib/Target/AArch64/GISel/AArch64InstructionSelector.cpp

llvm/test/CodeGen/AArch64/extract-sext-zext.ll

This is an archive of the discontinued LLVM Phabricator instance.

[AArch64][GlobalISel] Legalize <2 x s8> and <4 x s8> for G_BUILD_VECTORClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 533100

llvm/lib/Target/AArch64/GISel/AArch64CallLowering.cpp

llvm/lib/Target/AArch64/GISel/AArch64InstructionSelector.cpp

llvm/test/CodeGen/AArch64/extract-sext-zext.ll

[AArch64][GlobalISel] Legalize <2 x s8> and <4 x s8> for G_BUILD_VECTOR
ClosedPublic