Download Raw Diff

Details

Reviewers

aemerson
dzhidzhoev
arsenm
paquette
dmgreen
tschuett

Commits

rG0aaeb885326a: [AArch64][GlobalISel] Legalize <2 x s8> and <4 x s8> for G_BUILD_VECTOR

Summary

Refer to commit ccffc27, the remaining types <2 x s8> and <4 x s8> should
also be promoted to <2 x s32> and <4 x s16>.

Fixes https://github.com/llvm/llvm-project/issues/58274

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

Allen created this revision.Jun 20 2023, 7:01 PM

Herald added a project: Restricted Project. · View Herald TranscriptJun 20 2023, 7:01 PM

Herald added subscribers: hiraditya, kristof.beyls. · View Herald Transcript

Allen requested review of this revision.Jun 20 2023, 7:01 PM

Herald added a project: Restricted Project. · View Herald TranscriptJun 20 2023, 7:01 PM

Herald added subscribers: llvm-commits, wdng. · View Herald Transcript

Harbormaster completed remote builds in B240133: Diff 533100.Jun 20 2023, 8:05 PM

This test, whilst technically correct, doesn't look right. I don't think it can just generate a SUBREG_TO_REG in the same way it does for integer. Can you change the test to return <i16 0, i16 1>, and make sure the returned value would be the same as SDAG.

update test case according comment

Allen added a reviewer: dmgreen.Jun 20 2023, 8:51 PM

Harbormaster completed remote builds in B240144: Diff 533117.Jun 20 2023, 9:43 PM

tschuett added a subscriber: tschuett.Jun 25 2023, 12:43 AM

tschuett added inline comments.

llvm/lib/Target/AArch64/GISel/AArch64CallLowering.cpp
420 ↗	(On Diff #533117)	Could you instead query OldLLT and NewLLT whether they are `isScalar()`? Looks odd to query MVTs in GISel.

tschuett added inline comments.Jun 25 2023, 12:51 AM

llvm/lib/Target/AArch64/GISel/AArch64CallLowering.cpp
420 ↗	(On Diff #533117)	It is about integer and floats?

Allen added inline comments.Jun 27 2023, 5:48 AM

llvm/lib/Target/AArch64/GISel/AArch64CallLowering.cpp
420 ↗	(On Diff #533117)	Yes, It is about integer and floats (not about the scalar and vector).

If I'm reading the test correctly, then SDAG version is returning (as bytes) 0,0,0,0,1,0,0,0. i.e the v2i16 is promoted to a v2i32. The GlobalISel version is returning 0,0,1,0,0,0,0,0, as the v2i16 is widened to a v4i16.

I don't think we can have a difference in the calling convention between the two, it would mean they are ABI incompatible. Either they both need to change to widen (which looks like a lot of work, including an ABI break for SDAG), or the GISel code need return values in the same way as SDAG does.

llvm/lib/Target/AArch64/GISel/AArch64InstructionSelector.cpp
3230 ↗	(On Diff #533117)	I don't think this will handle vector extends correctly.

In D153394#4452589, @dmgreen wrote:

If I'm reading the test correctly, then SDAG version is returning (as bytes) 0,0,0,0,1,0,0,0. i.e the v2i16 is promoted to a v2i32. The GlobalISel version is returning 0,0,1,0,0,0,0,0, as the v2i16 is widened to a v4i16.

I don't think we can have a difference in the calling convention between the two, it would mean they are ABI incompatible. Either they both need to change to widen (which looks like a lot of work, including an ABI break for SDAG), or the GISel code need return values in the same way as SDAG does.

Thanks, I find it seems to be an optimization in SDAG version. So GISel version needs to do something like
SelectionDAG::FoldConstantArithmetic and TryToFoldExtendOfConstant for const vector constant ?

In D153394#4482586, @Allen wrote:

In D153394#4452589, @dmgreen wrote:

If I'm reading the test correctly, then SDAG version is returning (as bytes) 0,0,0,0,1,0,0,0. i.e the v2i16 is promoted to a v2i32. The GlobalISel version is returning 0,0,1,0,0,0,0,0, as the v2i16 is widened to a v4i16.

I don't think we can have a difference in the calling convention between the two, it would mean they are ABI incompatible. Either they both need to change to widen (which looks like a lot of work, including an ABI break for SDAG), or the GISel code need return values in the same way as SDAG does.

Thanks, I find it seems to be an optimization in SDAG version. So GISel version needs to do something like
SelectionDAG::FoldConstantArithmetic and TryToFoldExtendOfConstant for const vector constant ?

I believe that you are mixing optimizations with ABI. The SDAG ABI result has to win indepent of the inefficient GISel code.

Add tryToFoldExtendOfVectorConstant to adjust the ABI

Harbormaster completed remote builds in B244081: Diff 538557.Jul 10 2023, 3:30 AM

Allen edited the summary of this revision. (Show Details)Jul 10 2023, 3:51 AM

Can you add a MIR testcase?

llvm/lib/Target/AArch64/GISel/AArch64InstructionSelector.cpp
3212 ↗	(On Diff #538557)	Might be good to update this comment Scalar G_ANYEXT on bank...
5608 ↗	(On Diff #538557)	Comment?
5615 ↗	(On Diff #538557)	Could use a comment explaining that you're looking for G_BUILD_VECTORs with all constant source operands?
5634 ↗	(On Diff #538557)	emitConstantVector should return a nullptr on failure, right? So then we can save one LOC: // Try to replace ExtI with a constant vector. MachineInstr *MaybeCVec = emitConstantVector(ExtI.getOperand(0).getReg(), CV, MIB, MRI); if (MaybeCVec) ExtI.eraseFromParent(); return MaybeCVec;

address comment and add a new mir test llvm/test/CodeGen/AArch64/GlobalISel/select-neon-vector-const.mir

Allen marked 5 inline comments as done.Jul 10 2023, 7:57 PM

Allen added inline comments.

llvm/lib/Target/AArch64/GISel/AArch64InstructionSelector.cpp
3212 ↗	(On Diff #538557)	Done, thanks
5634 ↗	(On Diff #538557)	Thanks, apply your comment
3230 ↗	(On Diff #533117)	Thanks, add a new function tryToFoldExtendOfVectorConstant to handle this case.

Harbormaster completed remote builds in B244329: Diff 538896.Jul 10 2023, 9:12 PM

paquette added inline comments.Jul 10 2023, 11:11 PM

llvm/lib/Target/AArch64/GISel/AArch64InstructionSelector.cpp
5621 ↗	(On Diff #538896)	can you add a testcase that shows what happens when one of the G_BUILD_VECTOR sources is a constant? e.g %x = G_BUILD_VECTOR %constant, %not_a_constant
5638 ↗	(On Diff #538896)	can you add a test to the MIR testcase that shows what happens when `emitConstantVector` returns nullptr?
llvm/test/CodeGen/AArch64/GlobalISel/select-neon-vector-const.mir
3 ↗	(On Diff #538896)	you can delete the IR portion
17 ↗	(On Diff #538896)	you can delete the registers section
22 ↗	(On Diff #538896)	if you delete the IR section, then this will need to be renamed so that it does not reference the IR

Allen marked 3 inline comments as done.Jul 12 2023, 5:54 AM

Allen added inline comments.

llvm/lib/Target/AArch64/GISel/AArch64InstructionSelector.cpp
5621 ↗	(On Diff #538896)	it will crash before regbankselect, so it seems another independent issue, https://gcc.godbolt.org/z/e3Wq9Mdar, so I fire a issue https://github.com/llvm/llvm-project/issues/63826
5638 ↗	(On Diff #538896)	I add this function refer to above function tryOptConstantBuildVec, and I don't have the idea how to construct a constant that meets the scenario where returns null , do you have any suggestion ?

I haven't forgotten about this patch, I just need to find some time to look into this issue. In the mean time: as a rule, anything that's produced by the translator must be correct by itself, it can't rely on any optimizations to run in order to generate the correct code. It's easiest to check this by writing an additional MIR test for the irtranslator change. It should be clear from that test whether or not the change is correct. The optimizations, if needed and appropriate, can be a separate patch.

I'm glad to know about your plans，thank you for your time.

tschuett mentioned this in D155274: [GIsel][AArch64] extend legalization of G_INSERT_VECTOR_ELT.Jul 16 2023, 11:41 PM

So I had a look at this particular test case, and from what I can tell there's nothing we're doing wrong in the IRTranslator. lowerReturn() is correctly widening the <2 x i16> return type to <2 x i32>. This leaves the following MIR:

body:             |
  bb.1 (%ir-block.0):
    %1:_(s16) = G_CONSTANT i16 0
    %2:_(s16) = G_CONSTANT i16 1
    %0:_(<2 x s16>) = G_BUILD_VECTOR %1(s16), %2(s16)
    %3:_(<2 x s32>) = G_ANYEXT %0(<2 x s16>)
    $d0 = COPY %3(<2 x s32>)
    RET_ReallyLR implicit $d0

I think the problem is that G_BUILD_VECTOR of <2 x i16> needs to be widened to a supported type. Since this was a trivial change, I went ahead and did it in ccffc2705054

Thanks anyway for taking a look at it!

Thank you for your guidance

Allen abandoned this revision.Jul 21 2023, 1:44 AM

Allen updated this revision to Diff 542866.Jul 21 2023, 5:08 AM

Allen retitled this revision from [AArch64][GlobalISel] Selection support for v2s16 G_ANYEXT to [AArch64][GlobalISel] Legalize <2 x s16> and <4 x s8> for G_BUILD_VECTOR.

Allen edited the summary of this revision. (Show Details)

aemerson added inline comments.Jul 21 2023, 1:11 PM

llvm/lib/Target/AArch64/GISel/AArch64LegalizerInfo.cpp
723–729	Now that we're going to do this multiple times, I think it's worth factoring out the logic to make it easier to re-use. The underlying logic is I believe: "vectors must be at least 64 bits wide", right? I think we could make this easier by adding a new action/predicates in LegalizerInfo.h, so that we could do something like: .promoteVectoreEltsToVectorMinSize(0, 64) I think there are other places in this file that could also use this new action to simplify the code. P.S. please attach more context to your diffs (-U9999 works).

address comment, add new function promoteVectorEltsToVectorMinSize

Allen marked an inline comment as done.Jul 22 2023, 5:01 AM

Allen added inline comments.

llvm/lib/Target/AArch64/GISel/AArch64LegalizerInfo.cpp
723–729	thanks for your detail suggestion, apply your comment.

tschuett added inline comments.Jul 22 2023, 11:03 AM

llvm/include/llvm/CodeGen/GlobalISel/LegalizerInfo.h
944	Nice! s/Ty/VectorSize/ .

LGTM with a few nits. Thanks for working on this!

llvm/include/llvm/CodeGen/GlobalISel/LegalizerInfo.h
945	Sorry, the name I suggested didn't fit with the rest of the naming scheme. I think `widenVectorEltsToVectorMinSize` is better.
953	`LLT::isScalable()`

This revision is now accepted and ready to land.Jul 22 2023, 10:23 PM

address comments

Allen marked 3 inline comments as done.Jul 23 2023, 7:55 PM

Allen added inline comments.

llvm/include/llvm/CodeGen/GlobalISel/LegalizerInfo.h
944	Done, thanks
945	Done, thanks

This revision was landed with ongoing or failed builds.Jul 23 2023, 8:28 PM

Closed by commit rG0aaeb885326a: [AArch64][GlobalISel] Legalize <2 x s8> and <4 x s8> for G_BUILD_VECTOR (authored by Allen). · Explain Why

This revision was automatically updated to reflect the committed changes.

Allen marked 2 inline comments as done.

Allen added a commit: rG0aaeb885326a: [AArch64][GlobalISel] Legalize <2 x s8> and <4 x s8> for G_BUILD_VECTOR.

Harbormaster completed remote builds in B247545: Diff 543349.Jul 23 2023, 9:26 PM

GitHub <noreply@github.com> mentioned this in rGeaf23b2480a1: [GIsel][AArch64] Legalize <2 x i16> for G_INSERT_VECTOR_ELT (#65830).Sep 12 2023, 6:15 AM

Diff 543354

llvm/include/llvm/CodeGen/GlobalISel/LegalizerInfo.h

Show First 20 Lines • Show All 935 Lines • ▼ Show 20 Lines	LegalizeRuleSet &minScalarOrEltIf(LegalityPredicate Predicate,
using namespace LegalityPredicates;		using namespace LegalityPredicates;
using namespace LegalizeMutations;		using namespace LegalizeMutations;
return actionIf(LegalizeAction::WidenScalar,		return actionIf(LegalizeAction::WidenScalar,
all(Predicate, scalarOrEltNarrowerThan(		all(Predicate, scalarOrEltNarrowerThan(
TypeIdx, Ty.getScalarSizeInBits())),		TypeIdx, Ty.getScalarSizeInBits())),
changeElementTo(typeIdx(TypeIdx), Ty));		changeElementTo(typeIdx(TypeIdx), Ty));
}		}

		/// Ensure the vector size is at least as wide as VectorSize by promoting the
		tschuettUnsubmitted Done Reply Inline Actions Nice! s/Ty/VectorSize/ . tschuett: Nice! s/Ty/VectorSize/ .
		AllenAuthorUnsubmitted Done Reply Inline Actions Done, thanks Allen: Done, thanks
		/// element.
		aemersonUnsubmitted Done Reply Inline Actions Sorry, the name I suggested didn't fit with the rest of the naming scheme. I think `widenVectorEltsToVectorMinSize` is better. aemerson: Sorry, the name I suggested didn't fit with the rest of the naming scheme. I think…
		AllenAuthorUnsubmitted Done Reply Inline Actions Done, thanks Allen: Done, thanks
		LegalizeRuleSet &widenVectorEltsToVectorMinSize(unsigned TypeIdx,
		unsigned VectorSize) {
		using namespace LegalityPredicates;
		using namespace LegalizeMutations;
		return actionIf(
		LegalizeAction::WidenScalar,
		[=](const LegalityQuery &Query) {
		const LLT VecTy = Query.Types[TypeIdx];
		aemersonUnsubmitted Done Reply Inline Actions `LLT::isScalable()` aemerson: `LLT::isScalable()`
		return VecTy.isVector() && !VecTy.isScalable() &&
		VecTy.getSizeInBits() < VectorSize;
		},
		[=](const LegalityQuery &Query) {
		const LLT VecTy = Query.Types[TypeIdx];
		unsigned NumElts = VecTy.getNumElements();
		unsigned MinSize = VectorSize / NumElts;
		LLT NewTy = LLT::fixed_vector(NumElts, LLT::scalar(MinSize));
		return std::make_pair(TypeIdx, NewTy);
		});
		}

/// Ensure the scalar is at least as wide as Ty.		/// Ensure the scalar is at least as wide as Ty.
LegalizeRuleSet &minScalar(unsigned TypeIdx, const LLT Ty) {		LegalizeRuleSet &minScalar(unsigned TypeIdx, const LLT Ty) {
using namespace LegalityPredicates;		using namespace LegalityPredicates;
using namespace LegalizeMutations;		using namespace LegalizeMutations;
return actionIf(LegalizeAction::WidenScalar,		return actionIf(LegalizeAction::WidenScalar,
scalarNarrowerThan(TypeIdx, Ty.getSizeInBits()),		scalarNarrowerThan(TypeIdx, Ty.getSizeInBits()),
changeTo(typeIdx(TypeIdx), Ty));		changeTo(typeIdx(TypeIdx), Ty));
}		}
▲ Show 20 Lines • Show All 354 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/GISel/AArch64LegalizerInfo.cpp

Show First 20 Lines • Show All 44 Lines • ▼ Show 20 Lines	AArch64LegalizerInfo::AArch64LegalizerInfo(const AArch64Subtarget &ST)
const LLT s8 = LLT::scalar(8);		const LLT s8 = LLT::scalar(8);
const LLT s16 = LLT::scalar(16);		const LLT s16 = LLT::scalar(16);
const LLT s32 = LLT::scalar(32);		const LLT s32 = LLT::scalar(32);
const LLT s64 = LLT::scalar(64);		const LLT s64 = LLT::scalar(64);
const LLT s128 = LLT::scalar(128);		const LLT s128 = LLT::scalar(128);
const LLT v16s8 = LLT::fixed_vector(16, 8);		const LLT v16s8 = LLT::fixed_vector(16, 8);
const LLT v8s8 = LLT::fixed_vector(8, 8);		const LLT v8s8 = LLT::fixed_vector(8, 8);
const LLT v4s8 = LLT::fixed_vector(4, 8);		const LLT v4s8 = LLT::fixed_vector(4, 8);
		const LLT v2s8 = LLT::fixed_vector(2, 8);
const LLT v8s16 = LLT::fixed_vector(8, 16);		const LLT v8s16 = LLT::fixed_vector(8, 16);
const LLT v4s16 = LLT::fixed_vector(4, 16);		const LLT v4s16 = LLT::fixed_vector(4, 16);
const LLT v2s16 = LLT::fixed_vector(2, 16);		const LLT v2s16 = LLT::fixed_vector(2, 16);
const LLT v2s32 = LLT::fixed_vector(2, 32);		const LLT v2s32 = LLT::fixed_vector(2, 32);
const LLT v4s32 = LLT::fixed_vector(4, 32);		const LLT v4s32 = LLT::fixed_vector(4, 32);
const LLT v2s64 = LLT::fixed_vector(2, 64);		const LLT v2s64 = LLT::fixed_vector(2, 64);
const LLT v2p0 = LLT::fixed_vector(2, p0);		const LLT v2p0 = LLT::fixed_vector(2, p0);

▲ Show 20 Lines • Show All 653 Lines • ▼ Show 20 Lines	getActionDefinitionsBuilder(G_BUILD_VECTOR)
{v4s16, s16},		{v4s16, s16},
{v8s16, s16},		{v8s16, s16},
{v2s32, s32},		{v2s32, s32},
{v4s32, s32},		{v4s32, s32},
{v2p0, p0},		{v2p0, p0},
{v2s64, s64}})		{v2s64, s64}})
.clampNumElements(0, v4s32, v4s32)		.clampNumElements(0, v4s32, v4s32)
.clampNumElements(0, v2s64, v2s64)		.clampNumElements(0, v2s64, v2s64)
.minScalarOrElt(0, s8)		.minScalarOrElt(0, s8)
.minScalarOrEltIf(		.widenVectorEltsToVectorMinSize(0, 64)
[=](const LegalityQuery &Query) { return Query.Types[0] == v2s16; },
0, s32)
.minScalarSameAs(1, 0);		.minScalarSameAs(1, 0);

getActionDefinitionsBuilder(G_BUILD_VECTOR_TRUNC).lower();		getActionDefinitionsBuilder(G_BUILD_VECTOR_TRUNC).lower();

getActionDefinitionsBuilder(G_CTLZ)		getActionDefinitionsBuilder(G_CTLZ)
		aemersonUnsubmitted Done Reply Inline Actions Now that we're going to do this multiple times, I think it's worth factoring out the logic to make it easier to re-use. The underlying logic is I believe: "vectors must be at least 64 bits wide", right? I think we could make this easier by adding a new action/predicates in LegalizerInfo.h, so that we could do something like: .promoteVectoreEltsToVectorMinSize(0, 64) I think there are other places in this file that could also use this new action to simplify the code. P.S. please attach more context to your diffs (-U9999 works). aemerson: Now that we're going to do this multiple times, I think it's worth factoring out the logic to…
		AllenAuthorUnsubmitted Done Reply Inline Actions thanks for your detail suggestion, apply your comment. Allen: thanks for your detail suggestion, apply your comment.
.legalForCartesianProduct(		.legalForCartesianProduct(
{s32, s64, v8s8, v16s8, v4s16, v8s16, v2s32, v4s32})		{s32, s64, v8s8, v16s8, v4s16, v8s16, v2s32, v4s32})
.scalarize(1)		.scalarize(1)
.widenScalarToNextPow2(1, /Min=/32)		.widenScalarToNextPow2(1, /Min=/32)
.clampScalar(1, s32, s64)		.clampScalar(1, s32, s64)
.scalarSameSizeAs(0, 1);		.scalarSameSizeAs(0, 1);
getActionDefinitionsBuilder(G_CTLZ_ZERO_UNDEF).lower();		getActionDefinitionsBuilder(G_CTLZ_ZERO_UNDEF).lower();

▲ Show 20 Lines • Show All 953 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/GlobalISel/legalize-build-vector.mir

Show First 20 Lines • Show All 127 Lines • ▼ Show 20 Lines	bb.0:
; CHECK-NEXT: RET_ReallyLR		; CHECK-NEXT: RET_ReallyLR
%0:_(s16) = COPY $h0		%0:_(s16) = COPY $h0
%1:_(s16) = COPY $h1		%1:_(s16) = COPY $h1
%2:_(<2 x s16>) = G_BUILD_VECTOR %0(s16), %1(s16)		%2:_(<2 x s16>) = G_BUILD_VECTOR %0(s16), %1(s16)
%ext:_(<2 x s32>) = G_ANYEXT %2(<2 x s16>)		%ext:_(<2 x s32>) = G_ANYEXT %2(<2 x s16>)
$d0 = COPY %ext(<2 x s32>)		$d0 = COPY %ext(<2 x s32>)
RET_ReallyLR		RET_ReallyLR
...		...

		---
		name: widen_v2s8
		body: \|
		bb.0:
		; CHECK-LABEL: name: widen_v2s8
		; CHECK: [[DEF:%[0-9]+]]:_(s32) = G_IMPLICIT_DEF
		; CHECK-NEXT: [[COPY:%[0-9]+]]:_(s32) = COPY [[DEF]](s32)
		; CHECK-NEXT: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s32>) = G_BUILD_VECTOR [[COPY]](s32), [[DEF]](s32)
		; CHECK-NEXT: $d0 = COPY [[BUILD_VECTOR]](<2 x s32>)
		; CHECK-NEXT: RET_ReallyLR
		%0:_(s8) = G_IMPLICIT_DEF
		%1:_(s8) = G_IMPLICIT_DEF
		%2:_(<2 x s8>) = G_BUILD_VECTOR %0(s8), %1(s8)
		%ext:_(<2 x s32>) = G_ANYEXT %2(<2 x s8>)
		$d0 = COPY %ext(<2 x s32>)
		RET_ReallyLR
		...

		---
		name: widen_v4s8
		body: \|
		bb.0:
		; CHECK-LABEL: name: widen_v4s8
		; CHECK: [[DEF:%[0-9]+]]:_(s16) = G_IMPLICIT_DEF
		; CHECK-NEXT: [[COPY:%[0-9]+]]:_(s16) = COPY [[DEF]](s16)
		; CHECK-NEXT: [[COPY1:%[0-9]+]]:_(s16) = COPY [[DEF]](s16)
		; CHECK-NEXT: [[COPY2:%[0-9]+]]:_(s16) = COPY [[DEF]](s16)
		; CHECK-NEXT: [[BUILD_VECTOR:%[0-9]+]]:_(<4 x s16>) = G_BUILD_VECTOR [[COPY]](s16), [[COPY1]](s16), [[COPY2]](s16), [[DEF]](s16)
		; CHECK-NEXT: $d0 = COPY [[BUILD_VECTOR]](<4 x s16>)
		; CHECK-NEXT: RET_ReallyLR
		%0:_(s8) = G_IMPLICIT_DEF
		%1:_(s8) = G_IMPLICIT_DEF
		%2:_(s8) = G_IMPLICIT_DEF
		%3:_(s8) = G_IMPLICIT_DEF
		%4:_(<4 x s8>) = G_BUILD_VECTOR %0(s8), %1(s8), %2(s8), %3(s8)
		%ext:_(<4 x s16>) = G_ANYEXT %4(<4 x s8>)
		$d0 = COPY %ext(<4 x s16>)
		RET_ReallyLR
		...

llvm/test/CodeGen/AArch64/GlobalISel/legalize-itofp.mir

	Show First 20 Lines • Show All 268 Lines • ▼ Show 20 Lines
	---			---
	name: test_uitofp_v2s64_v2i1			name: test_uitofp_v2s64_v2i1
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $q0			liveins: $q0
	; CHECK-LABEL: name: test_uitofp_v2s64_v2i1			; CHECK-LABEL: name: test_uitofp_v2s64_v2i1
	; CHECK: liveins: $q0			; CHECK: liveins: $q0
	; CHECK-NEXT: {{ $}}			; CHECK-NEXT: {{ $}}
	; CHECK-NEXT: [[DEF:%[0-9]+]]:_(s8) = G_IMPLICIT_DEF			; CHECK-NEXT: [[DEF:%[0-9]+]]:_(s32) = G_IMPLICIT_DEF
	; CHECK-NEXT: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s8>) = G_BUILD_VECTOR [[DEF]](s8), [[DEF]](s8)			; CHECK-NEXT: [[COPY:%[0-9]+]]:_(s32) = COPY [[DEF]](s32)
				; CHECK-NEXT: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s32>) = G_BUILD_VECTOR [[COPY]](s32), [[DEF]](s32)
	; CHECK-NEXT: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 1			; CHECK-NEXT: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 1
	; CHECK-NEXT: [[BUILD_VECTOR1:%[0-9]+]]:_(<2 x s64>) = G_BUILD_VECTOR [[C]](s64), [[C]](s64)			; CHECK-NEXT: [[BUILD_VECTOR1:%[0-9]+]]:_(<2 x s64>) = G_BUILD_VECTOR [[C]](s64), [[C]](s64)
	; CHECK-NEXT: [[ANYEXT:%[0-9]+]]:_(<2 x s64>) = G_ANYEXT [[BUILD_VECTOR]](<2 x s8>)			; CHECK-NEXT: [[ANYEXT:%[0-9]+]]:_(<2 x s64>) = G_ANYEXT [[BUILD_VECTOR]](<2 x s32>)
	; CHECK-NEXT: [[AND:%[0-9]+]]:_(<2 x s64>) = G_AND [[ANYEXT]], [[BUILD_VECTOR1]]			; CHECK-NEXT: [[AND:%[0-9]+]]:_(<2 x s64>) = G_AND [[ANYEXT]], [[BUILD_VECTOR1]]
	; CHECK-NEXT: [[UITOFP:%[0-9]+]]:_(<2 x s64>) = G_UITOFP [[AND]](<2 x s64>)			; CHECK-NEXT: [[UITOFP:%[0-9]+]]:_(<2 x s64>) = G_UITOFP [[AND]](<2 x s64>)
	; CHECK-NEXT: $q0 = COPY [[UITOFP]](<2 x s64>)			; CHECK-NEXT: $q0 = COPY [[UITOFP]](<2 x s64>)
	%0:_(<2 x s1>) = G_IMPLICIT_DEF			%0:_(<2 x s1>) = G_IMPLICIT_DEF
	%1:_(<2 x s64>) = G_UITOFP %0			%1:_(<2 x s64>) = G_UITOFP %0
	$q0 = COPY %1			$q0 = COPY %1
	...			...

	---			---
	name: test_sitofp_v2s64_v2i1			name: test_sitofp_v2s64_v2i1
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $q0			liveins: $q0
	; CHECK-LABEL: name: test_sitofp_v2s64_v2i1			; CHECK-LABEL: name: test_sitofp_v2s64_v2i1
	; CHECK: liveins: $q0			; CHECK: liveins: $q0
	; CHECK-NEXT: {{ $}}			; CHECK-NEXT: {{ $}}
	; CHECK-NEXT: [[DEF:%[0-9]+]]:_(s8) = G_IMPLICIT_DEF			; CHECK-NEXT: [[DEF:%[0-9]+]]:_(s32) = G_IMPLICIT_DEF
	; CHECK-NEXT: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s8>) = G_BUILD_VECTOR [[DEF]](s8), [[DEF]](s8)			; CHECK-NEXT: [[COPY:%[0-9]+]]:_(s32) = COPY [[DEF]](s32)
	; CHECK-NEXT: [[ANYEXT:%[0-9]+]]:_(<2 x s64>) = G_ANYEXT [[BUILD_VECTOR]](<2 x s8>)			; CHECK-NEXT: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s32>) = G_BUILD_VECTOR [[COPY]](s32), [[DEF]](s32)
				; CHECK-NEXT: [[ANYEXT:%[0-9]+]]:_(<2 x s64>) = G_ANYEXT [[BUILD_VECTOR]](<2 x s32>)
	; CHECK-NEXT: [[SEXT_INREG:%[0-9]+]]:_(<2 x s64>) = G_SEXT_INREG [[ANYEXT]], 1			; CHECK-NEXT: [[SEXT_INREG:%[0-9]+]]:_(<2 x s64>) = G_SEXT_INREG [[ANYEXT]], 1
	; CHECK-NEXT: [[SITOFP:%[0-9]+]]:_(<2 x s64>) = G_SITOFP [[SEXT_INREG]](<2 x s64>)			; CHECK-NEXT: [[SITOFP:%[0-9]+]]:_(<2 x s64>) = G_SITOFP [[SEXT_INREG]](<2 x s64>)
	; CHECK-NEXT: $q0 = COPY [[SITOFP]](<2 x s64>)			; CHECK-NEXT: $q0 = COPY [[SITOFP]](<2 x s64>)
	%0:_(<2 x s1>) = G_IMPLICIT_DEF			%0:_(<2 x s1>) = G_IMPLICIT_DEF
	%1:_(<2 x s64>) = G_SITOFP %0			%1:_(<2 x s64>) = G_SITOFP %0
	$q0 = COPY %1			$q0 = COPY %1
	...			...

	Show All 36 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[AArch64][GlobalISel] Legalize <2 x s8> and <4 x s8> for G_BUILD_VECTOR
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 543354

llvm/include/llvm/CodeGen/GlobalISel/LegalizerInfo.h

llvm/lib/Target/AArch64/GISel/AArch64LegalizerInfo.cpp

llvm/test/CodeGen/AArch64/GlobalISel/legalize-build-vector.mir

llvm/test/CodeGen/AArch64/GlobalISel/legalize-itofp.mir

This is an archive of the discontinued LLVM Phabricator instance.

[AArch64][GlobalISel] Legalize <2 x s8> and <4 x s8> for G_BUILD_VECTORClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 543354

llvm/include/llvm/CodeGen/GlobalISel/LegalizerInfo.h

llvm/lib/Target/AArch64/GISel/AArch64LegalizerInfo.cpp

llvm/test/CodeGen/AArch64/GlobalISel/legalize-build-vector.mir

llvm/test/CodeGen/AArch64/GlobalISel/legalize-itofp.mir

[AArch64][GlobalISel] Legalize <2 x s8> and <4 x s8> for G_BUILD_VECTOR
ClosedPublic