This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/CodeGen/SelectionDAG/
-
CodeGen/
-
SelectionDAG/
1/2
SelectionDAG.cpp
-
test/CodeGen/AArch64/
-
CodeGen/
-
AArch64/
-
sve-intrinsics-gather-prefetches-scalar-base-vector-indexes.ll
-
sve-intrinsics-ld1ro.ll
-
sve-intrinsics-scatter-stores-32bit-scaled-offsets.ll
-
sve-intrinsics-scatter-stores-32bit-unscaled-offsets.ll
-
sve-masked-ldst-nonext.ll
-
sve-masked-ldst-sext.ll
-
sve-masked-ldst-trunc.ll
-
sve-masked-ldst-zext.ll
-
sve-pred-contiguous-ldst-addressing-mode-reg-imm.ll
-
sve-pred-contiguous-ldst-addressing-mode-reg-reg.ll
-
sve-setcc.ll
-
unittests/CodeGen/
-
CodeGen/
-
AArch64SelectionDAGTest.cpp

Differential D80437

[CodeGen] Let computeKnownBits do something sensible for scalable vectors
ClosedPublic

Authored by david-arm on May 22 2020, 6:14 AM.

Download Raw Diff

Details

Reviewers

sdesmalen
rengolin
ctetreau
efriedma

Commits

rGbd97342a0c2c: [CodeGen] Let computeKnownBits do something sensible for scalable vectors

Summary

Until we have a real need for computing known bits for scalable
vectors I have simply changed the code to bail out for now and
pretend we know nothing. I've also fixed up some simple callers of
computeKnownBits too.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

david-arm created this revision.May 22 2020, 6:14 AM

Herald added a reviewer: rengolin. · View Herald TranscriptMay 22 2020, 6:14 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: llvm-commits, hiraditya, kristof.beyls. · View Herald Transcript

david-arm retitled this revision from [CodeGen] Add support for extracting elements of scalable vectors to [CodeGen] Let computeKnownBits do something sensible for scalable vectors.May 22 2020, 6:15 AM

david-arm edited the summary of this revision. (Show Details)

Harbormaster completed remote builds in B57642: Diff 265726.May 22 2020, 7:29 AM

david-arm added reviewers: ctetreau, efriedma.May 25 2020, 11:44 PM

I think we should be able to make computeKnownBits work to some extent, but I guess it's not high priority at the moment.

llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp
2278	If we're going to bail out anyway, can we bail out explicitly, instead of waiting for computeKnownBits to bail out?

david-arm marked an inline comment as done.Jun 1 2020, 5:57 AM

david-arm added inline comments.

llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp
2278	I thought about that, but then I was also worried about duplicating the behaviour of the called routines in the callers. If the behaviour of MaskedValueIsZero or computeKnownBits changes, i.e. how we return values and so on, then this becomes out of sync. If you don't think that's a problem I can bail out early as you suggested?

Hi @efriedma, I don't suppose you've had any further thoughts about my reply to your comment? I was specifically worried that also bailing out early for all functions that call computeKnownBits means we're reliant on the behaviour of computeKnownBits not changing, i.e. in terms of it's returned values. If you don't think this is a concern, then I'm happy to do that.

It's not clear how we actually want to represent DemandedElts for scalable vectors. I think we'd go through all the callers before changing anything in that respect, so explicitly bailing out seems better than intentionally synthesizing something which might be wrong anyway.

For MaskedValueIsZero specifically, maybe makes sense to change the implementation to return Mask.isSubsetOf(computeKnownBits(V, Depth).Zero);, instead of trying to synthesize DemandedElts.

Hi @efriedma, thanks for the reply! Your comment about changing MaskedValueIsZero makes a lot of sense and improves code reuse. Hopefully the patch looks better now!

The code looks right.

For both this and the SimplifyDemandedBits/VectorElts patches, I'm not sure what we want to do about testing. Currently, I assume some optimizations "work", and this breaks them. And probably we do some incorrect optimizations. It would be nice if we could have at least a little test coverage showing these changes have some effect.

Hi @efriedma, I tried to find some pre-existing folds that we were doing before my patch, but couldn't find anything. The problem is that fundamentally many of these functions need to understand SPLAT_VECTORs in order to return something known. I tried adding SPLAT_VECTOR support to isConstOrConstSplat, since that allowed us to call MaskedValueIsZero a bit more. However, we then end up crashing in SimplifyDemandedBits. However, I do understand your concern so I have added a unit test that confirms we pretend we know nothing. This patch kills off the last remaining warnings in quite a few tests, so I have added checks for no warnings in those files.

Herald added a subscriber: arphaman. · View Herald TranscriptJun 10 2020, 4:44 AM

LGTM

This revision is now accepted and ready to land.Jun 10 2020, 12:00 PM

Closed by commit rGbd97342a0c2c: [CodeGen] Let computeKnownBits do something sensible for scalable vectors (authored by david-arm). · Explain WhyJun 11 2020, 12:37 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

lib/

CodeGen/

SelectionDAG/

SelectionDAG.cpp

19 lines

test/

CodeGen/

AArch64/

sve-intrinsics-gather-prefetches-scalar-base-vector-indexes.ll

5 lines

sve-intrinsics-ld1ro.ll

5 lines

sve-intrinsics-scatter-stores-32bit-scaled-offsets.ll

5 lines

sve-intrinsics-scatter-stores-32bit-unscaled-offsets.ll

5 lines

sve-masked-ldst-nonext.ll

5 lines

sve-masked-ldst-sext.ll

5 lines

sve-masked-ldst-trunc.ll

5 lines

sve-masked-ldst-zext.ll

5 lines

sve-pred-contiguous-ldst-addressing-mode-reg-imm.ll

5 lines

sve-pred-contiguous-ldst-addressing-mode-reg-reg.ll

5 lines

sve-setcc.ll

5 lines

unittests/

CodeGen/

AArch64SelectionDAGTest.cpp

19 lines

Diff 270047

llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 2,269 Lines • ▼ Show 20 Lines	bool SelectionDAG::SignBitIsZero(SDValue Op, unsigned Depth) const {
return MaskedValueIsZero(Op, APInt::getSignMask(BitWidth), Depth);		return MaskedValueIsZero(Op, APInt::getSignMask(BitWidth), Depth);
}		}

/// MaskedValueIsZero - Return true if 'V & Mask' is known to be zero. We use		/// MaskedValueIsZero - Return true if 'V & Mask' is known to be zero. We use
/// this predicate to simplify operations downstream. Mask is known to be zero		/// this predicate to simplify operations downstream. Mask is known to be zero
/// for bits that V cannot have.		/// for bits that V cannot have.
bool SelectionDAG::MaskedValueIsZero(SDValue V, const APInt &Mask,		bool SelectionDAG::MaskedValueIsZero(SDValue V, const APInt &Mask,
unsigned Depth) const {		unsigned Depth) const {
EVT VT = V.getValueType();		return Mask.isSubsetOf(computeKnownBits(V, Depth).Zero);
		efriedmaUnsubmitted Not Done Reply Inline Actions If we're going to bail out anyway, can we bail out explicitly, instead of waiting for computeKnownBits to bail out? efriedma: If we're going to bail out anyway, can we bail out explicitly, instead of waiting for…
		david-armAuthorUnsubmitted Done Reply Inline Actions I thought about that, but then I was also worried about duplicating the behaviour of the called routines in the callers. If the behaviour of MaskedValueIsZero or computeKnownBits changes, i.e. how we return values and so on, then this becomes out of sync. If you don't think that's a problem I can bail out early as you suggested? david-arm: I thought about that, but then I was also worried about duplicating the behaviour of the called…
APInt DemandedElts = VT.isVector()
? APInt::getAllOnesValue(VT.getVectorNumElements())
: APInt(1, 1);
return MaskedValueIsZero(V, Mask, DemandedElts, Depth);
}		}

/// MaskedValueIsZero - Return true if 'V & Mask' is known to be zero in		/// MaskedValueIsZero - Return true if 'V & Mask' is known to be zero in
/// DemandedElts. We use this predicate to simplify operations downstream.		/// DemandedElts. We use this predicate to simplify operations downstream.
/// Mask is known to be zero for bits that V cannot have.		/// Mask is known to be zero for bits that V cannot have.
bool SelectionDAG::MaskedValueIsZero(SDValue V, const APInt &Mask,		bool SelectionDAG::MaskedValueIsZero(SDValue V, const APInt &Mask,
const APInt &DemandedElts,		const APInt &DemandedElts,
unsigned Depth) const {		unsigned Depth) const {
▲ Show 20 Lines • Show All 252 Lines • ▼ Show 20 Lines	const APInt *SelectionDAG::getValidMaximumShiftAmountConstant(
return MaxShAmt;		return MaxShAmt;
}		}

/// Determine which bits of Op are known to be either zero or one and return		/// Determine which bits of Op are known to be either zero or one and return
/// them in Known. For vectors, the known bits are those that are shared by		/// them in Known. For vectors, the known bits are those that are shared by
/// every vector element.		/// every vector element.
KnownBits SelectionDAG::computeKnownBits(SDValue Op, unsigned Depth) const {		KnownBits SelectionDAG::computeKnownBits(SDValue Op, unsigned Depth) const {
EVT VT = Op.getValueType();		EVT VT = Op.getValueType();

		// TOOD: Until we have a plan for how to represent demanded elements for
		// scalable vectors, we can just bail out for now.
		if (Op.getValueType().isScalableVector()) {
		unsigned BitWidth = Op.getScalarValueSizeInBits();
		return KnownBits(BitWidth);
		}

APInt DemandedElts = VT.isVector()		APInt DemandedElts = VT.isVector()
? APInt::getAllOnesValue(VT.getVectorNumElements())		? APInt::getAllOnesValue(VT.getVectorNumElements())
: APInt(1, 1);		: APInt(1, 1);
return computeKnownBits(Op, DemandedElts, Depth);		return computeKnownBits(Op, DemandedElts, Depth);
}		}

/// Determine which bits of Op are known to be either zero or one and return		/// Determine which bits of Op are known to be either zero or one and return
/// them in Known. The DemandedElts argument allows us to only collect the known		/// them in Known. The DemandedElts argument allows us to only collect the known
/// bits that are shared by the requested vector elements.		/// bits that are shared by the requested vector elements.
KnownBits SelectionDAG::computeKnownBits(SDValue Op, const APInt &DemandedElts,		KnownBits SelectionDAG::computeKnownBits(SDValue Op, const APInt &DemandedElts,
unsigned Depth) const {		unsigned Depth) const {
unsigned BitWidth = Op.getScalarValueSizeInBits();		unsigned BitWidth = Op.getScalarValueSizeInBits();

KnownBits Known(BitWidth); // Don't know anything.		KnownBits Known(BitWidth); // Don't know anything.

		// TOOD: Until we have a plan for how to represent demanded elements for
		// scalable vectors, we can just bail out for now.
		if (Op.getValueType().isScalableVector())
		return Known;

if (auto *C = dyn_cast<ConstantSDNode>(Op)) {		if (auto *C = dyn_cast<ConstantSDNode>(Op)) {
// We know all of the bits for a constant!		// We know all of the bits for a constant!
Known.One = C->getAPIntValue();		Known.One = C->getAPIntValue();
Known.Zero = ~Known.One;		Known.Zero = ~Known.One;
return Known;		return Known;
}		}
if (auto *C = dyn_cast<ConstantFPSDNode>(Op)) {		if (auto *C = dyn_cast<ConstantFPSDNode>(Op)) {
// We know all of the bits for a constant fp!		// We know all of the bits for a constant fp!
▲ Show 20 Lines • Show All 7,317 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/sve-intrinsics-gather-prefetches-scalar-base-vector-indexes.ll

	; RUN: llc -mtriple=aarch64--linux-gnu -mattr=+sve --asm-verbose=false < %s \| FileCheck %s			; RUN: llc -mtriple=aarch64--linux-gnu -mattr=+sve --asm-verbose=false < %s 2>%t \| FileCheck %s
				; RUN: FileCheck --check-prefix=WARN --allow-empty %s <%t

				; WARN-NOT: warning

	; PRFB <prfop>, <Pg>, [<Xn\|SP>, <Zm>.S, <mod>] -> 32-bit indexes			; PRFB <prfop>, <Pg>, [<Xn\|SP>, <Zm>.S, <mod>] -> 32-bit indexes
	define void @llvm_aarch64_sve_prfb_gather_uxtw_index_nx4vi32(<vscale x 4 x i1> %Pg, i8* %base, <vscale x 4 x i32> %indexes) nounwind {			define void @llvm_aarch64_sve_prfb_gather_uxtw_index_nx4vi32(<vscale x 4 x i1> %Pg, i8* %base, <vscale x 4 x i32> %indexes) nounwind {
	; CHECK-LABEL: llvm_aarch64_sve_prfb_gather_uxtw_index_nx4vi32:			; CHECK-LABEL: llvm_aarch64_sve_prfb_gather_uxtw_index_nx4vi32:
	; CHECK-NEXT: prfb pldl1strm, p0, [x0, z0.s, uxtw]			; CHECK-NEXT: prfb pldl1strm, p0, [x0, z0.s, uxtw]
	; CHECK-NEXT: ret			; CHECK-NEXT: ret
	call void @llvm.aarch64.sve.prfb.gather.uxtw.index.nx4vi32(<vscale x 4 x i1> %Pg, i8* %base, <vscale x 4 x i32> %indexes, i32 1)			call void @llvm.aarch64.sve.prfb.gather.uxtw.index.nx4vi32(<vscale x 4 x i1> %Pg, i8* %base, <vscale x 4 x i32> %indexes, i32 1)
	ret void			ret void
	▲ Show 20 Lines • Show All 194 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/sve-intrinsics-ld1ro.ll

	; RUN: llc -mtriple=aarch64-linux-gnu -mattr=+sve,+f64mm -asm-verbose=0 < %s \| FileCheck %s			; RUN: llc -mtriple=aarch64-linux-gnu -mattr=+sve,+f64mm -asm-verbose=0 < %s 2>%t \| FileCheck %s
				; RUN: FileCheck --check-prefix=WARN --allow-empty %s <%t

				; WARN-NOT: warning

	;			;
	; LD1ROB			; LD1ROB
	;			;

	define <vscale x 16 x i8> @ld1rob_i8(<vscale x 16 x i1> %pred, i8* %addr) nounwind {			define <vscale x 16 x i8> @ld1rob_i8(<vscale x 16 x i1> %pred, i8* %addr) nounwind {
	; CHECK-LABEL: ld1rob_i8:			; CHECK-LABEL: ld1rob_i8:
	; CHECK-NEXT: ld1rob { z0.b }, p0/z, [x0]			; CHECK-NEXT: ld1rob { z0.b }, p0/z, [x0]
	▲ Show 20 Lines • Show All 75 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/sve-intrinsics-scatter-stores-32bit-scaled-offsets.ll

	; RUN: llc -mtriple=aarch64-linux-gnu -mattr=+sve < %s \| FileCheck %s			; RUN: llc -mtriple=aarch64-linux-gnu -mattr=+sve < %s 2>%t \| FileCheck %s
				; RUN: FileCheck --check-prefix=WARN --allow-empty %s <%t

				; WARN-NOT: warning

	;			;
	; ST1H, ST1W, ST1D: base + 32-bit scaled offset, sign (sxtw) or zero			; ST1H, ST1W, ST1D: base + 32-bit scaled offset, sign (sxtw) or zero
	; (uxtw) extended to 64 bits.			; (uxtw) extended to 64 bits.
	; e.g. st1h { z0.d }, p0, [x0, z1.d, uxtw #1]			; e.g. st1h { z0.d }, p0, [x0, z1.d, uxtw #1]
	;			;

	; ST1H			; ST1H
	▲ Show 20 Lines • Show All 184 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/sve-intrinsics-scatter-stores-32bit-unscaled-offsets.ll

	; RUN: llc -mtriple=aarch64-linux-gnu -mattr=+sve < %s \| FileCheck %s			; RUN: llc -mtriple=aarch64-linux-gnu -mattr=+sve < %s 2>%t \| FileCheck %s
				; RUN: FileCheck --check-prefix=WARN --allow-empty %s <%t

				; WARN-NOT: warning

	;			;
	; ST1B, ST1W, ST1H, ST1D: base + 32-bit unscaled offset, sign (sxtw) or zero			; ST1B, ST1W, ST1H, ST1D: base + 32-bit unscaled offset, sign (sxtw) or zero
	; (uxtw) extended to 64 bits.			; (uxtw) extended to 64 bits.
	; e.g. st1h { z0.d }, p0, [x0, z1.d, uxtw]			; e.g. st1h { z0.d }, p0, [x0, z1.d, uxtw]
	;			;

	; ST1B			; ST1B
	▲ Show 20 Lines • Show All 239 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/sve-masked-ldst-nonext.ll

	; RUN: llc -mtriple=aarch64--linux-gnu -mattr=+sve -asm-verbose=0 < %s \| FileCheck %s			; RUN: llc -mtriple=aarch64--linux-gnu -mattr=+sve -asm-verbose=0 < %s 2>%t \| FileCheck %s
				; RUN: FileCheck --check-prefix=WARN --allow-empty %s <%t

				; WARN-NOT: warning

	;			;
	; Masked Loads			; Masked Loads
	;			;

	define <vscale x 2 x i64> @masked_load_nxv2i64(<vscale x 2 x i64> *%a, <vscale x 2 x i1> %mask) nounwind {			define <vscale x 2 x i64> @masked_load_nxv2i64(<vscale x 2 x i64> *%a, <vscale x 2 x i1> %mask) nounwind {
	; CHECK-LABEL: masked_load_nxv2i64:			; CHECK-LABEL: masked_load_nxv2i64:
	; CHECK-NEXT: ld1d { z0.d }, p0/z, [x0]			; CHECK-NEXT: ld1d { z0.d }, p0/z, [x0]
	▲ Show 20 Lines • Show All 184 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/sve-masked-ldst-sext.ll

	; RUN: llc -mtriple=aarch64--linux-gnu -mattr=+sve < %s \| FileCheck %s			; RUN: llc -mtriple=aarch64--linux-gnu -mattr=+sve < %s 2>%t \| FileCheck %s
				; RUN: FileCheck --check-prefix=WARN --allow-empty %s <%t

				; WARN-NOT: warning

	;			;
	; Masked Loads			; Masked Loads
	;			;

	define <vscale x 2 x i64> @masked_sload_nxv2i8(<vscale x 2 x i8> *%a, <vscale x 2 x i1> %mask) {			define <vscale x 2 x i64> @masked_sload_nxv2i8(<vscale x 2 x i8> *%a, <vscale x 2 x i1> %mask) {
	; CHECK-LABEL: masked_sload_nxv2i8:			; CHECK-LABEL: masked_sload_nxv2i8:
	; CHECK: ld1sb { [[IN:z[0-9]+]].d }, [[PG:p[0-9]+]]/z, [x0]			; CHECK: ld1sb { [[IN:z[0-9]+]].d }, [[PG:p[0-9]+]]/z, [x0]
	▲ Show 20 Lines • Show All 57 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/sve-masked-ldst-trunc.ll

	; RUN: llc -mtriple=aarch64--linux-gnu -mattr=+sve -asm-verbose=0 < %s \| FileCheck %s			; RUN: llc -mtriple=aarch64--linux-gnu -mattr=+sve -asm-verbose=0 < %s 2>%t \| FileCheck %s
				; RUN: FileCheck --check-prefix=WARN --allow-empty %s <%t

				; WARN-NOT: warning

	;			;
	; Masked Stores			; Masked Stores
	;			;

	define void @masked_trunc_store_nxv2i8(<vscale x 2 x i64> %a, <vscale x 2 x i64> %val, <vscale x 2 x i8> %b, <vscale x 2 x i1> %mask) nounwind {			define void @masked_trunc_store_nxv2i8(<vscale x 2 x i64> %a, <vscale x 2 x i64> %val, <vscale x 2 x i8> %b, <vscale x 2 x i1> %mask) nounwind {
	; CHECK-LABEL: masked_trunc_store_nxv2i8:			; CHECK-LABEL: masked_trunc_store_nxv2i8:
	; CHECK-NEXT: st1b { z0.d }, p0, [x1]			; CHECK-NEXT: st1b { z0.d }, p0, [x1]
	▲ Show 20 Lines • Show All 57 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/sve-masked-ldst-zext.ll

	; RUN: llc -mtriple=aarch64--linux-gnu -mattr=+sve < %s \| FileCheck %s			; RUN: llc -mtriple=aarch64--linux-gnu -mattr=+sve < %s 2>%t \| FileCheck %s
				; RUN: FileCheck --check-prefix=WARN --allow-empty %s <%t

				; WARN-NOT: warning

	;			;
	; Masked Loads			; Masked Loads
	;			;

	define <vscale x 2 x i64> @masked_zload_nxv2i8(<vscale x 2 x i8>* %src, <vscale x 2 x i1> %mask) {			define <vscale x 2 x i64> @masked_zload_nxv2i8(<vscale x 2 x i8>* %src, <vscale x 2 x i1> %mask) {
	; CHECK-LABEL: masked_zload_nxv2i8:			; CHECK-LABEL: masked_zload_nxv2i8:
	; CHECK-NOT: ld1sb			; CHECK-NOT: ld1sb
	▲ Show 20 Lines • Show All 63 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/sve-pred-contiguous-ldst-addressing-mode-reg-imm.ll

	; RUN: llc -mtriple=aarch64--linux-gnu -mattr=+sve --asm-verbose=false < %s \| FileCheck %s			; RUN: llc -mtriple=aarch64--linux-gnu -mattr=+sve --asm-verbose=false < %s 2>%t \| FileCheck %s
				; RUN: FileCheck --check-prefix=WARN --allow-empty %s <%t

				; WARN-NOT: warning

	; Range checks: for all the instruction tested in this file, the			; Range checks: for all the instruction tested in this file, the
	; immediate must be within the range [-8, 7] (4-bit immediate). Out of			; immediate must be within the range [-8, 7] (4-bit immediate). Out of
	; range values are tested only in one case (following). Valid values			; range values are tested only in one case (following). Valid values
	; are tested all through the rest of the file.			; are tested all through the rest of the file.

	define void @imm_out_of_range(<vscale x 2 x i64> * %base, <vscale x 2 x i1> %mask) nounwind {			define void @imm_out_of_range(<vscale x 2 x i64> * %base, <vscale x 2 x i1> %mask) nounwind {
	; CHECK-LABEL: imm_out_of_range:			; CHECK-LABEL: imm_out_of_range:
	▲ Show 20 Lines • Show All 613 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/sve-pred-contiguous-ldst-addressing-mode-reg-reg.ll

	; RUN: llc -mtriple=aarch64--linux-gnu -mattr=+sve --asm-verbose=false < %s \| FileCheck %s			; RUN: llc -mtriple=aarch64--linux-gnu -mattr=+sve --asm-verbose=false < %s 2>%t \| FileCheck %s
				; RUN: FileCheck --check-prefix=WARN --allow-empty %s <%t

				; WARN-NOT: warning

	; 2-lane contiguous load/stores			; 2-lane contiguous load/stores

	define void @test_masked_ldst_sv2i8(i8 * %base, <vscale x 2 x i1> %mask, i64 %offset) nounwind {			define void @test_masked_ldst_sv2i8(i8 * %base, <vscale x 2 x i1> %mask, i64 %offset) nounwind {
	; CHECK-LABEL: test_masked_ldst_sv2i8:			; CHECK-LABEL: test_masked_ldst_sv2i8:
	; CHECK-NEXT: ld1sb { z[[DATA:[0-9]+]].d }, p0/z, [x0, x1]			; CHECK-NEXT: ld1sb { z[[DATA:[0-9]+]].d }, p0/z, [x0, x1]
	; CHECK-NEXT: st1b { z[[DATA]].d }, p0, [x0, x1]			; CHECK-NEXT: st1b { z[[DATA]].d }, p0, [x0, x1]
	; CHECK-NEXT: ret			; CHECK-NEXT: ret
	▲ Show 20 Lines • Show All 601 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/sve-setcc.ll

	; RUN: llc -mtriple=aarch64--linux-gnu -mattr=+sve < %s \| FileCheck %s			; RUN: llc -mtriple=aarch64--linux-gnu -mattr=+sve < %s 2>%t \| FileCheck %s
				; RUN: FileCheck --check-prefix=WARN --allow-empty %s <%t

				; WARN-NOT: warning

	; Ensure we use the inverted CC result of SVE compare instructions when branching.			; Ensure we use the inverted CC result of SVE compare instructions when branching.
	define void @sve_cmplt_setcc_inverted(<vscale x 8 x i16>* %out, <vscale x 8 x i16> %in, <vscale x 8 x i1> %pg) {			define void @sve_cmplt_setcc_inverted(<vscale x 8 x i16>* %out, <vscale x 8 x i16> %in, <vscale x 8 x i1> %pg) {
	; CHECK-LABEL: @sve_cmplt_setcc_inverted			; CHECK-LABEL: @sve_cmplt_setcc_inverted
	; CHECK: cmplt p1.h, p0/z, z0.h, #0			; CHECK: cmplt p1.h, p0/z, z0.h, #0
	; CHECK-NEXT: ptest p0, p1.b			; CHECK-NEXT: ptest p0, p1.b
	; CHECK-NEXT: b.ne			; CHECK-NEXT: b.ne
	entry:			entry:
	Show All 17 Lines

llvm/unittests/CodeGen/AArch64SelectionDAGTest.cpp

Show First 20 Lines • Show All 92 Lines • ▼ Show 20 Lines	TEST_F(AArch64SelectionDAGTest, computeKnownBits_ZERO_EXTEND_VECTOR_INREG) {
auto OutVecVT = EVT::getVectorVT(Context, Int16VT, 2);		auto OutVecVT = EVT::getVectorVT(Context, Int16VT, 2);
auto InVec = DAG->getConstant(0, Loc, InVecVT);		auto InVec = DAG->getConstant(0, Loc, InVecVT);
auto Op = DAG->getNode(ISD::ZERO_EXTEND_VECTOR_INREG, Loc, OutVecVT, InVec);		auto Op = DAG->getNode(ISD::ZERO_EXTEND_VECTOR_INREG, Loc, OutVecVT, InVec);
auto DemandedElts = APInt(2, 3);		auto DemandedElts = APInt(2, 3);
KnownBits Known = DAG->computeKnownBits(Op, DemandedElts);		KnownBits Known = DAG->computeKnownBits(Op, DemandedElts);
EXPECT_TRUE(Known.isZero());		EXPECT_TRUE(Known.isZero());
}		}

		TEST_F(AArch64SelectionDAGTest, computeKnownBitsSVE_ZERO_EXTEND_VECTOR_INREG) {
		if (!TM)
		return;
		SDLoc Loc;
		auto Int8VT = EVT::getIntegerVT(Context, 8);
		auto Int16VT = EVT::getIntegerVT(Context, 16);
		auto InVecVT = EVT::getVectorVT(Context, Int8VT, 4, true);
		auto OutVecVT = EVT::getVectorVT(Context, Int16VT, 2, true);
		auto InVec = DAG->getConstant(0, Loc, InVecVT);
		auto Op = DAG->getNode(ISD::ZERO_EXTEND_VECTOR_INREG, Loc, OutVecVT, InVec);
		auto DemandedElts = APInt(2, 3);
		KnownBits Known = DAG->computeKnownBits(Op, DemandedElts);

		// We don't know anything for SVE at the moment.
		EXPECT_EQ(Known.Zero, APInt(16, 0u));
		EXPECT_EQ(Known.One, APInt(16, 0u));
		EXPECT_FALSE(Known.isZero());
		}

TEST_F(AArch64SelectionDAGTest, computeKnownBits_EXTRACT_SUBVECTOR) {		TEST_F(AArch64SelectionDAGTest, computeKnownBits_EXTRACT_SUBVECTOR) {
if (!TM)		if (!TM)
return;		return;
SDLoc Loc;		SDLoc Loc;
auto IntVT = EVT::getIntegerVT(Context, 8);		auto IntVT = EVT::getIntegerVT(Context, 8);
auto VecVT = EVT::getVectorVT(Context, IntVT, 3);		auto VecVT = EVT::getVectorVT(Context, IntVT, 3);
auto IdxVT = EVT::getIntegerVT(Context, 64);		auto IdxVT = EVT::getIntegerVT(Context, 64);
auto Vec = DAG->getConstant(0, Loc, VecVT);		auto Vec = DAG->getConstant(0, Loc, VecVT);
▲ Show 20 Lines • Show All 333 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[CodeGen] Let computeKnownBits do something sensible for scalable vectorsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 270047

llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp

llvm/test/CodeGen/AArch64/sve-intrinsics-gather-prefetches-scalar-base-vector-indexes.ll

llvm/test/CodeGen/AArch64/sve-intrinsics-ld1ro.ll

llvm/test/CodeGen/AArch64/sve-intrinsics-scatter-stores-32bit-scaled-offsets.ll

llvm/test/CodeGen/AArch64/sve-intrinsics-scatter-stores-32bit-unscaled-offsets.ll

llvm/test/CodeGen/AArch64/sve-masked-ldst-nonext.ll

llvm/test/CodeGen/AArch64/sve-masked-ldst-sext.ll

llvm/test/CodeGen/AArch64/sve-masked-ldst-trunc.ll

llvm/test/CodeGen/AArch64/sve-masked-ldst-zext.ll

llvm/test/CodeGen/AArch64/sve-pred-contiguous-ldst-addressing-mode-reg-imm.ll

llvm/test/CodeGen/AArch64/sve-pred-contiguous-ldst-addressing-mode-reg-reg.ll

llvm/test/CodeGen/AArch64/sve-setcc.ll

llvm/unittests/CodeGen/AArch64SelectionDAGTest.cpp

[CodeGen] Let computeKnownBits do something sensible for scalable vectors
ClosedPublic