This is an archive of the discontinued LLVM Phabricator instance.

Refine logic of MaskedElementsAreZero
AbandonedPublic

Authored by xiangzhangllvm on Jul 13 2021, 1:50 AM.

Download Raw Diff

Details

Reviewers

Summary

The Mask (DemandedBits ) passed to MaskedValueIsZero in MaskedElementsAreZero seems didn't make sense.
Try to refine it at first.

Diff Detail

Unit TestsFailed

	Time	Test
	700 ms	x64 debian > LLVM.CodeGen/X86::vector_splat-const-shift-of-constmasked.ll
	2,810 ms	x64 debian > libarcher.critical::critical.c
	2,870 ms	x64 debian > libarcher.parallel::parallel-firstprivate.c
	2,920 ms	x64 debian > libarcher.races::critical-unrelated.c
	2,920 ms	x64 debian > libarcher.races::lock-nested-unrelated.c
		View Full Test Results (19 Failed)

Event Timeline

xiangzhangllvm created this revision.Jul 13 2021, 1:50 AM

Herald added a subscriber: hiraditya. · View Herald TranscriptJul 13 2021, 1:50 AM

xiangzhangllvm requested review of this revision.Jul 13 2021, 1:50 AM

Herald added a project: Restricted Project. · View Herald TranscriptJul 13 2021, 1:50 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Hello @RKSimon , I add you here, just because I git blamed that the code is your contribution.

Here I want to discuss with you about the logic of MaskedElementsAreZero first before I change and add related tests.

We want to Masked Elements (of vector) Are Zero, so the Mask pass to MaskedValueIsZero (in your old code) should be A Mask of vector.
Not clear why it just "APInt::getAllOnesValue(Op.getScalarValueSizeInBits())"

xiangzhangllvm mentioned this in rG2a419a0b9957: [X86][SSE] combineX86ShuffleChain - check if we're blending with zero into….Jul 13 2021, 2:06 AM

Harbormaster completed remote builds in B113687: Diff 358196.Jul 13 2021, 2:42 AM

Please do you have a test case to reproduce the issue you encountered?

MaskedElementsAreZero should only be called for vector types, although I failed to add an assert to ensure that.

The SelectionDAG valuetracking code doesn't handle individual vector elements, it just analyses the common bits of all the vector elements specified in the DemandedElts mask - if you really need to determine the known bits of every vector element you need to call computeKnownBits for every element, demanding just that element index.

Yes, I change here because we got a runfail at a project. Let me commit the small reproduce a little later.

llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp
2450	it just analyses the common bits of all the vector elements specified in the DemandedElts mask Thanks for your explain, in fact, I read the computeKnownBits code yesterday, it makes me a lot of puzzle. I thought here the DemandedBits should have the same BitWidth with Op (e.g. 128 for V16xi8) Let me take a example: So, if the Op is type V16xi8, and the DemandedElts is 0x2222 (16 bits, demanded index is 1, 5, 9, 13), you mean computeKnownBits will return the common zeros/ones of Op's elements with index 1, 5, 9, 13 ? ( zeros/ones = element1 & element5 & element9 & element13).

Let me abandon it first, your patch self looks no problem.

RKSimon added inline comments.Jul 14 2021, 2:15 AM

llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp
2450	Yes the KnownBits result is the common bits in those demanded elements - if any are set differently or unknown in any element then the KnownBits result doesn't 'know' that bit. If you have a usecase for a 'computeAllKnownBits' style wrapper that returns the known bits for each element (either as an array of KnownBits or a concatenated KnownBits struct) then it'd be definitely worth a patch along with suitable test coverage.

Revision Contents

Path

Size

llvm/

lib/

CodeGen/

SelectionDAG/

SelectionDAG.cpp

26 lines

Diff 358196

llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 2,439 Lines • ▼ Show 20 Lines	bool SelectionDAG::MaskedValueIsZero(SDValue V, const APInt &Mask,
unsigned Depth) const {		unsigned Depth) const {
return Mask.isSubsetOf(computeKnownBits(V, DemandedElts, Depth).Zero);		return Mask.isSubsetOf(computeKnownBits(V, DemandedElts, Depth).Zero);
}		}

/// Return true if the DemandedElts of the vector Op are all zero. We		/// Return true if the DemandedElts of the vector Op are all zero. We
/// use this predicate to simplify operations downstream.		/// use this predicate to simplify operations downstream.
bool SelectionDAG::MaskedElementsAreZero(SDValue Op, const APInt &DemandedElts,		bool SelectionDAG::MaskedElementsAreZero(SDValue Op, const APInt &DemandedElts,
unsigned Depth) const {		unsigned Depth) const {
		APInt DemandedBits;
		EVT VT = Op.getValueType();
unsigned BitWidth = Op.getScalarValueSizeInBits();		unsigned BitWidth = Op.getScalarValueSizeInBits();
APInt DemandedBits = APInt::getAllOnesValue(BitWidth);
return MaskedValueIsZero(Op, DemandedBits, DemandedElts, Depth);		if (VT.isVector()) {
xiangzhangllvmAuthorUnsubmitted Done Reply Inline Actions it just analyses the common bits of all the vector elements specified in the DemandedElts mask Thanks for your explain, in fact, I read the computeKnownBits code yesterday, it makes me a lot of puzzle. I thought here the DemandedBits should have the same BitWidth with Op (e.g. 128 for V16xi8) Let me take a example: So, if the Op is type V16xi8, and the DemandedElts is 0x2222 (16 bits, demanded index is 1, 5, 9, 13), you mean computeKnownBits will return the common zeros/ones of Op's elements with index 1, 5, 9, 13 ? ( zeros/ones = element1 & element5 & element9 & element13). xiangzhangllvm: > it just analyses the common bits of all the vector elements specified in the DemandedElts…
RKSimonUnsubmitted Not Done Reply Inline Actions Yes the KnownBits result is the common bits in those demanded elements - if any are set differently or unknown in any element then the KnownBits result doesn't 'know' that bit. If you have a usecase for a 'computeAllKnownBits' style wrapper that returns the known bits for each element (either as an array of KnownBits or a concatenated KnownBits struct) then it'd be definitely worth a patch along with suitable test coverage. RKSimon: Yes the KnownBits result is the common bits in those demanded elements - if any are set…
		unsigned NumElts = VT.getVectorNumElements();
		assert(NumElts == DemandedElts.getBitWidth() && "Unexpected vector size");
		DemandedBits = APInt::getNullValue(BitWidth * NumElts);
		for (int i = 0; i < NumElts; i++) {
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for variable 'i' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for variable 'i' [readability-identifier-naming]…
		if (!DemandedElts[i])
		continue;
		DemandedBits.setBits(BitWidth * i, BitWidth * (i + 1));
		}
		} else {
		DemandedBits = APInt::getAllOnesValue(BitWidth);
		}

		APInt Zero = computeKnownBits(Op, DemandedElts, Depth).Zero;

		if (DemandedBits.getBitWidth() > Zero.getBitWidth())
		Zero = Zero.zext(DemandedBits.getBitWidth());
		else
		Zero = Zero.trunc(DemandedBits.getBitWidth());

		return DemandedBits.isSubsetOf(Zero);
}		}

/// MaskedValueIsAllOnes - Return true if '(Op & Mask) == Mask'.		/// MaskedValueIsAllOnes - Return true if '(Op & Mask) == Mask'.
bool SelectionDAG::MaskedValueIsAllOnes(SDValue V, const APInt &Mask,		bool SelectionDAG::MaskedValueIsAllOnes(SDValue V, const APInt &Mask,
unsigned Depth) const {		unsigned Depth) const {
return Mask.isSubsetOf(computeKnownBits(V, Depth).One);		return Mask.isSubsetOf(computeKnownBits(V, Depth).One);
}		}

▲ Show 20 Lines • Show All 8,153 Lines • Show Last 20 Lines