This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/llvm/ADT/
-
llvm/
-
ADT/
-
APInt.h
-
lib/
-
Support/
4
APInt.cpp
-
Target/X86/
-
X86/
-
X86ISelLowering.cpp
-
X86ShuffleDecodeConstantPool.cpp
-
unittests/ADT/
-
ADT/
1
APIntTest.cpp

Differential D30336

[APInt] Add APInt::extractBits() method to extract APInt subrange
ClosedPublic

Authored by RKSimon on Feb 24 2017, 5:52 AM.

Download Raw Diff

Details

Reviewers

spatel
hans
filcab
majnemer
andreadb
craig.topper
davide

Commits

rG0f5fb5f54909: [APInt] Add APInt::extractBits() method to extract APInt subrange (reapplied)
rGcdf2bd656a68: Revert: r296141 [APInt] Add APInt::extractBits() method to extract APInt…
rGbd9fb2ae959d: [APInt] Add APInt::extractBits() method to extract APInt subrange
rL296272: [APInt] Add APInt::extractBits() method to extract APInt subrange (reapplied)
rL296147: Revert: r296141 [APInt] Add APInt::extractBits() method to extract APInt…
rL296141: [APInt] Add APInt::extractBits() method to extract APInt subrange

Summary

The current pattern for extract bits in range is typically:

Mask.lshr(BitOffset).trunc(SubSizeInBits);

Which can be particularly slow for large APInts (MaskSizeInBits > 64) as they require the allocation of memory for the temporary variable.

This is another of the compile time issues identified in PR32037 (see also D30265).

This patch adds the APInt::extractBits() helper method which avoids the temporary memory allocation.

Diff Detail

Repository: rL LLVM

Event Timeline

RKSimon created this revision.Feb 24 2017, 5:52 AM

LGTM

lib/Support/APInt.cpp

640

Looks deprecated

/// Equivalent to APInt(numBits, ArrayRef<uint64_t>(bigVal, numWords)), but
/// deprecated because this constructor is prone to ambiguity with the
/// APInt(unsigned, uint64_t, bool) constructor.
///
/// If this overload is ever deleted, care should be taken to prevent calls
/// from being incorrectly captured by the APInt(unsigned, uint64_t, bool)
/// constructor.
APInt(unsigned numBits, unsigned numWords, const uint64_t bigVal[]);

Maybe return APInt(numBits, makeArrayRef(pVal + loWord, 1 + hiWord - loWord));?

LGTM

This revision is now accepted and ready to land.Feb 24 2017, 8:38 AM

Closed by commit rL296141: [APInt] Add APInt::extractBits() method to extract APInt subrange (authored by RKSimon). · Explain WhyFeb 24 2017, 9:58 AM

This revision was automatically updated to reflect the committed changes.

hans added inline comments.Feb 24 2017, 10:25 AM

lib/Support/APInt.cpp
624	This is very nit-picky, but I find `numBits > 0` much more natural than `0 < numBits`. And I'd suggest splitting up the second assert into two, to make it clearer what went wrong if it fires.
628	For the single-word case we know bitPosition is in the first word, so there's no need for the remainder operation in `whichBit`. I'd suggest moving the if-statement before computing loBit, and just using bitPosition in the shift.
645	I'm not sure this loop is correct. It will never set `Result.pVal[hiWord]`? The only test case I see that hits this loop is `EXPECT_EQ(-8388481, i256.extractBits(128, 1).getSExtValue());` In that one, the result is computed from three words, but indeed fits into two words. But what if it didn't -- what about `i256.extractBits(129, 1)`?
unittests/ADT/APIntTest.cpp
1441	Would it be possible to use hex constants here and in the expectations somehow? Especially for the last one, it's hard to see what the expected result should be..

Thanks for catching that Hans - I've reverted the commit and will recommit with your recommendations later on.

Revision Contents

Path

Size

include/

llvm/

ADT/

	APInt.h
	APInt.h (revision 296120)

3 lines

lib/

Support/

	APInt.cpp
	APInt.cpp (revision 296120)

32 lines

Target/

X86/

	X86ISelLowering.cpp
	X86ISelLowering.cpp (revision 296120)

8 lines

	X86ShuffleDecodeConstantPool.cpp
	X86ShuffleDecodeConstantPool.cpp (revision 296120)

7 lines

unittests/

ADT/

	APIntTest.cpp
	APIntTest.cpp (revision 296120)

12 lines

Diff 89656

include/llvm/ADT/APInt.h

	Show First 20 Lines • Show All 1,266 Lines • ▼ Show 20 Lines
	}			}

	/// \brief Toggles a given bit to its opposite value.			/// \brief Toggles a given bit to its opposite value.
	///			///
	/// Toggle a given bit to its opposite value whose position is given			/// Toggle a given bit to its opposite value whose position is given
	/// as "bitPosition".			/// as "bitPosition".
	void flipBit(unsigned bitPosition);			void flipBit(unsigned bitPosition);

				/// Return an APInt with the extracted bits [bitPosition,bitPosition+numBits).
				APInt extractBits(unsigned numBits, unsigned bitPosition) const;

	/// @}			/// @}
	/// \name Value Characterization Functions			/// \name Value Characterization Functions
	/// @{			/// @{

	/// \brief Return the number of bits in the APInt.			/// \brief Return the number of bits in the APInt.
	unsigned getBitWidth() const { return BitWidth; }			unsigned getBitWidth() const { return BitWidth; }

	/// \brief Get the number of words.			/// \brief Get the number of words.
	▲ Show 20 Lines • Show All 739 Lines • Show Last 20 Lines

lib/Support/APInt.cpp

	Show First 20 Lines • Show All 612 Lines • ▼ Show 20 Lines
	/// as "bitPosition".			/// as "bitPosition".
	/// @brief Toggles a given bit to its opposite value.			/// @brief Toggles a given bit to its opposite value.
	void APInt::flipBit(unsigned bitPosition) {			void APInt::flipBit(unsigned bitPosition) {
	assert(bitPosition < BitWidth && "Out of the bit-width range!");			assert(bitPosition < BitWidth && "Out of the bit-width range!");
	if ((*this)[bitPosition]) clearBit(bitPosition);			if ((*this)[bitPosition]) clearBit(bitPosition);
	else setBit(bitPosition);			else setBit(bitPosition);
	}			}

				APInt APInt::extractBits(unsigned numBits, unsigned bitPosition) const {
				assert(0 < numBits && "Can't extract zero bits");
				assert(bitPosition < BitWidth && (numBits + bitPosition) <= BitWidth &&
				"Illegal bit extraction");
				hansUnsubmitted Not Done Reply Inline Actions This is very nit-picky, but I find `numBits > 0` much more natural than `0 < numBits`. And I'd suggest splitting up the second assert into two, to make it clearer what went wrong if it fires. hans: This is very nit-picky, but I find `numBits > 0` much more natural than `0 < numBits`. And I'd…

				unsigned loBit = whichBit(bitPosition);
				if (isSingleWord())
				return APInt(numBits, VAL >> loBit);
				hansUnsubmitted Not Done Reply Inline Actions For the single-word case we know bitPosition is in the first word, so there's no need for the remainder operation in `whichBit`. I'd suggest moving the if-statement before computing loBit, and just using bitPosition in the shift. hans: For the single-word case we know bitPosition is in the first word, so there's no need for the…

				unsigned loWord = whichWord(bitPosition);
				unsigned hiWord = whichWord(bitPosition + numBits - 1);

				// Single word result extracting bits from a single word source.
				if (loWord == hiWord)
				return APInt(numBits, pVal[loWord] >> loBit);

				// Extracting bits that start on a source word boundary can be done
				// as a fast memory copy.
				if (loBit == 0)
				return APInt(numBits, 1 + hiWord - loWord, pVal + loWord);
				filcabUnsubmitted Not Done Reply Inline Actions Looks deprecated /// Equivalent to APInt(numBits, ArrayRef<uint64_t>(bigVal, numWords)), but /// deprecated because this constructor is prone to ambiguity with the /// APInt(unsigned, uint64_t, bool) constructor. /// /// If this overload is ever deleted, care should be taken to prevent calls /// from being incorrectly captured by the APInt(unsigned, uint64_t, bool) /// constructor. APInt(unsigned numBits, unsigned numWords, const uint64_t bigVal[]); Maybe `return APInt(numBits, makeArrayRef(pVal + loWord, 1 + hiWord - loWord));`? filcab: Looks deprecated ``` /// Equivalent to APInt(numBits, ArrayRef<uint64_t>(bigVal, numWords))…

				// General case - shift + copy source words into place.
				APInt Result(numBits, 0);
				uint64_t *pDst = Result.pVal;
				for (unsigned word = loWord; word < hiWord; ++word, ++pDst) {
				hansUnsubmitted Not Done Reply Inline Actions I'm not sure this loop is correct. It will never set `Result.pVal[hiWord]`? The only test case I see that hits this loop is `EXPECT_EQ(-8388481, i256.extractBits(128, 1).getSExtValue());` In that one, the result is computed from three words, but indeed fits into two words. But what if it didn't -- what about `i256.extractBits(129, 1)`? hans: I'm not sure this loop is correct. It will never set `Result.pVal[hiWord]`? The only test case…
				uint64_t w0 = pVal[word + 0];
				uint64_t w1 = pVal[word + 1];
				*pDst = (w0 >> loBit) \| (w1 << (APINT_BITS_PER_WORD - loBit));
				}
				return Result.clearUnusedBits();
				}

	unsigned APInt::getBitsNeeded(StringRef str, uint8_t radix) {			unsigned APInt::getBitsNeeded(StringRef str, uint8_t radix) {
	assert(!str.empty() && "Invalid string length");			assert(!str.empty() && "Invalid string length");
	assert((radix == 10 \|\| radix == 8 \|\| radix == 16 \|\| radix == 2 \|\|			assert((radix == 10 \|\| radix == 8 \|\| radix == 16 \|\| radix == 2 \|\|
	radix == 36) &&			radix == 36) &&
	"Radix should be 2, 8, 10, 16, or 36!");			"Radix should be 2, 8, 10, 16, or 36!");

	size_t slen = str.size();			size_t slen = str.size();

	▲ Show 20 Lines • Show All 2,314 Lines • Show Last 20 Lines

lib/Target/X86/X86ISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 5,201 Lines • ▼ Show 20 Lines	auto SplitBitData = [&]() {
bool AllowUndefs = AllowWholeUndefs \|\| AllowPartialUndefs;		bool AllowUndefs = AllowWholeUndefs \|\| AllowPartialUndefs;
if (UndefBits.getBoolValue() && !AllowUndefs)		if (UndefBits.getBoolValue() && !AllowUndefs)
return false;		return false;

UndefElts = SmallBitVector(NumElts, false);		UndefElts = SmallBitVector(NumElts, false);
EltBits.resize(NumElts, APInt(EltSizeInBits, 0));		EltBits.resize(NumElts, APInt(EltSizeInBits, 0));

for (unsigned i = 0; i != NumElts; ++i) {		for (unsigned i = 0; i != NumElts; ++i) {
APInt UndefEltBits = UndefBits.lshr(i * EltSizeInBits);		unsigned BitOffset = i * EltSizeInBits;
UndefEltBits = UndefEltBits.zextOrTrunc(EltSizeInBits);		APInt UndefEltBits = UndefBits.extractBits(EltSizeInBits, BitOffset);

// Only treat an element as UNDEF if all bits are UNDEF.		// Only treat an element as UNDEF if all bits are UNDEF.
if (UndefEltBits.isAllOnesValue()) {		if (UndefEltBits.isAllOnesValue()) {
if (!AllowWholeUndefs)		if (!AllowWholeUndefs)
return false;		return false;
UndefElts[i] = true;		UndefElts[i] = true;
continue;		continue;
}		}

// If only some bits are UNDEF then treat them as zero (or bail if not		// If only some bits are UNDEF then treat them as zero (or bail if not
// supported).		// supported).
if (UndefEltBits.getBoolValue() && !AllowPartialUndefs)		if (UndefEltBits.getBoolValue() && !AllowPartialUndefs)
return false;		return false;

APInt Bits = MaskBits.lshr(i * EltSizeInBits).zextOrTrunc(EltSizeInBits);		APInt Bits = MaskBits.extractBits(EltSizeInBits, BitOffset);
EltBits[i] = Bits.getZExtValue();		EltBits[i] = Bits.getZExtValue();
}		}
return true;		return true;
};		};

// Collect constant bits and insert into mask/undef bit masks.		// Collect constant bits and insert into mask/undef bit masks.
auto CollectConstantBits = [SizeInBits](const Constant *Cst, APInt &Mask,		auto CollectConstantBits = [SizeInBits](const Constant *Cst, APInt &Mask,
APInt &Undefs, unsigned BitOffset) {		APInt &Undefs, unsigned BitOffset) {
▲ Show 20 Lines • Show All 1,181 Lines • ▼ Show 20 Lines

static Constant *getConstantVector(MVT VT, const APInt &SplatValue,		static Constant *getConstantVector(MVT VT, const APInt &SplatValue,
unsigned SplatBitSize, LLVMContext &C) {		unsigned SplatBitSize, LLVMContext &C) {
unsigned ScalarSize = VT.getScalarSizeInBits();		unsigned ScalarSize = VT.getScalarSizeInBits();
unsigned NumElm = SplatBitSize / ScalarSize;		unsigned NumElm = SplatBitSize / ScalarSize;

SmallVector<Constant *, 32> ConstantVec;		SmallVector<Constant *, 32> ConstantVec;
for (unsigned i = 0; i < NumElm; i++) {		for (unsigned i = 0; i < NumElm; i++) {
APInt Val = SplatValue.lshr(ScalarSize * i).trunc(ScalarSize);		APInt Val = SplatValue.extractBits(ScalarSize, ScalarSize * i);
Constant *Const;		Constant *Const;
if (VT.isFloatingPoint()) {		if (VT.isFloatingPoint()) {
assert((ScalarSize == 32 \|\| ScalarSize == 64) &&		assert((ScalarSize == 32 \|\| ScalarSize == 64) &&
"Unsupported floating point scalar size");		"Unsupported floating point scalar size");
if (ScalarSize == 32)		if (ScalarSize == 32)
Const = ConstantFP::get(Type::getFloatTy(C), Val.bitsToFloat());		Const = ConstantFP::get(Type::getFloatTy(C), Val.bitsToFloat());
else		else
Const = ConstantFP::get(Type::getDoubleTy(C), Val.bitsToDouble());		Const = ConstantFP::get(Type::getDoubleTy(C), Val.bitsToDouble());
▲ Show 20 Lines • Show All 28,990 Lines • Show Last 20 Lines

lib/Target/X86/X86ShuffleDecodeConstantPool.cpp

Show First 20 Lines • Show All 71 Lines • ▼ Show 20 Lines	static bool extractConstantMask(const Constant *C, unsigned MaskEltSizeInBits,
assert((CstSizeInBits % MaskEltSizeInBits) == 0 &&		assert((CstSizeInBits % MaskEltSizeInBits) == 0 &&
"Unaligned shuffle mask size");		"Unaligned shuffle mask size");

unsigned NumMaskElts = CstSizeInBits / MaskEltSizeInBits;		unsigned NumMaskElts = CstSizeInBits / MaskEltSizeInBits;
UndefElts = SmallBitVector(NumMaskElts, false);		UndefElts = SmallBitVector(NumMaskElts, false);
RawMask.resize(NumMaskElts, 0);		RawMask.resize(NumMaskElts, 0);

for (unsigned i = 0; i != NumMaskElts; ++i) {		for (unsigned i = 0; i != NumMaskElts; ++i) {
APInt EltUndef = UndefBits.lshr(i * MaskEltSizeInBits);		unsigned BitOffset = i * MaskEltSizeInBits;
EltUndef = EltUndef.zextOrTrunc(MaskEltSizeInBits);		APInt EltUndef = UndefBits.extractBits(MaskEltSizeInBits, BitOffset);

// Only treat the element as UNDEF if all bits are UNDEF, otherwise		// Only treat the element as UNDEF if all bits are UNDEF, otherwise
// treat it as zero.		// treat it as zero.
if (EltUndef.isAllOnesValue()) {		if (EltUndef.isAllOnesValue()) {
UndefElts[i] = true;		UndefElts[i] = true;
RawMask[i] = 0;		RawMask[i] = 0;
continue;		continue;
}		}

APInt EltBits = MaskBits.lshr(i * MaskEltSizeInBits);		APInt EltBits = MaskBits.extractBits(MaskEltSizeInBits, BitOffset);
EltBits = EltBits.zextOrTrunc(MaskEltSizeInBits);
RawMask[i] = EltBits.getZExtValue();		RawMask[i] = EltBits.getZExtValue();
}		}

return true;		return true;
}		}

void DecodePSHUFBMask(const Constant *C, SmallVectorImpl<int> &ShuffleMask) {		void DecodePSHUFBMask(const Constant *C, SmallVectorImpl<int> &ShuffleMask) {
Type *MaskTy = C->getType();		Type *MaskTy = C->getType();
▲ Show 20 Lines • Show All 233 Lines • Show Last 20 Lines

unittests/ADT/APIntTest.cpp

Show First 20 Lines • Show All 1,428 Lines • ▼ Show 20 Lines	for (int N : { 1, 2, 3, 4, 7, 8, 16, 32, 64, 127, 128, 129, 256 }) {
APInt One(N, 1);		APInt One(N, 1);
for (int I = 1; I <= N; ++I) {		for (int I = 1; I <= N; ++I) {
APInt MaskVal = One.shl(I) - 1;		APInt MaskVal = One.shl(I) - 1;
EXPECT_TRUE(APIntOps::isMask(MaskVal));		EXPECT_TRUE(APIntOps::isMask(MaskVal));
}		}
}		}
}		}

		TEST(APIntTest, extractBits) {
		APInt i32(32, 0x1234567);
		EXPECT_EQ(0x3456, i32.extractBits(16, 4));

		APInt i256(256, -16776961 /* 0xFFFFFFFFFFFFFFFFFFFFFFFFFF0000FF */, true);
		hansUnsubmitted Not Done Reply Inline Actions Would it be possible to use hex constants here and in the expectations somehow? Especially for the last one, it's hard to see what the expected result should be.. hans: Would it be possible to use hex constants here and in the expectations somehow? Especially for…
		EXPECT_EQ(255, i256.extractBits(16, 0));
		EXPECT_EQ(127, i256.extractBits(16, 1));
		EXPECT_EQ(-1, i256.extractBits(32, 64).getSExtValue());
		EXPECT_EQ(-1, i256.extractBits(128, 128).getSExtValue());
		EXPECT_EQ(-8388481, i256.extractBits(128, 1).getSExtValue());
		}

#if defined(__clang__)		#if defined(__clang__)
// Disable the pragma warning from versions of Clang without -Wself-move		// Disable the pragma warning from versions of Clang without -Wself-move
#pragma clang diagnostic push		#pragma clang diagnostic push
#pragma clang diagnostic ignored "-Wunknown-pragmas"		#pragma clang diagnostic ignored "-Wunknown-pragmas"
// Disable the warning that triggers on exactly what is being tested.		// Disable the warning that triggers on exactly what is being tested.
#pragma clang diagnostic push		#pragma clang diagnostic push
#pragma clang diagnostic ignored "-Wself-move"		#pragma clang diagnostic ignored "-Wself-move"
#endif		#endif
▲ Show 20 Lines • Show All 63 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[APInt] Add APInt::extractBits() method to extract APInt subrangeClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 89656

include/llvm/ADT/APInt.h

lib/Support/APInt.cpp

lib/Target/X86/X86ISelLowering.cpp

lib/Target/X86/X86ShuffleDecodeConstantPool.cpp

unittests/ADT/APIntTest.cpp

[APInt] Add APInt::extractBits() method to extract APInt subrange
ClosedPublic