This is an archive of the discontinued LLVM Phabricator instance.

Current upstream crash with llvm/lib/IR/Value.cpp:404: void llvm::Value::doRAUW(llvm::Value *, llvm::Value::ReplaceMetadataUses): Assertion `New->getType() == getType() && "replaceAllUses of value with new value of different type!"' failed.

run: opt -S -constprop bitcast.ll -o -

bitcast.ll

define <vscale x 4 x float> @bitcast_scalable_constant() {
  %i1 = insertelement <vscale x 4 x i32> undef, i32 1, i32 0
  %i2 = shufflevector <vscale x 4 x i32> %i1, <vscale x 4 x i32> undef, <vscale x 4 x i32> zeroinitializer
  %i3 = bitcast <vscale x 4 x i32> %i2 to <vscale x 4 x float>
  ret <vscale x 4 x float> %i3
}

LGTM. I don't know how many of these patches are needed, but it would be slightly more efficient to create a "vscale.ll" test file and put all of the related tests in that 1 file.

This revision is now accepted and ready to land.Dec 12 2019, 4:41 AM

sdesmalen added inline comments.Dec 12 2019, 5:14 AM

llvm/lib/IR/ConstantFold.cpp
52	Do we want to add support for a special case here for splat operations (like the one in the test)?

In D71389#1781620, @spatel wrote:

LGTM. I don't know how many of these patches are needed, but it would be slightly more efficient to create a "vscale.ll" test file and put all of the related tests in that 1 file.

Thank you Sanjay!
Yes, I will try my best to combine them. The current code base has a lot of assumption based on fixed length vector.
I am going through the IR passes, and try to fix them to the best I can.

llvm/lib/IR/ConstantFold.cpp
52	Actually we can not, we will end up in scalable vector constant. Instead of <4 x i32> <i32 1, i32 1, i32 1, i32 1>, we will need a form like "<vscale x 4 x i32> [?]" I don't know how to express scalable vector constant in current upstream, please let me know if I get anything wrong?

efriedma added inline comments.Dec 12 2019, 12:17 PM

llvm/lib/IR/ConstantFold.cpp
52	I think the suggestion is to special-case the specific sequence of insertelement+splatvector+bitcast, and bitcast the operand of the insertelement. Not sure how useful that would be in general, though.

spatel added inline comments.Dec 12 2019, 12:40 PM

llvm/lib/IR/ConstantFold.cpp
52	If we want to do that transform, it would go in instsimplify since we're doing a multi-instruction analysis/fold that becomes a constant.

spatel added inline comments.Dec 12 2019, 12:42 PM

llvm/lib/IR/ConstantFold.cpp
52	Sorry - might've misread that. If we're creating a new bitcast, that would have to go in instcombine (we don't create new instructions in instsimplify).

Yes, this transformation will need to go in InstCombine.
We should probably implement this when constant scalable vector support is ready, so that the effect will be most obvious.

efriedma added inline comments.Dec 12 2019, 1:30 PM

llvm/lib/IR/ConstantFold.cpp
52	By the time we get here, the shuffle and insertelement are ConstantExprs, not instructions.

sdesmalen added inline comments.Dec 13 2019, 2:57 AM

llvm/lib/IR/ConstantFold.cpp
52	In terms of usefulness, this would make it easier to determine whether the value is a splat when calling `Constant::getSplatValue` without having to look through the bitcast first.

huihuiz mentioned this in D71637: [PatternMatch] Add support for matching ConstantExpr..Dec 17 2019, 4:29 PM

huihuiz added a child revision: D71637: [PatternMatch] Add support for matching ConstantExpr..

Add special case handling:

For splat vector, fold bitcast to splat value.

Thanks for adding the extra constant fold @huihuiz!

llvm/lib/IR/ConstantFold.cpp
580	Because we're matching for a zero mask, explicitly matching `m_Undef` here seems unnecessarily restrictive, `m_Value()` should be sufficient.
587	Can we match this value directly with `m_Zero(Zero)`, rather than needing `auto InsertElem` and `InsertElem->getOperand(2)` ?
590	Can we match the value for `CE->getOperand(1)` value directly with something like `m_Value(Vec2)`. I'd expect `CE->getOperand(2)` can reuse `Zero` as suggested above.

bjope added a subscriber: bjope.Dec 18 2019, 6:52 AM

bjope added inline comments.

llvm/lib/IR/ConstantFold.cpp
572	Don't you need some check that both sides of the bitcast are vectors with matching number of elements?

Clean up code based on reviewer feedback.

llvm/lib/IR/ConstantFold.cpp
580	Yes, we don't need to restrict it to m_Undef(). I think we should restrict it with m_Constant(). I would expect operands to be constant at ConstantFoldCastInstruction(). Doing this can also help avoid unnecessary cast<constant> when calling ConstantExpr to get new InsertElement or cast operands into new types.
587	Current pattern matcher doesn't support m_Zero(Zero). I use m_CombineAnd(m_Zero(), m_Constant(Zero)) instead.

huihuiz mentioned this in D74095: [ConstantFold][SVE] Fold bitcast into splat value for splat vector..Feb 5 2020, 3:32 PM

huihuiz added a child revision: D74095: [ConstantFold][SVE] Fold bitcast into splat value for splat vector..

Closed by commit rG801857c59ea6: [ConstantFold][SVE] Fix constant folding for bitcast. (authored by huihuiz). · Explain WhyFeb 5 2020, 3:42 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

lib/

IR/

ConstantFold.cpp

28 lines

test/

Analysis/

ConstantFolding/

bitcast.ll

12 lines

Diff 234639

llvm/lib/IR/ConstantFold.cpp

Show First 20 Lines • Show All 41 Lines • ▼ Show 20 Lines
/// Convert the specified vector Constant node to the specified vector type.		/// Convert the specified vector Constant node to the specified vector type.
/// At this point, we know that the elements of the input vector constant are		/// At this point, we know that the elements of the input vector constant are
/// all simple integer or FP values.		/// all simple integer or FP values.
static Constant BitCastConstantVector(Constant CV, VectorType *DstTy) {		static Constant BitCastConstantVector(Constant CV, VectorType *DstTy) {

if (CV->isAllOnesValue()) return Constant::getAllOnesValue(DstTy);		if (CV->isAllOnesValue()) return Constant::getAllOnesValue(DstTy);
if (CV->isNullValue()) return Constant::getNullValue(DstTy);		if (CV->isNullValue()) return Constant::getNullValue(DstTy);

		// Do not iterate on scalable vector. The num of elements is unknown at
		// compile-time.
		if (DstTy->isScalable())
		sdesmalenUnsubmitted Not Done Reply Inline Actions Do we want to add support for a special case here for splat operations (like the one in the test)? sdesmalen: Do we want to add support for a special case here for splat operations (like the one in the…
		huihuizAuthorUnsubmitted Done Reply Inline Actions Actually we can not, we will end up in scalable vector constant. Instead of <4 x i32> <i32 1, i32 1, i32 1, i32 1>, we will need a form like "<vscale x 4 x i32> [?]" I don't know how to express scalable vector constant in current upstream, please let me know if I get anything wrong? huihuiz: Actually we can not, we will end up in scalable vector constant. Instead of <4 x i32> <i32 1…
		efriedmaUnsubmitted Not Done Reply Inline Actions I think the suggestion is to special-case the specific sequence of insertelement+splatvector+bitcast, and bitcast the operand of the insertelement. Not sure how useful that would be in general, though. efriedma: I think the suggestion is to special-case the specific sequence of…
		spatelUnsubmitted Not Done Reply Inline Actions If we want to do that transform, it would go in instsimplify since we're doing a multi-instruction analysis/fold that becomes a constant. spatel: If we want to do that transform, it would go in instsimplify since we're doing a multi…
		spatelUnsubmitted Not Done Reply Inline Actions Sorry - might've misread that. If we're creating a new bitcast, that would have to go in instcombine (we don't create new instructions in instsimplify). spatel: Sorry - might've misread that. If we're creating a new bitcast, that would have to go in…
		efriedmaUnsubmitted Not Done Reply Inline Actions By the time we get here, the shuffle and insertelement are ConstantExprs, not instructions. efriedma: By the time we get here, the shuffle and insertelement are ConstantExprs, not instructions.
		sdesmalenUnsubmitted Not Done Reply Inline Actions In terms of usefulness, this would make it easier to determine whether the value is a splat when calling `Constant::getSplatValue` without having to look through the bitcast first. sdesmalen: In terms of usefulness, this would make it easier to determine whether the value is a splat…
		return nullptr;

// If this cast changes element count then we can't handle it here:		// If this cast changes element count then we can't handle it here:
// doing so requires endianness information. This should be handled by		// doing so requires endianness information. This should be handled by
// Analysis/ConstantFolding.cpp		// Analysis/ConstantFolding.cpp
unsigned NumElts = DstTy->getNumElements();		unsigned NumElts = DstTy->getNumElements();
if (NumElts != CV->getType()->getVectorNumElements())		if (NumElts != CV->getType()->getVectorNumElements())
return nullptr;		return nullptr;

Type *DstEltTy = DstTy->getElementType();		Type *DstEltTy = DstTy->getElementType();
▲ Show 20 Lines • Show All 500 Lines • ▼ Show 20 Lines	if (CE->isCast()) {
for (unsigned i = 1, e = CE->getNumOperands(); i != e; ++i)		for (unsigned i = 1, e = CE->getNumOperands(); i != e; ++i)
if (!CE->getOperand(i)->isNullValue()) {		if (!CE->getOperand(i)->isNullValue()) {
isAllNull = false;		isAllNull = false;
break;		break;
}		}
if (isAllNull)		if (isAllNull)
// This is casting one pointer type to another, always BitCast		// This is casting one pointer type to another, always BitCast
return ConstantExpr::getPointerCast(CE->getOperand(0), DestTy);		return ConstantExpr::getPointerCast(CE->getOperand(0), DestTy);
		} else if (CE->getType()->isVectorTy() && DestTy->isVectorTy() &&
		CE->getType()->getVectorElementCount() ==
		bjopeUnsubmitted Done Reply Inline Actions Don't you need some check that both sides of the bitcast are vectors with matching number of elements? bjope: Don't you need some check that both sides of the bitcast are vectors with matching number of…
		DestTy->getVectorElementCount() &&
		CE->getOpcode() == Instruction::ShuffleVector &&
		opc == Instruction::BitCast) {
		// For splat vector, fold bitcast to splat value.
		// BitCast(ShuffleVector(InsertElement(C1, SplatV, Zero), C2, Zero)) to NewType
		// into
		// ShuffleVector(InsertElement(C1, BitCast(SplatV) to NewType, Zero), C2, Zero)
		Constant SplatV, C1, C2, ZeroIdx, *ZeroMask;
		sdesmalenUnsubmitted Not Done Reply Inline Actions Because we're matching for a zero mask, explicitly matching `m_Undef` here seems unnecessarily restrictive, `m_Value()` should be sufficient. sdesmalen: Because we're matching for a zero mask, explicitly matching `m_Undef` here seems unnecessarily…
		huihuizAuthorUnsubmitted Done Reply Inline Actions Yes, we don't need to restrict it to m_Undef(). I think we should restrict it with m_Constant(). I would expect operands to be constant at ConstantFoldCastInstruction(). Doing this can also help avoid unnecessary cast<constant> when calling ConstantExpr to get new InsertElement or cast operands into new types. huihuiz: Yes, we don't need to restrict it to m_Undef(). I think we should restrict it with m_Constant…
		if (match(CE, m_ShuffleVector(
		m_InsertElement(
		m_Constant(C1), m_Constant(SplatV),
		m_CombineAnd(m_Zero(), m_Constant(ZeroIdx))),
		m_Constant(C2),
		m_CombineAnd(m_Zero(), m_Constant(ZeroMask))))) {
		auto *CastedSplatV =
		sdesmalenUnsubmitted Not Done Reply Inline Actions Can we match this value directly with `m_Zero(Zero)`, rather than needing `auto InsertElem` and `InsertElem->getOperand(2)` ? sdesmalen: Can we match this value directly with `m_Zero(Zero)`, rather than needing `auto InsertElem` and…
		huihuizAuthorUnsubmitted Done Reply Inline Actions Current pattern matcher doesn't support m_Zero(Zero). I use m_CombineAnd(m_Zero(), m_Constant(Zero)) instead. huihuiz: Current pattern matcher doesn't support m_Zero(Zero). I use m_CombineAnd(m_Zero(), m_Constant…
		ConstantExpr::getCast(opc, SplatV, DestTy->getScalarType());
		return ConstantExpr::getShuffleVector(
		ConstantExpr::getInsertElement(
		sdesmalenUnsubmitted Done Reply Inline Actions Can we match the value for `CE->getOperand(1)` value directly with something like `m_Value(Vec2)`. I'd expect `CE->getOperand(2)` can reuse `Zero` as suggested above. sdesmalen: Can we match the value for `CE->getOperand(1)` value directly with something like `m_Value…
		ConstantExpr::getCast(opc, C1, DestTy), CastedSplatV, ZeroIdx),
		ConstantExpr::getCast(opc, C2, DestTy), ZeroMask);
		}
}		}
}		}

// If the cast operand is a constant vector, perform the cast by		// If the cast operand is a constant vector, perform the cast by
// operating on each element. In the cast of bitcasts, the element		// operating on each element. In the cast of bitcasts, the element
// count may be mismatched; don't attempt to handle that here.		// count may be mismatched; don't attempt to handle that here.
if ((isa<ConstantVector>(V) \|\| isa<ConstantDataVector>(V)) &&		if ((isa<ConstantVector>(V) \|\| isa<ConstantDataVector>(V)) &&
DestTy->isVectorTy() &&		DestTy->isVectorTy() &&
▲ Show 20 Lines • Show All 991 Lines • Show Last 20 Lines

llvm/test/Analysis/ConstantFolding/bitcast.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
				; RUN: opt < %s -constprop -S -verify \| FileCheck %s

				define <vscale x 4 x float> @bitcast_scalable_constant() {
				; CHECK-LABEL: @bitcast_scalable_constant(
				; CHECK-NEXT: ret <vscale x 4 x float> shufflevector (<vscale x 4 x float> insertelement (<vscale x 4 x float> undef, float 0x36A0000000000000, i32 0), <vscale x 4 x float> undef, <vscale x 4 x i32> zeroinitializer)
				;
				%i1 = insertelement <vscale x 4 x i32> undef, i32 1, i32 0
				%i2 = shufflevector <vscale x 4 x i32> %i1, <vscale x 4 x i32> undef, <vscale x 4 x i32> zeroinitializer
				%i3 = bitcast <vscale x 4 x i32> %i2 to <vscale x 4 x float>
				ret <vscale x 4 x float> %i3
				}

This is an archive of the discontinued LLVM Phabricator instance.

[ConstantFold][SVE] Fix constant folding for bitcast.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 234639

llvm/lib/IR/ConstantFold.cpp

llvm/test/Analysis/ConstantFolding/bitcast.ll

[ConstantFold][SVE] Fix constant folding for bitcast.
ClosedPublic