This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
lib/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
-
InstCombineSelect.cpp
-
test/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
-
apint-select.ll

Differential D21899

[InstCombine] extend (select X, C1, C2 --> ext X) to vectors
ClosedPublic

Authored by spatel on Jun 30 2016, 10:29 AM.

Download Raw Diff

Details

Reviewers

majnemer
eli.friedman
hfinkel

Commits

rG65a51c25c12a: [InstCombine] enhance (select X, C1, C2 --> ext X) to handle vectors
rL274696: [InstCombine] enhance (select X, C1, C2 --> ext X) to handle vectors

Summary

The code change is hopefully straightforward: replace dyn_casts with m_APInt, and we get transforms for splat vectors.

But these transforms raise some questions:

In the cases where we need a 'not', we're increasing the instruction count. Is this justified because ext/not is always assumed cheaper/more canonical than a select?

In the not+zext case, notice that the zext gets moved ahead of the xor. This is because visitZext() has: // zext (xor i1 X, true) to i32 --> xor (zext i1 X to i32), 1

There's no code comment for the motivation. Assuming there is good reason to break the m_Not pattern should sext+xor do the same?

If we're ok with increasing the instruction count for #1, is the scalar select of vectors example also a good transform? In the worst case, we'd need 4 instructions in place of the select: xor, zext, insertelement, shufflevector.

Diff Detail

Repository: rL LLVM

Event Timeline

spatel updated this revision to Diff 62377.Jun 30 2016, 10:29 AM

spatel retitled this revision from to [InstCombine] extend (select X, C1, C2 --> ext X) to vectors.

spatel updated this object.

spatel added reviewers: eli.friedman, majnemer, hfinkel.

spatel added a subscriber: llvm-commits.

Herald added a subscriber: mcrosier. · View Herald TranscriptJun 30 2016, 10:30 AM

Generally looks fine.

lib/Transforms/InstCombine/InstCombineSelect.cpp
965 ↗	(On Diff #62377)	Extra newline.
968 ↗	(On Diff #62377)	Won't this crash if SI.getType() is `<4 x i1>`?

Re: your questions about canonicalization:

The canonical representation of a calculation is whatever instcombine declares it to be. We generally try to favor arithmetic over selects because selects tend to be more expensive, but there's a limit to where that's worthwhile.
Not sure exactly why we prefer to move xor out of a zext rather than in... apparently the transform was added in r21713 to cover this testcase:

define i32 @test22(i1 %X) {
; CHECK-LABEL: @test22(
; CHECK-NEXT: %1 = zext i1 %X to i32
; CHECK-NEXT: ret i32 %1
	%Y = xor i1 %X, true		; <i1> [#uses=1]
	%Z = zext i1 %Y to i32		; <i32> [#uses=1]
	%Q = xor i32 %Z, 1		; <i32> [#uses=1]
	ret i32 %Q
}

Obviously, that doesn't really motivate the canonical representation either way. That said, moving the xor in rather than out in general raises awkward questions about what to do in cases like "(zext i8 x to i32) ^ 257". And changing the canonical representation probably involves some work to figure out what other transforms depend on the canonical representation of zext+xor.

Patch updated:
Added assert for i1 types (vectors should be handled after rL274465).

In D21899#473188, @eli.friedman wrote:

Re: your questions about canonicalization:

Obviously, that doesn't really motivate the canonical representation either way. That said, moving the xor in rather than out in general raises awkward questions about what to do in cases like "(zext i8 x to i32) ^ 257". And changing the canonical representation probably involves some work to figure out what other transforms depend on the canonical representation of zext+xor.

Thanks for the answers! I don't think we need to worry about your ^257 example because we're only looking at i1 types for this transform.

lib/Transforms/InstCombine/InstCombineSelect.cpp
971 ↗	(On Diff #62898)	Yes - scalar i1 selects were all handled above, but vectors were not. That should be fixed with rL274465, so now we can assert that no i1 types make it down here.

LGTM.

This revision is now accepted and ready to land.Jul 6 2016, 11:16 AM

Closed by commit rL274696: [InstCombine] enhance (select X, C1, C2 --> ext X) to handle vectors (authored by spatel). · Explain WhyJul 6 2016, 3:30 PM

This revision was automatically updated to reflect the committed changes.

spatel mentioned this in D22271: [InstCombine] reverse canonicalization of xor(zext i1 A), 1 <---> zext(not i1 A, true) (PR28476).Jul 12 2016, 11:10 AM

Revision Contents

Path

Size

llvm/

trunk/

lib/

Transforms/

InstCombine/

InstCombineSelect.cpp

50 lines

test/

Transforms/

InstCombine/

apint-select.ll

23 lines

Diff 62985

llvm/trunk/lib/Transforms/InstCombine/InstCombineSelect.cpp

Show First 20 Lines • Show All 948 Lines • ▼ Show 20 Lines	if (SI.getType()->getScalarType()->isIntegerTy(1) &&
// select a, ~a, b -> (~a) & b		// select a, ~a, b -> (~a) & b
// select a, b, ~a -> (~a) \| b		// select a, b, ~a -> (~a) \| b
if (match(TrueVal, m_Not(m_Specific(CondVal))))		if (match(TrueVal, m_Not(m_Specific(CondVal))))
return BinaryOperator::CreateAnd(TrueVal, FalseVal);		return BinaryOperator::CreateAnd(TrueVal, FalseVal);
if (match(FalseVal, m_Not(m_Specific(CondVal))))		if (match(FalseVal, m_Not(m_Specific(CondVal))))
return BinaryOperator::CreateOr(TrueVal, FalseVal);		return BinaryOperator::CreateOr(TrueVal, FalseVal);
}		}

// Selecting between two integer constants?		// Selecting between two integer or vector splat integer constants?
if (ConstantInt *TrueValC = dyn_cast<ConstantInt>(TrueVal))		//
if (ConstantInt *FalseValC = dyn_cast<ConstantInt>(FalseVal)) {		// Note that we don't handle a scalar select of vectors:
		// select i1 %c, <2 x i8> <1, 1>, <2 x i8> <0, 0>
		// because that may need 3 instructions to splat the condition value:
		// extend, insertelement, shufflevector.
		if (CondVal->getType()->isVectorTy() == SI.getType()->isVectorTy()) {
// select C, 1, 0 -> zext C to int		// select C, 1, 0 -> zext C to int
if (FalseValC->isZero() && TrueValC->getValue() == 1)		if (match(TrueVal, m_One()) && match(FalseVal, m_Zero()))
return new ZExtInst(CondVal, SI.getType());		return new ZExtInst(CondVal, SI.getType());

// select C, -1, 0 -> sext C to int		// select C, -1, 0 -> sext C to int
if (FalseValC->isZero() && TrueValC->isAllOnesValue())		if (match(TrueVal, m_AllOnes()) && match(FalseVal, m_Zero()))
return new SExtInst(CondVal, SI.getType());		return new SExtInst(CondVal, SI.getType());

// select C, 0, 1 -> zext !C to int		// select C, 0, 1 -> zext !C to int
if (TrueValC->isZero() && FalseValC->getValue() == 1) {		if (match(TrueVal, m_Zero()) && match(FalseVal, m_One())) {
Value *NotCond = Builder->CreateNot(CondVal, "not."+CondVal->getName());		Value *NotCond = Builder->CreateNot(CondVal, "not." + CondVal->getName());
return new ZExtInst(NotCond, SI.getType());		return new ZExtInst(NotCond, SI.getType());
}		}

// select C, 0, -1 -> sext !C to int		// select C, 0, -1 -> sext !C to int
if (TrueValC->isZero() && FalseValC->isAllOnesValue()) {		if (match(TrueVal, m_Zero()) && match(FalseVal, m_AllOnes())) {
Value *NotCond = Builder->CreateNot(CondVal, "not."+CondVal->getName());		Value *NotCond = Builder->CreateNot(CondVal, "not." + CondVal->getName());
return new SExtInst(NotCond, SI.getType());		return new SExtInst(NotCond, SI.getType());
}		}
		}

		if (ConstantInt *TrueValC = dyn_cast<ConstantInt>(TrueVal))
		if (ConstantInt *FalseValC = dyn_cast<ConstantInt>(FalseVal))
if (Value *V = foldSelectICmpAnd(SI, TrueValC, FalseValC, Builder))		if (Value *V = foldSelectICmpAnd(SI, TrueValC, FalseValC, Builder))
return replaceInstUsesWith(SI, V);		return replaceInstUsesWith(SI, V);
}

// See if we are selecting two values based on a comparison of the two values.		// See if we are selecting two values based on a comparison of the two values.
if (FCmpInst *FCI = dyn_cast<FCmpInst>(CondVal)) {		if (FCmpInst *FCI = dyn_cast<FCmpInst>(CondVal)) {
if (FCI->getOperand(0) == TrueVal && FCI->getOperand(1) == FalseVal) {		if (FCI->getOperand(0) == TrueVal && FCI->getOperand(1) == FalseVal) {
// Transform (X == Y) ? X : Y -> Y		// Transform (X == Y) ? X : Y -> Y
if (FCI->getPredicate() == FCmpInst::FCMP_OEQ) {		if (FCI->getPredicate() == FCmpInst::FCMP_OEQ) {
// This is not safe in general for floating point:		// This is not safe in general for floating point:
// consider X== -0, Y== +0.		// consider X== -0, Y== +0.
▲ Show 20 Lines • Show All 270 Lines • Show Last 20 Lines

llvm/trunk/test/Transforms/InstCombine/apint-select.ll

	Show All 35 Lines
	; CHECK-NEXT: [[NOT_C:%.*]] = xor i1 %C, true			; CHECK-NEXT: [[NOT_C:%.*]] = xor i1 %C, true
	; CHECK-NEXT: [[V:%.*]] = sext i1 [[NOT_C]] to i999			; CHECK-NEXT: [[V:%.*]] = sext i1 [[NOT_C]] to i999
	; CHECK-NEXT: ret i999 [[V]]			; CHECK-NEXT: ret i999 [[V]]
	;			;
	%V = select i1 %C, i999 0, i999 -1			%V = select i1 %C, i999 0, i999 -1
	ret i999 %V			ret i999 %V
	}			}

	; FIXME: Vector selects of vector splat constants match APInt too.			; Vector selects of vector splat constants match APInt too.

	define <2 x i41> @zext_vec(<2 x i1> %C) {			define <2 x i41> @zext_vec(<2 x i1> %C) {
	; CHECK-LABEL: @zext_vec(			; CHECK-LABEL: @zext_vec(
	; CHECK-NEXT: [[V:%.*]] = select <2 x i1> %C, <2 x i41> <i41 1, i41 1>, <2 x i41> zeroinitializer			; CHECK-NEXT: [[V:%.*]] = zext <2 x i1> %C to <2 x i41>
	; CHECK-NEXT: ret <2 x i41> [[V]]			; CHECK-NEXT: ret <2 x i41> [[V]]
	;			;
	%V = select <2 x i1> %C, <2 x i41> <i41 1, i41 1>, <2 x i41> <i41 0, i41 0>			%V = select <2 x i1> %C, <2 x i41> <i41 1, i41 1>, <2 x i41> <i41 0, i41 0>
	ret <2 x i41> %V			ret <2 x i41> %V
	}			}

	define <2 x i32> @sext_vec(<2 x i1> %C) {			define <2 x i32> @sext_vec(<2 x i1> %C) {
	; CHECK-LABEL: @sext_vec(			; CHECK-LABEL: @sext_vec(
	; CHECK-NEXT: [[V:%.*]] = select <2 x i1> %C, <2 x i32> <i32 -1, i32 -1>, <2 x i32> zeroinitializer			; CHECK-NEXT: [[V:%.*]] = sext <2 x i1> %C to <2 x i32>
	; CHECK-NEXT: ret <2 x i32> [[V]]			; CHECK-NEXT: ret <2 x i32> [[V]]
	;			;
	%V = select <2 x i1> %C, <2 x i32> <i32 -1, i32 -1>, <2 x i32> <i32 0, i32 0>			%V = select <2 x i1> %C, <2 x i32> <i32 -1, i32 -1>, <2 x i32> <i32 0, i32 0>
	ret <2 x i32> %V			ret <2 x i32> %V
	}			}

	define <2 x i999> @not_zext_vec(<2 x i1> %C) {			define <2 x i999> @not_zext_vec(<2 x i1> %C) {
	; CHECK-LABEL: @not_zext_vec(			; CHECK-LABEL: @not_zext_vec(
	; CHECK-NEXT: [[V:%.*]] = select <2 x i1> %C, <2 x i999> zeroinitializer, <2 x i999> <i999 1, i999 1>			; CHECK-NEXT: [[TMP1:%.*]] = zext <2 x i1> %C to <2 x i999>
				; CHECK-NEXT: [[V:%.*]] = xor <2 x i999> [[TMP1]], <i999 1, i999 1>
	; CHECK-NEXT: ret <2 x i999> [[V]]			; CHECK-NEXT: ret <2 x i999> [[V]]
	;			;
	%V = select <2 x i1> %C, <2 x i999> <i999 0, i999 0>, <2 x i999> <i999 1, i999 1>			%V = select <2 x i1> %C, <2 x i999> <i999 0, i999 0>, <2 x i999> <i999 1, i999 1>
	ret <2 x i999> %V			ret <2 x i999> %V
	}			}

	define <2 x i64> @not_sext_vec(<2 x i1> %C) {			define <2 x i64> @not_sext_vec(<2 x i1> %C) {
	; CHECK-LABEL: @not_sext_vec(			; CHECK-LABEL: @not_sext_vec(
	; CHECK-NEXT: [[V:%.*]] = select <2 x i1> %C, <2 x i64> zeroinitializer, <2 x i64> <i64 -1, i64 -1>			; CHECK-NEXT: [[NOT_C:%.*]] = xor <2 x i1> %C, <i1 true, i1 true>
				; CHECK-NEXT: [[V:%.*]] = sext <2 x i1> [[NOT_C]] to <2 x i64>
	; CHECK-NEXT: ret <2 x i64> [[V]]			; CHECK-NEXT: ret <2 x i64> [[V]]
	;			;
	%V = select <2 x i1> %C, <2 x i64> <i64 0, i64 0>, <2 x i64> <i64 -1, i64 -1>			%V = select <2 x i1> %C, <2 x i64> <i64 0, i64 0>, <2 x i64> <i64 -1, i64 -1>
	ret <2 x i64> %V			ret <2 x i64> %V
	}			}

				; But don't touch this - we would need 3 instructions to extend and splat the scalar select condition.

				define <2 x i32> @scalar_select_of_vectors(i1 %c) {
				; CHECK-LABEL: @scalar_select_of_vectors(
				; CHECK-NEXT: [[V:%.*]] = select i1 %c, <2 x i32> <i32 1, i32 1>, <2 x i32> zeroinitializer
				; CHECK-NEXT: ret <2 x i32> [[V]]
				;
				%V = select i1 %c, <2 x i32> <i32 1, i32 1>, <2 x i32> zeroinitializer
				ret <2 x i32> %V
				}

	;; (x <s 0) ? -1 : 0 -> ashr x, 31			;; (x <s 0) ? -1 : 0 -> ashr x, 31

	define i41 @test3(i41 %X) {			define i41 @test3(i41 %X) {
	; CHECK-LABEL: @test3(			; CHECK-LABEL: @test3(
	; CHECK-NEXT: [[X_LOBIT:%.*]] = ashr i41 %X, 40			; CHECK-NEXT: [[X_LOBIT:%.*]] = ashr i41 %X, 40
	; CHECK-NEXT: ret i41 [[X_LOBIT]]			; CHECK-NEXT: ret i41 [[X_LOBIT]]
	;			;
	%t = icmp slt i41 %X, 0			%t = icmp slt i41 %X, 0
	▲ Show 20 Lines • Show All 42 Lines • Show Last 20 Lines