This is an archive of the discontinued LLVM Phabricator instance.

[InstCombine] don't widen an arbitrary sequence of vector ops (PR40032)
ClosedPublic

Authored by spatel on Dec 16 2018, 9:07 AM.

Download Raw Diff

Details

Reviewers

efriedma
craig.topper
RKSimon

Commits

rG1a6e9ec43431: [InstCombine] don't widen an arbitrary sequence of vector ops (PR40032)
rL349389: [InstCombine] don't widen an arbitrary sequence of vector ops (PR40032)

Summary

The problem is shown specifically for a case with vector multiply here:
https://bugs.llvm.org/show_bug.cgi?id=40032
...and this might mask the original backend bug for ARM shown in:
https://bugs.llvm.org/show_bug.cgi?id=39967

As the test diffs here show, we were (and probably still aren't) doing these kinds of transforms in a principled way. We are producing more or equal wide instructions than we started with in some cases, so we still need to restrict/correct other transforms from overstepping.

If there are perf regressions from this change, we can either carve out exceptions to the general IR rules, or improve the backend to do these transforms when we know the transform is profitable. That's probably similar to a change I just made in D55448.

Diff Detail

Repository: rL LLVM

Event Timeline

spatel created this revision.Dec 16 2018, 9:07 AM

Herald added subscribers: kristof.beyls, javed.absar, mcrosier. · View Herald TranscriptDec 16 2018, 9:07 AM

RKSimon added inline comments.Dec 16 2018, 10:08 AM

lib/Transforms/InstCombine/InstCombineCasts.cpp
1117 ↗	(On Diff #178401)	What about putting !DestTy->isVectorTy() inside InstCombiner::shouldChangeType ?

spatel marked 2 inline comments as done.Dec 17 2018, 4:19 AM

spatel added inline comments.

lib/Transforms/InstCombine/InstCombineCasts.cpp
1117 ↗	(On Diff #178401)	Yes, that's cleaner - currently we just assert that we have scalar inside there.

Patch updated:
Move vector type check into shouldChangeType() rather than asserting we have scalars.

No more comments from me

LGTM

This revision is now accepted and ready to land.Dec 17 2018, 11:26 AM

Closed by commit rL349389: [InstCombine] don't widen an arbitrary sequence of vector ops (PR40032) (authored by spatel). · Explain WhyDec 17 2018, 12:31 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

trunk/

lib/

Transforms/

InstCombine/

InstCombineCasts.cpp

15 lines

InstructionCombining.cpp

5 lines

test/

Transforms/

InstCombine/

cast.ll

19 lines

select-bitext.ll

28 lines

vector-casts.ll

30 lines

Diff 178515

llvm/trunk/lib/Transforms/InstCombine/InstCombineCasts.cpp

Show First 20 Lines • Show All 1,101 Lines • ▼ Show 20 Lines	Instruction *InstCombiner::visitZExt(ZExtInst &CI) {

// If one of the common conversion will work, do it.		// If one of the common conversion will work, do it.
if (Instruction *Result = commonCastTransforms(CI))		if (Instruction *Result = commonCastTransforms(CI))
return Result;		return Result;

Value *Src = CI.getOperand(0);		Value *Src = CI.getOperand(0);
Type SrcTy = Src->getType(), DestTy = CI.getType();		Type SrcTy = Src->getType(), DestTy = CI.getType();

// Attempt to extend the entire input expression tree to the destination		// Try to extend the entire expression tree to the wide destination type.
// type. Only do this if the dest type is a simple type, don't convert the
// expression tree to something weird like i93 unless the source is also
// strange.
unsigned BitsToClear;		unsigned BitsToClear;
if ((DestTy->isVectorTy() \|\| shouldChangeType(SrcTy, DestTy)) &&		if (shouldChangeType(SrcTy, DestTy) &&
canEvaluateZExtd(Src, DestTy, BitsToClear, *this, &CI)) {		canEvaluateZExtd(Src, DestTy, BitsToClear, *this, &CI)) {
assert(BitsToClear <= SrcTy->getScalarSizeInBits() &&		assert(BitsToClear <= SrcTy->getScalarSizeInBits() &&
"Can't clear more bits than in SrcTy");		"Can't clear more bits than in SrcTy");

// Okay, we can transform this! Insert the new expression now.		// Okay, we can transform this! Insert the new expression now.
LLVM_DEBUG(		LLVM_DEBUG(
dbgs() << "ICE: EvaluateInDifferentType converting expression type"		dbgs() << "ICE: EvaluateInDifferentType converting expression type"
" to avoid zero extend: "		" to avoid zero extend: "
▲ Show 20 Lines • Show All 260 Lines • ▼ Show 20 Lines	Instruction *InstCombiner::visitSExt(SExtInst &CI) {
// If we know that the value being extended is positive, we can use a zext		// If we know that the value being extended is positive, we can use a zext
// instead.		// instead.
KnownBits Known = computeKnownBits(Src, 0, &CI);		KnownBits Known = computeKnownBits(Src, 0, &CI);
if (Known.isNonNegative()) {		if (Known.isNonNegative()) {
Value *ZExt = Builder.CreateZExt(Src, DestTy);		Value *ZExt = Builder.CreateZExt(Src, DestTy);
return replaceInstUsesWith(CI, ZExt);		return replaceInstUsesWith(CI, ZExt);
}		}

// Attempt to extend the entire input expression tree to the destination		// Try to extend the entire expression tree to the wide destination type.
// type. Only do this if the dest type is a simple type, don't convert the		if (shouldChangeType(SrcTy, DestTy) && canEvaluateSExtd(Src, DestTy)) {
// expression tree to something weird like i93 unless the source is also
// strange.
if ((DestTy->isVectorTy() \|\| shouldChangeType(SrcTy, DestTy)) &&
canEvaluateSExtd(Src, DestTy)) {
// Okay, we can transform this! Insert the new expression now.		// Okay, we can transform this! Insert the new expression now.
LLVM_DEBUG(		LLVM_DEBUG(
dbgs() << "ICE: EvaluateInDifferentType converting expression type"		dbgs() << "ICE: EvaluateInDifferentType converting expression type"
" to avoid sign extend: "		" to avoid sign extend: "
<< CI << '\n');		<< CI << '\n');
Value *Res = EvaluateInDifferentType(Src, DestTy, true);		Value *Res = EvaluateInDifferentType(Src, DestTy, true);
assert(Res->getType() == DestTy);		assert(Res->getType() == DestTy);

▲ Show 20 Lines • Show All 1,040 Lines • Show Last 20 Lines

llvm/trunk/lib/Transforms/InstCombine/InstructionCombining.cpp

	Show First 20 Lines • Show All 177 Lines • ▼ Show 20 Lines
	}			}

	/// Return true if it is desirable to convert a computation from 'From' to 'To'.			/// Return true if it is desirable to convert a computation from 'From' to 'To'.
	/// We don't want to convert from a legal to an illegal type or from a smaller			/// We don't want to convert from a legal to an illegal type or from a smaller
	/// to a larger illegal type. i1 is always treated as a legal type because it is			/// to a larger illegal type. i1 is always treated as a legal type because it is
	/// a fundamental type in IR, and there are many specialized optimizations for			/// a fundamental type in IR, and there are many specialized optimizations for
	/// i1 types.			/// i1 types.
	bool InstCombiner::shouldChangeType(Type From, Type To) const {			bool InstCombiner::shouldChangeType(Type From, Type To) const {
	assert(From->isIntegerTy() && To->isIntegerTy());			// TODO: This could be extended to allow vectors. Datalayout changes might be
				// needed to properly support that.
				if (!From->isIntegerTy() \|\| !To->isIntegerTy())
				return false;

	unsigned FromWidth = From->getPrimitiveSizeInBits();			unsigned FromWidth = From->getPrimitiveSizeInBits();
	unsigned ToWidth = To->getPrimitiveSizeInBits();			unsigned ToWidth = To->getPrimitiveSizeInBits();
	return shouldChangeType(FromWidth, ToWidth);			return shouldChangeType(FromWidth, ToWidth);
	}			}

	// Return true, if No Signed Wrap should be maintained for I.			// Return true, if No Signed Wrap should be maintained for I.
	// The No Signed Wrap flag can be kept if the operation "B (I.getOpcode) C",			// The No Signed Wrap flag can be kept if the operation "B (I.getOpcode) C",
	▲ Show 20 Lines • Show All 3,338 Lines • Show Last 20 Lines

llvm/trunk/test/Transforms/InstCombine/cast.ll

Show First 20 Lines • Show All 586 Lines • ▼ Show 20 Lines	;
%C = and i32 %B, 42		%C = and i32 %B, 42
%D = shl i32 %C, 8		%D = shl i32 %C, 8
%E = zext i32 %D to i64		%E = zext i32 %D to i64
ret i64 %E		ret i64 %E
}		}

define <2 x i64> @test46vec(<2 x i64> %A) {		define <2 x i64> @test46vec(<2 x i64> %A) {
; CHECK-LABEL: @test46vec(		; CHECK-LABEL: @test46vec(
; CHECK-NEXT: [[C:%.]] = shl <2 x i64> [[A:%.]], <i64 8, i64 8>		; CHECK-NEXT: [[B:%.]] = trunc <2 x i64> [[A:%.]] to <2 x i32>
; CHECK-NEXT: [[D:%.*]] = and <2 x i64> [[C]], <i64 10752, i64 10752>		; CHECK-NEXT: [[C:%.*]] = shl <2 x i32> [[B]], <i32 8, i32 8>
; CHECK-NEXT: ret <2 x i64> [[D]]		; CHECK-NEXT: [[D:%.*]] = and <2 x i32> [[C]], <i32 10752, i32 10752>
		; CHECK-NEXT: [[E:%.*]] = zext <2 x i32> [[D]] to <2 x i64>
		; CHECK-NEXT: ret <2 x i64> [[E]]
;		;
%B = trunc <2 x i64> %A to <2 x i32>		%B = trunc <2 x i64> %A to <2 x i32>
%C = and <2 x i32> %B, <i32 42, i32 42>		%C = and <2 x i32> %B, <i32 42, i32 42>
%D = shl <2 x i32> %C, <i32 8, i32 8>		%D = shl <2 x i32> %C, <i32 8, i32 8>
%E = zext <2 x i32> %D to <2 x i64>		%E = zext <2 x i32> %D to <2 x i64>
ret <2 x i64> %E		ret <2 x i64> %E
}		}

▲ Show 20 Lines • Show All 138 Lines • ▼ Show 20 Lines	;
%p353 = sext i16 %A to i32		%p353 = sext i16 %A to i32
%p354 = lshr i32 %p353, 5		%p354 = lshr i32 %p353, 5
%p355 = zext i32 %p354 to i64		%p355 = zext i32 %p354 to i64
ret i64 %p355		ret i64 %p355
}		}

define <2 x i64> @test56vec(<2 x i16> %A) {		define <2 x i64> @test56vec(<2 x i16> %A) {
; CHECK-LABEL: @test56vec(		; CHECK-LABEL: @test56vec(
; CHECK-NEXT: [[P353:%.]] = sext <2 x i16> [[A:%.]] to <2 x i64>		; CHECK-NEXT: [[P353:%.]] = sext <2 x i16> [[A:%.]] to <2 x i32>
; CHECK-NEXT: [[P354:%.*]] = lshr <2 x i64> [[P353]], <i64 5, i64 5>		; CHECK-NEXT: [[P354:%.*]] = lshr <2 x i32> [[P353]], <i32 5, i32 5>
; CHECK-NEXT: [[P355:%.*]] = and <2 x i64> [[P354]], <i64 134217727, i64 134217727>		; CHECK-NEXT: [[P355:%.*]] = zext <2 x i32> [[P354]] to <2 x i64>
; CHECK-NEXT: ret <2 x i64> [[P355]]		; CHECK-NEXT: ret <2 x i64> [[P355]]
;		;
%p353 = sext <2 x i16> %A to <2 x i32>		%p353 = sext <2 x i16> %A to <2 x i32>
%p354 = lshr <2 x i32> %p353, <i32 5, i32 5>		%p354 = lshr <2 x i32> %p353, <i32 5, i32 5>
%p355 = zext <2 x i32> %p354 to <2 x i64>		%p355 = zext <2 x i32> %p354 to <2 x i64>
ret <2 x i64> %p355		ret <2 x i64> %p355
}		}

define i64 @test57(i64 %A) {		define i64 @test57(i64 %A) {
; CHECK-LABEL: @test57(		; CHECK-LABEL: @test57(
; CHECK-NEXT: [[C:%.]] = lshr i64 [[A:%.]], 8		; CHECK-NEXT: [[C:%.]] = lshr i64 [[A:%.]], 8
; CHECK-NEXT: [[E:%.*]] = and i64 [[C]], 16777215		; CHECK-NEXT: [[E:%.*]] = and i64 [[C]], 16777215
; CHECK-NEXT: ret i64 [[E]]		; CHECK-NEXT: ret i64 [[E]]
;		;
%B = trunc i64 %A to i32		%B = trunc i64 %A to i32
%C = lshr i32 %B, 8		%C = lshr i32 %B, 8
%E = zext i32 %C to i64		%E = zext i32 %C to i64
ret i64 %E		ret i64 %E
}		}

define <2 x i64> @test57vec(<2 x i64> %A) {		define <2 x i64> @test57vec(<2 x i64> %A) {
; CHECK-LABEL: @test57vec(		; CHECK-LABEL: @test57vec(
; CHECK-NEXT: [[C:%.]] = lshr <2 x i64> [[A:%.]], <i64 8, i64 8>		; CHECK-NEXT: [[B:%.]] = trunc <2 x i64> [[A:%.]] to <2 x i32>
; CHECK-NEXT: [[E:%.*]] = and <2 x i64> [[C]], <i64 16777215, i64 16777215>		; CHECK-NEXT: [[C:%.*]] = lshr <2 x i32> [[B]], <i32 8, i32 8>
		; CHECK-NEXT: [[E:%.*]] = zext <2 x i32> [[C]] to <2 x i64>
; CHECK-NEXT: ret <2 x i64> [[E]]		; CHECK-NEXT: ret <2 x i64> [[E]]
;		;
%B = trunc <2 x i64> %A to <2 x i32>		%B = trunc <2 x i64> %A to <2 x i32>
%C = lshr <2 x i32> %B, <i32 8, i32 8>		%C = lshr <2 x i32> %B, <i32 8, i32 8>
%E = zext <2 x i32> %C to <2 x i64>		%E = zext <2 x i32> %C to <2 x i64>
ret <2 x i64> %E		ret <2 x i64> %E
}		}

▲ Show 20 Lines • Show All 772 Lines • Show Last 20 Lines

llvm/trunk/test/Transforms/InstCombine/select-bitext.ll

Show First 20 Lines • Show All 92 Lines • ▼ Show 20 Lines	;
%trunc = trunc i32 %a to i16		%trunc = trunc i32 %a to i16
%sel = select i1 %cmp, i16 %trunc, i16 42		%sel = select i1 %cmp, i16 %trunc, i16 42
%ext = sext i16 %sel to i64		%ext = sext i16 %sel to i64
ret i64 %ext		ret i64 %ext
}		}

define <2 x i64> @trunc_sel_larger_sext_vec(<2 x i32> %a, <2 x i1> %cmp) {		define <2 x i64> @trunc_sel_larger_sext_vec(<2 x i32> %a, <2 x i1> %cmp) {
; CHECK-LABEL: @trunc_sel_larger_sext_vec(		; CHECK-LABEL: @trunc_sel_larger_sext_vec(
; CHECK-NEXT: [[TRUNC:%.]] = zext <2 x i32> [[A:%.]] to <2 x i64>		; CHECK-NEXT: [[TRUNC:%.]] = trunc <2 x i32> [[A:%.]] to <2 x i16>
; CHECK-NEXT: [[SEXT:%.*]] = shl <2 x i64> [[TRUNC]], <i64 48, i64 48>		; CHECK-NEXT: [[TMP1:%.*]] = sext <2 x i16> [[TRUNC]] to <2 x i64>
; CHECK-NEXT: [[TMP1:%.*]] = ashr exact <2 x i64> [[SEXT]], <i64 48, i64 48>
; CHECK-NEXT: [[EXT:%.]] = select <2 x i1> [[CMP:%.]], <2 x i64> [[TMP1]], <2 x i64> <i64 42, i64 43>		; CHECK-NEXT: [[EXT:%.]] = select <2 x i1> [[CMP:%.]], <2 x i64> [[TMP1]], <2 x i64> <i64 42, i64 43>
; CHECK-NEXT: ret <2 x i64> [[EXT]]		; CHECK-NEXT: ret <2 x i64> [[EXT]]
;		;
%trunc = trunc <2 x i32> %a to <2 x i16>		%trunc = trunc <2 x i32> %a to <2 x i16>
%sel = select <2 x i1> %cmp, <2 x i16> %trunc, <2 x i16> <i16 42, i16 43>		%sel = select <2 x i1> %cmp, <2 x i16> %trunc, <2 x i16> <i16 42, i16 43>
%ext = sext <2 x i16> %sel to <2 x i64>		%ext = sext <2 x i16> %sel to <2 x i64>
ret <2 x i64> %ext		ret <2 x i64> %ext
}		}

define i32 @trunc_sel_smaller_sext(i64 %a, i1 %cmp) {		define i32 @trunc_sel_smaller_sext(i64 %a, i1 %cmp) {
; CHECK-LABEL: @trunc_sel_smaller_sext(		; CHECK-LABEL: @trunc_sel_smaller_sext(
; CHECK-NEXT: [[TRUNC:%.]] = trunc i64 [[A:%.]] to i16		; CHECK-NEXT: [[TRUNC:%.]] = trunc i64 [[A:%.]] to i16
; CHECK-NEXT: [[TMP1:%.*]] = sext i16 [[TRUNC]] to i32		; CHECK-NEXT: [[TMP1:%.*]] = sext i16 [[TRUNC]] to i32
; CHECK-NEXT: [[EXT:%.]] = select i1 [[CMP:%.]], i32 [[TMP1]], i32 42		; CHECK-NEXT: [[EXT:%.]] = select i1 [[CMP:%.]], i32 [[TMP1]], i32 42
; CHECK-NEXT: ret i32 [[EXT]]		; CHECK-NEXT: ret i32 [[EXT]]
;		;
%trunc = trunc i64 %a to i16		%trunc = trunc i64 %a to i16
%sel = select i1 %cmp, i16 %trunc, i16 42		%sel = select i1 %cmp, i16 %trunc, i16 42
%ext = sext i16 %sel to i32		%ext = sext i16 %sel to i32
ret i32 %ext		ret i32 %ext
}		}

define <2 x i32> @trunc_sel_smaller_sext_vec(<2 x i64> %a, <2 x i1> %cmp) {		define <2 x i32> @trunc_sel_smaller_sext_vec(<2 x i64> %a, <2 x i1> %cmp) {
; CHECK-LABEL: @trunc_sel_smaller_sext_vec(		; CHECK-LABEL: @trunc_sel_smaller_sext_vec(
; CHECK-NEXT: [[TRUNC:%.]] = trunc <2 x i64> [[A:%.]] to <2 x i32>		; CHECK-NEXT: [[TRUNC:%.]] = trunc <2 x i64> [[A:%.]] to <2 x i16>
; CHECK-NEXT: [[SEXT:%.*]] = shl <2 x i32> [[TRUNC]], <i32 16, i32 16>		; CHECK-NEXT: [[TMP1:%.*]] = sext <2 x i16> [[TRUNC]] to <2 x i32>
; CHECK-NEXT: [[TMP1:%.*]] = ashr exact <2 x i32> [[SEXT]], <i32 16, i32 16>
; CHECK-NEXT: [[EXT:%.]] = select <2 x i1> [[CMP:%.]], <2 x i32> [[TMP1]], <2 x i32> <i32 42, i32 43>		; CHECK-NEXT: [[EXT:%.]] = select <2 x i1> [[CMP:%.]], <2 x i32> [[TMP1]], <2 x i32> <i32 42, i32 43>
; CHECK-NEXT: ret <2 x i32> [[EXT]]		; CHECK-NEXT: ret <2 x i32> [[EXT]]
;		;
%trunc = trunc <2 x i64> %a to <2 x i16>		%trunc = trunc <2 x i64> %a to <2 x i16>
%sel = select <2 x i1> %cmp, <2 x i16> %trunc, <2 x i16> <i16 42, i16 43>		%sel = select <2 x i1> %cmp, <2 x i16> %trunc, <2 x i16> <i16 42, i16 43>
%ext = sext <2 x i16> %sel to <2 x i32>		%ext = sext <2 x i16> %sel to <2 x i32>
ret <2 x i32> %ext		ret <2 x i32> %ext
}		}

define i32 @trunc_sel_equal_sext(i32 %a, i1 %cmp) {		define i32 @trunc_sel_equal_sext(i32 %a, i1 %cmp) {
; CHECK-LABEL: @trunc_sel_equal_sext(		; CHECK-LABEL: @trunc_sel_equal_sext(
; CHECK-NEXT: [[TMP1:%.]] = shl i32 [[A:%.]], 16		; CHECK-NEXT: [[TMP1:%.]] = shl i32 [[A:%.]], 16
; CHECK-NEXT: [[TMP2:%.*]] = ashr exact i32 [[TMP1]], 16		; CHECK-NEXT: [[TMP2:%.*]] = ashr exact i32 [[TMP1]], 16
; CHECK-NEXT: [[EXT:%.]] = select i1 [[CMP:%.]], i32 [[TMP2]], i32 42		; CHECK-NEXT: [[EXT:%.]] = select i1 [[CMP:%.]], i32 [[TMP2]], i32 42
; CHECK-NEXT: ret i32 [[EXT]]		; CHECK-NEXT: ret i32 [[EXT]]
;		;
%trunc = trunc i32 %a to i16		%trunc = trunc i32 %a to i16
%sel = select i1 %cmp, i16 %trunc, i16 42		%sel = select i1 %cmp, i16 %trunc, i16 42
%ext = sext i16 %sel to i32		%ext = sext i16 %sel to i32
ret i32 %ext		ret i32 %ext
}		}

define <2 x i32> @trunc_sel_equal_sext_vec(<2 x i32> %a, <2 x i1> %cmp) {		define <2 x i32> @trunc_sel_equal_sext_vec(<2 x i32> %a, <2 x i1> %cmp) {
; CHECK-LABEL: @trunc_sel_equal_sext_vec(		; CHECK-LABEL: @trunc_sel_equal_sext_vec(
; CHECK-NEXT: [[SEXT:%.]] = shl <2 x i32> [[A:%.]], <i32 16, i32 16>		; CHECK-NEXT: [[TMP1:%.]] = shl <2 x i32> [[A:%.]], <i32 16, i32 16>
; CHECK-NEXT: [[TMP1:%.*]] = ashr exact <2 x i32> [[SEXT]], <i32 16, i32 16>		; CHECK-NEXT: [[TMP2:%.*]] = ashr exact <2 x i32> [[TMP1]], <i32 16, i32 16>
; CHECK-NEXT: [[EXT:%.]] = select <2 x i1> [[CMP:%.]], <2 x i32> [[TMP1]], <2 x i32> <i32 42, i32 43>		; CHECK-NEXT: [[EXT:%.]] = select <2 x i1> [[CMP:%.]], <2 x i32> [[TMP2]], <2 x i32> <i32 42, i32 43>
; CHECK-NEXT: ret <2 x i32> [[EXT]]		; CHECK-NEXT: ret <2 x i32> [[EXT]]
;		;
%trunc = trunc <2 x i32> %a to <2 x i16>		%trunc = trunc <2 x i32> %a to <2 x i16>
%sel = select <2 x i1> %cmp, <2 x i16> %trunc, <2 x i16> <i16 42, i16 43>		%sel = select <2 x i1> %cmp, <2 x i16> %trunc, <2 x i16> <i16 42, i16 43>
%ext = sext <2 x i16> %sel to <2 x i32>		%ext = sext <2 x i16> %sel to <2 x i32>
ret <2 x i32> %ext		ret <2 x i32> %ext
}		}

define i64 @trunc_sel_larger_zext(i32 %a, i1 %cmp) {		define i64 @trunc_sel_larger_zext(i32 %a, i1 %cmp) {
; CHECK-LABEL: @trunc_sel_larger_zext(		; CHECK-LABEL: @trunc_sel_larger_zext(
; CHECK-NEXT: [[TRUNC_MASK:%.]] = and i32 [[A:%.]], 65535		; CHECK-NEXT: [[TRUNC_MASK:%.]] = and i32 [[A:%.]], 65535
; CHECK-NEXT: [[TMP1:%.*]] = zext i32 [[TRUNC_MASK]] to i64		; CHECK-NEXT: [[TMP1:%.*]] = zext i32 [[TRUNC_MASK]] to i64
; CHECK-NEXT: [[EXT:%.]] = select i1 [[CMP:%.]], i64 [[TMP1]], i64 42		; CHECK-NEXT: [[EXT:%.]] = select i1 [[CMP:%.]], i64 [[TMP1]], i64 42
; CHECK-NEXT: ret i64 [[EXT]]		; CHECK-NEXT: ret i64 [[EXT]]
;		;
%trunc = trunc i32 %a to i16		%trunc = trunc i32 %a to i16
%sel = select i1 %cmp, i16 %trunc, i16 42		%sel = select i1 %cmp, i16 %trunc, i16 42
%ext = zext i16 %sel to i64		%ext = zext i16 %sel to i64
ret i64 %ext		ret i64 %ext
}		}

define <2 x i64> @trunc_sel_larger_zext_vec(<2 x i32> %a, <2 x i1> %cmp) {		define <2 x i64> @trunc_sel_larger_zext_vec(<2 x i32> %a, <2 x i1> %cmp) {
; CHECK-LABEL: @trunc_sel_larger_zext_vec(		; CHECK-LABEL: @trunc_sel_larger_zext_vec(
; CHECK-NEXT: [[TMP1:%.]] = and <2 x i32> [[A:%.]], <i32 65535, i32 65535>		; CHECK-NEXT: [[TRUNC_MASK:%.]] = and <2 x i32> [[A:%.]], <i32 65535, i32 65535>
; CHECK-NEXT: [[TMP2:%.*]] = zext <2 x i32> [[TMP1]] to <2 x i64>		; CHECK-NEXT: [[TMP1:%.*]] = zext <2 x i32> [[TRUNC_MASK]] to <2 x i64>
; CHECK-NEXT: [[EXT:%.]] = select <2 x i1> [[CMP:%.]], <2 x i64> [[TMP2]], <2 x i64> <i64 42, i64 43>		; CHECK-NEXT: [[EXT:%.]] = select <2 x i1> [[CMP:%.]], <2 x i64> [[TMP1]], <2 x i64> <i64 42, i64 43>
; CHECK-NEXT: ret <2 x i64> [[EXT]]		; CHECK-NEXT: ret <2 x i64> [[EXT]]
;		;
%trunc = trunc <2 x i32> %a to <2 x i16>		%trunc = trunc <2 x i32> %a to <2 x i16>
%sel = select <2 x i1> %cmp, <2 x i16> %trunc, <2 x i16> <i16 42, i16 43>		%sel = select <2 x i1> %cmp, <2 x i16> %trunc, <2 x i16> <i16 42, i16 43>
%ext = zext <2 x i16> %sel to <2 x i64>		%ext = zext <2 x i16> %sel to <2 x i64>
ret <2 x i64> %ext		ret <2 x i64> %ext
}		}

define i32 @trunc_sel_smaller_zext(i64 %a, i1 %cmp) {		define i32 @trunc_sel_smaller_zext(i64 %a, i1 %cmp) {
; CHECK-LABEL: @trunc_sel_smaller_zext(		; CHECK-LABEL: @trunc_sel_smaller_zext(
; CHECK-NEXT: [[TMP1:%.]] = trunc i64 [[A:%.]] to i32		; CHECK-NEXT: [[TMP1:%.]] = trunc i64 [[A:%.]] to i32
; CHECK-NEXT: [[TMP2:%.*]] = and i32 [[TMP1]], 65535		; CHECK-NEXT: [[TMP2:%.*]] = and i32 [[TMP1]], 65535
; CHECK-NEXT: [[EXT:%.]] = select i1 [[CMP:%.]], i32 [[TMP2]], i32 42		; CHECK-NEXT: [[EXT:%.]] = select i1 [[CMP:%.]], i32 [[TMP2]], i32 42
; CHECK-NEXT: ret i32 [[EXT]]		; CHECK-NEXT: ret i32 [[EXT]]
;		;
%trunc = trunc i64 %a to i16		%trunc = trunc i64 %a to i16
%sel = select i1 %cmp, i16 %trunc, i16 42		%sel = select i1 %cmp, i16 %trunc, i16 42
%ext = zext i16 %sel to i32		%ext = zext i16 %sel to i32
ret i32 %ext		ret i32 %ext
}		}

define <2 x i32> @trunc_sel_smaller_zext_vec(<2 x i64> %a, <2 x i1> %cmp) {		define <2 x i32> @trunc_sel_smaller_zext_vec(<2 x i64> %a, <2 x i1> %cmp) {
; CHECK-LABEL: @trunc_sel_smaller_zext_vec(		; CHECK-LABEL: @trunc_sel_smaller_zext_vec(
; CHECK-NEXT: [[TRUNC:%.]] = trunc <2 x i64> [[A:%.]] to <2 x i32>		; CHECK-NEXT: [[TMP1:%.]] = trunc <2 x i64> [[A:%.]] to <2 x i32>
; CHECK-NEXT: [[TMP1:%.*]] = and <2 x i32> [[TRUNC]], <i32 65535, i32 65535>		; CHECK-NEXT: [[TMP2:%.*]] = and <2 x i32> [[TMP1]], <i32 65535, i32 65535>
; CHECK-NEXT: [[EXT:%.]] = select <2 x i1> [[CMP:%.]], <2 x i32> [[TMP1]], <2 x i32> <i32 42, i32 43>		; CHECK-NEXT: [[EXT:%.]] = select <2 x i1> [[CMP:%.]], <2 x i32> [[TMP2]], <2 x i32> <i32 42, i32 43>
; CHECK-NEXT: ret <2 x i32> [[EXT]]		; CHECK-NEXT: ret <2 x i32> [[EXT]]
;		;
%trunc = trunc <2 x i64> %a to <2 x i16>		%trunc = trunc <2 x i64> %a to <2 x i16>
%sel = select <2 x i1> %cmp, <2 x i16> %trunc, <2 x i16> <i16 42, i16 43>		%sel = select <2 x i1> %cmp, <2 x i16> %trunc, <2 x i16> <i16 42, i16 43>
%ext = zext <2 x i16> %sel to <2 x i32>		%ext = zext <2 x i16> %sel to <2 x i32>
ret <2 x i32> %ext		ret <2 x i32> %ext
}		}

▲ Show 20 Lines • Show All 404 Lines • Show Last 20 Lines

llvm/trunk/test/Transforms/InstCombine/vector-casts.ll

Show First 20 Lines • Show All 157 Lines • ▼ Show 20 Lines	;
%val = trunc <2 x i64> %src to <2 x i32>		%val = trunc <2 x i64> %src to <2 x i32>
%add = add <2 x i32> %val, <i32 1, i32 1>		%add = add <2 x i32> %val, <i32 1, i32 1>
store <2 x i32> %add, <2 x i32>* %dst.addr		store <2 x i32> %add, <2 x i32>* %dst.addr
ret void		ret void
}		}

define <2 x i65> @foo(<2 x i64> %t) {		define <2 x i65> @foo(<2 x i64> %t) {
; CHECK-LABEL: @foo(		; CHECK-LABEL: @foo(
; CHECK-NEXT: [[TMP1:%.]] = and <2 x i64> [[T:%.]], <i64 4294967295, i64 4294967295>		; CHECK-NEXT: [[A_MASK:%.]] = and <2 x i64> [[T:%.]], <i64 4294967295, i64 4294967295>
; CHECK-NEXT: [[B:%.*]] = zext <2 x i64> [[TMP1]] to <2 x i65>		; CHECK-NEXT: [[B:%.*]] = zext <2 x i64> [[A_MASK]] to <2 x i65>
; CHECK-NEXT: ret <2 x i65> [[B]]		; CHECK-NEXT: ret <2 x i65> [[B]]
;		;
%a = trunc <2 x i64> %t to <2 x i32>		%a = trunc <2 x i64> %t to <2 x i32>
%b = zext <2 x i32> %a to <2 x i65>		%b = zext <2 x i32> %a to <2 x i65>
ret <2 x i65> %b		ret <2 x i65> %b
}		}

define <2 x i64> @bar(<2 x i65> %t) {		define <2 x i64> @bar(<2 x i65> %t) {
; CHECK-LABEL: @bar(		; CHECK-LABEL: @bar(
; CHECK-NEXT: [[A:%.]] = trunc <2 x i65> [[T:%.]] to <2 x i64>		; CHECK-NEXT: [[TMP1:%.]] = trunc <2 x i65> [[T:%.]] to <2 x i64>
; CHECK-NEXT: [[B:%.*]] = and <2 x i64> [[A]], <i64 4294967295, i64 4294967295>		; CHECK-NEXT: [[B:%.*]] = and <2 x i64> [[TMP1]], <i64 4294967295, i64 4294967295>
; CHECK-NEXT: ret <2 x i64> [[B]]		; CHECK-NEXT: ret <2 x i64> [[B]]
;		;
%a = trunc <2 x i65> %t to <2 x i32>		%a = trunc <2 x i65> %t to <2 x i32>
%b = zext <2 x i32> %a to <2 x i64>		%b = zext <2 x i32> %a to <2 x i64>
ret <2 x i64> %b		ret <2 x i64> %b
}		}

define <2 x i64> @bars(<2 x i65> %t) {		define <2 x i64> @bars(<2 x i65> %t) {
; CHECK-LABEL: @bars(		; CHECK-LABEL: @bars(
; CHECK-NEXT: [[A:%.]] = trunc <2 x i65> [[T:%.]] to <2 x i64>		; CHECK-NEXT: [[A:%.]] = trunc <2 x i65> [[T:%.]] to <2 x i32>
; CHECK-NEXT: [[SEXT:%.*]] = shl <2 x i64> [[A]], <i64 32, i64 32>		; CHECK-NEXT: [[B:%.*]] = sext <2 x i32> [[A]] to <2 x i64>
; CHECK-NEXT: [[B:%.*]] = ashr exact <2 x i64> [[SEXT]], <i64 32, i64 32>
; CHECK-NEXT: ret <2 x i64> [[B]]		; CHECK-NEXT: ret <2 x i64> [[B]]
;		;
%a = trunc <2 x i65> %t to <2 x i32>		%a = trunc <2 x i65> %t to <2 x i32>
%b = sext <2 x i32> %a to <2 x i64>		%b = sext <2 x i32> %a to <2 x i64>
ret <2 x i64> %b		ret <2 x i64> %b
}		}

define <2 x i64> @quxs(<2 x i64> %t) {		define <2 x i64> @quxs(<2 x i64> %t) {
; CHECK-LABEL: @quxs(		; CHECK-LABEL: @quxs(
; CHECK-NEXT: [[SEXT:%.]] = shl <2 x i64> [[T:%.]], <i64 32, i64 32>		; CHECK-NEXT: [[TMP1:%.]] = shl <2 x i64> [[T:%.]], <i64 32, i64 32>
; CHECK-NEXT: [[B:%.*]] = ashr exact <2 x i64> [[SEXT]], <i64 32, i64 32>		; CHECK-NEXT: [[B:%.*]] = ashr exact <2 x i64> [[TMP1]], <i64 32, i64 32>
; CHECK-NEXT: ret <2 x i64> [[B]]		; CHECK-NEXT: ret <2 x i64> [[B]]
;		;
%a = trunc <2 x i64> %t to <2 x i32>		%a = trunc <2 x i64> %t to <2 x i32>
%b = sext <2 x i32> %a to <2 x i64>		%b = sext <2 x i32> %a to <2 x i64>
ret <2 x i64> %b		ret <2 x i64> %b
}		}

define <2 x i64> @quxt(<2 x i64> %t) {		define <2 x i64> @quxt(<2 x i64> %t) {
▲ Show 20 Lines • Show All 169 Lines • ▼ Show 20 Lines

; Converting to a wide type might reduce instruction count,		; Converting to a wide type might reduce instruction count,
; but we can not do that unless the backend can recover from		; but we can not do that unless the backend can recover from
; the creation of a potentially illegal op (like a 64-bit vmul).		; the creation of a potentially illegal op (like a 64-bit vmul).
; PR40032 - https://bugs.llvm.org/show_bug.cgi?id=40032		; PR40032 - https://bugs.llvm.org/show_bug.cgi?id=40032

define <2 x i64> @sext_less_casting_with_wideop(<2 x i64> %x, <2 x i64> %y) {		define <2 x i64> @sext_less_casting_with_wideop(<2 x i64> %x, <2 x i64> %y) {
; CHECK-LABEL: @sext_less_casting_with_wideop(		; CHECK-LABEL: @sext_less_casting_with_wideop(
; CHECK-NEXT: [[MUL:%.]] = mul <2 x i64> [[X:%.]], [[Y:%.*]]		; CHECK-NEXT: [[XNARROW:%.]] = trunc <2 x i64> [[X:%.]] to <2 x i32>
; CHECK-NEXT: [[SEXT:%.*]] = shl <2 x i64> [[MUL]], <i64 32, i64 32>		; CHECK-NEXT: [[YNARROW:%.]] = trunc <2 x i64> [[Y:%.]] to <2 x i32>
; CHECK-NEXT: [[R:%.*]] = ashr exact <2 x i64> [[SEXT]], <i64 32, i64 32>		; CHECK-NEXT: [[MUL:%.*]] = mul <2 x i32> [[XNARROW]], [[YNARROW]]
		; CHECK-NEXT: [[R:%.*]] = sext <2 x i32> [[MUL]] to <2 x i64>
; CHECK-NEXT: ret <2 x i64> [[R]]		; CHECK-NEXT: ret <2 x i64> [[R]]
;		;
%xnarrow = trunc <2 x i64> %x to <2 x i32>		%xnarrow = trunc <2 x i64> %x to <2 x i32>
%ynarrow = trunc <2 x i64> %y to <2 x i32>		%ynarrow = trunc <2 x i64> %y to <2 x i32>
%mul = mul <2 x i32> %xnarrow, %ynarrow		%mul = mul <2 x i32> %xnarrow, %ynarrow
%r = sext <2 x i32> %mul to <2 x i64>		%r = sext <2 x i32> %mul to <2 x i64>
ret <2 x i64> %r		ret <2 x i64> %r
}		}

define <2 x i64> @zext_less_casting_with_wideop(<2 x i64> %x, <2 x i64> %y) {		define <2 x i64> @zext_less_casting_with_wideop(<2 x i64> %x, <2 x i64> %y) {
; CHECK-LABEL: @zext_less_casting_with_wideop(		; CHECK-LABEL: @zext_less_casting_with_wideop(
; CHECK-NEXT: [[MUL:%.]] = mul <2 x i64> [[X:%.]], [[Y:%.*]]		; CHECK-NEXT: [[XNARROW:%.]] = trunc <2 x i64> [[X:%.]] to <2 x i32>
; CHECK-NEXT: [[R:%.*]] = and <2 x i64> [[MUL]], <i64 4294967295, i64 4294967295>		; CHECK-NEXT: [[YNARROW:%.]] = trunc <2 x i64> [[Y:%.]] to <2 x i32>
		; CHECK-NEXT: [[MUL:%.*]] = mul <2 x i32> [[XNARROW]], [[YNARROW]]
		; CHECK-NEXT: [[R:%.*]] = zext <2 x i32> [[MUL]] to <2 x i64>
; CHECK-NEXT: ret <2 x i64> [[R]]		; CHECK-NEXT: ret <2 x i64> [[R]]
;		;
%xnarrow = trunc <2 x i64> %x to <2 x i32>		%xnarrow = trunc <2 x i64> %x to <2 x i32>
%ynarrow = trunc <2 x i64> %y to <2 x i32>		%ynarrow = trunc <2 x i64> %y to <2 x i32>
%mul = mul <2 x i32> %xnarrow, %ynarrow		%mul = mul <2 x i32> %xnarrow, %ynarrow
%r = zext <2 x i32> %mul to <2 x i64>		%r = zext <2 x i32> %mul to <2 x i64>
ret <2 x i64> %r		ret <2 x i64> %r
}		}