Download Raw Diff

Details

Reviewers

spatel
RKSimon
craig.topper
kpn
efriedma

Commits

rGb1b7fb6f20b0: [InstCombine] trunc (fptoui|fptosi)

Summary

Attempt to fold the trunc into the fp-to-int conversion.

https://alive2.llvm.org/ce/z/8RCNou

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

samparker created this revision.Jan 19 2023, 2:03 AM

Herald added a project: Restricted Project. · View Herald TranscriptJan 19 2023, 2:03 AM

Herald added subscribers: StephenFan, hiraditya. · View Herald Transcript

samparker requested review of this revision.Jan 19 2023, 2:03 AM

Herald added a project: Restricted Project. · View Herald TranscriptJan 19 2023, 2:03 AM

Now only checking for poison/undef for the signed case.

samparker added inline comments.Jan 19 2023, 3:33 AM

llvm/lib/Transforms/InstCombine/InstCombineCasts.cpp
506	And I'm still not sure whether this is needed? Alive seems to want it to be happy, but I pretty sure integer transforms are performed elsewhere without considering fp-to-int conversions as inputs.

Harbormaster completed remote builds in B208704: Diff 490438.Jan 19 2023, 4:07 AM

I realize it's unlikely in practice, but is there a reason not to support any FP type? For fptoui, the integer type just needs one more bit than the max exponent for a given FP semantic?

For the tests, it should be sufficient to have the intermediate integer width be one more than the minimum required type width, so "%i = fptoui half %x to i17".

Please pre-commit baseline tests (either locally or push to main) and label tests that should not change as negative tests (either in the function name or with a code comment).

it should be sufficient to have the intermediate integer width be one more than the minimum required type width, so "%i = fptoui half %x to i17".

IIUC, for half fptoui we don't need an i17, as an i16 can hold the max normal value (65504). I can add support in for float conversions though, as this logic is only triggering for simple types, I assume the only conversion that will work is float -> i128 -> i64.

I would really appreciate if someone could help me understand the complication with fptosi w.r.t checking for poison/undef too.

nikic added a subscriber: nikic.Jan 20 2023, 3:31 AM

nikic added inline comments.

llvm/lib/Transforms/InstCombine/InstCombineCasts.cpp
506	Can you share the problematic proof? It shouldn't be needed.

samparker added inline comments.Jan 20 2023, 4:14 AM

llvm/lib/Transforms/InstCombine/InstCombineCasts.cpp
506	It doesn't make sense to me, but I'm hopeless with FP, so just alive... https://alive2.llvm.org/ce/z/fr5kdx. It will only compile with `--disable-undef-input` or a noundef operand.

In D142093#4068247, @samparker wrote:

it should be sufficient to have the intermediate integer width be one more than the minimum required type width, so "%i = fptoui half %x to i17".

IIUC, for half fptoui we don't need an i17, as an i16 can hold the max normal value (65504). I can add support in for float conversions though, as this logic is only triggering for simple types, I assume the only conversion that will work is float -> i128 -> i64.

i16 is the smallest final type for fptoui; i17 is one bit bigger because we need to truncate at least one bit. We don't really want a "simple type" limit here in IR either unless there's some codegen concern. I'd add tests with float and bfloat. There's a current discussion about adding various other small format FP types to IR, so we should try to future-proof this transform for those types in case they make it into IR.

I would really appreciate if someone could help me understand the complication with fptosi w.r.t checking for poison/undef too.

There is no undef problem - I think it's just that the online instance times out with larger widths:
https://alive2.llvm.org/ce/z/6EZXLQ

Okay, great. Thanks for clarification on both fronts. I'm just about to commit some tests.

If you want to add some more, the initial set is in: https://github.com/llvm/llvm-project/commit/714286f9e641209411609deaf80dd865aa2198c5

Removed non-poison input restriction for fptosi.

Harbormaster completed remote builds in B208986: Diff 490840.Jan 20 2023, 8:31 AM

efriedma added inline comments.Jan 20 2023, 11:00 AM

llvm/test/Transforms/InstCombine/trunc-fp-to-int.ll
249	From alive2: define i33 @src(float noundef %x) { %0: %conv = fptosi float noundef %x to i64 %conv.1 = trunc i64 %conv to i33 ret i33 %conv.1 } => define i33 @tgt(float noundef %x) { %0: %conv = fptosi float noundef %x to i33 ret i33 %conv } Transformation doesn't verify! ERROR: Target is more poisonous than source The final result type must be able to hold the largest finite number representable in the floating-point type; otherwise, the transform isn't legal. For float, that's an i128 or i129, I think? AArch64 does have an instruction FJCVTZS you could theoretically use for this kind of thing, but that seems unlikely to be worthwhile.

spatel mentioned this in rGcb29ba9c0f87: [InstCombine] adjust tests for fptoui + trunc; NFC.Jan 20 2023, 11:25 AM

I updated the test file; see if this covers everything for fptoui:
cb29ba9c0f87
(if yes, then we duplicate each test for fptoui with one extra bit for the integer types)

llvm/lib/Transforms/InstCombine/InstCombineCasts.cpp
503–506	This isn't correct - we want to use semanticsMaxExponent() or something like that to determine the minimum bitwidth. Signed cast needs one extra bit to not truncate the sign bit in integer form.
llvm/test/Transforms/InstCombine/trunc-fp-to-int.ll
116–117	This and the following tests are miscompiles (noundef is used here only to prevent the timeout): https://alive2.llvm.org/ce/z/UCo_Py

Using semanticsMaxExponent, so hopefully correct now...

A scalar trunc, to a non-simple type, is still only explored if the input is also a non-simple type but I presume that would be better changed, if at all, in a separate patch.

Avoiding integer comparison warning.

In D142093#4073138, @samparker wrote:

Using semanticsMaxExponent, so hopefully correct now...

A scalar trunc, to a non-simple type, is still only explored if the input is also a non-simple type but I presume that would be better changed, if at all, in a separate patch.

Yes, presumably that's a rarer possibility (and covered by the "wider final type" tests), but we'd need to ease the type check leading into canEvaluateTruncated().

llvm/lib/Transforms/InstCombine/InstCombineCasts.cpp
502–503	This looks correct, but it's non-obvious, so it could use some explanatory code comments. Also, the code as written had a signed compare warning. I'd rewrite it as something like: // If the integer type can hold the max FP value, it is safe to cast // directly to that type. Otherwise, we may create poison via overflow // that did not exist in the original code. // // The max FP value is pow(2, MaxExponent) * (1 + MaxFraction), so we need // at least one more bit than the MaxExponent to hold the max FP value. Type *InputTy = I->getOperand(0)->getType()->getScalarType(); unsigned MinBitWidth = APFloat::semanticsMaxExponent(InputTy->getFltSemantics()); // We need one more bit to preserve the signbit through truncation. if (I->getOpcode() == Instruction::FPToSI) ++MinBitWidth; return Ty->getScalarSizeInBits() > MinBitWidth;
llvm/test/Transforms/InstCombine/trunc-fp-to-int.ll
43–44	Put a TODO comment on this since we don't do it yet.
134–135	Oops - yes, I typo'd the test names for doubles.
153	These look good - please pre-commit the tests with baseline results in a preliminary NFC patch. We should add one more test with an extra use like this: declare void @use(i129) define i128 @float_fptoui_i129_i128_use(float %x) { %i = fptoui float %x to i129 call void @use(i129 %i) %r = trunc i129 %i to i128 ret i128 %r } We won't transform that currently, but we could allow that.

Rebased and added comment.

Harbormaster completed remote builds in B209354: Diff 491340.Jan 23 2023, 7:13 AM

LGTM

llvm/lib/Transforms/InstCombine/InstCombineCasts.cpp
508	Formatting nit: variable should have a capitalized name.

This revision is now accepted and ready to land.Jan 23 2023, 8:22 AM

This revision was landed with ongoing or failed builds.Jan 24 2023, 1:16 AM

Closed by commit rGb1b7fb6f20b0: [InstCombine] trunc (fptoui|fptosi) (authored by samparker). · Explain Why

This revision was automatically updated to reflect the committed changes.

samparker added a commit: rGb1b7fb6f20b0: [InstCombine] trunc (fptoui|fptosi).

samparker mentioned this in D141926: [WebAssembly] Add passes for GEP lowering.Feb 3 2023, 7:01 AM

Diff 491652

llvm/lib/Transforms/InstCombine/InstCombineCasts.cpp

Show First 20 Lines • Show All 241 Lines • ▼ Show 20 Lines	case Instruction::PHI: {
for (unsigned i = 0, e = OPN->getNumIncomingValues(); i != e; ++i) {		for (unsigned i = 0, e = OPN->getNumIncomingValues(); i != e; ++i) {
Value *V =		Value *V =
EvaluateInDifferentType(OPN->getIncomingValue(i), Ty, isSigned);		EvaluateInDifferentType(OPN->getIncomingValue(i), Ty, isSigned);
NPN->addIncoming(V, OPN->getIncomingBlock(i));		NPN->addIncoming(V, OPN->getIncomingBlock(i));
}		}
Res = NPN;		Res = NPN;
break;		break;
}		}
		case Instruction::FPToUI:
		case Instruction::FPToSI:
		Res = CastInst::Create(
		static_cast<Instruction::CastOps>(Opc), I->getOperand(0), Ty);
		break;
default:		default:
// TODO: Can handle more cases here.		// TODO: Can handle more cases here.
llvm_unreachable("Unreachable!");		llvm_unreachable("Unreachable!");
}		}

Res->takeName(I);		Res->takeName(I);
return InsertNewInstWith(Res, *I);		return InsertNewInstWith(Res, *I);
}		}
▲ Show 20 Lines • Show All 228 Lines • ▼ Show 20 Lines	case Instruction::PHI: {
// get into trouble with cyclic PHIs here because we only consider		// get into trouble with cyclic PHIs here because we only consider
// instructions with a single use.		// instructions with a single use.
PHINode *PN = cast<PHINode>(I);		PHINode *PN = cast<PHINode>(I);
for (Value *IncValue : PN->incoming_values())		for (Value *IncValue : PN->incoming_values())
if (!canEvaluateTruncated(IncValue, Ty, IC, CxtI))		if (!canEvaluateTruncated(IncValue, Ty, IC, CxtI))
return false;		return false;
return true;		return true;
}		}
		case Instruction::FPToUI:
		case Instruction::FPToSI: {
		// If the integer type can hold the max FP value, it is safe to cast
		// directly to that type. Otherwise, we may create poison via overflow
		// that did not exist in the original code.
		spatelUnsubmitted Not Done Reply Inline Actions This looks correct, but it's non-obvious, so it could use some explanatory code comments. Also, the code as written had a signed compare warning. I'd rewrite it as something like: // If the integer type can hold the max FP value, it is safe to cast // directly to that type. Otherwise, we may create poison via overflow // that did not exist in the original code. // // The max FP value is pow(2, MaxExponent) * (1 + MaxFraction), so we need // at least one more bit than the MaxExponent to hold the max FP value. Type InputTy = I->getOperand(0)->getType()->getScalarType(); unsigned MinBitWidth = APFloat::semanticsMaxExponent(InputTy->getFltSemantics()); // We need one more bit to preserve the signbit through truncation. if (I->getOpcode() == Instruction::FPToSI) ++MinBitWidth; return Ty->getScalarSizeInBits() > MinBitWidth; spatel:* This looks correct, but it's non-obvious, so it could use some explanatory code comments. Also…
		//
		// The max FP value is pow(2, MaxExponent) * (1 + MaxFraction), so we need
		// at least one more bit than the MaxExponent to hold the max FP value.
		samparkerAuthorUnsubmitted Done Reply Inline Actions And I'm still not sure whether this is needed? Alive seems to want it to be happy, but I pretty sure integer transforms are performed elsewhere without considering fp-to-int conversions as inputs. samparker: And I'm still not sure whether this is needed? Alive seems to want it to be happy, but I pretty…
		nikicUnsubmitted Not Done Reply Inline Actions Can you share the problematic proof? It shouldn't be needed. nikic: Can you share the problematic proof? It shouldn't be needed.
		samparkerAuthorUnsubmitted Done Reply Inline Actions It doesn't make sense to me, but I'm hopeless with FP, so just alive... https://alive2.llvm.org/ce/z/fr5kdx. It will only compile with `--disable-undef-input` or a noundef operand. samparker: It doesn't make sense to me, but I'm hopeless with FP, so just alive... https://alive2.llvm.
		spatelUnsubmitted Not Done Reply Inline Actions This isn't correct - we want to use semanticsMaxExponent() or something like that to determine the minimum bitwidth. Signed cast needs one extra bit to not truncate the sign bit in integer form. spatel: This isn't correct - we want to use semanticsMaxExponent() or something like that to determine…
		Type *InputTy = I->getOperand(0)->getType()->getScalarType();
		const fltSemantics &Semantics = InputTy->getFltSemantics();
		spatelUnsubmitted Not Done Reply Inline Actions Formatting nit: variable should have a capitalized name. spatel: Formatting nit: variable should have a capitalized name.
		uint32_t MinBitWidth = APFloatBase::semanticsMaxExponent(Semantics);
		// Extra sign bit needed.
		if (I->getOpcode() == Instruction::FPToSI)
		++MinBitWidth;
		return Ty->getScalarSizeInBits() > MinBitWidth;
		}
default:		default:
// TODO: Can handle more cases here.		// TODO: Can handle more cases here.
break;		break;
}		}

return false;		return false;
}		}

▲ Show 20 Lines • Show All 2,407 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/trunc-fp-to-int.ll

; NOTE: Assertions have been autogenerated by utils/update_test_checks.py		; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
; RUN: opt -passes=instcombine -S -o - %s \| FileCheck %s		; RUN: opt -passes=instcombine -S -o - %s \| FileCheck %s

; Tests if an integer type can cover the entire range of an input		; Tests if an integer type can cover the entire range of an input
; FP value. If so, we can remove an intermediate cast to a smaller		; FP value. If so, we can remove an intermediate cast to a smaller
; int type (remove a truncate).		; int type (remove a truncate).

define i16 @half_fptoui_i17_i16(half %x) {		define i16 @half_fptoui_i17_i16(half %x) {
; CHECK-LABEL: @half_fptoui_i17_i16(		; CHECK-LABEL: @half_fptoui_i17_i16(
; CHECK-NEXT: [[I:%.]] = fptoui half [[X:%.]] to i17		; CHECK-NEXT: [[I:%.]] = fptoui half [[X:%.]] to i16
; CHECK-NEXT: [[R:%.*]] = trunc i17 [[I]] to i16		; CHECK-NEXT: ret i16 [[I]]
; CHECK-NEXT: ret i16 [[R]]
;		;
%i = fptoui half %x to i17		%i = fptoui half %x to i17
%r = trunc i17 %i to i16		%r = trunc i17 %i to i16
ret i16 %r		ret i16 %r
}		}

; Negative test - not enough bits to hold max half value (65504).		; Negative test - not enough bits to hold max half value (65504).

define i15 @half_fptoui_i17_i15(half %x) {		define i15 @half_fptoui_i17_i15(half %x) {
; CHECK-LABEL: @half_fptoui_i17_i15(		; CHECK-LABEL: @half_fptoui_i17_i15(
; CHECK-NEXT: [[I:%.]] = fptoui half [[X:%.]] to i17		; CHECK-NEXT: [[I:%.]] = fptoui half [[X:%.]] to i17
; CHECK-NEXT: [[R:%.*]] = trunc i17 [[I]] to i15		; CHECK-NEXT: [[R:%.*]] = trunc i17 [[I]] to i15
; CHECK-NEXT: ret i15 [[R]]		; CHECK-NEXT: ret i15 [[R]]
;		;
%i = fptoui half %x to i17		%i = fptoui half %x to i17
%r = trunc i17 %i to i15		%r = trunc i17 %i to i15
ret i15 %r		ret i15 %r
}		}

; Wider intermediate type is ok.		; Wider intermediate type is ok.

define i16 @half_fptoui_i32_i16(half %x) {		define i16 @half_fptoui_i32_i16(half %x) {
; CHECK-LABEL: @half_fptoui_i32_i16(		; CHECK-LABEL: @half_fptoui_i32_i16(
; CHECK-NEXT: [[I:%.]] = fptoui half [[X:%.]] to i32		; CHECK-NEXT: [[I:%.]] = fptoui half [[X:%.]] to i16
; CHECK-NEXT: [[R:%.*]] = trunc i32 [[I]] to i16		; CHECK-NEXT: ret i16 [[I]]
; CHECK-NEXT: ret i16 [[R]]
;		;
%i = fptoui half %x to i32		%i = fptoui half %x to i32
%r = trunc i32 %i to i16		%r = trunc i32 %i to i16
ret i16 %r		ret i16 %r
}		}

; Wider final type is ok.		; Wider final type is ok.
; TODO: Handle non-simple result type.		; TODO: Handle non-simple result type.
		spatelUnsubmitted Not Done Reply Inline Actions Put a TODO comment on this since we don't do it yet. spatel: Put a TODO comment on this since we don't do it yet.

define i17 @half_fptoui_i32_i17(half %x) {		define i17 @half_fptoui_i32_i17(half %x) {
; CHECK-LABEL: @half_fptoui_i32_i17(		; CHECK-LABEL: @half_fptoui_i32_i17(
; CHECK-NEXT: [[I:%.]] = fptoui half [[X:%.]] to i32		; CHECK-NEXT: [[I:%.]] = fptoui half [[X:%.]] to i32
; CHECK-NEXT: [[R:%.*]] = trunc i32 [[I]] to i17		; CHECK-NEXT: [[R:%.*]] = trunc i32 [[I]] to i17
; CHECK-NEXT: ret i17 [[R]]		; CHECK-NEXT: ret i17 [[R]]
;		;
%i = fptoui half %x to i32		%i = fptoui half %x to i32
%r = trunc i32 %i to i17		%r = trunc i32 %i to i17
ret i17 %r		ret i17 %r
}		}

; Vectors work too.		; Vectors work too.

define <4 x i16> @half_fptoui_4xi32_4xi16(<4 x half> %x) {		define <4 x i16> @half_fptoui_4xi32_4xi16(<4 x half> %x) {
; CHECK-LABEL: @half_fptoui_4xi32_4xi16(		; CHECK-LABEL: @half_fptoui_4xi32_4xi16(
; CHECK-NEXT: [[I:%.]] = fptoui <4 x half> [[X:%.]] to <4 x i32>		; CHECK-NEXT: [[I:%.]] = fptoui <4 x half> [[X:%.]] to <4 x i16>
; CHECK-NEXT: [[R:%.*]] = trunc <4 x i32> [[I]] to <4 x i16>		; CHECK-NEXT: ret <4 x i16> [[I]]
; CHECK-NEXT: ret <4 x i16> [[R]]
;		;
%i = fptoui <4 x half> %x to <4 x i32>		%i = fptoui <4 x half> %x to <4 x i32>
%r = trunc <4 x i32> %i to <4 x i16>		%r = trunc <4 x i32> %i to <4 x i16>
ret <4 x i16> %r		ret <4 x i16> %r
}		}

define i128 @bfloat_fptoui_i129_i128(bfloat %x) {		define i128 @bfloat_fptoui_i129_i128(bfloat %x) {
; CHECK-LABEL: @bfloat_fptoui_i129_i128(		; CHECK-LABEL: @bfloat_fptoui_i129_i128(
; CHECK-NEXT: [[I:%.]] = fptoui bfloat [[X:%.]] to i129		; CHECK-NEXT: [[I:%.]] = fptoui bfloat [[X:%.]] to i128
; CHECK-NEXT: [[R:%.*]] = trunc i129 [[I]] to i128		; CHECK-NEXT: ret i128 [[I]]
; CHECK-NEXT: ret i128 [[R]]
;		;
%i = fptoui bfloat %x to i129		%i = fptoui bfloat %x to i129
%r = trunc i129 %i to i128		%r = trunc i129 %i to i128
ret i128 %r		ret i128 %r
}		}

; Negative test - not enough bits to hold max bfloat value (2*127 (2 − 2**−7))		; Negative test - not enough bits to hold max bfloat value (2*127 (2 − 2**−7))

define i127 @bfloat_fptoui_i128_i127(bfloat %x) {		define i127 @bfloat_fptoui_i128_i127(bfloat %x) {
; CHECK-LABEL: @bfloat_fptoui_i128_i127(		; CHECK-LABEL: @bfloat_fptoui_i128_i127(
; CHECK-NEXT: [[I:%.]] = fptoui bfloat [[X:%.]] to i128		; CHECK-NEXT: [[I:%.]] = fptoui bfloat [[X:%.]] to i128
; CHECK-NEXT: [[R:%.*]] = trunc i128 [[I]] to i127		; CHECK-NEXT: [[R:%.*]] = trunc i128 [[I]] to i127
; CHECK-NEXT: ret i127 [[R]]		; CHECK-NEXT: ret i127 [[R]]
;		;
%i = fptoui bfloat %x to i128		%i = fptoui bfloat %x to i128
%r = trunc i128 %i to i127		%r = trunc i128 %i to i127
ret i127 %r		ret i127 %r
}		}

define i128 @float_fptoui_i129_i128(float %x) {		define i128 @float_fptoui_i129_i128(float %x) {
; CHECK-LABEL: @float_fptoui_i129_i128(		; CHECK-LABEL: @float_fptoui_i129_i128(
; CHECK-NEXT: [[I:%.]] = fptoui float [[X:%.]] to i129		; CHECK-NEXT: [[I:%.]] = fptoui float [[X:%.]] to i128
; CHECK-NEXT: [[R:%.*]] = trunc i129 [[I]] to i128		; CHECK-NEXT: ret i128 [[I]]
; CHECK-NEXT: ret i128 [[R]]
;		;
%i = fptoui float %x to i129		%i = fptoui float %x to i129
%r = trunc i129 %i to i128		%r = trunc i129 %i to i128
ret i128 %r		ret i128 %r
}		}

; TODO: We could transform with multiple users.		; TODO: We could transform with multiple users.
declare void @use(i129)		declare void @use(i129)
define i128 @float_fptoui_i129_i128_use(float %x) {		define i128 @float_fptoui_i129_i128_use(float %x) {
; CHECK-LABEL: @float_fptoui_i129_i128_use(		; CHECK-LABEL: @float_fptoui_i129_i128_use(
; CHECK-NEXT: [[I:%.]] = fptoui float [[X:%.]] to i129		; CHECK-NEXT: [[I:%.]] = fptoui float [[X:%.]] to i129
; CHECK-NEXT: call void @use(i129 [[I]])		; CHECK-NEXT: call void @use(i129 [[I]])
; CHECK-NEXT: [[R:%.*]] = trunc i129 [[I]] to i128		; CHECK-NEXT: [[R:%.*]] = trunc i129 [[I]] to i128
; CHECK-NEXT: ret i128 [[R]]		; CHECK-NEXT: ret i128 [[R]]
;		;
%i = fptoui float %x to i129		%i = fptoui float %x to i129
call void @use(i129 %i)		call void @use(i129 %i)
%r = trunc i129 %i to i128		%r = trunc i129 %i to i128
ret i128 %r		ret i128 %r
}		}

; Negative test - not enough bits to hold max float value (2*127 (2 − 2**−23))		; Negative test - not enough bits to hold max float value (2*127 (2 − 2**−23))
		spatelUnsubmitted Not Done Reply Inline Actions This and the following tests are miscompiles (noundef is used here only to prevent the timeout): https://alive2.llvm.org/ce/z/UCo_Py spatel: This and the following tests are miscompiles (noundef is used here only to prevent the timeout)…

define i127 @float_fptoui_i128_i127(float %x) {		define i127 @float_fptoui_i128_i127(float %x) {
; CHECK-LABEL: @float_fptoui_i128_i127(		; CHECK-LABEL: @float_fptoui_i128_i127(
; CHECK-NEXT: [[I:%.]] = fptoui float [[X:%.]] to i128		; CHECK-NEXT: [[I:%.]] = fptoui float [[X:%.]] to i128
; CHECK-NEXT: [[R:%.*]] = trunc i128 [[I]] to i127		; CHECK-NEXT: [[R:%.*]] = trunc i128 [[I]] to i127
; CHECK-NEXT: ret i127 [[R]]		; CHECK-NEXT: ret i127 [[R]]
;		;
%i = fptoui float %x to i128		%i = fptoui float %x to i128
%r = trunc i128 %i to i127		%r = trunc i128 %i to i127
ret i127 %r		ret i127 %r
}		}

define i1024 @double_fptoui_i1025_i1024(double %x) {		define i1024 @double_fptoui_i1025_i1024(double %x) {
spatelUnsubmitted Not Done Reply Inline Actions Oops - yes, I typo'd the test names for doubles. spatel: Oops - yes, I typo'd the test names for doubles.
; CHECK-LABEL: @double_fptoui_i1025_i1024(		; CHECK-LABEL: @double_fptoui_i1025_i1024(
; CHECK-NEXT: [[I:%.]] = fptoui double [[X:%.]] to i1025		; CHECK-NEXT: [[I:%.]] = fptoui double [[X:%.]] to i1024
; CHECK-NEXT: [[R:%.*]] = trunc i1025 [[I]] to i1024		; CHECK-NEXT: ret i1024 [[I]]
; CHECK-NEXT: ret i1024 [[R]]
;		;
%i = fptoui double %x to i1025		%i = fptoui double %x to i1025
%r = trunc i1025 %i to i1024		%r = trunc i1025 %i to i1024
ret i1024 %r		ret i1024 %r
}		}

; Negative test - not enough bits to hold max double value (2*1023 (2 − 2**−52))		; Negative test - not enough bits to hold max double value (2*1023 (2 − 2**−52))

define i1023 @double_fptoui_i1024_i1023(double %x) {		define i1023 @double_fptoui_i1024_i1023(double %x) {
; CHECK-LABEL: @double_fptoui_i1024_i1023(		; CHECK-LABEL: @double_fptoui_i1024_i1023(
; CHECK-NEXT: [[I:%.]] = fptoui double [[X:%.]] to i1024		; CHECK-NEXT: [[I:%.]] = fptoui double [[X:%.]] to i1024
; CHECK-NEXT: [[R:%.*]] = trunc i1024 [[I]] to i1023		; CHECK-NEXT: [[R:%.*]] = trunc i1024 [[I]] to i1023
; CHECK-NEXT: ret i1023 [[R]]		; CHECK-NEXT: ret i1023 [[R]]
;		;
%i = fptoui double %x to i1024		%i = fptoui double %x to i1024
%r = trunc i1024 %i to i1023		%r = trunc i1024 %i to i1023
ret i1023 %r		ret i1023 %r
}		}

; Negative test - not enough bits to hold min half value (-65504).		; Negative test - not enough bits to hold min half value (-65504).
		spatelUnsubmitted Not Done Reply Inline Actions These look good - please pre-commit the tests with baseline results in a preliminary NFC patch. We should add one more test with an extra use like this: declare void @use(i129) define i128 @float_fptoui_i129_i128_use(float %x) { %i = fptoui float %x to i129 call void @use(i129 %i) %r = trunc i129 %i to i128 ret i128 %r } We won't transform that currently, but we could allow that. spatel: These look good - please pre-commit the tests with baseline results in a preliminary NFC patch.

define i16 @half_fptosi_i17_i16(half %x) {		define i16 @half_fptosi_i17_i16(half %x) {
; CHECK-LABEL: @half_fptosi_i17_i16(		; CHECK-LABEL: @half_fptosi_i17_i16(
; CHECK-NEXT: [[I:%.]] = fptosi half [[X:%.]] to i17		; CHECK-NEXT: [[I:%.]] = fptosi half [[X:%.]] to i17
; CHECK-NEXT: [[R:%.*]] = trunc i17 [[I]] to i16		; CHECK-NEXT: [[R:%.*]] = trunc i17 [[I]] to i16
; CHECK-NEXT: ret i16 [[R]]		; CHECK-NEXT: ret i16 [[R]]
;		;
%i = fptosi half %x to i17		%i = fptosi half %x to i17
%r = trunc i17 %i to i16		%r = trunc i17 %i to i16
ret i16 %r		ret i16 %r
}		}

define i17 @half_fptosi_i18_i17(half %x) {		define i17 @half_fptosi_i18_i17(half %x) {
; CHECK-LABEL: @half_fptosi_i18_i17(		; CHECK-LABEL: @half_fptosi_i18_i17(
; CHECK-NEXT: [[I:%.]] = fptosi half [[X:%.]] to i18		; CHECK-NEXT: [[I:%.]] = fptosi half [[X:%.]] to i17
; CHECK-NEXT: [[R:%.*]] = trunc i18 [[I]] to i17		; CHECK-NEXT: ret i17 [[I]]
; CHECK-NEXT: ret i17 [[R]]
;		;
%i = fptosi half %x to i18		%i = fptosi half %x to i18
%r = trunc i18 %i to i17		%r = trunc i18 %i to i17
ret i17 %r		ret i17 %r
}		}

; Wider intermediate type is ok.		; Wider intermediate type is ok.
; TODO: Handle non-simple result type.		; TODO: Handle non-simple result type.
Show All 22 Lines	;
%r = trunc i32 %i to i18		%r = trunc i32 %i to i18
ret i18 %r		ret i18 %r
}		}

; Vectors work too.		; Vectors work too.

define <4 x i17> @half_fptosi_4xi32_4xi17(<4 x half> %x) {		define <4 x i17> @half_fptosi_4xi32_4xi17(<4 x half> %x) {
; CHECK-LABEL: @half_fptosi_4xi32_4xi17(		; CHECK-LABEL: @half_fptosi_4xi32_4xi17(
; CHECK-NEXT: [[I:%.]] = fptosi <4 x half> [[X:%.]] to <4 x i32>		; CHECK-NEXT: [[I:%.]] = fptosi <4 x half> [[X:%.]] to <4 x i17>
; CHECK-NEXT: [[R:%.*]] = trunc <4 x i32> [[I]] to <4 x i17>		; CHECK-NEXT: ret <4 x i17> [[I]]
; CHECK-NEXT: ret <4 x i17> [[R]]
;		;
%i = fptosi <4 x half> %x to <4 x i32>		%i = fptosi <4 x half> %x to <4 x i32>
%r = trunc <4 x i32> %i to <4 x i17>		%r = trunc <4 x i32> %i to <4 x i17>
ret <4 x i17> %r		ret <4 x i17> %r
}		}

; Negative test - not enough bits to hold min float value.		; Negative test - not enough bits to hold min float value.

define i128 @bfloat_fptosi_i129_i128(bfloat %x) {		define i128 @bfloat_fptosi_i129_i128(bfloat %x) {
; CHECK-LABEL: @bfloat_fptosi_i129_i128(		; CHECK-LABEL: @bfloat_fptosi_i129_i128(
; CHECK-NEXT: [[I:%.]] = fptosi bfloat [[X:%.]] to i129		; CHECK-NEXT: [[I:%.]] = fptosi bfloat [[X:%.]] to i129
; CHECK-NEXT: [[R:%.*]] = trunc i129 [[I]] to i128		; CHECK-NEXT: [[R:%.*]] = trunc i129 [[I]] to i128
; CHECK-NEXT: ret i128 [[R]]		; CHECK-NEXT: ret i128 [[R]]
;		;
%i = fptosi bfloat %x to i129		%i = fptosi bfloat %x to i129
%r = trunc i129 %i to i128		%r = trunc i129 %i to i128
ret i128 %r		ret i128 %r
}		}

define i129 @bfloat_fptosi_i130_i129(bfloat %x) {		define i129 @bfloat_fptosi_i130_i129(bfloat %x) {
; CHECK-LABEL: @bfloat_fptosi_i130_i129(		; CHECK-LABEL: @bfloat_fptosi_i130_i129(
; CHECK-NEXT: [[I:%.]] = fptosi bfloat [[X:%.]] to i130		; CHECK-NEXT: [[I:%.]] = fptosi bfloat [[X:%.]] to i129
; CHECK-NEXT: [[R:%.*]] = trunc i130 [[I]] to i129		; CHECK-NEXT: ret i129 [[I]]
; CHECK-NEXT: ret i129 [[R]]
;		;
%i = fptosi bfloat %x to i130		%i = fptosi bfloat %x to i130
%r = trunc i130 %i to i129		%r = trunc i130 %i to i129
ret i129 %r		ret i129 %r
}		}

define i129 @float_fptosi_i130_i129(float %x) {		define i129 @float_fptosi_i130_i129(float %x) {
; CHECK-LABEL: @float_fptosi_i130_i129(		; CHECK-LABEL: @float_fptosi_i130_i129(
; CHECK-NEXT: [[I:%.]] = fptosi float [[X:%.]] to i130		; CHECK-NEXT: [[I:%.]] = fptosi float [[X:%.]] to i129
; CHECK-NEXT: [[R:%.*]] = trunc i130 [[I]] to i129		; CHECK-NEXT: ret i129 [[I]]
; CHECK-NEXT: ret i129 [[R]]
;		;
%i = fptosi float %x to i130		%i = fptosi float %x to i130
%r = trunc i130 %i to i129		%r = trunc i130 %i to i129
ret i129 %r		ret i129 %r
}		}

; Negative test - not enough bits to hold min float value.		; Negative test - not enough bits to hold min float value.
		efriedmaUnsubmitted Not Done Reply Inline Actions From alive2: define i33 @src(float noundef %x) { %0: %conv = fptosi float noundef %x to i64 %conv.1 = trunc i64 %conv to i33 ret i33 %conv.1 } => define i33 @tgt(float noundef %x) { %0: %conv = fptosi float noundef %x to i33 ret i33 %conv } Transformation doesn't verify! ERROR: Target is more poisonous than source The final result type must be able to hold the largest finite number representable in the floating-point type; otherwise, the transform isn't legal. For float, that's an i128 or i129, I think? AArch64 does have an instruction FJCVTZS you could theoretically use for this kind of thing, but that seems unlikely to be worthwhile. efriedma: From alive2: ``` define i33 @src(float noundef %x) { %0: %conv = fptosi float noundef %x to…

define i128 @float_fptosi_i129_i128(float %x) {		define i128 @float_fptosi_i129_i128(float %x) {
; CHECK-LABEL: @float_fptosi_i129_i128(		; CHECK-LABEL: @float_fptosi_i129_i128(
; CHECK-NEXT: [[I:%.]] = fptosi float [[X:%.]] to i129		; CHECK-NEXT: [[I:%.]] = fptosi float [[X:%.]] to i129
; CHECK-NEXT: [[R:%.*]] = trunc i129 [[I]] to i128		; CHECK-NEXT: [[R:%.*]] = trunc i129 [[I]] to i128
; CHECK-NEXT: ret i128 [[R]]		; CHECK-NEXT: ret i128 [[R]]
;		;
%i = fptosi float %x to i129		%i = fptosi float %x to i129
%r = trunc i129 %i to i128		%r = trunc i129 %i to i128
ret i128 %r		ret i128 %r
}		}

define i1025 @double_fptosi_i1026_i1025(double %x) {		define i1025 @double_fptosi_i1026_i1025(double %x) {
; CHECK-LABEL: @double_fptosi_i1026_i1025(		; CHECK-LABEL: @double_fptosi_i1026_i1025(
; CHECK-NEXT: [[I:%.]] = fptosi double [[X:%.]] to i1026		; CHECK-NEXT: [[I:%.]] = fptosi double [[X:%.]] to i1025
; CHECK-NEXT: [[R:%.*]] = trunc i1026 [[I]] to i1025		; CHECK-NEXT: ret i1025 [[I]]
; CHECK-NEXT: ret i1025 [[R]]
;		;
%i = fptosi double %x to i1026		%i = fptosi double %x to i1026
%r = trunc i1026 %i to i1025		%r = trunc i1026 %i to i1025
ret i1025 %r		ret i1025 %r
}		}

; Negative test - not enough bits to hold min double value.		; Negative test - not enough bits to hold min double value.

Show All 10 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[InstCombine] trunc (fptoui|fptosi)
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 491652

llvm/lib/Transforms/InstCombine/InstCombineCasts.cpp

llvm/test/Transforms/InstCombine/trunc-fp-to-int.ll

This is an archive of the discontinued LLVM Phabricator instance.

[InstCombine] trunc (fptoui|fptosi)ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 491652

llvm/lib/Transforms/InstCombine/InstCombineCasts.cpp

llvm/test/Transforms/InstCombine/trunc-fp-to-int.ll

[InstCombine] trunc (fptoui|fptosi)
ClosedPublic