This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
docs/
-
LangRef.rst
-
include/llvm/
-
llvm/
-
CodeGen/
-
ISDOpcodes.h
-
SelectionDAGNodes.h
-
TargetLowering.h
-
IR/
-
IntrinsicInst.h
-
Intrinsics.td
-
lib/
-
CodeGen/SelectionDAG/
-
SelectionDAG/
-
LegalizeDAG.cpp
-
LegalizeTypes.h
-
LegalizeVectorOps.cpp
-
LegalizeVectorTypes.cpp
-
SelectionDAG.cpp
-
SelectionDAGBuilder.cpp
-
SelectionDAGDumper.cpp
-
IR/
-
IntrinsicInst.cpp
-
Verifier.cpp
-
test/
-
CodeGen/X86/
-
X86/
-
fp-intrinsics.ll
-
vector-constrained-fp-intrinsics.ll
-
Feature/
-
fp-intrinsics.ll

Differential D55897

Add constrained fptrunc and fpext intrinsics
ClosedPublic

Authored by kpn on Dec 19 2018, 12:20 PM.

Download Raw Diff

Details

Reviewers

andrew.w.kaylor
craig.topper
hfinkel
mehdi_amini
aemerson
javed.absar

Commits

rZORG635c65f4803c: Add constrained fptrunc and fpext intrinsics.
rZORG3cc1796eb454: Add constrained fptrunc and fpext intrinsics.
rG635c65f4803c: Add constrained fptrunc and fpext intrinsics.
rG3cc1796eb454: Add constrained fptrunc and fpext intrinsics.
rG5987749e33bb: Add constrained fptrunc and fpext intrinsics.
rL360581: Add constrained fptrunc and fpext intrinsics.

Summary

This ticket was split off from D43515. It contains just the experimental constrained fptrunc and fpext intrinsics plus related changes like documentation. These two intrinsics are simpler to implement so I don't see a reason they need to wait on the rest of D43515.

Diff Detail

Repository: rL LLVM

Event Timeline

kpn created this revision.Dec 19 2018, 12:20 PM

Herald added a subscriber: llvm-commits. · View Herald TranscriptDec 19 2018, 12:20 PM

I just realized I missed a couple of changes. Let me work on those and I'll update later.

It was pointed out before I split this ticket out that I wasn't properly passing the correct operand in ExpandNode() when handling the two new STRICT nodes. I've corrected that.

cwabbott added a subscriber: cwabbott.Jan 24 2019, 3:57 AM

cwabbott added inline comments.

docs/LangRef.rst
14529 ↗	(On Diff #182019)	Hi, I've been working on implementing a Vulkan extension which allows the user to specifiy different rounding modes for the AMDGPU backend. I'm not sure how this works in C/C++, but we're required to support floating-point truncation with non-standard rounding modes. Is there a reason the rounding mode isn't an argument here?

kpn marked an inline comment as done.Jan 25 2019, 9:24 AM

kpn added inline comments.

docs/LangRef.rst
14529 ↗	(On Diff #182019)	Going back and rereading D43515, I don't see an explicit reason given back then. And I can't find anything in the C99 or IEEE 754 standards, or in the LLVM documentation, that would mandate any particular rounding mode. So I'm open to adding a rounding mode argument to the constrained fptrunc. Andrew? What do you think of making constrained fptrunc go back to taking a rounding mode argument? Having said that, the constrained FP intrinsics are to avoid optimizations that change program behavior taking traps into account. Is this the behavior you need for Vulcan?

cwabbott added inline comments.Jan 28 2019, 7:30 AM

docs/LangRef.rst
14529 ↗	(On Diff #182019)	Vulkan doesn't support trapping floating-point exceptions, so we don't have to worry about that. However, my understanding is that we still need to communicate the rounding mode to LLVM, to prevent it from constant folding floating-point operations with the wrong rounding mode, so we still need to use the intrinsics. There's also the complication that we may need to emit some "internal" floating-point operations which need to have some defined rounding mode different from what the user specified. The backend has total control over the control register that specifies the rounding mode, and in some cases (e.g. fp32 -> fp16 truncation) the rounding mode is actually specified statically by the instruction itself rather than the control register, so my thought was that we could make AMDGPU just always use the rounding specified by the argument to the constrained intrinsic, emitting changes to the control register if necessary. I'm not sure if the CodeGen infrastructure is set up to do that.

cwabbott added inline comments.Jan 28 2019, 9:16 AM

docs/LangRef.rst
14529 ↗	(On Diff #182019)	Oh, and I forgot another thing: the extension also adds support for letting the user either flush or preserve denormalized values. However, this is per-source-module, and sometimes we need to stitch together multiple source modules which have different rounding needs, emitting an instruction in between them to change the denorm flushing and/or rounding mode. So it seems we really do need to use the constrained intrinsics, to prevent code motion of floating-point operations around that register setting.

OK, I'm working on it now.

Add back a rounding mode argument to constrained fptrunc as requested by Connor Abbott.

andrew.w.kaylor added inline comments.Jan 30 2019, 11:09 AM

docs/LangRef.rst
14529 ↗	(On Diff #182019)	We should definitely have a rounding mode argument for fptrunc. I think the reason we missed that the first time around is probably due to the unfortunate naming of this operation (i.e. it isn't actually truncating) and the confusion with ISD::ROUND.

This is looking pretty good. I don't think I know the Selection DAG well enough to offer a proper review of that. I'll see if I can get Craig's attention on it.

docs/index.rst
194 ↗	(On Diff #184127)	This seems a bit too prominently placed. Most people don't care about these intrinsics. I would recommend sinking this down into the subsytem documentation section. There is no clear organization there, so it's hard to say where it should go. Maybe just above or below the exception handling section (just based on my perception of the generality of each).
lib/IR/Verifier.cpp
4670 ↗	(On Diff #184127)	The formatting is non-standard and inconsistent in this section.
4685 ↗	(On Diff #184127)	You could combine these two lines as: if (auto *VecTy = dyn_cast<VectorType>(OperandTy))
4696 ↗	(On Diff #184127)	Redefining "Operand" here makes the code confusing. I'd rather see Operand and Result as separate variables and make local variables for their types. I would also make the Assert statements more specific to what they are actually checking. For instance, if (OperandTy->isVectorTy()) { Assert(ResultTy->isVectorTy(), ... Assert(OperandTy->getVectorNumElements() == ResultTy->getVectorNumElements(), ... } Is there a reason that these vector checks are specific to fptrunc and fpext? Is it just because they don't have a "same type" restriction in the intrinsic definition? The floating point type assertions could be simplified as Assert(OperandTy->isFPorFPVectorTy(),... Assert(ResultTy->isFPorFPVectorTy(),... You should also be checking that OperandTy->getScalarSizeInBits() > ResultTy->getScalarSizeInBits for fptrunc and vice versa for fpext.

craig.topper added inline comments.Jan 31 2019, 2:19 PM

lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp
710 ↗	(On Diff #184127)	What makes us reach this case? I would expect we'd scalarize based on the result type before we got to the operand type.
3058 ↗	(On Diff #184127)	We can't really widen this can we? Won't that put garbage in the upper elements?

craig.topper added inline comments.Jan 31 2019, 2:19 PM

test/CodeGen/X86/vector-constrained-fp-intrinsics.ll
2566 ↗	(On Diff #184127)	Please add an AVX command line so v4f64 will be a legal type.

kpn marked 6 inline comments as done.Feb 5 2019, 9:46 AM

kpn added inline comments.

lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp
710 ↗	(On Diff #184127)	Test case CodeGen/AArch64/neon-fpround_f128.ll says this code is needed. That case goes through the non-STRICT version of this function. This STRICT function was copied from the non-STRICT function and called in the appropriate places alongside that function. And a trivial conversion of the test case to use the constrained intrinsics does indeed go through this STRICT function. I did want to try to keep the functions unified, but sometimes the result was too ugly to live.
3058 ↗	(On Diff #184127)	Is that what getUNDEF() does? Give llvm license to put garbage in registers? My assumption is that the code is fine because this function is a copy of WidenVecRes_Convert() with the needed changes for the strict node being chained. If if there's a problem here there's also a problem in that function.
lib/IR/Verifier.cpp
4685 ↗	(On Diff #184127)	The rewrite moots this point.
4696 ↗	(On Diff #184127)	I've rewritten this code and with your suggestions it does look much nicer. I think I did put the vector checks are for fptrunc and fpext because they aren't checked earlier. I've also added the checks for the appropriate changes in ScalarSizeInBits.

Address review comments.

craig.topper added inline comments.Feb 5 2019, 10:19 AM

lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp
3058 ↗	(On Diff #184127)	That is what the undef means. The existing code isn't required to be exception safe so garbage is fine. The constrained intrinsics have to be exception safe. This is why the implementation of WidenVecRes_StrictFP is different than the non-trapping case in WidenVecRes_BinaryCanTrap. I believe FADD/FSUB/FDIV/FMUL are considered non-trapping on most targets.

I'm seeing a regression in WebAssembly/PR40267.ll that I need to look into. If anyone else is seeing this failure let me know.

lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp
3058 ↗	(On Diff #184127)	It looks like this "WidenNumElts % InVTNumElts" block can just be eliminated. The fallback at the end of the function should handle the case without any undefs.

cameron.mcinally added inline comments.Feb 13 2019, 8:27 AM

lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
2845 ↗	(On Diff #185337)	Would this lowering to `EmitStackConvert(...)` discard the rounding mode? I'm not familiar with this code, but I think it would. E.g. STRICT_FP_ROUND would lower to a truncating store.

cameron.mcinally added inline comments.Feb 13 2019, 9:02 AM

lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp
710 ↗	(On Diff #184127)	Have you considered scalarizing the operands with a generic function? Something like `ScalarizeVecRes_StrictFPOp(...)`? I suspect we'll see reuse of this code. @craig.topper What makes us reach this case? I would expect we'd scalarize based on the result type before we got to the operand type. The different operand and result types for these operations are probably why `ScalarizeVecRes_StrictFPOp(...)` didn't trip. Just guessing though...

kpn marked 2 inline comments as done.Feb 18 2019, 11:31 AM

kpn added inline comments.

lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
2845 ↗	(On Diff #185337)	Yes it would, assuming EmitStackConvert() works correctly for floating point types. A 'make check' of llvm with the default targets enabled doesn't trigger the call to DAG.getTruncStore(), so I'm not actually sure it works. But none of the other intrinsics handle rounding modes currently, so I think we should leave this to future work.
lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp
710 ↗	(On Diff #184127)	You know, I thought about it. But trying to generalize it would hide the parallel with ScalarizeVecRes_FP_ROUND(). Right now the connection between the two is obvious. I'm not sure that we'd gain much at this point in time by generalizing and losing that bit of readability. If we do end up needing to generalize this code in the future then we can do it then.

Address review comments.

Fix regression on WebAssembly accidentally caused by incorrect svn update.

jsji added a subscriber: jsji.Feb 25 2019, 10:42 AM

Rebase. Ping.

andrew.w.kaylor added a subscriber: pengfei.Mar 21 2019, 5:48 PM

craig.topper added inline comments.Mar 25 2019, 12:24 PM

docs/LangRef.rst
14812 ↗	(On Diff #191120)	This should be in a separate patch.
include/llvm/IR/Intrinsics.td
611 ↗	(On Diff #191120)	This line [ should probably be lined up with the line above. Same with fpext.
lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
1798 ↗	(On Diff #191120)	This line doesn't make sense to me. This is replacing the users of result 1 of the load you just created with Chain. But no one has seen this load yet so how can its result 1 have any users?
2806 ↗	(On Diff #191120)	Shouldn't you be passing the input chain operand of Node into the last argument? Right now it looks like you're using the Chain result from Node itself. But we want to delete Node.
lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp
773 ↗	(On Diff #191120)	Assert that OpNo is 1.
1648 ↗	(On Diff #191120)	use a temporary for the OpNo instead of of repeatedly calling N->isStrictFPOpcode()
2051 ↗	(On Diff #191120)	This comment must have been copy pasted. We aren't operating on a "load" here
4208 ↗	(On Diff #191120)	Add curly braces to the else and fix the indentation
lib/CodeGen/SelectionDAG/SelectionDAG.cpp
7659 ↗	(On Diff #191120)	Can this be Node->getValueType(0)? Accessing ValueList directly seems pretty unusual.
lib/IR/Verifier.cpp
4639 ↗	(On Diff #191120)	Most of this looks like it belongs in a separate patch. This patch shoudl focus on adding the new intrinsics. Gaps in old intrinsics should be separated.
test/CodeGen/X86/vector-constrained-fp-intrinsics.ll
3 ↗	(On Diff #191120)	Do this as a pre-commit?

kpn marked an inline comment as done.Mar 26 2019, 7:55 AM

kpn added inline comments.

docs/LangRef.rst
14812 ↗	(On Diff #191120)	What should be in a separate patch? The new AddingConstrainedIntrinsics.rst file?

kpn marked an inline comment as not done.Mar 26 2019, 7:56 AM

kpn mentioned this in D59830: [FPEnv] Make constrained FP IR verification more flexible..Mar 26 2019, 11:33 AM

kpn mentioned this in D59833: [FPEnv] New document for adding new constrained FP intrinsics.Mar 26 2019, 11:58 AM

kpn mentioned this in rL357065: The IR verifier currently supports the constrained floating point intrinsics,.Mar 27 2019, 6:30 AM

kpn mentioned this in rG4f3cdc6555ca: The IR verifier currently supports the constrained floating point intrinsics….

kpn marked 4 inline comments as done.Apr 15 2019, 5:56 AM

kpn marked 6 inline comments as done.Apr 17 2019, 9:47 AM

kpn added inline comments.

lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
1798 ↗	(On Diff #191120)	Agreed. I'm fixing it now.
2806 ↗	(On Diff #191120)	I'm now changing EmitStackConvert to splice itself into the chain. So passing Node is about to become correct since I need both ends of the chain to do the splicing.

Address review comments.

I did find that I didn't need a new, special version of ReplaceNode(). One of the existing ones appears to do the job. That meant I could simplify the changes to EmitStackConvert() and avoid chain splicing there. This, in turn, means I only needed to pass in one end of the chain to EmitStackConvert(). I hope the result is more readable.

andrew.w.kaylor added inline comments.Apr 25 2019, 3:06 PM

lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
2787 ↗	(On Diff #196690)	Can this cause the wrong rounding mode to be used? For X86 targets I would guess it will result in an instruction that uses the runtime rounding mode, but I'm not sure we can count on that for all targets. Also, if the value being converted is a constant, might this get folded using the default rounding mode? And if it is a constant and we knew the rounding mode based on an argument to the intrinsic, we might want to fold it (though ideally that would have happened before this).

kpn added inline comments.Apr 26 2019, 7:56 AM

lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
2787 ↗	(On Diff #196690)	Isn't this an argument for a strict load node types and maybe a strict store type as well? I'd be surprised if we were doing constant folding of a store+load combination, but if we are then either a strict load or store node would disable that folding. Strictly speaking, using the runtime rounding mode would be incorrect if that wasn't what was specified in the rounding mode field of the intrinsic. So even X86 could be wrong. We aren't doing anything with the rounding and exception arguments yet for any of the new intrinsics. So how about I file a bug noting that fact and noting that we need to not forget about this case? Then we can move on and come back later.

andrew.w.kaylor added inline comments.Apr 26 2019, 6:19 PM

lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
2787 ↗	(On Diff #196690)	Actually, if the runtime rounding mode doesn't match the rounding argument for non-dynamic cases that's a user error or a bug somewhere upstream in the compiler. The rounding mode argument is supposed to tell us what the rounding mode is at this point in the program. It is not supposed to control the rounding mode. The non-dynamic rounding modes are essentially an "assume" kind of directive. I see that the constant folding case I'm concerned about will actually be covered by one of your test cases, so I think we can safely move forward with this the way you have it.
lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp
1651 ↗	(On Diff #196690)	I'd prefer to see this as: unsigned OpNo = N->isStrictFPOpcode() ? 1 : 0;
2043 ↗	(On Diff #196690)	Again, the use of a boolean result as an index is awkward.
test/CodeGen/X86/fp-intrinsics.ll
303 ↗	(On Diff #196690)	This comment is wrong.
test/CodeGen/X86/vector-constrained-fp-intrinsics.ll
3 ↗	(On Diff #191120)	Expanding on Craig's comment here. I think he is suggesting that you add the AVX run line and all of the associated new checks as a separate patch before the fpext/fptrrunc patch lands.

kpn mentioned this in rL359461: Add AVX support to this test..Apr 29 2019, 9:04 AM

kpn mentioned this in rGa25c92830219: Add AVX support to this test..

Address review comments.

andrew.w.kaylor added inline comments.Apr 29 2019, 11:38 AM

lib/CodeGen/SelectionDAG/SelectionDAG.cpp
7711 ↗	(On Diff #197134)	It might be helpful to have a comment here explaining why STRICT_FP_ROUND isn't unary. That can be done in a separate change if everything else here is ready to commit. It all looks good to me, but I'm hoping Craig can give your widen vector changes one last look to make sure that's doing what he expected.

craig.topper added inline comments.Apr 29 2019, 12:12 PM

lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
2793 ↗	(On Diff #197134)	This break is unreachable.
2808 ↗	(On Diff #197134)	This break is unreachable
lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp
3225 ↗	(On Diff #197134)	I'm not sure this is safe. There's no way of knowing how the input was widened. I don't think you can access more than the original vector width worth of elements.
3237 ↗	(On Diff #197134)	I don't think this is safe either.
4184 ↗	(On Diff #197134)	Again I don't think this is safe. We don't what's in widened elements of the input.
test/CodeGen/X86/vector-constrained-fp-intrinsics.ll
3895 ↗	(On Diff #197134)	I think this is wrong. It's reading 4 elements, but we don't know what is in the 4th element.
3966 ↗	(On Diff #197134)	This is reading 4 float elements from memory. We should only be reading 2.
3992 ↗	(On Diff #197134)	This reads 4 floats from memory, but we should only read 3.

cameron.mcinally added inline comments.May 1 2019, 9:51 AM

lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
2787 ↗	(On Diff #196690)	Can this cause the wrong rounding mode to be used? For X86 targets I would guess it will result in an instruction that uses the runtime rounding mode, but I'm not sure we can count on that for all targets. I think it's more severe than that. The details are not fresh in my mind, but IIRC EmitStackConvert(...) emits a straight truncate on X86 (well, at least for one case I looked at. Could be wrong too.). We probably shouldn't be calling it for a strict round, and rather go to an instruction that honors rounding mode.

kpn marked 2 inline comments as done.May 1 2019, 10:06 AM

kpn added inline comments.

lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
2787 ↗	(On Diff #196690)	My impression is that EmitStackConvert() is one of those functions that is only used as a last resort when an ISA lacks a good reg+reg instruction. So this is probably already handled for most cases. If you know of a way to write a strict FP test case that uses it I'd love to see it. And if you have one then we should probably turn it into a library call to honor the rounding mode.
lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp
3225 ↗	(On Diff #197134)	I believe you are correct. I'm looking at this now.

andrew.w.kaylor added inline comments.May 1 2019, 11:24 AM

lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
2787 ↗	(On Diff #196690)	@cameron.mcinally , what do you mean by "a straight truncate"?

cameron.mcinally added inline comments.May 1 2019, 12:14 PM

lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
2787 ↗	(On Diff #196690)	Nope, I made a mistake. EmitStackConvert(...) ends up creating a truncstore, which could end up as an FST. I had mistakenly thought that this rounded to nearest, but it looks like it uses the FPU control word. Assuming that fenv.h sets the FPU control word correctly (I didn't check), it should be fine.

andrew.w.kaylor added inline comments.May 1 2019, 1:14 PM

lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
2787 ↗	(On Diff #196690)	Regarding fenv.h, if you use the fesetround() function from fenv.h to set the rounding mode, it will update both MXCSR and FPCW. If you use an intrinsic or inline assembly or whatever to set one of them without the other, you're on your own (and fegetround() will fail if they have different settings).

Address review comments: Add a comment as requested. Remove bogus optimizations to let the base case work. Update tests.

Why does Verifier::visitIntrinsicCall() not choke when given an intrinsic it doesn't know about?

I added back the lines accidentally lost so the new intrinsics do now get checked.

kpn added a subscriber: kbarton.May 10 2019, 12:28 PM

craig.topper added inline comments.May 10 2019, 12:51 PM

include/llvm/IR/Intrinsics.td
691 ↗	(On Diff #198890)	A lot of this FIXME still applies doesn't it?
lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp
3215 ↗	(On Diff #198890)	I think this is an unused variable
lib/IR/Verifier.cpp
4708 ↗	(On Diff #198890)	Indented too far
4710 ↗	(On Diff #198890)	Indented too far

cameron.mcinally added inline comments.May 10 2019, 1:03 PM

include/llvm/IR/Intrinsics.td
691 ↗	(On Diff #198890)	We definitely need FCMP. I attempted a patch for this, D54649, but FCMP is Custom lowered on X86. We still need a good way to handle STRICT nodes that need to be Custom lowered. Also FYI that the line below can be removed. We decided not to implement constrained FABS and FCOPYSIGN... unless a problem is found.

Address review comments.

LGTM with that one remaining unused variable fixed.

lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp
3214 ↗	(On Diff #199074)	Is InWidenVT also unused?

This revision is now accepted and ready to land.May 10 2019, 1:34 PM

Excellent. Thank you for all the reviews! I will commit this on Monday morning so I can keep an eye on the bots.

cameron.mcinally mentioned this in D24422: Unsafe copysign xform in DAGCombiner.May 10 2019, 1:49 PM

Closed by commit rL360581: Add constrained fptrunc and fpext intrinsics. (authored by kpn). · Explain WhyMay 13 2019, 6:21 AM

This revision was automatically updated to reflect the committed changes.

Herald added a project: Restricted Project. · View Herald TranscriptMay 13 2019, 6:21 AM

kpn mentioned this in D43515: More math intrinsics for conservative math handling.May 15 2019, 10:55 AM

Revision Contents

Path

Size

llvm/

trunk/

docs/

LangRef.rst

71 lines

include/

llvm/

CodeGen/

ISDOpcodes.h

20 lines

SelectionDAGNodes.h

2 lines

TargetLowering.h

2 lines

IR/

IntrinsicInst.h

2 lines

Intrinsics.td

13 lines

lib/

CodeGen/

SelectionDAG/

LegalizeDAG.cpp

35 lines

LegalizeTypes.h

3 lines

LegalizeVectorOps.cpp

4 lines

LegalizeVectorTypes.cpp

198 lines

SelectionDAG.cpp

17 lines

SelectionDAGBuilder.cpp

15 lines

SelectionDAGDumper.cpp

2 lines

IR/

IntrinsicInst.cpp

2 lines

Verifier.cpp

43 lines

test/

CodeGen/

X86/

fp-intrinsics.ll

26 lines

vector-constrained-fp-intrinsics.ll

219 lines

Feature/

fp-intrinsics.ll

26 lines

Diff 199261

llvm/trunk/docs/LangRef.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 14,817 Lines • ▼ Show 20 Lines

	Semantics:			Semantics:
	""""""""""			""""""""""

	The result produced is the product of the first two operands added to the third			The result produced is the product of the first two operands added to the third
	operand computed with infinite precision, and then rounded to the target			operand computed with infinite precision, and then rounded to the target
	precision.			precision.

				'``llvm.experimental.constrained.fptrunc``' Intrinsic
				^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

				Syntax:
				"""""""

				::

				declare <ty2>
				@llvm.experimental.constrained.fptrunc(<type> <value>,
				metadata <rounding mode>,
				metadata <exception behavior>)

				Overview:
				"""""""""

				The '``llvm.experimental.constrained.fptrunc``' intrinsic truncates ``value``
				to type ``ty2``.

				Arguments:
				""""""""""

				The first argument to the '``llvm.experimental.constrained.fptrunc``'
				intrinsic must be :ref:`floating point <t_floating>` or :ref:`vector
				<t_vector>` of floating point values. This argument must be larger in size
				than the result.

				The second and third arguments specify the rounding mode and exception
				behavior as described above.

				Semantics:
				""""""""""

				The result produced is a floating point value truncated to be smaller in size
				than the operand.

				'``llvm.experimental.constrained.fpext``' Intrinsic
				^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

				Syntax:
				"""""""

				::

				declare <ty2>
				@llvm.experimental.constrained.fpext(<type> <value>,
				metadata <exception behavior>)

				Overview:
				"""""""""

				The '``llvm.experimental.constrained.fpext``' intrinsic extends a
				floating-point ``value`` to a larger floating-point value.

				Arguments:
				""""""""""

				The first argument to the '``llvm.experimental.constrained.fpext``'
				intrinsic must be :ref:`floating point <t_floating>` or :ref:`vector
				<t_vector>` of floating point values. This argument must be smaller in size
				than the result.

				The second argument specifies the exception behavior as described above.

				Semantics:
				""""""""""

				The result produced is a floating point value extended to be larger in size
				than the operand. All restrictions that apply to the fpext instruction also
				apply to this intrinsic.

	Constrained libm-equivalent Intrinsics			Constrained libm-equivalent Intrinsics
	--------------------------------------			--------------------------------------

	In addition to the basic floating-point operations for which constrained			In addition to the basic floating-point operations for which constrained
	intrinsics are described above, there are constrained versions of various			intrinsics are described above, there are constrained versions of various
	operations which provide equivalent behavior to a corresponding libm function.			operations which provide equivalent behavior to a corresponding libm function.
	These intrinsics allow the precise behavior of these operations with respect to			These intrinsics allow the precise behavior of these operations with respect to
	rounding mode and exception behavior to be controlled.			rounding mode and exception behavior to be controlled.
	▲ Show 20 Lines • Show All 2,036 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/CodeGen/ISDOpcodes.h

Show First 20 Lines • Show All 291 Lines • ▼ Show 20 Lines	enum NodeType {
/// These will be lowered to the equivalent non-constrained pseudo-op		/// These will be lowered to the equivalent non-constrained pseudo-op
/// (or expanded to the equivalent library call) before final selection.		/// (or expanded to the equivalent library call) before final selection.
/// They are used to limit optimizations while the DAG is being optimized.		/// They are used to limit optimizations while the DAG is being optimized.
STRICT_FSQRT, STRICT_FPOW, STRICT_FPOWI, STRICT_FSIN, STRICT_FCOS,		STRICT_FSQRT, STRICT_FPOW, STRICT_FPOWI, STRICT_FSIN, STRICT_FCOS,
STRICT_FEXP, STRICT_FEXP2, STRICT_FLOG, STRICT_FLOG10, STRICT_FLOG2,		STRICT_FEXP, STRICT_FEXP2, STRICT_FLOG, STRICT_FLOG10, STRICT_FLOG2,
STRICT_FRINT, STRICT_FNEARBYINT, STRICT_FMAXNUM, STRICT_FMINNUM,		STRICT_FRINT, STRICT_FNEARBYINT, STRICT_FMAXNUM, STRICT_FMINNUM,
STRICT_FCEIL, STRICT_FFLOOR, STRICT_FROUND, STRICT_FTRUNC,		STRICT_FCEIL, STRICT_FFLOOR, STRICT_FROUND, STRICT_FTRUNC,

		/// X = STRICT_FP_ROUND(Y, TRUNC) - Rounding 'Y' from a larger floating
		/// point type down to the precision of the destination VT. TRUNC is a
		/// flag, which is always an integer that is zero or one. If TRUNC is 0,
		/// this is a normal rounding, if it is 1, this FP_ROUND is known to not
		/// change the value of Y.
		///
		/// The TRUNC = 1 case is used in cases where we know that the value will
		/// not be modified by the node, because Y is not using any of the extra
		/// precision of source type. This allows certain transformations like
		/// STRICT_FP_EXTEND(STRICT_FP_ROUND(X,1)) -> X which are not safe for
		/// STRICT_FP_EXTEND(STRICT_FP_ROUND(X,0)) because the extra bits aren't
		/// removed.
		/// It is used to limit optimizations while the DAG is being optimized.
		STRICT_FP_ROUND,

		/// X = STRICT_FP_EXTEND(Y) - Extend a smaller FP type into a larger FP
		/// type.
		/// It is used to limit optimizations while the DAG is being optimized.
		STRICT_FP_EXTEND,

/// FMA - Perform a * b + c with no intermediate rounding step.		/// FMA - Perform a * b + c with no intermediate rounding step.
FMA,		FMA,

/// FMAD - Perform a * b + c, while getting the same result as the		/// FMAD - Perform a * b + c, while getting the same result as the
/// separately rounded operations.		/// separately rounded operations.
FMAD,		FMAD,

/// FCOPYSIGN(X, Y) - Return the value of X with the sign of Y. NOTE: This		/// FCOPYSIGN(X, Y) - Return the value of X with the sign of Y. NOTE: This
▲ Show 20 Lines • Show All 740 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/CodeGen/SelectionDAGNodes.h

Show First 20 Lines • Show All 685 Lines • ▼ Show 20 Lines	switch (NodeType) {
case ISD::STRICT_FRINT:		case ISD::STRICT_FRINT:
case ISD::STRICT_FNEARBYINT:		case ISD::STRICT_FNEARBYINT:
case ISD::STRICT_FMAXNUM:		case ISD::STRICT_FMAXNUM:
case ISD::STRICT_FMINNUM:		case ISD::STRICT_FMINNUM:
case ISD::STRICT_FCEIL:		case ISD::STRICT_FCEIL:
case ISD::STRICT_FFLOOR:		case ISD::STRICT_FFLOOR:
case ISD::STRICT_FROUND:		case ISD::STRICT_FROUND:
case ISD::STRICT_FTRUNC:		case ISD::STRICT_FTRUNC:
		case ISD::STRICT_FP_ROUND:
		case ISD::STRICT_FP_EXTEND:
return true;		return true;
}		}
}		}

/// Test if this node has a post-isel opcode, directly		/// Test if this node has a post-isel opcode, directly
/// corresponding to a MachineInstr opcode.		/// corresponding to a MachineInstr opcode.
bool isMachineOpcode() const { return NodeType < 0; }		bool isMachineOpcode() const { return NodeType < 0; }

▲ Show 20 Lines • Show All 1,922 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/CodeGen/TargetLowering.h

Show First 20 Lines • Show All 885 Lines • ▼ Show 20 Lines	switch (Op) {
case ISD::STRICT_FRINT: EqOpc = ISD::FRINT; break;		case ISD::STRICT_FRINT: EqOpc = ISD::FRINT; break;
case ISD::STRICT_FNEARBYINT: EqOpc = ISD::FNEARBYINT; break;		case ISD::STRICT_FNEARBYINT: EqOpc = ISD::FNEARBYINT; break;
case ISD::STRICT_FMAXNUM: EqOpc = ISD::FMAXNUM; break;		case ISD::STRICT_FMAXNUM: EqOpc = ISD::FMAXNUM; break;
case ISD::STRICT_FMINNUM: EqOpc = ISD::FMINNUM; break;		case ISD::STRICT_FMINNUM: EqOpc = ISD::FMINNUM; break;
case ISD::STRICT_FCEIL: EqOpc = ISD::FCEIL; break;		case ISD::STRICT_FCEIL: EqOpc = ISD::FCEIL; break;
case ISD::STRICT_FFLOOR: EqOpc = ISD::FFLOOR; break;		case ISD::STRICT_FFLOOR: EqOpc = ISD::FFLOOR; break;
case ISD::STRICT_FROUND: EqOpc = ISD::FROUND; break;		case ISD::STRICT_FROUND: EqOpc = ISD::FROUND; break;
case ISD::STRICT_FTRUNC: EqOpc = ISD::FTRUNC; break;		case ISD::STRICT_FTRUNC: EqOpc = ISD::FTRUNC; break;
		case ISD::STRICT_FP_ROUND: EqOpc = ISD::FP_ROUND; break;
		case ISD::STRICT_FP_EXTEND: EqOpc = ISD::FP_EXTEND; break;
}		}

auto Action = getOperationAction(EqOpc, VT);		auto Action = getOperationAction(EqOpc, VT);

// We don't currently handle Custom or Promote for strict FP pseudo-ops.		// We don't currently handle Custom or Promote for strict FP pseudo-ops.
// For now, we just expand for those cases.		// For now, we just expand for those cases.
if (Action != Legal)		if (Action != Legal)
Action = Expand;		Action = Expand;
▲ Show 20 Lines • Show All 3,103 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/IR/IntrinsicInst.h

Show First 20 Lines • Show All 232 Lines • ▼ Show 20 Lines	public:
static bool classof(const IntrinsicInst *I) {		static bool classof(const IntrinsicInst *I) {
switch (I->getIntrinsicID()) {		switch (I->getIntrinsicID()) {
case Intrinsic::experimental_constrained_fadd:		case Intrinsic::experimental_constrained_fadd:
case Intrinsic::experimental_constrained_fsub:		case Intrinsic::experimental_constrained_fsub:
case Intrinsic::experimental_constrained_fmul:		case Intrinsic::experimental_constrained_fmul:
case Intrinsic::experimental_constrained_fdiv:		case Intrinsic::experimental_constrained_fdiv:
case Intrinsic::experimental_constrained_frem:		case Intrinsic::experimental_constrained_frem:
case Intrinsic::experimental_constrained_fma:		case Intrinsic::experimental_constrained_fma:
		case Intrinsic::experimental_constrained_fptrunc:
		case Intrinsic::experimental_constrained_fpext:
case Intrinsic::experimental_constrained_sqrt:		case Intrinsic::experimental_constrained_sqrt:
case Intrinsic::experimental_constrained_pow:		case Intrinsic::experimental_constrained_pow:
case Intrinsic::experimental_constrained_powi:		case Intrinsic::experimental_constrained_powi:
case Intrinsic::experimental_constrained_sin:		case Intrinsic::experimental_constrained_sin:
case Intrinsic::experimental_constrained_cos:		case Intrinsic::experimental_constrained_cos:
case Intrinsic::experimental_constrained_exp:		case Intrinsic::experimental_constrained_exp:
case Intrinsic::experimental_constrained_exp2:		case Intrinsic::experimental_constrained_exp2:
case Intrinsic::experimental_constrained_log:		case Intrinsic::experimental_constrained_log:
▲ Show 20 Lines • Show All 570 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/IR/Intrinsics.td

Show First 20 Lines • Show All 601 Lines • ▼ Show 20 Lines	let IntrProperties = [IntrInaccessibleMemOnly] in {

def int_experimental_constrained_fma : Intrinsic<[ llvm_anyfloat_ty ],		def int_experimental_constrained_fma : Intrinsic<[ llvm_anyfloat_ty ],
[ LLVMMatchType<0>,		[ LLVMMatchType<0>,
LLVMMatchType<0>,		LLVMMatchType<0>,
LLVMMatchType<0>,		LLVMMatchType<0>,
llvm_metadata_ty,		llvm_metadata_ty,
llvm_metadata_ty ]>;		llvm_metadata_ty ]>;

		def int_experimental_constrained_fptrunc : Intrinsic<[ llvm_anyfloat_ty ],
		[ llvm_anyfloat_ty,
		llvm_metadata_ty,
		llvm_metadata_ty ]>;

		def int_experimental_constrained_fpext : Intrinsic<[ llvm_anyfloat_ty ],
		[ llvm_anyfloat_ty,
		llvm_metadata_ty ]>;

// These intrinsics are sensitive to the rounding mode so we need constrained		// These intrinsics are sensitive to the rounding mode so we need constrained
// versions of each of them. When strict rounding and exception control are		// versions of each of them. When strict rounding and exception control are
// not required the non-constrained versions of these intrinsics should be		// not required the non-constrained versions of these intrinsics should be
// used.		// used.
def int_experimental_constrained_sqrt : Intrinsic<[ llvm_anyfloat_ty ],		def int_experimental_constrained_sqrt : Intrinsic<[ llvm_anyfloat_ty ],
[ LLVMMatchType<0>,		[ LLVMMatchType<0>,
llvm_metadata_ty,		llvm_metadata_ty,
llvm_metadata_ty ]>;		llvm_metadata_ty ]>;
▲ Show 20 Lines • Show All 65 Lines • ▼ Show 20 Lines	def int_experimental_constrained_round : Intrinsic<[ llvm_anyfloat_ty ],
[ LLVMMatchType<0>,		[ LLVMMatchType<0>,
llvm_metadata_ty,		llvm_metadata_ty,
llvm_metadata_ty ]>;		llvm_metadata_ty ]>;
def int_experimental_constrained_trunc : Intrinsic<[ llvm_anyfloat_ty ],		def int_experimental_constrained_trunc : Intrinsic<[ llvm_anyfloat_ty ],
[ LLVMMatchType<0>,		[ LLVMMatchType<0>,
llvm_metadata_ty,		llvm_metadata_ty,
llvm_metadata_ty ]>;		llvm_metadata_ty ]>;
}		}
// FIXME: Add intrinsics for fcmp, fptrunc, fpext, fptoui and fptosi.		// FIXME: Add intrinsics for fcmp, fptoui and fptosi.
// FIXME: Add intrinsics for fabs and copysign?


//===------------------------- Expect Intrinsics --------------------------===//		//===------------------------- Expect Intrinsics --------------------------===//
//		//
def int_expect : Intrinsic<[llvm_anyint_ty],		def int_expect : Intrinsic<[llvm_anyint_ty],
[LLVMMatchType<0>, LLVMMatchType<0>], [IntrNoMem]>;		[LLVMMatchType<0>, LLVMMatchType<0>], [IntrNoMem]>;

//===-------------------- Bit Manipulation Intrinsics ---------------------===//		//===-------------------- Bit Manipulation Intrinsics ---------------------===//
//		//
▲ Show 20 Lines • Show All 486 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp

Show First 20 Lines • Show All 148 Lines • ▼ Show 20 Lines	SDValue ExpandIntLibCall(SDNode *Node, bool isSigned,
RTLIB::Libcall Call_I32,		RTLIB::Libcall Call_I32,
RTLIB::Libcall Call_I64,		RTLIB::Libcall Call_I64,
RTLIB::Libcall Call_I128);		RTLIB::Libcall Call_I128);
void ExpandDivRemLibCall(SDNode *Node, SmallVectorImpl<SDValue> &Results);		void ExpandDivRemLibCall(SDNode *Node, SmallVectorImpl<SDValue> &Results);
void ExpandSinCosLibCall(SDNode *Node, SmallVectorImpl<SDValue> &Results);		void ExpandSinCosLibCall(SDNode *Node, SmallVectorImpl<SDValue> &Results);

SDValue EmitStackConvert(SDValue SrcOp, EVT SlotVT, EVT DestVT,		SDValue EmitStackConvert(SDValue SrcOp, EVT SlotVT, EVT DestVT,
const SDLoc &dl);		const SDLoc &dl);
		SDValue EmitStackConvert(SDValue SrcOp, EVT SlotVT, EVT DestVT,
		const SDLoc &dl, SDValue ChainIn);
SDValue ExpandBUILD_VECTOR(SDNode *Node);		SDValue ExpandBUILD_VECTOR(SDNode *Node);
SDValue ExpandSCALAR_TO_VECTOR(SDNode *Node);		SDValue ExpandSCALAR_TO_VECTOR(SDNode *Node);
void ExpandDYNAMIC_STACKALLOC(SDNode *Node,		void ExpandDYNAMIC_STACKALLOC(SDNode *Node,
SmallVectorImpl<SDValue> &Results);		SmallVectorImpl<SDValue> &Results);
void getSignAsIntValue(FloatSignAsInt &State, const SDLoc &DL,		void getSignAsIntValue(FloatSignAsInt &State, const SDLoc &DL,
SDValue Value) const;		SDValue Value) const;
SDValue modifySignAsInt(const FloatSignAsInt &State, const SDLoc &DL,		SDValue modifySignAsInt(const FloatSignAsInt &State, const SDLoc &DL,
SDValue NewIntValue) const;		SDValue NewIntValue) const;
▲ Show 20 Lines • Show All 945 Lines • ▼ Show 20 Lines	#endif
case ISD::STRICT_FRINT:		case ISD::STRICT_FRINT:
case ISD::STRICT_FNEARBYINT:		case ISD::STRICT_FNEARBYINT:
case ISD::STRICT_FMAXNUM:		case ISD::STRICT_FMAXNUM:
case ISD::STRICT_FMINNUM:		case ISD::STRICT_FMINNUM:
case ISD::STRICT_FCEIL:		case ISD::STRICT_FCEIL:
case ISD::STRICT_FFLOOR:		case ISD::STRICT_FFLOOR:
case ISD::STRICT_FROUND:		case ISD::STRICT_FROUND:
case ISD::STRICT_FTRUNC:		case ISD::STRICT_FTRUNC:
		case ISD::STRICT_FP_ROUND:
		case ISD::STRICT_FP_EXTEND:
// These pseudo-ops get legalized as if they were their non-strict		// These pseudo-ops get legalized as if they were their non-strict
// equivalent. For instance, if ISD::FSQRT is legal then ISD::STRICT_FSQRT		// equivalent. For instance, if ISD::FSQRT is legal then ISD::STRICT_FSQRT
// is also legal, but if ISD::FSQRT requires expansion then so does		// is also legal, but if ISD::FSQRT requires expansion then so does
// ISD::STRICT_FSQRT.		// ISD::STRICT_FSQRT.
Action = TLI.getStrictFPOperationAction(Node->getOpcode(),		Action = TLI.getStrictFPOperationAction(Node->getOpcode(),
Node->getValueType(0));		Node->getValueType(0));
break;		break;
case ISD::SADDSAT:		case ISD::SADDSAT:
▲ Show 20 Lines • Show All 610 Lines • ▼ Show 20 Lines
}		}

/// Emit a store/load combination to the stack. This stores		/// Emit a store/load combination to the stack. This stores
/// SrcOp to a stack slot of type SlotVT, truncating it if needed. It then does		/// SrcOp to a stack slot of type SlotVT, truncating it if needed. It then does
/// a load from the stack slot to DestVT, extending it if needed.		/// a load from the stack slot to DestVT, extending it if needed.
/// The resultant code need not be legal.		/// The resultant code need not be legal.
SDValue SelectionDAGLegalize::EmitStackConvert(SDValue SrcOp, EVT SlotVT,		SDValue SelectionDAGLegalize::EmitStackConvert(SDValue SrcOp, EVT SlotVT,
EVT DestVT, const SDLoc &dl) {		EVT DestVT, const SDLoc &dl) {
		return EmitStackConvert(SrcOp, SlotVT, DestVT, dl, DAG.getEntryNode());
		}

		SDValue SelectionDAGLegalize::EmitStackConvert(SDValue SrcOp, EVT SlotVT,
		EVT DestVT, const SDLoc &dl,
		SDValue Chain) {
// Create the stack frame object.		// Create the stack frame object.
unsigned SrcAlign = DAG.getDataLayout().getPrefTypeAlignment(		unsigned SrcAlign = DAG.getDataLayout().getPrefTypeAlignment(
SrcOp.getValueType().getTypeForEVT(*DAG.getContext()));		SrcOp.getValueType().getTypeForEVT(*DAG.getContext()));
SDValue FIPtr = DAG.CreateStackTemporary(SlotVT, SrcAlign);		SDValue FIPtr = DAG.CreateStackTemporary(SlotVT, SrcAlign);

FrameIndexSDNode *StackPtrFI = cast<FrameIndexSDNode>(FIPtr);		FrameIndexSDNode *StackPtrFI = cast<FrameIndexSDNode>(FIPtr);
int SPFI = StackPtrFI->getIndex();		int SPFI = StackPtrFI->getIndex();
MachinePointerInfo PtrInfo =		MachinePointerInfo PtrInfo =
MachinePointerInfo::getFixedStack(DAG.getMachineFunction(), SPFI);		MachinePointerInfo::getFixedStack(DAG.getMachineFunction(), SPFI);

unsigned SrcSize = SrcOp.getValueSizeInBits();		unsigned SrcSize = SrcOp.getValueSizeInBits();
unsigned SlotSize = SlotVT.getSizeInBits();		unsigned SlotSize = SlotVT.getSizeInBits();
unsigned DestSize = DestVT.getSizeInBits();		unsigned DestSize = DestVT.getSizeInBits();
Type DestType = DestVT.getTypeForEVT(DAG.getContext());		Type DestType = DestVT.getTypeForEVT(DAG.getContext());
unsigned DestAlign = DAG.getDataLayout().getPrefTypeAlignment(DestType);		unsigned DestAlign = DAG.getDataLayout().getPrefTypeAlignment(DestType);

// Emit a store to the stack slot. Use a truncstore if the input value is		// Emit a store to the stack slot. Use a truncstore if the input value is
// later than DestVT.		// later than DestVT.
SDValue Store;		SDValue Store;

if (SrcSize > SlotSize)		if (SrcSize > SlotSize)
Store = DAG.getTruncStore(DAG.getEntryNode(), dl, SrcOp, FIPtr, PtrInfo,		Store = DAG.getTruncStore(Chain, dl, SrcOp, FIPtr, PtrInfo,
SlotVT, SrcAlign);		SlotVT, SrcAlign);
else {		else {
assert(SrcSize == SlotSize && "Invalid store");		assert(SrcSize == SlotSize && "Invalid store");
Store =		Store =
DAG.getStore(DAG.getEntryNode(), dl, SrcOp, FIPtr, PtrInfo, SrcAlign);		DAG.getStore(Chain, dl, SrcOp, FIPtr, PtrInfo, SrcAlign);
}		}

// Result is a load from the stack slot.		// Result is a load from the stack slot.
if (SlotSize == DestSize)		if (SlotSize == DestSize)
return DAG.getLoad(DestVT, dl, Store, FIPtr, PtrInfo, DestAlign);		return DAG.getLoad(DestVT, dl, Store, FIPtr, PtrInfo, DestAlign);

assert(SlotSize < DestSize && "Unknown extension!");		assert(SlotSize < DestSize && "Unknown extension!");
return DAG.getExtLoad(ISD::EXTLOAD, dl, DestVT, Store, FIPtr, PtrInfo, SlotVT,		return DAG.getExtLoad(ISD::EXTLOAD, dl, DestVT, Store, FIPtr, PtrInfo, SlotVT,
DestAlign);		DestAlign);
}		}

SDValue SelectionDAGLegalize::ExpandSCALAR_TO_VECTOR(SDNode *Node) {		SDValue SelectionDAGLegalize::ExpandSCALAR_TO_VECTOR(SDNode *Node) {
SDLoc dl(Node);		SDLoc dl(Node);
// Create a vector sized/aligned stack slot, store the value to element #0,		// Create a vector sized/aligned stack slot, store the value to element #0,
▲ Show 20 Lines • Show All 987 Lines • ▼ Show 20 Lines	case ISD::UNDEF: {
if (VT.isInteger())		if (VT.isInteger())
Results.push_back(DAG.getConstant(0, dl, VT));		Results.push_back(DAG.getConstant(0, dl, VT));
else {		else {
assert(VT.isFloatingPoint() && "Unknown value type!");		assert(VT.isFloatingPoint() && "Unknown value type!");
Results.push_back(DAG.getConstantFP(0, dl, VT));		Results.push_back(DAG.getConstantFP(0, dl, VT));
}		}
break;		break;
}		}
		case ISD::STRICT_FP_ROUND:
		Tmp1 = EmitStackConvert(Node->getOperand(1),
		Node->getValueType(0),
		Node->getValueType(0), dl, Node->getOperand(0));
		ReplaceNode(Node, Tmp1.getNode());
		LLVM_DEBUG(dbgs() << "Successfully expanded STRICT_FP_ROUND node\n");
		return true;
case ISD::FP_ROUND:		case ISD::FP_ROUND:
case ISD::BITCAST:		case ISD::BITCAST:
Tmp1 = EmitStackConvert(Node->getOperand(0), Node->getValueType(0),		Tmp1 = EmitStackConvert(Node->getOperand(0),
		Node->getValueType(0),
Node->getValueType(0), dl);		Node->getValueType(0), dl);
Results.push_back(Tmp1);		Results.push_back(Tmp1);
break;		break;
		case ISD::STRICT_FP_EXTEND:
		Tmp1 = EmitStackConvert(Node->getOperand(1),
		Node->getOperand(1).getValueType(),
		Node->getValueType(0), dl, Node->getOperand(0));
		ReplaceNode(Node, Tmp1.getNode());
		LLVM_DEBUG(dbgs() << "Successfully expanded STRICT_FP_EXTEND node\n");
		return true;
case ISD::FP_EXTEND:		case ISD::FP_EXTEND:
Tmp1 = EmitStackConvert(Node->getOperand(0),		Tmp1 = EmitStackConvert(Node->getOperand(0),
Node->getOperand(0).getValueType(),		Node->getOperand(0).getValueType(),
Node->getValueType(0), dl);		Node->getValueType(0), dl);
Results.push_back(Tmp1);		Results.push_back(Tmp1);
break;		break;
case ISD::SIGN_EXTEND_INREG: {		case ISD::SIGN_EXTEND_INREG: {
EVT ExtraVT = cast<VTSDNode>(Node->getOperand(1))->getVT();		EVT ExtraVT = cast<VTSDNode>(Node->getOperand(1))->getVT();
▲ Show 20 Lines • Show All 1,781 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/SelectionDAG/LegalizeTypes.h

Show First 20 Lines • Show All 681 Lines • ▼ Show 20 Lines	private:
SDValue ScalarizeVecRes_OverflowOp(SDNode *N, unsigned ResNo);		SDValue ScalarizeVecRes_OverflowOp(SDNode *N, unsigned ResNo);
SDValue ScalarizeVecRes_InregOp(SDNode *N);		SDValue ScalarizeVecRes_InregOp(SDNode *N);
SDValue ScalarizeVecRes_VecInregOp(SDNode *N);		SDValue ScalarizeVecRes_VecInregOp(SDNode *N);

SDValue ScalarizeVecRes_BITCAST(SDNode *N);		SDValue ScalarizeVecRes_BITCAST(SDNode *N);
SDValue ScalarizeVecRes_BUILD_VECTOR(SDNode *N);		SDValue ScalarizeVecRes_BUILD_VECTOR(SDNode *N);
SDValue ScalarizeVecRes_EXTRACT_SUBVECTOR(SDNode *N);		SDValue ScalarizeVecRes_EXTRACT_SUBVECTOR(SDNode *N);
SDValue ScalarizeVecRes_FP_ROUND(SDNode *N);		SDValue ScalarizeVecRes_FP_ROUND(SDNode *N);
		SDValue ScalarizeVecRes_STRICT_FP_ROUND(SDNode *N);
SDValue ScalarizeVecRes_FPOWI(SDNode *N);		SDValue ScalarizeVecRes_FPOWI(SDNode *N);
SDValue ScalarizeVecRes_INSERT_VECTOR_ELT(SDNode *N);		SDValue ScalarizeVecRes_INSERT_VECTOR_ELT(SDNode *N);
SDValue ScalarizeVecRes_LOAD(LoadSDNode *N);		SDValue ScalarizeVecRes_LOAD(LoadSDNode *N);
SDValue ScalarizeVecRes_SCALAR_TO_VECTOR(SDNode *N);		SDValue ScalarizeVecRes_SCALAR_TO_VECTOR(SDNode *N);
SDValue ScalarizeVecRes_VSELECT(SDNode *N);		SDValue ScalarizeVecRes_VSELECT(SDNode *N);
SDValue ScalarizeVecRes_SELECT(SDNode *N);		SDValue ScalarizeVecRes_SELECT(SDNode *N);
SDValue ScalarizeVecRes_SELECT_CC(SDNode *N);		SDValue ScalarizeVecRes_SELECT_CC(SDNode *N);
SDValue ScalarizeVecRes_SETCC(SDNode *N);		SDValue ScalarizeVecRes_SETCC(SDNode *N);
SDValue ScalarizeVecRes_UNDEF(SDNode *N);		SDValue ScalarizeVecRes_UNDEF(SDNode *N);
SDValue ScalarizeVecRes_VECTOR_SHUFFLE(SDNode *N);		SDValue ScalarizeVecRes_VECTOR_SHUFFLE(SDNode *N);

SDValue ScalarizeVecRes_MULFIX(SDNode *N);		SDValue ScalarizeVecRes_MULFIX(SDNode *N);

// Vector Operand Scalarization: <1 x ty> -> ty.		// Vector Operand Scalarization: <1 x ty> -> ty.
bool ScalarizeVectorOperand(SDNode *N, unsigned OpNo);		bool ScalarizeVectorOperand(SDNode *N, unsigned OpNo);
SDValue ScalarizeVecOp_BITCAST(SDNode *N);		SDValue ScalarizeVecOp_BITCAST(SDNode *N);
SDValue ScalarizeVecOp_UnaryOp(SDNode *N);		SDValue ScalarizeVecOp_UnaryOp(SDNode *N);
SDValue ScalarizeVecOp_CONCAT_VECTORS(SDNode *N);		SDValue ScalarizeVecOp_CONCAT_VECTORS(SDNode *N);
SDValue ScalarizeVecOp_EXTRACT_VECTOR_ELT(SDNode *N);		SDValue ScalarizeVecOp_EXTRACT_VECTOR_ELT(SDNode *N);
SDValue ScalarizeVecOp_VSELECT(SDNode *N);		SDValue ScalarizeVecOp_VSELECT(SDNode *N);
SDValue ScalarizeVecOp_VSETCC(SDNode *N);		SDValue ScalarizeVecOp_VSETCC(SDNode *N);
SDValue ScalarizeVecOp_STORE(StoreSDNode *N, unsigned OpNo);		SDValue ScalarizeVecOp_STORE(StoreSDNode *N, unsigned OpNo);
SDValue ScalarizeVecOp_FP_ROUND(SDNode *N, unsigned OpNo);		SDValue ScalarizeVecOp_FP_ROUND(SDNode *N, unsigned OpNo);
		SDValue ScalarizeVecOp_STRICT_FP_ROUND(SDNode *N, unsigned OpNo);
SDValue ScalarizeVecOp_VECREDUCE(SDNode *N);		SDValue ScalarizeVecOp_VECREDUCE(SDNode *N);

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
// Vector Splitting Support: LegalizeVectorTypes.cpp		// Vector Splitting Support: LegalizeVectorTypes.cpp
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//

/// Given a processed vector Op which was split into vectors of half the size,		/// Given a processed vector Op which was split into vectors of half the size,
/// this method returns the halves. The first elements of Op coincide with the		/// this method returns the halves. The first elements of Op coincide with the
▲ Show 20 Lines • Show All 94 Lines • ▼ Show 20 Lines	private:
SDValue WidenVecRes_VECTOR_SHUFFLE(ShuffleVectorSDNode *N);		SDValue WidenVecRes_VECTOR_SHUFFLE(ShuffleVectorSDNode *N);

SDValue WidenVecRes_Ternary(SDNode *N);		SDValue WidenVecRes_Ternary(SDNode *N);
SDValue WidenVecRes_Binary(SDNode *N);		SDValue WidenVecRes_Binary(SDNode *N);
SDValue WidenVecRes_BinaryCanTrap(SDNode *N);		SDValue WidenVecRes_BinaryCanTrap(SDNode *N);
SDValue WidenVecRes_StrictFP(SDNode *N);		SDValue WidenVecRes_StrictFP(SDNode *N);
SDValue WidenVecRes_OverflowOp(SDNode *N, unsigned ResNo);		SDValue WidenVecRes_OverflowOp(SDNode *N, unsigned ResNo);
SDValue WidenVecRes_Convert(SDNode *N);		SDValue WidenVecRes_Convert(SDNode *N);
		SDValue WidenVecRes_Convert_StrictFP(SDNode *N);
SDValue WidenVecRes_FCOPYSIGN(SDNode *N);		SDValue WidenVecRes_FCOPYSIGN(SDNode *N);
SDValue WidenVecRes_POWI(SDNode *N);		SDValue WidenVecRes_POWI(SDNode *N);
SDValue WidenVecRes_Shift(SDNode *N);		SDValue WidenVecRes_Shift(SDNode *N);
SDValue WidenVecRes_Unary(SDNode *N);		SDValue WidenVecRes_Unary(SDNode *N);
SDValue WidenVecRes_InregOp(SDNode *N);		SDValue WidenVecRes_InregOp(SDNode *N);

// Widen Vector Operand.		// Widen Vector Operand.
bool WidenVectorOperand(SDNode *N, unsigned OpNo);		bool WidenVectorOperand(SDNode *N, unsigned OpNo);
▲ Show 20 Lines • Show All 130 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp

Show First 20 Lines • Show All 325 Lines • ▼ Show 20 Lines	SDValue VectorLegalizer::LegalizeOp(SDValue Op) {
case ISD::STRICT_FRINT:		case ISD::STRICT_FRINT:
case ISD::STRICT_FNEARBYINT:		case ISD::STRICT_FNEARBYINT:
case ISD::STRICT_FMAXNUM:		case ISD::STRICT_FMAXNUM:
case ISD::STRICT_FMINNUM:		case ISD::STRICT_FMINNUM:
case ISD::STRICT_FCEIL:		case ISD::STRICT_FCEIL:
case ISD::STRICT_FFLOOR:		case ISD::STRICT_FFLOOR:
case ISD::STRICT_FROUND:		case ISD::STRICT_FROUND:
case ISD::STRICT_FTRUNC:		case ISD::STRICT_FTRUNC:
		case ISD::STRICT_FP_ROUND:
		case ISD::STRICT_FP_EXTEND:
// These pseudo-ops get legalized as if they were their non-strict		// These pseudo-ops get legalized as if they were their non-strict
// equivalent. For instance, if ISD::FSQRT is legal then ISD::STRICT_FSQRT		// equivalent. For instance, if ISD::FSQRT is legal then ISD::STRICT_FSQRT
// is also legal, but if ISD::FSQRT requires expansion then so does		// is also legal, but if ISD::FSQRT requires expansion then so does
// ISD::STRICT_FSQRT.		// ISD::STRICT_FSQRT.
Action = TLI.getStrictFPOperationAction(Node->getOpcode(),		Action = TLI.getStrictFPOperationAction(Node->getOpcode(),
Node->getValueType(0));		Node->getValueType(0));
break;		break;
case ISD::ADD:		case ISD::ADD:
▲ Show 20 Lines • Show All 954 Lines • ▼ Show 20 Lines	for (unsigned i = 0; i < NumElems; ++i) {

// Now process the remaining operands.		// Now process the remaining operands.
for (unsigned j = 1; j < NumOpers; ++j) {		for (unsigned j = 1; j < NumOpers; ++j) {
SDValue Oper = Op.getOperand(j);		SDValue Oper = Op.getOperand(j);
EVT OperVT = Oper.getValueType();		EVT OperVT = Oper.getValueType();

if (OperVT.isVector())		if (OperVT.isVector())
Oper = DAG.getNode(ISD::EXTRACT_VECTOR_ELT, dl,		Oper = DAG.getNode(ISD::EXTRACT_VECTOR_ELT, dl,
EltVT, Oper, Idx);		OperVT.getVectorElementType(), Oper, Idx);

Opers.push_back(Oper);		Opers.push_back(Oper);
}		}

SDValue ScalarOp = DAG.getNode(Op->getOpcode(), dl, ValueVTs, Opers);		SDValue ScalarOp = DAG.getNode(Op->getOpcode(), dl, ValueVTs, Opers);

OpValues.push_back(ScalarOp.getValue(0));		OpValues.push_back(ScalarOp.getValue(0));
OpChains.push_back(ScalarOp.getValue(1));		OpChains.push_back(ScalarOp.getValue(1));
▲ Show 20 Lines • Show All 41 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp

Show First 20 Lines • Show All 44 Lines • ▼ Show 20 Lines
#endif		#endif
report_fatal_error("Do not know how to scalarize the result of this "		report_fatal_error("Do not know how to scalarize the result of this "
"operator!\n");		"operator!\n");

case ISD::MERGE_VALUES: R = ScalarizeVecRes_MERGE_VALUES(N, ResNo);break;		case ISD::MERGE_VALUES: R = ScalarizeVecRes_MERGE_VALUES(N, ResNo);break;
case ISD::BITCAST: R = ScalarizeVecRes_BITCAST(N); break;		case ISD::BITCAST: R = ScalarizeVecRes_BITCAST(N); break;
case ISD::BUILD_VECTOR: R = ScalarizeVecRes_BUILD_VECTOR(N); break;		case ISD::BUILD_VECTOR: R = ScalarizeVecRes_BUILD_VECTOR(N); break;
case ISD::EXTRACT_SUBVECTOR: R = ScalarizeVecRes_EXTRACT_SUBVECTOR(N); break;		case ISD::EXTRACT_SUBVECTOR: R = ScalarizeVecRes_EXTRACT_SUBVECTOR(N); break;
		case ISD::STRICT_FP_ROUND: R = ScalarizeVecRes_STRICT_FP_ROUND(N); break;
case ISD::FP_ROUND: R = ScalarizeVecRes_FP_ROUND(N); break;		case ISD::FP_ROUND: R = ScalarizeVecRes_FP_ROUND(N); break;
case ISD::FP_ROUND_INREG: R = ScalarizeVecRes_InregOp(N); break;		case ISD::FP_ROUND_INREG: R = ScalarizeVecRes_InregOp(N); break;
case ISD::FPOWI: R = ScalarizeVecRes_FPOWI(N); break;		case ISD::FPOWI: R = ScalarizeVecRes_FPOWI(N); break;
case ISD::INSERT_VECTOR_ELT: R = ScalarizeVecRes_INSERT_VECTOR_ELT(N); break;		case ISD::INSERT_VECTOR_ELT: R = ScalarizeVecRes_INSERT_VECTOR_ELT(N); break;
case ISD::LOAD: R = ScalarizeVecRes_LOAD(cast<LoadSDNode>(N));break;		case ISD::LOAD: R = ScalarizeVecRes_LOAD(cast<LoadSDNode>(N));break;
case ISD::SCALAR_TO_VECTOR: R = ScalarizeVecRes_SCALAR_TO_VECTOR(N); break;		case ISD::SCALAR_TO_VECTOR: R = ScalarizeVecRes_SCALAR_TO_VECTOR(N); break;
case ISD::SIGN_EXTEND_INREG: R = ScalarizeVecRes_InregOp(N); break;		case ISD::SIGN_EXTEND_INREG: R = ScalarizeVecRes_InregOp(N); break;
case ISD::VSELECT: R = ScalarizeVecRes_VSELECT(N); break;		case ISD::VSELECT: R = ScalarizeVecRes_VSELECT(N); break;
▲ Show 20 Lines • Show All 104 Lines • ▼ Show 20 Lines	#endif
case ISD::STRICT_FRINT:		case ISD::STRICT_FRINT:
case ISD::STRICT_FNEARBYINT:		case ISD::STRICT_FNEARBYINT:
case ISD::STRICT_FMAXNUM:		case ISD::STRICT_FMAXNUM:
case ISD::STRICT_FMINNUM:		case ISD::STRICT_FMINNUM:
case ISD::STRICT_FCEIL:		case ISD::STRICT_FCEIL:
case ISD::STRICT_FFLOOR:		case ISD::STRICT_FFLOOR:
case ISD::STRICT_FROUND:		case ISD::STRICT_FROUND:
case ISD::STRICT_FTRUNC:		case ISD::STRICT_FTRUNC:
		case ISD::STRICT_FP_EXTEND:
R = ScalarizeVecRes_StrictFPOp(N);		R = ScalarizeVecRes_StrictFPOp(N);
break;		break;
case ISD::UADDO:		case ISD::UADDO:
case ISD::SADDO:		case ISD::SADDO:
case ISD::USUBO:		case ISD::USUBO:
case ISD::SSUBO:		case ISD::SSUBO:
case ISD::UMULO:		case ISD::UMULO:
case ISD::SMULO:		case ISD::SMULO:
▲ Show 20 Lines • Show All 135 Lines • ▼ Show 20 Lines

SDValue DAGTypeLegalizer::ScalarizeVecRes_FP_ROUND(SDNode *N) {		SDValue DAGTypeLegalizer::ScalarizeVecRes_FP_ROUND(SDNode *N) {
EVT NewVT = N->getValueType(0).getVectorElementType();		EVT NewVT = N->getValueType(0).getVectorElementType();
SDValue Op = GetScalarizedVector(N->getOperand(0));		SDValue Op = GetScalarizedVector(N->getOperand(0));
return DAG.getNode(ISD::FP_ROUND, SDLoc(N),		return DAG.getNode(ISD::FP_ROUND, SDLoc(N),
NewVT, Op, N->getOperand(1));		NewVT, Op, N->getOperand(1));
}		}

		SDValue DAGTypeLegalizer::ScalarizeVecRes_STRICT_FP_ROUND(SDNode *N) {
		EVT NewVT = N->getValueType(0).getVectorElementType();
		SDValue Op = GetScalarizedVector(N->getOperand(1));
		SDValue Res = DAG.getNode(ISD::STRICT_FP_ROUND, SDLoc(N),
		{ NewVT, MVT::Other },
		{ N->getOperand(0), Op, N->getOperand(2) });
		// Legalize the chain result - switch anything that used the old chain to
		// use the new one.
		ReplaceValueWith(SDValue(N, 1), Res.getValue(1));
		return Res;
		}

SDValue DAGTypeLegalizer::ScalarizeVecRes_FPOWI(SDNode *N) {		SDValue DAGTypeLegalizer::ScalarizeVecRes_FPOWI(SDNode *N) {
SDValue Op = GetScalarizedVector(N->getOperand(0));		SDValue Op = GetScalarizedVector(N->getOperand(0));
return DAG.getNode(ISD::FPOWI, SDLoc(N),		return DAG.getNode(ISD::FPOWI, SDLoc(N),
Op.getValueType(), Op, N->getOperand(1));		Op.getValueType(), Op, N->getOperand(1));
}		}

SDValue DAGTypeLegalizer::ScalarizeVecRes_INSERT_VECTOR_ELT(SDNode *N) {		SDValue DAGTypeLegalizer::ScalarizeVecRes_INSERT_VECTOR_ELT(SDNode *N) {
// The value to insert may have a wider type than the vector element type,		// The value to insert may have a wider type than the vector element type,
▲ Show 20 Lines • Show All 267 Lines • ▼ Show 20 Lines	case ISD::VSELECT:
Res = ScalarizeVecOp_VSELECT(N);		Res = ScalarizeVecOp_VSELECT(N);
break;		break;
case ISD::SETCC:		case ISD::SETCC:
Res = ScalarizeVecOp_VSETCC(N);		Res = ScalarizeVecOp_VSETCC(N);
break;		break;
case ISD::STORE:		case ISD::STORE:
Res = ScalarizeVecOp_STORE(cast<StoreSDNode>(N), OpNo);		Res = ScalarizeVecOp_STORE(cast<StoreSDNode>(N), OpNo);
break;		break;
		case ISD::STRICT_FP_ROUND:
		Res = ScalarizeVecOp_STRICT_FP_ROUND(N, OpNo);
		break;
case ISD::FP_ROUND:		case ISD::FP_ROUND:
Res = ScalarizeVecOp_FP_ROUND(N, OpNo);		Res = ScalarizeVecOp_FP_ROUND(N, OpNo);
break;		break;
case ISD::VECREDUCE_FADD:		case ISD::VECREDUCE_FADD:
case ISD::VECREDUCE_FMUL:		case ISD::VECREDUCE_FMUL:
case ISD::VECREDUCE_ADD:		case ISD::VECREDUCE_ADD:
case ISD::VECREDUCE_MUL:		case ISD::VECREDUCE_MUL:
case ISD::VECREDUCE_AND:		case ISD::VECREDUCE_AND:
▲ Show 20 Lines • Show All 132 Lines • ▼ Show 20 Lines
SDValue DAGTypeLegalizer::ScalarizeVecOp_FP_ROUND(SDNode *N, unsigned OpNo) {		SDValue DAGTypeLegalizer::ScalarizeVecOp_FP_ROUND(SDNode *N, unsigned OpNo) {
SDValue Elt = GetScalarizedVector(N->getOperand(0));		SDValue Elt = GetScalarizedVector(N->getOperand(0));
SDValue Res = DAG.getNode(ISD::FP_ROUND, SDLoc(N),		SDValue Res = DAG.getNode(ISD::FP_ROUND, SDLoc(N),
N->getValueType(0).getVectorElementType(), Elt,		N->getValueType(0).getVectorElementType(), Elt,
N->getOperand(1));		N->getOperand(1));
return DAG.getNode(ISD::SCALAR_TO_VECTOR, SDLoc(N), N->getValueType(0), Res);		return DAG.getNode(ISD::SCALAR_TO_VECTOR, SDLoc(N), N->getValueType(0), Res);
}		}

		SDValue DAGTypeLegalizer::ScalarizeVecOp_STRICT_FP_ROUND(SDNode *N,
		unsigned OpNo) {
		assert(OpNo == 1 && "Wrong operand for scalarization!");
		SDValue Elt = GetScalarizedVector(N->getOperand(1));
		SDValue Res = DAG.getNode(ISD::STRICT_FP_ROUND, SDLoc(N),
		{ N->getValueType(0).getVectorElementType(),
		MVT::Other },
		{ N->getOperand(0), Elt, N->getOperand(2) });
		// Legalize the chain result - switch anything that used the old chain to
		// use the new one.
		ReplaceValueWith(SDValue(N, 1), Res.getValue(1));
		return DAG.getNode(ISD::SCALAR_TO_VECTOR, SDLoc(N), N->getValueType(0), Res);
		}

SDValue DAGTypeLegalizer::ScalarizeVecOp_VECREDUCE(SDNode *N) {		SDValue DAGTypeLegalizer::ScalarizeVecOp_VECREDUCE(SDNode *N) {
SDValue Res = GetScalarizedVector(N->getOperand(0));		SDValue Res = GetScalarizedVector(N->getOperand(0));
// Result type may be wider than element type.		// Result type may be wider than element type.
if (Res.getValueType() != N->getValueType(0))		if (Res.getValueType() != N->getValueType(0))
Res = DAG.getNode(ISD::ANY_EXTEND, SDLoc(N), N->getValueType(0), Res);		Res = DAG.getNode(ISD::ANY_EXTEND, SDLoc(N), N->getValueType(0), Res);
return Res;		return Res;
}		}

▲ Show 20 Lines • Show All 76 Lines • ▼ Show 20 Lines	#endif
case ISD::FEXP2:		case ISD::FEXP2:
case ISD::FFLOOR:		case ISD::FFLOOR:
case ISD::FLOG:		case ISD::FLOG:
case ISD::FLOG10:		case ISD::FLOG10:
case ISD::FLOG2:		case ISD::FLOG2:
case ISD::FNEARBYINT:		case ISD::FNEARBYINT:
case ISD::FNEG:		case ISD::FNEG:
case ISD::FP_EXTEND:		case ISD::FP_EXTEND:
		case ISD::STRICT_FP_EXTEND:
case ISD::FP_ROUND:		case ISD::FP_ROUND:
		case ISD::STRICT_FP_ROUND:
case ISD::FP_TO_SINT:		case ISD::FP_TO_SINT:
case ISD::FP_TO_UINT:		case ISD::FP_TO_UINT:
case ISD::FRINT:		case ISD::FRINT:
case ISD::FROUND:		case ISD::FROUND:
case ISD::FSIN:		case ISD::FSIN:
case ISD::FSQRT:		case ISD::FSQRT:
case ISD::FTRUNC:		case ISD::FTRUNC:
case ISD::SINT_TO_FP:		case ISD::SINT_TO_FP:
▲ Show 20 Lines • Show All 754 Lines • ▼ Show 20 Lines	void DAGTypeLegalizer::SplitVecRes_UnaryOp(SDNode *N, SDValue &Lo,
SDValue &Hi) {		SDValue &Hi) {
// Get the dest types - they may not match the input types, e.g. int_to_fp.		// Get the dest types - they may not match the input types, e.g. int_to_fp.
EVT LoVT, HiVT;		EVT LoVT, HiVT;
SDLoc dl(N);		SDLoc dl(N);
std::tie(LoVT, HiVT) = DAG.GetSplitDestVTs(N->getValueType(0));		std::tie(LoVT, HiVT) = DAG.GetSplitDestVTs(N->getValueType(0));

// If the input also splits, handle it directly for a compile time speedup.		// If the input also splits, handle it directly for a compile time speedup.
// Otherwise split it by hand.		// Otherwise split it by hand.
EVT InVT = N->getOperand(0).getValueType();		unsigned OpNo = N->isStrictFPOpcode() ? 1 : 0;
		EVT InVT = N->getOperand(OpNo).getValueType();
if (getTypeAction(InVT) == TargetLowering::TypeSplitVector)		if (getTypeAction(InVT) == TargetLowering::TypeSplitVector)
GetSplitVector(N->getOperand(0), Lo, Hi);		GetSplitVector(N->getOperand(OpNo), Lo, Hi);
else		else
std::tie(Lo, Hi) = DAG.SplitVectorOperand(N, 0);		std::tie(Lo, Hi) = DAG.SplitVectorOperand(N, OpNo);

if (N->getOpcode() == ISD::FP_ROUND) {		if (N->getOpcode() == ISD::FP_ROUND) {
Lo = DAG.getNode(N->getOpcode(), dl, LoVT, Lo, N->getOperand(1));		Lo = DAG.getNode(N->getOpcode(), dl, LoVT, Lo, N->getOperand(1));
Hi = DAG.getNode(N->getOpcode(), dl, HiVT, Hi, N->getOperand(1));		Hi = DAG.getNode(N->getOpcode(), dl, HiVT, Hi, N->getOperand(1));
		} else if (N->getOpcode() == ISD::STRICT_FP_ROUND) {
		Lo = DAG.getNode(N->getOpcode(), dl, { LoVT, MVT::Other },
		{ N->getOperand(0), Lo, N->getOperand(2) });
		Hi = DAG.getNode(N->getOpcode(), dl, { HiVT, MVT::Other },
		{ N->getOperand(0), Hi, N->getOperand(2) });
		SDValue NewChain = DAG.getNode(ISD::TokenFactor, dl, MVT::Other,
		Lo.getValue(1), Hi.getValue(1));
		ReplaceValueWith(SDValue(N, 1), NewChain);
		} else if (N->isStrictFPOpcode()) {
		Lo = DAG.getNode(N->getOpcode(), dl, { LoVT, MVT::Other },
		{ N->getOperand(0), Lo });
		Hi = DAG.getNode(N->getOpcode(), dl, { HiVT, MVT::Other },
		{ N->getOperand(0), Hi });
		// Legalize the chain result - switch anything that used the old chain to
		// use the new one.
		SDValue NewChain = DAG.getNode(ISD::TokenFactor, dl, MVT::Other,
		Lo.getValue(1), Hi.getValue(1));
		ReplaceValueWith(SDValue(N, 1), NewChain);
} else {		} else {
Lo = DAG.getNode(N->getOpcode(), dl, LoVT, Lo);		Lo = DAG.getNode(N->getOpcode(), dl, LoVT, Lo);
Hi = DAG.getNode(N->getOpcode(), dl, HiVT, Hi);		Hi = DAG.getNode(N->getOpcode(), dl, HiVT, Hi);
}		}
}		}

void DAGTypeLegalizer::SplitVecRes_ExtendOp(SDNode *N, SDValue &Lo,		void DAGTypeLegalizer::SplitVecRes_ExtendOp(SDNode *N, SDValue &Lo,
SDValue &Hi) {		SDValue &Hi) {
▲ Show 20 Lines • Show All 184 Lines • ▼ Show 20 Lines	#endif
case ISD::SETCC: Res = SplitVecOp_VSETCC(N); break;		case ISD::SETCC: Res = SplitVecOp_VSETCC(N); break;
case ISD::BITCAST: Res = SplitVecOp_BITCAST(N); break;		case ISD::BITCAST: Res = SplitVecOp_BITCAST(N); break;
case ISD::EXTRACT_SUBVECTOR: Res = SplitVecOp_EXTRACT_SUBVECTOR(N); break;		case ISD::EXTRACT_SUBVECTOR: Res = SplitVecOp_EXTRACT_SUBVECTOR(N); break;
case ISD::EXTRACT_VECTOR_ELT:Res = SplitVecOp_EXTRACT_VECTOR_ELT(N); break;		case ISD::EXTRACT_VECTOR_ELT:Res = SplitVecOp_EXTRACT_VECTOR_ELT(N); break;
case ISD::CONCAT_VECTORS: Res = SplitVecOp_CONCAT_VECTORS(N); break;		case ISD::CONCAT_VECTORS: Res = SplitVecOp_CONCAT_VECTORS(N); break;
case ISD::TRUNCATE:		case ISD::TRUNCATE:
Res = SplitVecOp_TruncateHelper(N);		Res = SplitVecOp_TruncateHelper(N);
break;		break;
		case ISD::STRICT_FP_ROUND:
case ISD::FP_ROUND: Res = SplitVecOp_FP_ROUND(N); break;		case ISD::FP_ROUND: Res = SplitVecOp_FP_ROUND(N); break;
case ISD::FCOPYSIGN: Res = SplitVecOp_FCOPYSIGN(N); break;		case ISD::FCOPYSIGN: Res = SplitVecOp_FCOPYSIGN(N); break;
case ISD::STORE:		case ISD::STORE:
Res = SplitVecOp_STORE(cast<StoreSDNode>(N), OpNo);		Res = SplitVecOp_STORE(cast<StoreSDNode>(N), OpNo);
break;		break;
case ISD::MSTORE:		case ISD::MSTORE:
Res = SplitVecOp_MSTORE(cast<MaskedStoreSDNode>(N), OpNo);		Res = SplitVecOp_MSTORE(cast<MaskedStoreSDNode>(N), OpNo);
break;		break;
Show All 13 Lines	case ISD::UINT_TO_FP:
else		else
Res = SplitVecOp_UnaryOp(N);		Res = SplitVecOp_UnaryOp(N);
break;		break;
case ISD::FP_TO_SINT:		case ISD::FP_TO_SINT:
case ISD::FP_TO_UINT:		case ISD::FP_TO_UINT:
case ISD::CTTZ:		case ISD::CTTZ:
case ISD::CTLZ:		case ISD::CTLZ:
case ISD::CTPOP:		case ISD::CTPOP:
		case ISD::STRICT_FP_EXTEND:
case ISD::FP_EXTEND:		case ISD::FP_EXTEND:
case ISD::SIGN_EXTEND:		case ISD::SIGN_EXTEND:
case ISD::ZERO_EXTEND:		case ISD::ZERO_EXTEND:
case ISD::ANY_EXTEND:		case ISD::ANY_EXTEND:
case ISD::FTRUNC:		case ISD::FTRUNC:
case ISD::FCANONICALIZE:		case ISD::FCANONICALIZE:
Res = SplitVecOp_UnaryOp(N);		Res = SplitVecOp_UnaryOp(N);
break;		break;
Show All 25 Lines	#endif
// If the result is null, the sub-method took care of registering results etc.		// If the result is null, the sub-method took care of registering results etc.
if (!Res.getNode()) return false;		if (!Res.getNode()) return false;

// If the result is N, the sub-method updated N in place. Tell the legalizer		// If the result is N, the sub-method updated N in place. Tell the legalizer
// core about this.		// core about this.
if (Res.getNode() == N)		if (Res.getNode() == N)
return true;		return true;

		if (N->isStrictFPOpcode())
		assert(Res.getValueType() == N->getValueType(0) && N->getNumValues() == 2 &&
		"Invalid operand expansion");
		else
assert(Res.getValueType() == N->getValueType(0) && N->getNumValues() == 1 &&		assert(Res.getValueType() == N->getValueType(0) && N->getNumValues() == 1 &&
"Invalid operand expansion");		"Invalid operand expansion");

ReplaceValueWith(SDValue(N, 0), Res);		ReplaceValueWith(SDValue(N, 0), Res);
return false;		return false;
}		}

SDValue DAGTypeLegalizer::SplitVecOp_VSELECT(SDNode *N, unsigned OpNo) {		SDValue DAGTypeLegalizer::SplitVecOp_VSELECT(SDNode *N, unsigned OpNo) {
// The only possibility for an illegal operand is the mask, since result type		// The only possibility for an illegal operand is the mask, since result type
▲ Show 20 Lines • Show All 71 Lines • ▼ Show 20 Lines	SDValue DAGTypeLegalizer::SplitVecOp_VECREDUCE(SDNode *N, unsigned OpNo) {
return DAG.getNode(N->getOpcode(), dl, ResVT, Partial, N->getFlags());		return DAG.getNode(N->getOpcode(), dl, ResVT, Partial, N->getFlags());
}		}

SDValue DAGTypeLegalizer::SplitVecOp_UnaryOp(SDNode *N) {		SDValue DAGTypeLegalizer::SplitVecOp_UnaryOp(SDNode *N) {
// The result has a legal vector type, but the input needs splitting.		// The result has a legal vector type, but the input needs splitting.
EVT ResVT = N->getValueType(0);		EVT ResVT = N->getValueType(0);
SDValue Lo, Hi;		SDValue Lo, Hi;
SDLoc dl(N);		SDLoc dl(N);
GetSplitVector(N->getOperand(0), Lo, Hi);		GetSplitVector(N->getOperand(N->isStrictFPOpcode() ? 1 : 0), Lo, Hi);
EVT InVT = Lo.getValueType();		EVT InVT = Lo.getValueType();

EVT OutVT = EVT::getVectorVT(*DAG.getContext(), ResVT.getVectorElementType(),		EVT OutVT = EVT::getVectorVT(*DAG.getContext(), ResVT.getVectorElementType(),
InVT.getVectorNumElements());		InVT.getVectorNumElements());

		if (N->isStrictFPOpcode()) {
		Lo = DAG.getNode(N->getOpcode(), dl, { OutVT, MVT::Other },
		{ N->getOperand(0), Lo });
		Hi = DAG.getNode(N->getOpcode(), dl, { OutVT, MVT::Other },
		{ N->getOperand(0), Hi });

		// Build a factor node to remember that this operation is independent
		// of the other one.
		SDValue Ch = DAG.getNode(ISD::TokenFactor, dl, MVT::Other, Lo.getValue(1),
		Hi.getValue(1));

		// Legalize the chain result - switch anything that used the old chain to
		// use the new one.
		ReplaceValueWith(SDValue(N, 1), Ch);
		} else {
Lo = DAG.getNode(N->getOpcode(), dl, OutVT, Lo);		Lo = DAG.getNode(N->getOpcode(), dl, OutVT, Lo);
Hi = DAG.getNode(N->getOpcode(), dl, OutVT, Hi);		Hi = DAG.getNode(N->getOpcode(), dl, OutVT, Hi);
		}

return DAG.getNode(ISD::CONCAT_VECTORS, dl, ResVT, Lo, Hi);		return DAG.getNode(ISD::CONCAT_VECTORS, dl, ResVT, Lo, Hi);
}		}

SDValue DAGTypeLegalizer::SplitVecOp_BITCAST(SDNode *N) {		SDValue DAGTypeLegalizer::SplitVecOp_BITCAST(SDNode *N) {
// For example, i64 = BITCAST v4i16 on alpha. Typically the vector will		// For example, i64 = BITCAST v4i16 on alpha. Typically the vector will
// end up being split all the way down to individual components. Convert the		// end up being split all the way down to individual components. Convert the
// split pieces into integers and reassemble.		// split pieces into integers and reassemble.
▲ Show 20 Lines • Show All 455 Lines • ▼ Show 20 Lines
}		}


SDValue DAGTypeLegalizer::SplitVecOp_FP_ROUND(SDNode *N) {		SDValue DAGTypeLegalizer::SplitVecOp_FP_ROUND(SDNode *N) {
// The result has a legal vector type, but the input needs splitting.		// The result has a legal vector type, but the input needs splitting.
EVT ResVT = N->getValueType(0);		EVT ResVT = N->getValueType(0);
SDValue Lo, Hi;		SDValue Lo, Hi;
SDLoc DL(N);		SDLoc DL(N);
GetSplitVector(N->getOperand(0), Lo, Hi);		GetSplitVector(N->getOperand(N->isStrictFPOpcode() ? 1 : 0), Lo, Hi);
EVT InVT = Lo.getValueType();		EVT InVT = Lo.getValueType();

EVT OutVT = EVT::getVectorVT(*DAG.getContext(), ResVT.getVectorElementType(),		EVT OutVT = EVT::getVectorVT(*DAG.getContext(), ResVT.getVectorElementType(),
InVT.getVectorNumElements());		InVT.getVectorNumElements());

		if (N->isStrictFPOpcode()) {
		Lo = DAG.getNode(N->getOpcode(), DL, { OutVT, MVT::Other },
		{ N->getOperand(0), Lo, N->getOperand(2) });
		Hi = DAG.getNode(N->getOpcode(), DL, { OutVT, MVT::Other },
		{ N->getOperand(0), Hi, N->getOperand(2) });
		// Legalize the chain result - switch anything that used the old chain to
		// use the new one.
		SDValue NewChain = DAG.getNode(ISD::TokenFactor, DL, MVT::Other,
		Lo.getValue(1), Hi.getValue(1));
		ReplaceValueWith(SDValue(N, 1), NewChain);
		} else {
Lo = DAG.getNode(ISD::FP_ROUND, DL, OutVT, Lo, N->getOperand(1));		Lo = DAG.getNode(ISD::FP_ROUND, DL, OutVT, Lo, N->getOperand(1));
Hi = DAG.getNode(ISD::FP_ROUND, DL, OutVT, Hi, N->getOperand(1));		Hi = DAG.getNode(ISD::FP_ROUND, DL, OutVT, Hi, N->getOperand(1));
		}

return DAG.getNode(ISD::CONCAT_VECTORS, DL, ResVT, Lo, Hi);		return DAG.getNode(ISD::CONCAT_VECTORS, DL, ResVT, Lo, Hi);
}		}

SDValue DAGTypeLegalizer::SplitVecOp_FCOPYSIGN(SDNode *N) {		SDValue DAGTypeLegalizer::SplitVecOp_FCOPYSIGN(SDNode *N) {
// The result (and the first input) has a legal vector type, but the second		// The result (and the first input) has a legal vector type, but the second
// input needs splitting.		// input needs splitting.
return DAG.UnrollVectorOp(N, N->getValueType(0).getVectorNumElements());		return DAG.UnrollVectorOp(N, N->getValueType(0).getVectorNumElements());
▲ Show 20 Lines • Show All 147 Lines • ▼ Show 20 Lines	#endif
case ISD::SIGN_EXTEND:		case ISD::SIGN_EXTEND:
case ISD::SINT_TO_FP:		case ISD::SINT_TO_FP:
case ISD::TRUNCATE:		case ISD::TRUNCATE:
case ISD::UINT_TO_FP:		case ISD::UINT_TO_FP:
case ISD::ZERO_EXTEND:		case ISD::ZERO_EXTEND:
Res = WidenVecRes_Convert(N);		Res = WidenVecRes_Convert(N);
break;		break;

		case ISD::STRICT_FP_EXTEND:
		case ISD::STRICT_FP_ROUND:
		Res = WidenVecRes_Convert_StrictFP(N);
		break;

case ISD::FABS:		case ISD::FABS:
case ISD::FCEIL:		case ISD::FCEIL:
case ISD::FCOS:		case ISD::FCOS:
case ISD::FEXP:		case ISD::FEXP:
case ISD::FEXP2:		case ISD::FEXP2:
case ISD::FFLOOR:		case ISD::FFLOOR:
case ISD::FLOG:		case ISD::FLOG:
case ISD::FLOG10:		case ISD::FLOG10:
▲ Show 20 Lines • Show All 461 Lines • ▼ Show 20 Lines	if (N->getNumOperands() == 1)
Ops[i] = DAG.getNode(Opcode, DL, EltVT, Val);		Ops[i] = DAG.getNode(Opcode, DL, EltVT, Val);
else		else
Ops[i] = DAG.getNode(Opcode, DL, EltVT, Val, N->getOperand(1), Flags);		Ops[i] = DAG.getNode(Opcode, DL, EltVT, Val, N->getOperand(1), Flags);
}		}

return DAG.getBuildVector(WidenVT, DL, Ops);		return DAG.getBuildVector(WidenVT, DL, Ops);
}		}

		SDValue DAGTypeLegalizer::WidenVecRes_Convert_StrictFP(SDNode *N) {
		SDValue InOp = N->getOperand(1);
		SDLoc DL(N);
		SmallVector<SDValue, 4> NewOps(N->op_begin(), N->op_end());

		EVT WidenVT = TLI.getTypeToTransformTo(*DAG.getContext(), N->getValueType(0));
		unsigned WidenNumElts = WidenVT.getVectorNumElements();
		SmallVector<EVT, 2> WidenVTs = { WidenVT, MVT::Other };

		EVT InVT = InOp.getValueType();
		EVT InEltVT = InVT.getVectorElementType();

		unsigned Opcode = N->getOpcode();

		// FIXME: Optimizations need to be implemented here.

		// Otherwise unroll into some nasty scalar code and rebuild the vector.
		EVT EltVT = WidenVT.getVectorElementType();
		SmallVector<EVT, 2> EltVTs = { EltVT, MVT::Other };
		SmallVector<SDValue, 16> Ops(WidenNumElts, DAG.getUNDEF(EltVT));
		SmallVector<SDValue, 32> OpChains;
		// Use the original element count so we don't do more scalar opts than
		// necessary.
		unsigned MinElts = N->getValueType(0).getVectorNumElements();
		for (unsigned i=0; i < MinElts; ++i) {
		NewOps[1] = DAG.getNode(
		ISD::EXTRACT_VECTOR_ELT, DL, InEltVT, InOp,
		DAG.getConstant(i, DL, TLI.getVectorIdxTy(DAG.getDataLayout())));
		Ops[i] = DAG.getNode(Opcode, DL, EltVTs, NewOps);
		OpChains.push_back(Ops[i].getValue(1));
		}
		SDValue NewChain = DAG.getNode(ISD::TokenFactor, DL, MVT::Other, OpChains);
		ReplaceValueWith(SDValue(N, 1), NewChain);

		return DAG.getBuildVector(WidenVT, DL, Ops);
		}

SDValue DAGTypeLegalizer::WidenVecRes_EXTEND_VECTOR_INREG(SDNode *N) {		SDValue DAGTypeLegalizer::WidenVecRes_EXTEND_VECTOR_INREG(SDNode *N) {
unsigned Opcode = N->getOpcode();		unsigned Opcode = N->getOpcode();
SDValue InOp = N->getOperand(0);		SDValue InOp = N->getOperand(0);
SDLoc DL(N);		SDLoc DL(N);

EVT WidenVT = TLI.getTypeToTransformTo(*DAG.getContext(), N->getValueType(0));		EVT WidenVT = TLI.getTypeToTransformTo(*DAG.getContext(), N->getValueType(0));
EVT WidenSVT = WidenVT.getVectorElementType();		EVT WidenSVT = WidenVT.getVectorElementType();
unsigned WidenNumElts = WidenVT.getVectorNumElements();		unsigned WidenNumElts = WidenVT.getVectorNumElements();
▲ Show 20 Lines • Show All 770 Lines • ▼ Show 20 Lines	#endif

case ISD::ANY_EXTEND:		case ISD::ANY_EXTEND:
case ISD::SIGN_EXTEND:		case ISD::SIGN_EXTEND:
case ISD::ZERO_EXTEND:		case ISD::ZERO_EXTEND:
Res = WidenVecOp_EXTEND(N);		Res = WidenVecOp_EXTEND(N);
break;		break;

case ISD::FP_EXTEND:		case ISD::FP_EXTEND:
		case ISD::STRICT_FP_EXTEND:
case ISD::FP_TO_SINT:		case ISD::FP_TO_SINT:
case ISD::FP_TO_UINT:		case ISD::FP_TO_UINT:
case ISD::SINT_TO_FP:		case ISD::SINT_TO_FP:
case ISD::UINT_TO_FP:		case ISD::UINT_TO_FP:
case ISD::TRUNCATE:		case ISD::TRUNCATE:
Res = WidenVecOp_Convert(N);		Res = WidenVecOp_Convert(N);
break;		break;

Show All 18 Lines	#endif
if (!Res.getNode()) return false;		if (!Res.getNode()) return false;

// If the result is N, the sub-method updated N in place. Tell the legalizer		// If the result is N, the sub-method updated N in place. Tell the legalizer
// core about this.		// core about this.
if (Res.getNode() == N)		if (Res.getNode() == N)
return true;		return true;


		if (N->isStrictFPOpcode())
		assert(Res.getValueType() == N->getValueType(0) && N->getNumValues() == 2 &&
		"Invalid operand expansion");
		else
assert(Res.getValueType() == N->getValueType(0) && N->getNumValues() == 1 &&		assert(Res.getValueType() == N->getValueType(0) && N->getNumValues() == 1 &&
"Invalid operand expansion");		"Invalid operand expansion");

ReplaceValueWith(SDValue(N, 0), Res);		ReplaceValueWith(SDValue(N, 0), Res);
return false;		return false;
}		}

SDValue DAGTypeLegalizer::WidenVecOp_EXTEND(SDNode *N) {		SDValue DAGTypeLegalizer::WidenVecOp_EXTEND(SDNode *N) {
SDLoc DL(N);		SDLoc DL(N);
EVT VT = N->getValueType(0);		EVT VT = N->getValueType(0);
▲ Show 20 Lines • Show All 63 Lines • ▼ Show 20 Lines
}		}

SDValue DAGTypeLegalizer::WidenVecOp_Convert(SDNode *N) {		SDValue DAGTypeLegalizer::WidenVecOp_Convert(SDNode *N) {
// Since the result is legal and the input is illegal.		// Since the result is legal and the input is illegal.
EVT VT = N->getValueType(0);		EVT VT = N->getValueType(0);
EVT EltVT = VT.getVectorElementType();		EVT EltVT = VT.getVectorElementType();
SDLoc dl(N);		SDLoc dl(N);
unsigned NumElts = VT.getVectorNumElements();		unsigned NumElts = VT.getVectorNumElements();
SDValue InOp = N->getOperand(0);		SDValue InOp = N->getOperand(N->isStrictFPOpcode() ? 1 : 0);
assert(getTypeAction(InOp.getValueType()) ==		assert(getTypeAction(InOp.getValueType()) ==
TargetLowering::TypeWidenVector &&		TargetLowering::TypeWidenVector &&
"Unexpected type action");		"Unexpected type action");
InOp = GetWidenedVector(InOp);		InOp = GetWidenedVector(InOp);
EVT InVT = InOp.getValueType();		EVT InVT = InOp.getValueType();
unsigned Opcode = N->getOpcode();		unsigned Opcode = N->getOpcode();

// See if a widened result type would be legal, if so widen the node.		// See if a widened result type would be legal, if so widen the node.
		// FIXME: This isn't safe for StrictFP. Other optimization here is needed.
EVT WideVT = EVT::getVectorVT(*DAG.getContext(), EltVT,		EVT WideVT = EVT::getVectorVT(*DAG.getContext(), EltVT,
InVT.getVectorNumElements());		InVT.getVectorNumElements());
if (TLI.isTypeLegal(WideVT)) {		if (TLI.isTypeLegal(WideVT) && !N->isStrictFPOpcode()) {
SDValue Res = DAG.getNode(Opcode, dl, WideVT, InOp);		SDValue Res;
		if (N->isStrictFPOpcode()) {
		Res = DAG.getNode(Opcode, dl, { WideVT, MVT::Other },
		{ N->getOperand(0), InOp });
		// Legalize the chain result - switch anything that used the old chain to
		// use the new one.
		ReplaceValueWith(SDValue(N, 1), Res.getValue(1));
		} else
		Res = DAG.getNode(Opcode, dl, WideVT, InOp);
return DAG.getNode(		return DAG.getNode(
ISD::EXTRACT_SUBVECTOR, dl, VT, Res,		ISD::EXTRACT_SUBVECTOR, dl, VT, Res,
DAG.getConstant(0, dl, TLI.getVectorIdxTy(DAG.getDataLayout())));		DAG.getConstant(0, dl, TLI.getVectorIdxTy(DAG.getDataLayout())));
}		}

EVT InEltVT = InVT.getVectorElementType();		EVT InEltVT = InVT.getVectorElementType();

// Unroll the convert into some scalar code and create a nasty build vector.		// Unroll the convert into some scalar code and create a nasty build vector.
SmallVector<SDValue, 16> Ops(NumElts);		SmallVector<SDValue, 16> Ops(NumElts);
		if (N->isStrictFPOpcode()) {
		SmallVector<SDValue, 4> NewOps(N->op_begin(), N->op_end());
		SmallVector<SDValue, 32> OpChains;
		for (unsigned i=0; i < NumElts; ++i) {
		NewOps[1] = DAG.getNode(
		ISD::EXTRACT_VECTOR_ELT, dl, InEltVT, InOp,
		DAG.getConstant(i, dl, TLI.getVectorIdxTy(DAG.getDataLayout())));
		Ops[i] = DAG.getNode(Opcode, dl, { EltVT, MVT::Other }, NewOps);
		OpChains.push_back(Ops[i].getValue(1));
		}
		SDValue NewChain = DAG.getNode(ISD::TokenFactor, dl, MVT::Other, OpChains);
		ReplaceValueWith(SDValue(N, 1), NewChain);
		} else {
for (unsigned i=0; i < NumElts; ++i)		for (unsigned i = 0; i < NumElts; ++i)
Ops[i] = DAG.getNode(		Ops[i] = DAG.getNode(
Opcode, dl, EltVT,		Opcode, dl, EltVT,
DAG.getNode(		DAG.getNode(
ISD::EXTRACT_VECTOR_ELT, dl, InEltVT, InOp,		ISD::EXTRACT_VECTOR_ELT, dl, InEltVT, InOp,
DAG.getConstant(i, dl, TLI.getVectorIdxTy(DAG.getDataLayout()))));		DAG.getConstant(i, dl, TLI.getVectorIdxTy(DAG.getDataLayout()))));
		}

return DAG.getBuildVector(VT, dl, Ops);		return DAG.getBuildVector(VT, dl, Ops);
}		}

SDValue DAGTypeLegalizer::WidenVecOp_BITCAST(SDNode *N) {		SDValue DAGTypeLegalizer::WidenVecOp_BITCAST(SDNode *N) {
EVT VT = N->getValueType(0);		EVT VT = N->getValueType(0);
SDValue InOp = GetWidenedVector(N->getOperand(0));		SDValue InOp = GetWidenedVector(N->getOperand(0));
EVT InWidenVT = InOp.getValueType();		EVT InWidenVT = InOp.getValueType();
▲ Show 20 Lines • Show All 760 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/SelectionDAG/SelectionDAG.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 7,605 Lines • ▼ Show 20 Lines	case ISD::STRICT_FNEARBYINT:
IsUnary = true;		IsUnary = true;
break;		break;
case ISD::STRICT_FMAXNUM: NewOpc = ISD::FMAXNUM; break;		case ISD::STRICT_FMAXNUM: NewOpc = ISD::FMAXNUM; break;
case ISD::STRICT_FMINNUM: NewOpc = ISD::FMINNUM; break;		case ISD::STRICT_FMINNUM: NewOpc = ISD::FMINNUM; break;
case ISD::STRICT_FCEIL: NewOpc = ISD::FCEIL; IsUnary = true; break;		case ISD::STRICT_FCEIL: NewOpc = ISD::FCEIL; IsUnary = true; break;
case ISD::STRICT_FFLOOR: NewOpc = ISD::FFLOOR; IsUnary = true; break;		case ISD::STRICT_FFLOOR: NewOpc = ISD::FFLOOR; IsUnary = true; break;
case ISD::STRICT_FROUND: NewOpc = ISD::FROUND; IsUnary = true; break;		case ISD::STRICT_FROUND: NewOpc = ISD::FROUND; IsUnary = true; break;
case ISD::STRICT_FTRUNC: NewOpc = ISD::FTRUNC; IsUnary = true; break;		case ISD::STRICT_FTRUNC: NewOpc = ISD::FTRUNC; IsUnary = true; break;
		// STRICT_FP_ROUND takes an extra argument describing whether or not
		// the value will be changed by this node. See ISDOpcodes.h for details.
		case ISD::STRICT_FP_ROUND: NewOpc = ISD::FP_ROUND; break;
		case ISD::STRICT_FP_EXTEND: NewOpc = ISD::FP_EXTEND; IsUnary = true; break;
}		}

// We're taking this node out of the chain, so we need to re-link things.		// We're taking this node out of the chain, so we need to re-link things.
SDValue InputChain = Node->getOperand(0);		SDValue InputChain = Node->getOperand(0);
SDValue OutputChain = SDValue(Node, 1);		SDValue OutputChain = SDValue(Node, 1);
ReplaceAllUsesOfValueWith(OutputChain, InputChain);		ReplaceAllUsesOfValueWith(OutputChain, InputChain);

SDVTList VTs = getVTList(Node->getOperand(1).getValueType());		SDVTList VTs;
SDNode *Res = nullptr;		SDNode *Res = nullptr;

		switch (OrigOpc) {
		default:
		VTs = getVTList(Node->getOperand(1).getValueType());
		break;
		case ISD::STRICT_FP_ROUND:
		case ISD::STRICT_FP_EXTEND:
		VTs = getVTList(Node->getValueType(0));
		break;
		}

if (IsUnary)		if (IsUnary)
Res = MorphNodeTo(Node, NewOpc, VTs, { Node->getOperand(1) });		Res = MorphNodeTo(Node, NewOpc, VTs, { Node->getOperand(1) });
else if (IsTernary)		else if (IsTernary)
Res = MorphNodeTo(Node, NewOpc, VTs, { Node->getOperand(1),		Res = MorphNodeTo(Node, NewOpc, VTs, { Node->getOperand(1),
Node->getOperand(2),		Node->getOperand(2),
Node->getOperand(3)});		Node->getOperand(3)});
else		else
Res = MorphNodeTo(Node, NewOpc, VTs, { Node->getOperand(1),		Res = MorphNodeTo(Node, NewOpc, VTs, { Node->getOperand(1),
▲ Show 20 Lines • Show All 1,788 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 6,072 Lines • ▼ Show 20 Lines	setValue(&I, DAG.getNode(ISD::FMA, sdl,
getValue(I.getArgOperand(2))));		getValue(I.getArgOperand(2))));
return nullptr;		return nullptr;
case Intrinsic::experimental_constrained_fadd:		case Intrinsic::experimental_constrained_fadd:
case Intrinsic::experimental_constrained_fsub:		case Intrinsic::experimental_constrained_fsub:
case Intrinsic::experimental_constrained_fmul:		case Intrinsic::experimental_constrained_fmul:
case Intrinsic::experimental_constrained_fdiv:		case Intrinsic::experimental_constrained_fdiv:
case Intrinsic::experimental_constrained_frem:		case Intrinsic::experimental_constrained_frem:
case Intrinsic::experimental_constrained_fma:		case Intrinsic::experimental_constrained_fma:
		case Intrinsic::experimental_constrained_fptrunc:
		case Intrinsic::experimental_constrained_fpext:
case Intrinsic::experimental_constrained_sqrt:		case Intrinsic::experimental_constrained_sqrt:
case Intrinsic::experimental_constrained_pow:		case Intrinsic::experimental_constrained_pow:
case Intrinsic::experimental_constrained_powi:		case Intrinsic::experimental_constrained_powi:
case Intrinsic::experimental_constrained_sin:		case Intrinsic::experimental_constrained_sin:
case Intrinsic::experimental_constrained_cos:		case Intrinsic::experimental_constrained_cos:
case Intrinsic::experimental_constrained_exp:		case Intrinsic::experimental_constrained_exp:
case Intrinsic::experimental_constrained_exp2:		case Intrinsic::experimental_constrained_exp2:
case Intrinsic::experimental_constrained_log:		case Intrinsic::experimental_constrained_log:
▲ Show 20 Lines • Show All 740 Lines • ▼ Show 20 Lines	case Intrinsic::experimental_constrained_fdiv:
Opcode = ISD::STRICT_FDIV;		Opcode = ISD::STRICT_FDIV;
break;		break;
case Intrinsic::experimental_constrained_frem:		case Intrinsic::experimental_constrained_frem:
Opcode = ISD::STRICT_FREM;		Opcode = ISD::STRICT_FREM;
break;		break;
case Intrinsic::experimental_constrained_fma:		case Intrinsic::experimental_constrained_fma:
Opcode = ISD::STRICT_FMA;		Opcode = ISD::STRICT_FMA;
break;		break;
		case Intrinsic::experimental_constrained_fptrunc:
		Opcode = ISD::STRICT_FP_ROUND;
		break;
		case Intrinsic::experimental_constrained_fpext:
		Opcode = ISD::STRICT_FP_EXTEND;
		break;
case Intrinsic::experimental_constrained_sqrt:		case Intrinsic::experimental_constrained_sqrt:
Opcode = ISD::STRICT_FSQRT;		Opcode = ISD::STRICT_FSQRT;
break;		break;
case Intrinsic::experimental_constrained_pow:		case Intrinsic::experimental_constrained_pow:
Opcode = ISD::STRICT_FPOW;		Opcode = ISD::STRICT_FPOW;
break;		break;
case Intrinsic::experimental_constrained_powi:		case Intrinsic::experimental_constrained_powi:
Opcode = ISD::STRICT_FPOWI;		Opcode = ISD::STRICT_FPOWI;
▲ Show 20 Lines • Show All 47 Lines • ▼ Show 20 Lines	void SelectionDAGBuilder::visitConstrainedFPIntrinsic(
const TargetLowering &TLI = DAG.getTargetLoweringInfo();		const TargetLowering &TLI = DAG.getTargetLoweringInfo();
SDValue Chain = getRoot();		SDValue Chain = getRoot();
SmallVector<EVT, 4> ValueVTs;		SmallVector<EVT, 4> ValueVTs;
ComputeValueVTs(TLI, DAG.getDataLayout(), FPI.getType(), ValueVTs);		ComputeValueVTs(TLI, DAG.getDataLayout(), FPI.getType(), ValueVTs);
ValueVTs.push_back(MVT::Other); // Out chain		ValueVTs.push_back(MVT::Other); // Out chain

SDVTList VTs = DAG.getVTList(ValueVTs);		SDVTList VTs = DAG.getVTList(ValueVTs);
SDValue Result;		SDValue Result;
if (FPI.isUnaryOp())		if (Opcode == ISD::STRICT_FP_ROUND)
		Result = DAG.getNode(Opcode, sdl, VTs,
		{ Chain, getValue(FPI.getArgOperand(0)),
		DAG.getTargetConstant(0, sdl,
		TLI.getPointerTy(DAG.getDataLayout())) });
		else if (FPI.isUnaryOp())
Result = DAG.getNode(Opcode, sdl, VTs,		Result = DAG.getNode(Opcode, sdl, VTs,
{ Chain, getValue(FPI.getArgOperand(0)) });		{ Chain, getValue(FPI.getArgOperand(0)) });
else if (FPI.isTernaryOp())		else if (FPI.isTernaryOp())
Result = DAG.getNode(Opcode, sdl, VTs,		Result = DAG.getNode(Opcode, sdl, VTs,
{ Chain, getValue(FPI.getArgOperand(0)),		{ Chain, getValue(FPI.getArgOperand(0)),
getValue(FPI.getArgOperand(1)),		getValue(FPI.getArgOperand(1)),
getValue(FPI.getArgOperand(2)) });		getValue(FPI.getArgOperand(2)) });
else		else
▲ Show 20 Lines • Show All 3,961 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp

Show First 20 Lines • Show All 307 Lines • ▼ Show 20 Lines	#endif
case ISD::ZERO_EXTEND: return "zero_extend";		case ISD::ZERO_EXTEND: return "zero_extend";
case ISD::ANY_EXTEND: return "any_extend";		case ISD::ANY_EXTEND: return "any_extend";
case ISD::SIGN_EXTEND_INREG: return "sign_extend_inreg";		case ISD::SIGN_EXTEND_INREG: return "sign_extend_inreg";
case ISD::ANY_EXTEND_VECTOR_INREG: return "any_extend_vector_inreg";		case ISD::ANY_EXTEND_VECTOR_INREG: return "any_extend_vector_inreg";
case ISD::SIGN_EXTEND_VECTOR_INREG: return "sign_extend_vector_inreg";		case ISD::SIGN_EXTEND_VECTOR_INREG: return "sign_extend_vector_inreg";
case ISD::ZERO_EXTEND_VECTOR_INREG: return "zero_extend_vector_inreg";		case ISD::ZERO_EXTEND_VECTOR_INREG: return "zero_extend_vector_inreg";
case ISD::TRUNCATE: return "truncate";		case ISD::TRUNCATE: return "truncate";
case ISD::FP_ROUND: return "fp_round";		case ISD::FP_ROUND: return "fp_round";
		case ISD::STRICT_FP_ROUND: return "strict_fp_round";
case ISD::FLT_ROUNDS_: return "flt_rounds";		case ISD::FLT_ROUNDS_: return "flt_rounds";
case ISD::FP_ROUND_INREG: return "fp_round_inreg";		case ISD::FP_ROUND_INREG: return "fp_round_inreg";
case ISD::FP_EXTEND: return "fp_extend";		case ISD::FP_EXTEND: return "fp_extend";
		case ISD::STRICT_FP_EXTEND: return "strict_fp_extend";

case ISD::SINT_TO_FP: return "sint_to_fp";		case ISD::SINT_TO_FP: return "sint_to_fp";
case ISD::UINT_TO_FP: return "uint_to_fp";		case ISD::UINT_TO_FP: return "uint_to_fp";
case ISD::FP_TO_SINT: return "fp_to_sint";		case ISD::FP_TO_SINT: return "fp_to_sint";
case ISD::FP_TO_UINT: return "fp_to_uint";		case ISD::FP_TO_UINT: return "fp_to_uint";
case ISD::BITCAST: return "bitcast";		case ISD::BITCAST: return "bitcast";
case ISD::ADDRSPACECAST: return "addrspacecast";		case ISD::ADDRSPACECAST: return "addrspacecast";
case ISD::FP16_TO_FP: return "fp16_to_fp";		case ISD::FP16_TO_FP: return "fp16_to_fp";
▲ Show 20 Lines • Show All 621 Lines • Show Last 20 Lines

llvm/trunk/lib/IR/IntrinsicInst.cpp

Show First 20 Lines • Show All 136 Lines • ▼ Show 20 Lines	return StringSwitch<ExceptionBehavior>(ExceptionArg)
.Case("fpexcept.strict", ebStrict)		.Case("fpexcept.strict", ebStrict)
.Default(ebInvalid);		.Default(ebInvalid);
}		}

bool ConstrainedFPIntrinsic::isUnaryOp() const {		bool ConstrainedFPIntrinsic::isUnaryOp() const {
switch (getIntrinsicID()) {		switch (getIntrinsicID()) {
default:		default:
return false;		return false;
		case Intrinsic::experimental_constrained_fptrunc:
		case Intrinsic::experimental_constrained_fpext:
case Intrinsic::experimental_constrained_sqrt:		case Intrinsic::experimental_constrained_sqrt:
case Intrinsic::experimental_constrained_sin:		case Intrinsic::experimental_constrained_sin:
case Intrinsic::experimental_constrained_cos:		case Intrinsic::experimental_constrained_cos:
case Intrinsic::experimental_constrained_exp:		case Intrinsic::experimental_constrained_exp:
case Intrinsic::experimental_constrained_exp2:		case Intrinsic::experimental_constrained_exp2:
case Intrinsic::experimental_constrained_log:		case Intrinsic::experimental_constrained_log:
case Intrinsic::experimental_constrained_log10:		case Intrinsic::experimental_constrained_log10:
case Intrinsic::experimental_constrained_log2:		case Intrinsic::experimental_constrained_log2:
▲ Show 20 Lines • Show All 52 Lines • Show Last 20 Lines

llvm/trunk/lib/IR/Verifier.cpp

Show First 20 Lines • Show All 4,203 Lines • ▼ Show 20 Lines	case Intrinsic::coro_id: {
break;		break;
}		}
case Intrinsic::experimental_constrained_fadd:		case Intrinsic::experimental_constrained_fadd:
case Intrinsic::experimental_constrained_fsub:		case Intrinsic::experimental_constrained_fsub:
case Intrinsic::experimental_constrained_fmul:		case Intrinsic::experimental_constrained_fmul:
case Intrinsic::experimental_constrained_fdiv:		case Intrinsic::experimental_constrained_fdiv:
case Intrinsic::experimental_constrained_frem:		case Intrinsic::experimental_constrained_frem:
case Intrinsic::experimental_constrained_fma:		case Intrinsic::experimental_constrained_fma:
		case Intrinsic::experimental_constrained_fptrunc:
		case Intrinsic::experimental_constrained_fpext:
case Intrinsic::experimental_constrained_sqrt:		case Intrinsic::experimental_constrained_sqrt:
case Intrinsic::experimental_constrained_pow:		case Intrinsic::experimental_constrained_pow:
case Intrinsic::experimental_constrained_powi:		case Intrinsic::experimental_constrained_powi:
case Intrinsic::experimental_constrained_sin:		case Intrinsic::experimental_constrained_sin:
case Intrinsic::experimental_constrained_cos:		case Intrinsic::experimental_constrained_cos:
case Intrinsic::experimental_constrained_exp:		case Intrinsic::experimental_constrained_exp:
case Intrinsic::experimental_constrained_exp2:		case Intrinsic::experimental_constrained_exp2:
case Intrinsic::experimental_constrained_log:		case Intrinsic::experimental_constrained_log:
▲ Show 20 Lines • Show All 462 Lines • ▼ Show 20 Lines	void Verifier::visitConstrainedFPIntrinsic(ConstrainedFPIntrinsic &FPI) {
case Intrinsic::experimental_constrained_maxnum:		case Intrinsic::experimental_constrained_maxnum:
case Intrinsic::experimental_constrained_minnum:		case Intrinsic::experimental_constrained_minnum:
Assert((NumOperands == 4), "invalid arguments for constrained FP intrinsic",		Assert((NumOperands == 4), "invalid arguments for constrained FP intrinsic",
&FPI);		&FPI);
HasExceptionMD = true;		HasExceptionMD = true;
HasRoundingMD = true;		HasRoundingMD = true;
break;		break;

		case Intrinsic::experimental_constrained_fptrunc:
		case Intrinsic::experimental_constrained_fpext: {
		if (FPI.getIntrinsicID() == Intrinsic::experimental_constrained_fptrunc) {
		Assert((NumOperands == 3),
		"invalid arguments for constrained FP intrinsic", &FPI);
		HasRoundingMD = true;
		} else {
		Assert((NumOperands == 2),
		"invalid arguments for constrained FP intrinsic", &FPI);
		}
		HasExceptionMD = true;

		Value *Operand = FPI.getArgOperand(0);
		Type *OperandTy = Operand->getType();
		Value *Result = &FPI;
		Type *ResultTy = Result->getType();
		Assert(OperandTy->isFPOrFPVectorTy(),
		"Intrinsic first argument must be FP or FP vector", &FPI);
		Assert(ResultTy->isFPOrFPVectorTy(),
		"Intrinsic result must be FP or FP vector", &FPI);
		Assert(OperandTy->isVectorTy() == ResultTy->isVectorTy(),
		"Intrinsic first argument and result disagree on vector use", &FPI);
		if (OperandTy->isVectorTy()) {
		auto *OperandVecTy = cast<VectorType>(OperandTy);
		auto *ResultVecTy = cast<VectorType>(ResultTy);
		Assert(OperandVecTy->getNumElements() == ResultVecTy->getNumElements(),
		"Intrinsic first argument and result vector lengths must be equal",
		&FPI);
		}
		if (FPI.getIntrinsicID() == Intrinsic::experimental_constrained_fptrunc) {
		Assert(OperandTy->getScalarSizeInBits() > ResultTy->getScalarSizeInBits(),
		"Intrinsic first argument's type must be larger than result type",
		&FPI);
		} else {
		Assert(OperandTy->getScalarSizeInBits() < ResultTy->getScalarSizeInBits(),
		"Intrinsic first argument's type must be smaller than result type",
		&FPI);
		}
		}
		break;

default:		default:
llvm_unreachable("Invalid constrained FP intrinsic!");		llvm_unreachable("Invalid constrained FP intrinsic!");
}		}

// If a non-metadata argument is passed in a metadata slot then the		// If a non-metadata argument is passed in a metadata slot then the
// error will be caught earlier when the incorrect argument doesn't		// error will be caught earlier when the incorrect argument doesn't
// match the specification in the intrinsic call table. Thus, no		// match the specification in the intrinsic call table. Thus, no
// argument type check is needed here.		// argument type check is needed here.
▲ Show 20 Lines • Show All 658 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/X86/fp-intrinsics.ll

Show First 20 Lines • Show All 280 Lines • ▼ Show 20 Lines	entry:
%rem = call double @llvm.experimental.constrained.frem.f64(		%rem = call double @llvm.experimental.constrained.frem.f64(
double 1.000000e+00,		double 1.000000e+00,
double 1.000000e+01,		double 1.000000e+01,
metadata !"round.dynamic",		metadata !"round.dynamic",
metadata !"fpexcept.strict")		metadata !"fpexcept.strict")
ret double %rem		ret double %rem
}		}

		; Verify that round(42.1) isn't simplified when the rounding mode is
		; unknown.
		; Verify that no gross errors happen.
		; CHECK-LABEL: @f21
		; COMMON: cvtsd2ss
		define float @f21() {
		entry:
		%result = call float @llvm.experimental.constrained.fptrunc.f32.f64(
		double 42.1,
		metadata !"round.dynamic",
		metadata !"fpexcept.strict")
		ret float %result
		}

		; CHECK-LABEL: @f22
		; COMMON: cvtss2sd
		define double @f22(float %x) {
		entry:
		%result = call double @llvm.experimental.constrained.fpext.f64.f32(float %x,
		metadata !"fpexcept.strict")
		ret double %result
		}

@llvm.fp.env = thread_local global i8 zeroinitializer, section "llvm.metadata"		@llvm.fp.env = thread_local global i8 zeroinitializer, section "llvm.metadata"
declare double @llvm.experimental.constrained.fadd.f64(double, double, metadata, metadata)		declare double @llvm.experimental.constrained.fadd.f64(double, double, metadata, metadata)
declare double @llvm.experimental.constrained.fsub.f64(double, double, metadata, metadata)		declare double @llvm.experimental.constrained.fsub.f64(double, double, metadata, metadata)
declare double @llvm.experimental.constrained.fmul.f64(double, double, metadata, metadata)		declare double @llvm.experimental.constrained.fmul.f64(double, double, metadata, metadata)
declare double @llvm.experimental.constrained.fdiv.f64(double, double, metadata, metadata)		declare double @llvm.experimental.constrained.fdiv.f64(double, double, metadata, metadata)
declare double @llvm.experimental.constrained.frem.f64(double, double, metadata, metadata)		declare double @llvm.experimental.constrained.frem.f64(double, double, metadata, metadata)
declare double @llvm.experimental.constrained.sqrt.f64(double, metadata, metadata)		declare double @llvm.experimental.constrained.sqrt.f64(double, metadata, metadata)
declare double @llvm.experimental.constrained.pow.f64(double, double, metadata, metadata)		declare double @llvm.experimental.constrained.pow.f64(double, double, metadata, metadata)
declare double @llvm.experimental.constrained.powi.f64(double, i32, metadata, metadata)		declare double @llvm.experimental.constrained.powi.f64(double, i32, metadata, metadata)
declare double @llvm.experimental.constrained.sin.f64(double, metadata, metadata)		declare double @llvm.experimental.constrained.sin.f64(double, metadata, metadata)
declare double @llvm.experimental.constrained.cos.f64(double, metadata, metadata)		declare double @llvm.experimental.constrained.cos.f64(double, metadata, metadata)
declare double @llvm.experimental.constrained.exp.f64(double, metadata, metadata)		declare double @llvm.experimental.constrained.exp.f64(double, metadata, metadata)
declare double @llvm.experimental.constrained.exp2.f64(double, metadata, metadata)		declare double @llvm.experimental.constrained.exp2.f64(double, metadata, metadata)
declare double @llvm.experimental.constrained.log.f64(double, metadata, metadata)		declare double @llvm.experimental.constrained.log.f64(double, metadata, metadata)
declare double @llvm.experimental.constrained.log10.f64(double, metadata, metadata)		declare double @llvm.experimental.constrained.log10.f64(double, metadata, metadata)
declare double @llvm.experimental.constrained.log2.f64(double, metadata, metadata)		declare double @llvm.experimental.constrained.log2.f64(double, metadata, metadata)
declare double @llvm.experimental.constrained.rint.f64(double, metadata, metadata)		declare double @llvm.experimental.constrained.rint.f64(double, metadata, metadata)
declare double @llvm.experimental.constrained.nearbyint.f64(double, metadata, metadata)		declare double @llvm.experimental.constrained.nearbyint.f64(double, metadata, metadata)
declare float @llvm.experimental.constrained.fma.f32(float, float, float, metadata, metadata)		declare float @llvm.experimental.constrained.fma.f32(float, float, float, metadata, metadata)
declare double @llvm.experimental.constrained.fma.f64(double, double, double, metadata, metadata)		declare double @llvm.experimental.constrained.fma.f64(double, double, double, metadata, metadata)
		declare float @llvm.experimental.constrained.fptrunc.f32.f64(double, metadata, metadata)
		declare double @llvm.experimental.constrained.fpext.f64.f32(float, metadata)

llvm/trunk/test/CodeGen/X86/vector-constrained-fp-intrinsics.ll

Show First 20 Lines • Show All 3,825 Lines • ▼ Show 20 Lines	%min = call <4 x double> @llvm.experimental.constrained.minnum.v4f64(
double 46.0, double 47.0>,		double 46.0, double 47.0>,
<4 x double> <double 40.0, double 41.0,		<4 x double> <double 40.0, double 41.0,
double 42.0, double 43.0>,		double 42.0, double 43.0>,
metadata !"round.dynamic",		metadata !"round.dynamic",
metadata !"fpexcept.strict")		metadata !"fpexcept.strict")
ret <4 x double> %min		ret <4 x double> %min
}		}

		define <1 x float> @constrained_vector_fptrunc_v1f64() {
		; CHECK-LABEL: constrained_vector_fptrunc_v1f64:
		; CHECK: # %bb.0: # %entry
		; CHECK-NEXT: movsd {{.*#+}} xmm0 = mem[0],zero
		; CHECK-NEXT: cvtsd2ss %xmm0, %xmm0
		; CHECK-NEXT: retq
		;
		; AVX-LABEL: constrained_vector_fptrunc_v1f64:
		; AVX: # %bb.0: # %entry
		; AVX-NEXT: vmovsd {{.*#+}} xmm0 = mem[0],zero
		; AVX-NEXT: vcvtsd2ss %xmm0, %xmm0, %xmm0
		; AVX-NEXT: retq
		entry:
		%result = call <1 x float> @llvm.experimental.constrained.fptrunc.v1f32.v1f64(
		<1 x double><double 42.1>,
		metadata !"round.dynamic",
		metadata !"fpexcept.strict")
		ret <1 x float> %result
		}

		define <2 x float> @constrained_vector_fptrunc_v2f64() {
		; CHECK-LABEL: constrained_vector_fptrunc_v2f64:
		; CHECK: # %bb.0: # %entry
		; CHECK-NEXT: movsd {{.*#+}} xmm0 = mem[0],zero
		; CHECK-NEXT: cvtsd2ss %xmm0, %xmm1
		; CHECK-NEXT: movsd {{.*#+}} xmm0 = mem[0],zero
		; CHECK-NEXT: cvtsd2ss %xmm0, %xmm0
		; CHECK-NEXT: unpcklps {{.*#+}} xmm0 = xmm0[0],xmm1[0],xmm0[1],xmm1[1]
		; CHECK-NEXT: retq
		;
		; AVX-LABEL: constrained_vector_fptrunc_v2f64:
		; AVX: # %bb.0: # %entry
		; AVX-NEXT: vmovsd {{.*#+}} xmm0 = mem[0],zero
		; AVX-NEXT: vcvtsd2ss %xmm0, %xmm0, %xmm0
		; AVX-NEXT: vmovsd {{.*#+}} xmm1 = mem[0],zero
		; AVX-NEXT: vcvtsd2ss %xmm1, %xmm1, %xmm1
		; AVX-NEXT: vinsertps {{.*#+}} xmm0 = xmm1[0],xmm0[0],xmm1[2,3]
		; AVX-NEXT: retq
		entry:
		%result = call <2 x float> @llvm.experimental.constrained.fptrunc.v2f32.v2f64(
		<2 x double><double 42.1, double 42.2>,
		metadata !"round.dynamic",
		metadata !"fpexcept.strict")
		ret <2 x float> %result
		}

		define <3 x float> @constrained_vector_fptrunc_v3f64() {
		; CHECK-LABEL: constrained_vector_fptrunc_v3f64:
		; CHECK: # %bb.0: # %entry
		; CHECK-NEXT: movsd {{.*#+}} xmm0 = mem[0],zero
		; CHECK-NEXT: cvtsd2ss %xmm0, %xmm1
		; CHECK-NEXT: movsd {{.*#+}} xmm0 = mem[0],zero
		; CHECK-NEXT: cvtsd2ss %xmm0, %xmm0
		; CHECK-NEXT: unpcklps {{.*#+}} xmm0 = xmm0[0],xmm1[0],xmm0[1],xmm1[1]
		; CHECK-NEXT: movsd {{.*#+}} xmm1 = mem[0],zero
		; CHECK-NEXT: cvtsd2ss %xmm1, %xmm1
		; CHECK-NEXT: movlhps {{.*#+}} xmm0 = xmm0[0],xmm1[0]
		; CHECK-NEXT: retq
		;
		; AVX-LABEL: constrained_vector_fptrunc_v3f64:
		; AVX: # %bb.0: # %entry
		; AVX-NEXT: vmovsd {{.*#+}} xmm0 = mem[0],zero
		; AVX-NEXT: vcvtsd2ss %xmm0, %xmm0, %xmm0
		; AVX-NEXT: vmovsd {{.*#+}} xmm1 = mem[0],zero
		; AVX-NEXT: vcvtsd2ss %xmm1, %xmm1, %xmm1
		; AVX-NEXT: vinsertps {{.*#+}} xmm0 = xmm1[0],xmm0[0],xmm1[2,3]
		; AVX-NEXT: vmovsd {{.*#+}} xmm1 = mem[0],zero
		; AVX-NEXT: vcvtsd2ss %xmm1, %xmm1, %xmm1
		; AVX-NEXT: vinsertps {{.*#+}} xmm0 = xmm0[0,1],xmm1[0],xmm0[3]
		; AVX-NEXT: retq
		entry:
		%result = call <3 x float> @llvm.experimental.constrained.fptrunc.v3f32.v3f64(
		<3 x double><double 42.1, double 42.2,
		double 42.3>,
		metadata !"round.dynamic",
		metadata !"fpexcept.strict")
		ret <3 x float> %result
		}

		define <4 x float> @constrained_vector_fptrunc_v4f64() {
		; CHECK-LABEL: constrained_vector_fptrunc_v4f64:
		; CHECK: # %bb.0: # %entry
		; CHECK-NEXT: movsd {{.*#+}} xmm0 = mem[0],zero
		; CHECK-NEXT: cvtsd2ss %xmm0, %xmm0
		; CHECK-NEXT: movsd {{.*#+}} xmm1 = mem[0],zero
		; CHECK-NEXT: cvtsd2ss %xmm1, %xmm1
		; CHECK-NEXT: unpcklps {{.*#+}} xmm1 = xmm1[0],xmm0[0],xmm1[1],xmm0[1]
		; CHECK-NEXT: movsd {{.*#+}} xmm0 = mem[0],zero
		; CHECK-NEXT: cvtsd2ss %xmm0, %xmm2
		; CHECK-NEXT: movsd {{.*#+}} xmm0 = mem[0],zero
		; CHECK-NEXT: cvtsd2ss %xmm0, %xmm0
		; CHECK-NEXT: unpcklps {{.*#+}} xmm0 = xmm0[0],xmm2[0],xmm0[1],xmm2[1]
		; CHECK-NEXT: movlhps {{.*#+}} xmm0 = xmm0[0],xmm1[0]
		; CHECK-NEXT: retq
		;
		; AVX-LABEL: constrained_vector_fptrunc_v4f64:
		; AVX: # %bb.0: # %entry
		; AVX-NEXT: vcvtpd2psy {{.*}}(%rip), %xmm0
		; AVX-NEXT: retq
		entry:
		%result = call <4 x float> @llvm.experimental.constrained.fptrunc.v4f32.v4f64(
		<4 x double><double 42.1, double 42.2,
		double 42.3, double 42.4>,
		metadata !"round.dynamic",
		metadata !"fpexcept.strict")
		ret <4 x float> %result
		}

		define <1 x double> @constrained_vector_fpext_v1f32() {
		; CHECK-LABEL: constrained_vector_fpext_v1f32:
		; CHECK: # %bb.0: # %entry
		; CHECK-NEXT: movss {{.*#+}} xmm0 = mem[0],zero,zero,zero
		; CHECK-NEXT: cvtss2sd %xmm0, %xmm0
		; CHECK-NEXT: retq
		;
		; AVX-LABEL: constrained_vector_fpext_v1f32:
		; AVX: # %bb.0: # %entry
		; AVX-NEXT: vmovss {{.*#+}} xmm0 = mem[0],zero,zero,zero
		; AVX-NEXT: vcvtss2sd %xmm0, %xmm0, %xmm0
		; AVX-NEXT: retq
		entry:
		%result = call <1 x double> @llvm.experimental.constrained.fpext.v1f64.v1f32(
		<1 x float><float 42.0>,
		metadata !"fpexcept.strict")
		ret <1 x double> %result
		}

		define <2 x double> @constrained_vector_fpext_v2f32() {
		; CHECK-LABEL: constrained_vector_fpext_v2f32:
		; CHECK: # %bb.0: # %entry
		; CHECK-NEXT: movss {{.*#+}} xmm0 = mem[0],zero,zero,zero
		; CHECK-NEXT: cvtss2sd %xmm0, %xmm1
		; CHECK-NEXT: movss {{.*#+}} xmm0 = mem[0],zero,zero,zero
		; CHECK-NEXT: cvtss2sd %xmm0, %xmm0
		; CHECK-NEXT: movlhps {{.*#+}} xmm0 = xmm0[0],xmm1[0]
		; CHECK-NEXT: retq
		;
		; AVX-LABEL: constrained_vector_fpext_v2f32:
		; AVX: # %bb.0: # %entry
		; AVX-NEXT: vmovss {{.*#+}} xmm0 = mem[0],zero,zero,zero
		; AVX-NEXT: vcvtss2sd %xmm0, %xmm0, %xmm0
		; AVX-NEXT: vmovss {{.*#+}} xmm1 = mem[0],zero,zero,zero
		; AVX-NEXT: vcvtss2sd %xmm1, %xmm1, %xmm1
		; AVX-NEXT: vmovlhps {{.*#+}} xmm0 = xmm1[0],xmm0[0]
		; AVX-NEXT: retq
		entry:
		%result = call <2 x double> @llvm.experimental.constrained.fpext.v2f64.v2f32(
		<2 x float><float 42.0, float 43.0>,
		metadata !"fpexcept.strict")
		ret <2 x double> %result
		}

		define <3 x double> @constrained_vector_fpext_v3f32() {
		; CHECK-LABEL: constrained_vector_fpext_v3f32:
		; CHECK: # %bb.0: # %entry
		; CHECK-NEXT: movss {{.*#+}} xmm0 = mem[0],zero,zero,zero
		; CHECK-NEXT: cvtss2sd %xmm0, %xmm0
		; CHECK-NEXT: movss {{.*#+}} xmm1 = mem[0],zero,zero,zero
		; CHECK-NEXT: cvtss2sd %xmm1, %xmm1
		; CHECK-NEXT: movss {{.*#+}} xmm2 = mem[0],zero,zero,zero
		; CHECK-NEXT: cvtss2sd %xmm2, %xmm2
		; CHECK-NEXT: movsd %xmm2, -{{[0-9]+}}(%rsp)
		; CHECK-NEXT: fldl -{{[0-9]+}}(%rsp)
		; CHECK-NEXT: retq
		;
		; AVX-LABEL: constrained_vector_fpext_v3f32:
		; AVX: # %bb.0: # %entry
		; AVX-NEXT: vmovss {{.*#+}} xmm0 = mem[0],zero,zero,zero
		; AVX-NEXT: vcvtss2sd %xmm0, %xmm0, %xmm0
		; AVX-NEXT: vmovss {{.*#+}} xmm1 = mem[0],zero,zero,zero
		; AVX-NEXT: vcvtss2sd %xmm1, %xmm1, %xmm1
		; AVX-NEXT: vmovlhps {{.*#+}} xmm0 = xmm1[0],xmm0[0]
		; AVX-NEXT: vmovss {{.*#+}} xmm1 = mem[0],zero,zero,zero
		; AVX-NEXT: vcvtss2sd %xmm1, %xmm1, %xmm1
		; AVX-NEXT: vinsertf128 $1, %xmm1, %ymm0, %ymm0
		; AVX-NEXT: retq
		entry:
		%result = call <3 x double> @llvm.experimental.constrained.fpext.v3f64.v3f32(
		<3 x float><float 42.0, float 43.0,
		float 44.0>,
		metadata !"fpexcept.strict")
		ret <3 x double> %result
		}

		define <4 x double> @constrained_vector_fpext_v4f32() {
		; CHECK-LABEL: constrained_vector_fpext_v4f32:
		; CHECK: # %bb.0: # %entry
		; CHECK-NEXT: movss {{.*#+}} xmm0 = mem[0],zero,zero,zero
		; CHECK-NEXT: cvtss2sd %xmm0, %xmm1
		; CHECK-NEXT: movss {{.*#+}} xmm0 = mem[0],zero,zero,zero
		; CHECK-NEXT: cvtss2sd %xmm0, %xmm0
		; CHECK-NEXT: movlhps {{.*#+}} xmm0 = xmm0[0],xmm1[0]
		; CHECK-NEXT: movss {{.*#+}} xmm1 = mem[0],zero,zero,zero
		; CHECK-NEXT: cvtss2sd %xmm1, %xmm2
		; CHECK-NEXT: movss {{.*#+}} xmm1 = mem[0],zero,zero,zero
		; CHECK-NEXT: cvtss2sd %xmm1, %xmm1
		; CHECK-NEXT: movlhps {{.*#+}} xmm1 = xmm1[0],xmm2[0]
		; CHECK-NEXT: retq
		;
		; AVX-LABEL: constrained_vector_fpext_v4f32:
		; AVX: # %bb.0: # %entry
		; AVX-NEXT: vcvtps2pd {{.*}}(%rip), %ymm0
		; AVX-NEXT: retq
		entry:
		%result = call <4 x double> @llvm.experimental.constrained.fpext.v4f64.v4f32(
		<4 x float><float 42.0, float 43.0,
		float 44.0, float 45.0>,
		metadata !"fpexcept.strict")
		ret <4 x double> %result
		}

define <1 x float> @constrained_vector_ceil_v1f32() {		define <1 x float> @constrained_vector_ceil_v1f32() {
; CHECK-LABEL: constrained_vector_ceil_v1f32:		; CHECK-LABEL: constrained_vector_ceil_v1f32:
; CHECK: # %bb.0: # %entry		; CHECK: # %bb.0: # %entry
; CHECK-NEXT: pushq %rax		; CHECK-NEXT: pushq %rax
; CHECK-NEXT: .cfi_def_cfa_offset 16		; CHECK-NEXT: .cfi_def_cfa_offset 16
; CHECK-NEXT: movss {{.*#+}} xmm0 = mem[0],zero,zero,zero		; CHECK-NEXT: movss {{.*#+}} xmm0 = mem[0],zero,zero,zero
; CHECK-NEXT: callq ceilf		; CHECK-NEXT: callq ceilf
; CHECK-NEXT: popq %rax		; CHECK-NEXT: popq %rax
▲ Show 20 Lines • Show All 566 Lines • ▼ Show 20 Lines
declare <2 x double> @llvm.experimental.constrained.exp2.v2f64(<2 x double>, metadata, metadata)		declare <2 x double> @llvm.experimental.constrained.exp2.v2f64(<2 x double>, metadata, metadata)
declare <2 x double> @llvm.experimental.constrained.log.v2f64(<2 x double>, metadata, metadata)		declare <2 x double> @llvm.experimental.constrained.log.v2f64(<2 x double>, metadata, metadata)
declare <2 x double> @llvm.experimental.constrained.log10.v2f64(<2 x double>, metadata, metadata)		declare <2 x double> @llvm.experimental.constrained.log10.v2f64(<2 x double>, metadata, metadata)
declare <2 x double> @llvm.experimental.constrained.log2.v2f64(<2 x double>, metadata, metadata)		declare <2 x double> @llvm.experimental.constrained.log2.v2f64(<2 x double>, metadata, metadata)
declare <2 x double> @llvm.experimental.constrained.rint.v2f64(<2 x double>, metadata, metadata)		declare <2 x double> @llvm.experimental.constrained.rint.v2f64(<2 x double>, metadata, metadata)
declare <2 x double> @llvm.experimental.constrained.nearbyint.v2f64(<2 x double>, metadata, metadata)		declare <2 x double> @llvm.experimental.constrained.nearbyint.v2f64(<2 x double>, metadata, metadata)
declare <2 x double> @llvm.experimental.constrained.maxnum.v2f64(<2 x double>, <2 x double>, metadata, metadata)		declare <2 x double> @llvm.experimental.constrained.maxnum.v2f64(<2 x double>, <2 x double>, metadata, metadata)
declare <2 x double> @llvm.experimental.constrained.minnum.v2f64(<2 x double>, <2 x double>, metadata, metadata)		declare <2 x double> @llvm.experimental.constrained.minnum.v2f64(<2 x double>, <2 x double>, metadata, metadata)
		declare <2 x float> @llvm.experimental.constrained.fptrunc.v2f32.v2f64(<2 x double>, metadata, metadata)
		declare <2 x double> @llvm.experimental.constrained.fpext.v2f64.v2f32(<2 x float>, metadata)
declare <2 x double> @llvm.experimental.constrained.ceil.v2f64(<2 x double>, metadata, metadata)		declare <2 x double> @llvm.experimental.constrained.ceil.v2f64(<2 x double>, metadata, metadata)
declare <2 x double> @llvm.experimental.constrained.floor.v2f64(<2 x double>, metadata, metadata)		declare <2 x double> @llvm.experimental.constrained.floor.v2f64(<2 x double>, metadata, metadata)
declare <2 x double> @llvm.experimental.constrained.round.v2f64(<2 x double>, metadata, metadata)		declare <2 x double> @llvm.experimental.constrained.round.v2f64(<2 x double>, metadata, metadata)
declare <2 x double> @llvm.experimental.constrained.trunc.v2f64(<2 x double>, metadata, metadata)		declare <2 x double> @llvm.experimental.constrained.trunc.v2f64(<2 x double>, metadata, metadata)

; Scalar width declarations		; Scalar width declarations
declare <1 x float> @llvm.experimental.constrained.fadd.v1f32(<1 x float>, <1 x float>, metadata, metadata)		declare <1 x float> @llvm.experimental.constrained.fadd.v1f32(<1 x float>, <1 x float>, metadata, metadata)
declare <1 x float> @llvm.experimental.constrained.fsub.v1f32(<1 x float>, <1 x float>, metadata, metadata)		declare <1 x float> @llvm.experimental.constrained.fsub.v1f32(<1 x float>, <1 x float>, metadata, metadata)
Show All 9 Lines
declare <1 x float> @llvm.experimental.constrained.exp2.v1f32(<1 x float>, metadata, metadata)		declare <1 x float> @llvm.experimental.constrained.exp2.v1f32(<1 x float>, metadata, metadata)
declare <1 x float> @llvm.experimental.constrained.log.v1f32(<1 x float>, metadata, metadata)		declare <1 x float> @llvm.experimental.constrained.log.v1f32(<1 x float>, metadata, metadata)
declare <1 x float> @llvm.experimental.constrained.log10.v1f32(<1 x float>, metadata, metadata)		declare <1 x float> @llvm.experimental.constrained.log10.v1f32(<1 x float>, metadata, metadata)
declare <1 x float> @llvm.experimental.constrained.log2.v1f32(<1 x float>, metadata, metadata)		declare <1 x float> @llvm.experimental.constrained.log2.v1f32(<1 x float>, metadata, metadata)
declare <1 x float> @llvm.experimental.constrained.rint.v1f32(<1 x float>, metadata, metadata)		declare <1 x float> @llvm.experimental.constrained.rint.v1f32(<1 x float>, metadata, metadata)
declare <1 x float> @llvm.experimental.constrained.nearbyint.v1f32(<1 x float>, metadata, metadata)		declare <1 x float> @llvm.experimental.constrained.nearbyint.v1f32(<1 x float>, metadata, metadata)
declare <1 x float> @llvm.experimental.constrained.maxnum.v1f32(<1 x float>, <1 x float>, metadata, metadata)		declare <1 x float> @llvm.experimental.constrained.maxnum.v1f32(<1 x float>, <1 x float>, metadata, metadata)
declare <1 x float> @llvm.experimental.constrained.minnum.v1f32(<1 x float>, <1 x float>, metadata, metadata)		declare <1 x float> @llvm.experimental.constrained.minnum.v1f32(<1 x float>, <1 x float>, metadata, metadata)
		declare <1 x float> @llvm.experimental.constrained.fptrunc.v1f32.v1f64(<1 x double>, metadata, metadata)
		declare <1 x double> @llvm.experimental.constrained.fpext.v1f64.v1f32(<1 x float>, metadata)
declare <1 x float> @llvm.experimental.constrained.ceil.v1f32(<1 x float>, metadata, metadata)		declare <1 x float> @llvm.experimental.constrained.ceil.v1f32(<1 x float>, metadata, metadata)
declare <1 x float> @llvm.experimental.constrained.floor.v1f32(<1 x float>, metadata, metadata)		declare <1 x float> @llvm.experimental.constrained.floor.v1f32(<1 x float>, metadata, metadata)
declare <1 x float> @llvm.experimental.constrained.round.v1f32(<1 x float>, metadata, metadata)		declare <1 x float> @llvm.experimental.constrained.round.v1f32(<1 x float>, metadata, metadata)
declare <1 x float> @llvm.experimental.constrained.trunc.v1f32(<1 x float>, metadata, metadata)		declare <1 x float> @llvm.experimental.constrained.trunc.v1f32(<1 x float>, metadata, metadata)

; Illegal width declarations		; Illegal width declarations
declare <3 x float> @llvm.experimental.constrained.fadd.v3f32(<3 x float>, <3 x float>, metadata, metadata)		declare <3 x float> @llvm.experimental.constrained.fadd.v3f32(<3 x float>, <3 x float>, metadata, metadata)
declare <3 x double> @llvm.experimental.constrained.fadd.v3f64(<3 x double>, <3 x double>, metadata, metadata)		declare <3 x double> @llvm.experimental.constrained.fadd.v3f64(<3 x double>, <3 x double>, metadata, metadata)
Show All 28 Lines
declare <3 x float> @llvm.experimental.constrained.rint.v3f32(<3 x float>, metadata, metadata)		declare <3 x float> @llvm.experimental.constrained.rint.v3f32(<3 x float>, metadata, metadata)
declare <3 x double> @llvm.experimental.constrained.rint.v3f64(<3 x double>, metadata, metadata)		declare <3 x double> @llvm.experimental.constrained.rint.v3f64(<3 x double>, metadata, metadata)
declare <3 x float> @llvm.experimental.constrained.nearbyint.v3f32(<3 x float>, metadata, metadata)		declare <3 x float> @llvm.experimental.constrained.nearbyint.v3f32(<3 x float>, metadata, metadata)
declare <3 x double> @llvm.experimental.constrained.nearbyint.v3f64(<3 x double>, metadata, metadata)		declare <3 x double> @llvm.experimental.constrained.nearbyint.v3f64(<3 x double>, metadata, metadata)
declare <3 x float> @llvm.experimental.constrained.maxnum.v3f32(<3 x float>, <3 x float>, metadata, metadata)		declare <3 x float> @llvm.experimental.constrained.maxnum.v3f32(<3 x float>, <3 x float>, metadata, metadata)
declare <3 x double> @llvm.experimental.constrained.maxnum.v3f64(<3 x double>, <3 x double>, metadata, metadata)		declare <3 x double> @llvm.experimental.constrained.maxnum.v3f64(<3 x double>, <3 x double>, metadata, metadata)
declare <3 x float> @llvm.experimental.constrained.minnum.v3f32(<3 x float>, <3 x float>, metadata, metadata)		declare <3 x float> @llvm.experimental.constrained.minnum.v3f32(<3 x float>, <3 x float>, metadata, metadata)
declare <3 x double> @llvm.experimental.constrained.minnum.v3f64(<3 x double>, <3 x double>, metadata, metadata)		declare <3 x double> @llvm.experimental.constrained.minnum.v3f64(<3 x double>, <3 x double>, metadata, metadata)
		declare <3 x float> @llvm.experimental.constrained.fptrunc.v3f32.v3f64(<3 x double>, metadata, metadata)
		declare <3 x double> @llvm.experimental.constrained.fpext.v3f64.v3f32(<3 x float>, metadata)
declare <3 x float> @llvm.experimental.constrained.ceil.v3f32(<3 x float>, metadata, metadata)		declare <3 x float> @llvm.experimental.constrained.ceil.v3f32(<3 x float>, metadata, metadata)
declare <3 x double> @llvm.experimental.constrained.ceil.v3f64(<3 x double>, metadata, metadata)		declare <3 x double> @llvm.experimental.constrained.ceil.v3f64(<3 x double>, metadata, metadata)
declare <3 x float> @llvm.experimental.constrained.floor.v3f32(<3 x float>, metadata, metadata)		declare <3 x float> @llvm.experimental.constrained.floor.v3f32(<3 x float>, metadata, metadata)
declare <3 x double> @llvm.experimental.constrained.floor.v3f64(<3 x double>, metadata, metadata)		declare <3 x double> @llvm.experimental.constrained.floor.v3f64(<3 x double>, metadata, metadata)
declare <3 x float> @llvm.experimental.constrained.round.v3f32(<3 x float>, metadata, metadata)		declare <3 x float> @llvm.experimental.constrained.round.v3f32(<3 x float>, metadata, metadata)
declare <3 x double> @llvm.experimental.constrained.round.v3f64(<3 x double>, metadata, metadata)		declare <3 x double> @llvm.experimental.constrained.round.v3f64(<3 x double>, metadata, metadata)
declare <3 x float> @llvm.experimental.constrained.trunc.v3f32(<3 x float>, metadata, metadata)		declare <3 x float> @llvm.experimental.constrained.trunc.v3f32(<3 x float>, metadata, metadata)
declare <3 x double> @llvm.experimental.constrained.trunc.v3f64(<3 x double>, metadata, metadata)		declare <3 x double> @llvm.experimental.constrained.trunc.v3f64(<3 x double>, metadata, metadata)
Show All 13 Lines
declare <4 x double> @llvm.experimental.constrained.exp2.v4f64(<4 x double>, metadata, metadata)		declare <4 x double> @llvm.experimental.constrained.exp2.v4f64(<4 x double>, metadata, metadata)
declare <4 x double> @llvm.experimental.constrained.log.v4f64(<4 x double>, metadata, metadata)		declare <4 x double> @llvm.experimental.constrained.log.v4f64(<4 x double>, metadata, metadata)
declare <4 x double> @llvm.experimental.constrained.log10.v4f64(<4 x double>, metadata, metadata)		declare <4 x double> @llvm.experimental.constrained.log10.v4f64(<4 x double>, metadata, metadata)
declare <4 x double> @llvm.experimental.constrained.log2.v4f64(<4 x double>, metadata, metadata)		declare <4 x double> @llvm.experimental.constrained.log2.v4f64(<4 x double>, metadata, metadata)
declare <4 x double> @llvm.experimental.constrained.rint.v4f64(<4 x double>, metadata, metadata)		declare <4 x double> @llvm.experimental.constrained.rint.v4f64(<4 x double>, metadata, metadata)
declare <4 x double> @llvm.experimental.constrained.nearbyint.v4f64(<4 x double>, metadata, metadata)		declare <4 x double> @llvm.experimental.constrained.nearbyint.v4f64(<4 x double>, metadata, metadata)
declare <4 x double> @llvm.experimental.constrained.maxnum.v4f64(<4 x double>, <4 x double>, metadata, metadata)		declare <4 x double> @llvm.experimental.constrained.maxnum.v4f64(<4 x double>, <4 x double>, metadata, metadata)
declare <4 x double> @llvm.experimental.constrained.minnum.v4f64(<4 x double>, <4 x double>, metadata, metadata)		declare <4 x double> @llvm.experimental.constrained.minnum.v4f64(<4 x double>, <4 x double>, metadata, metadata)
		declare <4 x float> @llvm.experimental.constrained.fptrunc.v4f32.v4f64(<4 x double>, metadata, metadata)
		declare <4 x double> @llvm.experimental.constrained.fpext.v4f64.v4f32(<4 x float>, metadata)
declare <4 x double> @llvm.experimental.constrained.ceil.v4f64(<4 x double>, metadata, metadata)		declare <4 x double> @llvm.experimental.constrained.ceil.v4f64(<4 x double>, metadata, metadata)
declare <4 x double> @llvm.experimental.constrained.floor.v4f64(<4 x double>, metadata, metadata)		declare <4 x double> @llvm.experimental.constrained.floor.v4f64(<4 x double>, metadata, metadata)
declare <4 x double> @llvm.experimental.constrained.round.v4f64(<4 x double>, metadata, metadata)		declare <4 x double> @llvm.experimental.constrained.round.v4f64(<4 x double>, metadata, metadata)
declare <4 x double> @llvm.experimental.constrained.trunc.v4f64(<4 x double>, metadata, metadata)		declare <4 x double> @llvm.experimental.constrained.trunc.v4f64(<4 x double>, metadata, metadata)

llvm/trunk/test/Feature/fp-intrinsics.ll

	Show First 20 Lines • Show All 236 Lines • ▼ Show 20 Lines
	define double @f17() {			define double @f17() {
	entry:			entry:
	%result = call double @llvm.experimental.constrained.fma.f64(double 42.1, double 42.1, double 42.1,			%result = call double @llvm.experimental.constrained.fma.f64(double 42.1, double 42.1, double 42.1,
	metadata !"round.dynamic",			metadata !"round.dynamic",
	metadata !"fpexcept.strict")			metadata !"fpexcept.strict")
	ret double %result			ret double %result
	}			}

				; Verify that fptrunc(42.1) isn't simplified when the rounding mode is
				; unknown.
				; CHECK-LABEL: f20
				; CHECK: call float @llvm.experimental.constrained.fptrunc
				define float @f20() {
				entry:
				%result = call float @llvm.experimental.constrained.fptrunc.f32.f64(
				double 42.1,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %result
				}

				; Verify that fpext(42.1) isn't simplified when the rounding mode is
				; unknown.
				; CHECK-LABEL: f21
				; CHECK: call double @llvm.experimental.constrained.fpext
				define double @f21() {
				entry:
				%result = call double @llvm.experimental.constrained.fpext.f64.f32(float 42.0,
				metadata !"fpexcept.strict")
				ret double %result
				}

	@llvm.fp.env = thread_local global i8 zeroinitializer, section "llvm.metadata"			@llvm.fp.env = thread_local global i8 zeroinitializer, section "llvm.metadata"
	declare double @llvm.experimental.constrained.fdiv.f64(double, double, metadata, metadata)			declare double @llvm.experimental.constrained.fdiv.f64(double, double, metadata, metadata)
	declare double @llvm.experimental.constrained.fmul.f64(double, double, metadata, metadata)			declare double @llvm.experimental.constrained.fmul.f64(double, double, metadata, metadata)
	declare double @llvm.experimental.constrained.fadd.f64(double, double, metadata, metadata)			declare double @llvm.experimental.constrained.fadd.f64(double, double, metadata, metadata)
	declare double @llvm.experimental.constrained.fsub.f64(double, double, metadata, metadata)			declare double @llvm.experimental.constrained.fsub.f64(double, double, metadata, metadata)
	declare double @llvm.experimental.constrained.sqrt.f64(double, metadata, metadata)			declare double @llvm.experimental.constrained.sqrt.f64(double, metadata, metadata)
	declare double @llvm.experimental.constrained.pow.f64(double, double, metadata, metadata)			declare double @llvm.experimental.constrained.pow.f64(double, double, metadata, metadata)
	declare double @llvm.experimental.constrained.powi.f64(double, i32, metadata, metadata)			declare double @llvm.experimental.constrained.powi.f64(double, i32, metadata, metadata)
	declare double @llvm.experimental.constrained.sin.f64(double, metadata, metadata)			declare double @llvm.experimental.constrained.sin.f64(double, metadata, metadata)
	declare double @llvm.experimental.constrained.cos.f64(double, metadata, metadata)			declare double @llvm.experimental.constrained.cos.f64(double, metadata, metadata)
	declare double @llvm.experimental.constrained.exp.f64(double, metadata, metadata)			declare double @llvm.experimental.constrained.exp.f64(double, metadata, metadata)
	declare double @llvm.experimental.constrained.exp2.f64(double, metadata, metadata)			declare double @llvm.experimental.constrained.exp2.f64(double, metadata, metadata)
	declare double @llvm.experimental.constrained.log.f64(double, metadata, metadata)			declare double @llvm.experimental.constrained.log.f64(double, metadata, metadata)
	declare double @llvm.experimental.constrained.log10.f64(double, metadata, metadata)			declare double @llvm.experimental.constrained.log10.f64(double, metadata, metadata)
	declare double @llvm.experimental.constrained.log2.f64(double, metadata, metadata)			declare double @llvm.experimental.constrained.log2.f64(double, metadata, metadata)
	declare double @llvm.experimental.constrained.rint.f64(double, metadata, metadata)			declare double @llvm.experimental.constrained.rint.f64(double, metadata, metadata)
	declare double @llvm.experimental.constrained.nearbyint.f64(double, metadata, metadata)			declare double @llvm.experimental.constrained.nearbyint.f64(double, metadata, metadata)
	declare double @llvm.experimental.constrained.fma.f64(double, double, double, metadata, metadata)			declare double @llvm.experimental.constrained.fma.f64(double, double, double, metadata, metadata)
				declare float @llvm.experimental.constrained.fptrunc.f32.f64(double, metadata, metadata)
				declare double @llvm.experimental.constrained.fpext.f64.f32(float, metadata)

This is an archive of the discontinued LLVM Phabricator instance.

Add constrained fptrunc and fpext intrinsicsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 199261

llvm/trunk/docs/LangRef.rst

llvm/trunk/include/llvm/CodeGen/ISDOpcodes.h

llvm/trunk/include/llvm/CodeGen/SelectionDAGNodes.h

llvm/trunk/include/llvm/CodeGen/TargetLowering.h

llvm/trunk/include/llvm/IR/IntrinsicInst.h

llvm/trunk/include/llvm/IR/Intrinsics.td

llvm/trunk/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp

llvm/trunk/lib/CodeGen/SelectionDAG/LegalizeTypes.h

llvm/trunk/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp

llvm/trunk/lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp

llvm/trunk/lib/CodeGen/SelectionDAG/SelectionDAG.cpp

llvm/trunk/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

llvm/trunk/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp

llvm/trunk/lib/IR/IntrinsicInst.cpp

llvm/trunk/lib/IR/Verifier.cpp

llvm/trunk/test/CodeGen/X86/fp-intrinsics.ll

llvm/trunk/test/CodeGen/X86/vector-constrained-fp-intrinsics.ll

llvm/trunk/test/Feature/fp-intrinsics.ll

Add constrained fptrunc and fpext intrinsics
ClosedPublic