This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
docs/
1/5
LangRef.rst
-
include/llvm/
-
llvm/
-
Analysis/
-
TargetLibraryInfo.h
-
CodeGen/
-
ISDOpcodes.h
-
RuntimeLibcalls.h
-
IR/
-
Intrinsics.td
-
Target/
-
TargetSelectionDAG.td
-
lib/
-
CodeGen/
-
SelectionDAG/
1/1
LegalizeDAG.cpp
-
LegalizeTypes.h
-
LegalizeVectorOps.cpp
-
LegalizeVectorTypes.cpp
-
SelectionDAGBuilder.h
-
SelectionDAGBuilder.cpp
-
SelectionDAGDumper.cpp
1/1
TargetLoweringBase.cpp
-
Target/
-
AArch64/
-
AArch64ISelLowering.cpp
-
AMDGPU/
-
AMDGPUISelLowering.h
-
AMDGPUISelLowering.cpp
-
AMDGPUInstrInfo.td
-
SIInstructions.td
-
ARM/
-
ARMISelLowering.cpp
-
Hexagon/
-
HexagonISelLowering.cpp
-
Mips/
-
MipsISelLowering.cpp
-
PowerPC/
1
PPCISelLowering.cpp
-
WebAssembly/
-
WebAssemblyISelLowering.cpp
-
X86/
-
X86ISelLowering.cpp
-
Transforms/Utils/
-
Utils/
-
SimplifyLibCalls.cpp
-
test/
-
CodeGen/
-
AMDGPU/
1
llvm.ldexp.ll
-
X86/
1/1
ldexp.ll
-
Transforms/InstCombine/
-
InstCombine/
-
exp2-1.ll

Differential D14327

IR: Add llvm.ldexp and llvm.experimental.constrained.ldexp intrinsics
ClosedPublic

Authored by arsenm on Nov 4 2015, 2:54 AM.

Download Raw Diff

Details

Reviewers

• tstellarAMD
hfinkel
nhaehnle
jcranmer-intel
kpn
sepavloff
andrew.w.kaylor
spatel
foad

Summary

AMDGPU has native instructions and target intrinsics for this, but
these really should be subject to legalization and generic
optimizations. This will enable legalization of f16->f32 on targets
without f16 support.

Implement a somewhat horrible inline expansion for targets without
libcall support. This could be better if we could introduce control
flow (GlobalISel version not yet implemented). Support for strictfp
legalization is less complete but works for the simple cases.

Diff Detail

Event Timeline

nhaehnle updated this revision to Diff 39178.Nov 4 2015, 2:54 AM

nhaehnle retitled this revision from to Add llvm.ldexp.* intrinsic, associated SDNode and library calls.Nov 4 2015, 2:54 AM

nhaehnle updated this object.

Herald added subscribers: dsanders, arsenm, jfb. · View Herald TranscriptNov 4 2015, 2:54 AM

nhaehnle added a subscriber: llvm-commits.Nov 4 2015, 2:57 AM

arsenm added subscribers: scanon, resistor.Nov 4 2015, 9:32 AM

This mostly LGTM except for the question of error behavior. There should be a few additions to get more of the benefits of using an intrinsic over a libcall. ldexp should be added to isTriviallyVectorizable and isSafeToSpeculativelyExecute with appropriate tests, assuming we can assume it doesn't set errno. This could be a follow up patch.

docs/LangRef.rst
9889	I don't think this should be defined it to handling the same way as libm. I think we should say it does not set errno, and then to only do the libcall transformation if the call is marked readonly/readnone. This is an area that isn't handled particularly consistently by the existing math intrinsics.
test/CodeGen/AMDGPU/llvm.ldexp.ll
21	Should include vector versions for at least v2f32, v4f32 and v2f64 Also, can you merge the existing llvm.AMDGPU.ldexp.ll test into this one and rename them with a legacy_ prefix

arsenm added a subscriber: hfinkel.Nov 4 2015, 9:35 AM

Thank you for taking a look! I've made some changes based on your feedback:

AMDGPU: more llvm.ldexp.ll tests and assorted bugfixes
LangRef for llvm.ldexp.*: remove statement about handling error conditions
[VectorUtils] llvm.ldexp.* intrinsic is vectorizable
[ValueTracking] ldexp preserves the sign of its first argument

I agree that the error handling is a problem, and I have to admit that I don't
know what is best. At the time of the libcall transformation, we already have
an SDNode, so I do not know how to tell the attributes of the original call.

It's also some effort to provide an expansion that is guaranteed to never set
errno, because the most straightforward expansion uses exp2, which is in turn
likely to become a library call. I suppose one could write a custom implementation
in compiler-rt, but I don't think that that's the best use of my time.

For now, I have made changes that are in line with the other intrinsics like pow
and powi: those are marked as isTriviallyVectorizable, but *not* as
isSafeToSpeculativelyExecute.

I hope that this is good enough. There are quite a number of TODOs already in
the code regarding these error problems. In any case, I've left those changes
as separate commits locally, so it's easy enough for me to rearrange them.
(Though at least for some of them I believe they should definitely be squashed
before committing to SVN.)

Couldn't the original bug be fixed by marking ldexpf as unavailable for AMDGPU in lib/Analysis/TargetLibraryInfo.cpp ?

In D14327#292904, @tstellarAMD wrote:

Couldn't the original bug be fixed by marking ldexpf as unavailable for AMDGPU in lib/Analysis/TargetLibraryInfo.cpp ?

I think so, yes. Though Matt said that we do want to use the ldexp instruction because it is a full-rate instruction.

In D14327#292935, @nhaehnle wrote:

In D14327#292904, @tstellarAMD wrote:

Couldn't the original bug be fixed by marking ldexpf as unavailable for AMDGPU in lib/Analysis/TargetLibraryInfo.cpp ?

I think so, yes. Though Matt said that we do want to use the ldexp instruction because it is a full-rate instruction.

Ok, so for a temporary solution, rather than changing the intrinsic emitted by Mesa, I think we should mark this libcall as unavailable. This current patch could then be done as a follow up.

arsenm added inline comments.Jan 19 2016, 1:49 PM

test/CodeGen/X86/ldexp.ll
4	Vector tests here are probably a good idea as well

hfinkel added inline comments.Feb 2 2016, 5:03 PM

docs/LangRef.rst
9889	As I recall, we're very consistent about this, with one exception: @llvm.sqrt. And this causes a lot of confusion. That having been said, there is a precedent, and there are good reasons to do it. However, we do need to say what happens if the result is not representable. You really have two choices: "and handles error conditions in the same way" (i.e. perhaps sets errno) Has undefined behavior (it needs to be undefined because it might be implemented using libm, and we can't know whether libm will affect errno)
lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
3251	This seems like a great idea is FEXP2 is legal, but otherwise, seems likely slower than the original library function call to ldexp. Unless we really know better, we should keep the original call.
lib/CodeGen/TargetLoweringBase.cpp
873	You should add FLDEXP here too.

Rebased on top of current trunk and addressed the various comments.

Since the TargetAction now defaults to Expand (which is actually LibCall
in disguise when available), I have removed several places where targets
redundantly set the action.

Herald added a reviewer: • tstellarAMD. · View Herald TranscriptFeb 10 2016, 3:16 PM

Herald added a subscriber: mzolotukhin. · View Herald Transcript

I've opted to go the "undefined range error" behaviour route in the revision since that seemed more useful to me given that he LibCallSimplifier is intrinsic->intrinsic and libcall->libcall now.

nhaehnle added reviewers: arsenm, hfinkel.Feb 10 2016, 3:17 PM

Could also use updating some IR places to handle it (e.g. TTI, isSafeToSpeculativelyExecute), but that's probably a separate patch

docs/LangRef.rst
9889	The returned value on underflow is defined to be zero, and HUGE_VAL, which may be infinity, on overflow. I think saying undefined behavior for the case is too strong. Maybe saying just the state of errno is undefined?

hfinkel added inline comments.Apr 26 2016, 6:00 PM

docs/LangRef.rst
9889	We don't have a way to model errno. We need to "prevent" a situation where we're allowed to reorder a call to ldexp in between, for example, a call to open() and a call to perror(). To get the benefits you want, however, you need to mark the function as readnone. However, it might be implemented using the underlying library call, which might set errno. Unless you make that undefined behavior, then the readnone on the intrinsic is wrong. Both overflow and underflow need to be undefined behavior. I realize that this is unfortunate.
lib/Target/PowerPC/PPCISelLowering.cpp
460	Don't do this. Set it to Expand by default (in TargetLoweringBase::initActions). That's our current best practice for new rarely-legal nodes.

arsenm added inline comments.Apr 27 2016, 1:27 PM

docs/LangRef.rst
9889	The converse is we already don't 'correctly' lower the existing intrinsics which are assumed to write errno because errno does not exist on the platform. I'm still generally confused about the inconsistency of errno handling. Why don't we have a separate set of math intrinsics for respecting errno, and not? Lowering the non-errno version with a library call would be an incorrect lowering for these. Alternatively, why doesn't the possibility of of writing errno always be a libcall, while the intrinsics are fine for -fno-math-errno? Currently -fno-math-errno adds readnone to the call site of the library call, and allows selecting to the corresponding DAG node. The inconsistency in behavior between the DAG nodes and intrinsics has always confused me. A readnone call to the library function will select to the corresponding chainless node, which could still be lowered to a call to an errno writing function. In the case of sqrt, this is further confused because < 0 inputs are no longer undefined. I would expect the intrinsics would be the for using a native instruction which ignores errno. The current set of math intrinsics, including those that say handle errors the same way, are already IntrNoMem (e.g. llvm.exp) and say nothing about undefined behavior. The sqrt intrinsic has undefined behavior for < 0, but we are able to fold an isnan() check before it out in the DAG. I'm not sure what an underflow/overflow test for ldexp would look like, but it would be more complicated than the simple compare and select for sqrt.

Regarding errno: it's totally valid to ignore if the implementation sets math_errhandling & MATH_ERRNO to zero. Of course, you need to know the C library to make that choice, but its value never changes at runtime. See C11 section 7.12, as well as the soon-to-be-published C++ paper p0108r1 which you can preview here.

spatel mentioned this in rG62a0a1b9eea7: [InstCombine] avoid crashing in exp2->ldexp.Feb 10 2023, 4:36 AM

spatel mentioned this in rG9dcd7195a21c: [InstCombine] avoid crashing in pow->ldexp.Feb 10 2023, 5:04 AM

arsenm commandeered this revision.May 1 2023, 7:24 AM

arsenm edited reviewers, added: nhaehnle; removed: arsenm.

Herald added a project: Restricted Project. · View Herald TranscriptMay 1 2023, 7:24 AM

Herald added subscribers: hoy, • pcwang-thead, kosarev and 8 others. · View Herald Transcript

Rebase forward 7 years. Add constrained version and GlobalISel support. Fix promotion for f16->f32. Replace fpow2 based legalization with an integer expansion which actually passes opencl conformance. Drop some redundant checks for the libcall signature.

Also fix treating the second operand as a fixed scalar value instead of a vector

Herald added a project: Restricted Project. · View Herald TranscriptMay 1 2023, 7:31 AM

Herald added subscribers: foad, atanasyan, jrtc27 and 2 others. · View Herald Transcript

arsenm added reviewers: jcranmer-intel, kpn, sepavloff, andrew.w.kaylor, spatel.May 1 2023, 7:33 AM

Herald added a subscriber: StephenFan. · View Herald TranscriptMay 1 2023, 7:33 AM

arsenm added a reviewer: foad.May 1 2023, 7:33 AM

arsenm added a child revision: D149587: InstSimplify: Simplifications for ldexp.May 1 2023, 7:36 AM

arsenm added a child revision: D149588: clang: Start emitting intrinsic for __builtin_ldexp*.

arsenm added a child revision: D149589: AMDGPU: Drop and auto-upgrade llvm.amdgcn.ldexp to llvm.ldexp.May 1 2023, 7:40 AM

arsenm added a child revision: D149590: ValueTracking: Implement computeKnownFPClass for ldexp.

arsenm mentioned this in D149589: AMDGPU: Drop and auto-upgrade llvm.amdgcn.ldexp to llvm.ldexp.May 1 2023, 8:35 AM

The constrained intrinsic version is not documented here.

Harbormaster completed remote builds in B229225: Diff 518436.May 1 2023, 9:06 AM

It would be great to add some tests for NVPTX as the patch may hit some corner cases there. NVPTX has no libcalls and fp16 support depends on the GPU variant (no fp16 before sm_60).

In D14327#4310305, @tra wrote:

NVPTX has no libcalls

The TargetLibraryInfo query says there is ldexp and then it doesn't work

Copy-paste docs like the other constrained intrinsics (is there a reason we don't just document them all as pairs?)

Harbormaster completed remote builds in B229405: Diff 518687.May 2 2023, 9:33 AM

In D14327#4312048, @arsenm wrote:

The TargetLibraryInfo query says there is ldexp and then it doesn't work

Interesting. How exactly does it fail? I'm pretty sure we used to make libcalls unavailable in the past (I think we could not lower the calls to them), but I'm having a hard time finding that code now. It may have changed when we've improved handling the unsupported libcalls in NVPTX.

foad added inline comments.May 3 2023, 9:01 AM

llvm/test/CodeGen/AMDGPU/llvm.ldexp.ll
259 ↗	(On Diff #518687)	This doesn't quite work because the instruction truncates v1 to 16 bits, so if you wanted ldexp(1.0, 0x10000) aka +inf you'll actually get ldexp(1.0, 0) aka 1.0.

In D14327#4313028, @tra wrote:

In D14327#4312048, @arsenm wrote:

The TargetLibraryInfo query says there is ldexp and then it doesn't work

Interesting. How exactly does it fail?

LLVM ERROR: Undefined external symbol "ldexpf"

llvm/test/CodeGen/AMDGPU/llvm.ldexp.ll
259 ↗	(On Diff #518687)	Ugh, the library does have clamp code for this. The tablegen definition claims this is VOP_F16_F16_I32 though

Clamp when truncating exp

Harbormaster completed remote builds in B229872: Diff 519328.May 3 2023, 9:38 PM

ping

This mostly LGTM, but it looks like some GlobalISel legalization is missing relative to SelectionDAG?

In D14327#4341602, @nhaehnle wrote:

This mostly LGTM, but it looks like some GlobalISel legalization is missing relative to SelectionDAG?

Yes. The full legalization expansion should be different, since it's possible to introduce control flow. I didn't see the point handling it right now since the only case I'm sure that expands now is x86 windows, which isn't complete enough to write an end to end test for.

Okay, makes sense.

This revision is now accepted and ready to land.May 15 2023, 5:59 AM

arsenm added a child revision: D150765: InstCombine: Fold select of ldexp to ldexp of select.May 17 2023, 2:50 AM

Joe_Nash added a subscriber: Joe_Nash.May 18 2023, 7:44 AM

Joe_Nash added inline comments.

llvm/test/CodeGen/AMDGPU/llvm.ldexp.ll
5 ↗	(On Diff #519328)	Typo GFX1

arsenm marked an inline comment as done.May 18 2023, 9:20 AM

Update some MC tests for operand change. Disassembler seems to have a bizarre behavior where it takes invalid instructions and prints invalid instructions with larger encodings than they started with

Harbormaster completed remote builds in B233144: Diff 523725.May 19 2023, 6:30 AM

jcranmer-intel added inline comments.May 25 2023, 10:26 AM

llvm/docs/ReleaseNotes.rst
59 ↗	(On Diff #523725)	Nit: mention constrained version as well?

release notes

Harbormaster completed remote builds in B234645: Diff 525794.May 25 2023, 6:23 PM

eece6ba283bd763e6d7109ae9e155e81cfee0651

foad added inline comments.Jun 7 2023, 6:31 AM

llvm/test/MC/Disassembler/AMDGPU/gfx10_vop3.txt
7523 ↗	(On Diff #525794)	What caused this change in the assembler/disassembler behaviour? It looks like it has broken round-tripping, since the "encoding" output is longer than the input.

arsenm added inline comments.Jun 7 2023, 6:40 AM

llvm/test/MC/Disassembler/AMDGPU/gfx10_vop3.txt
7523 ↗	(On Diff #525794)	The exp operand was incorrectly marked as i32 when it's really i16. The inline immediate values are then different

Joe_Nash added inline comments.Jun 7 2023, 7:00 AM

llvm/test/MC/Disassembler/AMDGPU/gfx10_vop3.txt
7523 ↗	(On Diff #525794)	I believe that operand should be f16. We still want to be able to assemble inline fp constants. From a semantic point of view, these are i16 constants, but from an encoding point of view they are f16. In the True16 support downstream I have been treating that argument as f16. If you want it to be i16 yet still support inline fp constants, we need to effectively revert 5f5f566b265db00f577ead268400d99f34ba9cdd

arsenm added inline comments.Jun 7 2023, 7:16 AM

llvm/test/MC/Disassembler/AMDGPU/gfx10_vop3.txt
7523 ↗	(On Diff #525794)	It is an i16 operand. In the broken hardware handling of the f16 inline immediates, +- 0.5/1.0/2.0/4.0 are all effectively aliases for 0. The assembler now rejects these as invalid literals. I don't really understand the disassembler's handling of this invalid case

foad added inline comments.Jun 7 2023, 7:29 AM

llvm/test/MC/AMDGPU/gfx10_asm_vop2.s
12937 ↗	(On Diff #525794)	The assembler now rejects these as invalid literals. Looks like it is still accepting -4.0 here?

arsenm added inline comments.Jun 7 2023, 7:41 AM

llvm/test/MC/AMDGPU/gfx10_asm_vop2.s
12937 ↗	(On Diff #525794)	It's being accepted as a 32-bit literal, which is valid on gfx10

Revision Contents

Path

Size

docs/

LangRef.rst

36 lines

include/

llvm/

Analysis/

TargetLibraryInfo.h

1 line

CodeGen/

ISDOpcodes.h

3 lines

RuntimeLibcalls.h

5 lines

IR/

Intrinsics.td

1 line

Target/

TargetSelectionDAG.td

4 lines

lib/

CodeGen/

SelectionDAG/

LegalizeDAG.cpp

16 lines

LegalizeTypes.h

6 lines

LegalizeVectorOps.cpp

1 line

LegalizeVectorTypes.cpp

20 lines

SelectionDAGBuilder.h

1 line

SelectionDAGBuilder.cpp

32 lines

SelectionDAGDumper.cpp

1 line

TargetLoweringBase.cpp

12 lines

Target/

AArch64/

AArch64ISelLowering.cpp

6 lines

AMDGPU/

AMDGPUISelLowering.h

1 line

AMDGPUISelLowering.cpp

7 lines

AMDGPUInstrInfo.td

2 lines

SIInstructions.td

4 lines

ARM/

ARMISelLowering.cpp

6 lines

Hexagon/

HexagonISelLowering.cpp

4 lines

Mips/

MipsISelLowering.cpp

2 lines

PowerPC/

PPCISelLowering.cpp

5 lines

WebAssembly/

WebAssemblyISelLowering.cpp

3 lines

X86/

X86ISelLowering.cpp

11 lines

Transforms/

Utils/

SimplifyLibCalls.cpp

37 lines

test/

CodeGen/

AMDGPU/

llvm.ldexp.ll

26 lines

X86/

ldexp.ll

30 lines

Transforms/

InstCombine/

exp2-1.ll

8 lines

Diff 39178

docs/LangRef.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 9,846 Lines • ▼ Show 20 Lines
	type.			type.

	Semantics:			Semantics:
	""""""""""			""""""""""

	This function returns the same values as the libm ``exp2`` functions			This function returns the same values as the libm ``exp2`` functions
	would, and handles error conditions in the same way.			would, and handles error conditions in the same way.

				'``llvm.ldexp.*``' Intrinsic
				^^^^^^^^^^^^^^^^^^^^^^^^^^^

				Syntax:
				"""""""

				This is an overloaded intrinsic. You can use ``llvm.ldexp`` on any
				floating point or vector of floating point type. Not all targets support
				all types however.

				::

				declare float @llvm.ldexp.f32(float %Val, i32 %Exp)
				declare double @llvm.ldexp.f64(double %Val, i32 %Exp)
				declare x86_fp80 @llvm.ldexp.f80(x86_fp80 %Val, i32 %Exp)
				declare fp128 @llvm.ldexp.f128(fp128 %Val, i32 %Exp)
				declare ppc_fp128 @llvm.ldexp.ppcf128(ppc_fp128 %Val, i32 %Exp)

				Overview:
				"""""""""

				The '``llvm.ldexp.*``' intrinsics perform the ldexp function.

				Arguments:
				""""""""""

				The first argument and the return value are floating point numbers of the same
				type. The second argument is an integer.

				Semantics:
				""""""""""

				This function multiplies the first argument by 2 raised to the second argument's
				power, thus returning the same values as the libm ``ldexp`` functions
				would, and handles error conditions in the same way.
				arsenmAuthorUnsubmitted Not Done Reply Inline Actions I don't think this should be defined it to handling the same way as libm. I think we should say it does not set errno, and then to only do the libcall transformation if the call is marked readonly/readnone. This is an area that isn't handled particularly consistently by the existing math intrinsics. arsenm: I don't think this should be defined it to handling the same way as libm. I think we should say…
				hfinkelUnsubmitted Done Reply Inline Actions As I recall, we're very consistent about this, with one exception: @llvm.sqrt. And this causes a lot of confusion. That having been said, there is a precedent, and there are good reasons to do it. However, we do need to say what happens if the result is not representable. You really have two choices: "and handles error conditions in the same way" (i.e. perhaps sets errno) Has undefined behavior (it needs to be undefined because it might be implemented using libm, and we can't know whether libm will affect errno) hfinkel: As I recall, we're very consistent about this, with one exception: @llvm.sqrt. And this causes…
				arsenmAuthorUnsubmitted Not Done Reply Inline Actions The returned value on underflow is defined to be zero, and HUGE_VAL, which may be infinity, on overflow. I think saying undefined behavior for the case is too strong. Maybe saying just the state of errno is undefined? arsenm: The returned value on underflow is defined to be zero, and HUGE_VAL, which may be infinity, on…
				hfinkelUnsubmitted Not Done Reply Inline Actions We don't have a way to model errno. We need to "prevent" a situation where we're allowed to reorder a call to ldexp in between, for example, a call to open() and a call to perror(). To get the benefits you want, however, you need to mark the function as readnone. However, it might be implemented using the underlying library call, which might set errno. Unless you make that undefined behavior, then the readnone on the intrinsic is wrong. Both overflow and underflow need to be undefined behavior. I realize that this is unfortunate. hfinkel: We don't have a way to model errno. We need to "prevent" a situation where we're allowed to…
				arsenmAuthorUnsubmitted Not Done Reply Inline Actions The converse is we already don't 'correctly' lower the existing intrinsics which are assumed to write errno because errno does not exist on the platform. I'm still generally confused about the inconsistency of errno handling. Why don't we have a separate set of math intrinsics for respecting errno, and not? Lowering the non-errno version with a library call would be an incorrect lowering for these. Alternatively, why doesn't the possibility of of writing errno always be a libcall, while the intrinsics are fine for -fno-math-errno? Currently -fno-math-errno adds readnone to the call site of the library call, and allows selecting to the corresponding DAG node. The inconsistency in behavior between the DAG nodes and intrinsics has always confused me. A readnone call to the library function will select to the corresponding chainless node, which could still be lowered to a call to an errno writing function. In the case of sqrt, this is further confused because < 0 inputs are no longer undefined. I would expect the intrinsics would be the for using a native instruction which ignores errno. The current set of math intrinsics, including those that say handle errors the same way, are already IntrNoMem (e.g. llvm.exp) and say nothing about undefined behavior. The sqrt intrinsic has undefined behavior for < 0, but we are able to fold an isnan() check before it out in the DAG. I'm not sure what an underflow/overflow test for ldexp would look like, but it would be more complicated than the simple compare and select for sqrt. arsenm: The converse is we already don't 'correctly' lower the existing intrinsics which are assumed to…

	'``llvm.log.*``' Intrinsic			'``llvm.log.*``' Intrinsic
	^^^^^^^^^^^^^^^^^^^^^^^^^^			^^^^^^^^^^^^^^^^^^^^^^^^^^

	Syntax:			Syntax:
	"""""""			"""""""

	This is an overloaded intrinsic. You can use ``llvm.log`` on any			This is an overloaded intrinsic. You can use ``llvm.log`` on any
	floating point or vector of floating point type. Not all targets support			floating point or vector of floating point type. Not all targets support
	▲ Show 20 Lines • Show All 2,162 Lines • Show Last 20 Lines

include/llvm/Analysis/TargetLibraryInfo.h

Show First 20 Lines • Show All 227 Lines • ▼ Show 20 Lines	bool hasOptimizedCodeGen(LibFunc::Func F) const {
case LibFunc::floor: case LibFunc::floorf: case LibFunc::floorl:		case LibFunc::floor: case LibFunc::floorf: case LibFunc::floorl:
case LibFunc::nearbyint: case LibFunc::nearbyintf: case LibFunc::nearbyintl:		case LibFunc::nearbyint: case LibFunc::nearbyintf: case LibFunc::nearbyintl:
case LibFunc::ceil: case LibFunc::ceilf: case LibFunc::ceill:		case LibFunc::ceil: case LibFunc::ceilf: case LibFunc::ceill:
case LibFunc::rint: case LibFunc::rintf: case LibFunc::rintl:		case LibFunc::rint: case LibFunc::rintf: case LibFunc::rintl:
case LibFunc::round: case LibFunc::roundf: case LibFunc::roundl:		case LibFunc::round: case LibFunc::roundf: case LibFunc::roundl:
case LibFunc::trunc: case LibFunc::truncf: case LibFunc::truncl:		case LibFunc::trunc: case LibFunc::truncf: case LibFunc::truncl:
case LibFunc::log2: case LibFunc::log2f: case LibFunc::log2l:		case LibFunc::log2: case LibFunc::log2f: case LibFunc::log2l:
case LibFunc::exp2: case LibFunc::exp2f: case LibFunc::exp2l:		case LibFunc::exp2: case LibFunc::exp2f: case LibFunc::exp2l:
		case LibFunc::ldexp: case LibFunc::ldexpf: case LibFunc::ldexpl:
case LibFunc::memcmp: case LibFunc::strcmp: case LibFunc::strcpy:		case LibFunc::memcmp: case LibFunc::strcmp: case LibFunc::strcpy:
case LibFunc::stpcpy: case LibFunc::strlen: case LibFunc::strnlen:		case LibFunc::stpcpy: case LibFunc::strlen: case LibFunc::strnlen:
case LibFunc::memchr:		case LibFunc::memchr:
return true;		return true;
}		}
return false;		return false;
}		}

▲ Show 20 Lines • Show All 85 Lines • Show Last 20 Lines

include/llvm/CodeGen/ISDOpcodes.h

Show First 20 Lines • Show All 518 Lines • ▼ Show 20 Lines	enum NodeType {
/// In the case where a single input is NaN, the non-NaN input is returned.		/// In the case where a single input is NaN, the non-NaN input is returned.
///		///
/// The return value of (FMINNUM 0.0, -0.0) could be either 0.0 or -0.0.		/// The return value of (FMINNUM 0.0, -0.0) could be either 0.0 or -0.0.
FMINNUM, FMAXNUM,		FMINNUM, FMAXNUM,
/// FMINNAN/FMAXNAN - Behave identically to FMINNUM/FMAXNUM, except that		/// FMINNAN/FMAXNAN - Behave identically to FMINNUM/FMAXNUM, except that
/// when a single input is NaN, NaN is returned.		/// when a single input is NaN, NaN is returned.
FMINNAN, FMAXNAN,		FMINNAN, FMAXNAN,

		/// FLDEXP - ldexp from libm (op0 * 2**op1).
		FLDEXP,

/// FSINCOS - Compute both fsin and fcos as a single operation.		/// FSINCOS - Compute both fsin and fcos as a single operation.
FSINCOS,		FSINCOS,

/// LOAD and STORE have token chains as their first operand, then the same		/// LOAD and STORE have token chains as their first operand, then the same
/// operands as an LLVM load/store instruction, then an offset node that		/// operands as an LLVM load/store instruction, then an offset node that
/// is added / subtracted from the base pointer to form the address (for		/// is added / subtracted from the base pointer to form the address (for
/// indexed memory ops).		/// indexed memory ops).
LOAD, STORE,		LOAD, STORE,
▲ Show 20 Lines • Show All 395 Lines • Show Last 20 Lines

include/llvm/CodeGen/RuntimeLibcalls.h

Show First 20 Lines • Show All 142 Lines • ▼ Show 20 Lines	enum Libcall {
EXP_F80,		EXP_F80,
EXP_F128,		EXP_F128,
EXP_PPCF128,		EXP_PPCF128,
EXP2_F32,		EXP2_F32,
EXP2_F64,		EXP2_F64,
EXP2_F80,		EXP2_F80,
EXP2_F128,		EXP2_F128,
EXP2_PPCF128,		EXP2_PPCF128,
		LDEXP_F32,
		LDEXP_F64,
		LDEXP_F80,
		LDEXP_F128,
		LDEXP_PPCF128,
SIN_F32,		SIN_F32,
SIN_F64,		SIN_F64,
SIN_F80,		SIN_F80,
SIN_F128,		SIN_F128,
SIN_PPCF128,		SIN_PPCF128,
COS_F32,		COS_F32,
COS_F64,		COS_F64,
COS_F80,		COS_F80,
▲ Show 20 Lines • Show All 277 Lines • Show Last 20 Lines

include/llvm/IR/Intrinsics.td

Show First 20 Lines • Show All 354 Lines • ▼ Show 20 Lines	let Properties = [IntrNoMem] in {
def int_cos : Intrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>]>;		def int_cos : Intrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>]>;
def int_pow : Intrinsic<[llvm_anyfloat_ty],		def int_pow : Intrinsic<[llvm_anyfloat_ty],
[LLVMMatchType<0>, LLVMMatchType<0>]>;		[LLVMMatchType<0>, LLVMMatchType<0>]>;
def int_log : Intrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>]>;		def int_log : Intrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>]>;
def int_log10: Intrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>]>;		def int_log10: Intrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>]>;
def int_log2 : Intrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>]>;		def int_log2 : Intrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>]>;
def int_exp : Intrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>]>;		def int_exp : Intrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>]>;
def int_exp2 : Intrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>]>;		def int_exp2 : Intrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>]>;
		def int_ldexp : Intrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>, llvm_i32_ty]>;
def int_fabs : Intrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>]>;		def int_fabs : Intrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>]>;
def int_minnum : Intrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>, LLVMMatchType<0>]>;		def int_minnum : Intrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>, LLVMMatchType<0>]>;
def int_maxnum : Intrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>, LLVMMatchType<0>]>;		def int_maxnum : Intrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>, LLVMMatchType<0>]>;
def int_copysign : Intrinsic<[llvm_anyfloat_ty],		def int_copysign : Intrinsic<[llvm_anyfloat_ty],
[LLVMMatchType<0>, LLVMMatchType<0>]>;		[LLVMMatchType<0>, LLVMMatchType<0>]>;
def int_floor : Intrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>]>;		def int_floor : Intrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>]>;
def int_ceil : Intrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>]>;		def int_ceil : Intrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>]>;
def int_trunc : Intrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>]>;		def int_trunc : Intrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>]>;
▲ Show 20 Lines • Show All 289 Lines • Show Last 20 Lines

include/llvm/Target/TargetSelectionDAG.td

Show First 20 Lines • Show All 142 Lines • ▼ Show 20 Lines	def SDTFPExtendOp : SDTypeProfile<1, 1, [ // fextend
SDTCisFP<0>, SDTCisFP<1>, SDTCisOpSmallerThanOp<1, 0>		SDTCisFP<0>, SDTCisFP<1>, SDTCisOpSmallerThanOp<1, 0>
]>;		]>;
def SDTIntToFPOp : SDTypeProfile<1, 1, [ // [su]int_to_fp		def SDTIntToFPOp : SDTypeProfile<1, 1, [ // [su]int_to_fp
SDTCisFP<0>, SDTCisInt<1>		SDTCisFP<0>, SDTCisInt<1>
]>;		]>;
def SDTFPToIntOp : SDTypeProfile<1, 1, [ // fp_to_[su]int		def SDTFPToIntOp : SDTypeProfile<1, 1, [ // fp_to_[su]int
SDTCisInt<0>, SDTCisFP<1>		SDTCisInt<0>, SDTCisFP<1>
]>;		]>;
		def SDTFPExpOp : SDTypeProfile<1, 2, [ // ldexp
		SDTCisSameAs<0, 1>, SDTCisFP<0>, SDTCisInt<2>
		]>;
def SDTExtInreg : SDTypeProfile<1, 2, [ // sext_inreg		def SDTExtInreg : SDTypeProfile<1, 2, [ // sext_inreg
SDTCisSameAs<0, 1>, SDTCisInt<0>, SDTCisVT<2, OtherVT>,		SDTCisSameAs<0, 1>, SDTCisInt<0>, SDTCisVT<2, OtherVT>,
SDTCisVTSmallerThanOp<2, 1>		SDTCisVTSmallerThanOp<2, 1>
]>;		]>;

def SDTSetCC : SDTypeProfile<1, 3, [ // setcc		def SDTSetCC : SDTypeProfile<1, 3, [ // setcc
SDTCisInt<0>, SDTCisSameAs<1, 2>, SDTCisVT<3, OtherVT>		SDTCisInt<0>, SDTCisSameAs<1, 2>, SDTCisVT<3, OtherVT>
]>;		]>;
▲ Show 20 Lines • Show All 270 Lines • ▼ Show 20 Lines
def fpow : SDNode<"ISD::FPOW" , SDTFPBinOp>;		def fpow : SDNode<"ISD::FPOW" , SDTFPBinOp>;
def flog2 : SDNode<"ISD::FLOG2" , SDTFPUnaryOp>;		def flog2 : SDNode<"ISD::FLOG2" , SDTFPUnaryOp>;
def frint : SDNode<"ISD::FRINT" , SDTFPUnaryOp>;		def frint : SDNode<"ISD::FRINT" , SDTFPUnaryOp>;
def ftrunc : SDNode<"ISD::FTRUNC" , SDTFPUnaryOp>;		def ftrunc : SDNode<"ISD::FTRUNC" , SDTFPUnaryOp>;
def fceil : SDNode<"ISD::FCEIL" , SDTFPUnaryOp>;		def fceil : SDNode<"ISD::FCEIL" , SDTFPUnaryOp>;
def ffloor : SDNode<"ISD::FFLOOR" , SDTFPUnaryOp>;		def ffloor : SDNode<"ISD::FFLOOR" , SDTFPUnaryOp>;
def fnearbyint : SDNode<"ISD::FNEARBYINT" , SDTFPUnaryOp>;		def fnearbyint : SDNode<"ISD::FNEARBYINT" , SDTFPUnaryOp>;
def frnd : SDNode<"ISD::FROUND" , SDTFPUnaryOp>;		def frnd : SDNode<"ISD::FROUND" , SDTFPUnaryOp>;
		def fldexp : SDNode<"ISD::FLDEXP" , SDTFPExpOp>;

def fround : SDNode<"ISD::FP_ROUND" , SDTFPRoundOp>;		def fround : SDNode<"ISD::FP_ROUND" , SDTFPRoundOp>;
def fextend : SDNode<"ISD::FP_EXTEND" , SDTFPExtendOp>;		def fextend : SDNode<"ISD::FP_EXTEND" , SDTFPExtendOp>;
def fcopysign : SDNode<"ISD::FCOPYSIGN" , SDTFPSignOp>;		def fcopysign : SDNode<"ISD::FCOPYSIGN" , SDTFPSignOp>;

def sint_to_fp : SDNode<"ISD::SINT_TO_FP" , SDTIntToFPOp>;		def sint_to_fp : SDNode<"ISD::SINT_TO_FP" , SDTIntToFPOp>;
def uint_to_fp : SDNode<"ISD::UINT_TO_FP" , SDTIntToFPOp>;		def uint_to_fp : SDNode<"ISD::UINT_TO_FP" , SDTIntToFPOp>;
def fp_to_sint : SDNode<"ISD::FP_TO_SINT" , SDTFPToIntOp>;		def fp_to_sint : SDNode<"ISD::FP_TO_SINT" , SDTFPToIntOp>;
▲ Show 20 Lines • Show All 695 Lines • Show Last 20 Lines

lib/CodeGen/SelectionDAG/LegalizeDAG.cpp

Show First 20 Lines • Show All 3,238 Lines • ▼ Show 20 Lines	if ((TLI.isOperationLegalOrCustom(ISD::FSINCOS, VT) \|\|
SDVTList VTs = DAG.getVTList(VT, VT);		SDVTList VTs = DAG.getVTList(VT, VT);
Tmp1 = DAG.getNode(ISD::FSINCOS, dl, VTs, Node->getOperand(0));		Tmp1 = DAG.getNode(ISD::FSINCOS, dl, VTs, Node->getOperand(0));
if (Node->getOpcode() == ISD::FCOS)		if (Node->getOpcode() == ISD::FCOS)
Tmp1 = Tmp1.getValue(1);		Tmp1 = Tmp1.getValue(1);
Results.push_back(Tmp1);		Results.push_back(Tmp1);
}		}
break;		break;
}		}

		case ISD::FLDEXP: {
		EVT VT = Node->getValueType(0);
		Tmp1 = DAG.getNode(ISD::SINT_TO_FP, dl, VT, Node->getOperand(1));
		Tmp2 = DAG.getNode(ISD::FEXP2, dl, VT, Tmp1);
		hfinkelUnsubmitted Done Reply Inline Actions This seems like a great idea is FEXP2 is legal, but otherwise, seems likely slower than the original library function call to ldexp. Unless we really know better, we should keep the original call. hfinkel: This seems like a great idea is FEXP2 is legal, but otherwise, seems likely slower than the…
		Tmp3 = DAG.getNode(ISD::FMUL, dl, VT, Node->getOperand(0), Tmp2);
		Results.push_back(Tmp3);
		break;
		}

case ISD::FMAD:		case ISD::FMAD:
llvm_unreachable("Illegal fmad should never be formed");		llvm_unreachable("Illegal fmad should never be formed");

case ISD::FP16_TO_FP:		case ISD::FP16_TO_FP:
if (Node->getValueType(0) != MVT::f32) {		if (Node->getValueType(0) != MVT::f32) {
// We can extend to types bigger than f32 in two steps without changing		// We can extend to types bigger than f32 in two steps without changing
// the result. Since "f16 -> f32" is much more commonly available, give		// the result. Since "f16 -> f32" is much more commonly available, give
// CodeGen the option of emitting that before resorting to a libcall.		// CodeGen the option of emitting that before resorting to a libcall.
▲ Show 20 Lines • Show All 704 Lines • ▼ Show 20 Lines	case ISD::FNEARBYINT:
break;		break;
case ISD::FROUND:		case ISD::FROUND:
Results.push_back(ExpandFPLibCall(Node, RTLIB::ROUND_F32,		Results.push_back(ExpandFPLibCall(Node, RTLIB::ROUND_F32,
RTLIB::ROUND_F64,		RTLIB::ROUND_F64,
RTLIB::ROUND_F80,		RTLIB::ROUND_F80,
RTLIB::ROUND_F128,		RTLIB::ROUND_F128,
RTLIB::ROUND_PPCF128));		RTLIB::ROUND_PPCF128));
break;		break;
		case ISD::FLDEXP:
		Results.push_back(ExpandFPLibCall(Node, RTLIB::LDEXP_F32, RTLIB::LDEXP_F64,
		RTLIB::LDEXP_F80, RTLIB::LDEXP_F128,
		RTLIB::LDEXP_PPCF128));
		break;
case ISD::FPOWI:		case ISD::FPOWI:
Results.push_back(ExpandFPLibCall(Node, RTLIB::POWI_F32, RTLIB::POWI_F64,		Results.push_back(ExpandFPLibCall(Node, RTLIB::POWI_F32, RTLIB::POWI_F64,
RTLIB::POWI_F80, RTLIB::POWI_F128,		RTLIB::POWI_F80, RTLIB::POWI_F128,
RTLIB::POWI_PPCF128));		RTLIB::POWI_PPCF128));
break;		break;
case ISD::FPOW:		case ISD::FPOW:
Results.push_back(ExpandFPLibCall(Node, RTLIB::POW_F32, RTLIB::POW_F64,		Results.push_back(ExpandFPLibCall(Node, RTLIB::POW_F32, RTLIB::POW_F64,
RTLIB::POW_F80, RTLIB::POW_F128,		RTLIB::POW_F80, RTLIB::POW_F128,
▲ Show 20 Lines • Show All 286 Lines • ▼ Show 20 Lines	case ISD::FMA: {
Tmp3 = DAG.getNode(ISD::FP_EXTEND, dl, NVT, Node->getOperand(2));		Tmp3 = DAG.getNode(ISD::FP_EXTEND, dl, NVT, Node->getOperand(2));
Results.push_back(		Results.push_back(
DAG.getNode(ISD::FP_ROUND, dl, OVT,		DAG.getNode(ISD::FP_ROUND, dl, OVT,
DAG.getNode(Node->getOpcode(), dl, NVT, Tmp1, Tmp2, Tmp3),		DAG.getNode(Node->getOpcode(), dl, NVT, Tmp1, Tmp2, Tmp3),
DAG.getIntPtrConstant(0, dl)));		DAG.getIntPtrConstant(0, dl)));
break;		break;
}		}
case ISD::FCOPYSIGN:		case ISD::FCOPYSIGN:
		case ISD::FLDEXP:
case ISD::FPOWI: {		case ISD::FPOWI: {
Tmp1 = DAG.getNode(ISD::FP_EXTEND, dl, NVT, Node->getOperand(0));		Tmp1 = DAG.getNode(ISD::FP_EXTEND, dl, NVT, Node->getOperand(0));
Tmp2 = Node->getOperand(1);		Tmp2 = Node->getOperand(1);
Tmp3 = DAG.getNode(Node->getOpcode(), dl, NVT, Tmp1, Tmp2);		Tmp3 = DAG.getNode(Node->getOpcode(), dl, NVT, Tmp1, Tmp2);

// fcopysign doesn't change anything but the sign bit, so		// fcopysign doesn't change anything but the sign bit, so
// (fp_round (fcopysign (fpext a), b))		// (fp_round (fcopysign (fpext a), b))
// is as precise as		// is as precise as
▲ Show 20 Lines • Show All 120 Lines • Show Last 20 Lines

lib/CodeGen/SelectionDAG/LegalizeTypes.h

Show First 20 Lines • Show All 553 Lines • ▼ Show 20 Lines	private:

// Vector Result Scalarization: <1 x ty> -> ty.		// Vector Result Scalarization: <1 x ty> -> ty.
void ScalarizeVectorResult(SDNode *N, unsigned OpNo);		void ScalarizeVectorResult(SDNode *N, unsigned OpNo);
SDValue ScalarizeVecRes_MERGE_VALUES(SDNode *N, unsigned ResNo);		SDValue ScalarizeVecRes_MERGE_VALUES(SDNode *N, unsigned ResNo);
SDValue ScalarizeVecRes_BinOp(SDNode *N);		SDValue ScalarizeVecRes_BinOp(SDNode *N);
SDValue ScalarizeVecRes_TernaryOp(SDNode *N);		SDValue ScalarizeVecRes_TernaryOp(SDNode *N);
SDValue ScalarizeVecRes_UnaryOp(SDNode *N);		SDValue ScalarizeVecRes_UnaryOp(SDNode *N);
SDValue ScalarizeVecRes_InregOp(SDNode *N);		SDValue ScalarizeVecRes_InregOp(SDNode *N);
		SDValue ScalarizeVecRes_ExpOp(SDNode *N);

SDValue ScalarizeVecRes_BITCAST(SDNode *N);		SDValue ScalarizeVecRes_BITCAST(SDNode *N);
SDValue ScalarizeVecRes_BUILD_VECTOR(SDNode *N);		SDValue ScalarizeVecRes_BUILD_VECTOR(SDNode *N);
SDValue ScalarizeVecRes_CONVERT_RNDSAT(SDNode *N);		SDValue ScalarizeVecRes_CONVERT_RNDSAT(SDNode *N);
SDValue ScalarizeVecRes_EXTRACT_SUBVECTOR(SDNode *N);		SDValue ScalarizeVecRes_EXTRACT_SUBVECTOR(SDNode *N);
SDValue ScalarizeVecRes_FP_ROUND(SDNode *N);		SDValue ScalarizeVecRes_FP_ROUND(SDNode *N);
SDValue ScalarizeVecRes_FPOWI(SDNode *N);
SDValue ScalarizeVecRes_INSERT_VECTOR_ELT(SDNode *N);		SDValue ScalarizeVecRes_INSERT_VECTOR_ELT(SDNode *N);
SDValue ScalarizeVecRes_LOAD(LoadSDNode *N);		SDValue ScalarizeVecRes_LOAD(LoadSDNode *N);
SDValue ScalarizeVecRes_SCALAR_TO_VECTOR(SDNode *N);		SDValue ScalarizeVecRes_SCALAR_TO_VECTOR(SDNode *N);
SDValue ScalarizeVecRes_VSELECT(SDNode *N);		SDValue ScalarizeVecRes_VSELECT(SDNode *N);
SDValue ScalarizeVecRes_SELECT(SDNode *N);		SDValue ScalarizeVecRes_SELECT(SDNode *N);
SDValue ScalarizeVecRes_SELECT_CC(SDNode *N);		SDValue ScalarizeVecRes_SELECT_CC(SDNode *N);
SDValue ScalarizeVecRes_SETCC(SDNode *N);		SDValue ScalarizeVecRes_SETCC(SDNode *N);
SDValue ScalarizeVecRes_UNDEF(SDNode *N);		SDValue ScalarizeVecRes_UNDEF(SDNode *N);
Show All 26 Lines	private:

// Vector Result Splitting: <128 x ty> -> 2 x <64 x ty>.		// Vector Result Splitting: <128 x ty> -> 2 x <64 x ty>.
void SplitVectorResult(SDNode *N, unsigned OpNo);		void SplitVectorResult(SDNode *N, unsigned OpNo);
void SplitVecRes_BinOp(SDNode *N, SDValue &Lo, SDValue &Hi);		void SplitVecRes_BinOp(SDNode *N, SDValue &Lo, SDValue &Hi);
void SplitVecRes_TernaryOp(SDNode *N, SDValue &Lo, SDValue &Hi);		void SplitVecRes_TernaryOp(SDNode *N, SDValue &Lo, SDValue &Hi);
void SplitVecRes_UnaryOp(SDNode *N, SDValue &Lo, SDValue &Hi);		void SplitVecRes_UnaryOp(SDNode *N, SDValue &Lo, SDValue &Hi);
void SplitVecRes_ExtendOp(SDNode *N, SDValue &Lo, SDValue &Hi);		void SplitVecRes_ExtendOp(SDNode *N, SDValue &Lo, SDValue &Hi);
void SplitVecRes_InregOp(SDNode *N, SDValue &Lo, SDValue &Hi);		void SplitVecRes_InregOp(SDNode *N, SDValue &Lo, SDValue &Hi);
		void SplitVecRes_ExpOp(SDNode *N, SDValue &Lo, SDValue &Hi);

void SplitVecRes_BITCAST(SDNode *N, SDValue &Lo, SDValue &Hi);		void SplitVecRes_BITCAST(SDNode *N, SDValue &Lo, SDValue &Hi);
void SplitVecRes_BUILD_VECTOR(SDNode *N, SDValue &Lo, SDValue &Hi);		void SplitVecRes_BUILD_VECTOR(SDNode *N, SDValue &Lo, SDValue &Hi);
void SplitVecRes_CONCAT_VECTORS(SDNode *N, SDValue &Lo, SDValue &Hi);		void SplitVecRes_CONCAT_VECTORS(SDNode *N, SDValue &Lo, SDValue &Hi);
void SplitVecRes_EXTRACT_SUBVECTOR(SDNode *N, SDValue &Lo, SDValue &Hi);		void SplitVecRes_EXTRACT_SUBVECTOR(SDNode *N, SDValue &Lo, SDValue &Hi);
void SplitVecRes_INSERT_SUBVECTOR(SDNode *N, SDValue &Lo, SDValue &Hi);		void SplitVecRes_INSERT_SUBVECTOR(SDNode *N, SDValue &Lo, SDValue &Hi);
void SplitVecRes_FPOWI(SDNode *N, SDValue &Lo, SDValue &Hi);
void SplitVecRes_FCOPYSIGN(SDNode *N, SDValue &Lo, SDValue &Hi);		void SplitVecRes_FCOPYSIGN(SDNode *N, SDValue &Lo, SDValue &Hi);
void SplitVecRes_INSERT_VECTOR_ELT(SDNode *N, SDValue &Lo, SDValue &Hi);		void SplitVecRes_INSERT_VECTOR_ELT(SDNode *N, SDValue &Lo, SDValue &Hi);
void SplitVecRes_LOAD(LoadSDNode *N, SDValue &Lo, SDValue &Hi);		void SplitVecRes_LOAD(LoadSDNode *N, SDValue &Lo, SDValue &Hi);
void SplitVecRes_MLOAD(MaskedLoadSDNode *N, SDValue &Lo, SDValue &Hi);		void SplitVecRes_MLOAD(MaskedLoadSDNode *N, SDValue &Lo, SDValue &Hi);
void SplitVecRes_MGATHER(MaskedGatherSDNode *N, SDValue &Lo, SDValue &Hi);		void SplitVecRes_MGATHER(MaskedGatherSDNode *N, SDValue &Lo, SDValue &Hi);
void SplitVecRes_SCALAR_TO_VECTOR(SDNode *N, SDValue &Lo, SDValue &Hi);		void SplitVecRes_SCALAR_TO_VECTOR(SDNode *N, SDValue &Lo, SDValue &Hi);
void SplitVecRes_SETCC(SDNode *N, SDValue &Lo, SDValue &Hi);		void SplitVecRes_SETCC(SDNode *N, SDValue &Lo, SDValue &Hi);
void SplitVecRes_VECTOR_SHUFFLE(ShuffleVectorSDNode *N, SDValue &Lo,		void SplitVecRes_VECTOR_SHUFFLE(ShuffleVectorSDNode *N, SDValue &Lo,
▲ Show 20 Lines • Show All 54 Lines • ▼ Show 20 Lines	private:
SDValue WidenVecRes_VECTOR_SHUFFLE(ShuffleVectorSDNode *N);		SDValue WidenVecRes_VECTOR_SHUFFLE(ShuffleVectorSDNode *N);
SDValue WidenVecRes_VSETCC(SDNode* N);		SDValue WidenVecRes_VSETCC(SDNode* N);

SDValue WidenVecRes_Ternary(SDNode *N);		SDValue WidenVecRes_Ternary(SDNode *N);
SDValue WidenVecRes_Binary(SDNode *N);		SDValue WidenVecRes_Binary(SDNode *N);
SDValue WidenVecRes_BinaryCanTrap(SDNode *N);		SDValue WidenVecRes_BinaryCanTrap(SDNode *N);
SDValue WidenVecRes_Convert(SDNode *N);		SDValue WidenVecRes_Convert(SDNode *N);
SDValue WidenVecRes_FCOPYSIGN(SDNode *N);		SDValue WidenVecRes_FCOPYSIGN(SDNode *N);
SDValue WidenVecRes_POWI(SDNode *N);		SDValue WidenVecRes_ExpOp(SDNode *N);
SDValue WidenVecRes_Shift(SDNode *N);		SDValue WidenVecRes_Shift(SDNode *N);
SDValue WidenVecRes_Unary(SDNode *N);		SDValue WidenVecRes_Unary(SDNode *N);
SDValue WidenVecRes_InregOp(SDNode *N);		SDValue WidenVecRes_InregOp(SDNode *N);

// Widen Vector Operand.		// Widen Vector Operand.
bool WidenVectorOperand(SDNode *N, unsigned OpNo);		bool WidenVectorOperand(SDNode *N, unsigned OpNo);
SDValue WidenVecOp_BITCAST(SDNode *N);		SDValue WidenVecOp_BITCAST(SDNode *N);
SDValue WidenVecOp_CONCAT_VECTORS(SDNode *N);		SDValue WidenVecOp_CONCAT_VECTORS(SDNode *N);
▲ Show 20 Lines • Show All 119 Lines • Show Last 20 Lines

lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp

Show First 20 Lines • Show All 300 Lines • ▼ Show 20 Lines	SDValue VectorLegalizer::LegalizeOp(SDValue Op) {
case ISD::FMINNUM:		case ISD::FMINNUM:
case ISD::FMAXNUM:		case ISD::FMAXNUM:
case ISD::FMINNAN:		case ISD::FMINNAN:
case ISD::FMAXNAN:		case ISD::FMAXNAN:
case ISD::FCOPYSIGN:		case ISD::FCOPYSIGN:
case ISD::FSQRT:		case ISD::FSQRT:
case ISD::FSIN:		case ISD::FSIN:
case ISD::FCOS:		case ISD::FCOS:
		case ISD::FLDEXP:
case ISD::FPOWI:		case ISD::FPOWI:
case ISD::FPOW:		case ISD::FPOW:
case ISD::FLOG:		case ISD::FLOG:
case ISD::FLOG2:		case ISD::FLOG2:
case ISD::FLOG10:		case ISD::FLOG10:
case ISD::FEXP:		case ISD::FEXP:
case ISD::FEXP2:		case ISD::FEXP2:
case ISD::FCEIL:		case ISD::FCEIL:
▲ Show 20 Lines • Show All 748 Lines • Show Last 20 Lines

lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp

Show First 20 Lines • Show All 49 Lines • ▼ Show 20 Lines	#endif

case ISD::MERGE_VALUES: R = ScalarizeVecRes_MERGE_VALUES(N, ResNo);break;		case ISD::MERGE_VALUES: R = ScalarizeVecRes_MERGE_VALUES(N, ResNo);break;
case ISD::BITCAST: R = ScalarizeVecRes_BITCAST(N); break;		case ISD::BITCAST: R = ScalarizeVecRes_BITCAST(N); break;
case ISD::BUILD_VECTOR: R = ScalarizeVecRes_BUILD_VECTOR(N); break;		case ISD::BUILD_VECTOR: R = ScalarizeVecRes_BUILD_VECTOR(N); break;
case ISD::CONVERT_RNDSAT: R = ScalarizeVecRes_CONVERT_RNDSAT(N); break;		case ISD::CONVERT_RNDSAT: R = ScalarizeVecRes_CONVERT_RNDSAT(N); break;
case ISD::EXTRACT_SUBVECTOR: R = ScalarizeVecRes_EXTRACT_SUBVECTOR(N); break;		case ISD::EXTRACT_SUBVECTOR: R = ScalarizeVecRes_EXTRACT_SUBVECTOR(N); break;
case ISD::FP_ROUND: R = ScalarizeVecRes_FP_ROUND(N); break;		case ISD::FP_ROUND: R = ScalarizeVecRes_FP_ROUND(N); break;
case ISD::FP_ROUND_INREG: R = ScalarizeVecRes_InregOp(N); break;		case ISD::FP_ROUND_INREG: R = ScalarizeVecRes_InregOp(N); break;
case ISD::FPOWI: R = ScalarizeVecRes_FPOWI(N); break;		case ISD::FLDEXP:
		case ISD::FPOWI: R = ScalarizeVecRes_ExpOp(N); break;
case ISD::INSERT_VECTOR_ELT: R = ScalarizeVecRes_INSERT_VECTOR_ELT(N); break;		case ISD::INSERT_VECTOR_ELT: R = ScalarizeVecRes_INSERT_VECTOR_ELT(N); break;
case ISD::LOAD: R = ScalarizeVecRes_LOAD(cast<LoadSDNode>(N));break;		case ISD::LOAD: R = ScalarizeVecRes_LOAD(cast<LoadSDNode>(N));break;
case ISD::SCALAR_TO_VECTOR: R = ScalarizeVecRes_SCALAR_TO_VECTOR(N); break;		case ISD::SCALAR_TO_VECTOR: R = ScalarizeVecRes_SCALAR_TO_VECTOR(N); break;
case ISD::SIGN_EXTEND_INREG: R = ScalarizeVecRes_InregOp(N); break;		case ISD::SIGN_EXTEND_INREG: R = ScalarizeVecRes_InregOp(N); break;
case ISD::VSELECT: R = ScalarizeVecRes_VSELECT(N); break;		case ISD::VSELECT: R = ScalarizeVecRes_VSELECT(N); break;
case ISD::SELECT: R = ScalarizeVecRes_SELECT(N); break;		case ISD::SELECT: R = ScalarizeVecRes_SELECT(N); break;
case ISD::SELECT_CC: R = ScalarizeVecRes_SELECT_CC(N); break;		case ISD::SELECT_CC: R = ScalarizeVecRes_SELECT_CC(N); break;
case ISD::SETCC: R = ScalarizeVecRes_SETCC(N); break;		case ISD::SETCC: R = ScalarizeVecRes_SETCC(N); break;
▲ Show 20 Lines • Show All 126 Lines • ▼ Show 20 Lines

SDValue DAGTypeLegalizer::ScalarizeVecRes_FP_ROUND(SDNode *N) {		SDValue DAGTypeLegalizer::ScalarizeVecRes_FP_ROUND(SDNode *N) {
EVT NewVT = N->getValueType(0).getVectorElementType();		EVT NewVT = N->getValueType(0).getVectorElementType();
SDValue Op = GetScalarizedVector(N->getOperand(0));		SDValue Op = GetScalarizedVector(N->getOperand(0));
return DAG.getNode(ISD::FP_ROUND, SDLoc(N),		return DAG.getNode(ISD::FP_ROUND, SDLoc(N),
NewVT, Op, N->getOperand(1));		NewVT, Op, N->getOperand(1));
}		}

SDValue DAGTypeLegalizer::ScalarizeVecRes_FPOWI(SDNode *N) {		SDValue DAGTypeLegalizer::ScalarizeVecRes_ExpOp(SDNode *N) {
SDValue Op = GetScalarizedVector(N->getOperand(0));		SDValue Op = GetScalarizedVector(N->getOperand(0));
return DAG.getNode(ISD::FPOWI, SDLoc(N),		return DAG.getNode(N->getOpcode(), SDLoc(N), Op.getValueType(), Op,
Op.getValueType(), Op, N->getOperand(1));		N->getOperand(1));
}		}

SDValue DAGTypeLegalizer::ScalarizeVecRes_INSERT_VECTOR_ELT(SDNode *N) {		SDValue DAGTypeLegalizer::ScalarizeVecRes_INSERT_VECTOR_ELT(SDNode *N) {
// The value to insert may have a wider type than the vector element type,		// The value to insert may have a wider type than the vector element type,
// so be sure to truncate it to the element type if necessary.		// so be sure to truncate it to the element type if necessary.
SDValue Op = N->getOperand(1);		SDValue Op = N->getOperand(1);
EVT EltVT = N->getValueType(0).getVectorElementType();		EVT EltVT = N->getValueType(0).getVectorElementType();
if (Op.getValueType() != EltVT)		if (Op.getValueType() != EltVT)
▲ Show 20 Lines • Show All 377 Lines • ▼ Show 20 Lines	#endif
case ISD::SELECT_CC: SplitRes_SELECT_CC(N, Lo, Hi); break;		case ISD::SELECT_CC: SplitRes_SELECT_CC(N, Lo, Hi); break;
case ISD::UNDEF: SplitRes_UNDEF(N, Lo, Hi); break;		case ISD::UNDEF: SplitRes_UNDEF(N, Lo, Hi); break;
case ISD::BITCAST: SplitVecRes_BITCAST(N, Lo, Hi); break;		case ISD::BITCAST: SplitVecRes_BITCAST(N, Lo, Hi); break;
case ISD::BUILD_VECTOR: SplitVecRes_BUILD_VECTOR(N, Lo, Hi); break;		case ISD::BUILD_VECTOR: SplitVecRes_BUILD_VECTOR(N, Lo, Hi); break;
case ISD::CONCAT_VECTORS: SplitVecRes_CONCAT_VECTORS(N, Lo, Hi); break;		case ISD::CONCAT_VECTORS: SplitVecRes_CONCAT_VECTORS(N, Lo, Hi); break;
case ISD::EXTRACT_SUBVECTOR: SplitVecRes_EXTRACT_SUBVECTOR(N, Lo, Hi); break;		case ISD::EXTRACT_SUBVECTOR: SplitVecRes_EXTRACT_SUBVECTOR(N, Lo, Hi); break;
case ISD::INSERT_SUBVECTOR: SplitVecRes_INSERT_SUBVECTOR(N, Lo, Hi); break;		case ISD::INSERT_SUBVECTOR: SplitVecRes_INSERT_SUBVECTOR(N, Lo, Hi); break;
case ISD::FP_ROUND_INREG: SplitVecRes_InregOp(N, Lo, Hi); break;		case ISD::FP_ROUND_INREG: SplitVecRes_InregOp(N, Lo, Hi); break;
case ISD::FPOWI: SplitVecRes_FPOWI(N, Lo, Hi); break;		case ISD::FLDEXP:
		case ISD::FPOWI: SplitVecRes_ExpOp(N, Lo, Hi); break;
case ISD::FCOPYSIGN: SplitVecRes_FCOPYSIGN(N, Lo, Hi); break;		case ISD::FCOPYSIGN: SplitVecRes_FCOPYSIGN(N, Lo, Hi); break;
case ISD::INSERT_VECTOR_ELT: SplitVecRes_INSERT_VECTOR_ELT(N, Lo, Hi); break;		case ISD::INSERT_VECTOR_ELT: SplitVecRes_INSERT_VECTOR_ELT(N, Lo, Hi); break;
case ISD::SCALAR_TO_VECTOR: SplitVecRes_SCALAR_TO_VECTOR(N, Lo, Hi); break;		case ISD::SCALAR_TO_VECTOR: SplitVecRes_SCALAR_TO_VECTOR(N, Lo, Hi); break;
case ISD::SIGN_EXTEND_INREG: SplitVecRes_InregOp(N, Lo, Hi); break;		case ISD::SIGN_EXTEND_INREG: SplitVecRes_InregOp(N, Lo, Hi); break;
case ISD::LOAD:		case ISD::LOAD:
SplitVecRes_LOAD(cast<LoadSDNode>(N), Lo, Hi);		SplitVecRes_LOAD(cast<LoadSDNode>(N), Lo, Hi);
break;		break;
case ISD::MLOAD:		case ISD::MLOAD:
▲ Show 20 Lines • Show All 258 Lines • ▼ Show 20 Lines	StackPtr =
DAG.getNode(ISD::ADD, dl, StackPtr.getValueType(), StackPtr,		DAG.getNode(ISD::ADD, dl, StackPtr.getValueType(), StackPtr,
DAG.getConstant(IncrementSize, dl, StackPtr.getValueType()));		DAG.getConstant(IncrementSize, dl, StackPtr.getValueType()));

// Load the Hi part from the stack slot.		// Load the Hi part from the stack slot.
Hi = DAG.getLoad(Hi.getValueType(), dl, Store, StackPtr, MachinePointerInfo(),		Hi = DAG.getLoad(Hi.getValueType(), dl, Store, StackPtr, MachinePointerInfo(),
false, false, false, MinAlign(Alignment, IncrementSize));		false, false, false, MinAlign(Alignment, IncrementSize));
}		}

void DAGTypeLegalizer::SplitVecRes_FPOWI(SDNode *N, SDValue &Lo,		void DAGTypeLegalizer::SplitVecRes_ExpOp(SDNode *N, SDValue &Lo, SDValue &Hi) {
SDValue &Hi) {
SDLoc dl(N);		SDLoc dl(N);
GetSplitVector(N->getOperand(0), Lo, Hi);		GetSplitVector(N->getOperand(0), Lo, Hi);
Lo = DAG.getNode(ISD::FPOWI, dl, Lo.getValueType(), Lo, N->getOperand(1));		Lo = DAG.getNode(ISD::FPOWI, dl, Lo.getValueType(), Lo, N->getOperand(1));
Hi = DAG.getNode(ISD::FPOWI, dl, Hi.getValueType(), Hi, N->getOperand(1));		Hi = DAG.getNode(ISD::FPOWI, dl, Hi.getValueType(), Hi, N->getOperand(1));
}		}

void DAGTypeLegalizer::SplitVecRes_FCOPYSIGN(SDNode *N, SDValue &Lo,		void DAGTypeLegalizer::SplitVecRes_FCOPYSIGN(SDNode *N, SDValue &Lo,
SDValue &Hi) {		SDValue &Hi) {
▲ Show 20 Lines • Show All 1,116 Lines • ▼ Show 20 Lines	#endif
case ISD::UREM:		case ISD::UREM:
Res = WidenVecRes_BinaryCanTrap(N);		Res = WidenVecRes_BinaryCanTrap(N);
break;		break;

case ISD::FCOPYSIGN:		case ISD::FCOPYSIGN:
Res = WidenVecRes_FCOPYSIGN(N);		Res = WidenVecRes_FCOPYSIGN(N);
break;		break;

		case ISD::FLDEXP:
case ISD::FPOWI:		case ISD::FPOWI:
Res = WidenVecRes_POWI(N);		Res = WidenVecRes_ExpOp(N);
break;		break;

case ISD::SHL:		case ISD::SHL:
case ISD::SRA:		case ISD::SRA:
case ISD::SRL:		case ISD::SRL:
Res = WidenVecRes_Shift(N);		Res = WidenVecRes_Shift(N);
break;		break;

▲ Show 20 Lines • Show All 288 Lines • ▼ Show 20 Lines	SDValue DAGTypeLegalizer::WidenVecRes_FCOPYSIGN(SDNode *N) {
if (N->getOperand(0).getValueType() == N->getOperand(1).getValueType())		if (N->getOperand(0).getValueType() == N->getOperand(1).getValueType())
return WidenVecRes_BinaryCanTrap(N);		return WidenVecRes_BinaryCanTrap(N);

// If the types are different, fall back to unrolling.		// If the types are different, fall back to unrolling.
EVT WidenVT = TLI.getTypeToTransformTo(*DAG.getContext(), N->getValueType(0));		EVT WidenVT = TLI.getTypeToTransformTo(*DAG.getContext(), N->getValueType(0));
return DAG.UnrollVectorOp(N, WidenVT.getVectorNumElements());		return DAG.UnrollVectorOp(N, WidenVT.getVectorNumElements());
}		}

SDValue DAGTypeLegalizer::WidenVecRes_POWI(SDNode *N) {		SDValue DAGTypeLegalizer::WidenVecRes_ExpOp(SDNode *N) {
EVT WidenVT = TLI.getTypeToTransformTo(*DAG.getContext(), N->getValueType(0));		EVT WidenVT = TLI.getTypeToTransformTo(*DAG.getContext(), N->getValueType(0));
SDValue InOp = GetWidenedVector(N->getOperand(0));		SDValue InOp = GetWidenedVector(N->getOperand(0));
SDValue ShOp = N->getOperand(1);		SDValue ShOp = N->getOperand(1);
return DAG.getNode(N->getOpcode(), SDLoc(N), WidenVT, InOp, ShOp);		return DAG.getNode(N->getOpcode(), SDLoc(N), WidenVT, InOp, ShOp);
}		}

SDValue DAGTypeLegalizer::WidenVecRes_Shift(SDNode *N) {		SDValue DAGTypeLegalizer::WidenVecRes_Shift(SDNode *N) {
EVT WidenVT = TLI.getTypeToTransformTo(*DAG.getContext(), N->getValueType(0));		EVT WidenVT = TLI.getTypeToTransformTo(*DAG.getContext(), N->getValueType(0));
▲ Show 20 Lines • Show All 1,317 Lines • Show Last 20 Lines

lib/CodeGen/SelectionDAG/SelectionDAGBuilder.h

Show First 20 Lines • Show All 832 Lines • ▼ Show 20 Lines	private:
bool visitMemCmpCall(const CallInst &I);		bool visitMemCmpCall(const CallInst &I);
bool visitMemChrCall(const CallInst &I);		bool visitMemChrCall(const CallInst &I);
bool visitStrCpyCall(const CallInst &I, bool isStpcpy);		bool visitStrCpyCall(const CallInst &I, bool isStpcpy);
bool visitStrCmpCall(const CallInst &I);		bool visitStrCmpCall(const CallInst &I);
bool visitStrLenCall(const CallInst &I);		bool visitStrLenCall(const CallInst &I);
bool visitStrNLenCall(const CallInst &I);		bool visitStrNLenCall(const CallInst &I);
bool visitUnaryFloatCall(const CallInst &I, unsigned Opcode);		bool visitUnaryFloatCall(const CallInst &I, unsigned Opcode);
bool visitBinaryFloatCall(const CallInst &I, unsigned Opcode);		bool visitBinaryFloatCall(const CallInst &I, unsigned Opcode);
		bool visitLdExpCall(const CallInst &I);
void visitAtomicLoad(const LoadInst &I);		void visitAtomicLoad(const LoadInst &I);
void visitAtomicStore(const StoreInst &I);		void visitAtomicStore(const StoreInst &I);

void visitInlineAsm(ImmutableCallSite CS);		void visitInlineAsm(ImmutableCallSite CS);
const char *visitIntrinsicCall(const CallInst &I, unsigned Intrinsic);		const char *visitIntrinsicCall(const CallInst &I, unsigned Intrinsic);
void visitTargetIntrinsic(const CallInst &I, unsigned Intrinsic);		void visitTargetIntrinsic(const CallInst &I, unsigned Intrinsic);

void visitVAStart(const CallInst &I);		void visitVAStart(const CallInst &I);
▲ Show 20 Lines • Show All 117 Lines • Show Last 20 Lines

lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 4,202 Lines • ▼ Show 20 Lines	if (!F->optForSize() \|\|
return Res;		return Res;
}		}
}		}

// Otherwise, expand to a libcall.		// Otherwise, expand to a libcall.
return DAG.getNode(ISD::FPOWI, DL, LHS.getValueType(), LHS, RHS);		return DAG.getNode(ISD::FPOWI, DL, LHS.getValueType(), LHS, RHS);
}		}

		/// ExpandLdExp - Expand a llvm.ldexp intrinsic.
		static SDValue ExpandLdExp(SDLoc DL, SDValue Op1, SDValue Op2,
		SelectionDAG &DAG) {
		return DAG.getNode(ISD::FLDEXP, DL, Op1.getValueType(), Op1, Op2);
		}

// getUnderlyingArgReg - Find underlying register used for a truncated or		// getUnderlyingArgReg - Find underlying register used for a truncated or
// bitcasted argument.		// bitcasted argument.
static unsigned getUnderlyingArgReg(const SDValue &N) {		static unsigned getUnderlyingArgReg(const SDValue &N) {
switch (N.getOpcode()) {		switch (N.getOpcode()) {
case ISD::CopyFromReg:		case ISD::CopyFromReg:
return cast<RegisterSDNode>(N.getOperand(1))->getReg();		return cast<RegisterSDNode>(N.getOperand(1))->getReg();
case ISD::BITCAST:		case ISD::BITCAST:
case ISD::AssertZext:		case ISD::AssertZext:
▲ Show 20 Lines • Show All 516 Lines • ▼ Show 20 Lines	Res = DAG.getConvertRndSat(DestVT, sdl, getValue(Op1),
Code);		Code);
setValue(&I, Res);		setValue(&I, Res);
return nullptr;		return nullptr;
}		}
case Intrinsic::powi:		case Intrinsic::powi:
setValue(&I, ExpandPowI(sdl, getValue(I.getArgOperand(0)),		setValue(&I, ExpandPowI(sdl, getValue(I.getArgOperand(0)),
getValue(I.getArgOperand(1)), DAG));		getValue(I.getArgOperand(1)), DAG));
return nullptr;		return nullptr;
		case Intrinsic::ldexp:
		setValue(&I, ExpandLdExp(sdl, getValue(I.getArgOperand(0)),
		getValue(I.getArgOperand(1)), DAG));
		return nullptr;
case Intrinsic::log:		case Intrinsic::log:
setValue(&I, expandLog(sdl, getValue(I.getArgOperand(0)), DAG, TLI));		setValue(&I, expandLog(sdl, getValue(I.getArgOperand(0)), DAG, TLI));
return nullptr;		return nullptr;
case Intrinsic::log2:		case Intrinsic::log2:
setValue(&I, expandLog2(sdl, getValue(I.getArgOperand(0)), DAG, TLI));		setValue(&I, expandLog2(sdl, getValue(I.getArgOperand(0)), DAG, TLI));
return nullptr;		return nullptr;
case Intrinsic::log10:		case Intrinsic::log10:
setValue(&I, expandLog10(sdl, getValue(I.getArgOperand(0)), DAG, TLI));		setValue(&I, expandLog10(sdl, getValue(I.getArgOperand(0)), DAG, TLI));
▲ Show 20 Lines • Show All 982 Lines • ▼ Show 20 Lines	bool SelectionDAGBuilder::visitBinaryFloatCall(const CallInst &I,

SDValue Tmp0 = getValue(I.getArgOperand(0));		SDValue Tmp0 = getValue(I.getArgOperand(0));
SDValue Tmp1 = getValue(I.getArgOperand(1));		SDValue Tmp1 = getValue(I.getArgOperand(1));
EVT VT = Tmp0.getValueType();		EVT VT = Tmp0.getValueType();
setValue(&I, DAG.getNode(Opcode, getCurSDLoc(), VT, Tmp0, Tmp1));		setValue(&I, DAG.getNode(Opcode, getCurSDLoc(), VT, Tmp0, Tmp1));
return true;		return true;
}		}

		/// visitLdExpCall - If a call instruction fits a ldexp call (as expected),
		/// translate it to an SDNode with opcode FLDEXP and return true.
		bool SelectionDAGBuilder::visitLdExpCall(const CallInst &I) {
		if (I.getNumArgOperands() != 2 \|\|
		!I.getArgOperand(0)->getType()->isFloatingPointTy() \|\|
		!I.getArgOperand(1)->getType()->isIntegerTy() \|\|
		I.getType() != I.getArgOperand(0)->getType() \|\| !I.onlyReadsMemory())
		return false;

		SDValue Tmp0 = getValue(I.getArgOperand(0));
		SDValue Tmp1 = getValue(I.getArgOperand(1));
		EVT VT = Tmp0.getValueType();
		setValue(&I, DAG.getNode(ISD::FLDEXP, getCurSDLoc(), VT, Tmp0, Tmp1));
		return true;
		}

void SelectionDAGBuilder::visitCall(const CallInst &I) {		void SelectionDAGBuilder::visitCall(const CallInst &I) {
// Handle inline assembly differently.		// Handle inline assembly differently.
if (isa<InlineAsm>(I.getCalledValue())) {		if (isa<InlineAsm>(I.getCalledValue())) {
visitInlineAsm(&I);		visitInlineAsm(&I);
return;		return;
}		}

MachineModuleInfo &MMI = DAG.getMachineFunction().getMMI();		MachineModuleInfo &MMI = DAG.getMachineFunction().getMMI();
▲ Show 20 Lines • Show All 121 Lines • ▼ Show 20 Lines	if (!F->hasLocalLinkage() && F->hasName() &&
return;		return;
break;		break;
case LibFunc::exp2:		case LibFunc::exp2:
case LibFunc::exp2f:		case LibFunc::exp2f:
case LibFunc::exp2l:		case LibFunc::exp2l:
if (visitUnaryFloatCall(I, ISD::FEXP2))		if (visitUnaryFloatCall(I, ISD::FEXP2))
return;		return;
break;		break;
		case LibFunc::ldexp:
		case LibFunc::ldexpf:
		case LibFunc::ldexpl:
		if (visitLdExpCall(I))
		return;
		break;
case LibFunc::memcmp:		case LibFunc::memcmp:
if (visitMemCmpCall(I))		if (visitMemCmpCall(I))
return;		return;
break;		break;
case LibFunc::memchr:		case LibFunc::memchr:
if (visitMemChrCall(I))		if (visitMemChrCall(I))
return;		return;
break;		break;
▲ Show 20 Lines • Show All 2,671 Lines • Show Last 20 Lines

lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp

Show First 20 Lines • Show All 201 Lines • ▼ Show 20 Lines	#endif
case ISD::FCOPYSIGN: return "fcopysign";		case ISD::FCOPYSIGN: return "fcopysign";
case ISD::FGETSIGN: return "fgetsign";		case ISD::FGETSIGN: return "fgetsign";
case ISD::FPOW: return "fpow";		case ISD::FPOW: return "fpow";
case ISD::SMIN: return "smin";		case ISD::SMIN: return "smin";
case ISD::SMAX: return "smax";		case ISD::SMAX: return "smax";
case ISD::UMIN: return "umin";		case ISD::UMIN: return "umin";
case ISD::UMAX: return "umax";		case ISD::UMAX: return "umax";

		case ISD::FLDEXP: return "fldexp";
case ISD::FPOWI: return "fpowi";		case ISD::FPOWI: return "fpowi";
case ISD::SETCC: return "setcc";		case ISD::SETCC: return "setcc";
case ISD::SELECT: return "select";		case ISD::SELECT: return "select";
case ISD::VSELECT: return "vselect";		case ISD::VSELECT: return "vselect";
case ISD::SELECT_CC: return "select_cc";		case ISD::SELECT_CC: return "select_cc";
case ISD::INSERT_VECTOR_ELT: return "insert_vector_elt";		case ISD::INSERT_VECTOR_ELT: return "insert_vector_elt";
case ISD::EXTRACT_VECTOR_ELT: return "extract_vector_elt";		case ISD::EXTRACT_VECTOR_ELT: return "extract_vector_elt";
case ISD::CONCAT_VECTORS: return "concat_vectors";		case ISD::CONCAT_VECTORS: return "concat_vectors";
▲ Show 20 Lines • Show All 507 Lines • Show Last 20 Lines

lib/CodeGen/TargetLoweringBase.cpp

Show First 20 Lines • Show All 160 Lines • ▼ Show 20 Lines	static void InitLibcallNames(const char **Names, const Triple &TT) {
Names[RTLIB::EXP_F80] = "expl";		Names[RTLIB::EXP_F80] = "expl";
Names[RTLIB::EXP_F128] = "expl";		Names[RTLIB::EXP_F128] = "expl";
Names[RTLIB::EXP_PPCF128] = "expl";		Names[RTLIB::EXP_PPCF128] = "expl";
Names[RTLIB::EXP2_F32] = "exp2f";		Names[RTLIB::EXP2_F32] = "exp2f";
Names[RTLIB::EXP2_F64] = "exp2";		Names[RTLIB::EXP2_F64] = "exp2";
Names[RTLIB::EXP2_F80] = "exp2l";		Names[RTLIB::EXP2_F80] = "exp2l";
Names[RTLIB::EXP2_F128] = "exp2l";		Names[RTLIB::EXP2_F128] = "exp2l";
Names[RTLIB::EXP2_PPCF128] = "exp2l";		Names[RTLIB::EXP2_PPCF128] = "exp2l";
		Names[RTLIB::LDEXP_F32] = "ldexpf";
		Names[RTLIB::LDEXP_F64] = "ldexp";
		Names[RTLIB::LDEXP_F80] = "ldexpl";
		Names[RTLIB::LDEXP_F128] = "ldexpl";
		Names[RTLIB::LDEXP_PPCF128] = "ldexpl";
Names[RTLIB::SIN_F32] = "sinf";		Names[RTLIB::SIN_F32] = "sinf";
Names[RTLIB::SIN_F64] = "sin";		Names[RTLIB::SIN_F64] = "sin";
Names[RTLIB::SIN_F80] = "sinl";		Names[RTLIB::SIN_F80] = "sinl";
Names[RTLIB::SIN_F128] = "sinl";		Names[RTLIB::SIN_F128] = "sinl";
Names[RTLIB::SIN_PPCF128] = "sinl";		Names[RTLIB::SIN_PPCF128] = "sinl";
Names[RTLIB::COS_F32] = "cosf";		Names[RTLIB::COS_F32] = "cosf";
Names[RTLIB::COS_F64] = "cos";		Names[RTLIB::COS_F64] = "cos";
Names[RTLIB::COS_F80] = "cosl";		Names[RTLIB::COS_F80] = "cosl";
▲ Show 20 Lines • Show All 251 Lines • ▼ Show 20 Lines	static void InitLibcallNames(const char **Names, const Triple &TT) {

// For f16/f32 conversions, Darwin uses the standard naming scheme, instead		// For f16/f32 conversions, Darwin uses the standard naming scheme, instead
// of the gnueabi-style __gnu_*_ieee.		// of the gnueabi-style __gnu_*_ieee.
// FIXME: What about other targets?		// FIXME: What about other targets?
if (TT.isOSDarwin()) {		if (TT.isOSDarwin()) {
Names[RTLIB::FPEXT_F16_F32] = "__extendhfsf2";		Names[RTLIB::FPEXT_F16_F32] = "__extendhfsf2";
Names[RTLIB::FPROUND_F32_F16] = "__truncsfhf2";		Names[RTLIB::FPROUND_F32_F16] = "__truncsfhf2";
}		}

		if (TT.isOSWindows() && !TT.isOSCygMing()) {
		Names[RTLIB::LDEXP_F32] = nullptr;
		Names[RTLIB::LDEXP_F80] = nullptr;
		Names[RTLIB::LDEXP_F128] = nullptr;
		Names[RTLIB::LDEXP_PPCF128] = nullptr;
		}
}		}

/// InitLibcallCallingConvs - Set default libcall CallingConvs.		/// InitLibcallCallingConvs - Set default libcall CallingConvs.
///		///
static void InitLibcallCallingConvs(CallingConv::ID *CCs) {		static void InitLibcallCallingConvs(CallingConv::ID *CCs) {
for (int i = 0; i < RTLIB::UNKNOWN_LIBCALL; ++i) {		for (int i = 0; i < RTLIB::UNKNOWN_LIBCALL; ++i) {
CCs[i] = CallingConv::C;		CCs[i] = CallingConv::C;
}		}
▲ Show 20 Lines • Show All 409 Lines • ▼ Show 20 Lines	void TargetLoweringBase::initActions() {
// to optimize expansions for certain constants.		// to optimize expansions for certain constants.
setOperationAction(ISD::ConstantFP, MVT::f16, Expand);		setOperationAction(ISD::ConstantFP, MVT::f16, Expand);
setOperationAction(ISD::ConstantFP, MVT::f32, Expand);		setOperationAction(ISD::ConstantFP, MVT::f32, Expand);
setOperationAction(ISD::ConstantFP, MVT::f64, Expand);		setOperationAction(ISD::ConstantFP, MVT::f64, Expand);
setOperationAction(ISD::ConstantFP, MVT::f80, Expand);		setOperationAction(ISD::ConstantFP, MVT::f80, Expand);
setOperationAction(ISD::ConstantFP, MVT::f128, Expand);		setOperationAction(ISD::ConstantFP, MVT::f128, Expand);

// These library functions default to expand.		// These library functions default to expand.
for (MVT VT : {MVT::f32, MVT::f64, MVT::f128}) {		for (MVT VT : {MVT::f32, MVT::f64, MVT::f128}) {
		hfinkelUnsubmitted Done Reply Inline Actions You should add FLDEXP here too. hfinkel: You should add FLDEXP here too.
setOperationAction(ISD::FLOG , VT, Expand);		setOperationAction(ISD::FLOG , VT, Expand);
setOperationAction(ISD::FLOG2, VT, Expand);		setOperationAction(ISD::FLOG2, VT, Expand);
setOperationAction(ISD::FLOG10, VT, Expand);		setOperationAction(ISD::FLOG10, VT, Expand);
setOperationAction(ISD::FEXP , VT, Expand);		setOperationAction(ISD::FEXP , VT, Expand);
setOperationAction(ISD::FEXP2, VT, Expand);		setOperationAction(ISD::FEXP2, VT, Expand);
setOperationAction(ISD::FFLOOR, VT, Expand);		setOperationAction(ISD::FFLOOR, VT, Expand);
setOperationAction(ISD::FMINNUM, VT, Expand);		setOperationAction(ISD::FMINNUM, VT, Expand);
setOperationAction(ISD::FMAXNUM, VT, Expand);		setOperationAction(ISD::FMAXNUM, VT, Expand);
▲ Show 20 Lines • Show All 848 Lines • Show Last 20 Lines

lib/Target/AArch64/AArch64ISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 263 Lines • ▼ Show 20 Lines	AArch64TargetLowering::AArch64TargetLowering(const TargetMachine &TM,
setOperationAction(ISD::UMULO, MVT::i64, Custom);		setOperationAction(ISD::UMULO, MVT::i64, Custom);

setOperationAction(ISD::FSIN, MVT::f32, Expand);		setOperationAction(ISD::FSIN, MVT::f32, Expand);
setOperationAction(ISD::FSIN, MVT::f64, Expand);		setOperationAction(ISD::FSIN, MVT::f64, Expand);
setOperationAction(ISD::FCOS, MVT::f32, Expand);		setOperationAction(ISD::FCOS, MVT::f32, Expand);
setOperationAction(ISD::FCOS, MVT::f64, Expand);		setOperationAction(ISD::FCOS, MVT::f64, Expand);
setOperationAction(ISD::FPOW, MVT::f32, Expand);		setOperationAction(ISD::FPOW, MVT::f32, Expand);
setOperationAction(ISD::FPOW, MVT::f64, Expand);		setOperationAction(ISD::FPOW, MVT::f64, Expand);
		setOperationAction(ISD::FLDEXP, MVT::f32, Expand);
		setOperationAction(ISD::FLDEXP, MVT::f64, Expand);
setOperationAction(ISD::FCOPYSIGN, MVT::f64, Custom);		setOperationAction(ISD::FCOPYSIGN, MVT::f64, Custom);
setOperationAction(ISD::FCOPYSIGN, MVT::f32, Custom);		setOperationAction(ISD::FCOPYSIGN, MVT::f32, Custom);

// f16 is a storage-only type, always promote it to f32.		// f16 is a storage-only type, always promote it to f32.
setOperationAction(ISD::SETCC, MVT::f16, Promote);		setOperationAction(ISD::SETCC, MVT::f16, Promote);
setOperationAction(ISD::BR_CC, MVT::f16, Promote);		setOperationAction(ISD::BR_CC, MVT::f16, Promote);
setOperationAction(ISD::SELECT_CC, MVT::f16, Promote);		setOperationAction(ISD::SELECT_CC, MVT::f16, Promote);
setOperationAction(ISD::SELECT, MVT::f16, Promote);		setOperationAction(ISD::SELECT, MVT::f16, Promote);
setOperationAction(ISD::FADD, MVT::f16, Promote);		setOperationAction(ISD::FADD, MVT::f16, Promote);
setOperationAction(ISD::FSUB, MVT::f16, Promote);		setOperationAction(ISD::FSUB, MVT::f16, Promote);
setOperationAction(ISD::FMUL, MVT::f16, Promote);		setOperationAction(ISD::FMUL, MVT::f16, Promote);
setOperationAction(ISD::FDIV, MVT::f16, Promote);		setOperationAction(ISD::FDIV, MVT::f16, Promote);
setOperationAction(ISD::FREM, MVT::f16, Promote);		setOperationAction(ISD::FREM, MVT::f16, Promote);
setOperationAction(ISD::FMA, MVT::f16, Promote);		setOperationAction(ISD::FMA, MVT::f16, Promote);
setOperationAction(ISD::FNEG, MVT::f16, Promote);		setOperationAction(ISD::FNEG, MVT::f16, Promote);
setOperationAction(ISD::FABS, MVT::f16, Promote);		setOperationAction(ISD::FABS, MVT::f16, Promote);
setOperationAction(ISD::FCEIL, MVT::f16, Promote);		setOperationAction(ISD::FCEIL, MVT::f16, Promote);
setOperationAction(ISD::FCOPYSIGN, MVT::f16, Promote);		setOperationAction(ISD::FCOPYSIGN, MVT::f16, Promote);
setOperationAction(ISD::FCOS, MVT::f16, Promote);		setOperationAction(ISD::FCOS, MVT::f16, Promote);
setOperationAction(ISD::FFLOOR, MVT::f16, Promote);		setOperationAction(ISD::FFLOOR, MVT::f16, Promote);
setOperationAction(ISD::FNEARBYINT, MVT::f16, Promote);		setOperationAction(ISD::FNEARBYINT, MVT::f16, Promote);
setOperationAction(ISD::FPOW, MVT::f16, Promote);		setOperationAction(ISD::FPOW, MVT::f16, Promote);
setOperationAction(ISD::FPOWI, MVT::f16, Promote);		setOperationAction(ISD::FPOWI, MVT::f16, Promote);
		setOperationAction(ISD::FLDEXP, MVT::f16, Promote);
setOperationAction(ISD::FRINT, MVT::f16, Promote);		setOperationAction(ISD::FRINT, MVT::f16, Promote);
setOperationAction(ISD::FSIN, MVT::f16, Promote);		setOperationAction(ISD::FSIN, MVT::f16, Promote);
setOperationAction(ISD::FSINCOS, MVT::f16, Promote);		setOperationAction(ISD::FSINCOS, MVT::f16, Promote);
setOperationAction(ISD::FSQRT, MVT::f16, Promote);		setOperationAction(ISD::FSQRT, MVT::f16, Promote);
setOperationAction(ISD::FEXP, MVT::f16, Promote);		setOperationAction(ISD::FEXP, MVT::f16, Promote);
setOperationAction(ISD::FEXP2, MVT::f16, Promote);		setOperationAction(ISD::FEXP2, MVT::f16, Promote);
setOperationAction(ISD::FLOG, MVT::f16, Promote);		setOperationAction(ISD::FLOG, MVT::f16, Promote);
setOperationAction(ISD::FLOG2, MVT::f16, Promote);		setOperationAction(ISD::FLOG2, MVT::f16, Promote);
Show All 28 Lines	AArch64TargetLowering::AArch64TargetLowering(const TargetMachine &TM,
setOperationAction(ISD::FCOPYSIGN, MVT::v4f16, Expand);		setOperationAction(ISD::FCOPYSIGN, MVT::v4f16, Expand);
setOperationAction(ISD::FCOS, MVT::v4f16, Expand);		setOperationAction(ISD::FCOS, MVT::v4f16, Expand);
setOperationAction(ISD::FFLOOR, MVT::v4f16, Expand);		setOperationAction(ISD::FFLOOR, MVT::v4f16, Expand);
setOperationAction(ISD::FMA, MVT::v4f16, Expand);		setOperationAction(ISD::FMA, MVT::v4f16, Expand);
setOperationAction(ISD::FNEARBYINT, MVT::v4f16, Expand);		setOperationAction(ISD::FNEARBYINT, MVT::v4f16, Expand);
setOperationAction(ISD::FNEG, MVT::v4f16, Expand);		setOperationAction(ISD::FNEG, MVT::v4f16, Expand);
setOperationAction(ISD::FPOW, MVT::v4f16, Expand);		setOperationAction(ISD::FPOW, MVT::v4f16, Expand);
setOperationAction(ISD::FPOWI, MVT::v4f16, Expand);		setOperationAction(ISD::FPOWI, MVT::v4f16, Expand);
		setOperationAction(ISD::FLDEXP, MVT::v4f16, Expand);
setOperationAction(ISD::FREM, MVT::v4f16, Expand);		setOperationAction(ISD::FREM, MVT::v4f16, Expand);
setOperationAction(ISD::FROUND, MVT::v4f16, Expand);		setOperationAction(ISD::FROUND, MVT::v4f16, Expand);
setOperationAction(ISD::FRINT, MVT::v4f16, Expand);		setOperationAction(ISD::FRINT, MVT::v4f16, Expand);
setOperationAction(ISD::FSIN, MVT::v4f16, Expand);		setOperationAction(ISD::FSIN, MVT::v4f16, Expand);
setOperationAction(ISD::FSINCOS, MVT::v4f16, Expand);		setOperationAction(ISD::FSINCOS, MVT::v4f16, Expand);
setOperationAction(ISD::FSQRT, MVT::v4f16, Expand);		setOperationAction(ISD::FSQRT, MVT::v4f16, Expand);
setOperationAction(ISD::FTRUNC, MVT::v4f16, Expand);		setOperationAction(ISD::FTRUNC, MVT::v4f16, Expand);
setOperationAction(ISD::SETCC, MVT::v4f16, Expand);		setOperationAction(ISD::SETCC, MVT::v4f16, Expand);
Show All 16 Lines	AArch64TargetLowering::AArch64TargetLowering(const TargetMachine &TM,
setOperationAction(ISD::FDIV, MVT::v8f16, Expand);		setOperationAction(ISD::FDIV, MVT::v8f16, Expand);
setOperationAction(ISD::FFLOOR, MVT::v8f16, Expand);		setOperationAction(ISD::FFLOOR, MVT::v8f16, Expand);
setOperationAction(ISD::FMA, MVT::v8f16, Expand);		setOperationAction(ISD::FMA, MVT::v8f16, Expand);
setOperationAction(ISD::FMUL, MVT::v8f16, Expand);		setOperationAction(ISD::FMUL, MVT::v8f16, Expand);
setOperationAction(ISD::FNEARBYINT, MVT::v8f16, Expand);		setOperationAction(ISD::FNEARBYINT, MVT::v8f16, Expand);
setOperationAction(ISD::FNEG, MVT::v8f16, Expand);		setOperationAction(ISD::FNEG, MVT::v8f16, Expand);
setOperationAction(ISD::FPOW, MVT::v8f16, Expand);		setOperationAction(ISD::FPOW, MVT::v8f16, Expand);
setOperationAction(ISD::FPOWI, MVT::v8f16, Expand);		setOperationAction(ISD::FPOWI, MVT::v8f16, Expand);
		setOperationAction(ISD::FLDEXP, MVT::v8f16, Expand);
setOperationAction(ISD::FREM, MVT::v8f16, Expand);		setOperationAction(ISD::FREM, MVT::v8f16, Expand);
setOperationAction(ISD::FROUND, MVT::v8f16, Expand);		setOperationAction(ISD::FROUND, MVT::v8f16, Expand);
setOperationAction(ISD::FRINT, MVT::v8f16, Expand);		setOperationAction(ISD::FRINT, MVT::v8f16, Expand);
setOperationAction(ISD::FSIN, MVT::v8f16, Expand);		setOperationAction(ISD::FSIN, MVT::v8f16, Expand);
setOperationAction(ISD::FSINCOS, MVT::v8f16, Expand);		setOperationAction(ISD::FSINCOS, MVT::v8f16, Expand);
setOperationAction(ISD::FSQRT, MVT::v8f16, Expand);		setOperationAction(ISD::FSQRT, MVT::v8f16, Expand);
setOperationAction(ISD::FSUB, MVT::v8f16, Expand);		setOperationAction(ISD::FSUB, MVT::v8f16, Expand);
setOperationAction(ISD::FTRUNC, MVT::v8f16, Expand);		setOperationAction(ISD::FTRUNC, MVT::v8f16, Expand);
▲ Show 20 Lines • Show All 264 Lines • ▼ Show 20 Lines	void AArch64TargetLowering::addTypeForNEON(EVT VT, EVT PromotedBitwiseVT) {
}		}

// Mark vector float intrinsics as expand.		// Mark vector float intrinsics as expand.
if (VT == MVT::v2f32 \|\| VT == MVT::v4f32 \|\| VT == MVT::v2f64) {		if (VT == MVT::v2f32 \|\| VT == MVT::v4f32 \|\| VT == MVT::v2f64) {
setOperationAction(ISD::FSIN, VT.getSimpleVT(), Expand);		setOperationAction(ISD::FSIN, VT.getSimpleVT(), Expand);
setOperationAction(ISD::FCOS, VT.getSimpleVT(), Expand);		setOperationAction(ISD::FCOS, VT.getSimpleVT(), Expand);
setOperationAction(ISD::FPOWI, VT.getSimpleVT(), Expand);		setOperationAction(ISD::FPOWI, VT.getSimpleVT(), Expand);
setOperationAction(ISD::FPOW, VT.getSimpleVT(), Expand);		setOperationAction(ISD::FPOW, VT.getSimpleVT(), Expand);
		setOperationAction(ISD::FLDEXP, VT.getSimpleVT(), Expand);
setOperationAction(ISD::FLOG, VT.getSimpleVT(), Expand);		setOperationAction(ISD::FLOG, VT.getSimpleVT(), Expand);
setOperationAction(ISD::FLOG2, VT.getSimpleVT(), Expand);		setOperationAction(ISD::FLOG2, VT.getSimpleVT(), Expand);
setOperationAction(ISD::FLOG10, VT.getSimpleVT(), Expand);		setOperationAction(ISD::FLOG10, VT.getSimpleVT(), Expand);
setOperationAction(ISD::FEXP, VT.getSimpleVT(), Expand);		setOperationAction(ISD::FEXP, VT.getSimpleVT(), Expand);
setOperationAction(ISD::FEXP2, VT.getSimpleVT(), Expand);		setOperationAction(ISD::FEXP2, VT.getSimpleVT(), Expand);

// But we do support custom-lowering for FCOPYSIGN.		// But we do support custom-lowering for FCOPYSIGN.
setOperationAction(ISD::FCOPYSIGN, VT.getSimpleVT(), Custom);		setOperationAction(ISD::FCOPYSIGN, VT.getSimpleVT(), Custom);
▲ Show 20 Lines • Show All 9,315 Lines • Show Last 20 Lines

lib/Target/AMDGPU/AMDGPUISelLowering.h

Show First 20 Lines • Show All 248 Lines • ▼ Show 20 Lines	enum NodeType : unsigned {
TRIG_PREOP, // 1 ULP max error for f64		TRIG_PREOP, // 1 ULP max error for f64

// RCP, RSQ - For f32, 1 ULP max error, no denormal handling.		// RCP, RSQ - For f32, 1 ULP max error, no denormal handling.
// For f64, max error 2^29 ULP, handles denormals.		// For f64, max error 2^29 ULP, handles denormals.
RCP,		RCP,
RSQ,		RSQ,
RSQ_LEGACY,		RSQ_LEGACY,
RSQ_CLAMPED,		RSQ_CLAMPED,
LDEXP,
FP_CLASS,		FP_CLASS,
DOT4,		DOT4,
CARRY,		CARRY,
BORROW,		BORROW,
BFE_U32, // Extract range of bits with zero extension to 32-bits.		BFE_U32, // Extract range of bits with zero extension to 32-bits.
BFE_I32, // Extract range of bits with sign extension to 32-bits.		BFE_I32, // Extract range of bits with sign extension to 32-bits.
BFI, // (src0 & src1) \| (~src0 & src2)		BFI, // (src0 & src1) \| (~src0 & src2)
BFM, // Insert a range of bits into a 32-bit word.		BFM, // Insert a range of bits into a 32-bit word.
▲ Show 20 Lines • Show All 49 Lines • Show Last 20 Lines

lib/Target/AMDGPU/AMDGPUISelLowering.cpp

Show First 20 Lines • Show All 953 Lines • ▼ Show 20 Lines	case Intrinsic::AMDGPU_rsq_clamped:
DAG.getConstantFP(Max, DL, VT));		DAG.getConstantFP(Max, DL, VT));
return DAG.getNode(ISD::FMAXNUM, DL, VT, Tmp,		return DAG.getNode(ISD::FMAXNUM, DL, VT, Tmp,
DAG.getConstantFP(Min, DL, VT));		DAG.getConstantFP(Min, DL, VT));
} else {		} else {
return DAG.getNode(AMDGPUISD::RSQ_CLAMPED, DL, VT, Op.getOperand(1));		return DAG.getNode(AMDGPUISD::RSQ_CLAMPED, DL, VT, Op.getOperand(1));
}		}

case Intrinsic::AMDGPU_ldexp:		case Intrinsic::AMDGPU_ldexp:
return DAG.getNode(AMDGPUISD::LDEXP, DL, VT, Op.getOperand(1),		return DAG.getNode(ISD::FLDEXP, DL, VT, Op.getOperand(1),
Op.getOperand(2));		Op.getOperand(2));

case AMDGPUIntrinsic::AMDGPU_imax:		case AMDGPUIntrinsic::AMDGPU_imax:
return DAG.getNode(ISD::SMAX, DL, VT, Op.getOperand(1),		return DAG.getNode(ISD::SMAX, DL, VT, Op.getOperand(1),
Op.getOperand(2));		Op.getOperand(2));
case AMDGPUIntrinsic::AMDGPU_umax:		case AMDGPUIntrinsic::AMDGPU_umax:
return DAG.getNode(ISD::UMAX, DL, VT, Op.getOperand(1),		return DAG.getNode(ISD::UMAX, DL, VT, Op.getOperand(1),
Op.getOperand(2));		Op.getOperand(2));
case AMDGPUIntrinsic::AMDGPU_imin:		case AMDGPUIntrinsic::AMDGPU_imin:
▲ Show 20 Lines • Show All 1,181 Lines • ▼ Show 20 Lines	SDValue AMDGPUTargetLowering::LowerINT_TO_FP64(SDValue Op, SelectionDAG &DAG,
SDValue Hi = DAG.getNode(ISD::EXTRACT_VECTOR_ELT, SL, MVT::i32, BC,		SDValue Hi = DAG.getNode(ISD::EXTRACT_VECTOR_ELT, SL, MVT::i32, BC,
DAG.getConstant(1, SL, MVT::i32));		DAG.getConstant(1, SL, MVT::i32));

SDValue CvtHi = DAG.getNode(Signed ? ISD::SINT_TO_FP : ISD::UINT_TO_FP,		SDValue CvtHi = DAG.getNode(Signed ? ISD::SINT_TO_FP : ISD::UINT_TO_FP,
SL, MVT::f64, Hi);		SL, MVT::f64, Hi);

SDValue CvtLo = DAG.getNode(ISD::UINT_TO_FP, SL, MVT::f64, Lo);		SDValue CvtLo = DAG.getNode(ISD::UINT_TO_FP, SL, MVT::f64, Lo);

SDValue LdExp = DAG.getNode(AMDGPUISD::LDEXP, SL, MVT::f64, CvtHi,		SDValue LdExp = DAG.getNode(ISD::FLDEXP, SL, MVT::f64, CvtHi,
DAG.getConstant(32, SL, MVT::i32));		DAG.getConstant(32, SL, MVT::i32));
// TODO: Should this propagate fast-math-flags?		// TODO: Should this propagate fast-math-flags?
return DAG.getNode(ISD::FADD, SL, MVT::f64, LdExp, CvtLo);		return DAG.getNode(ISD::FADD, SL, MVT::f64, LdExp, CvtLo);
}		}

SDValue AMDGPUTargetLowering::LowerUINT_TO_FP(SDValue Op,		SDValue AMDGPUTargetLowering::LowerUINT_TO_FP(SDValue Op,
SelectionDAG &DAG) const {		SelectionDAG &DAG) const {
SDValue S0 = Op.getOperand(0);		SDValue S0 = Op.getOperand(0);
▲ Show 20 Lines • Show All 503 Lines • ▼ Show 20 Lines	const char* AMDGPUTargetLowering::getTargetNodeName(unsigned Opcode) const {
NODE_NAME_CASE(DIV_SCALE)		NODE_NAME_CASE(DIV_SCALE)
NODE_NAME_CASE(DIV_FMAS)		NODE_NAME_CASE(DIV_FMAS)
NODE_NAME_CASE(DIV_FIXUP)		NODE_NAME_CASE(DIV_FIXUP)
NODE_NAME_CASE(TRIG_PREOP)		NODE_NAME_CASE(TRIG_PREOP)
NODE_NAME_CASE(RCP)		NODE_NAME_CASE(RCP)
NODE_NAME_CASE(RSQ)		NODE_NAME_CASE(RSQ)
NODE_NAME_CASE(RSQ_LEGACY)		NODE_NAME_CASE(RSQ_LEGACY)
NODE_NAME_CASE(RSQ_CLAMPED)		NODE_NAME_CASE(RSQ_CLAMPED)
NODE_NAME_CASE(LDEXP)
NODE_NAME_CASE(FP_CLASS)		NODE_NAME_CASE(FP_CLASS)
NODE_NAME_CASE(DOT4)		NODE_NAME_CASE(DOT4)
NODE_NAME_CASE(CARRY)		NODE_NAME_CASE(CARRY)
NODE_NAME_CASE(BORROW)		NODE_NAME_CASE(BORROW)
NODE_NAME_CASE(BFE_U32)		NODE_NAME_CASE(BFE_U32)
NODE_NAME_CASE(BFE_I32)		NODE_NAME_CASE(BFE_I32)
NODE_NAME_CASE(BFI)		NODE_NAME_CASE(BFI)
NODE_NAME_CASE(BFM)		NODE_NAME_CASE(BFM)
▲ Show 20 Lines • Show All 177 Lines • Show Last 20 Lines

lib/Target/AMDGPU/AMDGPUInstrInfo.td

	Show First 20 Lines • Show All 59 Lines • ▼ Show 20 Lines
	def AMDGPUrsq : SDNode<"AMDGPUISD::RSQ", SDTFPUnaryOp>;			def AMDGPUrsq : SDNode<"AMDGPUISD::RSQ", SDTFPUnaryOp>;

	// out = 1.0 / sqrt(a)			// out = 1.0 / sqrt(a)
	def AMDGPUrsq_legacy : SDNode<"AMDGPUISD::RSQ_LEGACY", SDTFPUnaryOp>;			def AMDGPUrsq_legacy : SDNode<"AMDGPUISD::RSQ_LEGACY", SDTFPUnaryOp>;

	// out = 1.0 / sqrt(a) result clamped to +/- max_float.			// out = 1.0 / sqrt(a) result clamped to +/- max_float.
	def AMDGPUrsq_clamped : SDNode<"AMDGPUISD::RSQ_CLAMPED", SDTFPUnaryOp>;			def AMDGPUrsq_clamped : SDNode<"AMDGPUISD::RSQ_CLAMPED", SDTFPUnaryOp>;

	def AMDGPUldexp : SDNode<"AMDGPUISD::LDEXP", AMDGPULdExpOp>;

	def AMDGPUfp_class : SDNode<"AMDGPUISD::FP_CLASS", AMDGPUFPClassOp>;			def AMDGPUfp_class : SDNode<"AMDGPUISD::FP_CLASS", AMDGPUFPClassOp>;

	// out = max(a, b) a and b are floats, where a nan comparison fails.			// out = max(a, b) a and b are floats, where a nan comparison fails.
	// This is not commutative because this gives the second operand:			// This is not commutative because this gives the second operand:
	// x < nan ? x : nan -> nan			// x < nan ? x : nan -> nan
	// nan < x ? nan : x -> x			// nan < x ? nan : x -> x
	def AMDGPUfmax_legacy : SDNode<"AMDGPUISD::FMAX_LEGACY", SDTFPBinOp,			def AMDGPUfmax_legacy : SDNode<"AMDGPUISD::FMAX_LEGACY", SDTFPBinOp,
	[]			[]
	▲ Show 20 Lines • Show All 168 Lines • Show Last 20 Lines

lib/Target/AMDGPU/SIInstructions.td

	Show First 20 Lines • Show All 1,592 Lines • ▼ Show 20 Lines
	>;			>;
	defm V_MBCNT_LO_U32_B32 : VOP2_VI3_Inst <vop23<0x23, 0x28c>, "v_mbcnt_lo_u32_b32",			defm V_MBCNT_LO_U32_B32 : VOP2_VI3_Inst <vop23<0x23, 0x28c>, "v_mbcnt_lo_u32_b32",
	VOP_I32_I32_I32			VOP_I32_I32_I32
	>;			>;
	defm V_MBCNT_HI_U32_B32 : VOP2_VI3_Inst <vop23<0x24, 0x28d>, "v_mbcnt_hi_u32_b32",			defm V_MBCNT_HI_U32_B32 : VOP2_VI3_Inst <vop23<0x24, 0x28d>, "v_mbcnt_hi_u32_b32",
	VOP_I32_I32_I32			VOP_I32_I32_I32
	>;			>;
	defm V_LDEXP_F32 : VOP2_VI3_Inst <vop23<0x2b, 0x288>, "v_ldexp_f32",			defm V_LDEXP_F32 : VOP2_VI3_Inst <vop23<0x2b, 0x288>, "v_ldexp_f32",
	VOP_F32_F32_I32, AMDGPUldexp			VOP_F32_F32_I32, fldexp
	>;			>;

	defm V_CVT_PKACCUM_U8_F32 : VOP2_VI3_Inst <vop23<0x2c, 0x1f0>, "v_cvt_pkaccum_u8_f32",			defm V_CVT_PKACCUM_U8_F32 : VOP2_VI3_Inst <vop23<0x2c, 0x1f0>, "v_cvt_pkaccum_u8_f32",
	VOP_I32_F32_I32>; // TODO: set "Uses = dst"			VOP_I32_F32_I32>; // TODO: set "Uses = dst"

	defm V_CVT_PKNORM_I16_F32 : VOP2_VI3_Inst <vop23<0x2d, 0x294>, "v_cvt_pknorm_i16_f32",			defm V_CVT_PKNORM_I16_F32 : VOP2_VI3_Inst <vop23<0x2d, 0x294>, "v_cvt_pknorm_i16_f32",
	VOP_I32_F32_F32			VOP_I32_F32_F32
	>;			>;
	▲ Show 20 Lines • Show All 134 Lines • ▼ Show 20 Lines
	>;			>;
	defm V_MAX_F64 : VOP3Inst <vop3<0x167, 0x283>, "v_max_f64",			defm V_MAX_F64 : VOP3Inst <vop3<0x167, 0x283>, "v_max_f64",
	VOP_F64_F64_F64, fmaxnum			VOP_F64_F64_F64, fmaxnum
	>;			>;

	} // isCommutable = 1			} // isCommutable = 1

	defm V_LDEXP_F64 : VOP3Inst <vop3<0x168, 0x284>, "v_ldexp_f64",			defm V_LDEXP_F64 : VOP3Inst <vop3<0x168, 0x284>, "v_ldexp_f64",
	VOP_F64_F64_I32, AMDGPUldexp			VOP_F64_F64_I32, fldexp
	>;			>;

	} // let SchedRW = [WriteDoubleAdd]			} // let SchedRW = [WriteDoubleAdd]

	let isCommutable = 1, SchedRW = [WriteQuarterRate32] in {			let isCommutable = 1, SchedRW = [WriteQuarterRate32] in {

	defm V_MUL_LO_U32 : VOP3Inst <vop3<0x169, 0x285>, "v_mul_lo_u32",			defm V_MUL_LO_U32 : VOP3Inst <vop3<0x169, 0x285>, "v_mul_lo_u32",
	VOP_I32_I32_I32			VOP_I32_I32_I32
	▲ Show 20 Lines • Show All 1,525 Lines • Show Last 20 Lines

lib/Target/ARM/ARMISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 476 Lines • ▼ Show 20 Lines	if (Subtarget->hasNEON()) {
// ARMTargetLowering::addTypeForNEON method for details.		// ARMTargetLowering::addTypeForNEON method for details.
setOperationAction(ISD::SETCC, MVT::v2f64, Expand);		setOperationAction(ISD::SETCC, MVT::v2f64, Expand);
// FIXME: Create unittest for FNEG and for FABS.		// FIXME: Create unittest for FNEG and for FABS.
setOperationAction(ISD::FNEG, MVT::v2f64, Expand);		setOperationAction(ISD::FNEG, MVT::v2f64, Expand);
setOperationAction(ISD::FABS, MVT::v2f64, Expand);		setOperationAction(ISD::FABS, MVT::v2f64, Expand);
setOperationAction(ISD::FSQRT, MVT::v2f64, Expand);		setOperationAction(ISD::FSQRT, MVT::v2f64, Expand);
setOperationAction(ISD::FSIN, MVT::v2f64, Expand);		setOperationAction(ISD::FSIN, MVT::v2f64, Expand);
setOperationAction(ISD::FCOS, MVT::v2f64, Expand);		setOperationAction(ISD::FCOS, MVT::v2f64, Expand);
		setOperationAction(ISD::FLDEXP, MVT::v2f64, Expand);
setOperationAction(ISD::FPOWI, MVT::v2f64, Expand);		setOperationAction(ISD::FPOWI, MVT::v2f64, Expand);
setOperationAction(ISD::FPOW, MVT::v2f64, Expand);		setOperationAction(ISD::FPOW, MVT::v2f64, Expand);
setOperationAction(ISD::FLOG, MVT::v2f64, Expand);		setOperationAction(ISD::FLOG, MVT::v2f64, Expand);
setOperationAction(ISD::FLOG2, MVT::v2f64, Expand);		setOperationAction(ISD::FLOG2, MVT::v2f64, Expand);
setOperationAction(ISD::FLOG10, MVT::v2f64, Expand);		setOperationAction(ISD::FLOG10, MVT::v2f64, Expand);
setOperationAction(ISD::FEXP, MVT::v2f64, Expand);		setOperationAction(ISD::FEXP, MVT::v2f64, Expand);
setOperationAction(ISD::FEXP2, MVT::v2f64, Expand);		setOperationAction(ISD::FEXP2, MVT::v2f64, Expand);
// FIXME: Create unittest for FCEIL, FTRUNC, FRINT, FNEARBYINT, FFLOOR.		// FIXME: Create unittest for FCEIL, FTRUNC, FRINT, FNEARBYINT, FFLOOR.
setOperationAction(ISD::FCEIL, MVT::v2f64, Expand);		setOperationAction(ISD::FCEIL, MVT::v2f64, Expand);
setOperationAction(ISD::FTRUNC, MVT::v2f64, Expand);		setOperationAction(ISD::FTRUNC, MVT::v2f64, Expand);
setOperationAction(ISD::FRINT, MVT::v2f64, Expand);		setOperationAction(ISD::FRINT, MVT::v2f64, Expand);
setOperationAction(ISD::FNEARBYINT, MVT::v2f64, Expand);		setOperationAction(ISD::FNEARBYINT, MVT::v2f64, Expand);
setOperationAction(ISD::FFLOOR, MVT::v2f64, Expand);		setOperationAction(ISD::FFLOOR, MVT::v2f64, Expand);
setOperationAction(ISD::FMA, MVT::v2f64, Expand);		setOperationAction(ISD::FMA, MVT::v2f64, Expand);

setOperationAction(ISD::FSQRT, MVT::v4f32, Expand);		setOperationAction(ISD::FSQRT, MVT::v4f32, Expand);
setOperationAction(ISD::FSIN, MVT::v4f32, Expand);		setOperationAction(ISD::FSIN, MVT::v4f32, Expand);
setOperationAction(ISD::FCOS, MVT::v4f32, Expand);		setOperationAction(ISD::FCOS, MVT::v4f32, Expand);
		setOperationAction(ISD::FLDEXP, MVT::v4f32, Expand);
setOperationAction(ISD::FPOWI, MVT::v4f32, Expand);		setOperationAction(ISD::FPOWI, MVT::v4f32, Expand);
setOperationAction(ISD::FPOW, MVT::v4f32, Expand);		setOperationAction(ISD::FPOW, MVT::v4f32, Expand);
setOperationAction(ISD::FLOG, MVT::v4f32, Expand);		setOperationAction(ISD::FLOG, MVT::v4f32, Expand);
setOperationAction(ISD::FLOG2, MVT::v4f32, Expand);		setOperationAction(ISD::FLOG2, MVT::v4f32, Expand);
setOperationAction(ISD::FLOG10, MVT::v4f32, Expand);		setOperationAction(ISD::FLOG10, MVT::v4f32, Expand);
setOperationAction(ISD::FEXP, MVT::v4f32, Expand);		setOperationAction(ISD::FEXP, MVT::v4f32, Expand);
setOperationAction(ISD::FEXP2, MVT::v4f32, Expand);		setOperationAction(ISD::FEXP2, MVT::v4f32, Expand);
setOperationAction(ISD::FCEIL, MVT::v4f32, Expand);		setOperationAction(ISD::FCEIL, MVT::v4f32, Expand);
setOperationAction(ISD::FTRUNC, MVT::v4f32, Expand);		setOperationAction(ISD::FTRUNC, MVT::v4f32, Expand);
setOperationAction(ISD::FRINT, MVT::v4f32, Expand);		setOperationAction(ISD::FRINT, MVT::v4f32, Expand);
setOperationAction(ISD::FNEARBYINT, MVT::v4f32, Expand);		setOperationAction(ISD::FNEARBYINT, MVT::v4f32, Expand);
setOperationAction(ISD::FFLOOR, MVT::v4f32, Expand);		setOperationAction(ISD::FFLOOR, MVT::v4f32, Expand);

// Mark v2f32 intrinsics.		// Mark v2f32 intrinsics.
setOperationAction(ISD::FSQRT, MVT::v2f32, Expand);		setOperationAction(ISD::FSQRT, MVT::v2f32, Expand);
setOperationAction(ISD::FSIN, MVT::v2f32, Expand);		setOperationAction(ISD::FSIN, MVT::v2f32, Expand);
setOperationAction(ISD::FCOS, MVT::v2f32, Expand);		setOperationAction(ISD::FCOS, MVT::v2f32, Expand);
		setOperationAction(ISD::FLDEXP, MVT::v2f32, Expand);
setOperationAction(ISD::FPOWI, MVT::v2f32, Expand);		setOperationAction(ISD::FPOWI, MVT::v2f32, Expand);
setOperationAction(ISD::FPOW, MVT::v2f32, Expand);		setOperationAction(ISD::FPOW, MVT::v2f32, Expand);
setOperationAction(ISD::FLOG, MVT::v2f32, Expand);		setOperationAction(ISD::FLOG, MVT::v2f32, Expand);
setOperationAction(ISD::FLOG2, MVT::v2f32, Expand);		setOperationAction(ISD::FLOG2, MVT::v2f32, Expand);
setOperationAction(ISD::FLOG10, MVT::v2f32, Expand);		setOperationAction(ISD::FLOG10, MVT::v2f32, Expand);
setOperationAction(ISD::FEXP, MVT::v2f32, Expand);		setOperationAction(ISD::FEXP, MVT::v2f32, Expand);
setOperationAction(ISD::FEXP2, MVT::v2f32, Expand);		setOperationAction(ISD::FEXP2, MVT::v2f32, Expand);
setOperationAction(ISD::FCEIL, MVT::v2f32, Expand);		setOperationAction(ISD::FCEIL, MVT::v2f32, Expand);
▲ Show 20 Lines • Show All 108 Lines • ▼ Show 20 Lines	if (Subtarget->isFPOnlySP()) {
setOperationAction(ISD::FREM, MVT::f64, Expand);		setOperationAction(ISD::FREM, MVT::f64, Expand);
setOperationAction(ISD::FCOPYSIGN, MVT::f64, Expand);		setOperationAction(ISD::FCOPYSIGN, MVT::f64, Expand);
setOperationAction(ISD::FGETSIGN, MVT::f64, Expand);		setOperationAction(ISD::FGETSIGN, MVT::f64, Expand);
setOperationAction(ISD::FNEG, MVT::f64, Expand);		setOperationAction(ISD::FNEG, MVT::f64, Expand);
setOperationAction(ISD::FABS, MVT::f64, Expand);		setOperationAction(ISD::FABS, MVT::f64, Expand);
setOperationAction(ISD::FSQRT, MVT::f64, Expand);		setOperationAction(ISD::FSQRT, MVT::f64, Expand);
setOperationAction(ISD::FSIN, MVT::f64, Expand);		setOperationAction(ISD::FSIN, MVT::f64, Expand);
setOperationAction(ISD::FCOS, MVT::f64, Expand);		setOperationAction(ISD::FCOS, MVT::f64, Expand);
		setOperationAction(ISD::FLDEXP, MVT::f64, Expand);
setOperationAction(ISD::FPOWI, MVT::f64, Expand);		setOperationAction(ISD::FPOWI, MVT::f64, Expand);
setOperationAction(ISD::FPOW, MVT::f64, Expand);		setOperationAction(ISD::FPOW, MVT::f64, Expand);
setOperationAction(ISD::FLOG, MVT::f64, Expand);		setOperationAction(ISD::FLOG, MVT::f64, Expand);
setOperationAction(ISD::FLOG2, MVT::f64, Expand);		setOperationAction(ISD::FLOG2, MVT::f64, Expand);
setOperationAction(ISD::FLOG10, MVT::f64, Expand);		setOperationAction(ISD::FLOG10, MVT::f64, Expand);
setOperationAction(ISD::FEXP, MVT::f64, Expand);		setOperationAction(ISD::FEXP, MVT::f64, Expand);
setOperationAction(ISD::FEXP2, MVT::f64, Expand);		setOperationAction(ISD::FEXP2, MVT::f64, Expand);
setOperationAction(ISD::FCEIL, MVT::f64, Expand);		setOperationAction(ISD::FCEIL, MVT::f64, Expand);
▲ Show 20 Lines • Show All 267 Lines • ▼ Show 20 Lines	ARMTargetLowering::ARMTargetLowering(const TargetMachine &TM,
setOperationAction(ISD::FREM, MVT::f32, Expand);		setOperationAction(ISD::FREM, MVT::f32, Expand);
if (!Subtarget->useSoftFloat() && Subtarget->hasVFP2() &&		if (!Subtarget->useSoftFloat() && Subtarget->hasVFP2() &&
!Subtarget->isThumb1Only()) {		!Subtarget->isThumb1Only()) {
setOperationAction(ISD::FCOPYSIGN, MVT::f64, Custom);		setOperationAction(ISD::FCOPYSIGN, MVT::f64, Custom);
setOperationAction(ISD::FCOPYSIGN, MVT::f32, Custom);		setOperationAction(ISD::FCOPYSIGN, MVT::f32, Custom);
}		}
setOperationAction(ISD::FPOW, MVT::f64, Expand);		setOperationAction(ISD::FPOW, MVT::f64, Expand);
setOperationAction(ISD::FPOW, MVT::f32, Expand);		setOperationAction(ISD::FPOW, MVT::f32, Expand);
		setOperationAction(ISD::FLDEXP, MVT::f32, Expand);
		setOperationAction(ISD::FLDEXP, MVT::f64, Expand);

if (!Subtarget->hasVFP4()) {		if (!Subtarget->hasVFP4()) {
setOperationAction(ISD::FMA, MVT::f64, Expand);		setOperationAction(ISD::FMA, MVT::f64, Expand);
setOperationAction(ISD::FMA, MVT::f32, Expand);		setOperationAction(ISD::FMA, MVT::f32, Expand);
}		}

// Various VFP goodness		// Various VFP goodness
if (!Subtarget->useSoftFloat() && !Subtarget->isThumb1Only()) {		if (!Subtarget->useSoftFloat() && !Subtarget->isThumb1Only()) {
▲ Show 20 Lines • Show All 11,100 Lines • Show Last 20 Lines

lib/Target/Hexagon/HexagonISelLowering.cpp

Show First 20 Lines • Show All 1,427 Lines • ▼ Show 20 Lines	for (unsigned IntExpOp :
ISD::ROTL, ISD::ROTR, ISD::BSWAP, ISD::SHL_PARTS, ISD::SRA_PARTS,		ISD::ROTL, ISD::ROTR, ISD::BSWAP, ISD::SHL_PARTS, ISD::SRA_PARTS,
ISD::SRL_PARTS, ISD::SMUL_LOHI, ISD::UMUL_LOHI}) {		ISD::SRL_PARTS, ISD::SMUL_LOHI, ISD::UMUL_LOHI}) {
setOperationAction(IntExpOp, MVT::i32, Expand);		setOperationAction(IntExpOp, MVT::i32, Expand);
setOperationAction(IntExpOp, MVT::i64, Expand);		setOperationAction(IntExpOp, MVT::i64, Expand);
}		}

for (unsigned FPExpOp :		for (unsigned FPExpOp :
{ISD::FDIV, ISD::FREM, ISD::FSQRT, ISD::FSIN, ISD::FCOS, ISD::FSINCOS,		{ISD::FDIV, ISD::FREM, ISD::FSQRT, ISD::FSIN, ISD::FCOS, ISD::FSINCOS,
ISD::FPOW, ISD::FCOPYSIGN}) {		ISD::FPOW, ISD::FLDEXP, ISD::FCOPYSIGN}) {
setOperationAction(FPExpOp, MVT::f32, Expand);		setOperationAction(FPExpOp, MVT::f32, Expand);
setOperationAction(FPExpOp, MVT::f64, Expand);		setOperationAction(FPExpOp, MVT::f64, Expand);
}		}

// No extending loads from i32.		// No extending loads from i32.
for (MVT VT : MVT::integer_valuetypes()) {		for (MVT VT : MVT::integer_valuetypes()) {
setLoadExtAction(ISD::ZEXTLOAD, VT, MVT::i32, Expand);		setLoadExtAction(ISD::ZEXTLOAD, VT, MVT::i32, Expand);
setLoadExtAction(ISD::SEXTLOAD, VT, MVT::i32, Expand);		setLoadExtAction(ISD::SEXTLOAD, VT, MVT::i32, Expand);
▲ Show 20 Lines • Show All 45 Lines • ▼ Show 20 Lines	static const unsigned VectExpOps[] = {
ISD::CTPOP, ISD::CTLZ, ISD::CTTZ, ISD::CTLZ_ZERO_UNDEF,		ISD::CTPOP, ISD::CTLZ, ISD::CTTZ, ISD::CTLZ_ZERO_UNDEF,
ISD::CTTZ_ZERO_UNDEF,		ISD::CTTZ_ZERO_UNDEF,
// Floating point arithmetic/math functions:		// Floating point arithmetic/math functions:
ISD::FADD, ISD::FSUB, ISD::FMUL, ISD::FMA, ISD::FDIV,		ISD::FADD, ISD::FSUB, ISD::FMUL, ISD::FMA, ISD::FDIV,
ISD::FREM, ISD::FNEG, ISD::FABS, ISD::FSQRT, ISD::FSIN,		ISD::FREM, ISD::FNEG, ISD::FABS, ISD::FSQRT, ISD::FSIN,
ISD::FCOS, ISD::FPOWI, ISD::FPOW, ISD::FLOG, ISD::FLOG2,		ISD::FCOS, ISD::FPOWI, ISD::FPOW, ISD::FLOG, ISD::FLOG2,
ISD::FLOG10, ISD::FEXP, ISD::FEXP2, ISD::FCEIL, ISD::FTRUNC,		ISD::FLOG10, ISD::FEXP, ISD::FEXP2, ISD::FCEIL, ISD::FTRUNC,
ISD::FRINT, ISD::FNEARBYINT, ISD::FROUND, ISD::FFLOOR,		ISD::FRINT, ISD::FNEARBYINT, ISD::FROUND, ISD::FFLOOR,
ISD::FMINNUM, ISD::FMAXNUM, ISD::FSINCOS,		ISD::FMINNUM, ISD::FMAXNUM, ISD::FSINCOS, ISD::FLDEXP,
// Misc:		// Misc:
ISD::SELECT, ISD::ConstantPool,		ISD::SELECT, ISD::ConstantPool,
// Vector:		// Vector:
ISD::BUILD_VECTOR, ISD::SCALAR_TO_VECTOR,		ISD::BUILD_VECTOR, ISD::SCALAR_TO_VECTOR,
ISD::EXTRACT_VECTOR_ELT, ISD::INSERT_VECTOR_ELT,		ISD::EXTRACT_VECTOR_ELT, ISD::INSERT_VECTOR_ELT,
ISD::EXTRACT_SUBVECTOR, ISD::INSERT_SUBVECTOR,		ISD::EXTRACT_SUBVECTOR, ISD::INSERT_SUBVECTOR,
ISD::CONCAT_VECTORS, ISD::VECTOR_SHUFFLE		ISD::CONCAT_VECTORS, ISD::VECTOR_SHUFFLE
};		};
▲ Show 20 Lines • Show All 1,011 Lines • Show Last 20 Lines

lib/Target/Mips/MipsISelLowering.cpp

Show First 20 Lines • Show All 356 Lines • ▼ Show 20 Lines	if (!Subtarget.hasMips64r2())
setOperationAction(ISD::ROTR, MVT::i64, Expand);		setOperationAction(ISD::ROTR, MVT::i64, Expand);

setOperationAction(ISD::FSIN, MVT::f32, Expand);		setOperationAction(ISD::FSIN, MVT::f32, Expand);
setOperationAction(ISD::FSIN, MVT::f64, Expand);		setOperationAction(ISD::FSIN, MVT::f64, Expand);
setOperationAction(ISD::FCOS, MVT::f32, Expand);		setOperationAction(ISD::FCOS, MVT::f32, Expand);
setOperationAction(ISD::FCOS, MVT::f64, Expand);		setOperationAction(ISD::FCOS, MVT::f64, Expand);
setOperationAction(ISD::FSINCOS, MVT::f32, Expand);		setOperationAction(ISD::FSINCOS, MVT::f32, Expand);
setOperationAction(ISD::FSINCOS, MVT::f64, Expand);		setOperationAction(ISD::FSINCOS, MVT::f64, Expand);
		setOperationAction(ISD::FLDEXP, MVT::f32, Expand);
		setOperationAction(ISD::FLDEXP, MVT::f64, Expand);
setOperationAction(ISD::FPOWI, MVT::f32, Expand);		setOperationAction(ISD::FPOWI, MVT::f32, Expand);
setOperationAction(ISD::FPOW, MVT::f32, Expand);		setOperationAction(ISD::FPOW, MVT::f32, Expand);
setOperationAction(ISD::FPOW, MVT::f64, Expand);		setOperationAction(ISD::FPOW, MVT::f64, Expand);
setOperationAction(ISD::FLOG, MVT::f32, Expand);		setOperationAction(ISD::FLOG, MVT::f32, Expand);
setOperationAction(ISD::FLOG2, MVT::f32, Expand);		setOperationAction(ISD::FLOG2, MVT::f32, Expand);
setOperationAction(ISD::FLOG10, MVT::f32, Expand);		setOperationAction(ISD::FLOG10, MVT::f32, Expand);
setOperationAction(ISD::FEXP, MVT::f32, Expand);		setOperationAction(ISD::FEXP, MVT::f32, Expand);
setOperationAction(ISD::FMA, MVT::f32, Expand);		setOperationAction(ISD::FMA, MVT::f32, Expand);
▲ Show 20 Lines • Show All 3,600 Lines • Show Last 20 Lines

lib/Target/PowerPC/PPCISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 159 Lines • ▼ Show 20 Lines	PPCTargetLowering::PPCTargetLowering(const PPCTargetMachine &TM,
setOperationAction(ISD::SDIVREM, MVT::i64, Expand);		setOperationAction(ISD::SDIVREM, MVT::i64, Expand);

// We don't support sin/cos/sqrt/fmod/pow		// We don't support sin/cos/sqrt/fmod/pow
setOperationAction(ISD::FSIN , MVT::f64, Expand);		setOperationAction(ISD::FSIN , MVT::f64, Expand);
setOperationAction(ISD::FCOS , MVT::f64, Expand);		setOperationAction(ISD::FCOS , MVT::f64, Expand);
setOperationAction(ISD::FSINCOS, MVT::f64, Expand);		setOperationAction(ISD::FSINCOS, MVT::f64, Expand);
setOperationAction(ISD::FREM , MVT::f64, Expand);		setOperationAction(ISD::FREM , MVT::f64, Expand);
setOperationAction(ISD::FPOW , MVT::f64, Expand);		setOperationAction(ISD::FPOW , MVT::f64, Expand);
		setOperationAction(ISD::FLDEXP, MVT::f64, Expand);
setOperationAction(ISD::FMA , MVT::f64, Legal);		setOperationAction(ISD::FMA , MVT::f64, Legal);
setOperationAction(ISD::FSIN , MVT::f32, Expand);		setOperationAction(ISD::FSIN , MVT::f32, Expand);
setOperationAction(ISD::FCOS , MVT::f32, Expand);		setOperationAction(ISD::FCOS , MVT::f32, Expand);
setOperationAction(ISD::FSINCOS, MVT::f32, Expand);		setOperationAction(ISD::FSINCOS, MVT::f32, Expand);
setOperationAction(ISD::FREM , MVT::f32, Expand);		setOperationAction(ISD::FREM , MVT::f32, Expand);
setOperationAction(ISD::FPOW , MVT::f32, Expand);		setOperationAction(ISD::FPOW , MVT::f32, Expand);
		setOperationAction(ISD::FLDEXP, MVT::f32, Expand);
setOperationAction(ISD::FMA , MVT::f32, Legal);		setOperationAction(ISD::FMA , MVT::f32, Legal);

setOperationAction(ISD::FLT_ROUNDS_, MVT::i32, Custom);		setOperationAction(ISD::FLT_ROUNDS_, MVT::i32, Custom);

// If we're enabling GP optimizations, use hardware square root		// If we're enabling GP optimizations, use hardware square root
if (!Subtarget.hasFSQRT() &&		if (!Subtarget.hasFSQRT() &&
!(TM.Options.UnsafeFPMath && Subtarget.hasFRSQRTE() &&		!(TM.Options.UnsafeFPMath && Subtarget.hasFRSQRTE() &&
Subtarget.hasFRE()))		Subtarget.hasFRE()))
▲ Show 20 Lines • Show All 268 Lines • ▼ Show 20 Lines	for (MVT VT : MVT::vector_valuetypes()) {
setOperationAction(ISD::FLOG10, VT, Expand);		setOperationAction(ISD::FLOG10, VT, Expand);
setOperationAction(ISD::FLOG2, VT, Expand);		setOperationAction(ISD::FLOG2, VT, Expand);
setOperationAction(ISD::FEXP, VT, Expand);		setOperationAction(ISD::FEXP, VT, Expand);
setOperationAction(ISD::FEXP2, VT, Expand);		setOperationAction(ISD::FEXP2, VT, Expand);
setOperationAction(ISD::FSIN, VT, Expand);		setOperationAction(ISD::FSIN, VT, Expand);
setOperationAction(ISD::FCOS, VT, Expand);		setOperationAction(ISD::FCOS, VT, Expand);
setOperationAction(ISD::FABS, VT, Expand);		setOperationAction(ISD::FABS, VT, Expand);
setOperationAction(ISD::FPOWI, VT, Expand);		setOperationAction(ISD::FPOWI, VT, Expand);
		setOperationAction(ISD::FLDEXP, VT, Expand);
		hfinkelUnsubmitted Not Done Reply Inline Actions Don't do this. Set it to Expand by default (in TargetLoweringBase::initActions). That's our current best practice for new rarely-legal nodes. hfinkel: Don't do this. Set it to Expand by default (in TargetLoweringBase::initActions). That's our…
setOperationAction(ISD::FFLOOR, VT, Expand);		setOperationAction(ISD::FFLOOR, VT, Expand);
setOperationAction(ISD::FCEIL, VT, Expand);		setOperationAction(ISD::FCEIL, VT, Expand);
setOperationAction(ISD::FTRUNC, VT, Expand);		setOperationAction(ISD::FTRUNC, VT, Expand);
setOperationAction(ISD::FRINT, VT, Expand);		setOperationAction(ISD::FRINT, VT, Expand);
setOperationAction(ISD::FNEARBYINT, VT, Expand);		setOperationAction(ISD::FNEARBYINT, VT, Expand);
setOperationAction(ISD::EXTRACT_VECTOR_ELT, VT, Expand);		setOperationAction(ISD::EXTRACT_VECTOR_ELT, VT, Expand);
setOperationAction(ISD::INSERT_VECTOR_ELT, VT, Expand);		setOperationAction(ISD::INSERT_VECTOR_ELT, VT, Expand);
setOperationAction(ISD::BUILD_VECTOR, VT, Expand);		setOperationAction(ISD::BUILD_VECTOR, VT, Expand);
▲ Show 20 Lines • Show All 216 Lines • ▼ Show 20 Lines	if (Subtarget.hasQPX()) {
setOperationAction(ISD::FP_EXTEND, MVT::v4f64, Legal);		setOperationAction(ISD::FP_EXTEND, MVT::v4f64, Legal);

setOperationAction(ISD::FNEG , MVT::v4f64, Legal);		setOperationAction(ISD::FNEG , MVT::v4f64, Legal);
setOperationAction(ISD::FABS , MVT::v4f64, Legal);		setOperationAction(ISD::FABS , MVT::v4f64, Legal);
setOperationAction(ISD::FSIN , MVT::v4f64, Expand);		setOperationAction(ISD::FSIN , MVT::v4f64, Expand);
setOperationAction(ISD::FCOS , MVT::v4f64, Expand);		setOperationAction(ISD::FCOS , MVT::v4f64, Expand);
setOperationAction(ISD::FPOWI , MVT::v4f64, Expand);		setOperationAction(ISD::FPOWI , MVT::v4f64, Expand);
setOperationAction(ISD::FPOW , MVT::v4f64, Expand);		setOperationAction(ISD::FPOW , MVT::v4f64, Expand);
		setOperationAction(ISD::FLDEXP, MVT::v4f64, Expand);
setOperationAction(ISD::FLOG , MVT::v4f64, Expand);		setOperationAction(ISD::FLOG , MVT::v4f64, Expand);
setOperationAction(ISD::FLOG2 , MVT::v4f64, Expand);		setOperationAction(ISD::FLOG2 , MVT::v4f64, Expand);
setOperationAction(ISD::FLOG10 , MVT::v4f64, Expand);		setOperationAction(ISD::FLOG10 , MVT::v4f64, Expand);
setOperationAction(ISD::FEXP , MVT::v4f64, Expand);		setOperationAction(ISD::FEXP , MVT::v4f64, Expand);
setOperationAction(ISD::FEXP2 , MVT::v4f64, Expand);		setOperationAction(ISD::FEXP2 , MVT::v4f64, Expand);

setOperationAction(ISD::FMINNUM, MVT::v4f64, Legal);		setOperationAction(ISD::FMINNUM, MVT::v4f64, Legal);
setOperationAction(ISD::FMAXNUM, MVT::v4f64, Legal);		setOperationAction(ISD::FMAXNUM, MVT::v4f64, Legal);
Show All 30 Lines	if (Subtarget.hasQPX()) {
setOperationAction(ISD::FP_TO_UINT , MVT::v4f32, Expand);		setOperationAction(ISD::FP_TO_UINT , MVT::v4f32, Expand);

setOperationAction(ISD::FNEG , MVT::v4f32, Legal);		setOperationAction(ISD::FNEG , MVT::v4f32, Legal);
setOperationAction(ISD::FABS , MVT::v4f32, Legal);		setOperationAction(ISD::FABS , MVT::v4f32, Legal);
setOperationAction(ISD::FSIN , MVT::v4f32, Expand);		setOperationAction(ISD::FSIN , MVT::v4f32, Expand);
setOperationAction(ISD::FCOS , MVT::v4f32, Expand);		setOperationAction(ISD::FCOS , MVT::v4f32, Expand);
setOperationAction(ISD::FPOWI , MVT::v4f32, Expand);		setOperationAction(ISD::FPOWI , MVT::v4f32, Expand);
setOperationAction(ISD::FPOW , MVT::v4f32, Expand);		setOperationAction(ISD::FPOW , MVT::v4f32, Expand);
		setOperationAction(ISD::FLDEXP , MVT::v4f32, Expand);
setOperationAction(ISD::FLOG , MVT::v4f32, Expand);		setOperationAction(ISD::FLOG , MVT::v4f32, Expand);
setOperationAction(ISD::FLOG2 , MVT::v4f32, Expand);		setOperationAction(ISD::FLOG2 , MVT::v4f32, Expand);
setOperationAction(ISD::FLOG10 , MVT::v4f32, Expand);		setOperationAction(ISD::FLOG10 , MVT::v4f32, Expand);
setOperationAction(ISD::FEXP , MVT::v4f32, Expand);		setOperationAction(ISD::FEXP , MVT::v4f32, Expand);
setOperationAction(ISD::FEXP2 , MVT::v4f32, Expand);		setOperationAction(ISD::FEXP2 , MVT::v4f32, Expand);

setOperationAction(ISD::FMINNUM, MVT::v4f32, Legal);		setOperationAction(ISD::FMINNUM, MVT::v4f32, Legal);
setOperationAction(ISD::FMAXNUM, MVT::v4f32, Legal);		setOperationAction(ISD::FMAXNUM, MVT::v4f32, Legal);
▲ Show 20 Lines • Show All 10,818 Lines • Show Last 20 Lines

lib/Target/WebAssembly/WebAssemblyISelLowering.cpp

Show First 20 Lines • Show All 119 Lines • ▼ Show 20 Lines	WebAssemblyTargetLowering::WebAssemblyTargetLowering(
for (auto T : {MVT::f32, MVT::f64}) {		for (auto T : {MVT::f32, MVT::f64}) {
// Don't expand the floating-point types to constant pools.		// Don't expand the floating-point types to constant pools.
setOperationAction(ISD::ConstantFP, T, Legal);		setOperationAction(ISD::ConstantFP, T, Legal);
// Expand floating-point comparisons.		// Expand floating-point comparisons.
for (auto CC : {ISD::SETO, ISD::SETUO, ISD::SETUEQ, ISD::SETONE,		for (auto CC : {ISD::SETO, ISD::SETUO, ISD::SETUEQ, ISD::SETONE,
ISD::SETULT, ISD::SETULE, ISD::SETUGT, ISD::SETUGE})		ISD::SETULT, ISD::SETULE, ISD::SETUGT, ISD::SETUGE})
setCondCodeAction(CC, T, Expand);		setCondCodeAction(CC, T, Expand);
// Expand floating-point library function operators.		// Expand floating-point library function operators.
for (auto Op : {ISD::FSIN, ISD::FCOS, ISD::FSINCOS, ISD::FPOWI, ISD::FPOW})		for (auto Op : {ISD::FSIN, ISD::FCOS, ISD::FSINCOS, ISD::FPOWI, ISD::FPOW,
		ISD::FLDEXP})
setOperationAction(Op, T, Expand);		setOperationAction(Op, T, Expand);
// Note supported floating-point library function operators that otherwise		// Note supported floating-point library function operators that otherwise
// default to expand.		// default to expand.
for (auto Op : {ISD::FCEIL, ISD::FFLOOR, ISD::FTRUNC, ISD::FNEARBYINT,		for (auto Op : {ISD::FCEIL, ISD::FFLOOR, ISD::FTRUNC, ISD::FNEARBYINT,
ISD::FRINT})		ISD::FRINT})
setOperationAction(Op, T, Legal);		setOperationAction(Op, T, Legal);
}		}

▲ Show 20 Lines • Show All 323 Lines • Show Last 20 Lines

lib/Target/X86/X86ISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 648 Lines • ▼ Show 20 Lines	if (!Subtarget->useSoftFloat()) {
setOperationAction(ISD::FMA, MVT::f80, Expand);		setOperationAction(ISD::FMA, MVT::f80, Expand);
}		}

// Always use a library call for pow.		// Always use a library call for pow.
setOperationAction(ISD::FPOW , MVT::f32 , Expand);		setOperationAction(ISD::FPOW , MVT::f32 , Expand);
setOperationAction(ISD::FPOW , MVT::f64 , Expand);		setOperationAction(ISD::FPOW , MVT::f64 , Expand);
setOperationAction(ISD::FPOW , MVT::f80 , Expand);		setOperationAction(ISD::FPOW , MVT::f80 , Expand);

		setOperationAction(ISD::FLDEXP , MVT::f32 , LibCall);
		setOperationAction(ISD::FLDEXP , MVT::f64 , LibCall);
		setOperationAction(ISD::FLDEXP , MVT::f80 , LibCall);

		// These are not available on some Windows configurations
		if (!getLibcallName(RTLIB::LDEXP_F32)) {
		setOperationAction(ISD::FLDEXP, MVT::f32, Expand);
		setOperationAction(ISD::FLDEXP, MVT::f80, Expand);
		}

setOperationAction(ISD::FLOG, MVT::f80, Expand);		setOperationAction(ISD::FLOG, MVT::f80, Expand);
setOperationAction(ISD::FLOG2, MVT::f80, Expand);		setOperationAction(ISD::FLOG2, MVT::f80, Expand);
setOperationAction(ISD::FLOG10, MVT::f80, Expand);		setOperationAction(ISD::FLOG10, MVT::f80, Expand);
setOperationAction(ISD::FEXP, MVT::f80, Expand);		setOperationAction(ISD::FEXP, MVT::f80, Expand);
setOperationAction(ISD::FEXP2, MVT::f80, Expand);		setOperationAction(ISD::FEXP2, MVT::f80, Expand);
setOperationAction(ISD::FMINNUM, MVT::f80, Expand);		setOperationAction(ISD::FMINNUM, MVT::f80, Expand);
setOperationAction(ISD::FMAXNUM, MVT::f80, Expand);		setOperationAction(ISD::FMAXNUM, MVT::f80, Expand);

Show All 22 Lines	for (MVT VT : MVT::vector_valuetypes()) {
setOperationAction(ISD::FABS, VT, Expand);		setOperationAction(ISD::FABS, VT, Expand);
setOperationAction(ISD::FSIN, VT, Expand);		setOperationAction(ISD::FSIN, VT, Expand);
setOperationAction(ISD::FSINCOS, VT, Expand);		setOperationAction(ISD::FSINCOS, VT, Expand);
setOperationAction(ISD::FCOS, VT, Expand);		setOperationAction(ISD::FCOS, VT, Expand);
setOperationAction(ISD::FSINCOS, VT, Expand);		setOperationAction(ISD::FSINCOS, VT, Expand);
setOperationAction(ISD::FREM, VT, Expand);		setOperationAction(ISD::FREM, VT, Expand);
setOperationAction(ISD::FMA, VT, Expand);		setOperationAction(ISD::FMA, VT, Expand);
setOperationAction(ISD::FPOWI, VT, Expand);		setOperationAction(ISD::FPOWI, VT, Expand);
		setOperationAction(ISD::FLDEXP, VT, Expand);
setOperationAction(ISD::FSQRT, VT, Expand);		setOperationAction(ISD::FSQRT, VT, Expand);
setOperationAction(ISD::FCOPYSIGN, VT, Expand);		setOperationAction(ISD::FCOPYSIGN, VT, Expand);
setOperationAction(ISD::FFLOOR, VT, Expand);		setOperationAction(ISD::FFLOOR, VT, Expand);
setOperationAction(ISD::FCEIL, VT, Expand);		setOperationAction(ISD::FCEIL, VT, Expand);
setOperationAction(ISD::FTRUNC, VT, Expand);		setOperationAction(ISD::FTRUNC, VT, Expand);
setOperationAction(ISD::FRINT, VT, Expand);		setOperationAction(ISD::FRINT, VT, Expand);
setOperationAction(ISD::FNEARBYINT, VT, Expand);		setOperationAction(ISD::FNEARBYINT, VT, Expand);
setOperationAction(ISD::SMUL_LOHI, VT, Expand);		setOperationAction(ISD::SMUL_LOHI, VT, Expand);
▲ Show 20 Lines • Show All 26,816 Lines • Show Last 20 Lines

lib/Transforms/Utils/SimplifyLibCalls.cpp

Show First 20 Lines • Show All 1,179 Lines • ▼ Show 20 Lines	Value LibCallSimplifier::optimizeExp2(CallInst CI, IRBuilder<> &B) {
// result type.		// result type.
if (FT->getNumParams() != 1 \|\| FT->getReturnType() != FT->getParamType(0) \|\|		if (FT->getNumParams() != 1 \|\| FT->getReturnType() != FT->getParamType(0) \|\|
!FT->getParamType(0)->isFloatingPointTy())		!FT->getParamType(0)->isFloatingPointTy())
return Ret;		return Ret;

Value *Op = CI->getArgOperand(0);		Value *Op = CI->getArgOperand(0);
// Turn exp2(sitofp(x)) -> ldexp(1.0, sext(x)) if sizeof(x) <= 32		// Turn exp2(sitofp(x)) -> ldexp(1.0, sext(x)) if sizeof(x) <= 32
// Turn exp2(uitofp(x)) -> ldexp(1.0, zext(x)) if sizeof(x) < 32		// Turn exp2(uitofp(x)) -> ldexp(1.0, zext(x)) if sizeof(x) < 32
		bool TryLdExp;
LibFunc::Func LdExp = LibFunc::ldexpl;		LibFunc::Func LdExp = LibFunc::ldexpl;
		if (Callee->isIntrinsic()) {
		TryLdExp = true;
		} else {
if (Op->getType()->isFloatTy())		if (Op->getType()->isFloatTy())
LdExp = LibFunc::ldexpf;		LdExp = LibFunc::ldexpf;
else if (Op->getType()->isDoubleTy())		else if (Op->getType()->isDoubleTy())
LdExp = LibFunc::ldexp;		LdExp = LibFunc::ldexp;
		TryLdExp = TLI->has(LdExp);
		}

if (TLI->has(LdExp)) {		if (TryLdExp) {
Value *LdExpArg = nullptr;		Value *LdExpArg = nullptr;
if (SIToFPInst *OpC = dyn_cast<SIToFPInst>(Op)) {		if (SIToFPInst *OpC = dyn_cast<SIToFPInst>(Op)) {
if (OpC->getOperand(0)->getType()->getPrimitiveSizeInBits() <= 32)		if (OpC->getOperand(0)->getType()->getPrimitiveSizeInBits() <= 32)
LdExpArg = B.CreateSExt(OpC->getOperand(0), B.getInt32Ty());		LdExpArg = B.CreateSExt(OpC->getOperand(0), B.getInt32Ty());
} else if (UIToFPInst *OpC = dyn_cast<UIToFPInst>(Op)) {		} else if (UIToFPInst *OpC = dyn_cast<UIToFPInst>(Op)) {
if (OpC->getOperand(0)->getType()->getPrimitiveSizeInBits() < 32)		if (OpC->getOperand(0)->getType()->getPrimitiveSizeInBits() < 32)
LdExpArg = B.CreateZExt(OpC->getOperand(0), B.getInt32Ty());		LdExpArg = B.CreateZExt(OpC->getOperand(0), B.getInt32Ty());
}		}

if (LdExpArg) {		if (LdExpArg) {
Constant *One = ConstantFP::get(CI->getContext(), APFloat(1.0f));		Constant *One = ConstantFP::get(CI->getContext(), APFloat(1.0f));
if (!Op->getType()->isFloatTy())		if (!Op->getType()->isFloatTy())
One = ConstantExpr::getFPExtend(One, Op->getType());		One = ConstantExpr::getFPExtend(One, Op->getType());

Module *M = Caller->getParent();		Module *M = Caller->getParent();
		if (Callee->isIntrinsic()) {
		Function *F =
		Intrinsic::getDeclaration(M, Intrinsic::ldexp, Op->getType());
		return B.CreateCall(F, {One, LdExpArg});
		} else {
Value *Callee =		Value *Callee =
M->getOrInsertFunction(TLI->getName(LdExp), Op->getType(),		M->getOrInsertFunction(TLI->getName(LdExp), Op->getType(),
Op->getType(), B.getInt32Ty(), nullptr);		Op->getType(), B.getInt32Ty(), nullptr);
CallInst *CI = B.CreateCall(Callee, {One, LdExpArg});		CallInst *CI = B.CreateCall(Callee, {One, LdExpArg});
if (const Function *F = dyn_cast<Function>(Callee->stripPointerCasts()))		if (const Function *F = dyn_cast<Function>(Callee->stripPointerCasts()))
CI->setCallingConv(F->getCallingConv());		CI->setCallingConv(F->getCallingConv());

return CI;		return CI;
}		}
}		}
		}
return Ret;		return Ret;
}		}

Value LibCallSimplifier::optimizeFabs(CallInst CI, IRBuilder<> &B) {		Value LibCallSimplifier::optimizeFabs(CallInst CI, IRBuilder<> &B) {
Function *Callee = CI->getCalledFunction();		Function *Callee = CI->getCalledFunction();

Value *Ret = nullptr;		Value *Ret = nullptr;
if (Callee->getName() == "fabs" && TLI->has(LibFunc::fabsf)) {		if (Callee->getName() == "fabs" && TLI->has(LibFunc::fabsf)) {
▲ Show 20 Lines • Show All 1,221 Lines • Show Last 20 Lines

test/CodeGen/AMDGPU/llvm.ldexp.ll

This file was added.

				; RUN: llc -march=amdgcn -mcpu=SI -verify-machineinstrs < %s \| FileCheck -check-prefix=SI %s
				; RUN: llc -march=amdgcn -mcpu=tonga -verify-machineinstrs < %s \| FileCheck -check-prefix=SI %s

				; SI-LABEL: {{^}}test_ldexp_f32:
				; SI: v_ldexp_f32
				; SI: s_endpgm
				define void @test_ldexp_f32(float addrspace(1)* %out, float %a, i32 %b) nounwind {
				%result = call float @llvm.ldexp.f32(float %a, i32 %b) nounwind readnone
				store float %result, float addrspace(1)* %out, align 4
				ret void
				}

				; SI-LABEL: {{^}}test_ldexp_f64:
				; SI: v_ldexp_f64
				; SI: s_endpgm
				define void @test_ldexp_f64(double addrspace(1)* %out, double %a, i32 %b) nounwind {
				%result = call double @llvm.ldexp.f64(double %a, i32 %b) nounwind readnone
				store double %result, double addrspace(1)* %out, align 8
				ret void
				}

				arsenmAuthorUnsubmitted Not Done Reply Inline Actions Should include vector versions for at least v2f32, v4f32 and v2f64 Also, can you merge the existing llvm.AMDGPU.ldexp.ll test into this one and rename them with a legacy_ prefix arsenm: Should include vector versions for at least v2f32, v4f32 and v2f64 Also, can you merge the…
				declare float @llvm.ldexp.f32(float, i32) #1
				declare double @llvm.ldexp.f64(double, i32) #1

				attributes #1 = { nounwind readnone }

test/CodeGen/X86/ldexp.ll

This file was added.

				; RUN: llc < %s -mtriple=x86_64-unknown-unknown \| FileCheck %s
				; RUN: llc < %s -mtriple=i386-pc-win32 \| FileCheck %s -check-prefix=CHECK-WIN

				; CHECK-LABEL: ldexp_f32:
				arsenmAuthorUnsubmitted Done Reply Inline Actions Vector tests here are probably a good idea as well arsenm: Vector tests here are probably a good idea as well
				; CHECK-WIN-LABEL: ldexp_f32:
				; CHECK: jmp ldexpf
				; CHECK-WIN-NOT: ldexpf
				define float @ldexp_f32(i8 zeroext %x) {
				%1 = zext i8 %x to i32
				%2 = call float @llvm.ldexp.f32(float 1.000000e+00, i32 %1)
				ret float %2
				}

				; CHECK-LABEL: ldexp_f64:
				; CHECK-WIN-LABEL: ldexp_f64:
				; CHECK: jmp ldexp
				; CHECK-WIN: calll _ldexp
				define double @ldexp_f64(i8 zeroext %x) {
				%1 = zext i8 %x to i32
				%2 = call double @llvm.ldexp.f64(double 1.000000e+00, i32 %1)
				ret double %2
				}

				; Function Attrs: nounwind readnone
				declare double @llvm.ldexp.f64(double, i32) #0

				; Function Attrs: nounwind readnone
				declare float @llvm.ldexp.f32(float, i32) #0

				attributes #0 = { nounwind readnone }

test/Transforms/InstCombine/exp2-1.ll

; Test that the exp2 library call simplifier works correctly.		; Test that the exp2 library call simplifier works correctly.
;		;
; RUN: opt < %s -instcombine -S \| FileCheck %s		; RUN: opt < %s -instcombine -S \| FileCheck %s
; RUN: opt < %s -instcombine -S -mtriple=i386-pc-win32 \| FileCheck %s -check-prefix=CHECK-WIN

target datalayout = "e-p:32:32:32-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:32:64-f32:32:32-f64:32:64-v64:64:64-v128:128:128-a0:0:64-f80:128:128"		target datalayout = "e-p:32:32:32-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:32:64-f32:32:32-f64:32:64-v64:64:64-v128:128:128-a0:0:64-f80:128:128"

declare double @exp2(double)		declare double @exp2(double)
declare float @exp2f(float)		declare float @exp2f(float)

; Check exp2(sitofp(x)) -> ldexp(1.0, sext(x)).		; Check exp2(sitofp(x)) -> ldexp(1.0, sext(x)).

▲ Show 20 Lines • Show All 63 Lines • ▼ Show 20 Lines	; CHECK: call float @ldexpf
ret float %ret		ret float %ret
}		}

declare double @llvm.exp2.f64(double)		declare double @llvm.exp2.f64(double)
declare float @llvm.exp2.f32(float)		declare float @llvm.exp2.f32(float)

define double @test_simplify9(i8 zeroext %x) {		define double @test_simplify9(i8 zeroext %x) {
; CHECK-LABEL: @test_simplify9(		; CHECK-LABEL: @test_simplify9(
; CHECK-WIN-LABEL: @test_simplify9(
%conv = uitofp i8 %x to double		%conv = uitofp i8 %x to double
%ret = call double @llvm.exp2.f64(double %conv)		%ret = call double @llvm.exp2.f64(double %conv)
; CHECK: call double @ldexp		; CHECK: call double @llvm.ldexp.f64
; CHECK-WIN: call double @ldexp
ret double %ret		ret double %ret
}		}

define float @test_simplify10(i8 zeroext %x) {		define float @test_simplify10(i8 zeroext %x) {
; CHECK-LABEL: @test_simplify10(		; CHECK-LABEL: @test_simplify10(
; CHECK-WIN-LABEL: @test_simplify10(		; CHECK-WIN-LABEL: @test_simplify10(
%conv = uitofp i8 %x to float		%conv = uitofp i8 %x to float
%ret = call float @llvm.exp2.f32(float %conv)		%ret = call float @llvm.exp2.f32(float %conv)
; CHECK: call float @ldexpf		; CHECK: call float @llvm.ldexp.f32
; CHECK-WIN-NOT: call float @ldexpf
ret float %ret		ret float %ret
}		}

This is an archive of the discontinued LLVM Phabricator instance.

IR: Add llvm.ldexp and llvm.experimental.constrained.ldexp intrinsicsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 39178

docs/LangRef.rst

include/llvm/Analysis/TargetLibraryInfo.h

include/llvm/CodeGen/ISDOpcodes.h

include/llvm/CodeGen/RuntimeLibcalls.h

include/llvm/IR/Intrinsics.td

include/llvm/Target/TargetSelectionDAG.td

lib/CodeGen/SelectionDAG/LegalizeDAG.cpp

lib/CodeGen/SelectionDAG/LegalizeTypes.h

lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp

lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp

lib/CodeGen/SelectionDAG/SelectionDAGBuilder.h

lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp

lib/CodeGen/TargetLoweringBase.cpp

lib/Target/AArch64/AArch64ISelLowering.cpp

lib/Target/AMDGPU/AMDGPUISelLowering.h

lib/Target/AMDGPU/AMDGPUISelLowering.cpp

lib/Target/AMDGPU/AMDGPUInstrInfo.td

lib/Target/AMDGPU/SIInstructions.td

lib/Target/ARM/ARMISelLowering.cpp

lib/Target/Hexagon/HexagonISelLowering.cpp

lib/Target/Mips/MipsISelLowering.cpp

lib/Target/PowerPC/PPCISelLowering.cpp

lib/Target/WebAssembly/WebAssemblyISelLowering.cpp

lib/Target/X86/X86ISelLowering.cpp

lib/Transforms/Utils/SimplifyLibCalls.cpp

test/CodeGen/AMDGPU/llvm.ldexp.ll

test/CodeGen/X86/ldexp.ll

test/Transforms/InstCombine/exp2-1.ll

IR: Add llvm.ldexp and llvm.experimental.constrained.ldexp intrinsics
ClosedPublic