This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
include/llvm/
-
llvm/
-
CodeGen/
-
MachineInstr.h
-
SelectionDAGNodes.h
-
MC/
-
MCInstrDesc.h
-
Target/
-
Target.td
-
TargetSelectionDAG.td
-
lib/
-
CodeGen/
-
GlobalISel/
-
InstructionSelector.cpp
-
ImplicitNullChecks.cpp
-
MIRParser/
-
MILexer.h
-
MILexer.cpp
-
MIParser.cpp
-
MIRPrinter.cpp
-
MachineCSE.cpp
-
MachineInstr.cpp
-
MachinePipeliner.cpp
-
PeepholeOptimizer.cpp
-
ScheduleDAGInstrs.cpp
-
SelectionDAG/
-
InstrEmitter.cpp
-
SelectionDAGBuilder.cpp
-
SelectionDAGISel.cpp
-
TargetInstrInfo.cpp
-
TargetLoweringBase.cpp
-
Target/SystemZ/
-
SystemZ/
-
SystemZISelLowering.cpp
-
SystemZInstrFP.td
-
SystemZInstrVector.td
-
SystemZOperators.td
-
test/CodeGen/SystemZ/
-
CodeGen/
-
SystemZ/
-
fp-strict-add-01.ll
-
fp-strict-add-02.ll
-
fp-strict-add-03.ll
-
fp-strict-add-04.ll
-
fp-strict-alias.ll
-
fp-strict-conv-01.ll
-
fp-strict-conv-02.ll
-
fp-strict-conv-03.ll
-
fp-strict-conv-04.ll
-
fp-strict-conv-15.ll
-
fp-strict-div-01.ll
-
fp-strict-div-02.ll
-
fp-strict-div-03.ll
-
fp-strict-div-04.ll
-
fp-strict-mul-01.ll
-
fp-strict-mul-02.ll
-
fp-strict-mul-03.ll
-
fp-strict-mul-04.ll
-
fp-strict-mul-05.ll
-
fp-strict-mul-06.ll
-
fp-strict-mul-07.ll
-
fp-strict-mul-08.ll
-
fp-strict-mul-09.ll
-
fp-strict-mul-10.ll
-
fp-strict-mul-11.ll
-
fp-strict-round-01.ll
-
fp-strict-round-02.ll
-
fp-strict-round-03.ll
-
fp-strict-sqrt-01.ll
-
fp-strict-sqrt-02.ll
-
fp-strict-sqrt-03.ll
-
fp-strict-sqrt-04.ll
-
fp-strict-sub-01.ll
-
fp-strict-sub-02.ll
-
fp-strict-sub-03.ll
-
fp-strict-sub-04.ll
-
vec-strict-add-01.ll
-
vec-strict-add-02.ll
-
vec-strict-div-01.ll
-
vec-strict-div-02.ll
-
vec-strict-max-01.ll
-
vec-strict-min-01.ll
-
vec-strict-mul-01.ll
-
vec-strict-mul-02.ll
-
vec-strict-mul-03.ll
-
vec-strict-mul-04.ll
-
vec-strict-mul-05.ll
-
vec-strict-round-01.ll
-
vec-strict-round-02.ll
-
vec-strict-sqrt-01.ll
-
vec-strict-sqrt-02.ll
-
vec-strict-sub-01.ll
-
vec-strict-sub-02.ll
-
vector-constrained-fp-intrinsics.ll
-
utils/TableGen/
-
TableGen/
-
CodeGenInstruction.h
-
CodeGenInstruction.cpp
-
InstrInfoEmitter.cpp

Differential D55506

[RFC v2] Allow target to handle STRICT floating-point nodes
ClosedPublic

Authored by uweigand on Dec 10 2018, 4:33 AM.

Download Raw Diff

Details

Reviewers

hfinkel
cameron.mcinally
andrew.w.kaylor
kpn
arsenm
craig.topper
spatel
bogner
kbarton
scanon
mcberg2017

Commits

rG6c5d5ce5517b: Allow target to handle STRICT floating-point nodes
rL362663: Allow target to handle STRICT floating-point nodes

Summary

This is an alternate approach to the RFC in D45576 / D52785 / D52786. The main difference is that we're no longer attempting to use MachineMemOperand structures to capture floating-point exception status.

Instead, we make the MI codegen explicitly aware of the floating-point exceptions by introducing two new concepts:

A new MCID flag "mayRaiseFPException" that the target should set on any instruction that possibly can raise FP exception according to the architecture definition.
A new MI flag FPExcept that CodeGen/SelectionDAG will set on any MI instruction resulting from expansion of any constrained FP intrinsic.

Any MI instruction that is *both* marked as mayRaiseFPException *and* FPExcept then needs to be considered as raising exceptions by MI-level codegen (e.g. scheduling).

Now, setting those two new flags is relatively straightforward. The mayRaiseFPException flag is simply set via TableGen by marking all relevant instruction patterns in the .td files.

The FPExcept flag is set in SDNodeFlags when creating the STRICT_ nodes in the SelectionDAG, and gets inherited in the MachineSDNode nodes created from it during instruction selection. The flag is then transfered to an MIFlag when creating the MI from the MachineSDNode. This is handled just like fast-math flags like no-nans are handled today. In a way, we can think of the FPExcept flag like an inverted fast-math flag "no-except" that just defaults to true instead of false.

This should address the concerns that MachineMemOperands might get dropped accidentally. The new mayRaiseException flag is an invariant setting anyway, and the FPExcept flag is a MIFlag, and to my understanding those cannot be dropped during MI codegen anyway.

Diff Detail

Repository: rL LLVM

Event Timeline

uweigand created this revision.Dec 10 2018, 4:33 AM

Herald added subscribers: llvm-commits, wdng, qcolombet, MatzeB. · View Herald TranscriptDec 10 2018, 4:33 AM

I like this implementation a lot. Some targets control the FPEnv through a register, not through memory. Modeling instructions with register side-effects through MachineMemOperand wasn't intuitive.

Also, the pattern changes seem concise. This is really good.

This patch does seem FP exception centric and rounding mode agnostic though. Should FPExcept and friends be named something more general to cover both? To be clear, I'm okay with the current naming scheme, so just playing Devil's advocate.

include/llvm/CodeGen/SelectionDAGNodes.h
378 ↗	(On Diff #177482)	Nit-picking: the first sentence was a little hard to parse. One alternative: We assume by default that instructions are considered to not raise Or maybe a rewrite would be cleaner: We assume instructions do not raise floating-point exceptions by default, and only those marked explicitly may do so. Just thinking aloud...

Updated comment.

In D55506#1331281, @cameron.mcinally wrote:

This patch does seem FP exception centric and rounding mode agnostic though. Should FPExcept and friends be named something more general to cover both? To be clear, I'm okay with the current naming scheme, so just playing Devil's advocate.

Well, this is because I'm really only using this feature to handle exceptions (b.t.w. just like the MemOperands in the alternate attempt).

Rounding mode is handled completely in the back-end: we simply make all floating-point instructions (strict or not doesn't even matter here) use the FPC control register, and mark all instructions that change the rounding mode as changing FPC. The one missing piece is that we need to mark all function calls to functions marked as within FENV_ACCESS ON regions as also clobbering FPC -- that can be done easily in the target ABI code as soon as the front-end marks such function calls (which we need anyway).

@uweigand, it looks like you submitted some unintended changes with the last Diff update. Just FYI...

[EDIT: Maybe not. I'm not sure what I was seeing, but it seems to be gone after reloading. Apologies for the noise.]

Well, this is because I'm really only using this feature to handle exceptions (b.t.w. just like the MemOperands in the alternate attempt).

Rounding mode is handled completely in the back-end: we simply make all floating-point instructions (strict or not doesn't even matter here) use the FPC control register, and mark all instructions that change the rounding mode as changing FPC. The one missing piece is that we need to mark all function calls to functions marked as within FENV_ACCESS ON regions as also clobbering FPC -- that can be done easily in the target ABI code as soon as the front-end marks such function calls (which we need anyway).

Hm, I may be thinking about this a little differently. I'll elaborate to make sure we're on the same page...

I was envisioning a -frounding-math-like option that is more than just adhering to the current rounding mode. That option would also suppress optimizations affecting rounding results. This would be very much like how we use STRICT_ nodes to suppress optimizations on instructions that may trap. That's why I suggested that mayRaiseException and friends may be a little too narrow.

Is that how you are envisioning the rounding mode support to work too?

Well, mayRaise Exception is purely a MI level flag. I struggle to see where optimizations on the MI level would ever care about rounding modes in the sense you describe: note that currently, MI optimizations don't even know which operation an MI instruction performs -- if you don't even know whether you're dealing with addition or subtraction, why would you care which rounding mode the operation is performed in? MI transformations instead care about what I'd call "structural" properties of the operation: what are the operands, what is input vs. output, which memory may be accessed, which special registers may involved, which other side effects may the operation have. This is the type of knowledge you need for the types of transformations that are done on the MI level: mostly about moving instructions around, arriving at an optimal schedule, de-duplicating identical operations performed multiple times etc. (Even things like simply changing a register operand to a memory operand for the same operation cannot be done solely by common MI optimizations but require per-target support.)

On this level, I do believe that my proposed patch captures all relevant "structural" properties of floating-point instructions: dependency on control registers (including controlling changing rounding modes), and the possibility of floating-point exceptions and traps (affecting whether instruction may be rescheduled or executed speculatively). If you can think of anything I missed here, I'd certainly appreciate to learn more.

Now, of course, earlier passes (on the IR and also SelectionDAG levels) certainly do perform transformations that affect the actual operations performed, and those would certainly care. But at those levels we already have all the extra information we need; at the IR level we have the constrained intrinsics, and at the DAG level we have the STRICT_ nodes. (Currently, those STRICT_ nodes lose a bit of information available at the IR level: we don't specifically distinguish whether a STRICT_ node arose from an IR that is marked as requiring special handling because of just rounding modes, just exceptions, or both. If we ever want to add DAG optimization for STRICT_ node that requires this information, it would certainly be fine with me to add another bit in the SDNodeFlags for example.)

This looks like a promising direction. I particularly like the idea of having a way to intersect information from the backend instruction definitions with the constraints coming from the IR. However, I also have some concerns.

It seems that you're doing two things in this patch -- (1) adding the FCP register to the SystemZ backend, and (2) adding the strict FP handshake mechanism. Could you separate those two change sets?

I'd the new flags being added to make specific reference to FP. People who don't do a lot of FP-specific work are likely to misunderstand any unqualified term such as mayRaiseException. For the MIFlag I'd prefer something like FPStrict because I expect that we will want to extend this to handling rounding mode issues like constant folding or code motion relative to instructions that change the rounding mode.

The unmodeled side effects approach is a lot stronger than we really need. Ideally, these instructions would only act as a barrier to one another and not to other instructions. My concern is that simply marking them as having unmodeled side effects is going to have a bigger hit on optimizations than we want. Granted, the same thing is happening at the IR level with the intrinsics but I have a rough idea of how we'll be able to move forward in that case without rewriting the mechanism.

What I'd really like to see is a way for the backend to be able to completely model the relevant FP register uses/defs but conditionally strip those uses/defs out for non-strict instructions. I explored this with the X86 backend. I had to create an artificial split between the control and status parts of the MXCSR register. I ran into problems because every FP instruction had to be a use and a def of the status bits. That would be fine for strict mode, but it wasn't really acceptable as a default behavior. I only have a rough idea of how this would work.

Another problem I ran into when I tried to add modeling of the MXCSR register is that the machine verifier didn't like seeing uses of this register without ever having seen a def. That didn't show up in my initial testing and I got as far as committing something that modeled the use of MXCSR by the instructions that explicitly read and write it, but the machine verifier issue appeared in post-commit testing. I think it came up with inline assembly that was using the STMXCSR instruction directly. It seems like this could be an issue with the FCP register you're adding for System Z also.

Eventually I'd really like a way to pass the rounding mode argument through to the backend. I've been emphatic in the past that this argument is meant to describe the assumed rounding mode rather than control it, but it occurs to me that for architectures which have FP instructions where the rounding mode can/must be baked into the instruction the backend could use the assumed rounding mode to set the rounding mode operand. In X86 we've only got a few instructions like this, but it's my understanding that some other architectures require it more broadly.

include/llvm/CodeGen/SelectionDAGNodes.h
469 ↗	(On Diff #178261)	There has been some discussion in the past about the possibility of using the fast math flags while still preserving strict FP exceptions. I'm not sure we reached a conclusion.
lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
6520 ↗	(On Diff #178261)	This really shouldn't be unconditional. It's possible to have constrained intrinsics that are preserving the rounding mode but ignoring FP exceptions. At this point we still have that information.

Well, mayRaise Exception is purely a MI level flag. I struggle to see where optimizations on the MI level would ever care about rounding modes in the sense you describe: note that currently, MI optimizations don't even know which operation an MI instruction performs -- if you don't even know whether you're dealing with addition or subtraction, why would you care which rounding mode the operation is performed in? MI transformations instead care about what I'd call "structural" properties of the operation: what are the operands, what is input vs. output, which memory may be accessed, which special registers may involved, which other side effects may the operation have. This is the type of knowledge you need for the types of transformations that are done on the MI level: mostly about moving instructions around, arriving at an optimal schedule, de-duplicating identical operations performed multiple times etc. (Even things like simply changing a register operand to a memory operand for the same operation cannot be done solely by common MI optimizations but require per-target support.)

Huh, this is interesting and was not clear to me a priori. After digging around a bit, I agree with you that almost all of the MI code is fine wrt rounding.

In D55506#1333753, @andrew.w.kaylor wrote:

This looks like a promising direction. I particularly like the idea of having a way to intersect information from the backend instruction definitions with the constraints coming from the IR. However, I also have some concerns.

Thanks for the review!

It seems that you're doing two things in this patch -- (1) adding the FCP register to the SystemZ backend, and (2) adding the strict FP handshake mechanism. Could you separate those two change sets?

Of course. I'm keeping them as separate patches internally anyway, and was planning to commit them separately. I just wanted to show how the full picture would look like in the end.

I'd the new flags being added to make specific reference to FP. People who don't do a lot of FP-specific work are likely to misunderstand any unqualified term such as mayRaiseException. For the MIFlag I'd prefer something like FPStrict because I expect that we will want to extend this to handling rounding mode issues like constant folding or code motion relative to instructions that change the rounding mode.

I'd be fine with using mayRaiseFPException instead. I specifically did not want to mix other aspects (like rounding modes) into it, but rather separate the concerns here. See my previous comments to Cameron about rounding modes in general. But if we do need to track those at the MI level, I'd rather prefer to add another bit in MIFlags, and keep the one I add here strictly about exceptions.

Code motion relative to instructions that change the rounding mode is already handled in my patch, via the dependency on FPC that is added to all FP instructions.

The unmodeled side effects approach is a lot stronger than we really need. Ideally, these instructions would only act as a barrier to one another and not to other instructions. My concern is that simply marking them as having unmodeled side effects is going to have a bigger hit on optimizations than we want. Granted, the same thing is happening at the IR level with the intrinsics but I have a rough idea of how we'll be able to move forward in that case without rewriting the mechanism.

Agreed. In fact, that's really the main point why I want to have the MI flag specifically about exceptions, so that common code can be written to handle the flag just exactly as is required for FP exceptions and nothing else. I just didn't want to complicate this initial patch, and therefore added only a quick (overly conservative, but correct) check to hasUnmodeledSideEffect. Once this is in, we can refine the handling as a follow-on patch. (For example, handle exactly the necessary dependencies for FP exceptions in ScheduleDAGInstrs::buildSchedGraph, which might e.g. allow two FP instructions to be swapped as long as no speculative execution of FP instructions is introduced.)

What I'd really like to see is a way for the backend to be able to completely model the relevant FP register uses/defs but conditionally strip those uses/defs out for non-strict instructions. I explored this with the X86 backend. I had to create an artificial split between the control and status parts of the MXCSR register. I ran into problems because every FP instruction had to be a use and a def of the status bits. That would be fine for strict mode, but it wasn't really acceptable as a default behavior. I only have a rough idea of how this would work.

That's why I was going away from using registers to model exceptions. As you suggest, I'm also in effect performing a split between the control and status parts of the FPC register: I'm using the register at the MI level to model the control part, and using the mayRaiseException flag to model the status part. The side effect implied by the flag includes both the trap and setting of the status bits. This seems more straightforward than adding register defs, in particular since the latter overly constrain scheduling.

Eventually I'd really like a way to pass the rounding mode argument through to the backend. I've been emphatic in the past that this argument is meant to describe the assumed rounding mode rather than control it, but it occurs to me that for architectures which have FP instructions where the rounding mode can/must be baked into the instruction the backend could use the assumed rounding mode to set the rounding mode operand. In X86 we've only got a few instructions like this, but it's my understanding that some other architectures require it more broadly.

I'm not really sure I see where this can help. On SystemZ there are a few instructions that encode rounding modes, but those are mapped to separate DAG opcodes. For example, frint / fnearbyint / ffloor / fceil / ftrunc / fround all map to the same SystemZ instruction, just with a different encoded rounding mode value. The existing DAG instruction selection mechanism seems fine for that.

uweigand marked an inline comment as done.Dec 18 2018, 2:29 AM

uweigand added inline comments.

lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
6520 ↗	(On Diff #178261)	Yes, of course, I'll add that for the final version. That's the main reason why I added the SDNode flag in the first place, instead of just checking for a STRICT_ node later on.

cameron.mcinally mentioned this in D54649: [FPEnv] Rough out constrained FCmp intrinsics.Jan 14 2019, 7:07 AM

In D55506#1332231, @uweigand wrote:

Well, mayRaise Exception is purely a MI level flag. I struggle to see where optimizations on the MI level would ever care about rounding modes in the sense you describe: note that currently, MI optimizations don't even know which operation an MI instruction performs -- if you don't even know whether you're dealing with addition or subtraction, why would you care which rounding mode the operation is performed in? MI transformations instead care about what I'd call "structural" properties of the operation: what are the operands, what is input vs. output, which memory may be accessed, which special registers may involved, which other side effects may the operation have. This is the type of knowledge you need for the types of transformations that are done on the MI level: mostly about moving instructions around, arriving at an optimal schedule, de-duplicating identical operations performed multiple times etc. (Even things like simply changing a register operand to a memory operand for the same operation cannot be done solely by common MI optimizations but require per-target support.)

On this level, I do believe that my proposed patch captures all relevant "structural" properties of floating-point instructions: dependency on control registers (including controlling changing rounding modes), and the possibility of floating-point exceptions and traps (affecting whether instruction may be rescheduled or executed speculatively). If you can think of anything I missed here, I'd certainly appreciate to learn more.

Now, of course, earlier passes (on the IR and also SelectionDAG levels) certainly do perform transformations that affect the actual operations performed, and those would certainly care. But at those levels we already have all the extra information we need; at the IR level we have the constrained intrinsics, and at the DAG level we have the STRICT_ nodes. (Currently, those STRICT_ nodes lose a bit of information available at the IR level: we don't specifically distinguish whether a STRICT_ node arose from an IR that is marked as requiring special handling because of just rounding modes, just exceptions, or both. If we ever want to add DAG optimization for STRICT_ node that requires this information, it would certainly be fine with me to add another bit in the SDNodeFlags for example.)

I think we can have a flag to indicate rounding mode for float operator such as FADD instead of STRICT_FADD, because STRICT_FADD does have side effect(chain), but some target (eg. RISCV) has static rounding mode encoded in the instruction which does not side effect. Then we can optimize specific STRICT_FADD node which ignoring exception into normal FADD with corresponding static round mode.

Herald added a subscriber: jdoerfert. · View Herald TranscriptFeb 24 2019, 11:19 PM

In D55506#1408435, @wuzish wrote:

I think we can have a flag to indicate rounding mode for float operator such as FADD instead of STRICT_FADD, because STRICT_FADD does have side effect(chain), but some target (eg. RISCV) has static rounding mode encoded in the instruction which does not side effect. Then we can optimize specific STRICT_FADD node which ignoring exception into normal FADD with corresponding static round mode.

WRT the chain: "That's not a bug, that's a feature!"

The chain prevents reordering of floating point instructions by the SelectionDAG. That makes it required.

Some System/Z floating point instructions can take an optional per-instruction rounding mode encoded in the instruction. This does not eliminate the need for strict ordering of floating point operations.

jsji added a subscriber: jsji.Feb 25 2019, 10:40 AM

In D55506#1408913, @kpn wrote:

In D55506#1408435, @wuzish wrote:

I think we can have a flag to indicate rounding mode for float operator such as FADD instead of STRICT_FADD, because STRICT_FADD does have side effect(chain), but some target (eg. RISCV) has static rounding mode encoded in the instruction which does not side effect. Then we can optimize specific STRICT_FADD node which ignoring exception into normal FADD with corresponding static round mode.

WRT the chain: "That's not a bug, that's a feature!"

The chain prevents reordering of floating point instructions by the SelectionDAG. That makes it required.

Some System/Z floating point instructions can take an optional per-instruction rounding mode encoded in the instruction. This does not eliminate the need for strict ordering of floating point operations.

Yes, the chain prevents redordering of floating point instructions by SelectionDAG. But it's fine to reorder instructions with static rounding mode(and ignore exception) and no need for chain. Because they do not have side effect, it means the result of such instructions only depend on input (eg, float add including source operands and mode encoding), does not depend on outside state variable that is current rounding mode. It also means the result is unchanged no matter how many times happens so long as the input is same.

In D55506#1410052, @wuzish wrote:

Yes, the chain prevents redordering of floating point instructions by SelectionDAG. But it's fine to reorder instructions with static rounding mode(and ignore exception) and no need for chain. Because they do not have side effect, it means the result of such instructions only depend on input (eg, float add including source operands and mode encoding), does not depend on outside state variable that is current rounding mode. It also means the result is unchanged no matter how many times happens so long as the input is same.

I'm not sure exactly what you have in mind here. For instructions which take explicit rounding mode arguments reordering is not always an issue, but when we're building the selection DAG do we know that we will end up with instructions that take an explicit rounding mode argument? For some architectures we do, but not for all. The concern with reordering, in the case where FP status flags are being ignored, is that we need to avoid possibly reordering instructions with respect to instructions that change the rounding mode. If the rounding mode is explicit in the instruction that's not an issue but otherwise it is, and at least in the case of X86 I don't think we can tell ahead of time (at least without doing things we shouldn't be doing) which instructions we'll end up with.

andrew.w.kaylor added a subscriber: pengfei.Mar 21 2019, 5:52 PM

cameron.mcinally mentioned this in D59833: [FPEnv] New document for adding new constrained FP intrinsics.Mar 29 2019, 2:18 PM

mcberg2017 added a subscriber: mcberg2017.Apr 3 2019, 10:52 PM

annita.zhang added a subscriber: annita.zhang.Apr 16 2019, 6:46 PM

Updated to address issues raised during the review:

Separated out the introduction of the SystemZ FPC register -- this has now been committed as rev. 360570. The remaining changes in this patch are now solely related to handling FP exceptions.
Renamed the new flag to mayRaiseFPException to make clear that this is about floating-point.
No longer treat FP exceptions as equivalent to unmodeled side effects, but treat them separately. Specifically, as I mentioned above, I'm now handling instructions that may raise FP exceptions in the scheduler such that they'll only conflict with each other (and global barriers like calls and unmodeled side effects), but not with loads or stores.
Respect the constraint intrinsics metadata to the extent that instructions with fpexcept.ignore no longer are marked as "may raise FP exceptions".

This patch, together with the FPC register patch already committed, is to my understanding sufficient to correctly handle the FP exception related aspects of the constrained intrinsics (as to rounding mode, we still need support for function calls that may change the rounding mode). To make progress, I'd like to move forward with committing this piece -- assuming all issues related to this part are now addressed. Comments / reviews welcome!

Aside from two minor comments, I think this looks fine.
However, I don't think I'm qualified to give the final approval for this to land as I'm just starting to learn the background here.

include/llvm/CodeGen/MachineInstr.h
835 ↗	(On Diff #199243)	it -> if
include/llvm/CodeGen/SelectionDAGNodes.h
376 ↗	(On Diff #199243)	Should this be renamed to NoFPExcept also, to remain consistent with the mayRaiseFPException flag?

Why is the FPBarrierChain getting involved with the BarrierChain for memory objects? Even without this patch I think we might be entangling the strict FP nodes with other chained nodes more than we should.

Are there any cases where mayRaiseFPException should not imply hasUnmodeledSideEffects? If we checked mayRaiseFPException() inside the hasUnmodeledSideEffects() implementation a lot of these changes wouldn't be needed, and it might help with people who didn't think about FP exceptions in future changes.

There is no way to distinguish between fpexcept.maytrap and fpexcept.strict after pattern matching. We could probably live with that, but it's less than ideal.

I'd like to see a solution that could handle rounding mode also. For rounding mode it will be even more important to have more than a single on/off flag. Do you have any ideas for that?

In D55506#1527651, @andrew.w.kaylor wrote:

Why is the FPBarrierChain getting involved with the BarrierChain for memory objects? Even without this patch I think we might be entangling the strict FP nodes with other chained nodes more than we should.

The BarrierChain tracks calls, volatile/atomic memory accesses, and UnmodeledSideEffects. I do believe that FP exceptions must indeed be kept stable relative to those.

My patch does *not* entangle FP exceptions with regular memory accesses, which do *not* touch BarrierChain.

Are there any cases where mayRaiseFPException should not imply hasUnmodeledSideEffects? If we checked mayRaiseFPException() inside the hasUnmodeledSideEffects() implementation a lot of these changes wouldn't be needed, and it might help with people who didn't think about FP exceptions in future changes.

Well, my last version did have FP exceptions implying UnmodeledSideEffects, and you didn't like that :-) Specifically, that would mean that FP exceptions could not be moved across any normal memory access (since UnmodeledSideEffect instructions cannot) ...

There is no way to distinguish between fpexcept.maytrap and fpexcept.strict after pattern matching. We could probably live with that, but it's less than ideal.

Well, to be honest I don't really see what the difference between those two at the MI level would be. Can you explain e.g. how scheduling restrictions ought to differ? If we have a need for that, we can just add a second MI flag.

I'd like to see a solution that could handle rounding mode also. For rounding mode it will be even more important to have more than a single on/off flag. Do you have any ideas for that?

As I mentioned in the description, I believe rounding modes can be handled completely in the back-end, by simply modeling the FPC register (which I already committed on SystemZ). Every FP instruction (strict or not) is modeled as a user of FPC, all assembler instruction that modify it (which are only a few special-purpose instruction) are modeled as a definition, and function calls within a FENV_ACCESS section should be marked by the front-end as having a variant ABI, which will cause the back-end to have them marked as clobbering FPC (that last part is still missing, but is completely independent of everything that is done by this patch). What aspect of rounding modes do you think is not covered by this approach?

In D55506#1527992, @uweigand wrote:

In D55506#1527651, @andrew.w.kaylor wrote:

Why is the FPBarrierChain getting involved with the BarrierChain for memory objects? Even without this patch I think we might be entangling the strict FP nodes with other chained nodes more than we should.

The BarrierChain tracks calls, volatile/atomic memory accesses, and UnmodeledSideEffects. I do believe that FP exceptions must indeed be kept stable relative to those.

My patch does *not* entangle FP exceptions with regular memory accesses, which do *not* touch BarrierChain.

I see. Thanks for the explanation.

Are there any cases where mayRaiseFPException should not imply hasUnmodeledSideEffects? If we checked mayRaiseFPException() inside the hasUnmodeledSideEffects() implementation a lot of these changes wouldn't be needed, and it might help with people who didn't think about FP exceptions in future changes.

Well, my last version did have FP exceptions implying UnmodeledSideEffects, and you didn't like that :-) Specifically, that would mean that FP exceptions could not be moved across any normal memory access (since UnmodeledSideEffect instructions cannot) ...

Sorry about that. I forgot what I said before. I've just re-read my earlier comments, and I think that line of reasoning was sound. As I said before, hasUnmodeledSideEffects is -- at least in theory -- a stronger barrier than mayRaiseFPExceptions. Looking at your current patch, I had the impression that hasUnmodeledSideEffects and mayRaiseFPExceptions were always coinciding, but even if that is currently the case it isn't necessarily required to be so. I apologize for the noise.

There is no way to distinguish between fpexcept.maytrap and fpexcept.strict after pattern matching. We could probably live with that, but it's less than ideal.

Well, to be honest I don't really see what the difference between those two at the MI level would be. Can you explain e.g. how scheduling restrictions ought to differ? If we have a need for that, we can just add a second MI flag.

With regard to scheduling there is no difference between strict and maytrap. The difference comes when we want to eliminate instructions. For example, if we have something like this:

BB1:
  FMP %0, %1
  JCC %BB2, 4
  JMP %BB3
BB2:
  JMP %BB3
BB3:
  ...

Under "fpexcept.strict" we cannot eliminate the compare because it may raise an exception (though we can still get rid of the JCC), but under "fpexcept.maytrap" we can eliminate the compare because our only promise is that we won't raise spurious exceptions.

I'd like to see a solution that could handle rounding mode also. For rounding mode it will be even more important to have more than a single on/off flag. Do you have any ideas for that?

As I mentioned in the description, I believe rounding modes can be handled completely in the back-end, by simply modeling the FPC register (which I already committed on SystemZ). Every FP instruction (strict or not) is modeled as a user of FPC, all assembler instruction that modify it (which are only a few special-purpose instruction) are modeled as a definition, and function calls within a FENV_ACCESS section should be marked by the front-end as having a variant ABI, which will cause the back-end to have them marked as clobbering FPC (that last part is still missing, but is completely independent of everything that is done by this patch). What aspect of rounding modes do you think is not covered by this approach?

First, I'd like to avoid modeling the FP control and status registers when we aren't in a constrained mode. I haven't done any tests to measure the effect of modeling these registers, but it does place some constraints on the backend that don't need to be there in the unconstrained mode and at least has the potential to degrade performance.

Second, there are some additional optimizations we could make in the cases where we know the rounding mode. Constant folding is an example. I'm not sure there are any backend optimizations that do constant folding after instruction selection, but there was a review recently where this exact issue came up for constant folding during ISel and the conclusion was that we could handle it if we knew the rounding mode but if not we'd have to just block the folding.

Third, there are some architectures where the rounding mode can be included as an operand. This is a potentially confusing point so I'm going to be verbose. I've said before that the rounding mode in constrained instructions is only a descriptive hint to the optimizer, declaring what the rounding mode is at that point rather than a prescriptive operator that sets the rounding mode. However, in the case where we do know the rounding mode (i.e. the rounding mode operand is something other than "round.dynamic") we can use that information to select an instruction form that has a rounding mode operand. How important or unimportant this capability is will depend on the architecture, but in some cases I think it could be significant.

With regard to the patch as a whole, let me say that my biggest concern at this point is to make sure we aren't going toward a dead end. If we think that all of the issues I've raised can likely be addressed in some variation of the patch you've proposed here then I would be in favor of committing this patch now without solving all of those problems. I just don't want to put something in place that will need to be ripped out later in order to make progress. Of course I understand that there's a balance to be found here. I know some of what we have already will need to be replaced, and that's just the way things go. That is to say, I just want to make sure we're headed in more or less the right direction.

Thanks for your patience and persistence.

Hopefully modelling all SSE/AVX/AVX512 FP instruction as having an implicit use of the control portion of MXCSR for rounding mode and exception controls won't create a significant constraint on the backend. I think most of the trouble would start if we had it as an implicit def as well. Properly annotating this in tablegen will likely be tedious and error prone unfortunately. This is just due to the complexity of the 3 different encoding forms as well as the shared multiclasses in the tablegen files. I can't promise that some integer and FP stuff aren't using the same multiclasses.

In D55506#1528109, @andrew.w.kaylor wrote:

Under "fpexcept.strict" we cannot eliminate the compare because it may raise an exception (though we can still get rid of the JCC), but under "fpexcept.maytrap" we can eliminate the compare because our only promise is that we won't raise spurious exceptions.

Currently, it looks like the MI common optimizers cannot easily make that distinction; for example, the MI-level dead code elimination pass checks "isSafeToMove" to decide whether an instruction may be deleted. Now, instructions that may raise FP exceptions are never safe to move even with fpexcept.maytrap, so that check must always fail. If it ever becomes important to make that distinction, I'd add more fine-grained primitives like isSafeToMove vs. isSafeToDelete, and then add a second MI flag like NoDelete that would be set for fpexcept.strict insns.

As an aside, I'm not sure what this fpexcept.strict semantics is even necessary for; I don't believe it is required for the C standard FENV_ACCESS ON mode ...

First, I'd like to avoid modeling the FP control and status registers when we aren't in a constrained mode. I haven't done any tests to measure the effect of modeling these registers, but it does place some constraints on the backend that don't need to be there in the unconstrained mode and at least has the potential to degrade performance.

As Craig pointed out, just adding a use cannot degrade performance. I've already changed the SystemZ back-end to do this unconditionally, and it generated 100% identical assembler code as before (except in functions that use a built-in to modify the rounding mode, but there we actually want those changes).

Second, there are some additional optimizations we could make in the cases where we know the rounding mode. Constant folding is an example. I'm not sure there are any backend optimizations that do constant folding after instruction selection, but there was a review recently where this exact issue came up for constant folding during ISel and the conclusion was that we could handle it if we knew the rounding mode but if not we'd have to just block the folding.

I do not believe it is ever possible to have constant folding at the MI level. Common MI passes don't even understand whether an instruction is an add or a subtract (they only understand the general "form" of an instruction, e.g. what are input vs. output operands etc.); given that, it is hard to see where knowledge of a rounding mode could ever be useful at this level. Those optimizations have to be done earlier.

Third, there are some architectures where the rounding mode can be included as an operand. This is a potentially confusing point so I'm going to be verbose. I've said before that the rounding mode in constrained instructions is only a descriptive hint to the optimizer, declaring what the rounding mode is at that point rather than a prescriptive operator that sets the rounding mode. However, in the case where we do know the rounding mode (i.e. the rounding mode operand is something other than "round.dynamic") we can use that information to select an instruction form that has a rounding mode operand. How important or unimportant this capability is will depend on the architecture, but in some cases I think it could be significant.

If platforms want to do that, my suggestion would be to make the rounding mode available during instruction selection (at the ISel DAG level) so that the platform can then select *different* MI instruction opcodes. I think this also wouldn't conflict with anything in this current patch and can be added later. There would be no need the to have the generic MI layer somehow encode rounding modes then.

With regard to the patch as a whole, let me say that my biggest concern at this point is to make sure we aren't going toward a dead end. If we think that all of the issues I've raised can likely be addressed in some variation of the patch you've proposed here then I would be in favor of committing this patch now without solving all of those problems. I just don't want to put something in place that will need to be ripped out later in order to make progress. Of course I understand that there's a balance to be found here. I know some of what we have already will need to be replaced, and that's just the way things go. That is to say, I just want to make sure we're headed in more or less the right direction.

Thanks for your patience and persistence.

Thanks for the continued review!

OK. I'm convinced. This should let us get a correct solution in place, and as you say we can add on something to handle rounding modes later if needed.

You should probably address Kit's minor comments, but otherwise I'd be very happy to see this committed.

This revision is now accepted and ready to land.Jun 4 2019, 2:12 PM

Addressed review comments and updated to mainline changes.

In D55506#1529951, @andrew.w.kaylor wrote:

OK. I'm convinced. This should let us get a correct solution in place, and as you say we can add on something to handle rounding modes later if needed.

You should probably address Kit's minor comments, but otherwise I'd be very happy to see this committed.

Thanks for the review, Andrew!

I've updated the patch to address Kit's comments, and handle recent mainline changes, in particular the new STRICT_FP_ROUND and STRICT_FP_EXTEND nodes. I'll be committing this version shortly.

Closed by commit rL362663: Allow target to handle STRICT floating-point nodes (authored by uweigand). · Explain WhyJun 5 2019, 3:33 PM

This revision was automatically updated to reflect the committed changes.

Herald added a project: Restricted Project. · View Herald TranscriptJun 5 2019, 3:33 PM

Is there any further step to enable the rounding-mode infra to handle rounding mode metadata in intrinsics? For example, covert rounding mode metadata to a new constant operand of strict_* series op. If some target/platform does not have related machine instruction for different static rounding-mode, then ignore this constant in td selection.

In D55506#1558492, @wuzish wrote:

Is there any further step to enable the rounding-mode infra to handle rounding mode metadata in intrinsics? For example, covert rounding mode metadata to a new constant operand of strict_* series op. If some target/platform does not have related machine instruction for different static rounding-mode, then ignore this constant in td selection.

If this is actually needed by some target, I agree that there are further steps required to propagate that information. Instead of using a constant operand, I think it would probably be better to create a new subclass of SDNode (e.g. StrictFPSDNode or the like), and store this information in its associated bits, like e.g. access bits are stored for MemSDNodes.

I have another question about any_*.

For example, I see you use any_fadd to match both fadd and strict_fadd. I guess ZSystem target is similar to POWER that there is no fp addition machine instruction with nearest rounding mode and no exception statically.
But the semantic of fadd node requires. So theoretically, fadd node should be mapped into 2 instructions for correct compiling. One sets rounding mode to nearest and clear exception handler bit(no exception happens), the other is float addition.
Do we not handle fadd like above because there is no function error for such fine-grained instruction semantic mostly? And avoid performance penalty? @uweigand

In D55506#1560331, @wuzish wrote:

I have another question about any_*.

For example, I see you use any_fadd to match both fadd and strict_fadd. I guess ZSystem target is similar to POWER that there is no fp addition machine instruction with nearest rounding mode and no exception statically.
But the semantic of fadd node requires. So theoretically, fadd node should be mapped into 2 instructions for correct compiling. One sets rounding mode to nearest and clear exception handler bit(no exception happens), the other is float addition.
Do we not handle fadd like above because there is no function error for such fine-grained instruction semantic mostly? And avoid performance penalty? @uweigand

I believe you may be misunderstanding the semantics of the non-strict operations. These do *not* instruct codegen to actively generate code to switch rounding modes and/or clear exception bits. Rather, the use of a non-strict operation like fadd is an *assertion* that tells codegen that it may *assume* that the rounding mode is set to default, trapping exceptions are switched off, and exceptions flags are don't care (i.e. subsequent code will not check them). If you ever use fadd in a context where this assumption is not true, the behaviour is undefined. This means it is perfectly correct to implement fadd using the "normal" floating-point instruction.

Note that similarly, if a strict operation with a given rounding mode flag is used, this likely does *not* instruct coegen to actively generate code to implement this particular rounding mode. Rather, this is again simply an *assertion* that tells codegen that it may *assume* the current rounding mode is set to this specific value. Again, this means that it is perfectly correct to implement such an operation using the "normal" floating-point instruction that will use the current rounding mode.

In D55506#1560499, @uweigand wrote:

In D55506#1560331, @wuzish wrote:

I have another question about any_*.

For example, I see you use any_fadd to match both fadd and strict_fadd. I guess ZSystem target is similar to POWER that there is no fp addition machine instruction with nearest rounding mode and no exception statically.
But the semantic of fadd node requires. So theoretically, fadd node should be mapped into 2 instructions for correct compiling. One sets rounding mode to nearest and clear exception handler bit(no exception happens), the other is float addition.
Do we not handle fadd like above because there is no function error for such fine-grained instruction semantic mostly? And avoid performance penalty? @uweigand

I believe you may be misunderstanding the semantics of the non-strict operations. These do *not* instruct codegen to actively generate code to switch rounding modes and/or clear exception bits. Rather, the use of a non-strict operation like fadd is an *assertion* that tells codegen that it may *assume* that the rounding mode is set to default, trapping exceptions are switched off, and exceptions flags are don't care (i.e. subsequent code will not check them). If you ever use fadd in a context where this assumption is not true, the behaviour is undefined. This means it is perfectly correct to implement fadd using the "normal" floating-point instruction.

Note that similarly, if a strict operation with a given rounding mode flag is used, this likely does *not* instruct coegen to actively generate code to implement this particular rounding mode. Rather, this is again simply an *assertion* that tells codegen that it may *assume* the current rounding mode is set to this specific value. Again, this means that it is perfectly correct to implement such an operation using the "normal" floating-point instruction that will use the current rounding mode.

Thank you. I think I got what you said. Dynamic rounding mode can be applied to static rounding mode so long as the environment rounding mode is correct set by other user/system code before. So to avoid the UB, initial work of environment(clear exception and set rounding mode to nearest) can be done at the beginning of program, maybe in runtime library.

In D55506#1562154, @wuzish wrote:

Thank you. I think I got what you said. Dynamic rounding mode can be applied to static rounding mode so long as the environment rounding mode is correct set by other user/system code before. So to avoid the UB, initial work of environment(clear exception and set rounding mode to nearest) can be done at the beginning of program, maybe in runtime library.

Yes, that's what the startup code already does on Linux.

qiucf mentioned this in D63916: [PowerPC] Add exception constraint to FP arithmetic.Jan 6 2020, 1:26 AM

Revision Contents

Path

Size

llvm/

trunk/

include/

llvm/

CodeGen/

MachineInstr.h

15 lines

SelectionDAGNodes.h

17 lines

MC/

MCInstrDesc.h

6 lines

Target/

Target.td

1 line

TargetSelectionDAG.td

115 lines

lib/

CodeGen/

GlobalISel/

InstructionSelector.cpp

4 lines

ImplicitNullChecks.cpp

3 lines

MIRParser/

1 line

1 line

5 lines

2 lines

2 lines

4 lines

4 lines

PeepholeOptimizer.cpp

2 lines

ScheduleDAGInstrs.cpp

13 lines

SelectionDAG/

InstrEmitter.cpp

3 lines

SelectionDAGBuilder.cpp

7 lines

SelectionDAGISel.cpp

18 lines

TargetInstrInfo.cpp

3 lines

TargetLoweringBase.cpp

28 lines

Target/

SystemZ/

SystemZISelLowering.cpp

51 lines

SystemZInstrFP.td

187 lines

SystemZInstrVector.td

181 lines

SystemZOperators.td

20 lines

test/

CodeGen/

SystemZ/

173 lines

172 lines

25 lines

22 lines

140 lines

95 lines

33 lines

35 lines

35 lines

64 lines

173 lines

173 lines

25 lines

22 lines

173 lines

283 lines

173 lines

314 lines

25 lines

137 lines

130 lines

145 lines

138 lines

55 lines

40 lines

fp-strict-round-01.ll

250 lines

fp-strict-round-02.ll

254 lines

fp-strict-round-03.ll

262 lines

94 lines

94 lines

23 lines

20 lines

173 lines

173 lines

25 lines

22 lines

33 lines

33 lines

33 lines

33 lines

80 lines

80 lines

33 lines

36 lines

33 lines

37 lines

75 lines

vec-strict-round-01.ll

155 lines

vec-strict-round-02.ll

154 lines

vec-strict-sqrt-01.ll

29 lines

vec-strict-sqrt-02.ll

29 lines

vec-strict-sub-01.ll

34 lines

vec-strict-sub-02.ll

33 lines

vector-constrained-fp-intrinsics.ll

334 lines

utils/

TableGen/

CodeGenInstruction.h

1 line

CodeGenInstruction.cpp

1 line

InstrInfoEmitter.cpp

1 line

Diff 203262

llvm/trunk/include/llvm/CodeGen/MachineInstr.h

Show First 20 Lines • Show All 96 Lines • ▼ Show 20 Lines	enum MIFlag {
FmAfn = 1 << 9, // Instruction may map to Fast math		FmAfn = 1 << 9, // Instruction may map to Fast math
// instrinsic approximation.		// instrinsic approximation.
FmReassoc = 1 << 10, // Instruction supports Fast math		FmReassoc = 1 << 10, // Instruction supports Fast math
// reassociation of operand order.		// reassociation of operand order.
NoUWrap = 1 << 11, // Instruction supports binary operator		NoUWrap = 1 << 11, // Instruction supports binary operator
// no unsigned wrap.		// no unsigned wrap.
NoSWrap = 1 << 12, // Instruction supports binary operator		NoSWrap = 1 << 12, // Instruction supports binary operator
// no signed wrap.		// no signed wrap.
IsExact = 1 << 13 // Instruction supports division is		IsExact = 1 << 13, // Instruction supports division is
// known to be exact.		// known to be exact.
		FPExcept = 1 << 14, // Instruction may raise floating-point
		// exceptions.
};		};

private:		private:
const MCInstrDesc *MCID; // Instruction descriptor.		const MCInstrDesc *MCID; // Instruction descriptor.
MachineBasicBlock *Parent = nullptr; // Pointer to the owning basic block.		MachineBasicBlock *Parent = nullptr; // Pointer to the owning basic block.

// Operands are allocated by an ArrayRecycler.		// Operands are allocated by an ArrayRecycler.
MachineOperand *Operands = nullptr; // Pointer to the first operand.		MachineOperand *Operands = nullptr; // Pointer to the first operand.
▲ Show 20 Lines • Show All 710 Lines • ▼ Show 20 Lines	bool mayStore(QueryType Type = AnyInBundle) const {
return hasProperty(MCID::MayStore, Type);		return hasProperty(MCID::MayStore, Type);
}		}

/// Return true if this instruction could possibly read or modify memory.		/// Return true if this instruction could possibly read or modify memory.
bool mayLoadOrStore(QueryType Type = AnyInBundle) const {		bool mayLoadOrStore(QueryType Type = AnyInBundle) const {
return mayLoad(Type) \|\| mayStore(Type);		return mayLoad(Type) \|\| mayStore(Type);
}		}

		/// Return true if this instruction could possibly raise a floating-point
		/// exception. This is the case if the instruction is a floating-point
		/// instruction that can in principle raise an exception, as indicated
		/// by the MCID::MayRaiseFPException property, and at the same time,
		/// the instruction is used in a context where we expect floating-point
		/// exceptions might be enabled, as indicated by the FPExcept MI flag.
		bool mayRaiseFPException() const {
		return hasProperty(MCID::MayRaiseFPException) &&
		getFlag(MachineInstr::MIFlag::FPExcept);
		}

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
// Flags that indicate whether an instruction can be modified by a method.		// Flags that indicate whether an instruction can be modified by a method.
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//

/// Return true if this may be a 2- or 3-address		/// Return true if this may be a 2- or 3-address
/// instruction (of the form "X = op Y, Z, ..."), which produces the same		/// instruction (of the form "X = op Y, Z, ..."), which produces the same
/// result if Y and Z are exchanged. If this flag is set, then the		/// result if Y and Z are exchanged. If this flag is set, then the
/// TargetInstrInfo::commuteInstruction method may be used to hack on the		/// TargetInstrInfo::commuteInstruction method may be used to hack on the
▲ Show 20 Lines • Show All 811 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/CodeGen/SelectionDAGNodes.h

Show First 20 Lines • Show All 362 Lines • ▼ Show 20 Lines	private:
bool NoInfs : 1;		bool NoInfs : 1;
bool NoSignedZeros : 1;		bool NoSignedZeros : 1;
bool AllowReciprocal : 1;		bool AllowReciprocal : 1;
bool VectorReduction : 1;		bool VectorReduction : 1;
bool AllowContract : 1;		bool AllowContract : 1;
bool ApproximateFuncs : 1;		bool ApproximateFuncs : 1;
bool AllowReassociation : 1;		bool AllowReassociation : 1;

		// We assume instructions do not raise floating-point exceptions by default,
		// and only those marked explicitly may do so. We could choose to represent
		// this via a positive "FPExcept" flags like on the MI level, but having a
		// negative "NoFPExcept" flag here (that defaults to true) makes the flag
		// intersection logic more straightforward.
		bool NoFPExcept : 1;

public:		public:
/// Default constructor turns off all optimization flags.		/// Default constructor turns off all optimization flags.
SDNodeFlags()		SDNodeFlags()
: AnyDefined(false), NoUnsignedWrap(false), NoSignedWrap(false),		: AnyDefined(false), NoUnsignedWrap(false), NoSignedWrap(false),
Exact(false), NoNaNs(false), NoInfs(false),		Exact(false), NoNaNs(false), NoInfs(false),
NoSignedZeros(false), AllowReciprocal(false), VectorReduction(false),		NoSignedZeros(false), AllowReciprocal(false), VectorReduction(false),
AllowContract(false), ApproximateFuncs(false),		AllowContract(false), ApproximateFuncs(false),
AllowReassociation(false) {}		AllowReassociation(false), NoFPExcept(true) {}

/// Propagate the fast-math-flags from an IR FPMathOperator.		/// Propagate the fast-math-flags from an IR FPMathOperator.
void copyFMF(const FPMathOperator &FPMO) {		void copyFMF(const FPMathOperator &FPMO) {
setNoNaNs(FPMO.hasNoNaNs());		setNoNaNs(FPMO.hasNoNaNs());
setNoInfs(FPMO.hasNoInfs());		setNoInfs(FPMO.hasNoInfs());
setNoSignedZeros(FPMO.hasNoSignedZeros());		setNoSignedZeros(FPMO.hasNoSignedZeros());
setAllowReciprocal(FPMO.hasAllowReciprocal());		setAllowReciprocal(FPMO.hasAllowReciprocal());
setAllowContract(FPMO.hasAllowContract());		setAllowContract(FPMO.hasAllowContract());
▲ Show 20 Lines • Show All 46 Lines • ▼ Show 20 Lines	public:
void setApproximateFuncs(bool b) {		void setApproximateFuncs(bool b) {
setDefined();		setDefined();
ApproximateFuncs = b;		ApproximateFuncs = b;
}		}
void setAllowReassociation(bool b) {		void setAllowReassociation(bool b) {
setDefined();		setDefined();
AllowReassociation = b;		AllowReassociation = b;
}		}
		void setFPExcept(bool b) {
		setDefined();
		NoFPExcept = !b;
		}

// These are accessors for each flag.		// These are accessors for each flag.
bool hasNoUnsignedWrap() const { return NoUnsignedWrap; }		bool hasNoUnsignedWrap() const { return NoUnsignedWrap; }
bool hasNoSignedWrap() const { return NoSignedWrap; }		bool hasNoSignedWrap() const { return NoSignedWrap; }
bool hasExact() const { return Exact; }		bool hasExact() const { return Exact; }
bool hasNoNaNs() const { return NoNaNs; }		bool hasNoNaNs() const { return NoNaNs; }
bool hasNoInfs() const { return NoInfs; }		bool hasNoInfs() const { return NoInfs; }
bool hasNoSignedZeros() const { return NoSignedZeros; }		bool hasNoSignedZeros() const { return NoSignedZeros; }
bool hasAllowReciprocal() const { return AllowReciprocal; }		bool hasAllowReciprocal() const { return AllowReciprocal; }
bool hasVectorReduction() const { return VectorReduction; }		bool hasVectorReduction() const { return VectorReduction; }
bool hasAllowContract() const { return AllowContract; }		bool hasAllowContract() const { return AllowContract; }
bool hasApproximateFuncs() const { return ApproximateFuncs; }		bool hasApproximateFuncs() const { return ApproximateFuncs; }
bool hasAllowReassociation() const { return AllowReassociation; }		bool hasAllowReassociation() const { return AllowReassociation; }
		bool hasFPExcept() const { return !NoFPExcept; }

bool isFast() const {		bool isFast() const {
return NoSignedZeros && AllowReciprocal && NoNaNs && NoInfs &&		return NoSignedZeros && AllowReciprocal && NoNaNs && NoInfs && NoFPExcept &&
AllowContract && ApproximateFuncs && AllowReassociation;		AllowContract && ApproximateFuncs && AllowReassociation;
}		}

/// Clear any flags in this flag set that aren't also set in Flags.		/// Clear any flags in this flag set that aren't also set in Flags.
/// If the given Flags are undefined then don't do anything.		/// If the given Flags are undefined then don't do anything.
void intersectWith(const SDNodeFlags Flags) {		void intersectWith(const SDNodeFlags Flags) {
if (!Flags.isDefined())		if (!Flags.isDefined())
return;		return;
NoUnsignedWrap &= Flags.NoUnsignedWrap;		NoUnsignedWrap &= Flags.NoUnsignedWrap;
NoSignedWrap &= Flags.NoSignedWrap;		NoSignedWrap &= Flags.NoSignedWrap;
Exact &= Flags.Exact;		Exact &= Flags.Exact;
NoNaNs &= Flags.NoNaNs;		NoNaNs &= Flags.NoNaNs;
NoInfs &= Flags.NoInfs;		NoInfs &= Flags.NoInfs;
NoSignedZeros &= Flags.NoSignedZeros;		NoSignedZeros &= Flags.NoSignedZeros;
AllowReciprocal &= Flags.AllowReciprocal;		AllowReciprocal &= Flags.AllowReciprocal;
VectorReduction &= Flags.VectorReduction;		VectorReduction &= Flags.VectorReduction;
AllowContract &= Flags.AllowContract;		AllowContract &= Flags.AllowContract;
ApproximateFuncs &= Flags.ApproximateFuncs;		ApproximateFuncs &= Flags.ApproximateFuncs;
AllowReassociation &= Flags.AllowReassociation;		AllowReassociation &= Flags.AllowReassociation;
		NoFPExcept &= Flags.NoFPExcept;
}		}
};		};

/// Represents one node in the SelectionDAG.		/// Represents one node in the SelectionDAG.
///		///
class SDNode : public FoldingSetNode, public ilist_node<SDNode> {		class SDNode : public FoldingSetNode, public ilist_node<SDNode> {
private:		private:
/// The operation that this node performs.		/// The operation that this node performs.
▲ Show 20 Lines • Show All 2,132 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/MC/MCInstrDesc.h

Show First 20 Lines • Show All 128 Lines • ▼ Show 20 Lines	enum Flag {
MoveImm,		MoveImm,
MoveReg,		MoveReg,
Bitcast,		Bitcast,
Select,		Select,
DelaySlot,		DelaySlot,
FoldableAsLoad,		FoldableAsLoad,
MayLoad,		MayLoad,
MayStore,		MayStore,
		MayRaiseFPException,
Predicable,		Predicable,
NotDuplicable,		NotDuplicable,
UnmodeledSideEffects,		UnmodeledSideEffects,
Commutable,		Commutable,
ConvertibleTo3Addr,		ConvertibleTo3Addr,
UsesCustomInserter,		UsesCustomInserter,
HasPostISelHook,		HasPostISelHook,
Rematerializable,		Rematerializable,
▲ Show 20 Lines • Show All 253 Lines • ▼ Show 20 Lines	public:
bool mayLoad() const { return Flags & (1ULL << MCID::MayLoad); }		bool mayLoad() const { return Flags & (1ULL << MCID::MayLoad); }

/// Return true if this instruction could possibly modify memory.		/// Return true if this instruction could possibly modify memory.
/// Instructions with this flag set are not necessarily simple store		/// Instructions with this flag set are not necessarily simple store
/// instructions, they may store a modified value based on their operands, or		/// instructions, they may store a modified value based on their operands, or
/// may not actually modify anything, for example.		/// may not actually modify anything, for example.
bool mayStore() const { return Flags & (1ULL << MCID::MayStore); }		bool mayStore() const { return Flags & (1ULL << MCID::MayStore); }

		/// Return true if this instruction may raise a floating-point exception.
		bool mayRaiseFPException() const {
		return Flags & (1ULL << MCID::MayRaiseFPException);
		}

/// Return true if this instruction has side		/// Return true if this instruction has side
/// effects that are not modeled by other flags. This does not return true		/// effects that are not modeled by other flags. This does not return true
/// for instructions whose effects are captured by:		/// for instructions whose effects are captured by:
///		///
/// 1. Their operand list and implicit definition/use list. Register use/def		/// 1. Their operand list and implicit definition/use list. Register use/def
/// info is explicit for instructions.		/// info is explicit for instructions.
/// 2. Memory accesses. Use mayLoad/mayStore.		/// 2. Memory accesses. Use mayLoad/mayStore.
/// 3. Calling, branching, returning: use isCall/isReturn/isBranch.		/// 3. Calling, branching, returning: use isCall/isReturn/isBranch.
▲ Show 20 Lines • Show All 189 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/Target/Target.td

Show First 20 Lines • Show All 450 Lines • ▼ Show 20 Lines	class Instruction {
bit isSelect = 0; // Is this instruction a select instruction?		bit isSelect = 0; // Is this instruction a select instruction?
bit isBarrier = 0; // Can control flow fall through this instruction?		bit isBarrier = 0; // Can control flow fall through this instruction?
bit isCall = 0; // Is this instruction a call instruction?		bit isCall = 0; // Is this instruction a call instruction?
bit isAdd = 0; // Is this instruction an add instruction?		bit isAdd = 0; // Is this instruction an add instruction?
bit isTrap = 0; // Is this instruction a trap instruction?		bit isTrap = 0; // Is this instruction a trap instruction?
bit canFoldAsLoad = 0; // Can this be folded as a simple memory operand?		bit canFoldAsLoad = 0; // Can this be folded as a simple memory operand?
bit mayLoad = ?; // Is it possible for this inst to read memory?		bit mayLoad = ?; // Is it possible for this inst to read memory?
bit mayStore = ?; // Is it possible for this inst to write memory?		bit mayStore = ?; // Is it possible for this inst to write memory?
		bit mayRaiseFPException = 0; // Can this raise a floating-point exception?
bit isConvertibleToThreeAddress = 0; // Can this 2-addr instruction promote?		bit isConvertibleToThreeAddress = 0; // Can this 2-addr instruction promote?
bit isCommutable = 0; // Is this 3 operand instruction commutable?		bit isCommutable = 0; // Is this 3 operand instruction commutable?
bit isTerminator = 0; // Is this part of the terminator for a basic block?		bit isTerminator = 0; // Is this part of the terminator for a basic block?
bit isReMaterializable = 0; // Is this instruction re-materializable?		bit isReMaterializable = 0; // Is this instruction re-materializable?
bit isPredicable = 0; // 1 means this instruction is predicable		bit isPredicable = 0; // 1 means this instruction is predicable
// even if it does not have any operand		// even if it does not have any operand
// tablegen can identify as a predicate		// tablegen can identify as a predicate
bit isUnpredicable = 0; // 1 means this instruction is not predicable		bit isUnpredicable = 0; // 1 means this instruction is not predicable
▲ Show 20 Lines • Show All 1,113 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/Target/TargetSelectionDAG.td

Show First 20 Lines • Show All 461 Lines • ▼ Show 20 Lines

def sint_to_fp : SDNode<"ISD::SINT_TO_FP" , SDTIntToFPOp>;		def sint_to_fp : SDNode<"ISD::SINT_TO_FP" , SDTIntToFPOp>;
def uint_to_fp : SDNode<"ISD::UINT_TO_FP" , SDTIntToFPOp>;		def uint_to_fp : SDNode<"ISD::UINT_TO_FP" , SDTIntToFPOp>;
def fp_to_sint : SDNode<"ISD::FP_TO_SINT" , SDTFPToIntOp>;		def fp_to_sint : SDNode<"ISD::FP_TO_SINT" , SDTFPToIntOp>;
def fp_to_uint : SDNode<"ISD::FP_TO_UINT" , SDTFPToIntOp>;		def fp_to_uint : SDNode<"ISD::FP_TO_UINT" , SDTFPToIntOp>;
def f16_to_fp : SDNode<"ISD::FP16_TO_FP" , SDTIntToFPOp>;		def f16_to_fp : SDNode<"ISD::FP16_TO_FP" , SDTIntToFPOp>;
def fp_to_f16 : SDNode<"ISD::FP_TO_FP16" , SDTFPToIntOp>;		def fp_to_f16 : SDNode<"ISD::FP_TO_FP16" , SDTFPToIntOp>;

		def strict_fadd : SDNode<"ISD::STRICT_FADD",
		SDTFPBinOp, [SDNPHasChain, SDNPCommutative]>;
		def strict_fsub : SDNode<"ISD::STRICT_FSUB",
		SDTFPBinOp, [SDNPHasChain]>;
		def strict_fmul : SDNode<"ISD::STRICT_FMUL",
		SDTFPBinOp, [SDNPHasChain, SDNPCommutative]>;
		def strict_fdiv : SDNode<"ISD::STRICT_FDIV",
		SDTFPBinOp, [SDNPHasChain]>;
		def strict_frem : SDNode<"ISD::STRICT_FREM",
		SDTFPBinOp, [SDNPHasChain]>;
		def strict_fma : SDNode<"ISD::STRICT_FMA",
		SDTFPTernaryOp, [SDNPHasChain]>;
		def strict_fsqrt : SDNode<"ISD::STRICT_FSQRT",
		SDTFPUnaryOp, [SDNPHasChain]>;
		def strict_fsin : SDNode<"ISD::STRICT_FSIN",
		SDTFPUnaryOp, [SDNPHasChain]>;
		def strict_fcos : SDNode<"ISD::STRICT_FCOS",
		SDTFPUnaryOp, [SDNPHasChain]>;
		def strict_fexp2 : SDNode<"ISD::STRICT_FEXP2",
		SDTFPUnaryOp, [SDNPHasChain]>;
		def strict_fpow : SDNode<"ISD::STRICT_FPOW",
		SDTFPBinOp, [SDNPHasChain]>;
		def strict_flog2 : SDNode<"ISD::STRICT_FLOG2",
		SDTFPUnaryOp, [SDNPHasChain]>;
		def strict_frint : SDNode<"ISD::STRICT_FRINT",
		SDTFPUnaryOp, [SDNPHasChain]>;
		def strict_fnearbyint : SDNode<"ISD::STRICT_FNEARBYINT",
		SDTFPUnaryOp, [SDNPHasChain]>;
		def strict_fceil : SDNode<"ISD::STRICT_FCEIL",
		SDTFPUnaryOp, [SDNPHasChain]>;
		def strict_ffloor : SDNode<"ISD::STRICT_FFLOOR",
		SDTFPUnaryOp, [SDNPHasChain]>;
		def strict_fround : SDNode<"ISD::STRICT_FROUND",
		SDTFPUnaryOp, [SDNPHasChain]>;
		def strict_ftrunc : SDNode<"ISD::STRICT_FTRUNC",
		SDTFPUnaryOp, [SDNPHasChain]>;
		def strict_fminnum : SDNode<"ISD::STRICT_FMINNUM",
		SDTFPBinOp, [SDNPHasChain,
		SDNPCommutative, SDNPAssociative]>;
		def strict_fmaxnum : SDNode<"ISD::STRICT_FMAXNUM",
		SDTFPBinOp, [SDNPHasChain,
		SDNPCommutative, SDNPAssociative]>;
		def strict_fpround : SDNode<"ISD::STRICT_FP_ROUND",
		SDTFPRoundOp, [SDNPHasChain]>;
		def strict_fpextend : SDNode<"ISD::STRICT_FP_EXTEND",
		SDTFPExtendOp, [SDNPHasChain]>;

def setcc : SDNode<"ISD::SETCC" , SDTSetCC>;		def setcc : SDNode<"ISD::SETCC" , SDTSetCC>;
def select : SDNode<"ISD::SELECT" , SDTSelect>;		def select : SDNode<"ISD::SELECT" , SDTSelect>;
def vselect : SDNode<"ISD::VSELECT" , SDTVSelect>;		def vselect : SDNode<"ISD::VSELECT" , SDTVSelect>;
def selectcc : SDNode<"ISD::SELECT_CC" , SDTSelectCC>;		def selectcc : SDNode<"ISD::SELECT_CC" , SDTSelectCC>;

def brcc : SDNode<"ISD::BR_CC" , SDTBrCC, [SDNPHasChain]>;		def brcc : SDNode<"ISD::BR_CC" , SDTBrCC, [SDNPHasChain]>;
def brcond : SDNode<"ISD::BRCOND" , SDTBrcond, [SDNPHasChain]>;		def brcond : SDNode<"ISD::BRCOND" , SDTBrcond, [SDNPHasChain]>;
def brind : SDNode<"ISD::BRIND" , SDTBrind, [SDNPHasChain]>;		def brind : SDNode<"ISD::BRIND" , SDTBrind, [SDNPHasChain]>;
▲ Show 20 Lines • Show All 694 Lines • ▼ Show 20 Lines	def setge : PatFrag<(ops node:$lhs, node:$rhs),
(setcc node:$lhs, node:$rhs, SETGE)>;		(setcc node:$lhs, node:$rhs, SETGE)>;
def setlt : PatFrag<(ops node:$lhs, node:$rhs),		def setlt : PatFrag<(ops node:$lhs, node:$rhs),
(setcc node:$lhs, node:$rhs, SETLT)>;		(setcc node:$lhs, node:$rhs, SETLT)>;
def setle : PatFrag<(ops node:$lhs, node:$rhs),		def setle : PatFrag<(ops node:$lhs, node:$rhs),
(setcc node:$lhs, node:$rhs, SETLE)>;		(setcc node:$lhs, node:$rhs, SETLE)>;
def setne : PatFrag<(ops node:$lhs, node:$rhs),		def setne : PatFrag<(ops node:$lhs, node:$rhs),
(setcc node:$lhs, node:$rhs, SETNE)>;		(setcc node:$lhs, node:$rhs, SETNE)>;

		// Convenience fragments to match both strict and non-strict fp operations
		def any_fadd : PatFrags<(ops node:$lhs, node:$rhs),
		[(strict_fadd node:$lhs, node:$rhs),
		(fadd node:$lhs, node:$rhs)]>;
		def any_fsub : PatFrags<(ops node:$lhs, node:$rhs),
		[(strict_fsub node:$lhs, node:$rhs),
		(fsub node:$lhs, node:$rhs)]>;
		def any_fmul : PatFrags<(ops node:$lhs, node:$rhs),
		[(strict_fmul node:$lhs, node:$rhs),
		(fmul node:$lhs, node:$rhs)]>;
		def any_fdiv : PatFrags<(ops node:$lhs, node:$rhs),
		[(strict_fdiv node:$lhs, node:$rhs),
		(fdiv node:$lhs, node:$rhs)]>;
		def any_frem : PatFrags<(ops node:$lhs, node:$rhs),
		[(strict_frem node:$lhs, node:$rhs),
		(frem node:$lhs, node:$rhs)]>;
		def any_fma : PatFrags<(ops node:$src1, node:$src2, node:$src3),
		[(strict_fma node:$src1, node:$src2, node:$src3),
		(fma node:$src1, node:$src2, node:$src3)]>;
		def any_fsqrt : PatFrags<(ops node:$src),
		[(strict_fsqrt node:$src),
		(fsqrt node:$src)]>;
		def any_fsin : PatFrags<(ops node:$src),
		[(strict_fsin node:$src),
		(fsin node:$src)]>;
		def any_fcos : PatFrags<(ops node:$src),
		[(strict_fcos node:$src),
		(fcos node:$src)]>;
		def any_fexp2 : PatFrags<(ops node:$src),
		[(strict_fexp2 node:$src),
		(fexp2 node:$src)]>;
		def any_fpow : PatFrags<(ops node:$lhs, node:$rhs),
		[(strict_fpow node:$lhs, node:$rhs),
		(fpow node:$lhs, node:$rhs)]>;
		def any_flog2 : PatFrags<(ops node:$src),
		[(strict_flog2 node:$src),
		(flog2 node:$src)]>;
		def any_frint : PatFrags<(ops node:$src),
		[(strict_frint node:$src),
		(frint node:$src)]>;
		def any_fnearbyint : PatFrags<(ops node:$src),
		[(strict_fnearbyint node:$src),
		(fnearbyint node:$src)]>;
		def any_fceil : PatFrags<(ops node:$src),
		[(strict_fceil node:$src),
		(fceil node:$src)]>;
		def any_ffloor : PatFrags<(ops node:$src),
		[(strict_ffloor node:$src),
		(ffloor node:$src)]>;
		def any_fround : PatFrags<(ops node:$src),
		[(strict_fround node:$src),
		(fround node:$src)]>;
		def any_ftrunc : PatFrags<(ops node:$src),
		[(strict_ftrunc node:$src),
		(ftrunc node:$src)]>;
		def any_fmaxnum : PatFrags<(ops node:$lhs, node:$rhs),
		[(strict_fmaxnum node:$lhs, node:$rhs),
		(fmaxnum node:$lhs, node:$rhs)]>;
		def any_fminnum : PatFrags<(ops node:$lhs, node:$rhs),
		[(strict_fminnum node:$lhs, node:$rhs),
		(fminnum node:$lhs, node:$rhs)]>;
		def any_fpround : PatFrags<(ops node:$src),
		[(strict_fpround node:$src),
		(fpround node:$src)]>;
		def any_fpextend : PatFrags<(ops node:$src),
		[(strict_fpextend node:$src),
		(fpextend node:$src)]>;

multiclass binary_atomic_op_ord<SDNode atomic_op> {		multiclass binary_atomic_op_ord<SDNode atomic_op> {
def #NAME#_monotonic : PatFrag<(ops node:$ptr, node:$val),		def #NAME#_monotonic : PatFrag<(ops node:$ptr, node:$val),
(!cast<SDPatternOperator>(#NAME) node:$ptr, node:$val)> {		(!cast<SDPatternOperator>(#NAME) node:$ptr, node:$val)> {
let IsAtomic = 1;		let IsAtomic = 1;
let IsAtomicOrderingMonotonic = 1;		let IsAtomicOrderingMonotonic = 1;
}		}
def #NAME#_acquire : PatFrag<(ops node:$ptr, node:$val),		def #NAME#_acquire : PatFrag<(ops node:$ptr, node:$val),
(!cast<SDPatternOperator>(#NAME) node:$ptr, node:$val)> {		(!cast<SDPatternOperator>(#NAME) node:$ptr, node:$val)> {
▲ Show 20 Lines • Show All 186 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/GlobalISel/InstructionSelector.cpp

	Show First 20 Lines • Show All 72 Lines • ▼ Show 20 Lines

	bool InstructionSelector::isObviouslySafeToFold(MachineInstr &MI,			bool InstructionSelector::isObviouslySafeToFold(MachineInstr &MI,
	MachineInstr &IntoMI) const {			MachineInstr &IntoMI) const {
	// Immediate neighbours are already folded.			// Immediate neighbours are already folded.
	if (MI.getParent() == IntoMI.getParent() &&			if (MI.getParent() == IntoMI.getParent() &&
	std::next(MI.getIterator()) == IntoMI.getIterator())			std::next(MI.getIterator()) == IntoMI.getIterator())
	return true;			return true;

	return !MI.mayLoadOrStore() && !MI.hasUnmodeledSideEffects() &&			return !MI.mayLoadOrStore() && !MI.mayRaiseFPException() &&
	empty(MI.implicit_operands());			!MI.hasUnmodeledSideEffects() && empty(MI.implicit_operands());
	}			}

llvm/trunk/lib/CodeGen/ImplicitNullChecks.cpp

Show First 20 Lines • Show All 223 Lines • ▼ Show 20 Lines	MachineFunctionProperties getRequiredProperties() const override {
return MachineFunctionProperties().set(		return MachineFunctionProperties().set(
MachineFunctionProperties::Property::NoVRegs);		MachineFunctionProperties::Property::NoVRegs);
}		}
};		};

} // end anonymous namespace		} // end anonymous namespace

bool ImplicitNullChecks::canHandle(const MachineInstr *MI) {		bool ImplicitNullChecks::canHandle(const MachineInstr *MI) {
if (MI->isCall() \|\| MI->hasUnmodeledSideEffects())		if (MI->isCall() \|\| MI->mayRaiseFPException() \|\|
		MI->hasUnmodeledSideEffects())
return false;		return false;
auto IsRegMask = [](const MachineOperand &MO) { return MO.isRegMask(); };		auto IsRegMask = [](const MachineOperand &MO) { return MO.isRegMask(); };
(void)IsRegMask;		(void)IsRegMask;

assert(!llvm::any_of(MI->operands(), IsRegMask) &&		assert(!llvm::any_of(MI->operands(), IsRegMask) &&
"Calls were filtered out above!");		"Calls were filtered out above!");

auto IsUnordered = [](MachineMemOperand *MMO) { return MMO->isUnordered(); };		auto IsUnordered = [](MachineMemOperand *MMO) { return MMO->isUnordered(); };
▲ Show 20 Lines • Show All 485 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/MIRParser/MILexer.h

Show First 20 Lines • Show All 67 Lines • ▼ Show 20 Lines	enum TokenKind {
kw_nsz,		kw_nsz,
kw_arcp,		kw_arcp,
kw_contract,		kw_contract,
kw_afn,		kw_afn,
kw_reassoc,		kw_reassoc,
kw_nuw,		kw_nuw,
kw_nsw,		kw_nsw,
kw_exact,		kw_exact,
		kw_fpexcept,
kw_debug_location,		kw_debug_location,
kw_cfi_same_value,		kw_cfi_same_value,
kw_cfi_offset,		kw_cfi_offset,
kw_cfi_rel_offset,		kw_cfi_rel_offset,
kw_cfi_def_cfa_register,		kw_cfi_def_cfa_register,
kw_cfi_def_cfa_offset,		kw_cfi_def_cfa_offset,
kw_cfi_adjust_cfa_offset,		kw_cfi_adjust_cfa_offset,
kw_cfi_escape,		kw_cfi_escape,
▲ Show 20 Lines • Show All 150 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/MIRParser/MILexer.cpp

Show First 20 Lines • Show All 198 Lines • ▼ Show 20 Lines	return StringSwitch<MIToken::TokenKind>(Identifier)
.Case("nsz", MIToken::kw_nsz)		.Case("nsz", MIToken::kw_nsz)
.Case("arcp", MIToken::kw_arcp)		.Case("arcp", MIToken::kw_arcp)
.Case("contract", MIToken::kw_contract)		.Case("contract", MIToken::kw_contract)
.Case("afn", MIToken::kw_afn)		.Case("afn", MIToken::kw_afn)
.Case("reassoc", MIToken::kw_reassoc)		.Case("reassoc", MIToken::kw_reassoc)
.Case("nuw" , MIToken::kw_nuw)		.Case("nuw" , MIToken::kw_nuw)
.Case("nsw" , MIToken::kw_nsw)		.Case("nsw" , MIToken::kw_nsw)
.Case("exact" , MIToken::kw_exact)		.Case("exact" , MIToken::kw_exact)
		.Case("fpexcept", MIToken::kw_fpexcept)
.Case("debug-location", MIToken::kw_debug_location)		.Case("debug-location", MIToken::kw_debug_location)
.Case("same_value", MIToken::kw_cfi_same_value)		.Case("same_value", MIToken::kw_cfi_same_value)
.Case("offset", MIToken::kw_cfi_offset)		.Case("offset", MIToken::kw_cfi_offset)
.Case("rel_offset", MIToken::kw_cfi_rel_offset)		.Case("rel_offset", MIToken::kw_cfi_rel_offset)
.Case("def_cfa_register", MIToken::kw_cfi_def_cfa_register)		.Case("def_cfa_register", MIToken::kw_cfi_def_cfa_register)
.Case("def_cfa_offset", MIToken::kw_cfi_def_cfa_offset)		.Case("def_cfa_offset", MIToken::kw_cfi_def_cfa_offset)
.Case("adjust_cfa_offset", MIToken::kw_cfi_adjust_cfa_offset)		.Case("adjust_cfa_offset", MIToken::kw_cfi_adjust_cfa_offset)
.Case("escape", MIToken::kw_cfi_escape)		.Case("escape", MIToken::kw_cfi_escape)
▲ Show 20 Lines • Show All 521 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/MIRParser/MIParser.cpp

Show First 20 Lines • Show All 1,130 Lines • ▼ Show 20 Lines	while (Token.is(MIToken::kw_frame_setup) \|\|
Token.is(MIToken::kw_ninf) \|\|		Token.is(MIToken::kw_ninf) \|\|
Token.is(MIToken::kw_nsz) \|\|		Token.is(MIToken::kw_nsz) \|\|
Token.is(MIToken::kw_arcp) \|\|		Token.is(MIToken::kw_arcp) \|\|
Token.is(MIToken::kw_contract) \|\|		Token.is(MIToken::kw_contract) \|\|
Token.is(MIToken::kw_afn) \|\|		Token.is(MIToken::kw_afn) \|\|
Token.is(MIToken::kw_reassoc) \|\|		Token.is(MIToken::kw_reassoc) \|\|
Token.is(MIToken::kw_nuw) \|\|		Token.is(MIToken::kw_nuw) \|\|
Token.is(MIToken::kw_nsw) \|\|		Token.is(MIToken::kw_nsw) \|\|
Token.is(MIToken::kw_exact)) {		Token.is(MIToken::kw_exact) \|\|
		Token.is(MIToken::kw_fpexcept)) {
// Mine frame and fast math flags		// Mine frame and fast math flags
if (Token.is(MIToken::kw_frame_setup))		if (Token.is(MIToken::kw_frame_setup))
Flags \|= MachineInstr::FrameSetup;		Flags \|= MachineInstr::FrameSetup;
if (Token.is(MIToken::kw_frame_destroy))		if (Token.is(MIToken::kw_frame_destroy))
Flags \|= MachineInstr::FrameDestroy;		Flags \|= MachineInstr::FrameDestroy;
if (Token.is(MIToken::kw_nnan))		if (Token.is(MIToken::kw_nnan))
Flags \|= MachineInstr::FmNoNans;		Flags \|= MachineInstr::FmNoNans;
if (Token.is(MIToken::kw_ninf))		if (Token.is(MIToken::kw_ninf))
Show All 9 Lines	while (Token.is(MIToken::kw_frame_setup) \|\|
if (Token.is(MIToken::kw_reassoc))		if (Token.is(MIToken::kw_reassoc))
Flags \|= MachineInstr::FmReassoc;		Flags \|= MachineInstr::FmReassoc;
if (Token.is(MIToken::kw_nuw))		if (Token.is(MIToken::kw_nuw))
Flags \|= MachineInstr::NoUWrap;		Flags \|= MachineInstr::NoUWrap;
if (Token.is(MIToken::kw_nsw))		if (Token.is(MIToken::kw_nsw))
Flags \|= MachineInstr::NoSWrap;		Flags \|= MachineInstr::NoSWrap;
if (Token.is(MIToken::kw_exact))		if (Token.is(MIToken::kw_exact))
Flags \|= MachineInstr::IsExact;		Flags \|= MachineInstr::IsExact;
		if (Token.is(MIToken::kw_fpexcept))
		Flags \|= MachineInstr::FPExcept;

lex();		lex();
}		}
if (Token.isNot(MIToken::Identifier))		if (Token.isNot(MIToken::Identifier))
return error("expected a machine instruction");		return error("expected a machine instruction");
StringRef InstrName = Token.stringValue();		StringRef InstrName = Token.stringValue();
if (PFS.Target.parseInstrName(InstrName, OpCode))		if (PFS.Target.parseInstrName(InstrName, OpCode))
return error(Twine("unknown machine instruction name '") + InstrName + "'");		return error(Twine("unknown machine instruction name '") + InstrName + "'");
▲ Show 20 Lines • Show All 1,866 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/MIRPrinter.cpp

Show First 20 Lines • Show All 707 Lines • ▼ Show 20 Lines	void MIPrinter::print(const MachineInstr &MI) {
if (MI.getFlag(MachineInstr::FmReassoc))		if (MI.getFlag(MachineInstr::FmReassoc))
OS << "reassoc ";		OS << "reassoc ";
if (MI.getFlag(MachineInstr::NoUWrap))		if (MI.getFlag(MachineInstr::NoUWrap))
OS << "nuw ";		OS << "nuw ";
if (MI.getFlag(MachineInstr::NoSWrap))		if (MI.getFlag(MachineInstr::NoSWrap))
OS << "nsw ";		OS << "nsw ";
if (MI.getFlag(MachineInstr::IsExact))		if (MI.getFlag(MachineInstr::IsExact))
OS << "exact ";		OS << "exact ";
		if (MI.getFlag(MachineInstr::FPExcept))
		OS << "fpexcept ";

OS << TII->getName(MI.getOpcode());		OS << TII->getName(MI.getOpcode());
if (I < E)		if (I < E)
OS << ' ';		OS << ' ';

bool NeedComma = false;		bool NeedComma = false;
for (; I < E; ++I) {		for (; I < E; ++I) {
if (NeedComma)		if (NeedComma)
▲ Show 20 Lines • Show All 113 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/MachineCSE.cpp

Show First 20 Lines • Show All 376 Lines • ▼ Show 20 Lines	if (MI->isPosition() \|\| MI->isPHI() \|\| MI->isImplicitDef() \|\| MI->isKill() \|\|
return false;		return false;

// Ignore copies.		// Ignore copies.
if (MI->isCopyLike())		if (MI->isCopyLike())
return false;		return false;

// Ignore stuff that we obviously can't move.		// Ignore stuff that we obviously can't move.
if (MI->mayStore() \|\| MI->isCall() \|\| MI->isTerminator() \|\|		if (MI->mayStore() \|\| MI->isCall() \|\| MI->isTerminator() \|\|
MI->hasUnmodeledSideEffects())		MI->mayRaiseFPException() \|\| MI->hasUnmodeledSideEffects())
return false;		return false;

if (MI->mayLoad()) {		if (MI->mayLoad()) {
// Okay, this instruction does a load. As a refinement, we allow the target		// Okay, this instruction does a load. As a refinement, we allow the target
// to decide whether the loaded value is actually a constant. If so, we can		// to decide whether the loaded value is actually a constant. If so, we can
// actually use it as a load.		// actually use it as a load.
if (!MI->isDereferenceableInvariantLoad(AA))		if (!MI->isDereferenceableInvariantLoad(AA))
// FIXME: we should be able to hoist loads with no other side effects if		// FIXME: we should be able to hoist loads with no other side effects if
▲ Show 20 Lines • Show All 367 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/MachineInstr.cpp

Show First 20 Lines • Show All 1,172 Lines • ▼ Show 20 Lines	bool MachineInstr::isSafeToMove(AliasAnalysis *AA, bool &SawStore) const {
// a load across an atomic load with Ordering > Monotonic.		// a load across an atomic load with Ordering > Monotonic.
if (mayStore() \|\| isCall() \|\| isPHI() \|\|		if (mayStore() \|\| isCall() \|\| isPHI() \|\|
(mayLoad() && hasOrderedMemoryRef())) {		(mayLoad() && hasOrderedMemoryRef())) {
SawStore = true;		SawStore = true;
return false;		return false;
}		}

if (isPosition() \|\| isDebugInstr() \|\| isTerminator() \|\|		if (isPosition() \|\| isDebugInstr() \|\| isTerminator() \|\|
hasUnmodeledSideEffects())		mayRaiseFPException() \|\| hasUnmodeledSideEffects())
return false;		return false;

// See if this instruction does a load. If so, we have to guarantee that the		// See if this instruction does a load. If so, we have to guarantee that the
// loaded value doesn't change between the load and the its intended		// loaded value doesn't change between the load and the its intended
// destination. The check for isInvariantLoad gives the targe the chance to		// destination. The check for isInvariantLoad gives the targe the chance to
// classify the load as always returning a constant, e.g. a constant pool		// classify the load as always returning a constant, e.g. a constant pool
// load.		// load.
if (mayLoad() && !isDereferenceableInvariantLoad(AA))		if (mayLoad() && !isDereferenceableInvariantLoad(AA))
▲ Show 20 Lines • Show All 349 Lines • ▼ Show 20 Lines	void MachineInstr::print(raw_ostream &OS, ModuleSlotTracker &MST,
if (getFlag(MachineInstr::FmReassoc))		if (getFlag(MachineInstr::FmReassoc))
OS << "reassoc ";		OS << "reassoc ";
if (getFlag(MachineInstr::NoUWrap))		if (getFlag(MachineInstr::NoUWrap))
OS << "nuw ";		OS << "nuw ";
if (getFlag(MachineInstr::NoSWrap))		if (getFlag(MachineInstr::NoSWrap))
OS << "nsw ";		OS << "nsw ";
if (getFlag(MachineInstr::IsExact))		if (getFlag(MachineInstr::IsExact))
OS << "exact ";		OS << "exact ";
		if (getFlag(MachineInstr::FPExcept))
		OS << "fpexcept ";

// Print the opcode name.		// Print the opcode name.
if (TII)		if (TII)
OS << TII->getName(getOpcode());		OS << TII->getName(getOpcode());
else		else
OS << "UNKNOWN";		OS << "UNKNOWN";

if (SkipOpers)		if (SkipOpers)
▲ Show 20 Lines • Show All 624 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/MachinePipeliner.cpp

Show First 20 Lines • Show All 573 Lines • ▼ Show 20 Lines	while (!Worklist.empty()) {
}		}
}		}
return false;		return false;
}		}

/// Return true if the instruction causes a chain between memory		/// Return true if the instruction causes a chain between memory
/// references before and after it.		/// references before and after it.
static bool isDependenceBarrier(MachineInstr &MI, AliasAnalysis *AA) {		static bool isDependenceBarrier(MachineInstr &MI, AliasAnalysis *AA) {
return MI.isCall() \|\| MI.hasUnmodeledSideEffects() \|\|		return MI.isCall() \|\| MI.mayRaiseFPException() \|\|
		MI.hasUnmodeledSideEffects() \|\|
(MI.hasOrderedMemoryRef() &&		(MI.hasOrderedMemoryRef() &&
(!MI.mayLoad() \|\| !MI.isDereferenceableInvariantLoad(AA)));		(!MI.mayLoad() \|\| !MI.isDereferenceableInvariantLoad(AA)));
}		}

/// Return the underlying objects for the memory references of an instruction.		/// Return the underlying objects for the memory references of an instruction.
/// This function calls the code in ValueTracking, but first checks that the		/// This function calls the code in ValueTracking, but first checks that the
/// instruction has a memory operand.		/// instruction has a memory operand.
static void getUnderlyingObjects(const MachineInstr *MI,		static void getUnderlyingObjects(const MachineInstr *MI,
▲ Show 20 Lines • Show All 2,642 Lines • ▼ Show 20 Lines	bool SwingSchedulerDAG::isLoopCarriedDep(SUnit *Source, const SDep &Dep,
MachineInstr *SI = Source->getInstr();		MachineInstr *SI = Source->getInstr();
MachineInstr *DI = Dep.getSUnit()->getInstr();		MachineInstr *DI = Dep.getSUnit()->getInstr();
if (!isSucc)		if (!isSucc)
std::swap(SI, DI);		std::swap(SI, DI);
assert(SI != nullptr && DI != nullptr && "Expecting SUnit with an MI.");		assert(SI != nullptr && DI != nullptr && "Expecting SUnit with an MI.");

// Assume ordered loads and stores may have a loop carried dependence.		// Assume ordered loads and stores may have a loop carried dependence.
if (SI->hasUnmodeledSideEffects() \|\| DI->hasUnmodeledSideEffects() \|\|		if (SI->hasUnmodeledSideEffects() \|\| DI->hasUnmodeledSideEffects() \|\|
		SI->mayRaiseFPException() \|\| DI->mayRaiseFPException() \|\|
SI->hasOrderedMemoryRef() \|\| DI->hasOrderedMemoryRef())		SI->hasOrderedMemoryRef() \|\| DI->hasOrderedMemoryRef())
return true;		return true;

// Only chain dependences between a load and store can be loop carried.		// Only chain dependences between a load and store can be loop carried.
if (!DI->mayStore() \|\| !SI->mayLoad())		if (!DI->mayStore() \|\| !SI->mayLoad())
return false;		return false;

unsigned DeltaS, DeltaD;		unsigned DeltaS, DeltaD;
▲ Show 20 Lines • Show All 803 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/PeepholeOptimizer.cpp

Show First 20 Lines • Show All 1,819 Lines • ▼ Show 20 Lines	if (Src.isUndef())
return ValueTrackerResult();		return ValueTrackerResult();
return ValueTrackerResult(Src.getReg(), Src.getSubReg());		return ValueTrackerResult(Src.getReg(), Src.getSubReg());
}		}

ValueTrackerResult ValueTracker::getNextSourceFromBitcast() {		ValueTrackerResult ValueTracker::getNextSourceFromBitcast() {
assert(Def->isBitcast() && "Invalid definition");		assert(Def->isBitcast() && "Invalid definition");

// Bail if there are effects that a plain copy will not expose.		// Bail if there are effects that a plain copy will not expose.
if (Def->hasUnmodeledSideEffects())		if (Def->mayRaiseFPException() \|\| Def->hasUnmodeledSideEffects())
return ValueTrackerResult();		return ValueTrackerResult();

// Bitcasts with more than one def are not supported.		// Bitcasts with more than one def are not supported.
if (Def->getDesc().getNumDefs() != 1)		if (Def->getDesc().getNumDefs() != 1)
return ValueTrackerResult();		return ValueTrackerResult();
const MachineOperand DefOp = Def->getOperand(DefIdx);		const MachineOperand DefOp = Def->getOperand(DefIdx);
if (DefOp.getSubReg() != DefSubReg)		if (DefOp.getSubReg() != DefSubReg)
// If we look for a different subreg, it means we want a subreg of the src.		// If we look for a different subreg, it means we want a subreg of the src.
▲ Show 20 Lines • Show All 269 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/ScheduleDAGInstrs.cpp

Show First 20 Lines • Show All 706 Lines • ▼ Show 20 Lines	void ScheduleDAGInstrs::buildSchedGraph(AliasAnalysis *AA,
LiveIntervals *LIS,		LiveIntervals *LIS,
bool TrackLaneMasks) {		bool TrackLaneMasks) {
const TargetSubtargetInfo &ST = MF.getSubtarget();		const TargetSubtargetInfo &ST = MF.getSubtarget();
bool UseAA = EnableAASchedMI.getNumOccurrences() > 0 ? EnableAASchedMI		bool UseAA = EnableAASchedMI.getNumOccurrences() > 0 ? EnableAASchedMI
: ST.useAA();		: ST.useAA();
AAForDep = UseAA ? AA : nullptr;		AAForDep = UseAA ? AA : nullptr;

BarrierChain = nullptr;		BarrierChain = nullptr;
		SUnit *FPBarrierChain = nullptr;

this->TrackLaneMasks = TrackLaneMasks;		this->TrackLaneMasks = TrackLaneMasks;
MISUnitMap.clear();		MISUnitMap.clear();
ScheduleDAG::clearDAG();		ScheduleDAG::clearDAG();

// Create an SUnit for each real instruction.		// Create an SUnit for each real instruction.
initSUnits();		initSUnits();

▲ Show 20 Lines • Show All 143 Lines • ▼ Show 20 Lines	if (isGlobalMemoryObject(AA, &MI)) {
<< BarrierChain->NodeNum << ").\n";);		<< BarrierChain->NodeNum << ").\n";);

// Add dependencies against everything below it and clear maps.		// Add dependencies against everything below it and clear maps.
addBarrierChain(Stores);		addBarrierChain(Stores);
addBarrierChain(Loads);		addBarrierChain(Loads);
addBarrierChain(NonAliasStores);		addBarrierChain(NonAliasStores);
addBarrierChain(NonAliasLoads);		addBarrierChain(NonAliasLoads);

		// Add dependency against previous FP barrier and reset FP barrier.
		if (FPBarrierChain)
		FPBarrierChain->addPredBarrier(BarrierChain);
		FPBarrierChain = BarrierChain;

continue;		continue;
}		}

		// Instructions that may raise FP exceptions depend on each other.
		if (MI.mayRaiseFPException()) {
		if (FPBarrierChain)
		FPBarrierChain->addPredBarrier(SU);
		FPBarrierChain = SU;
		}

// If it's not a store or a variant load, we're done.		// If it's not a store or a variant load, we're done.
if (!MI.mayStore() &&		if (!MI.mayStore() &&
!(MI.mayLoad() && !MI.isDereferenceableInvariantLoad(AA)))		!(MI.mayLoad() && !MI.isDereferenceableInvariantLoad(AA)))
continue;		continue;

// Always add dependecy edge to BarrierChain if present.		// Always add dependecy edge to BarrierChain if present.
if (BarrierChain)		if (BarrierChain)
BarrierChain->addPredBarrier(SU);		BarrierChain->addPredBarrier(SU);
▲ Show 20 Lines • Show All 596 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/SelectionDAG/InstrEmitter.cpp

Show First 20 Lines • Show All 877 Lines • ▼ Show 20 Lines	if (NumResults) {
if (Flags.hasNoUnsignedWrap())		if (Flags.hasNoUnsignedWrap())
MI->setFlag(MachineInstr::MIFlag::NoUWrap);		MI->setFlag(MachineInstr::MIFlag::NoUWrap);

if (Flags.hasNoSignedWrap())		if (Flags.hasNoSignedWrap())
MI->setFlag(MachineInstr::MIFlag::NoSWrap);		MI->setFlag(MachineInstr::MIFlag::NoSWrap);

if (Flags.hasExact())		if (Flags.hasExact())
MI->setFlag(MachineInstr::MIFlag::IsExact);		MI->setFlag(MachineInstr::MIFlag::IsExact);

		if (Flags.hasFPExcept())
		MI->setFlag(MachineInstr::MIFlag::FPExcept);
}		}

// Emit all of the actual operands of this instruction, adding them to the		// Emit all of the actual operands of this instruction, adding them to the
// instruction as appropriate.		// instruction as appropriate.
bool HasOptPRefs = NumDefs > NumResults;		bool HasOptPRefs = NumDefs > NumResults;
assert((!HasOptPRefs \|\| !HasPhysRegOuts) &&		assert((!HasOptPRefs \|\| !HasPhysRegOuts) &&
"Unable to cope with optional defs and phys regs defs!");		"Unable to cope with optional defs and phys regs defs!");
unsigned NumSkip = HasOptPRefs ? NumDefs - NumResults : 0;		unsigned NumSkip = HasOptPRefs ? NumDefs - NumResults : 0;
▲ Show 20 Lines • Show All 270 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 6,949 Lines • ▼ Show 20 Lines	Result = DAG.getNode(Opcode, sdl, VTs,
{ Chain, getValue(FPI.getArgOperand(0)),		{ Chain, getValue(FPI.getArgOperand(0)),
getValue(FPI.getArgOperand(1)),		getValue(FPI.getArgOperand(1)),
getValue(FPI.getArgOperand(2)) });		getValue(FPI.getArgOperand(2)) });
else		else
Result = DAG.getNode(Opcode, sdl, VTs,		Result = DAG.getNode(Opcode, sdl, VTs,
{ Chain, getValue(FPI.getArgOperand(0)),		{ Chain, getValue(FPI.getArgOperand(0)),
getValue(FPI.getArgOperand(1)) });		getValue(FPI.getArgOperand(1)) });

		if (FPI.getExceptionBehavior() !=
		ConstrainedFPIntrinsic::ExceptionBehavior::ebIgnore) {
		SDNodeFlags Flags;
		Flags.setFPExcept(true);
		Result->setFlags(Flags);
		}

assert(Result.getNode()->getNumValues() == 2);		assert(Result.getNode()->getNumValues() == 2);
SDValue OutChain = Result.getValue(1);		SDValue OutChain = Result.getValue(1);
DAG.setRoot(OutChain);		DAG.setRoot(OutChain);
SDValue FPResult = Result.getValue(0);		SDValue FPResult = Result.getValue(0);
setValue(&FPI, FPResult);		setValue(&FPI, FPResult);
}		}

std::pair<SDValue, SDValue>		std::pair<SDValue, SDValue>
▲ Show 20 Lines • Show All 3,953 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp

Show First 20 Lines • Show All 1,116 Lines • ▼ Show 20 Lines	#ifndef NDEBUG
assert(Op->getNodeId() != -1 &&		assert(Op->getNodeId() != -1 &&
"Node has already selected predecessor node");		"Node has already selected predecessor node");
}		}
}		}
}		}
#endif		#endif

// When we are using non-default rounding modes or FP exception behavior		// When we are using non-default rounding modes or FP exception behavior
// FP operations are represented by StrictFP pseudo-operations. They		// FP operations are represented by StrictFP pseudo-operations. For
// need to be simplified here so that the target-specific instruction		// targets that do not (yet) understand strict FP operations directly,
// selectors know how to handle them.		// we convert them to normal FP opcodes instead at this point. This
//		// will allow them to be handled by existing target-specific instruction
// If the current node is a strict FP pseudo-op, the isStrictFPOp()		// selectors.
// function will provide the corresponding normal FP opcode to which the		if (Node->isStrictFPOpcode() &&
// node should be mutated.		(TLI->getOperationAction(Node->getOpcode(), Node->getValueType(0))
//		!= TargetLowering::Legal))
// FIXME: The backends need a way to handle FP constraints.
if (Node->isStrictFPOpcode())
Node = CurDAG->mutateStrictFPToFP(Node);		Node = CurDAG->mutateStrictFPToFP(Node);

LLVM_DEBUG(dbgs() << "\nISEL: Starting selection on root node: ";		LLVM_DEBUG(dbgs() << "\nISEL: Starting selection on root node: ";
Node->dump(CurDAG));		Node->dump(CurDAG));

Select(Node);		Select(Node);
}		}

▲ Show 20 Lines • Show All 2,491 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/TargetInstrInfo.cpp

Show First 20 Lines • Show All 893 Lines • ▼ Show 20 Lines	bool TargetInstrInfo::isReallyTriviallyReMaterializableGeneric(
// redundant with subsequent checks, but it's target-independent,		// redundant with subsequent checks, but it's target-independent,
// simple, and a common case.		// simple, and a common case.
int FrameIdx = 0;		int FrameIdx = 0;
if (isLoadFromStackSlot(MI, FrameIdx) &&		if (isLoadFromStackSlot(MI, FrameIdx) &&
MF.getFrameInfo().isImmutableObjectIndex(FrameIdx))		MF.getFrameInfo().isImmutableObjectIndex(FrameIdx))
return true;		return true;

// Avoid instructions obviously unsafe for remat.		// Avoid instructions obviously unsafe for remat.
if (MI.isNotDuplicable() \|\| MI.mayStore() \|\| MI.hasUnmodeledSideEffects())		if (MI.isNotDuplicable() \|\| MI.mayStore() \|\| MI.mayRaiseFPException() \|\|
		MI.hasUnmodeledSideEffects())
return false;		return false;

// Don't remat inline asm. We have no idea how expensive it is		// Don't remat inline asm. We have no idea how expensive it is
// even if it's side effect free.		// even if it's side effect free.
if (MI.isInlineAsm())		if (MI.isInlineAsm())
return false;		return false;

// Avoid instructions which load from potentially varying memory.		// Avoid instructions which load from potentially varying memory.
▲ Show 20 Lines • Show All 311 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/TargetLoweringBase.cpp

Show First 20 Lines • Show All 657 Lines • ▼ Show 20 Lines	for (MVT VT : MVT::all_valuetypes()) {
// These operations default to expand for vector types.		// These operations default to expand for vector types.
if (VT.isVector()) {		if (VT.isVector()) {
setOperationAction(ISD::FCOPYSIGN, VT, Expand);		setOperationAction(ISD::FCOPYSIGN, VT, Expand);
setOperationAction(ISD::ANY_EXTEND_VECTOR_INREG, VT, Expand);		setOperationAction(ISD::ANY_EXTEND_VECTOR_INREG, VT, Expand);
setOperationAction(ISD::SIGN_EXTEND_VECTOR_INREG, VT, Expand);		setOperationAction(ISD::SIGN_EXTEND_VECTOR_INREG, VT, Expand);
setOperationAction(ISD::ZERO_EXTEND_VECTOR_INREG, VT, Expand);		setOperationAction(ISD::ZERO_EXTEND_VECTOR_INREG, VT, Expand);
}		}

		// Constrained floating-point operations default to expand.
		setOperationAction(ISD::STRICT_FADD, VT, Expand);
		setOperationAction(ISD::STRICT_FSUB, VT, Expand);
		setOperationAction(ISD::STRICT_FMUL, VT, Expand);
		setOperationAction(ISD::STRICT_FDIV, VT, Expand);
		setOperationAction(ISD::STRICT_FREM, VT, Expand);
		setOperationAction(ISD::STRICT_FMA, VT, Expand);
		setOperationAction(ISD::STRICT_FSQRT, VT, Expand);
		setOperationAction(ISD::STRICT_FPOW, VT, Expand);
		setOperationAction(ISD::STRICT_FPOWI, VT, Expand);
		setOperationAction(ISD::STRICT_FSIN, VT, Expand);
		setOperationAction(ISD::STRICT_FCOS, VT, Expand);
		setOperationAction(ISD::STRICT_FEXP, VT, Expand);
		setOperationAction(ISD::STRICT_FEXP2, VT, Expand);
		setOperationAction(ISD::STRICT_FLOG, VT, Expand);
		setOperationAction(ISD::STRICT_FLOG10, VT, Expand);
		setOperationAction(ISD::STRICT_FLOG2, VT, Expand);
		setOperationAction(ISD::STRICT_FRINT, VT, Expand);
		setOperationAction(ISD::STRICT_FNEARBYINT, VT, Expand);
		setOperationAction(ISD::STRICT_FCEIL, VT, Expand);
		setOperationAction(ISD::STRICT_FFLOOR, VT, Expand);
		setOperationAction(ISD::STRICT_FROUND, VT, Expand);
		setOperationAction(ISD::STRICT_FTRUNC, VT, Expand);
		setOperationAction(ISD::STRICT_FMAXNUM, VT, Expand);
		setOperationAction(ISD::STRICT_FMINNUM, VT, Expand);
		setOperationAction(ISD::STRICT_FP_ROUND, VT, Expand);
		setOperationAction(ISD::STRICT_FP_EXTEND, VT, Expand);

// For most targets @llvm.get.dynamic.area.offset just returns 0.		// For most targets @llvm.get.dynamic.area.offset just returns 0.
setOperationAction(ISD::GET_DYNAMIC_AREA_OFFSET, VT, Expand);		setOperationAction(ISD::GET_DYNAMIC_AREA_OFFSET, VT, Expand);

// Vector reduction default to expand.		// Vector reduction default to expand.
setOperationAction(ISD::VECREDUCE_FADD, VT, Expand);		setOperationAction(ISD::VECREDUCE_FADD, VT, Expand);
setOperationAction(ISD::VECREDUCE_FMUL, VT, Expand);		setOperationAction(ISD::VECREDUCE_FMUL, VT, Expand);
setOperationAction(ISD::VECREDUCE_ADD, VT, Expand);		setOperationAction(ISD::VECREDUCE_ADD, VT, Expand);
setOperationAction(ISD::VECREDUCE_MUL, VT, Expand);		setOperationAction(ISD::VECREDUCE_MUL, VT, Expand);
▲ Show 20 Lines • Show All 1,224 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/SystemZ/SystemZISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 395 Lines • ▼ Show 20 Lines	if (isTypeLegal(VT)) {
}		}

// No special instructions for these.		// No special instructions for these.
setOperationAction(ISD::FSIN, VT, Expand);		setOperationAction(ISD::FSIN, VT, Expand);
setOperationAction(ISD::FCOS, VT, Expand);		setOperationAction(ISD::FCOS, VT, Expand);
setOperationAction(ISD::FSINCOS, VT, Expand);		setOperationAction(ISD::FSINCOS, VT, Expand);
setOperationAction(ISD::FREM, VT, Expand);		setOperationAction(ISD::FREM, VT, Expand);
setOperationAction(ISD::FPOW, VT, Expand);		setOperationAction(ISD::FPOW, VT, Expand);

		// Handle constrained floating-point operations.
		setOperationAction(ISD::STRICT_FADD, VT, Legal);
		setOperationAction(ISD::STRICT_FSUB, VT, Legal);
		setOperationAction(ISD::STRICT_FMUL, VT, Legal);
		setOperationAction(ISD::STRICT_FDIV, VT, Legal);
		setOperationAction(ISD::STRICT_FMA, VT, Legal);
		setOperationAction(ISD::STRICT_FSQRT, VT, Legal);
		setOperationAction(ISD::STRICT_FRINT, VT, Legal);
		setOperationAction(ISD::STRICT_FP_ROUND, VT, Legal);
		setOperationAction(ISD::STRICT_FP_EXTEND, VT, Legal);
		if (Subtarget.hasFPExtension()) {
		setOperationAction(ISD::STRICT_FNEARBYINT, VT, Legal);
		setOperationAction(ISD::STRICT_FFLOOR, VT, Legal);
		setOperationAction(ISD::STRICT_FCEIL, VT, Legal);
		setOperationAction(ISD::STRICT_FROUND, VT, Legal);
		setOperationAction(ISD::STRICT_FTRUNC, VT, Legal);
		}
}		}
}		}

// Handle floating-point vector types.		// Handle floating-point vector types.
if (Subtarget.hasVector()) {		if (Subtarget.hasVector()) {
// Scalar-to-vector conversion is just a subreg.		// Scalar-to-vector conversion is just a subreg.
setOperationAction(ISD::SCALAR_TO_VECTOR, MVT::v4f32, Legal);		setOperationAction(ISD::SCALAR_TO_VECTOR, MVT::v4f32, Legal);
setOperationAction(ISD::SCALAR_TO_VECTOR, MVT::v2f64, Legal);		setOperationAction(ISD::SCALAR_TO_VECTOR, MVT::v2f64, Legal);
Show All 15 Lines	if (Subtarget.hasVector()) {
setOperationAction(ISD::FABS, MVT::v2f64, Legal);		setOperationAction(ISD::FABS, MVT::v2f64, Legal);
setOperationAction(ISD::FSQRT, MVT::v2f64, Legal);		setOperationAction(ISD::FSQRT, MVT::v2f64, Legal);
setOperationAction(ISD::FRINT, MVT::v2f64, Legal);		setOperationAction(ISD::FRINT, MVT::v2f64, Legal);
setOperationAction(ISD::FNEARBYINT, MVT::v2f64, Legal);		setOperationAction(ISD::FNEARBYINT, MVT::v2f64, Legal);
setOperationAction(ISD::FFLOOR, MVT::v2f64, Legal);		setOperationAction(ISD::FFLOOR, MVT::v2f64, Legal);
setOperationAction(ISD::FCEIL, MVT::v2f64, Legal);		setOperationAction(ISD::FCEIL, MVT::v2f64, Legal);
setOperationAction(ISD::FTRUNC, MVT::v2f64, Legal);		setOperationAction(ISD::FTRUNC, MVT::v2f64, Legal);
setOperationAction(ISD::FROUND, MVT::v2f64, Legal);		setOperationAction(ISD::FROUND, MVT::v2f64, Legal);

		// Handle constrained floating-point operations.
		setOperationAction(ISD::STRICT_FADD, MVT::v2f64, Legal);
		setOperationAction(ISD::STRICT_FSUB, MVT::v2f64, Legal);
		setOperationAction(ISD::STRICT_FMUL, MVT::v2f64, Legal);
		setOperationAction(ISD::STRICT_FMA, MVT::v2f64, Legal);
		setOperationAction(ISD::STRICT_FDIV, MVT::v2f64, Legal);
		setOperationAction(ISD::STRICT_FSQRT, MVT::v2f64, Legal);
		setOperationAction(ISD::STRICT_FRINT, MVT::v2f64, Legal);
		setOperationAction(ISD::STRICT_FNEARBYINT, MVT::v2f64, Legal);
		setOperationAction(ISD::STRICT_FFLOOR, MVT::v2f64, Legal);
		setOperationAction(ISD::STRICT_FCEIL, MVT::v2f64, Legal);
		setOperationAction(ISD::STRICT_FTRUNC, MVT::v2f64, Legal);
		setOperationAction(ISD::STRICT_FROUND, MVT::v2f64, Legal);
}		}

// The vector enhancements facility 1 has instructions for these.		// The vector enhancements facility 1 has instructions for these.
if (Subtarget.hasVectorEnhancements1()) {		if (Subtarget.hasVectorEnhancements1()) {
setOperationAction(ISD::FADD, MVT::v4f32, Legal);		setOperationAction(ISD::FADD, MVT::v4f32, Legal);
setOperationAction(ISD::FNEG, MVT::v4f32, Legal);		setOperationAction(ISD::FNEG, MVT::v4f32, Legal);
setOperationAction(ISD::FSUB, MVT::v4f32, Legal);		setOperationAction(ISD::FSUB, MVT::v4f32, Legal);
setOperationAction(ISD::FMUL, MVT::v4f32, Legal);		setOperationAction(ISD::FMUL, MVT::v4f32, Legal);
Show All 27 Lines	if (Subtarget.hasVectorEnhancements1()) {
setOperationAction(ISD::FMAXIMUM, MVT::v4f32, Legal);		setOperationAction(ISD::FMAXIMUM, MVT::v4f32, Legal);
setOperationAction(ISD::FMINNUM, MVT::v4f32, Legal);		setOperationAction(ISD::FMINNUM, MVT::v4f32, Legal);
setOperationAction(ISD::FMINIMUM, MVT::v4f32, Legal);		setOperationAction(ISD::FMINIMUM, MVT::v4f32, Legal);

setOperationAction(ISD::FMAXNUM, MVT::f128, Legal);		setOperationAction(ISD::FMAXNUM, MVT::f128, Legal);
setOperationAction(ISD::FMAXIMUM, MVT::f128, Legal);		setOperationAction(ISD::FMAXIMUM, MVT::f128, Legal);
setOperationAction(ISD::FMINNUM, MVT::f128, Legal);		setOperationAction(ISD::FMINNUM, MVT::f128, Legal);
setOperationAction(ISD::FMINIMUM, MVT::f128, Legal);		setOperationAction(ISD::FMINIMUM, MVT::f128, Legal);

		// Handle constrained floating-point operations.
		setOperationAction(ISD::STRICT_FADD, MVT::v4f32, Legal);
		setOperationAction(ISD::STRICT_FSUB, MVT::v4f32, Legal);
		setOperationAction(ISD::STRICT_FMUL, MVT::v4f32, Legal);
		setOperationAction(ISD::STRICT_FMA, MVT::v4f32, Legal);
		setOperationAction(ISD::STRICT_FDIV, MVT::v4f32, Legal);
		setOperationAction(ISD::STRICT_FSQRT, MVT::v4f32, Legal);
		setOperationAction(ISD::STRICT_FRINT, MVT::v4f32, Legal);
		setOperationAction(ISD::STRICT_FNEARBYINT, MVT::v4f32, Legal);
		setOperationAction(ISD::STRICT_FFLOOR, MVT::v4f32, Legal);
		setOperationAction(ISD::STRICT_FCEIL, MVT::v4f32, Legal);
		setOperationAction(ISD::STRICT_FROUND, MVT::v4f32, Legal);
		setOperationAction(ISD::STRICT_FTRUNC, MVT::v4f32, Legal);
		for (auto VT : { MVT::f32, MVT::f64, MVT::f128,
		MVT::v4f32, MVT::v2f64 }) {
		setOperationAction(ISD::STRICT_FMAXNUM, VT, Legal);
		setOperationAction(ISD::STRICT_FMINNUM, VT, Legal);
		}
}		}

// We have fused multiply-addition for f32 and f64 but not f128.		// We have fused multiply-addition for f32 and f64 but not f128.
setOperationAction(ISD::FMA, MVT::f32, Legal);		setOperationAction(ISD::FMA, MVT::f32, Legal);
setOperationAction(ISD::FMA, MVT::f64, Legal);		setOperationAction(ISD::FMA, MVT::f64, Legal);
if (Subtarget.hasVectorEnhancements1())		if (Subtarget.hasVectorEnhancements1())
setOperationAction(ISD::FMA, MVT::f128, Legal);		setOperationAction(ISD::FMA, MVT::f128, Legal);
else		else
▲ Show 20 Lines • Show All 6,999 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/SystemZ/SystemZInstrFP.td

Show First 20 Lines • Show All 46 Lines • ▼ Show 20 Lines
def LXR : UnaryRRE<"lxr", 0xB365, null_frag, FP128, FP128>;		def LXR : UnaryRRE<"lxr", 0xB365, null_frag, FP128, FP128>;

// For z13 we prefer LDR over LER to avoid partial register dependencies.		// For z13 we prefer LDR over LER to avoid partial register dependencies.
let isCodeGenOnly = 1 in		let isCodeGenOnly = 1 in
def LDR32 : UnaryRR<"ldr", 0x28, null_frag, FP32, FP32>;		def LDR32 : UnaryRR<"ldr", 0x28, null_frag, FP32, FP32>;

// Moves between two floating-point registers that also set the condition		// Moves between two floating-point registers that also set the condition
// codes.		// codes.
let Uses = [FPC], Defs = [CC], CCValues = 0xF, CompareZeroCCMask = 0xF in {		let Uses = [FPC], mayRaiseFPException = 1,
		Defs = [CC], CCValues = 0xF, CompareZeroCCMask = 0xF in {
defm LTEBR : LoadAndTestRRE<"ltebr", 0xB302, FP32>;		defm LTEBR : LoadAndTestRRE<"ltebr", 0xB302, FP32>;
defm LTDBR : LoadAndTestRRE<"ltdbr", 0xB312, FP64>;		defm LTDBR : LoadAndTestRRE<"ltdbr", 0xB312, FP64>;
defm LTXBR : LoadAndTestRRE<"ltxbr", 0xB342, FP128>;		defm LTXBR : LoadAndTestRRE<"ltxbr", 0xB342, FP128>;
}		}
// Note that LTxBRCompare is not available if we have vector support,		// Note that LTxBRCompare is not available if we have vector support,
// since load-and-test instructions will partially clobber the target		// since load-and-test instructions will partially clobber the target
// (vector) register.		// (vector) register.
let Predicates = [FeatureNoVector] in {		let Predicates = [FeatureNoVector] in {
defm : CompareZeroFP<LTEBRCompare, FP32>;		defm : CompareZeroFP<LTEBRCompare, FP32>;
defm : CompareZeroFP<LTDBRCompare, FP64>;		defm : CompareZeroFP<LTDBRCompare, FP64>;
defm : CompareZeroFP<LTXBRCompare, FP128>;		defm : CompareZeroFP<LTXBRCompare, FP128>;
}		}

// Use a normal load-and-test for compare against zero in case of		// Use a normal load-and-test for compare against zero in case of
// vector support (via a pseudo to simplify instruction selection).		// vector support (via a pseudo to simplify instruction selection).
let Uses = [FPC], Defs = [CC], usesCustomInserter = 1, hasNoSchedulingInfo = 1 in {		let Uses = [FPC], mayRaiseFPException = 1,
		Defs = [CC], usesCustomInserter = 1, hasNoSchedulingInfo = 1 in {
def LTEBRCompare_VecPseudo : Pseudo<(outs), (ins FP32:$R1, FP32:$R2), []>;		def LTEBRCompare_VecPseudo : Pseudo<(outs), (ins FP32:$R1, FP32:$R2), []>;
def LTDBRCompare_VecPseudo : Pseudo<(outs), (ins FP64:$R1, FP64:$R2), []>;		def LTDBRCompare_VecPseudo : Pseudo<(outs), (ins FP64:$R1, FP64:$R2), []>;
def LTXBRCompare_VecPseudo : Pseudo<(outs), (ins FP128:$R1, FP128:$R2), []>;		def LTXBRCompare_VecPseudo : Pseudo<(outs), (ins FP128:$R1, FP128:$R2), []>;
}		}
let Predicates = [FeatureVector] in {		let Predicates = [FeatureVector] in {
defm : CompareZeroFP<LTEBRCompare_VecPseudo, FP32>;		defm : CompareZeroFP<LTEBRCompare_VecPseudo, FP32>;
defm : CompareZeroFP<LTDBRCompare_VecPseudo, FP64>;		defm : CompareZeroFP<LTDBRCompare_VecPseudo, FP64>;
}		}
▲ Show 20 Lines • Show All 88 Lines • ▼ Show 20 Lines

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Conversion instructions		// Conversion instructions
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

// Convert floating-point values to narrower representations, rounding		// Convert floating-point values to narrower representations, rounding
// according to the current mode. The destination of LEXBR and LDXBR		// according to the current mode. The destination of LEXBR and LDXBR
// is a 128-bit value, but only the first register of the pair is used.		// is a 128-bit value, but only the first register of the pair is used.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def LEDBR : UnaryRRE<"ledbr", 0xB344, fpround, FP32, FP64>;		def LEDBR : UnaryRRE<"ledbr", 0xB344, any_fpround, FP32, FP64>;
def LEXBR : UnaryRRE<"lexbr", 0xB346, null_frag, FP128, FP128>;		def LEXBR : UnaryRRE<"lexbr", 0xB346, null_frag, FP128, FP128>;
def LDXBR : UnaryRRE<"ldxbr", 0xB345, null_frag, FP128, FP128>;		def LDXBR : UnaryRRE<"ldxbr", 0xB345, null_frag, FP128, FP128>;

def LEDBRA : TernaryRRFe<"ledbra", 0xB344, FP32, FP64>,		def LEDBRA : TernaryRRFe<"ledbra", 0xB344, FP32, FP64>,
Requires<[FeatureFPExtension]>;		Requires<[FeatureFPExtension]>;
def LEXBRA : TernaryRRFe<"lexbra", 0xB346, FP128, FP128>,		def LEXBRA : TernaryRRFe<"lexbra", 0xB346, FP128, FP128>,
Requires<[FeatureFPExtension]>;		Requires<[FeatureFPExtension]>;
def LDXBRA : TernaryRRFe<"ldxbra", 0xB345, FP128, FP128>,		def LDXBRA : TernaryRRFe<"ldxbra", 0xB345, FP128, FP128>,
Requires<[FeatureFPExtension]>;		Requires<[FeatureFPExtension]>;
}		}

let Predicates = [FeatureNoVectorEnhancements1] in {		let Predicates = [FeatureNoVectorEnhancements1] in {
def : Pat<(f32 (fpround FP128:$src)),		def : Pat<(f32 (any_fpround FP128:$src)),
(EXTRACT_SUBREG (LEXBR FP128:$src), subreg_hh32)>;		(EXTRACT_SUBREG (LEXBR FP128:$src), subreg_hh32)>;
def : Pat<(f64 (fpround FP128:$src)),		def : Pat<(f64 (any_fpround FP128:$src)),
(EXTRACT_SUBREG (LDXBR FP128:$src), subreg_h64)>;		(EXTRACT_SUBREG (LDXBR FP128:$src), subreg_h64)>;
}		}

// Extend register floating-point values to wider representations.		// Extend register floating-point values to wider representations.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def LDEBR : UnaryRRE<"ldebr", 0xB304, fpextend, FP64, FP32>;		def LDEBR : UnaryRRE<"ldebr", 0xB304, any_fpextend, FP64, FP32>;
def LXEBR : UnaryRRE<"lxebr", 0xB306, null_frag, FP128, FP32>;		def LXEBR : UnaryRRE<"lxebr", 0xB306, null_frag, FP128, FP32>;
def LXDBR : UnaryRRE<"lxdbr", 0xB305, null_frag, FP128, FP64>;		def LXDBR : UnaryRRE<"lxdbr", 0xB305, null_frag, FP128, FP64>;
}		}
let Predicates = [FeatureNoVectorEnhancements1] in {		let Predicates = [FeatureNoVectorEnhancements1] in {
def : Pat<(f128 (fpextend (f32 FP32:$src))), (LXEBR FP32:$src)>;		def : Pat<(f128 (any_fpextend (f32 FP32:$src))), (LXEBR FP32:$src)>;
def : Pat<(f128 (fpextend (f64 FP64:$src))), (LXDBR FP64:$src)>;		def : Pat<(f128 (any_fpextend (f64 FP64:$src))), (LXDBR FP64:$src)>;
}		}

// Extend memory floating-point values to wider representations.		// Extend memory floating-point values to wider representations.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def LDEB : UnaryRXE<"ldeb", 0xED04, extloadf32, FP64, 4>;		def LDEB : UnaryRXE<"ldeb", 0xED04, extloadf32, FP64, 4>;
def LXEB : UnaryRXE<"lxeb", 0xED06, null_frag, FP128, 4>;		def LXEB : UnaryRXE<"lxeb", 0xED06, null_frag, FP128, 4>;
def LXDB : UnaryRXE<"lxdb", 0xED05, null_frag, FP128, 8>;		def LXDB : UnaryRXE<"lxdb", 0xED05, null_frag, FP128, 8>;
}		}
let Predicates = [FeatureNoVectorEnhancements1] in {		let Predicates = [FeatureNoVectorEnhancements1] in {
def : Pat<(f128 (extloadf32 bdxaddr12only:$src)),		def : Pat<(f128 (extloadf32 bdxaddr12only:$src)),
(LXEB bdxaddr12only:$src)>;		(LXEB bdxaddr12only:$src)>;
def : Pat<(f128 (extloadf64 bdxaddr12only:$src)),		def : Pat<(f128 (extloadf64 bdxaddr12only:$src)),
(LXDB bdxaddr12only:$src)>;		(LXDB bdxaddr12only:$src)>;
}		}

// Convert a signed integer register value to a floating-point one.		// Convert a signed integer register value to a floating-point one.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def CEFBR : UnaryRRE<"cefbr", 0xB394, sint_to_fp, FP32, GR32>;		def CEFBR : UnaryRRE<"cefbr", 0xB394, sint_to_fp, FP32, GR32>;
def CDFBR : UnaryRRE<"cdfbr", 0xB395, sint_to_fp, FP64, GR32>;		def CDFBR : UnaryRRE<"cdfbr", 0xB395, sint_to_fp, FP64, GR32>;
def CXFBR : UnaryRRE<"cxfbr", 0xB396, sint_to_fp, FP128, GR32>;		def CXFBR : UnaryRRE<"cxfbr", 0xB396, sint_to_fp, FP128, GR32>;

def CEGBR : UnaryRRE<"cegbr", 0xB3A4, sint_to_fp, FP32, GR64>;		def CEGBR : UnaryRRE<"cegbr", 0xB3A4, sint_to_fp, FP32, GR64>;
def CDGBR : UnaryRRE<"cdgbr", 0xB3A5, sint_to_fp, FP64, GR64>;		def CDGBR : UnaryRRE<"cdgbr", 0xB3A5, sint_to_fp, FP64, GR64>;
def CXGBR : UnaryRRE<"cxgbr", 0xB3A6, sint_to_fp, FP128, GR64>;		def CXGBR : UnaryRRE<"cxgbr", 0xB3A6, sint_to_fp, FP128, GR64>;
}		}

// The FP extension feature provides versions of the above that allow		// The FP extension feature provides versions of the above that allow
// specifying rounding mode and inexact-exception suppression flags.		// specifying rounding mode and inexact-exception suppression flags.
let Uses = [FPC], Predicates = [FeatureFPExtension] in {		let Uses = [FPC], mayRaiseFPException = 1, Predicates = [FeatureFPExtension] in {
def CEFBRA : TernaryRRFe<"cefbra", 0xB394, FP32, GR32>;		def CEFBRA : TernaryRRFe<"cefbra", 0xB394, FP32, GR32>;
def CDFBRA : TernaryRRFe<"cdfbra", 0xB395, FP64, GR32>;		def CDFBRA : TernaryRRFe<"cdfbra", 0xB395, FP64, GR32>;
def CXFBRA : TernaryRRFe<"cxfbra", 0xB396, FP128, GR32>;		def CXFBRA : TernaryRRFe<"cxfbra", 0xB396, FP128, GR32>;

def CEGBRA : TernaryRRFe<"cegbra", 0xB3A4, FP32, GR64>;		def CEGBRA : TernaryRRFe<"cegbra", 0xB3A4, FP32, GR64>;
def CDGBRA : TernaryRRFe<"cdgbra", 0xB3A5, FP64, GR64>;		def CDGBRA : TernaryRRFe<"cdgbra", 0xB3A5, FP64, GR64>;
def CXGBRA : TernaryRRFe<"cxgbra", 0xB3A6, FP128, GR64>;		def CXGBRA : TernaryRRFe<"cxgbra", 0xB3A6, FP128, GR64>;
}		}

// Convert am unsigned integer register value to a floating-point one.		// Convert am unsigned integer register value to a floating-point one.
let Predicates = [FeatureFPExtension] in {		let Predicates = [FeatureFPExtension] in {
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def CELFBR : TernaryRRFe<"celfbr", 0xB390, FP32, GR32>;		def CELFBR : TernaryRRFe<"celfbr", 0xB390, FP32, GR32>;
def CDLFBR : TernaryRRFe<"cdlfbr", 0xB391, FP64, GR32>;		def CDLFBR : TernaryRRFe<"cdlfbr", 0xB391, FP64, GR32>;
def CXLFBR : TernaryRRFe<"cxlfbr", 0xB392, FP128, GR32>;		def CXLFBR : TernaryRRFe<"cxlfbr", 0xB392, FP128, GR32>;

def CELGBR : TernaryRRFe<"celgbr", 0xB3A0, FP32, GR64>;		def CELGBR : TernaryRRFe<"celgbr", 0xB3A0, FP32, GR64>;
def CDLGBR : TernaryRRFe<"cdlgbr", 0xB3A1, FP64, GR64>;		def CDLGBR : TernaryRRFe<"cdlgbr", 0xB3A1, FP64, GR64>;
def CXLGBR : TernaryRRFe<"cxlgbr", 0xB3A2, FP128, GR64>;		def CXLGBR : TernaryRRFe<"cxlgbr", 0xB3A2, FP128, GR64>;
}		}

def : Pat<(f32 (uint_to_fp GR32:$src)), (CELFBR 0, GR32:$src, 0)>;		def : Pat<(f32 (uint_to_fp GR32:$src)), (CELFBR 0, GR32:$src, 0)>;
def : Pat<(f64 (uint_to_fp GR32:$src)), (CDLFBR 0, GR32:$src, 0)>;		def : Pat<(f64 (uint_to_fp GR32:$src)), (CDLFBR 0, GR32:$src, 0)>;
def : Pat<(f128 (uint_to_fp GR32:$src)), (CXLFBR 0, GR32:$src, 0)>;		def : Pat<(f128 (uint_to_fp GR32:$src)), (CXLFBR 0, GR32:$src, 0)>;

def : Pat<(f32 (uint_to_fp GR64:$src)), (CELGBR 0, GR64:$src, 0)>;		def : Pat<(f32 (uint_to_fp GR64:$src)), (CELGBR 0, GR64:$src, 0)>;
def : Pat<(f64 (uint_to_fp GR64:$src)), (CDLGBR 0, GR64:$src, 0)>;		def : Pat<(f64 (uint_to_fp GR64:$src)), (CDLGBR 0, GR64:$src, 0)>;
def : Pat<(f128 (uint_to_fp GR64:$src)), (CXLGBR 0, GR64:$src, 0)>;		def : Pat<(f128 (uint_to_fp GR64:$src)), (CXLGBR 0, GR64:$src, 0)>;
}		}

// Convert a floating-point register value to a signed integer value,		// Convert a floating-point register value to a signed integer value,
// with the second operand (modifier M3) specifying the rounding mode.		// with the second operand (modifier M3) specifying the rounding mode.
let Uses = [FPC], Defs = [CC] in {		let Uses = [FPC], mayRaiseFPException = 1, Defs = [CC] in {
def CFEBR : BinaryRRFe<"cfebr", 0xB398, GR32, FP32>;		def CFEBR : BinaryRRFe<"cfebr", 0xB398, GR32, FP32>;
def CFDBR : BinaryRRFe<"cfdbr", 0xB399, GR32, FP64>;		def CFDBR : BinaryRRFe<"cfdbr", 0xB399, GR32, FP64>;
def CFXBR : BinaryRRFe<"cfxbr", 0xB39A, GR32, FP128>;		def CFXBR : BinaryRRFe<"cfxbr", 0xB39A, GR32, FP128>;

def CGEBR : BinaryRRFe<"cgebr", 0xB3A8, GR64, FP32>;		def CGEBR : BinaryRRFe<"cgebr", 0xB3A8, GR64, FP32>;
def CGDBR : BinaryRRFe<"cgdbr", 0xB3A9, GR64, FP64>;		def CGDBR : BinaryRRFe<"cgdbr", 0xB3A9, GR64, FP64>;
def CGXBR : BinaryRRFe<"cgxbr", 0xB3AA, GR64, FP128>;		def CGXBR : BinaryRRFe<"cgxbr", 0xB3AA, GR64, FP128>;
}		}

// fp_to_sint always rounds towards zero, which is modifier value 5.		// fp_to_sint always rounds towards zero, which is modifier value 5.
def : Pat<(i32 (fp_to_sint FP32:$src)), (CFEBR 5, FP32:$src)>;		def : Pat<(i32 (fp_to_sint FP32:$src)), (CFEBR 5, FP32:$src)>;
def : Pat<(i32 (fp_to_sint FP64:$src)), (CFDBR 5, FP64:$src)>;		def : Pat<(i32 (fp_to_sint FP64:$src)), (CFDBR 5, FP64:$src)>;
def : Pat<(i32 (fp_to_sint FP128:$src)), (CFXBR 5, FP128:$src)>;		def : Pat<(i32 (fp_to_sint FP128:$src)), (CFXBR 5, FP128:$src)>;

def : Pat<(i64 (fp_to_sint FP32:$src)), (CGEBR 5, FP32:$src)>;		def : Pat<(i64 (fp_to_sint FP32:$src)), (CGEBR 5, FP32:$src)>;
def : Pat<(i64 (fp_to_sint FP64:$src)), (CGDBR 5, FP64:$src)>;		def : Pat<(i64 (fp_to_sint FP64:$src)), (CGDBR 5, FP64:$src)>;
def : Pat<(i64 (fp_to_sint FP128:$src)), (CGXBR 5, FP128:$src)>;		def : Pat<(i64 (fp_to_sint FP128:$src)), (CGXBR 5, FP128:$src)>;

// The FP extension feature provides versions of the above that allow		// The FP extension feature provides versions of the above that allow
// also specifying the inexact-exception suppression flag.		// also specifying the inexact-exception suppression flag.
let Uses = [FPC], Predicates = [FeatureFPExtension], Defs = [CC] in {		let Uses = [FPC], mayRaiseFPException = 1,
		Predicates = [FeatureFPExtension], Defs = [CC] in {
def CFEBRA : TernaryRRFe<"cfebra", 0xB398, GR32, FP32>;		def CFEBRA : TernaryRRFe<"cfebra", 0xB398, GR32, FP32>;
def CFDBRA : TernaryRRFe<"cfdbra", 0xB399, GR32, FP64>;		def CFDBRA : TernaryRRFe<"cfdbra", 0xB399, GR32, FP64>;
def CFXBRA : TernaryRRFe<"cfxbra", 0xB39A, GR32, FP128>;		def CFXBRA : TernaryRRFe<"cfxbra", 0xB39A, GR32, FP128>;

def CGEBRA : TernaryRRFe<"cgebra", 0xB3A8, GR64, FP32>;		def CGEBRA : TernaryRRFe<"cgebra", 0xB3A8, GR64, FP32>;
def CGDBRA : TernaryRRFe<"cgdbra", 0xB3A9, GR64, FP64>;		def CGDBRA : TernaryRRFe<"cgdbra", 0xB3A9, GR64, FP64>;
def CGXBRA : TernaryRRFe<"cgxbra", 0xB3AA, GR64, FP128>;		def CGXBRA : TernaryRRFe<"cgxbra", 0xB3AA, GR64, FP128>;
}		}

// Convert a floating-point register value to an unsigned integer value.		// Convert a floating-point register value to an unsigned integer value.
let Predicates = [FeatureFPExtension] in {		let Predicates = [FeatureFPExtension] in {
let Uses = [FPC], Defs = [CC] in {		let Uses = [FPC], mayRaiseFPException = 1, Defs = [CC] in {
def CLFEBR : TernaryRRFe<"clfebr", 0xB39C, GR32, FP32>;		def CLFEBR : TernaryRRFe<"clfebr", 0xB39C, GR32, FP32>;
def CLFDBR : TernaryRRFe<"clfdbr", 0xB39D, GR32, FP64>;		def CLFDBR : TernaryRRFe<"clfdbr", 0xB39D, GR32, FP64>;
def CLFXBR : TernaryRRFe<"clfxbr", 0xB39E, GR32, FP128>;		def CLFXBR : TernaryRRFe<"clfxbr", 0xB39E, GR32, FP128>;

def CLGEBR : TernaryRRFe<"clgebr", 0xB3AC, GR64, FP32>;		def CLGEBR : TernaryRRFe<"clgebr", 0xB3AC, GR64, FP32>;
def CLGDBR : TernaryRRFe<"clgdbr", 0xB3AD, GR64, FP64>;		def CLGDBR : TernaryRRFe<"clgdbr", 0xB3AD, GR64, FP64>;
def CLGXBR : TernaryRRFe<"clgxbr", 0xB3AE, GR64, FP128>;		def CLGXBR : TernaryRRFe<"clgxbr", 0xB3AE, GR64, FP128>;
}		}
▲ Show 20 Lines • Show All 49 Lines • ▼ Show 20 Lines	let Defs = [CC], CCValues = 0xF, CompareZeroCCMask = 0xF in {
def LNXBR : UnaryRRE<"lnxbr", 0xB341, fnabs, FP128, FP128>;		def LNXBR : UnaryRRE<"lnxbr", 0xB341, fnabs, FP128, FP128>;
}		}
// Generic form, which does not set CC.		// Generic form, which does not set CC.
def LNDFR : UnaryRRE<"lndfr", 0xB371, fnabs, FP64, FP64>;		def LNDFR : UnaryRRE<"lndfr", 0xB371, fnabs, FP64, FP64>;
let isCodeGenOnly = 1 in		let isCodeGenOnly = 1 in
def LNDFR_32 : UnaryRRE<"lndfr", 0xB371, fnabs, FP32, FP32>;		def LNDFR_32 : UnaryRRE<"lndfr", 0xB371, fnabs, FP32, FP32>;

// Square root.		// Square root.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def SQEBR : UnaryRRE<"sqebr", 0xB314, fsqrt, FP32, FP32>;		def SQEBR : UnaryRRE<"sqebr", 0xB314, any_fsqrt, FP32, FP32>;
def SQDBR : UnaryRRE<"sqdbr", 0xB315, fsqrt, FP64, FP64>;		def SQDBR : UnaryRRE<"sqdbr", 0xB315, any_fsqrt, FP64, FP64>;
def SQXBR : UnaryRRE<"sqxbr", 0xB316, fsqrt, FP128, FP128>;		def SQXBR : UnaryRRE<"sqxbr", 0xB316, any_fsqrt, FP128, FP128>;

def SQEB : UnaryRXE<"sqeb", 0xED14, loadu<fsqrt>, FP32, 4>;		def SQEB : UnaryRXE<"sqeb", 0xED14, loadu<any_fsqrt>, FP32, 4>;
def SQDB : UnaryRXE<"sqdb", 0xED15, loadu<fsqrt>, FP64, 8>;		def SQDB : UnaryRXE<"sqdb", 0xED15, loadu<any_fsqrt>, FP64, 8>;
}		}

// Round to an integer, with the second operand (modifier M3) specifying		// Round to an integer, with the second operand (modifier M3) specifying
// the rounding mode. These forms always check for inexact conditions.		// the rounding mode. These forms always check for inexact conditions.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def FIEBR : BinaryRRFe<"fiebr", 0xB357, FP32, FP32>;		def FIEBR : BinaryRRFe<"fiebr", 0xB357, FP32, FP32>;
def FIDBR : BinaryRRFe<"fidbr", 0xB35F, FP64, FP64>;		def FIDBR : BinaryRRFe<"fidbr", 0xB35F, FP64, FP64>;
def FIXBR : BinaryRRFe<"fixbr", 0xB347, FP128, FP128>;		def FIXBR : BinaryRRFe<"fixbr", 0xB347, FP128, FP128>;
}		}

// frint rounds according to the current mode (modifier 0) and detects		// frint rounds according to the current mode (modifier 0) and detects
// inexact conditions.		// inexact conditions.
def : Pat<(frint FP32:$src), (FIEBR 0, FP32:$src)>;		def : Pat<(any_frint FP32:$src), (FIEBR 0, FP32:$src)>;
def : Pat<(frint FP64:$src), (FIDBR 0, FP64:$src)>;		def : Pat<(any_frint FP64:$src), (FIDBR 0, FP64:$src)>;
def : Pat<(frint FP128:$src), (FIXBR 0, FP128:$src)>;		def : Pat<(any_frint FP128:$src), (FIXBR 0, FP128:$src)>;

let Predicates = [FeatureFPExtension] in {		let Predicates = [FeatureFPExtension] in {
// Extended forms of the FIxBR instructions. M4 can be set to 4		// Extended forms of the FIxBR instructions. M4 can be set to 4
// to suppress detection of inexact conditions.		// to suppress detection of inexact conditions.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def FIEBRA : TernaryRRFe<"fiebra", 0xB357, FP32, FP32>;		def FIEBRA : TernaryRRFe<"fiebra", 0xB357, FP32, FP32>;
def FIDBRA : TernaryRRFe<"fidbra", 0xB35F, FP64, FP64>;		def FIDBRA : TernaryRRFe<"fidbra", 0xB35F, FP64, FP64>;
def FIXBRA : TernaryRRFe<"fixbra", 0xB347, FP128, FP128>;		def FIXBRA : TernaryRRFe<"fixbra", 0xB347, FP128, FP128>;
}		}

// fnearbyint is like frint but does not detect inexact conditions.		// fnearbyint is like frint but does not detect inexact conditions.
def : Pat<(fnearbyint FP32:$src), (FIEBRA 0, FP32:$src, 4)>;		def : Pat<(any_fnearbyint FP32:$src), (FIEBRA 0, FP32:$src, 4)>;
def : Pat<(fnearbyint FP64:$src), (FIDBRA 0, FP64:$src, 4)>;		def : Pat<(any_fnearbyint FP64:$src), (FIDBRA 0, FP64:$src, 4)>;
def : Pat<(fnearbyint FP128:$src), (FIXBRA 0, FP128:$src, 4)>;		def : Pat<(any_fnearbyint FP128:$src), (FIXBRA 0, FP128:$src, 4)>;

// floor is no longer allowed to raise an inexact condition,		// floor is no longer allowed to raise an inexact condition,
// so restrict it to the cases where the condition can be suppressed.		// so restrict it to the cases where the condition can be suppressed.
// Mode 7 is round towards -inf.		// Mode 7 is round towards -inf.
def : Pat<(ffloor FP32:$src), (FIEBRA 7, FP32:$src, 4)>;		def : Pat<(any_ffloor FP32:$src), (FIEBRA 7, FP32:$src, 4)>;
def : Pat<(ffloor FP64:$src), (FIDBRA 7, FP64:$src, 4)>;		def : Pat<(any_ffloor FP64:$src), (FIDBRA 7, FP64:$src, 4)>;
def : Pat<(ffloor FP128:$src), (FIXBRA 7, FP128:$src, 4)>;		def : Pat<(any_ffloor FP128:$src), (FIXBRA 7, FP128:$src, 4)>;

// Same idea for ceil, where mode 6 is round towards +inf.		// Same idea for ceil, where mode 6 is round towards +inf.
def : Pat<(fceil FP32:$src), (FIEBRA 6, FP32:$src, 4)>;		def : Pat<(any_fceil FP32:$src), (FIEBRA 6, FP32:$src, 4)>;
def : Pat<(fceil FP64:$src), (FIDBRA 6, FP64:$src, 4)>;		def : Pat<(any_fceil FP64:$src), (FIDBRA 6, FP64:$src, 4)>;
def : Pat<(fceil FP128:$src), (FIXBRA 6, FP128:$src, 4)>;		def : Pat<(any_fceil FP128:$src), (FIXBRA 6, FP128:$src, 4)>;

// Same idea for trunc, where mode 5 is round towards zero.		// Same idea for trunc, where mode 5 is round towards zero.
def : Pat<(ftrunc FP32:$src), (FIEBRA 5, FP32:$src, 4)>;		def : Pat<(any_ftrunc FP32:$src), (FIEBRA 5, FP32:$src, 4)>;
def : Pat<(ftrunc FP64:$src), (FIDBRA 5, FP64:$src, 4)>;		def : Pat<(any_ftrunc FP64:$src), (FIDBRA 5, FP64:$src, 4)>;
def : Pat<(ftrunc FP128:$src), (FIXBRA 5, FP128:$src, 4)>;		def : Pat<(any_ftrunc FP128:$src), (FIXBRA 5, FP128:$src, 4)>;

// Same idea for round, where mode 1 is round towards nearest with		// Same idea for round, where mode 1 is round towards nearest with
// ties away from zero.		// ties away from zero.
def : Pat<(fround FP32:$src), (FIEBRA 1, FP32:$src, 4)>;		def : Pat<(any_fround FP32:$src), (FIEBRA 1, FP32:$src, 4)>;
def : Pat<(fround FP64:$src), (FIDBRA 1, FP64:$src, 4)>;		def : Pat<(any_fround FP64:$src), (FIDBRA 1, FP64:$src, 4)>;
def : Pat<(fround FP128:$src), (FIXBRA 1, FP128:$src, 4)>;		def : Pat<(any_fround FP128:$src), (FIXBRA 1, FP128:$src, 4)>;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Binary arithmetic		// Binary arithmetic
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

// Addition.		// Addition.
let Uses = [FPC], Defs = [CC], CCValues = 0xF, CompareZeroCCMask = 0xF in {		let Uses = [FPC], mayRaiseFPException = 1,
		Defs = [CC], CCValues = 0xF, CompareZeroCCMask = 0xF in {
let isCommutable = 1 in {		let isCommutable = 1 in {
def AEBR : BinaryRRE<"aebr", 0xB30A, fadd, FP32, FP32>;		def AEBR : BinaryRRE<"aebr", 0xB30A, any_fadd, FP32, FP32>;
def ADBR : BinaryRRE<"adbr", 0xB31A, fadd, FP64, FP64>;		def ADBR : BinaryRRE<"adbr", 0xB31A, any_fadd, FP64, FP64>;
def AXBR : BinaryRRE<"axbr", 0xB34A, fadd, FP128, FP128>;		def AXBR : BinaryRRE<"axbr", 0xB34A, any_fadd, FP128, FP128>;
}		}
def AEB : BinaryRXE<"aeb", 0xED0A, fadd, FP32, load, 4>;		def AEB : BinaryRXE<"aeb", 0xED0A, any_fadd, FP32, load, 4>;
def ADB : BinaryRXE<"adb", 0xED1A, fadd, FP64, load, 8>;		def ADB : BinaryRXE<"adb", 0xED1A, any_fadd, FP64, load, 8>;
}		}

// Subtraction.		// Subtraction.
let Uses = [FPC], Defs = [CC], CCValues = 0xF, CompareZeroCCMask = 0xF in {		let Uses = [FPC], mayRaiseFPException = 1,
def SEBR : BinaryRRE<"sebr", 0xB30B, fsub, FP32, FP32>;		Defs = [CC], CCValues = 0xF, CompareZeroCCMask = 0xF in {
def SDBR : BinaryRRE<"sdbr", 0xB31B, fsub, FP64, FP64>;		def SEBR : BinaryRRE<"sebr", 0xB30B, any_fsub, FP32, FP32>;
def SXBR : BinaryRRE<"sxbr", 0xB34B, fsub, FP128, FP128>;		def SDBR : BinaryRRE<"sdbr", 0xB31B, any_fsub, FP64, FP64>;
		def SXBR : BinaryRRE<"sxbr", 0xB34B, any_fsub, FP128, FP128>;

def SEB : BinaryRXE<"seb", 0xED0B, fsub, FP32, load, 4>;		def SEB : BinaryRXE<"seb", 0xED0B, any_fsub, FP32, load, 4>;
def SDB : BinaryRXE<"sdb", 0xED1B, fsub, FP64, load, 8>;		def SDB : BinaryRXE<"sdb", 0xED1B, any_fsub, FP64, load, 8>;
}		}

// Multiplication.		// Multiplication.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
let isCommutable = 1 in {		let isCommutable = 1 in {
def MEEBR : BinaryRRE<"meebr", 0xB317, fmul, FP32, FP32>;		def MEEBR : BinaryRRE<"meebr", 0xB317, any_fmul, FP32, FP32>;
def MDBR : BinaryRRE<"mdbr", 0xB31C, fmul, FP64, FP64>;		def MDBR : BinaryRRE<"mdbr", 0xB31C, any_fmul, FP64, FP64>;
def MXBR : BinaryRRE<"mxbr", 0xB34C, fmul, FP128, FP128>;		def MXBR : BinaryRRE<"mxbr", 0xB34C, any_fmul, FP128, FP128>;
}		}
def MEEB : BinaryRXE<"meeb", 0xED17, fmul, FP32, load, 4>;		def MEEB : BinaryRXE<"meeb", 0xED17, any_fmul, FP32, load, 4>;
def MDB : BinaryRXE<"mdb", 0xED1C, fmul, FP64, load, 8>;		def MDB : BinaryRXE<"mdb", 0xED1C, any_fmul, FP64, load, 8>;
}		}

// f64 multiplication of two FP32 registers.		// f64 multiplication of two FP32 registers.
let Uses = [FPC] in		let Uses = [FPC], mayRaiseFPException = 1 in
def MDEBR : BinaryRRE<"mdebr", 0xB30C, null_frag, FP64, FP32>;		def MDEBR : BinaryRRE<"mdebr", 0xB30C, null_frag, FP64, FP32>;
def : Pat<(fmul (f64 (fpextend FP32:$src1)), (f64 (fpextend FP32:$src2))),		def : Pat<(any_fmul (f64 (fpextend FP32:$src1)),
		(f64 (fpextend FP32:$src2))),
(MDEBR (INSERT_SUBREG (f64 (IMPLICIT_DEF)),		(MDEBR (INSERT_SUBREG (f64 (IMPLICIT_DEF)),
FP32:$src1, subreg_h32), FP32:$src2)>;		FP32:$src1, subreg_h32), FP32:$src2)>;

// f64 multiplication of an FP32 register and an f32 memory.		// f64 multiplication of an FP32 register and an f32 memory.
let Uses = [FPC] in		let Uses = [FPC], mayRaiseFPException = 1 in
def MDEB : BinaryRXE<"mdeb", 0xED0C, null_frag, FP64, load, 4>;		def MDEB : BinaryRXE<"mdeb", 0xED0C, null_frag, FP64, load, 4>;
def : Pat<(fmul (f64 (fpextend FP32:$src1)),		def : Pat<(any_fmul (f64 (fpextend FP32:$src1)),
(f64 (extloadf32 bdxaddr12only:$addr))),		(f64 (extloadf32 bdxaddr12only:$addr))),
(MDEB (INSERT_SUBREG (f64 (IMPLICIT_DEF)), FP32:$src1, subreg_h32),		(MDEB (INSERT_SUBREG (f64 (IMPLICIT_DEF)), FP32:$src1, subreg_h32),
bdxaddr12only:$addr)>;		bdxaddr12only:$addr)>;

// f128 multiplication of two FP64 registers.		// f128 multiplication of two FP64 registers.
let Uses = [FPC] in		let Uses = [FPC], mayRaiseFPException = 1 in
def MXDBR : BinaryRRE<"mxdbr", 0xB307, null_frag, FP128, FP64>;		def MXDBR : BinaryRRE<"mxdbr", 0xB307, null_frag, FP128, FP64>;
let Predicates = [FeatureNoVectorEnhancements1] in		let Predicates = [FeatureNoVectorEnhancements1] in
def : Pat<(fmul (f128 (fpextend FP64:$src1)), (f128 (fpextend FP64:$src2))),		def : Pat<(any_fmul (f128 (fpextend FP64:$src1)),
		(f128 (fpextend FP64:$src2))),
(MXDBR (INSERT_SUBREG (f128 (IMPLICIT_DEF)),		(MXDBR (INSERT_SUBREG (f128 (IMPLICIT_DEF)),
FP64:$src1, subreg_h64), FP64:$src2)>;		FP64:$src1, subreg_h64), FP64:$src2)>;

// f128 multiplication of an FP64 register and an f64 memory.		// f128 multiplication of an FP64 register and an f64 memory.
let Uses = [FPC] in		let Uses = [FPC], mayRaiseFPException = 1 in
def MXDB : BinaryRXE<"mxdb", 0xED07, null_frag, FP128, load, 8>;		def MXDB : BinaryRXE<"mxdb", 0xED07, null_frag, FP128, load, 8>;
let Predicates = [FeatureNoVectorEnhancements1] in		let Predicates = [FeatureNoVectorEnhancements1] in
def : Pat<(fmul (f128 (fpextend FP64:$src1)),		def : Pat<(any_fmul (f128 (fpextend FP64:$src1)),
(f128 (extloadf64 bdxaddr12only:$addr))),		(f128 (extloadf64 bdxaddr12only:$addr))),
(MXDB (INSERT_SUBREG (f128 (IMPLICIT_DEF)), FP64:$src1, subreg_h64),		(MXDB (INSERT_SUBREG (f128 (IMPLICIT_DEF)), FP64:$src1, subreg_h64),
bdxaddr12only:$addr)>;		bdxaddr12only:$addr)>;

// Fused multiply-add.		// Fused multiply-add.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def MAEBR : TernaryRRD<"maebr", 0xB30E, z_fma, FP32, FP32>;		def MAEBR : TernaryRRD<"maebr", 0xB30E, z_any_fma, FP32, FP32>;
def MADBR : TernaryRRD<"madbr", 0xB31E, z_fma, FP64, FP64>;		def MADBR : TernaryRRD<"madbr", 0xB31E, z_any_fma, FP64, FP64>;

def MAEB : TernaryRXF<"maeb", 0xED0E, z_fma, FP32, FP32, load, 4>;		def MAEB : TernaryRXF<"maeb", 0xED0E, z_any_fma, FP32, FP32, load, 4>;
def MADB : TernaryRXF<"madb", 0xED1E, z_fma, FP64, FP64, load, 8>;		def MADB : TernaryRXF<"madb", 0xED1E, z_any_fma, FP64, FP64, load, 8>;
}		}

// Fused multiply-subtract.		// Fused multiply-subtract.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def MSEBR : TernaryRRD<"msebr", 0xB30F, z_fms, FP32, FP32>;		def MSEBR : TernaryRRD<"msebr", 0xB30F, z_any_fms, FP32, FP32>;
def MSDBR : TernaryRRD<"msdbr", 0xB31F, z_fms, FP64, FP64>;		def MSDBR : TernaryRRD<"msdbr", 0xB31F, z_any_fms, FP64, FP64>;

def MSEB : TernaryRXF<"mseb", 0xED0F, z_fms, FP32, FP32, load, 4>;		def MSEB : TernaryRXF<"mseb", 0xED0F, z_any_fms, FP32, FP32, load, 4>;
def MSDB : TernaryRXF<"msdb", 0xED1F, z_fms, FP64, FP64, load, 8>;		def MSDB : TernaryRXF<"msdb", 0xED1F, z_any_fms, FP64, FP64, load, 8>;
}		}

// Division.		// Division.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def DEBR : BinaryRRE<"debr", 0xB30D, fdiv, FP32, FP32>;		def DEBR : BinaryRRE<"debr", 0xB30D, any_fdiv, FP32, FP32>;
def DDBR : BinaryRRE<"ddbr", 0xB31D, fdiv, FP64, FP64>;		def DDBR : BinaryRRE<"ddbr", 0xB31D, any_fdiv, FP64, FP64>;
def DXBR : BinaryRRE<"dxbr", 0xB34D, fdiv, FP128, FP128>;		def DXBR : BinaryRRE<"dxbr", 0xB34D, any_fdiv, FP128, FP128>;

def DEB : BinaryRXE<"deb", 0xED0D, fdiv, FP32, load, 4>;		def DEB : BinaryRXE<"deb", 0xED0D, any_fdiv, FP32, load, 4>;
def DDB : BinaryRXE<"ddb", 0xED1D, fdiv, FP64, load, 8>;		def DDB : BinaryRXE<"ddb", 0xED1D, any_fdiv, FP64, load, 8>;
}		}

// Divide to integer.		// Divide to integer.
let Uses = [FPC], Defs = [CC] in {		let Uses = [FPC], mayRaiseFPException = 1, Defs = [CC] in {
def DIEBR : TernaryRRFb<"diebr", 0xB353, FP32, FP32, FP32>;		def DIEBR : TernaryRRFb<"diebr", 0xB353, FP32, FP32, FP32>;
def DIDBR : TernaryRRFb<"didbr", 0xB35B, FP64, FP64, FP64>;		def DIDBR : TernaryRRFb<"didbr", 0xB35B, FP64, FP64, FP64>;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Comparisons		// Comparisons
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

let Uses = [FPC], Defs = [CC], CCValues = 0xF in {		let Uses = [FPC], mayRaiseFPException = 1, Defs = [CC], CCValues = 0xF in {
def CEBR : CompareRRE<"cebr", 0xB309, z_fcmp, FP32, FP32>;		def CEBR : CompareRRE<"cebr", 0xB309, z_fcmp, FP32, FP32>;
def CDBR : CompareRRE<"cdbr", 0xB319, z_fcmp, FP64, FP64>;		def CDBR : CompareRRE<"cdbr", 0xB319, z_fcmp, FP64, FP64>;
def CXBR : CompareRRE<"cxbr", 0xB349, z_fcmp, FP128, FP128>;		def CXBR : CompareRRE<"cxbr", 0xB349, z_fcmp, FP128, FP128>;

def CEB : CompareRXE<"ceb", 0xED09, z_fcmp, FP32, load, 4>;		def CEB : CompareRXE<"ceb", 0xED09, z_fcmp, FP32, load, 4>;
def CDB : CompareRXE<"cdb", 0xED19, z_fcmp, FP64, load, 8>;		def CDB : CompareRXE<"cdb", 0xED19, z_fcmp, FP64, load, 8>;

def KEBR : CompareRRE<"kebr", 0xB308, null_frag, FP32, FP32>;		def KEBR : CompareRRE<"kebr", 0xB308, null_frag, FP32, FP32>;
Show All 24 Lines	let mayLoad = 1, mayStore = 1 in {
}		}

let Defs = [FPC] in {		let Defs = [FPC] in {
def SFPC : SideEffectUnaryRRE<"sfpc", 0xB384, GR32, int_s390_sfpc>;		def SFPC : SideEffectUnaryRRE<"sfpc", 0xB384, GR32, int_s390_sfpc>;
def LFPC : SideEffectUnaryS<"lfpc", 0xB29D, loadu<int_s390_sfpc>, 4>;		def LFPC : SideEffectUnaryS<"lfpc", 0xB29D, loadu<int_s390_sfpc>, 4>;
}		}
}		}

let Defs = [FPC] in {		let Defs = [FPC], mayRaiseFPException = 1 in {
def SFASR : SideEffectUnaryRRE<"sfasr", 0xB385, GR32, null_frag>;		def SFASR : SideEffectUnaryRRE<"sfasr", 0xB385, GR32, null_frag>;
def LFAS : SideEffectUnaryS<"lfas", 0xB2BD, null_frag, 4>;		def LFAS : SideEffectUnaryS<"lfas", 0xB2BD, null_frag, 4>;
}		}

let Uses = [FPC], Defs = [FPC] in {		let Uses = [FPC], Defs = [FPC] in {
def SRNMB : SideEffectAddressS<"srnmb", 0xB2B8, null_frag, shift12only>,		def SRNMB : SideEffectAddressS<"srnmb", 0xB2B8, null_frag, shift12only>,
Requires<[FeatureFPExtension]>;		Requires<[FeatureFPExtension]>;
def SRNM : SideEffectAddressS<"srnm", 0xB299, null_frag, shift12only>;		def SRNM : SideEffectAddressS<"srnm", 0xB299, null_frag, shift12only>;
Show All 11 Lines

llvm/trunk/lib/Target/SystemZ/SystemZInstrVector.td

Show First 20 Lines • Show All 918 Lines • ▼ Show 20 Lines

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Floating-point arithmetic		// Floating-point arithmetic
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

// See comments in SystemZInstrFP.td for the suppression flags and		// See comments in SystemZInstrFP.td for the suppression flags and
// rounding modes.		// rounding modes.
multiclass VectorRounding<Instruction insn, TypedReg tr> {		multiclass VectorRounding<Instruction insn, TypedReg tr> {
def : FPConversion<insn, frint, tr, tr, 0, 0>;		def : FPConversion<insn, any_frint, tr, tr, 0, 0>;
def : FPConversion<insn, fnearbyint, tr, tr, 4, 0>;		def : FPConversion<insn, any_fnearbyint, tr, tr, 4, 0>;
def : FPConversion<insn, ffloor, tr, tr, 4, 7>;		def : FPConversion<insn, any_ffloor, tr, tr, 4, 7>;
def : FPConversion<insn, fceil, tr, tr, 4, 6>;		def : FPConversion<insn, any_fceil, tr, tr, 4, 6>;
def : FPConversion<insn, ftrunc, tr, tr, 4, 5>;		def : FPConversion<insn, any_ftrunc, tr, tr, 4, 5>;
def : FPConversion<insn, fround, tr, tr, 4, 1>;		def : FPConversion<insn, any_fround, tr, tr, 4, 1>;
}		}

let Predicates = [FeatureVector] in {		let Predicates = [FeatureVector] in {
// Add.		// Add.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def VFA : BinaryVRRcFloatGeneric<"vfa", 0xE7E3>;		def VFA : BinaryVRRcFloatGeneric<"vfa", 0xE7E3>;
def VFADB : BinaryVRRc<"vfadb", 0xE7E3, fadd, v128db, v128db, 3, 0>;		def VFADB : BinaryVRRc<"vfadb", 0xE7E3, any_fadd, v128db, v128db, 3, 0>;
def WFADB : BinaryVRRc<"wfadb", 0xE7E3, fadd, v64db, v64db, 3, 8>;		def WFADB : BinaryVRRc<"wfadb", 0xE7E3, any_fadd, v64db, v64db, 3, 8>;
let Predicates = [FeatureVectorEnhancements1] in {		let Predicates = [FeatureVectorEnhancements1] in {
def VFASB : BinaryVRRc<"vfasb", 0xE7E3, fadd, v128sb, v128sb, 2, 0>;		def VFASB : BinaryVRRc<"vfasb", 0xE7E3, any_fadd, v128sb, v128sb, 2, 0>;
def WFASB : BinaryVRRc<"wfasb", 0xE7E3, fadd, v32sb, v32sb, 2, 8>;		def WFASB : BinaryVRRc<"wfasb", 0xE7E3, any_fadd, v32sb, v32sb, 2, 8>;
def WFAXB : BinaryVRRc<"wfaxb", 0xE7E3, fadd, v128xb, v128xb, 4, 8>;		def WFAXB : BinaryVRRc<"wfaxb", 0xE7E3, any_fadd, v128xb, v128xb, 4, 8>;
}		}
}		}

// Convert from fixed 64-bit.		// Convert from fixed 64-bit.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def VCDG : TernaryVRRaFloatGeneric<"vcdg", 0xE7C3>;		def VCDG : TernaryVRRaFloatGeneric<"vcdg", 0xE7C3>;
def VCDGB : TernaryVRRa<"vcdgb", 0xE7C3, null_frag, v128db, v128g, 3, 0>;		def VCDGB : TernaryVRRa<"vcdgb", 0xE7C3, null_frag, v128db, v128g, 3, 0>;
def WCDGB : TernaryVRRa<"wcdgb", 0xE7C3, null_frag, v64db, v64g, 3, 8>;		def WCDGB : TernaryVRRa<"wcdgb", 0xE7C3, null_frag, v64db, v64g, 3, 8>;
}		}
def : FPConversion<VCDGB, sint_to_fp, v128db, v128g, 0, 0>;		def : FPConversion<VCDGB, sint_to_fp, v128db, v128g, 0, 0>;

// Convert from logical 64-bit.		// Convert from logical 64-bit.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def VCDLG : TernaryVRRaFloatGeneric<"vcdlg", 0xE7C1>;		def VCDLG : TernaryVRRaFloatGeneric<"vcdlg", 0xE7C1>;
def VCDLGB : TernaryVRRa<"vcdlgb", 0xE7C1, null_frag, v128db, v128g, 3, 0>;		def VCDLGB : TernaryVRRa<"vcdlgb", 0xE7C1, null_frag, v128db, v128g, 3, 0>;
def WCDLGB : TernaryVRRa<"wcdlgb", 0xE7C1, null_frag, v64db, v64g, 3, 8>;		def WCDLGB : TernaryVRRa<"wcdlgb", 0xE7C1, null_frag, v64db, v64g, 3, 8>;
}		}
def : FPConversion<VCDLGB, uint_to_fp, v128db, v128g, 0, 0>;		def : FPConversion<VCDLGB, uint_to_fp, v128db, v128g, 0, 0>;

// Convert to fixed 64-bit.		// Convert to fixed 64-bit.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def VCGD : TernaryVRRaFloatGeneric<"vcgd", 0xE7C2>;		def VCGD : TernaryVRRaFloatGeneric<"vcgd", 0xE7C2>;
def VCGDB : TernaryVRRa<"vcgdb", 0xE7C2, null_frag, v128g, v128db, 3, 0>;		def VCGDB : TernaryVRRa<"vcgdb", 0xE7C2, null_frag, v128g, v128db, 3, 0>;
def WCGDB : TernaryVRRa<"wcgdb", 0xE7C2, null_frag, v64g, v64db, 3, 8>;		def WCGDB : TernaryVRRa<"wcgdb", 0xE7C2, null_frag, v64g, v64db, 3, 8>;
}		}
// Rounding mode should agree with SystemZInstrFP.td.		// Rounding mode should agree with SystemZInstrFP.td.
def : FPConversion<VCGDB, fp_to_sint, v128g, v128db, 0, 5>;		def : FPConversion<VCGDB, fp_to_sint, v128g, v128db, 0, 5>;

// Convert to logical 64-bit.		// Convert to logical 64-bit.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def VCLGD : TernaryVRRaFloatGeneric<"vclgd", 0xE7C0>;		def VCLGD : TernaryVRRaFloatGeneric<"vclgd", 0xE7C0>;
def VCLGDB : TernaryVRRa<"vclgdb", 0xE7C0, null_frag, v128g, v128db, 3, 0>;		def VCLGDB : TernaryVRRa<"vclgdb", 0xE7C0, null_frag, v128g, v128db, 3, 0>;
def WCLGDB : TernaryVRRa<"wclgdb", 0xE7C0, null_frag, v64g, v64db, 3, 8>;		def WCLGDB : TernaryVRRa<"wclgdb", 0xE7C0, null_frag, v64g, v64db, 3, 8>;
}		}
// Rounding mode should agree with SystemZInstrFP.td.		// Rounding mode should agree with SystemZInstrFP.td.
def : FPConversion<VCLGDB, fp_to_uint, v128g, v128db, 0, 5>;		def : FPConversion<VCLGDB, fp_to_uint, v128g, v128db, 0, 5>;

// Divide.		// Divide.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def VFD : BinaryVRRcFloatGeneric<"vfd", 0xE7E5>;		def VFD : BinaryVRRcFloatGeneric<"vfd", 0xE7E5>;
def VFDDB : BinaryVRRc<"vfddb", 0xE7E5, fdiv, v128db, v128db, 3, 0>;		def VFDDB : BinaryVRRc<"vfddb", 0xE7E5, any_fdiv, v128db, v128db, 3, 0>;
def WFDDB : BinaryVRRc<"wfddb", 0xE7E5, fdiv, v64db, v64db, 3, 8>;		def WFDDB : BinaryVRRc<"wfddb", 0xE7E5, any_fdiv, v64db, v64db, 3, 8>;
let Predicates = [FeatureVectorEnhancements1] in {		let Predicates = [FeatureVectorEnhancements1] in {
def VFDSB : BinaryVRRc<"vfdsb", 0xE7E5, fdiv, v128sb, v128sb, 2, 0>;		def VFDSB : BinaryVRRc<"vfdsb", 0xE7E5, any_fdiv, v128sb, v128sb, 2, 0>;
def WFDSB : BinaryVRRc<"wfdsb", 0xE7E5, fdiv, v32sb, v32sb, 2, 8>;		def WFDSB : BinaryVRRc<"wfdsb", 0xE7E5, any_fdiv, v32sb, v32sb, 2, 8>;
def WFDXB : BinaryVRRc<"wfdxb", 0xE7E5, fdiv, v128xb, v128xb, 4, 8>;		def WFDXB : BinaryVRRc<"wfdxb", 0xE7E5, any_fdiv, v128xb, v128xb, 4, 8>;
}		}
}		}

// Load FP integer.		// Load FP integer.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def VFI : TernaryVRRaFloatGeneric<"vfi", 0xE7C7>;		def VFI : TernaryVRRaFloatGeneric<"vfi", 0xE7C7>;
def VFIDB : TernaryVRRa<"vfidb", 0xE7C7, int_s390_vfidb, v128db, v128db, 3, 0>;		def VFIDB : TernaryVRRa<"vfidb", 0xE7C7, int_s390_vfidb, v128db, v128db, 3, 0>;
def WFIDB : TernaryVRRa<"wfidb", 0xE7C7, null_frag, v64db, v64db, 3, 8>;		def WFIDB : TernaryVRRa<"wfidb", 0xE7C7, null_frag, v64db, v64db, 3, 8>;
}		}
defm : VectorRounding<VFIDB, v128db>;		defm : VectorRounding<VFIDB, v128db>;
defm : VectorRounding<WFIDB, v64db>;		defm : VectorRounding<WFIDB, v64db>;
let Predicates = [FeatureVectorEnhancements1] in {		let Predicates = [FeatureVectorEnhancements1] in {
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def VFISB : TernaryVRRa<"vfisb", 0xE7C7, int_s390_vfisb, v128sb, v128sb, 2, 0>;		def VFISB : TernaryVRRa<"vfisb", 0xE7C7, int_s390_vfisb, v128sb, v128sb, 2, 0>;
def WFISB : TernaryVRRa<"wfisb", 0xE7C7, null_frag, v32sb, v32sb, 2, 8>;		def WFISB : TernaryVRRa<"wfisb", 0xE7C7, null_frag, v32sb, v32sb, 2, 8>;
def WFIXB : TernaryVRRa<"wfixb", 0xE7C7, null_frag, v128xb, v128xb, 4, 8>;		def WFIXB : TernaryVRRa<"wfixb", 0xE7C7, null_frag, v128xb, v128xb, 4, 8>;
}		}
defm : VectorRounding<VFISB, v128sb>;		defm : VectorRounding<VFISB, v128sb>;
defm : VectorRounding<WFISB, v32sb>;		defm : VectorRounding<WFISB, v32sb>;
defm : VectorRounding<WFIXB, v128xb>;		defm : VectorRounding<WFIXB, v128xb>;
}		}

// Load lengthened.		// Load lengthened.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def VLDE : UnaryVRRaFloatGeneric<"vlde", 0xE7C4>;		def VLDE : UnaryVRRaFloatGeneric<"vlde", 0xE7C4>;
def VLDEB : UnaryVRRa<"vldeb", 0xE7C4, z_vextend, v128db, v128sb, 2, 0>;		def VLDEB : UnaryVRRa<"vldeb", 0xE7C4, z_vextend, v128db, v128sb, 2, 0>;
def WLDEB : UnaryVRRa<"wldeb", 0xE7C4, fpextend, v64db, v32sb, 2, 8>;		def WLDEB : UnaryVRRa<"wldeb", 0xE7C4, any_fpextend, v64db, v32sb, 2, 8>;
}		}
let Predicates = [FeatureVectorEnhancements1] in {		let Predicates = [FeatureVectorEnhancements1] in {
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
let isAsmParserOnly = 1 in {		let isAsmParserOnly = 1 in {
def VFLL : UnaryVRRaFloatGeneric<"vfll", 0xE7C4>;		def VFLL : UnaryVRRaFloatGeneric<"vfll", 0xE7C4>;
def VFLLS : UnaryVRRa<"vflls", 0xE7C4, null_frag, v128db, v128sb, 2, 0>;		def VFLLS : UnaryVRRa<"vflls", 0xE7C4, null_frag, v128db, v128sb, 2, 0>;
def WFLLS : UnaryVRRa<"wflls", 0xE7C4, null_frag, v64db, v32sb, 2, 8>;		def WFLLS : UnaryVRRa<"wflls", 0xE7C4, null_frag, v64db, v32sb, 2, 8>;
}		}
def WFLLD : UnaryVRRa<"wflld", 0xE7C4, fpextend, v128xb, v64db, 3, 8>;		def WFLLD : UnaryVRRa<"wflld", 0xE7C4, any_fpextend, v128xb, v64db, 3, 8>;
}		}
def : Pat<(f128 (fpextend (f32 VR32:$src))),		def : Pat<(f128 (any_fpextend (f32 VR32:$src))),
(WFLLD (WLDEB VR32:$src))>;		(WFLLD (WLDEB VR32:$src))>;
}		}

// Load rounded.		// Load rounded.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def VLED : TernaryVRRaFloatGeneric<"vled", 0xE7C5>;		def VLED : TernaryVRRaFloatGeneric<"vled", 0xE7C5>;
def VLEDB : TernaryVRRa<"vledb", 0xE7C5, null_frag, v128sb, v128db, 3, 0>;		def VLEDB : TernaryVRRa<"vledb", 0xE7C5, null_frag, v128sb, v128db, 3, 0>;
def WLEDB : TernaryVRRa<"wledb", 0xE7C5, null_frag, v32sb, v64db, 3, 8>;		def WLEDB : TernaryVRRa<"wledb", 0xE7C5, null_frag, v32sb, v64db, 3, 8>;
}		}
def : Pat<(v4f32 (z_vround (v2f64 VR128:$src))), (VLEDB VR128:$src, 0, 0)>;		def : Pat<(v4f32 (z_vround (v2f64 VR128:$src))), (VLEDB VR128:$src, 0, 0)>;
def : FPConversion<WLEDB, fpround, v32sb, v64db, 0, 0>;		def : FPConversion<WLEDB, any_fpround, v32sb, v64db, 0, 0>;
let Predicates = [FeatureVectorEnhancements1] in {		let Predicates = [FeatureVectorEnhancements1] in {
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
let isAsmParserOnly = 1 in {		let isAsmParserOnly = 1 in {
def VFLR : TernaryVRRaFloatGeneric<"vflr", 0xE7C5>;		def VFLR : TernaryVRRaFloatGeneric<"vflr", 0xE7C5>;
def VFLRD : TernaryVRRa<"vflrd", 0xE7C5, null_frag, v128sb, v128db, 3, 0>;		def VFLRD : TernaryVRRa<"vflrd", 0xE7C5, null_frag, v128sb, v128db, 3, 0>;
def WFLRD : TernaryVRRa<"wflrd", 0xE7C5, null_frag, v32sb, v64db, 3, 8>;		def WFLRD : TernaryVRRa<"wflrd", 0xE7C5, null_frag, v32sb, v64db, 3, 8>;
}		}
def WFLRX : TernaryVRRa<"wflrx", 0xE7C5, null_frag, v64db, v128xb, 4, 8>;		def WFLRX : TernaryVRRa<"wflrx", 0xE7C5, null_frag, v64db, v128xb, 4, 8>;
}		}
def : FPConversion<WFLRX, fpround, v64db, v128xb, 0, 0>;		def : FPConversion<WFLRX, any_fpround, v64db, v128xb, 0, 0>;
def : Pat<(f32 (fpround (f128 VR128:$src))),		def : Pat<(f32 (any_fpround (f128 VR128:$src))),
(WLEDB (WFLRX VR128:$src, 0, 3), 0, 0)>;		(WLEDB (WFLRX VR128:$src, 0, 3), 0, 0)>;
}		}

// Maximum.		// Maximum.
multiclass VectorMax<Instruction insn, TypedReg tr> {		multiclass VectorMax<Instruction insn, TypedReg tr> {
def : FPMinMax<insn, fmaxnum, tr, 4>;		def : FPMinMax<insn, any_fmaxnum, tr, 4>;
def : FPMinMax<insn, fmaximum, tr, 1>;		def : FPMinMax<insn, fmaximum, tr, 1>;
}		}
let Predicates = [FeatureVectorEnhancements1] in {		let Predicates = [FeatureVectorEnhancements1] in {
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def VFMAX : TernaryVRRcFloatGeneric<"vfmax", 0xE7EF>;		def VFMAX : TernaryVRRcFloatGeneric<"vfmax", 0xE7EF>;
def VFMAXDB : TernaryVRRcFloat<"vfmaxdb", 0xE7EF, int_s390_vfmaxdb,		def VFMAXDB : TernaryVRRcFloat<"vfmaxdb", 0xE7EF, int_s390_vfmaxdb,
v128db, v128db, 3, 0>;		v128db, v128db, 3, 0>;
def WFMAXDB : TernaryVRRcFloat<"wfmaxdb", 0xE7EF, null_frag,		def WFMAXDB : TernaryVRRcFloat<"wfmaxdb", 0xE7EF, null_frag,
v64db, v64db, 3, 8>;		v64db, v64db, 3, 8>;
def VFMAXSB : TernaryVRRcFloat<"vfmaxsb", 0xE7EF, int_s390_vfmaxsb,		def VFMAXSB : TernaryVRRcFloat<"vfmaxsb", 0xE7EF, int_s390_vfmaxsb,
v128sb, v128sb, 2, 0>;		v128sb, v128sb, 2, 0>;
def WFMAXSB : TernaryVRRcFloat<"wfmaxsb", 0xE7EF, null_frag,		def WFMAXSB : TernaryVRRcFloat<"wfmaxsb", 0xE7EF, null_frag,
v32sb, v32sb, 2, 8>;		v32sb, v32sb, 2, 8>;
def WFMAXXB : TernaryVRRcFloat<"wfmaxxb", 0xE7EF, null_frag,		def WFMAXXB : TernaryVRRcFloat<"wfmaxxb", 0xE7EF, null_frag,
v128xb, v128xb, 4, 8>;		v128xb, v128xb, 4, 8>;
}		}
defm : VectorMax<VFMAXDB, v128db>;		defm : VectorMax<VFMAXDB, v128db>;
defm : VectorMax<WFMAXDB, v64db>;		defm : VectorMax<WFMAXDB, v64db>;
defm : VectorMax<VFMAXSB, v128sb>;		defm : VectorMax<VFMAXSB, v128sb>;
defm : VectorMax<WFMAXSB, v32sb>;		defm : VectorMax<WFMAXSB, v32sb>;
defm : VectorMax<WFMAXXB, v128xb>;		defm : VectorMax<WFMAXXB, v128xb>;
}		}

// Minimum.		// Minimum.
multiclass VectorMin<Instruction insn, TypedReg tr> {		multiclass VectorMin<Instruction insn, TypedReg tr> {
def : FPMinMax<insn, fminnum, tr, 4>;		def : FPMinMax<insn, any_fminnum, tr, 4>;
def : FPMinMax<insn, fminimum, tr, 1>;		def : FPMinMax<insn, fminimum, tr, 1>;
}		}
let Predicates = [FeatureVectorEnhancements1] in {		let Predicates = [FeatureVectorEnhancements1] in {
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def VFMIN : TernaryVRRcFloatGeneric<"vfmin", 0xE7EE>;		def VFMIN : TernaryVRRcFloatGeneric<"vfmin", 0xE7EE>;
def VFMINDB : TernaryVRRcFloat<"vfmindb", 0xE7EE, int_s390_vfmindb,		def VFMINDB : TernaryVRRcFloat<"vfmindb", 0xE7EE, int_s390_vfmindb,
v128db, v128db, 3, 0>;		v128db, v128db, 3, 0>;
def WFMINDB : TernaryVRRcFloat<"wfmindb", 0xE7EE, null_frag,		def WFMINDB : TernaryVRRcFloat<"wfmindb", 0xE7EE, null_frag,
v64db, v64db, 3, 8>;		v64db, v64db, 3, 8>;
def VFMINSB : TernaryVRRcFloat<"vfminsb", 0xE7EE, int_s390_vfminsb,		def VFMINSB : TernaryVRRcFloat<"vfminsb", 0xE7EE, int_s390_vfminsb,
v128sb, v128sb, 2, 0>;		v128sb, v128sb, 2, 0>;
def WFMINSB : TernaryVRRcFloat<"wfminsb", 0xE7EE, null_frag,		def WFMINSB : TernaryVRRcFloat<"wfminsb", 0xE7EE, null_frag,
v32sb, v32sb, 2, 8>;		v32sb, v32sb, 2, 8>;
def WFMINXB : TernaryVRRcFloat<"wfminxb", 0xE7EE, null_frag,		def WFMINXB : TernaryVRRcFloat<"wfminxb", 0xE7EE, null_frag,
v128xb, v128xb, 4, 8>;		v128xb, v128xb, 4, 8>;
}		}
defm : VectorMin<VFMINDB, v128db>;		defm : VectorMin<VFMINDB, v128db>;
defm : VectorMin<WFMINDB, v64db>;		defm : VectorMin<WFMINDB, v64db>;
defm : VectorMin<VFMINSB, v128sb>;		defm : VectorMin<VFMINSB, v128sb>;
defm : VectorMin<WFMINSB, v32sb>;		defm : VectorMin<WFMINSB, v32sb>;
defm : VectorMin<WFMINXB, v128xb>;		defm : VectorMin<WFMINXB, v128xb>;
}		}

// Multiply.		// Multiply.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def VFM : BinaryVRRcFloatGeneric<"vfm", 0xE7E7>;		def VFM : BinaryVRRcFloatGeneric<"vfm", 0xE7E7>;
def VFMDB : BinaryVRRc<"vfmdb", 0xE7E7, fmul, v128db, v128db, 3, 0>;		def VFMDB : BinaryVRRc<"vfmdb", 0xE7E7, any_fmul, v128db, v128db, 3, 0>;
def WFMDB : BinaryVRRc<"wfmdb", 0xE7E7, fmul, v64db, v64db, 3, 8>;		def WFMDB : BinaryVRRc<"wfmdb", 0xE7E7, any_fmul, v64db, v64db, 3, 8>;
let Predicates = [FeatureVectorEnhancements1] in {		let Predicates = [FeatureVectorEnhancements1] in {
def VFMSB : BinaryVRRc<"vfmsb", 0xE7E7, fmul, v128sb, v128sb, 2, 0>;		def VFMSB : BinaryVRRc<"vfmsb", 0xE7E7, any_fmul, v128sb, v128sb, 2, 0>;
def WFMSB : BinaryVRRc<"wfmsb", 0xE7E7, fmul, v32sb, v32sb, 2, 8>;		def WFMSB : BinaryVRRc<"wfmsb", 0xE7E7, any_fmul, v32sb, v32sb, 2, 8>;
def WFMXB : BinaryVRRc<"wfmxb", 0xE7E7, fmul, v128xb, v128xb, 4, 8>;		def WFMXB : BinaryVRRc<"wfmxb", 0xE7E7, any_fmul, v128xb, v128xb, 4, 8>;
}		}
}		}

// Multiply and add.		// Multiply and add.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def VFMA : TernaryVRReFloatGeneric<"vfma", 0xE78F>;		def VFMA : TernaryVRReFloatGeneric<"vfma", 0xE78F>;
def VFMADB : TernaryVRRe<"vfmadb", 0xE78F, fma, v128db, v128db, 0, 3>;		def VFMADB : TernaryVRRe<"vfmadb", 0xE78F, any_fma, v128db, v128db, 0, 3>;
def WFMADB : TernaryVRRe<"wfmadb", 0xE78F, fma, v64db, v64db, 8, 3>;		def WFMADB : TernaryVRRe<"wfmadb", 0xE78F, any_fma, v64db, v64db, 8, 3>;
let Predicates = [FeatureVectorEnhancements1] in {		let Predicates = [FeatureVectorEnhancements1] in {
def VFMASB : TernaryVRRe<"vfmasb", 0xE78F, fma, v128sb, v128sb, 0, 2>;		def VFMASB : TernaryVRRe<"vfmasb", 0xE78F, any_fma, v128sb, v128sb, 0, 2>;
def WFMASB : TernaryVRRe<"wfmasb", 0xE78F, fma, v32sb, v32sb, 8, 2>;		def WFMASB : TernaryVRRe<"wfmasb", 0xE78F, any_fma, v32sb, v32sb, 8, 2>;
def WFMAXB : TernaryVRRe<"wfmaxb", 0xE78F, fma, v128xb, v128xb, 8, 4>;		def WFMAXB : TernaryVRRe<"wfmaxb", 0xE78F, any_fma, v128xb, v128xb, 8, 4>;
}		}
}		}

// Multiply and subtract.		// Multiply and subtract.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def VFMS : TernaryVRReFloatGeneric<"vfms", 0xE78E>;		def VFMS : TernaryVRReFloatGeneric<"vfms", 0xE78E>;
def VFMSDB : TernaryVRRe<"vfmsdb", 0xE78E, fms, v128db, v128db, 0, 3>;		def VFMSDB : TernaryVRRe<"vfmsdb", 0xE78E, any_fms, v128db, v128db, 0, 3>;
def WFMSDB : TernaryVRRe<"wfmsdb", 0xE78E, fms, v64db, v64db, 8, 3>;		def WFMSDB : TernaryVRRe<"wfmsdb", 0xE78E, any_fms, v64db, v64db, 8, 3>;
let Predicates = [FeatureVectorEnhancements1] in {		let Predicates = [FeatureVectorEnhancements1] in {
def VFMSSB : TernaryVRRe<"vfmssb", 0xE78E, fms, v128sb, v128sb, 0, 2>;		def VFMSSB : TernaryVRRe<"vfmssb", 0xE78E, any_fms, v128sb, v128sb, 0, 2>;
def WFMSSB : TernaryVRRe<"wfmssb", 0xE78E, fms, v32sb, v32sb, 8, 2>;		def WFMSSB : TernaryVRRe<"wfmssb", 0xE78E, any_fms, v32sb, v32sb, 8, 2>;
def WFMSXB : TernaryVRRe<"wfmsxb", 0xE78E, fms, v128xb, v128xb, 8, 4>;		def WFMSXB : TernaryVRRe<"wfmsxb", 0xE78E, any_fms, v128xb, v128xb, 8, 4>;
}		}
}		}

// Negative multiply and add.		// Negative multiply and add.
let Uses = [FPC], Predicates = [FeatureVectorEnhancements1] in {		let Uses = [FPC], mayRaiseFPException = 1,
		Predicates = [FeatureVectorEnhancements1] in {
def VFNMA : TernaryVRReFloatGeneric<"vfnma", 0xE79F>;		def VFNMA : TernaryVRReFloatGeneric<"vfnma", 0xE79F>;
def VFNMADB : TernaryVRRe<"vfnmadb", 0xE79F, fnma, v128db, v128db, 0, 3>;		def VFNMADB : TernaryVRRe<"vfnmadb", 0xE79F, any_fnma, v128db, v128db, 0, 3>;
def WFNMADB : TernaryVRRe<"wfnmadb", 0xE79F, fnma, v64db, v64db, 8, 3>;		def WFNMADB : TernaryVRRe<"wfnmadb", 0xE79F, any_fnma, v64db, v64db, 8, 3>;
def VFNMASB : TernaryVRRe<"vfnmasb", 0xE79F, fnma, v128sb, v128sb, 0, 2>;		def VFNMASB : TernaryVRRe<"vfnmasb", 0xE79F, any_fnma, v128sb, v128sb, 0, 2>;
def WFNMASB : TernaryVRRe<"wfnmasb", 0xE79F, fnma, v32sb, v32sb, 8, 2>;		def WFNMASB : TernaryVRRe<"wfnmasb", 0xE79F, any_fnma, v32sb, v32sb, 8, 2>;
def WFNMAXB : TernaryVRRe<"wfnmaxb", 0xE79F, fnma, v128xb, v128xb, 8, 4>;		def WFNMAXB : TernaryVRRe<"wfnmaxb", 0xE79F, any_fnma, v128xb, v128xb, 8, 4>;
}		}

// Negative multiply and subtract.		// Negative multiply and subtract.
let Uses = [FPC], Predicates = [FeatureVectorEnhancements1] in {		let Uses = [FPC], mayRaiseFPException = 1,
		Predicates = [FeatureVectorEnhancements1] in {
def VFNMS : TernaryVRReFloatGeneric<"vfnms", 0xE79E>;		def VFNMS : TernaryVRReFloatGeneric<"vfnms", 0xE79E>;
def VFNMSDB : TernaryVRRe<"vfnmsdb", 0xE79E, fnms, v128db, v128db, 0, 3>;		def VFNMSDB : TernaryVRRe<"vfnmsdb", 0xE79E, any_fnms, v128db, v128db, 0, 3>;
def WFNMSDB : TernaryVRRe<"wfnmsdb", 0xE79E, fnms, v64db, v64db, 8, 3>;		def WFNMSDB : TernaryVRRe<"wfnmsdb", 0xE79E, any_fnms, v64db, v64db, 8, 3>;
def VFNMSSB : TernaryVRRe<"vfnmssb", 0xE79E, fnms, v128sb, v128sb, 0, 2>;		def VFNMSSB : TernaryVRRe<"vfnmssb", 0xE79E, any_fnms, v128sb, v128sb, 0, 2>;
def WFNMSSB : TernaryVRRe<"wfnmssb", 0xE79E, fnms, v32sb, v32sb, 8, 2>;		def WFNMSSB : TernaryVRRe<"wfnmssb", 0xE79E, any_fnms, v32sb, v32sb, 8, 2>;
def WFNMSXB : TernaryVRRe<"wfnmsxb", 0xE79E, fnms, v128xb, v128xb, 8, 4>;		def WFNMSXB : TernaryVRRe<"wfnmsxb", 0xE79E, any_fnms, v128xb, v128xb, 8, 4>;
}		}

// Perform sign operation.		// Perform sign operation.
def VFPSO : BinaryVRRaFloatGeneric<"vfpso", 0xE7CC>;		def VFPSO : BinaryVRRaFloatGeneric<"vfpso", 0xE7CC>;
def VFPSODB : BinaryVRRa<"vfpsodb", 0xE7CC, null_frag, v128db, v128db, 3, 0>;		def VFPSODB : BinaryVRRa<"vfpsodb", 0xE7CC, null_frag, v128db, v128db, 3, 0>;
def WFPSODB : BinaryVRRa<"wfpsodb", 0xE7CC, null_frag, v64db, v64db, 3, 8>;		def WFPSODB : BinaryVRRa<"wfpsodb", 0xE7CC, null_frag, v64db, v64db, 3, 8>;
let Predicates = [FeatureVectorEnhancements1] in {		let Predicates = [FeatureVectorEnhancements1] in {
def VFPSOSB : BinaryVRRa<"vfpsosb", 0xE7CC, null_frag, v128sb, v128sb, 2, 0>;		def VFPSOSB : BinaryVRRa<"vfpsosb", 0xE7CC, null_frag, v128sb, v128sb, 2, 0>;
Show All 24 Lines	let Predicates = [FeatureVector] in {
def WFLPDB : UnaryVRRa<"wflpdb", 0xE7CC, fabs, v64db, v64db, 3, 8, 2>;		def WFLPDB : UnaryVRRa<"wflpdb", 0xE7CC, fabs, v64db, v64db, 3, 8, 2>;
let Predicates = [FeatureVectorEnhancements1] in {		let Predicates = [FeatureVectorEnhancements1] in {
def VFLPSB : UnaryVRRa<"vflpsb", 0xE7CC, fabs, v128sb, v128sb, 2, 0, 2>;		def VFLPSB : UnaryVRRa<"vflpsb", 0xE7CC, fabs, v128sb, v128sb, 2, 0, 2>;
def WFLPSB : UnaryVRRa<"wflpsb", 0xE7CC, fabs, v32sb, v32sb, 2, 8, 2>;		def WFLPSB : UnaryVRRa<"wflpsb", 0xE7CC, fabs, v32sb, v32sb, 2, 8, 2>;
def WFLPXB : UnaryVRRa<"wflpxb", 0xE7CC, fabs, v128xb, v128xb, 4, 8, 2>;		def WFLPXB : UnaryVRRa<"wflpxb", 0xE7CC, fabs, v128xb, v128xb, 4, 8, 2>;
}		}

// Square root.		// Square root.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def VFSQ : UnaryVRRaFloatGeneric<"vfsq", 0xE7CE>;		def VFSQ : UnaryVRRaFloatGeneric<"vfsq", 0xE7CE>;
def VFSQDB : UnaryVRRa<"vfsqdb", 0xE7CE, fsqrt, v128db, v128db, 3, 0>;		def VFSQDB : UnaryVRRa<"vfsqdb", 0xE7CE, any_fsqrt, v128db, v128db, 3, 0>;
def WFSQDB : UnaryVRRa<"wfsqdb", 0xE7CE, fsqrt, v64db, v64db, 3, 8>;		def WFSQDB : UnaryVRRa<"wfsqdb", 0xE7CE, any_fsqrt, v64db, v64db, 3, 8>;
let Predicates = [FeatureVectorEnhancements1] in {		let Predicates = [FeatureVectorEnhancements1] in {
def VFSQSB : UnaryVRRa<"vfsqsb", 0xE7CE, fsqrt, v128sb, v128sb, 2, 0>;		def VFSQSB : UnaryVRRa<"vfsqsb", 0xE7CE, any_fsqrt, v128sb, v128sb, 2, 0>;
def WFSQSB : UnaryVRRa<"wfsqsb", 0xE7CE, fsqrt, v32sb, v32sb, 2, 8>;		def WFSQSB : UnaryVRRa<"wfsqsb", 0xE7CE, any_fsqrt, v32sb, v32sb, 2, 8>;
def WFSQXB : UnaryVRRa<"wfsqxb", 0xE7CE, fsqrt, v128xb, v128xb, 4, 8>;		def WFSQXB : UnaryVRRa<"wfsqxb", 0xE7CE, any_fsqrt, v128xb, v128xb, 4, 8>;
}		}
}		}

// Subtract.		// Subtract.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def VFS : BinaryVRRcFloatGeneric<"vfs", 0xE7E2>;		def VFS : BinaryVRRcFloatGeneric<"vfs", 0xE7E2>;
def VFSDB : BinaryVRRc<"vfsdb", 0xE7E2, fsub, v128db, v128db, 3, 0>;		def VFSDB : BinaryVRRc<"vfsdb", 0xE7E2, any_fsub, v128db, v128db, 3, 0>;
def WFSDB : BinaryVRRc<"wfsdb", 0xE7E2, fsub, v64db, v64db, 3, 8>;		def WFSDB : BinaryVRRc<"wfsdb", 0xE7E2, any_fsub, v64db, v64db, 3, 8>;
let Predicates = [FeatureVectorEnhancements1] in {		let Predicates = [FeatureVectorEnhancements1] in {
def VFSSB : BinaryVRRc<"vfssb", 0xE7E2, fsub, v128sb, v128sb, 2, 0>;		def VFSSB : BinaryVRRc<"vfssb", 0xE7E2, any_fsub, v128sb, v128sb, 2, 0>;
def WFSSB : BinaryVRRc<"wfssb", 0xE7E2, fsub, v32sb, v32sb, 2, 8>;		def WFSSB : BinaryVRRc<"wfssb", 0xE7E2, any_fsub, v32sb, v32sb, 2, 8>;
def WFSXB : BinaryVRRc<"wfsxb", 0xE7E2, fsub, v128xb, v128xb, 4, 8>;		def WFSXB : BinaryVRRc<"wfsxb", 0xE7E2, any_fsub, v128xb, v128xb, 4, 8>;
}		}
}		}

// Test data class immediate.		// Test data class immediate.
let Defs = [CC] in {		let Defs = [CC] in {
def VFTCI : BinaryVRIeFloatGeneric<"vftci", 0xE74A>;		def VFTCI : BinaryVRIeFloatGeneric<"vftci", 0xE74A>;
def VFTCIDB : BinaryVRIe<"vftcidb", 0xE74A, z_vftci, v128g, v128db, 3, 0>;		def VFTCIDB : BinaryVRIe<"vftcidb", 0xE74A, z_vftci, v128g, v128db, 3, 0>;
def WFTCIDB : BinaryVRIe<"wftcidb", 0xE74A, null_frag, v64g, v64db, 3, 8>;		def WFTCIDB : BinaryVRIe<"wftcidb", 0xE74A, null_frag, v64g, v64db, 3, 8>;
let Predicates = [FeatureVectorEnhancements1] in {		let Predicates = [FeatureVectorEnhancements1] in {
def VFTCISB : BinaryVRIe<"vftcisb", 0xE74A, z_vftci, v128f, v128sb, 2, 0>;		def VFTCISB : BinaryVRIe<"vftcisb", 0xE74A, z_vftci, v128f, v128sb, 2, 0>;
def WFTCISB : BinaryVRIe<"wftcisb", 0xE74A, null_frag, v32f, v32sb, 2, 8>;		def WFTCISB : BinaryVRIe<"wftcisb", 0xE74A, null_frag, v32f, v32sb, 2, 8>;
def WFTCIXB : BinaryVRIe<"wftcixb", 0xE74A, null_frag, v128q, v128xb, 4, 8>;		def WFTCIXB : BinaryVRIe<"wftcixb", 0xE74A, null_frag, v128q, v128xb, 4, 8>;
}		}
}		}
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Floating-point comparison		// Floating-point comparison
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

let Predicates = [FeatureVector] in {		let Predicates = [FeatureVector] in {
// Compare scalar.		// Compare scalar.
let Uses = [FPC], Defs = [CC] in {		let Uses = [FPC], mayRaiseFPException = 1, Defs = [CC] in {
def WFC : CompareVRRaFloatGeneric<"wfc", 0xE7CB>;		def WFC : CompareVRRaFloatGeneric<"wfc", 0xE7CB>;
def WFCDB : CompareVRRa<"wfcdb", 0xE7CB, z_fcmp, v64db, 3>;		def WFCDB : CompareVRRa<"wfcdb", 0xE7CB, z_fcmp, v64db, 3>;
let Predicates = [FeatureVectorEnhancements1] in {		let Predicates = [FeatureVectorEnhancements1] in {
def WFCSB : CompareVRRa<"wfcsb", 0xE7CB, z_fcmp, v32sb, 2>;		def WFCSB : CompareVRRa<"wfcsb", 0xE7CB, z_fcmp, v32sb, 2>;
def WFCXB : CompareVRRa<"wfcxb", 0xE7CB, z_fcmp, v128xb, 4>;		def WFCXB : CompareVRRa<"wfcxb", 0xE7CB, z_fcmp, v128xb, 4>;
}		}
}		}

// Compare and signal scalar.		// Compare and signal scalar.
let Uses = [FPC], Defs = [CC] in {		let Uses = [FPC], mayRaiseFPException = 1, Defs = [CC] in {
def WFK : CompareVRRaFloatGeneric<"wfk", 0xE7CA>;		def WFK : CompareVRRaFloatGeneric<"wfk", 0xE7CA>;
def WFKDB : CompareVRRa<"wfkdb", 0xE7CA, null_frag, v64db, 3>;		def WFKDB : CompareVRRa<"wfkdb", 0xE7CA, null_frag, v64db, 3>;
let Predicates = [FeatureVectorEnhancements1] in {		let Predicates = [FeatureVectorEnhancements1] in {
def WFKSB : CompareVRRa<"wfksb", 0xE7CA, null_frag, v32sb, 2>;		def WFKSB : CompareVRRa<"wfksb", 0xE7CA, null_frag, v32sb, 2>;
def WFKXB : CompareVRRa<"wfkxb", 0xE7CA, null_frag, v128xb, 4>;		def WFKXB : CompareVRRa<"wfkxb", 0xE7CA, null_frag, v128xb, 4>;
}		}
}		}

// Compare equal.		// Compare equal.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def VFCE : BinaryVRRcSPairFloatGeneric<"vfce", 0xE7E8>;		def VFCE : BinaryVRRcSPairFloatGeneric<"vfce", 0xE7E8>;
defm VFCEDB : BinaryVRRcSPair<"vfcedb", 0xE7E8, z_vfcmpe, z_vfcmpes,		defm VFCEDB : BinaryVRRcSPair<"vfcedb", 0xE7E8, z_vfcmpe, z_vfcmpes,
v128g, v128db, 3, 0>;		v128g, v128db, 3, 0>;
defm WFCEDB : BinaryVRRcSPair<"wfcedb", 0xE7E8, null_frag, null_frag,		defm WFCEDB : BinaryVRRcSPair<"wfcedb", 0xE7E8, null_frag, null_frag,
v64g, v64db, 3, 8>;		v64g, v64db, 3, 8>;
let Predicates = [FeatureVectorEnhancements1] in {		let Predicates = [FeatureVectorEnhancements1] in {
defm VFCESB : BinaryVRRcSPair<"vfcesb", 0xE7E8, z_vfcmpe, z_vfcmpes,		defm VFCESB : BinaryVRRcSPair<"vfcesb", 0xE7E8, z_vfcmpe, z_vfcmpes,
v128f, v128sb, 2, 0>;		v128f, v128sb, 2, 0>;
defm WFCESB : BinaryVRRcSPair<"wfcesb", 0xE7E8, null_frag, null_frag,		defm WFCESB : BinaryVRRcSPair<"wfcesb", 0xE7E8, null_frag, null_frag,
v32f, v32sb, 2, 8>;		v32f, v32sb, 2, 8>;
defm WFCEXB : BinaryVRRcSPair<"wfcexb", 0xE7E8, null_frag, null_frag,		defm WFCEXB : BinaryVRRcSPair<"wfcexb", 0xE7E8, null_frag, null_frag,
v128q, v128xb, 4, 8>;		v128q, v128xb, 4, 8>;
}		}
}		}

// Compare and signal equal.		// Compare and signal equal.
let Uses = [FPC], Predicates = [FeatureVectorEnhancements1] in {		let Uses = [FPC], mayRaiseFPException = 1,
		Predicates = [FeatureVectorEnhancements1] in {
defm VFKEDB : BinaryVRRcSPair<"vfkedb", 0xE7E8, null_frag, null_frag,		defm VFKEDB : BinaryVRRcSPair<"vfkedb", 0xE7E8, null_frag, null_frag,
v128g, v128db, 3, 4>;		v128g, v128db, 3, 4>;
defm WFKEDB : BinaryVRRcSPair<"wfkedb", 0xE7E8, null_frag, null_frag,		defm WFKEDB : BinaryVRRcSPair<"wfkedb", 0xE7E8, null_frag, null_frag,
v64g, v64db, 3, 12>;		v64g, v64db, 3, 12>;
defm VFKESB : BinaryVRRcSPair<"vfkesb", 0xE7E8, null_frag, null_frag,		defm VFKESB : BinaryVRRcSPair<"vfkesb", 0xE7E8, null_frag, null_frag,
v128f, v128sb, 2, 4>;		v128f, v128sb, 2, 4>;
defm WFKESB : BinaryVRRcSPair<"wfkesb", 0xE7E8, null_frag, null_frag,		defm WFKESB : BinaryVRRcSPair<"wfkesb", 0xE7E8, null_frag, null_frag,
v32f, v32sb, 2, 12>;		v32f, v32sb, 2, 12>;
defm WFKEXB : BinaryVRRcSPair<"wfkexb", 0xE7E8, null_frag, null_frag,		defm WFKEXB : BinaryVRRcSPair<"wfkexb", 0xE7E8, null_frag, null_frag,
v128q, v128xb, 4, 12>;		v128q, v128xb, 4, 12>;
}		}

// Compare high.		// Compare high.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def VFCH : BinaryVRRcSPairFloatGeneric<"vfch", 0xE7EB>;		def VFCH : BinaryVRRcSPairFloatGeneric<"vfch", 0xE7EB>;
defm VFCHDB : BinaryVRRcSPair<"vfchdb", 0xE7EB, z_vfcmph, z_vfcmphs,		defm VFCHDB : BinaryVRRcSPair<"vfchdb", 0xE7EB, z_vfcmph, z_vfcmphs,
v128g, v128db, 3, 0>;		v128g, v128db, 3, 0>;
defm WFCHDB : BinaryVRRcSPair<"wfchdb", 0xE7EB, null_frag, null_frag,		defm WFCHDB : BinaryVRRcSPair<"wfchdb", 0xE7EB, null_frag, null_frag,
v64g, v64db, 3, 8>;		v64g, v64db, 3, 8>;
let Predicates = [FeatureVectorEnhancements1] in {		let Predicates = [FeatureVectorEnhancements1] in {
defm VFCHSB : BinaryVRRcSPair<"vfchsb", 0xE7EB, z_vfcmph, z_vfcmphs,		defm VFCHSB : BinaryVRRcSPair<"vfchsb", 0xE7EB, z_vfcmph, z_vfcmphs,
v128f, v128sb, 2, 0>;		v128f, v128sb, 2, 0>;
defm WFCHSB : BinaryVRRcSPair<"wfchsb", 0xE7EB, null_frag, null_frag,		defm WFCHSB : BinaryVRRcSPair<"wfchsb", 0xE7EB, null_frag, null_frag,
v32f, v32sb, 2, 8>;		v32f, v32sb, 2, 8>;
defm WFCHXB : BinaryVRRcSPair<"wfchxb", 0xE7EB, null_frag, null_frag,		defm WFCHXB : BinaryVRRcSPair<"wfchxb", 0xE7EB, null_frag, null_frag,
v128q, v128xb, 4, 8>;		v128q, v128xb, 4, 8>;
}		}
}		}

// Compare and signal high.		// Compare and signal high.
let Uses = [FPC], Predicates = [FeatureVectorEnhancements1] in {		let Uses = [FPC], mayRaiseFPException = 1,
		Predicates = [FeatureVectorEnhancements1] in {
defm VFKHDB : BinaryVRRcSPair<"vfkhdb", 0xE7EB, null_frag, null_frag,		defm VFKHDB : BinaryVRRcSPair<"vfkhdb", 0xE7EB, null_frag, null_frag,
v128g, v128db, 3, 4>;		v128g, v128db, 3, 4>;
defm WFKHDB : BinaryVRRcSPair<"wfkhdb", 0xE7EB, null_frag, null_frag,		defm WFKHDB : BinaryVRRcSPair<"wfkhdb", 0xE7EB, null_frag, null_frag,
v64g, v64db, 3, 12>;		v64g, v64db, 3, 12>;
defm VFKHSB : BinaryVRRcSPair<"vfkhsb", 0xE7EB, null_frag, null_frag,		defm VFKHSB : BinaryVRRcSPair<"vfkhsb", 0xE7EB, null_frag, null_frag,
v128f, v128sb, 2, 4>;		v128f, v128sb, 2, 4>;
defm WFKHSB : BinaryVRRcSPair<"wfkhsb", 0xE7EB, null_frag, null_frag,		defm WFKHSB : BinaryVRRcSPair<"wfkhsb", 0xE7EB, null_frag, null_frag,
v32f, v32sb, 2, 12>;		v32f, v32sb, 2, 12>;
defm WFKHXB : BinaryVRRcSPair<"wfkhxb", 0xE7EB, null_frag, null_frag,		defm WFKHXB : BinaryVRRcSPair<"wfkhxb", 0xE7EB, null_frag, null_frag,
v128q, v128xb, 4, 12>;		v128q, v128xb, 4, 12>;
}		}

// Compare high or equal.		// Compare high or equal.
let Uses = [FPC] in {		let Uses = [FPC], mayRaiseFPException = 1 in {
def VFCHE : BinaryVRRcSPairFloatGeneric<"vfche", 0xE7EA>;		def VFCHE : BinaryVRRcSPairFloatGeneric<"vfche", 0xE7EA>;
defm VFCHEDB : BinaryVRRcSPair<"vfchedb", 0xE7EA, z_vfcmphe, z_vfcmphes,		defm VFCHEDB : BinaryVRRcSPair<"vfchedb", 0xE7EA, z_vfcmphe, z_vfcmphes,
v128g, v128db, 3, 0>;		v128g, v128db, 3, 0>;
defm WFCHEDB : BinaryVRRcSPair<"wfchedb", 0xE7EA, null_frag, null_frag,		defm WFCHEDB : BinaryVRRcSPair<"wfchedb", 0xE7EA, null_frag, null_frag,
v64g, v64db, 3, 8>;		v64g, v64db, 3, 8>;
let Predicates = [FeatureVectorEnhancements1] in {		let Predicates = [FeatureVectorEnhancements1] in {
defm VFCHESB : BinaryVRRcSPair<"vfchesb", 0xE7EA, z_vfcmphe, z_vfcmphes,		defm VFCHESB : BinaryVRRcSPair<"vfchesb", 0xE7EA, z_vfcmphe, z_vfcmphes,
v128f, v128sb, 2, 0>;		v128f, v128sb, 2, 0>;
defm WFCHESB : BinaryVRRcSPair<"wfchesb", 0xE7EA, null_frag, null_frag,		defm WFCHESB : BinaryVRRcSPair<"wfchesb", 0xE7EA, null_frag, null_frag,
v32f, v32sb, 2, 8>;		v32f, v32sb, 2, 8>;
defm WFCHEXB : BinaryVRRcSPair<"wfchexb", 0xE7EA, null_frag, null_frag,		defm WFCHEXB : BinaryVRRcSPair<"wfchexb", 0xE7EA, null_frag, null_frag,
v128q, v128xb, 4, 8>;		v128q, v128xb, 4, 8>;
}		}
}		}

// Compare and signal high or equal.		// Compare and signal high or equal.
let Uses = [FPC], Predicates = [FeatureVectorEnhancements1] in {		let Uses = [FPC], mayRaiseFPException = 1,
		Predicates = [FeatureVectorEnhancements1] in {
defm VFKHEDB : BinaryVRRcSPair<"vfkhedb", 0xE7EA, null_frag, null_frag,		defm VFKHEDB : BinaryVRRcSPair<"vfkhedb", 0xE7EA, null_frag, null_frag,
v128g, v128db, 3, 4>;		v128g, v128db, 3, 4>;
defm WFKHEDB : BinaryVRRcSPair<"wfkhedb", 0xE7EA, null_frag, null_frag,		defm WFKHEDB : BinaryVRRcSPair<"wfkhedb", 0xE7EA, null_frag, null_frag,
v64g, v64db, 3, 12>;		v64g, v64db, 3, 12>;
defm VFKHESB : BinaryVRRcSPair<"vfkhesb", 0xE7EA, null_frag, null_frag,		defm VFKHESB : BinaryVRRcSPair<"vfkhesb", 0xE7EA, null_frag, null_frag,
v128f, v128sb, 2, 4>;		v128f, v128sb, 2, 4>;
defm WFKHESB : BinaryVRRcSPair<"wfkhesb", 0xE7EA, null_frag, null_frag,		defm WFKHESB : BinaryVRRcSPair<"wfkhesb", 0xE7EA, null_frag, null_frag,
v32f, v32sb, 2, 12>;		v32f, v32sb, 2, 12>;
▲ Show 20 Lines • Show All 249 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/SystemZ/SystemZOperators.td

	Show First 20 Lines • Show All 656 Lines • ▼ Show 20 Lines
	def z_ssub : PatFrags<(ops node:$src1, node:$src2),			def z_ssub : PatFrags<(ops node:$src1, node:$src2),
	[(z_ssubo node:$src1, node:$src2),			[(z_ssubo node:$src1, node:$src2),
	(sub node:$src1, node:$src2)]>;			(sub node:$src1, node:$src2)]>;
	def z_usub : PatFrags<(ops node:$src1, node:$src2),			def z_usub : PatFrags<(ops node:$src1, node:$src2),
	[(z_usubo node:$src1, node:$src2),			[(z_usubo node:$src1, node:$src2),
	(sub node:$src1, node:$src2)]>;			(sub node:$src1, node:$src2)]>;

	// Fused multiply-subtract, using the natural operand order.			// Fused multiply-subtract, using the natural operand order.
	def fms : PatFrag<(ops node:$src1, node:$src2, node:$src3),			def any_fms : PatFrag<(ops node:$src1, node:$src2, node:$src3),
	(fma node:$src1, node:$src2, (fneg node:$src3))>;			(any_fma node:$src1, node:$src2, (fneg node:$src3))>;

	// Fused multiply-add and multiply-subtract, but with the order of the			// Fused multiply-add and multiply-subtract, but with the order of the
	// operands matching SystemZ's MA and MS instructions.			// operands matching SystemZ's MA and MS instructions.
	def z_fma : PatFrag<(ops node:$src1, node:$src2, node:$src3),			def z_any_fma : PatFrag<(ops node:$src1, node:$src2, node:$src3),
	(fma node:$src2, node:$src3, node:$src1)>;			(any_fma node:$src2, node:$src3, node:$src1)>;
	def z_fms : PatFrag<(ops node:$src1, node:$src2, node:$src3),			def z_any_fms : PatFrag<(ops node:$src1, node:$src2, node:$src3),
	(fma node:$src2, node:$src3, (fneg node:$src1))>;			(any_fma node:$src2, node:$src3, (fneg node:$src1))>;

	// Negative fused multiply-add and multiply-subtract.			// Negative fused multiply-add and multiply-subtract.
	def fnma : PatFrag<(ops node:$src1, node:$src2, node:$src3),			def any_fnma : PatFrag<(ops node:$src1, node:$src2, node:$src3),
	(fneg (fma node:$src1, node:$src2, node:$src3))>;			(fneg (any_fma node:$src1, node:$src2, node:$src3))>;
	def fnms : PatFrag<(ops node:$src1, node:$src2, node:$src3),			def any_fnms : PatFrag<(ops node:$src1, node:$src2, node:$src3),
	(fneg (fms node:$src1, node:$src2, node:$src3))>;			(fneg (any_fms node:$src1, node:$src2, node:$src3))>;

	// Floating-point negative absolute.			// Floating-point negative absolute.
	def fnabs : PatFrag<(ops node:$ptr), (fneg (fabs node:$ptr))>;			def fnabs : PatFrag<(ops node:$ptr), (fneg (fabs node:$ptr))>;

	// Create a unary operator that loads from memory and then performs			// Create a unary operator that loads from memory and then performs
	// the given operation on it.			// the given operation on it.
	class loadu<SDPatternOperator operator, SDPatternOperator load = load>			class loadu<SDPatternOperator operator, SDPatternOperator load = load>
	: PatFrag<(ops node:$addr), (operator (load node:$addr))>;			: PatFrag<(ops node:$addr), (operator (load node:$addr))>;
	▲ Show 20 Lines • Show All 144 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/SystemZ/fp-strict-add-01.ll

				; Test 32-bit floating-point strict addition.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z10 \
				; RUN: \| FileCheck -check-prefix=CHECK -check-prefix=CHECK-SCALAR %s
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14 \| FileCheck %s

				declare float @foo()
				declare float @llvm.experimental.constrained.fadd.f32(float, float, metadata, metadata)

				; Check register addition.
				define float @f1(float %f1, float %f2) {
				; CHECK-LABEL: f1:
				; CHECK: aebr %f0, %f2
				; CHECK: br %r14
				%res = call float @llvm.experimental.constrained.fadd.f32(
				float %f1, float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check the low end of the AEB range.
				define float @f2(float %f1, float *%ptr) {
				; CHECK-LABEL: f2:
				; CHECK: aeb %f0, 0(%r2)
				; CHECK: br %r14
				%f2 = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.fadd.f32(
				float %f1, float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check the high end of the aligned AEB range.
				define float @f3(float %f1, float *%base) {
				; CHECK-LABEL: f3:
				; CHECK: aeb %f0, 4092(%r2)
				; CHECK: br %r14
				%ptr = getelementptr float, float *%base, i64 1023
				%f2 = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.fadd.f32(
				float %f1, float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check the next word up, which needs separate address logic.
				; Other sequences besides this one would be OK.
				define float @f4(float %f1, float *%base) {
				; CHECK-LABEL: f4:
				; CHECK: aghi %r2, 4096
				; CHECK: aeb %f0, 0(%r2)
				; CHECK: br %r14
				%ptr = getelementptr float, float *%base, i64 1024
				%f2 = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.fadd.f32(
				float %f1, float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check negative displacements, which also need separate address logic.
				define float @f5(float %f1, float *%base) {
				; CHECK-LABEL: f5:
				; CHECK: aghi %r2, -4
				; CHECK: aeb %f0, 0(%r2)
				; CHECK: br %r14
				%ptr = getelementptr float, float *%base, i64 -1
				%f2 = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.fadd.f32(
				float %f1, float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check that AEB allows indices.
				define float @f6(float %f1, float *%base, i64 %index) {
				; CHECK-LABEL: f6:
				; CHECK: sllg %r1, %r3, 2
				; CHECK: aeb %f0, 400(%r1,%r2)
				; CHECK: br %r14
				%ptr1 = getelementptr float, float *%base, i64 %index
				%ptr2 = getelementptr float, float *%ptr1, i64 100
				%f2 = load float, float *%ptr2
				%res = call float @llvm.experimental.constrained.fadd.f32(
				float %f1, float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check that additions of spilled values can use AEB rather than AEBR.
				define float @f7(float *%ptr0) {
				; CHECK-LABEL: f7:
				; CHECK: brasl %r14, foo@PLT
				; CHECK-SCALAR: aeb %f0, 16{{[04]}}(%r15)
				; CHECK: br %r14
				%ptr1 = getelementptr float, float *%ptr0, i64 2
				%ptr2 = getelementptr float, float *%ptr0, i64 4
				%ptr3 = getelementptr float, float *%ptr0, i64 6
				%ptr4 = getelementptr float, float *%ptr0, i64 8
				%ptr5 = getelementptr float, float *%ptr0, i64 10
				%ptr6 = getelementptr float, float *%ptr0, i64 12
				%ptr7 = getelementptr float, float *%ptr0, i64 14
				%ptr8 = getelementptr float, float *%ptr0, i64 16
				%ptr9 = getelementptr float, float *%ptr0, i64 18
				%ptr10 = getelementptr float, float *%ptr0, i64 20

				%val0 = load float, float *%ptr0
				%val1 = load float, float *%ptr1
				%val2 = load float, float *%ptr2
				%val3 = load float, float *%ptr3
				%val4 = load float, float *%ptr4
				%val5 = load float, float *%ptr5
				%val6 = load float, float *%ptr6
				%val7 = load float, float *%ptr7
				%val8 = load float, float *%ptr8
				%val9 = load float, float *%ptr9
				%val10 = load float, float *%ptr10

				%ret = call float @foo()

				%add0 = call float @llvm.experimental.constrained.fadd.f32(
				float %ret, float %val0,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%add1 = call float @llvm.experimental.constrained.fadd.f32(
				float %add0, float %val1,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%add2 = call float @llvm.experimental.constrained.fadd.f32(
				float %add1, float %val2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%add3 = call float @llvm.experimental.constrained.fadd.f32(
				float %add2, float %val3,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%add4 = call float @llvm.experimental.constrained.fadd.f32(
				float %add3, float %val4,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%add5 = call float @llvm.experimental.constrained.fadd.f32(
				float %add4, float %val5,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%add6 = call float @llvm.experimental.constrained.fadd.f32(
				float %add5, float %val6,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%add7 = call float @llvm.experimental.constrained.fadd.f32(
				float %add6, float %val7,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%add8 = call float @llvm.experimental.constrained.fadd.f32(
				float %add7, float %val8,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%add9 = call float @llvm.experimental.constrained.fadd.f32(
				float %add8, float %val9,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%add10 = call float @llvm.experimental.constrained.fadd.f32(
				float %add9, float %val10,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")

				ret float %add10
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-add-02.ll

				; Test strict 64-bit floating-point addition.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z10 \
				; RUN: \| FileCheck -check-prefix=CHECK -check-prefix=CHECK-SCALAR %s
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z13 -verify-machineinstrs \| FileCheck %s
				declare double @foo()
				declare double @llvm.experimental.constrained.fadd.f64(double, double, metadata, metadata)

				; Check register addition.
				define double @f1(double %f1, double %f2) {
				; CHECK-LABEL: f1:
				; CHECK: adbr %f0, %f2
				; CHECK: br %r14
				%res = call double @llvm.experimental.constrained.fadd.f64(
				double %f1, double %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check the low end of the ADB range.
				define double @f2(double %f1, double *%ptr) {
				; CHECK-LABEL: f2:
				; CHECK: adb %f0, 0(%r2)
				; CHECK: br %r14
				%f2 = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.fadd.f64(
				double %f1, double %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check the high end of the aligned ADB range.
				define double @f3(double %f1, double *%base) {
				; CHECK-LABEL: f3:
				; CHECK: adb %f0, 4088(%r2)
				; CHECK: br %r14
				%ptr = getelementptr double, double *%base, i64 511
				%f2 = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.fadd.f64(
				double %f1, double %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check the next doubleword up, which needs separate address logic.
				; Other sequences besides this one would be OK.
				define double @f4(double %f1, double *%base) {
				; CHECK-LABEL: f4:
				; CHECK: aghi %r2, 4096
				; CHECK: adb %f0, 0(%r2)
				; CHECK: br %r14
				%ptr = getelementptr double, double *%base, i64 512
				%f2 = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.fadd.f64(
				double %f1, double %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check negative displacements, which also need separate address logic.
				define double @f5(double %f1, double *%base) {
				; CHECK-LABEL: f5:
				; CHECK: aghi %r2, -8
				; CHECK: adb %f0, 0(%r2)
				; CHECK: br %r14
				%ptr = getelementptr double, double *%base, i64 -1
				%f2 = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.fadd.f64(
				double %f1, double %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check that ADB allows indices.
				define double @f6(double %f1, double *%base, i64 %index) {
				; CHECK-LABEL: f6:
				; CHECK: sllg %r1, %r3, 3
				; CHECK: adb %f0, 800(%r1,%r2)
				; CHECK: br %r14
				%ptr1 = getelementptr double, double *%base, i64 %index
				%ptr2 = getelementptr double, double *%ptr1, i64 100
				%f2 = load double, double *%ptr2
				%res = call double @llvm.experimental.constrained.fadd.f64(
				double %f1, double %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check that additions of spilled values can use ADB rather than ADBR.
				define double @f7(double *%ptr0) {
				; CHECK-LABEL: f7:
				; CHECK: brasl %r14, foo@PLT
				; CHECK-SCALAR: adb %f0, 160(%r15)
				; CHECK: br %r14
				%ptr1 = getelementptr double, double *%ptr0, i64 2
				%ptr2 = getelementptr double, double *%ptr0, i64 4
				%ptr3 = getelementptr double, double *%ptr0, i64 6
				%ptr4 = getelementptr double, double *%ptr0, i64 8
				%ptr5 = getelementptr double, double *%ptr0, i64 10
				%ptr6 = getelementptr double, double *%ptr0, i64 12
				%ptr7 = getelementptr double, double *%ptr0, i64 14
				%ptr8 = getelementptr double, double *%ptr0, i64 16
				%ptr9 = getelementptr double, double *%ptr0, i64 18
				%ptr10 = getelementptr double, double *%ptr0, i64 20

				%val0 = load double, double *%ptr0
				%val1 = load double, double *%ptr1
				%val2 = load double, double *%ptr2
				%val3 = load double, double *%ptr3
				%val4 = load double, double *%ptr4
				%val5 = load double, double *%ptr5
				%val6 = load double, double *%ptr6
				%val7 = load double, double *%ptr7
				%val8 = load double, double *%ptr8
				%val9 = load double, double *%ptr9
				%val10 = load double, double *%ptr10

				%ret = call double @foo()

				%add0 = call double @llvm.experimental.constrained.fadd.f64(
				double %ret, double %val0,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%add1 = call double @llvm.experimental.constrained.fadd.f64(
				double %add0, double %val1,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%add2 = call double @llvm.experimental.constrained.fadd.f64(
				double %add1, double %val2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%add3 = call double @llvm.experimental.constrained.fadd.f64(
				double %add2, double %val3,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%add4 = call double @llvm.experimental.constrained.fadd.f64(
				double %add3, double %val4,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%add5 = call double @llvm.experimental.constrained.fadd.f64(
				double %add4, double %val5,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%add6 = call double @llvm.experimental.constrained.fadd.f64(
				double %add5, double %val6,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%add7 = call double @llvm.experimental.constrained.fadd.f64(
				double %add6, double %val7,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%add8 = call double @llvm.experimental.constrained.fadd.f64(
				double %add7, double %val8,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%add9 = call double @llvm.experimental.constrained.fadd.f64(
				double %add8, double %val9,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%add10 = call double @llvm.experimental.constrained.fadd.f64(
				double %add9, double %val10,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")

				ret double %add10
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-add-03.ll

				; Test strict 128-bit floating-point addition.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu \| FileCheck %s

				declare fp128 @llvm.experimental.constrained.fadd.f128(fp128, fp128, metadata, metadata)

				; There is no memory form of 128-bit addition.
				define void @f1(fp128 *%ptr, float %f2) {
				; CHECK-LABEL: f1:
				; CHECK-DAG: lxebr %f0, %f0
				; CHECK-DAG: ld %f1, 0(%r2)
				; CHECK-DAG: ld %f3, 8(%r2)
				; CHECK: axbr %f0, %f1
				; CHECK: std %f0, 0(%r2)
				; CHECK: std %f2, 8(%r2)
				; CHECK: br %r14
				%f1 = load fp128, fp128 *%ptr
				%f2x = fpext float %f2 to fp128
				%sum = call fp128 @llvm.experimental.constrained.fadd.f128(
				fp128 %f1, fp128 %f2x,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %sum, fp128 *%ptr
				ret void
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-add-04.ll

				; Test strict 128-bit floating-point addition on z14.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14 \| FileCheck %s

				declare fp128 @llvm.experimental.constrained.fadd.f128(fp128, fp128, metadata, metadata)

				define void @f1(fp128 %ptr1, fp128 %ptr2) {
				; CHECK-LABEL: f1:
				; CHECK-DAG: vl [[REG1:%v[0-9]+]], 0(%r2)
				; CHECK-DAG: vl [[REG2:%v[0-9]+]], 0(%r3)
				; CHECK: wfaxb [[RES:%v[0-9]+]], [[REG1]], [[REG2]]
				; CHECK: vst [[RES]], 0(%r2)
				; CHECK: br %r14
				%f1 = load fp128, fp128 *%ptr1
				%f2 = load fp128, fp128 *%ptr2
				%sum = call fp128 @llvm.experimental.constrained.fadd.f128(
				fp128 %f1, fp128 %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %sum, fp128 *%ptr1
				ret void
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-alias.ll

				; Verify that strict FP operations are not rescheduled
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z13 \| FileCheck %s

				declare float @llvm.experimental.constrained.fadd.f32(float, float, metadata, metadata)
				declare float @llvm.experimental.constrained.fsub.f32(float, float, metadata, metadata)
				declare float @llvm.experimental.constrained.sqrt.f32(float, metadata, metadata)
				declare float @llvm.sqrt.f32(float)
				declare void @llvm.s390.sfpc(i32)

				; For non-strict operations, we expect the post-RA scheduler to
				; separate the two square root instructions on z13.
				define void @f1(float %f1, float %f2, float %f3, float %f4, float *%ptr0) {
				; CHECK-LABEL: f1:
				; CHECK: sqebr
				; CHECK: {{aebr\|sebr}}
				; CHECK: sqebr
				; CHECK: br %r14

				%add = fadd float %f1, %f2
				%sub = fsub float %f3, %f4
				%sqrt1 = call float @llvm.sqrt.f32(float %f2)
				%sqrt2 = call float @llvm.sqrt.f32(float %f4)

				%ptr1 = getelementptr float, float *%ptr0, i64 1
				%ptr2 = getelementptr float, float *%ptr0, i64 2
				%ptr3 = getelementptr float, float *%ptr0, i64 3

				store float %add, float *%ptr0
				store float %sub, float *%ptr1
				store float %sqrt1, float *%ptr2
				store float %sqrt2, float *%ptr3

				ret void
				}

				; But for strict operations, this must not happen.
				define void @f2(float %f1, float %f2, float %f3, float %f4, float *%ptr0) {
				; CHECK-LABEL: f2:
				; CHECK: {{aebr\|sebr}}
				; CHECK: {{aebr\|sebr}}
				; CHECK: sqebr
				; CHECK: sqebr
				; CHECK: br %r14

				%add = call float @llvm.experimental.constrained.fadd.f32(
				float %f1, float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%sub = call float @llvm.experimental.constrained.fsub.f32(
				float %f3, float %f4,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%sqrt1 = call float @llvm.experimental.constrained.sqrt.f32(
				float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%sqrt2 = call float @llvm.experimental.constrained.sqrt.f32(
				float %f4,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")

				%ptr1 = getelementptr float, float *%ptr0, i64 1
				%ptr2 = getelementptr float, float *%ptr0, i64 2
				%ptr3 = getelementptr float, float *%ptr0, i64 3

				store float %add, float *%ptr0
				store float %sub, float *%ptr1
				store float %sqrt1, float *%ptr2
				store float %sqrt2, float *%ptr3

				ret void
				}

				; On the other hand, strict operations that use the fpexcept.ignore
				; exception behaviour should be scheduled freely.
				define void @f3(float %f1, float %f2, float %f3, float %f4, float *%ptr0) {
				; CHECK-LABEL: f3:
				; CHECK: sqebr
				; CHECK: {{aebr\|sebr}}
				; CHECK: sqebr
				; CHECK: br %r14

				%add = call float @llvm.experimental.constrained.fadd.f32(
				float %f1, float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.ignore")
				%sub = call float @llvm.experimental.constrained.fsub.f32(
				float %f3, float %f4,
				metadata !"round.dynamic",
				metadata !"fpexcept.ignore")
				%sqrt1 = call float @llvm.experimental.constrained.sqrt.f32(
				float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.ignore")
				%sqrt2 = call float @llvm.experimental.constrained.sqrt.f32(
				float %f4,
				metadata !"round.dynamic",
				metadata !"fpexcept.ignore")

				%ptr1 = getelementptr float, float *%ptr0, i64 1
				%ptr2 = getelementptr float, float *%ptr0, i64 2
				%ptr3 = getelementptr float, float *%ptr0, i64 3

				store float %add, float *%ptr0
				store float %sub, float *%ptr1
				store float %sqrt1, float *%ptr2
				store float %sqrt2, float *%ptr3

				ret void
				}

				; However, even non-strict operations must not be scheduled across an SFPC.
				define void @f4(float %f1, float %f2, float %f3, float %f4, float *%ptr0) {
				; CHECK-LABEL: f4:
				; CHECK: {{aebr\|sebr}}
				; CHECK: {{aebr\|sebr}}
				; CHECK: sfpc
				; CHECK: sqebr
				; CHECK: sqebr
				; CHECK: br %r14

				%add = fadd float %f1, %f2
				%sub = fsub float %f3, %f4
				call void @llvm.s390.sfpc(i32 0)
				%sqrt1 = call float @llvm.sqrt.f32(float %f2)
				%sqrt2 = call float @llvm.sqrt.f32(float %f4)

				%ptr1 = getelementptr float, float *%ptr0, i64 1
				%ptr2 = getelementptr float, float *%ptr0, i64 2
				%ptr3 = getelementptr float, float *%ptr0, i64 3

				store float %add, float *%ptr0
				store float %sub, float *%ptr1
				store float %sqrt1, float *%ptr2
				store float %sqrt2, float *%ptr3

				ret void
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-conv-01.ll

				; Test strict floating-point truncations.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z10 \
				; RUN: \| FileCheck -check-prefix=CHECK -check-prefix=CHECK-SCALAR %s
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z13 \
				; RUN: \| FileCheck -check-prefix=CHECK -check-prefix=CHECK-VECTOR %s

				declare float @llvm.experimental.constrained.fptrunc.f32.f64(double, metadata, metadata)
				declare float @llvm.experimental.constrained.fptrunc.f32.f128(fp128, metadata, metadata)
				declare double @llvm.experimental.constrained.fptrunc.f64.f128(fp128, metadata, metadata)

				declare float @llvm.experimental.constrained.fadd.f32(float, float, metadata, metadata)
				declare double @llvm.experimental.constrained.fadd.f64(double, double, metadata, metadata)

				; Test f64->f32.
				define float @f1(double %d1, double %d2) {
				; CHECK-LABEL: f1:
				; CHECK-SCALAR: ledbr %f0, %f2
				; CHECK-VECTOR: ledbra %f0, 0, %f2, 0
				; CHECK: br %r14
				%res = call float @llvm.experimental.constrained.fptrunc.f32.f64(
				double %d2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Test f128->f32.
				define float @f2(fp128 *%ptr) {
				; CHECK-LABEL: f2:
				; CHECK: lexbr %f0, %f0
				; CHECK: br %r14
				%val = load fp128, fp128 *%ptr
				%res = call float @llvm.experimental.constrained.fptrunc.f32.f128(
				fp128 %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Make sure that we don't use %f0 as the destination of LEXBR when %f2
				; is still live.
				define void @f3(float %dst, fp128 %ptr, float %d1, float %d2) {
				; CHECK-LABEL: f3:
				; CHECK: lexbr %f1, %f1
				; CHECK: aebr %f1, %f2
				; CHECK: ste %f1, 0(%r2)
				; CHECK: br %r14
				%val = load fp128, fp128 *%ptr
				%conv = call float @llvm.experimental.constrained.fptrunc.f32.f128(
				fp128 %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%res = call float @llvm.experimental.constrained.fadd.f32(
				float %conv, float %d2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store float %res, float *%dst
				ret void
				}

				; Test f128->f64.
				define double @f4(fp128 *%ptr) {
				; CHECK-LABEL: f4:
				; CHECK: ldxbr %f0, %f0
				; CHECK: br %r14
				%val = load fp128, fp128 *%ptr
				%res = call double @llvm.experimental.constrained.fptrunc.f64.f128(
				fp128 %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Like f3, but for f128->f64.
				define void @f5(double %dst, fp128 %ptr, double %d1, double %d2) {
				; CHECK-LABEL: f5:
				; CHECK: ldxbr %f1, %f1
				; CHECK-SCALAR: adbr %f1, %f2
				; CHECK-SCALAR: std %f1, 0(%r2)
				; CHECK-VECTOR: wfadb [[REG:%f[0-9]+]], %f1, %f2
				; CHECK-VECTOR: std [[REG]], 0(%r2)
				; CHECK: br %r14
				%val = load fp128, fp128 *%ptr
				%conv = call double @llvm.experimental.constrained.fptrunc.f64.f128(
				fp128 %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%res = call double @llvm.experimental.constrained.fadd.f64(
				double %conv, double %d2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store double %res, double *%dst
				ret void
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-conv-02.ll

				; Test strict extensions of f32 to f64.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z10 \
				; RUN: \| FileCheck -check-prefix=CHECK -check-prefix=CHECK-SCALAR %s
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z13 \
				; RUN: \| FileCheck -check-prefix=CHECK -check-prefix=CHECK-VECTOR %s

				declare double @llvm.experimental.constrained.fpext.f64.f32(float, metadata)

				; Check register extension.
				define double @f1(float %val) {
				; CHECK-LABEL: f1:
				; CHECK: ldebr %f0, %f0
				; CHECK: br %r14
				%res = call double @llvm.experimental.constrained.fpext.f64.f32(float %val,
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check extension from memory.
				; FIXME: This should really use LDEB, but there is no strict "extload" yet.
				define double @f2(float *%ptr) {
				; CHECK-LABEL: f2:
				; CHECK-SCALAR: le %f0, 0(%r2)
				; CHECK-VECTOR: lde %f0, 0(%r2)
				; CHECK: ldebr %f0, %f0
				; CHECK: br %r14
				%val = load float, float *%ptr
				%res = call double @llvm.experimental.constrained.fpext.f64.f32(float %val,
				metadata !"fpexcept.strict")
				ret double %res
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-conv-03.ll

				; Test strict extensions of f32 to f128.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu \| FileCheck %s

				declare fp128 @llvm.experimental.constrained.fpext.f128.f32(float, metadata)

				; Check register extension.
				define void @f1(fp128 *%dst, float %val) {
				; CHECK-LABEL: f1:
				; CHECK: lxebr %f0, %f0
				; CHECK: std %f0, 0(%r2)
				; CHECK: std %f2, 8(%r2)
				; CHECK: br %r14
				%res = call fp128 @llvm.experimental.constrained.fpext.f128.f32(float %val,
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%dst
				ret void
				}

				; Check extension from memory.
				; FIXME: This should really use LXEB, but there is no strict "extload" yet.
				define void @f2(fp128 %dst, float %ptr) {
				; CHECK-LABEL: f2:
				; CHECK: le %f0, 0(%r3)
				; CHECK: lxebr %f0, %f0
				; CHECK: std %f0, 0(%r2)
				; CHECK: std %f2, 8(%r2)
				; CHECK: br %r14
				%val = load float, float *%ptr
				%res = call fp128 @llvm.experimental.constrained.fpext.f128.f32(float %val,
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%dst
				ret void
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-conv-04.ll

				; Test strict extensions of f64 to f128.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu \| FileCheck %s

				declare fp128 @llvm.experimental.constrained.fpext.f128.f64(double, metadata)

				; Check register extension.
				define void @f1(fp128 *%dst, double %val) {
				; CHECK-LABEL: f1:
				; CHECK: lxdbr %f0, %f0
				; CHECK: std %f0, 0(%r2)
				; CHECK: std %f2, 8(%r2)
				; CHECK: br %r14
				%res = call fp128 @llvm.experimental.constrained.fpext.f128.f64(double %val,
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%dst
				ret void
				}

				; Check extension from memory.
				; FIXME: This should really use LXDB, but there is no strict "extload" yet.
				define void @f2(fp128 %dst, double %ptr) {
				; CHECK-LABEL: f2:
				; CHECK: ld %f0, 0(%r3)
				; CHECK: lxdbr %f0, %f0
				; CHECK: std %f0, 0(%r2)
				; CHECK: std %f2, 8(%r2)
				; CHECK: br %r14
				%val = load double, double *%ptr
				%res = call fp128 @llvm.experimental.constrained.fpext.f128.f64(double %val,
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%dst
				ret void
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-conv-15.ll

				; Test f128 floating-point strict truncations/extensions on z14.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14 \| FileCheck %s

				declare float @llvm.experimental.constrained.fptrunc.f32.f128(fp128, metadata, metadata)
				declare double @llvm.experimental.constrained.fptrunc.f64.f128(fp128, metadata, metadata)

				declare fp128 @llvm.experimental.constrained.fpext.f128.f32(float, metadata)
				declare fp128 @llvm.experimental.constrained.fpext.f128.f64(double, metadata)

				; Test f128->f64.
				define double @f1(fp128 *%ptr) {
				; CHECK-LABEL: f1:
				; CHECK: vl [[REG:%v[0-9]+]], 0(%r2)
				; CHECK: wflrx %f0, [[REG]], 0, 0
				; CHECK: br %r14
				%val = load fp128, fp128 *%ptr
				%res = call double @llvm.experimental.constrained.fptrunc.f64.f128(
				fp128 %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Test f128->f32.
				define float @f2(fp128 *%ptr) {
				; CHECK-LABEL: f2:
				; CHECK: vl [[REG:%v[0-9]+]], 0(%r2)
				; CHECK: wflrx %f0, [[REG]], 0, 3
				; CHECK: ledbra %f0, 0, %f0, 0
				; CHECK: br %r14
				%val = load fp128, fp128 *%ptr
				%res = call float @llvm.experimental.constrained.fptrunc.f32.f128(
				fp128 %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Test f64->f128.
				define void @f3(fp128 *%dst, double %val) {
				; CHECK-LABEL: f3:
				; CHECK: wflld [[RES:%v[0-9]+]], %f0
				; CHECK: vst [[RES]], 0(%r2)
				; CHECK: br %r14
				%res = call fp128 @llvm.experimental.constrained.fpext.f128.f64(double %val,
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%dst
				ret void
				}

				; Test f32->f128.
				define void @f4(fp128 *%dst, float %val) {
				; CHECK-LABEL: f4:
				; CHECK: ldebr %f0, %f0
				; CHECK: wflld [[RES:%v[0-9]+]], %f0
				; CHECK: vst [[RES]], 0(%r2)
				; CHECK: br %r14
				%res = call fp128 @llvm.experimental.constrained.fpext.f128.f32(float %val,
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%dst
				ret void
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-div-01.ll

				; Test strict 32-bit floating-point division.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z10 \
				; RUN: \| FileCheck -check-prefix=CHECK -check-prefix=CHECK-SCALAR %s
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14 \| FileCheck %s

				declare float @foo()
				declare float @llvm.experimental.constrained.fdiv.f32(float, float, metadata, metadata)

				; Check register division.
				define float @f1(float %f1, float %f2) {
				; CHECK-LABEL: f1:
				; CHECK: debr %f0, %f2
				; CHECK: br %r14
				%res = call float @llvm.experimental.constrained.fdiv.f32(
				float %f1, float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check the low end of the DEB range.
				define float @f2(float %f1, float *%ptr) {
				; CHECK-LABEL: f2:
				; CHECK: deb %f0, 0(%r2)
				; CHECK: br %r14
				%f2 = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.fdiv.f32(
				float %f1, float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check the high end of the aligned DEB range.
				define float @f3(float %f1, float *%base) {
				; CHECK-LABEL: f3:
				; CHECK: deb %f0, 4092(%r2)
				; CHECK: br %r14
				%ptr = getelementptr float, float *%base, i64 1023
				%f2 = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.fdiv.f32(
				float %f1, float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check the next word up, which needs separate address logic.
				; Other sequences besides this one would be OK.
				define float @f4(float %f1, float *%base) {
				; CHECK-LABEL: f4:
				; CHECK: aghi %r2, 4096
				; CHECK: deb %f0, 0(%r2)
				; CHECK: br %r14
				%ptr = getelementptr float, float *%base, i64 1024
				%f2 = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.fdiv.f32(
				float %f1, float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check negative displacements, which also need separate address logic.
				define float @f5(float %f1, float *%base) {
				; CHECK-LABEL: f5:
				; CHECK: aghi %r2, -4
				; CHECK: deb %f0, 0(%r2)
				; CHECK: br %r14
				%ptr = getelementptr float, float *%base, i64 -1
				%f2 = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.fdiv.f32(
				float %f1, float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check that DEB allows indices.
				define float @f6(float %f1, float *%base, i64 %index) {
				; CHECK-LABEL: f6:
				; CHECK: sllg %r1, %r3, 2
				; CHECK: deb %f0, 400(%r1,%r2)
				; CHECK: br %r14
				%ptr1 = getelementptr float, float *%base, i64 %index
				%ptr2 = getelementptr float, float *%ptr1, i64 100
				%f2 = load float, float *%ptr2
				%res = call float @llvm.experimental.constrained.fdiv.f32(
				float %f1, float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check that divisions of spilled values can use DEB rather than DEBR.
				define float @f7(float *%ptr0) {
				; CHECK-LABEL: f7:
				; CHECK: brasl %r14, foo@PLT
				; CHECK-SCALAR: deb %f0, 16{{[04]}}(%r15)
				; CHECK: br %r14
				%ptr1 = getelementptr float, float *%ptr0, i64 2
				%ptr2 = getelementptr float, float *%ptr0, i64 4
				%ptr3 = getelementptr float, float *%ptr0, i64 6
				%ptr4 = getelementptr float, float *%ptr0, i64 8
				%ptr5 = getelementptr float, float *%ptr0, i64 10
				%ptr6 = getelementptr float, float *%ptr0, i64 12
				%ptr7 = getelementptr float, float *%ptr0, i64 14
				%ptr8 = getelementptr float, float *%ptr0, i64 16
				%ptr9 = getelementptr float, float *%ptr0, i64 18
				%ptr10 = getelementptr float, float *%ptr0, i64 20

				%val0 = load float, float *%ptr0
				%val1 = load float, float *%ptr1
				%val2 = load float, float *%ptr2
				%val3 = load float, float *%ptr3
				%val4 = load float, float *%ptr4
				%val5 = load float, float *%ptr5
				%val6 = load float, float *%ptr6
				%val7 = load float, float *%ptr7
				%val8 = load float, float *%ptr8
				%val9 = load float, float *%ptr9
				%val10 = load float, float *%ptr10

				%ret = call float @foo()

				%div0 = call float @llvm.experimental.constrained.fdiv.f32(
				float %ret, float %val0,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%div1 = call float @llvm.experimental.constrained.fdiv.f32(
				float %div0, float %val1,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%div2 = call float @llvm.experimental.constrained.fdiv.f32(
				float %div1, float %val2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%div3 = call float @llvm.experimental.constrained.fdiv.f32(
				float %div2, float %val3,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%div4 = call float @llvm.experimental.constrained.fdiv.f32(
				float %div3, float %val4,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%div5 = call float @llvm.experimental.constrained.fdiv.f32(
				float %div4, float %val5,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%div6 = call float @llvm.experimental.constrained.fdiv.f32(
				float %div5, float %val6,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%div7 = call float @llvm.experimental.constrained.fdiv.f32(
				float %div6, float %val7,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%div8 = call float @llvm.experimental.constrained.fdiv.f32(
				float %div7, float %val8,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%div9 = call float @llvm.experimental.constrained.fdiv.f32(
				float %div8, float %val9,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%div10 = call float @llvm.experimental.constrained.fdiv.f32(
				float %div9, float %val10,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")

				ret float %div10
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-div-02.ll

				; Test strict 64-bit floating-point division.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z10 \
				; RUN: \| FileCheck -check-prefix=CHECK -check-prefix=CHECK-SCALAR %s
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z13 \| FileCheck %s

				declare double @foo()
				declare double @llvm.experimental.constrained.fdiv.f64(double, double, metadata, metadata)

				; Check register division.
				define double @f1(double %f1, double %f2) {
				; CHECK-LABEL: f1:
				; CHECK: ddbr %f0, %f2
				; CHECK: br %r14
				%res = call double @llvm.experimental.constrained.fdiv.f64(
				double %f1, double %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check the low end of the DDB range.
				define double @f2(double %f1, double *%ptr) {
				; CHECK-LABEL: f2:
				; CHECK: ddb %f0, 0(%r2)
				; CHECK: br %r14
				%f2 = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.fdiv.f64(
				double %f1, double %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check the high end of the aligned DDB range.
				define double @f3(double %f1, double *%base) {
				; CHECK-LABEL: f3:
				; CHECK: ddb %f0, 4088(%r2)
				; CHECK: br %r14
				%ptr = getelementptr double, double *%base, i64 511
				%f2 = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.fdiv.f64(
				double %f1, double %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check the next doubleword up, which needs separate address logic.
				; Other sequences besides this one would be OK.
				define double @f4(double %f1, double *%base) {
				; CHECK-LABEL: f4:
				; CHECK: aghi %r2, 4096
				; CHECK: ddb %f0, 0(%r2)
				; CHECK: br %r14
				%ptr = getelementptr double, double *%base, i64 512
				%f2 = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.fdiv.f64(
				double %f1, double %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check negative displacements, which also need separate address logic.
				define double @f5(double %f1, double *%base) {
				; CHECK-LABEL: f5:
				; CHECK: aghi %r2, -8
				; CHECK: ddb %f0, 0(%r2)
				; CHECK: br %r14
				%ptr = getelementptr double, double *%base, i64 -1
				%f2 = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.fdiv.f64(
				double %f1, double %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check that DDB allows indices.
				define double @f6(double %f1, double *%base, i64 %index) {
				; CHECK-LABEL: f6:
				; CHECK: sllg %r1, %r3, 3
				; CHECK: ddb %f0, 800(%r1,%r2)
				; CHECK: br %r14
				%ptr1 = getelementptr double, double *%base, i64 %index
				%ptr2 = getelementptr double, double *%ptr1, i64 100
				%f2 = load double, double *%ptr2
				%res = call double @llvm.experimental.constrained.fdiv.f64(
				double %f1, double %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check that divisions of spilled values can use DDB rather than DDBR.
				define double @f7(double *%ptr0) {
				; CHECK-LABEL: f7:
				; CHECK: brasl %r14, foo@PLT
				; CHECK-SCALAR: ddb %f0, 160(%r15)
				; CHECK: br %r14
				%ptr1 = getelementptr double, double *%ptr0, i64 2
				%ptr2 = getelementptr double, double *%ptr0, i64 4
				%ptr3 = getelementptr double, double *%ptr0, i64 6
				%ptr4 = getelementptr double, double *%ptr0, i64 8
				%ptr5 = getelementptr double, double *%ptr0, i64 10
				%ptr6 = getelementptr double, double *%ptr0, i64 12
				%ptr7 = getelementptr double, double *%ptr0, i64 14
				%ptr8 = getelementptr double, double *%ptr0, i64 16
				%ptr9 = getelementptr double, double *%ptr0, i64 18
				%ptr10 = getelementptr double, double *%ptr0, i64 20

				%val0 = load double, double *%ptr0
				%val1 = load double, double *%ptr1
				%val2 = load double, double *%ptr2
				%val3 = load double, double *%ptr3
				%val4 = load double, double *%ptr4
				%val5 = load double, double *%ptr5
				%val6 = load double, double *%ptr6
				%val7 = load double, double *%ptr7
				%val8 = load double, double *%ptr8
				%val9 = load double, double *%ptr9
				%val10 = load double, double *%ptr10

				%ret = call double @foo()

				%div0 = call double @llvm.experimental.constrained.fdiv.f64(
				double %ret, double %val0,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%div1 = call double @llvm.experimental.constrained.fdiv.f64(
				double %div0, double %val1,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%div2 = call double @llvm.experimental.constrained.fdiv.f64(
				double %div1, double %val2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%div3 = call double @llvm.experimental.constrained.fdiv.f64(
				double %div2, double %val3,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%div4 = call double @llvm.experimental.constrained.fdiv.f64(
				double %div3, double %val4,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%div5 = call double @llvm.experimental.constrained.fdiv.f64(
				double %div4, double %val5,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%div6 = call double @llvm.experimental.constrained.fdiv.f64(
				double %div5, double %val6,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%div7 = call double @llvm.experimental.constrained.fdiv.f64(
				double %div6, double %val7,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%div8 = call double @llvm.experimental.constrained.fdiv.f64(
				double %div7, double %val8,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%div9 = call double @llvm.experimental.constrained.fdiv.f64(
				double %div8, double %val9,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%div10 = call double @llvm.experimental.constrained.fdiv.f64(
				double %div9, double %val10,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")

				ret double %div10
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-div-03.ll

				; Test strict 128-bit floating-point division.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu \| FileCheck %s

				declare fp128 @llvm.experimental.constrained.fdiv.f128(fp128, fp128, metadata, metadata)

				; There is no memory form of 128-bit division.
				define void @f1(fp128 *%ptr, float %f2) {
				; CHECK-LABEL: f1:
				; CHECK-DAG: lxebr %f0, %f0
				; CHECK-DAG: ld %f1, 0(%r2)
				; CHECK-DAG: ld %f3, 8(%r2)
				; CHECK: dxbr %f1, %f0
				; CHECK: std %f1, 0(%r2)
				; CHECK: std %f3, 8(%r2)
				; CHECK: br %r14
				%f1 = load fp128, fp128 *%ptr
				%f2x = fpext float %f2 to fp128
				%sum = call fp128 @llvm.experimental.constrained.fdiv.f128(
				fp128 %f1, fp128 %f2x,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %sum, fp128 *%ptr
				ret void
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-div-04.ll

				; Test strict 128-bit floating-point division on z14.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14 \| FileCheck %s

				declare fp128 @llvm.experimental.constrained.fdiv.f128(fp128, fp128, metadata, metadata)

				define void @f1(fp128 %ptr1, fp128 %ptr2) {
				; CHECK-LABEL: f1:
				; CHECK-DAG: vl [[REG1:%v[0-9]+]], 0(%r2)
				; CHECK-DAG: vl [[REG2:%v[0-9]+]], 0(%r3)
				; CHECK: wfdxb [[RES:%v[0-9]+]], [[REG1]], [[REG2]]
				; CHECK: vst [[RES]], 0(%r2)
				; CHECK: br %r14
				%f1 = load fp128, fp128 *%ptr1
				%f2 = load fp128, fp128 *%ptr2
				%sum = call fp128 @llvm.experimental.constrained.fdiv.f128(
				fp128 %f1, fp128 %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %sum, fp128 *%ptr1
				ret void
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-mul-01.ll

				; Test strict multiplication of two f32s, producing an f32 result.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z10 \
				; RUN: \| FileCheck -check-prefix=CHECK -check-prefix=CHECK-SCALAR %s
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14 \| FileCheck %s

				declare float @foo()
				declare float @llvm.experimental.constrained.fmul.f32(float, float, metadata, metadata)

				; Check register multiplication.
				define float @f1(float %f1, float %f2) {
				; CHECK-LABEL: f1:
				; CHECK: meebr %f0, %f2
				; CHECK: br %r14
				%res = call float @llvm.experimental.constrained.fmul.f32(
				float %f1, float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check the low end of the MEEB range.
				define float @f2(float %f1, float *%ptr) {
				; CHECK-LABEL: f2:
				; CHECK: meeb %f0, 0(%r2)
				; CHECK: br %r14
				%f2 = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.fmul.f32(
				float %f1, float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check the high end of the aligned MEEB range.
				define float @f3(float %f1, float *%base) {
				; CHECK-LABEL: f3:
				; CHECK: meeb %f0, 4092(%r2)
				; CHECK: br %r14
				%ptr = getelementptr float, float *%base, i64 1023
				%f2 = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.fmul.f32(
				float %f1, float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check the next word up, which needs separate address logic.
				; Other sequences besides this one would be OK.
				define float @f4(float %f1, float *%base) {
				; CHECK-LABEL: f4:
				; CHECK: aghi %r2, 4096
				; CHECK: meeb %f0, 0(%r2)
				; CHECK: br %r14
				%ptr = getelementptr float, float *%base, i64 1024
				%f2 = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.fmul.f32(
				float %f1, float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check negative displacements, which also need separate address logic.
				define float @f5(float %f1, float *%base) {
				; CHECK-LABEL: f5:
				; CHECK: aghi %r2, -4
				; CHECK: meeb %f0, 0(%r2)
				; CHECK: br %r14
				%ptr = getelementptr float, float *%base, i64 -1
				%f2 = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.fmul.f32(
				float %f1, float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check that MEEB allows indices.
				define float @f6(float %f1, float *%base, i64 %index) {
				; CHECK-LABEL: f6:
				; CHECK: sllg %r1, %r3, 2
				; CHECK: meeb %f0, 400(%r1,%r2)
				; CHECK: br %r14
				%ptr1 = getelementptr float, float *%base, i64 %index
				%ptr2 = getelementptr float, float *%ptr1, i64 100
				%f2 = load float, float *%ptr2
				%res = call float @llvm.experimental.constrained.fmul.f32(
				float %f1, float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check that multiplications of spilled values can use MEEB rather than MEEBR.
				define float @f7(float *%ptr0) {
				; CHECK-LABEL: f7:
				; CHECK: brasl %r14, foo@PLT
				; CHECK-SCALAR: meeb %f0, 16{{[04]}}(%r15)
				; CHECK: br %r14
				%ptr1 = getelementptr float, float *%ptr0, i64 2
				%ptr2 = getelementptr float, float *%ptr0, i64 4
				%ptr3 = getelementptr float, float *%ptr0, i64 6
				%ptr4 = getelementptr float, float *%ptr0, i64 8
				%ptr5 = getelementptr float, float *%ptr0, i64 10
				%ptr6 = getelementptr float, float *%ptr0, i64 12
				%ptr7 = getelementptr float, float *%ptr0, i64 14
				%ptr8 = getelementptr float, float *%ptr0, i64 16
				%ptr9 = getelementptr float, float *%ptr0, i64 18
				%ptr10 = getelementptr float, float *%ptr0, i64 20

				%val0 = load float, float *%ptr0
				%val1 = load float, float *%ptr1
				%val2 = load float, float *%ptr2
				%val3 = load float, float *%ptr3
				%val4 = load float, float *%ptr4
				%val5 = load float, float *%ptr5
				%val6 = load float, float *%ptr6
				%val7 = load float, float *%ptr7
				%val8 = load float, float *%ptr8
				%val9 = load float, float *%ptr9
				%val10 = load float, float *%ptr10

				%ret = call float @foo()

				%mul0 = call float @llvm.experimental.constrained.fmul.f32(
				float %ret, float %val0,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%mul1 = call float @llvm.experimental.constrained.fmul.f32(
				float %mul0, float %val1,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%mul2 = call float @llvm.experimental.constrained.fmul.f32(
				float %mul1, float %val2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%mul3 = call float @llvm.experimental.constrained.fmul.f32(
				float %mul2, float %val3,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%mul4 = call float @llvm.experimental.constrained.fmul.f32(
				float %mul3, float %val4,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%mul5 = call float @llvm.experimental.constrained.fmul.f32(
				float %mul4, float %val5,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%mul6 = call float @llvm.experimental.constrained.fmul.f32(
				float %mul5, float %val6,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%mul7 = call float @llvm.experimental.constrained.fmul.f32(
				float %mul6, float %val7,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%mul8 = call float @llvm.experimental.constrained.fmul.f32(
				float %mul7, float %val8,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%mul9 = call float @llvm.experimental.constrained.fmul.f32(
				float %mul8, float %val9,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%mul10 = call float @llvm.experimental.constrained.fmul.f32(
				float %mul9, float %val10,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")

				ret float %mul10
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-mul-02.ll

				; Test strict multiplication of two f32s, producing an f64 result.
				; FIXME: we do not have a strict version of fpext yet
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu \| FileCheck %s

				declare float @foo()
				declare double @llvm.experimental.constrained.fmul.f64(double, double, metadata, metadata)

				; Check register multiplication.
				define double @f1(float %f1, float %f2) {
				; CHECK-LABEL: f1:
				; CHECK: mdebr %f0, %f2
				; CHECK: br %r14
				%f1x = fpext float %f1 to double
				%f2x = fpext float %f2 to double
				%res = call double @llvm.experimental.constrained.fmul.f64(
				double %f1x, double %f2x,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check the low end of the MDEB range.
				define double @f2(float %f1, float *%ptr) {
				; CHECK-LABEL: f2:
				; CHECK: mdeb %f0, 0(%r2)
				; CHECK: br %r14
				%f2 = load float, float *%ptr
				%f1x = fpext float %f1 to double
				%f2x = fpext float %f2 to double
				%res = call double @llvm.experimental.constrained.fmul.f64(
				double %f1x, double %f2x,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check the high end of the aligned MDEB range.
				define double @f3(float %f1, float *%base) {
				; CHECK-LABEL: f3:
				; CHECK: mdeb %f0, 4092(%r2)
				; CHECK: br %r14
				%ptr = getelementptr float, float *%base, i64 1023
				%f2 = load float, float *%ptr
				%f1x = fpext float %f1 to double
				%f2x = fpext float %f2 to double
				%res = call double @llvm.experimental.constrained.fmul.f64(
				double %f1x, double %f2x,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check the next word up, which needs separate address logic.
				; Other sequences besides this one would be OK.
				define double @f4(float %f1, float *%base) {
				; CHECK-LABEL: f4:
				; CHECK: aghi %r2, 4096
				; CHECK: mdeb %f0, 0(%r2)
				; CHECK: br %r14
				%ptr = getelementptr float, float *%base, i64 1024
				%f2 = load float, float *%ptr
				%f1x = fpext float %f1 to double
				%f2x = fpext float %f2 to double
				%res = call double @llvm.experimental.constrained.fmul.f64(
				double %f1x, double %f2x,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check negative displacements, which also need separate address logic.
				define double @f5(float %f1, float *%base) {
				; CHECK-LABEL: f5:
				; CHECK: aghi %r2, -4
				; CHECK: mdeb %f0, 0(%r2)
				; CHECK: br %r14
				%ptr = getelementptr float, float *%base, i64 -1
				%f2 = load float, float *%ptr
				%f1x = fpext float %f1 to double
				%f2x = fpext float %f2 to double
				%res = call double @llvm.experimental.constrained.fmul.f64(
				double %f1x, double %f2x,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check that MDEB allows indices.
				define double @f6(float %f1, float *%base, i64 %index) {
				; CHECK-LABEL: f6:
				; CHECK: sllg %r1, %r3, 2
				; CHECK: mdeb %f0, 400(%r1,%r2)
				; CHECK: br %r14
				%ptr1 = getelementptr float, float *%base, i64 %index
				%ptr2 = getelementptr float, float *%ptr1, i64 100
				%f2 = load float, float *%ptr2
				%f1x = fpext float %f1 to double
				%f2x = fpext float %f2 to double
				%res = call double @llvm.experimental.constrained.fmul.f64(
				double %f1x, double %f2x,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check that multiplications of spilled values can use MDEB rather than MDEBR.
				define float @f7(float *%ptr0) {
				; CHECK-LABEL: f7:
				; CHECK: brasl %r14, foo@PLT
				; CHECK: mdeb %f0, 16{{[04]}}(%r15)
				; CHECK: br %r14
				%ptr1 = getelementptr float, float *%ptr0, i64 2
				%ptr2 = getelementptr float, float *%ptr0, i64 4
				%ptr3 = getelementptr float, float *%ptr0, i64 6
				%ptr4 = getelementptr float, float *%ptr0, i64 8
				%ptr5 = getelementptr float, float *%ptr0, i64 10
				%ptr6 = getelementptr float, float *%ptr0, i64 12
				%ptr7 = getelementptr float, float *%ptr0, i64 14
				%ptr8 = getelementptr float, float *%ptr0, i64 16
				%ptr9 = getelementptr float, float *%ptr0, i64 18
				%ptr10 = getelementptr float, float *%ptr0, i64 20

				%val0 = load float, float *%ptr0
				%val1 = load float, float *%ptr1
				%val2 = load float, float *%ptr2
				%val3 = load float, float *%ptr3
				%val4 = load float, float *%ptr4
				%val5 = load float, float *%ptr5
				%val6 = load float, float *%ptr6
				%val7 = load float, float *%ptr7
				%val8 = load float, float *%ptr8
				%val9 = load float, float *%ptr9
				%val10 = load float, float *%ptr10

				%frob0 = fadd float %val0, %val0
				%frob1 = fadd float %val1, %val1
				%frob2 = fadd float %val2, %val2
				%frob3 = fadd float %val3, %val3
				%frob4 = fadd float %val4, %val4
				%frob5 = fadd float %val5, %val5
				%frob6 = fadd float %val6, %val6
				%frob7 = fadd float %val7, %val7
				%frob8 = fadd float %val8, %val8
				%frob9 = fadd float %val9, %val9
				%frob10 = fadd float %val9, %val10

				store float %frob0, float *%ptr0
				store float %frob1, float *%ptr1
				store float %frob2, float *%ptr2
				store float %frob3, float *%ptr3
				store float %frob4, float *%ptr4
				store float %frob5, float *%ptr5
				store float %frob6, float *%ptr6
				store float %frob7, float *%ptr7
				store float %frob8, float *%ptr8
				store float %frob9, float *%ptr9
				store float %frob10, float *%ptr10

				%ret = call float @foo()

				%accext0 = fpext float %ret to double
				%ext0 = fpext float %frob0 to double
				%mul0 = call double @llvm.experimental.constrained.fmul.f64(
				double %accext0, double %ext0,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%extra0 = call double @llvm.experimental.constrained.fmul.f64(
				double %mul0, double 1.01,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%trunc0 = fptrunc double %extra0 to float

				%accext1 = fpext float %trunc0 to double
				%ext1 = fpext float %frob1 to double
				%mul1 = call double @llvm.experimental.constrained.fmul.f64(
				double %accext1, double %ext1,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%extra1 = call double @llvm.experimental.constrained.fmul.f64(
				double %mul1, double 1.11,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%trunc1 = fptrunc double %extra1 to float

				%accext2 = fpext float %trunc1 to double
				%ext2 = fpext float %frob2 to double
				%mul2 = call double @llvm.experimental.constrained.fmul.f64(
				double %accext2, double %ext2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%extra2 = call double @llvm.experimental.constrained.fmul.f64(
				double %mul2, double 1.21,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%trunc2 = fptrunc double %extra2 to float

				%accext3 = fpext float %trunc2 to double
				%ext3 = fpext float %frob3 to double
				%mul3 = call double @llvm.experimental.constrained.fmul.f64(
				double %accext3, double %ext3,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%extra3 = call double @llvm.experimental.constrained.fmul.f64(
				double %mul3, double 1.31,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%trunc3 = fptrunc double %extra3 to float

				%accext4 = fpext float %trunc3 to double
				%ext4 = fpext float %frob4 to double
				%mul4 = call double @llvm.experimental.constrained.fmul.f64(
				double %accext4, double %ext4,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%extra4 = call double @llvm.experimental.constrained.fmul.f64(
				double %mul4, double 1.41,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%trunc4 = fptrunc double %extra4 to float

				%accext5 = fpext float %trunc4 to double
				%ext5 = fpext float %frob5 to double
				%mul5 = call double @llvm.experimental.constrained.fmul.f64(
				double %accext5, double %ext5,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%extra5 = call double @llvm.experimental.constrained.fmul.f64(
				double %mul5, double 1.51,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%trunc5 = fptrunc double %extra5 to float

				%accext6 = fpext float %trunc5 to double
				%ext6 = fpext float %frob6 to double
				%mul6 = call double @llvm.experimental.constrained.fmul.f64(
				double %accext6, double %ext6,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%extra6 = call double @llvm.experimental.constrained.fmul.f64(
				double %mul6, double 1.61,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%trunc6 = fptrunc double %extra6 to float

				%accext7 = fpext float %trunc6 to double
				%ext7 = fpext float %frob7 to double
				%mul7 = call double @llvm.experimental.constrained.fmul.f64(
				double %accext7, double %ext7,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%extra7 = call double @llvm.experimental.constrained.fmul.f64(
				double %mul7, double 1.71,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%trunc7 = fptrunc double %extra7 to float

				%accext8 = fpext float %trunc7 to double
				%ext8 = fpext float %frob8 to double
				%mul8 = call double @llvm.experimental.constrained.fmul.f64(
				double %accext8, double %ext8,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%extra8 = call double @llvm.experimental.constrained.fmul.f64(
				double %mul8, double 1.81,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%trunc8 = fptrunc double %extra8 to float

				%accext9 = fpext float %trunc8 to double
				%ext9 = fpext float %frob9 to double
				%mul9 = call double @llvm.experimental.constrained.fmul.f64(
				double %accext9, double %ext9,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%extra9 = call double @llvm.experimental.constrained.fmul.f64(
				double %mul9, double 1.91,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%trunc9 = fptrunc double %extra9 to float

				ret float %trunc9
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-mul-03.ll

				; Test strict multiplication of two f64s, producing an f64 result.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z10 \
				; RUN: \| FileCheck -check-prefix=CHECK -check-prefix=CHECK-SCALAR %s
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z13 \| FileCheck %s

				declare double @foo()
				declare double @llvm.experimental.constrained.fmul.f64(double, double, metadata, metadata)

				; Check register multiplication.
				define double @f1(double %f1, double %f2) {
				; CHECK-LABEL: f1:
				; CHECK: mdbr %f0, %f2
				; CHECK: br %r14
				%res = call double @llvm.experimental.constrained.fmul.f64(
				double %f1, double %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check the low end of the MDB range.
				define double @f2(double %f1, double *%ptr) {
				; CHECK-LABEL: f2:
				; CHECK: mdb %f0, 0(%r2)
				; CHECK: br %r14
				%f2 = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.fmul.f64(
				double %f1, double %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check the high end of the aligned MDB range.
				define double @f3(double %f1, double *%base) {
				; CHECK-LABEL: f3:
				; CHECK: mdb %f0, 4088(%r2)
				; CHECK: br %r14
				%ptr = getelementptr double, double *%base, i64 511
				%f2 = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.fmul.f64(
				double %f1, double %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check the next doubleword up, which needs separate address logic.
				; Other sequences besides this one would be OK.
				define double @f4(double %f1, double *%base) {
				; CHECK-LABEL: f4:
				; CHECK: aghi %r2, 4096
				; CHECK: mdb %f0, 0(%r2)
				; CHECK: br %r14
				%ptr = getelementptr double, double *%base, i64 512
				%f2 = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.fmul.f64(
				double %f1, double %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check negative displacements, which also need separate address logic.
				define double @f5(double %f1, double *%base) {
				; CHECK-LABEL: f5:
				; CHECK: aghi %r2, -8
				; CHECK: mdb %f0, 0(%r2)
				; CHECK: br %r14
				%ptr = getelementptr double, double *%base, i64 -1
				%f2 = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.fmul.f64(
				double %f1, double %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check that MDB allows indices.
				define double @f6(double %f1, double *%base, i64 %index) {
				; CHECK-LABEL: f6:
				; CHECK: sllg %r1, %r3, 3
				; CHECK: mdb %f0, 800(%r1,%r2)
				; CHECK: br %r14
				%ptr1 = getelementptr double, double *%base, i64 %index
				%ptr2 = getelementptr double, double *%ptr1, i64 100
				%f2 = load double, double *%ptr2
				%res = call double @llvm.experimental.constrained.fmul.f64(
				double %f1, double %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check that multiplications of spilled values can use MDB rather than MDBR.
				define double @f7(double *%ptr0) {
				; CHECK-LABEL: f7:
				; CHECK: brasl %r14, foo@PLT
				; CHECK-SCALAR: mdb %f0, 160(%r15)
				; CHECK: br %r14
				%ptr1 = getelementptr double, double *%ptr0, i64 2
				%ptr2 = getelementptr double, double *%ptr0, i64 4
				%ptr3 = getelementptr double, double *%ptr0, i64 6
				%ptr4 = getelementptr double, double *%ptr0, i64 8
				%ptr5 = getelementptr double, double *%ptr0, i64 10
				%ptr6 = getelementptr double, double *%ptr0, i64 12
				%ptr7 = getelementptr double, double *%ptr0, i64 14
				%ptr8 = getelementptr double, double *%ptr0, i64 16
				%ptr9 = getelementptr double, double *%ptr0, i64 18
				%ptr10 = getelementptr double, double *%ptr0, i64 20

				%val0 = load double, double *%ptr0
				%val1 = load double, double *%ptr1
				%val2 = load double, double *%ptr2
				%val3 = load double, double *%ptr3
				%val4 = load double, double *%ptr4
				%val5 = load double, double *%ptr5
				%val6 = load double, double *%ptr6
				%val7 = load double, double *%ptr7
				%val8 = load double, double *%ptr8
				%val9 = load double, double *%ptr9
				%val10 = load double, double *%ptr10

				%ret = call double @foo()

				%mul0 = call double @llvm.experimental.constrained.fmul.f64(
				double %ret, double %val0,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%mul1 = call double @llvm.experimental.constrained.fmul.f64(
				double %mul0, double %val1,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%mul2 = call double @llvm.experimental.constrained.fmul.f64(
				double %mul1, double %val2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%mul3 = call double @llvm.experimental.constrained.fmul.f64(
				double %mul2, double %val3,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%mul4 = call double @llvm.experimental.constrained.fmul.f64(
				double %mul3, double %val4,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%mul5 = call double @llvm.experimental.constrained.fmul.f64(
				double %mul4, double %val5,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%mul6 = call double @llvm.experimental.constrained.fmul.f64(
				double %mul5, double %val6,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%mul7 = call double @llvm.experimental.constrained.fmul.f64(
				double %mul6, double %val7,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%mul8 = call double @llvm.experimental.constrained.fmul.f64(
				double %mul7, double %val8,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%mul9 = call double @llvm.experimental.constrained.fmul.f64(
				double %mul8, double %val9,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%mul10 = call double @llvm.experimental.constrained.fmul.f64(
				double %mul9, double %val10,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")

				ret double %mul10
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-mul-04.ll

				; Test strict multiplication of two f64s, producing an f128 result.
				; FIXME: we do not have a strict version of fpext yet
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu \| FileCheck %s

				declare fp128 @llvm.experimental.constrained.fmul.f128(fp128, fp128, metadata, metadata)

				declare double @foo()

				; Check register multiplication. "mxdbr %f0, %f2" is not valid from LLVM's
				; point of view, because %f2 is the low register of the FP128 %f0. Pass the
				; multiplier in %f4 instead.
				define void @f1(double %f1, double %dummy, double %f2, fp128 *%dst) {
				; CHECK-LABEL: f1:
				; CHECK: mxdbr %f0, %f4
				; CHECK: std %f0, 0(%r2)
				; CHECK: std %f2, 8(%r2)
				; CHECK: br %r14
				%f1x = fpext double %f1 to fp128
				%f2x = fpext double %f2 to fp128
				%res = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %f1x, fp128 %f2x,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%dst
				ret void
				}

				; Check the low end of the MXDB range.
				define void @f2(double %f1, double %ptr, fp128 %dst) {
				; CHECK-LABEL: f2:
				; CHECK: mxdb %f0, 0(%r2)
				; CHECK: std %f0, 0(%r3)
				; CHECK: std %f2, 8(%r3)
				; CHECK: br %r14
				%f2 = load double, double *%ptr
				%f1x = fpext double %f1 to fp128
				%f2x = fpext double %f2 to fp128
				%res = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %f1x, fp128 %f2x,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%dst
				ret void
				}

				; Check the high end of the aligned MXDB range.
				define void @f3(double %f1, double %base, fp128 %dst) {
				; CHECK-LABEL: f3:
				; CHECK: mxdb %f0, 4088(%r2)
				; CHECK: std %f0, 0(%r3)
				; CHECK: std %f2, 8(%r3)
				; CHECK: br %r14
				%ptr = getelementptr double, double *%base, i64 511
				%f2 = load double, double *%ptr
				%f1x = fpext double %f1 to fp128
				%f2x = fpext double %f2 to fp128
				%res = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %f1x, fp128 %f2x,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%dst
				ret void
				}

				; Check the next doubleword up, which needs separate address logic.
				; Other sequences besides this one would be OK.
				define void @f4(double %f1, double %base, fp128 %dst) {
				; CHECK-LABEL: f4:
				; CHECK: aghi %r2, 4096
				; CHECK: mxdb %f0, 0(%r2)
				; CHECK: std %f0, 0(%r3)
				; CHECK: std %f2, 8(%r3)
				; CHECK: br %r14
				%ptr = getelementptr double, double *%base, i64 512
				%f2 = load double, double *%ptr
				%f1x = fpext double %f1 to fp128
				%f2x = fpext double %f2 to fp128
				%res = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %f1x, fp128 %f2x,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%dst
				ret void
				}

				; Check negative displacements, which also need separate address logic.
				define void @f5(double %f1, double %base, fp128 %dst) {
				; CHECK-LABEL: f5:
				; CHECK: aghi %r2, -8
				; CHECK: mxdb %f0, 0(%r2)
				; CHECK: std %f0, 0(%r3)
				; CHECK: std %f2, 8(%r3)
				; CHECK: br %r14
				%ptr = getelementptr double, double *%base, i64 -1
				%f2 = load double, double *%ptr
				%f1x = fpext double %f1 to fp128
				%f2x = fpext double %f2 to fp128
				%res = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %f1x, fp128 %f2x,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%dst
				ret void
				}

				; Check that MXDB allows indices.
				define void @f6(double %f1, double %base, i64 %index, fp128 %dst) {
				; CHECK-LABEL: f6:
				; CHECK: sllg %r1, %r3, 3
				; CHECK: mxdb %f0, 800(%r1,%r2)
				; CHECK: std %f0, 0(%r4)
				; CHECK: std %f2, 8(%r4)
				; CHECK: br %r14
				%ptr1 = getelementptr double, double *%base, i64 %index
				%ptr2 = getelementptr double, double *%ptr1, i64 100
				%f2 = load double, double *%ptr2
				%f1x = fpext double %f1 to fp128
				%f2x = fpext double %f2 to fp128
				%res = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %f1x, fp128 %f2x,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%dst
				ret void
				}

				; Check that multiplications of spilled values can use MXDB rather than MXDBR.
				define double @f7(double *%ptr0) {
				; CHECK-LABEL: f7:
				; CHECK: brasl %r14, foo@PLT
				; CHECK: mxdb %f0, 160(%r15)
				; CHECK: br %r14
				%ptr1 = getelementptr double, double *%ptr0, i64 2
				%ptr2 = getelementptr double, double *%ptr0, i64 4
				%ptr3 = getelementptr double, double *%ptr0, i64 6
				%ptr4 = getelementptr double, double *%ptr0, i64 8
				%ptr5 = getelementptr double, double *%ptr0, i64 10
				%ptr6 = getelementptr double, double *%ptr0, i64 12
				%ptr7 = getelementptr double, double *%ptr0, i64 14
				%ptr8 = getelementptr double, double *%ptr0, i64 16
				%ptr9 = getelementptr double, double *%ptr0, i64 18
				%ptr10 = getelementptr double, double *%ptr0, i64 20

				%val0 = load double, double *%ptr0
				%val1 = load double, double *%ptr1
				%val2 = load double, double *%ptr2
				%val3 = load double, double *%ptr3
				%val4 = load double, double *%ptr4
				%val5 = load double, double *%ptr5
				%val6 = load double, double *%ptr6
				%val7 = load double, double *%ptr7
				%val8 = load double, double *%ptr8
				%val9 = load double, double *%ptr9
				%val10 = load double, double *%ptr10

				%frob0 = fadd double %val0, %val0
				%frob1 = fadd double %val1, %val1
				%frob2 = fadd double %val2, %val2
				%frob3 = fadd double %val3, %val3
				%frob4 = fadd double %val4, %val4
				%frob5 = fadd double %val5, %val5
				%frob6 = fadd double %val6, %val6
				%frob7 = fadd double %val7, %val7
				%frob8 = fadd double %val8, %val8
				%frob9 = fadd double %val9, %val9
				%frob10 = fadd double %val9, %val10

				store double %frob0, double *%ptr0
				store double %frob1, double *%ptr1
				store double %frob2, double *%ptr2
				store double %frob3, double *%ptr3
				store double %frob4, double *%ptr4
				store double %frob5, double *%ptr5
				store double %frob6, double *%ptr6
				store double %frob7, double *%ptr7
				store double %frob8, double *%ptr8
				store double %frob9, double *%ptr9
				store double %frob10, double *%ptr10

				%ret = call double @foo()

				%accext0 = fpext double %ret to fp128
				%ext0 = fpext double %frob0 to fp128
				%mul0 = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %accext0, fp128 %ext0,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%const0 = fpext double 1.01 to fp128
				%extra0 = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %mul0, fp128 %const0,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%trunc0 = fptrunc fp128 %extra0 to double

				%accext1 = fpext double %trunc0 to fp128
				%ext1 = fpext double %frob1 to fp128
				%mul1 = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %accext1, fp128 %ext1,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%const1 = fpext double 1.11 to fp128
				%extra1 = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %mul1, fp128 %const1,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%trunc1 = fptrunc fp128 %extra1 to double

				%accext2 = fpext double %trunc1 to fp128
				%ext2 = fpext double %frob2 to fp128
				%mul2 = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %accext2, fp128 %ext2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%const2 = fpext double 1.21 to fp128
				%extra2 = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %mul2, fp128 %const2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%trunc2 = fptrunc fp128 %extra2 to double

				%accext3 = fpext double %trunc2 to fp128
				%ext3 = fpext double %frob3 to fp128
				%mul3 = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %accext3, fp128 %ext3,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%const3 = fpext double 1.31 to fp128
				%extra3 = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %mul3, fp128 %const3,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%trunc3 = fptrunc fp128 %extra3 to double

				%accext4 = fpext double %trunc3 to fp128
				%ext4 = fpext double %frob4 to fp128
				%mul4 = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %accext4, fp128 %ext4,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%const4 = fpext double 1.41 to fp128
				%extra4 = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %mul4, fp128 %const4,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%trunc4 = fptrunc fp128 %extra4 to double

				%accext5 = fpext double %trunc4 to fp128
				%ext5 = fpext double %frob5 to fp128
				%mul5 = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %accext5, fp128 %ext5,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%const5 = fpext double 1.51 to fp128
				%extra5 = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %mul5, fp128 %const5,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%trunc5 = fptrunc fp128 %extra5 to double

				%accext6 = fpext double %trunc5 to fp128
				%ext6 = fpext double %frob6 to fp128
				%mul6 = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %accext6, fp128 %ext6,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%const6 = fpext double 1.61 to fp128
				%extra6 = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %mul6, fp128 %const6,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%trunc6 = fptrunc fp128 %extra6 to double

				%accext7 = fpext double %trunc6 to fp128
				%ext7 = fpext double %frob7 to fp128
				%mul7 = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %accext7, fp128 %ext7,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%const7 = fpext double 1.71 to fp128
				%extra7 = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %mul7, fp128 %const7,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%trunc7 = fptrunc fp128 %extra7 to double

				%accext8 = fpext double %trunc7 to fp128
				%ext8 = fpext double %frob8 to fp128
				%mul8 = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %accext8, fp128 %ext8,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%const8 = fpext double 1.81 to fp128
				%extra8 = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %mul8, fp128 %const8,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%trunc8 = fptrunc fp128 %extra8 to double

				%accext9 = fpext double %trunc8 to fp128
				%ext9 = fpext double %frob9 to fp128
				%mul9 = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %accext9, fp128 %ext9,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%const9 = fpext double 1.91 to fp128
				%extra9 = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %mul9, fp128 %const9,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%trunc9 = fptrunc fp128 %extra9 to double

				ret double %trunc9
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-mul-05.ll

				; Test strict multiplication of two f128s.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu \| FileCheck %s

				declare fp128 @llvm.experimental.constrained.fmul.f128(fp128, fp128, metadata, metadata)

				; There is no memory form of 128-bit multiplication.
				define void @f1(fp128 *%ptr, float %f2) {
				; CHECK-LABEL: f1:
				; CHECK-DAG: lxebr %f0, %f0
				; CHECK-DAG: ld %f1, 0(%r2)
				; CHECK-DAG: ld %f3, 8(%r2)
				; CHECK: mxbr %f0, %f1
				; CHECK: std %f0, 0(%r2)
				; CHECK: std %f2, 8(%r2)
				; CHECK: br %r14
				%f1 = load fp128, fp128 *%ptr
				%f2x = fpext float %f2 to fp128
				%diff = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %f1, fp128 %f2x,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %diff, fp128 *%ptr
				ret void
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-mul-06.ll

				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z10 \
				; RUN: \| FileCheck -check-prefix=CHECK -check-prefix=CHECK-SCALAR %s
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14 \
				; RUN: \| FileCheck -check-prefix=CHECK -check-prefix=CHECK-VECTOR %s

				declare float @llvm.experimental.constrained.fma.f32(float, float, float, metadata, metadata)

				define float @f1(float %f1, float %f2, float %acc) {
				; CHECK-LABEL: f1:
				; CHECK-SCALAR: maebr %f4, %f0, %f2
				; CHECK-SCALAR: ler %f0, %f4
				; CHECK-VECTOR: wfmasb %f0, %f0, %f2, %f4
				; CHECK: br %r14
				%res = call float @llvm.experimental.constrained.fma.f32 (
				float %f1, float %f2, float %acc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				define float @f2(float %f1, float *%ptr, float %acc) {
				; CHECK-LABEL: f2:
				; CHECK: maeb %f2, %f0, 0(%r2)
				; CHECK-SCALAR: ler %f0, %f2
				; CHECK-VECTOR: ldr %f0, %f2
				; CHECK: br %r14
				%f2 = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.fma.f32 (
				float %f1, float %f2, float %acc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				define float @f3(float %f1, float *%base, float %acc) {
				; CHECK-LABEL: f3:
				; CHECK: maeb %f2, %f0, 4092(%r2)
				; CHECK-SCALAR: ler %f0, %f2
				; CHECK-VECTOR: ldr %f0, %f2
				; CHECK: br %r14
				%ptr = getelementptr float, float *%base, i64 1023
				%f2 = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.fma.f32 (
				float %f1, float %f2, float %acc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				define float @f4(float %f1, float *%base, float %acc) {
				; The important thing here is that we don't generate an out-of-range
				; displacement. Other sequences besides this one would be OK.
				;
				; CHECK-LABEL: f4:
				; CHECK: aghi %r2, 4096
				; CHECK: maeb %f2, %f0, 0(%r2)
				; CHECK-SCALAR: ler %f0, %f2
				; CHECK-VECTOR: ldr %f0, %f2
				; CHECK: br %r14
				%ptr = getelementptr float, float *%base, i64 1024
				%f2 = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.fma.f32 (
				float %f1, float %f2, float %acc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				define float @f5(float %f1, float *%base, float %acc) {
				; Here too the important thing is that we don't generate an out-of-range
				; displacement. Other sequences besides this one would be OK.
				;
				; CHECK-LABEL: f5:
				; CHECK: aghi %r2, -4
				; CHECK: maeb %f2, %f0, 0(%r2)
				; CHECK-SCALAR: ler %f0, %f2
				; CHECK-VECTOR: ldr %f0, %f2
				; CHECK: br %r14
				%ptr = getelementptr float, float *%base, i64 -1
				%f2 = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.fma.f32 (
				float %f1, float %f2, float %acc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				define float @f6(float %f1, float *%base, i64 %index, float %acc) {
				; CHECK-LABEL: f6:
				; CHECK: sllg %r1, %r3, 2
				; CHECK: maeb %f2, %f0, 0(%r1,%r2)
				; CHECK-SCALAR: ler %f0, %f2
				; CHECK-VECTOR: ldr %f0, %f2
				; CHECK: br %r14
				%ptr = getelementptr float, float *%base, i64 %index
				%f2 = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.fma.f32 (
				float %f1, float %f2, float %acc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				define float @f7(float %f1, float *%base, i64 %index, float %acc) {
				; CHECK-LABEL: f7:
				; CHECK: sllg %r1, %r3, 2
				; CHECK: maeb %f2, %f0, 4092({{%r1,%r2\|%r2,%r1}})
				; CHECK-SCALAR: ler %f0, %f2
				; CHECK-VECTOR: ldr %f0, %f2
				; CHECK: br %r14
				%index2 = add i64 %index, 1023
				%ptr = getelementptr float, float *%base, i64 %index2
				%f2 = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.fma.f32 (
				float %f1, float %f2, float %acc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				define float @f8(float %f1, float *%base, i64 %index, float %acc) {
				; CHECK-LABEL: f8:
				; CHECK: sllg %r1, %r3, 2
				; CHECK: lay %r1, 4096({{%r1,%r2\|%r2,%r1}})
				; CHECK: maeb %f2, %f0, 0(%r1)
				; CHECK-SCALAR: ler %f0, %f2
				; CHECK-VECTOR: ldr %f0, %f2
				; CHECK: br %r14
				%index2 = add i64 %index, 1024
				%ptr = getelementptr float, float *%base, i64 %index2
				%f2 = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.fma.f32 (
				float %f1, float %f2, float %acc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-mul-07.ll

				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z10 \
				; RUN: \| FileCheck -check-prefix=CHECK -check-prefix=CHECK-SCALAR %s
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z13 \
				; RUN: \| FileCheck -check-prefix=CHECK -check-prefix=CHECK-VECTOR %s

				declare double @llvm.experimental.constrained.fma.f64(double %f1, double %f2, double %f3, metadata, metadata)

				define double @f1(double %f1, double %f2, double %acc) {
				; CHECK-LABEL: f1:
				; CHECK-SCALAR: madbr %f4, %f0, %f2
				; CHECK-SCALAR: ldr %f0, %f4
				; CHECK-VECTOR: wfmadb %f0, %f0, %f2, %f4
				; CHECK: br %r14
				%res = call double @llvm.experimental.constrained.fma.f64 (
				double %f1, double %f2, double %acc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				define double @f2(double %f1, double *%ptr, double %acc) {
				; CHECK-LABEL: f2:
				; CHECK: madb %f2, %f0, 0(%r2)
				; CHECK: ldr %f0, %f2
				; CHECK: br %r14
				%f2 = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.fma.f64 (
				double %f1, double %f2, double %acc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				define double @f3(double %f1, double *%base, double %acc) {
				; CHECK-LABEL: f3:
				; CHECK: madb %f2, %f0, 4088(%r2)
				; CHECK: ldr %f0, %f2
				; CHECK: br %r14
				%ptr = getelementptr double, double *%base, i64 511
				%f2 = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.fma.f64 (
				double %f1, double %f2, double %acc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				define double @f4(double %f1, double *%base, double %acc) {
				; The important thing here is that we don't generate an out-of-range
				; displacement. Other sequences besides this one would be OK.
				;
				; CHECK-LABEL: f4:
				; CHECK: aghi %r2, 4096
				; CHECK: madb %f2, %f0, 0(%r2)
				; CHECK: ldr %f0, %f2
				; CHECK: br %r14
				%ptr = getelementptr double, double *%base, i64 512
				%f2 = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.fma.f64 (
				double %f1, double %f2, double %acc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				define double @f5(double %f1, double *%base, double %acc) {
				; Here too the important thing is that we don't generate an out-of-range
				; displacement. Other sequences besides this one would be OK.
				;
				; CHECK-LABEL: f5:
				; CHECK: aghi %r2, -8
				; CHECK: madb %f2, %f0, 0(%r2)
				; CHECK: ldr %f0, %f2
				; CHECK: br %r14
				%ptr = getelementptr double, double *%base, i64 -1
				%f2 = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.fma.f64 (
				double %f1, double %f2, double %acc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				define double @f6(double %f1, double *%base, i64 %index, double %acc) {
				; CHECK-LABEL: f6:
				; CHECK: sllg %r1, %r3, 3
				; CHECK: madb %f2, %f0, 0(%r1,%r2)
				; CHECK: ldr %f0, %f2
				; CHECK: br %r14
				%ptr = getelementptr double, double *%base, i64 %index
				%f2 = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.fma.f64 (
				double %f1, double %f2, double %acc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				define double @f7(double %f1, double *%base, i64 %index, double %acc) {
				; CHECK-LABEL: f7:
				; CHECK: sllg %r1, %r3, 3
				; CHECK: madb %f2, %f0, 4088({{%r1,%r2\|%r2,%r1}})
				; CHECK: ldr %f0, %f2
				; CHECK: br %r14
				%index2 = add i64 %index, 511
				%ptr = getelementptr double, double *%base, i64 %index2
				%f2 = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.fma.f64 (
				double %f1, double %f2, double %acc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				define double @f8(double %f1, double *%base, i64 %index, double %acc) {
				; CHECK-LABEL: f8:
				; CHECK: sllg %r1, %r3, 3
				; CHECK: lay %r1, 4096({{%r1,%r2\|%r2,%r1}})
				; CHECK: madb %f2, %f0, 0(%r1)
				; CHECK: ldr %f0, %f2
				; CHECK: br %r14
				%index2 = add i64 %index, 512
				%ptr = getelementptr double, double *%base, i64 %index2
				%f2 = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.fma.f64 (
				double %f1, double %f2, double %acc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-mul-08.ll

				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z10 \
				; RUN: \| FileCheck -check-prefix=CHECK -check-prefix=CHECK-SCALAR %s
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14 \
				; RUN: \| FileCheck -check-prefix=CHECK -check-prefix=CHECK-VECTOR %s

				declare float @llvm.experimental.constrained.fma.f32(float %f1, float %f2, float %f3, metadata, metadata)

				define float @f1(float %f1, float %f2, float %acc) {
				; CHECK-LABEL: f1:
				; CHECK-SCALAR: msebr %f4, %f0, %f2
				; CHECK-SCALAR: ler %f0, %f4
				; CHECK-VECTOR: wfmssb %f0, %f0, %f2, %f4
				; CHECK: br %r14
				%negacc = fsub float -0.0, %acc
				%res = call float @llvm.experimental.constrained.fma.f32 (
				float %f1, float %f2, float %negacc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				define float @f2(float %f1, float *%ptr, float %acc) {
				; CHECK-LABEL: f2:
				; CHECK: mseb %f2, %f0, 0(%r2)
				; CHECK-SCALAR: ler %f0, %f2
				; CHECK-VECTOR: ldr %f0, %f2
				; CHECK: br %r14
				%f2 = load float, float *%ptr
				%negacc = fsub float -0.0, %acc
				%res = call float @llvm.experimental.constrained.fma.f32 (
				float %f1, float %f2, float %negacc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				define float @f3(float %f1, float *%base, float %acc) {
				; CHECK-LABEL: f3:
				; CHECK: mseb %f2, %f0, 4092(%r2)
				; CHECK-SCALAR: ler %f0, %f2
				; CHECK-VECTOR: ldr %f0, %f2
				; CHECK: br %r14
				%ptr = getelementptr float, float *%base, i64 1023
				%f2 = load float, float *%ptr
				%negacc = fsub float -0.0, %acc
				%res = call float @llvm.experimental.constrained.fma.f32 (
				float %f1, float %f2, float %negacc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				define float @f4(float %f1, float *%base, float %acc) {
				; The important thing here is that we don't generate an out-of-range
				; displacement. Other sequences besides this one would be OK.
				;
				; CHECK-LABEL: f4:
				; CHECK: aghi %r2, 4096
				; CHECK: mseb %f2, %f0, 0(%r2)
				; CHECK-SCALAR: ler %f0, %f2
				; CHECK-VECTOR: ldr %f0, %f2
				; CHECK: br %r14
				%ptr = getelementptr float, float *%base, i64 1024
				%f2 = load float, float *%ptr
				%negacc = fsub float -0.0, %acc
				%res = call float @llvm.experimental.constrained.fma.f32 (
				float %f1, float %f2, float %negacc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				define float @f5(float %f1, float *%base, float %acc) {
				; Here too the important thing is that we don't generate an out-of-range
				; displacement. Other sequences besides this one would be OK.
				;
				; CHECK-LABEL: f5:
				; CHECK: aghi %r2, -4
				; CHECK: mseb %f2, %f0, 0(%r2)
				; CHECK-SCALAR: ler %f0, %f2
				; CHECK-VECTOR: ldr %f0, %f2
				; CHECK: br %r14
				%ptr = getelementptr float, float *%base, i64 -1
				%f2 = load float, float *%ptr
				%negacc = fsub float -0.0, %acc
				%res = call float @llvm.experimental.constrained.fma.f32 (
				float %f1, float %f2, float %negacc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				define float @f6(float %f1, float *%base, i64 %index, float %acc) {
				; CHECK-LABEL: f6:
				; CHECK: sllg %r1, %r3, 2
				; CHECK: mseb %f2, %f0, 0(%r1,%r2)
				; CHECK-SCALAR: ler %f0, %f2
				; CHECK-VECTOR: ldr %f0, %f2
				; CHECK: br %r14
				%ptr = getelementptr float, float *%base, i64 %index
				%f2 = load float, float *%ptr
				%negacc = fsub float -0.0, %acc
				%res = call float @llvm.experimental.constrained.fma.f32 (
				float %f1, float %f2, float %negacc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				define float @f7(float %f1, float *%base, i64 %index, float %acc) {
				; CHECK-LABEL: f7:
				; CHECK: sllg %r1, %r3, 2
				; CHECK: mseb %f2, %f0, 4092({{%r1,%r2\|%r2,%r1}})
				; CHECK-SCALAR: ler %f0, %f2
				; CHECK-VECTOR: ldr %f0, %f2
				; CHECK: br %r14
				%index2 = add i64 %index, 1023
				%ptr = getelementptr float, float *%base, i64 %index2
				%f2 = load float, float *%ptr
				%negacc = fsub float -0.0, %acc
				%res = call float @llvm.experimental.constrained.fma.f32 (
				float %f1, float %f2, float %negacc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				define float @f8(float %f1, float *%base, i64 %index, float %acc) {
				; CHECK-LABEL: f8:
				; CHECK: sllg %r1, %r3, 2
				; CHECK: lay %r1, 4096({{%r1,%r2\|%r2,%r1}})
				; CHECK: mseb %f2, %f0, 0(%r1)
				; CHECK-SCALAR: ler %f0, %f2
				; CHECK-VECTOR: ldr %f0, %f2
				; CHECK: br %r14
				%index2 = add i64 %index, 1024
				%ptr = getelementptr float, float *%base, i64 %index2
				%f2 = load float, float *%ptr
				%negacc = fsub float -0.0, %acc
				%res = call float @llvm.experimental.constrained.fma.f32 (
				float %f1, float %f2, float %negacc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-mul-09.ll

				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z10 \
				; RUN: \| FileCheck -check-prefix=CHECK -check-prefix=CHECK-SCALAR %s
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z13 \
				; RUN: \| FileCheck -check-prefix=CHECK -check-prefix=CHECK-VECTOR %s

				declare double @llvm.experimental.constrained.fma.f64(double %f1, double %f2, double %f3, metadata, metadata)

				define double @f1(double %f1, double %f2, double %acc) {
				; CHECK-LABEL: f1:
				; CHECK-SCALAR: msdbr %f4, %f0, %f2
				; CHECK-SCALAR: ldr %f0, %f4
				; CHECK-VECTOR: wfmsdb %f0, %f0, %f2, %f4
				; CHECK: br %r14
				%negacc = fsub double -0.0, %acc
				%res = call double @llvm.experimental.constrained.fma.f64 (
				double %f1, double %f2, double %negacc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				define double @f2(double %f1, double *%ptr, double %acc) {
				; CHECK-LABEL: f2:
				; CHECK: msdb %f2, %f0, 0(%r2)
				; CHECK: ldr %f0, %f2
				; CHECK: br %r14
				%f2 = load double, double *%ptr
				%negacc = fsub double -0.0, %acc
				%res = call double @llvm.experimental.constrained.fma.f64 (
				double %f1, double %f2, double %negacc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				define double @f3(double %f1, double *%base, double %acc) {
				; CHECK-LABEL: f3:
				; CHECK: msdb %f2, %f0, 4088(%r2)
				; CHECK: ldr %f0, %f2
				; CHECK: br %r14
				%ptr = getelementptr double, double *%base, i64 511
				%f2 = load double, double *%ptr
				%negacc = fsub double -0.0, %acc
				%res = call double @llvm.experimental.constrained.fma.f64 (
				double %f1, double %f2, double %negacc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				define double @f4(double %f1, double *%base, double %acc) {
				; The important thing here is that we don't generate an out-of-range
				; displacement. Other sequences besides this one would be OK.
				;
				; CHECK-LABEL: f4:
				; CHECK: aghi %r2, 4096
				; CHECK: msdb %f2, %f0, 0(%r2)
				; CHECK: ldr %f0, %f2
				; CHECK: br %r14
				%ptr = getelementptr double, double *%base, i64 512
				%f2 = load double, double *%ptr
				%negacc = fsub double -0.0, %acc
				%res = call double @llvm.experimental.constrained.fma.f64 (
				double %f1, double %f2, double %negacc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				define double @f5(double %f1, double *%base, double %acc) {
				; Here too the important thing is that we don't generate an out-of-range
				; displacement. Other sequences besides this one would be OK.
				;
				; CHECK-LABEL: f5:
				; CHECK: aghi %r2, -8
				; CHECK: msdb %f2, %f0, 0(%r2)
				; CHECK: ldr %f0, %f2
				; CHECK: br %r14
				%ptr = getelementptr double, double *%base, i64 -1
				%f2 = load double, double *%ptr
				%negacc = fsub double -0.0, %acc
				%res = call double @llvm.experimental.constrained.fma.f64 (
				double %f1, double %f2, double %negacc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				define double @f6(double %f1, double *%base, i64 %index, double %acc) {
				; CHECK-LABEL: f6:
				; CHECK: sllg %r1, %r3, 3
				; CHECK: msdb %f2, %f0, 0(%r1,%r2)
				; CHECK: ldr %f0, %f2
				; CHECK: br %r14
				%ptr = getelementptr double, double *%base, i64 %index
				%f2 = load double, double *%ptr
				%negacc = fsub double -0.0, %acc
				%res = call double @llvm.experimental.constrained.fma.f64 (
				double %f1, double %f2, double %negacc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				define double @f7(double %f1, double *%base, i64 %index, double %acc) {
				; CHECK-LABEL: f7:
				; CHECK: sllg %r1, %r3, 3
				; CHECK: msdb %f2, %f0, 4088({{%r1,%r2\|%r2,%r1}})
				; CHECK: ldr %f0, %f2
				; CHECK: br %r14
				%index2 = add i64 %index, 511
				%ptr = getelementptr double, double *%base, i64 %index2
				%f2 = load double, double *%ptr
				%negacc = fsub double -0.0, %acc
				%res = call double @llvm.experimental.constrained.fma.f64 (
				double %f1, double %f2, double %negacc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				define double @f8(double %f1, double *%base, i64 %index, double %acc) {
				; CHECK-LABEL: f8:
				; CHECK: sllg %r1, %r3, 3
				; CHECK: lay %r1, 4096({{%r1,%r2\|%r2,%r1}})
				; CHECK: msdb %f2, %f0, 0(%r1)
				; CHECK: ldr %f0, %f2
				; CHECK: br %r14
				%index2 = add i64 %index, 512
				%ptr = getelementptr double, double *%base, i64 %index2
				%f2 = load double, double *%ptr
				%negacc = fsub double -0.0, %acc
				%res = call double @llvm.experimental.constrained.fma.f64 (
				double %f1, double %f2, double %negacc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-mul-10.ll

				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14 \| FileCheck %s

				declare double @llvm.experimental.constrained.fma.f64(double %f1, double %f2, double %f3, metadata, metadata)
				declare float @llvm.experimental.constrained.fma.f32(float %f1, float %f2, float %f3, metadata, metadata)

				define double @f1(double %f1, double %f2, double %acc) {
				; CHECK-LABEL: f1:
				; CHECK: wfnmadb %f0, %f0, %f2, %f4
				; CHECK: br %r14
				%res = call double @llvm.experimental.constrained.fma.f64 (
				double %f1, double %f2, double %acc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%negres = fsub double -0.0, %res
				ret double %negres
				}

				define double @f2(double %f1, double %f2, double %acc) {
				; CHECK-LABEL: f2:
				; CHECK: wfnmsdb %f0, %f0, %f2, %f4
				; CHECK: br %r14
				%negacc = fsub double -0.0, %acc
				%res = call double @llvm.experimental.constrained.fma.f64 (
				double %f1, double %f2, double %negacc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%negres = fsub double -0.0, %res
				ret double %negres
				}

				define float @f3(float %f1, float %f2, float %acc) {
				; CHECK-LABEL: f3:
				; CHECK: wfnmasb %f0, %f0, %f2, %f4
				; CHECK: br %r14
				%res = call float @llvm.experimental.constrained.fma.f32 (
				float %f1, float %f2, float %acc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%negres = fsub float -0.0, %res
				ret float %negres
				}

				define float @f4(float %f1, float %f2, float %acc) {
				; CHECK-LABEL: f4:
				; CHECK: wfnmssb %f0, %f0, %f2, %f4
				; CHECK: br %r14
				%negacc = fsub float -0.0, %acc
				%res = call float @llvm.experimental.constrained.fma.f32 (
				float %f1, float %f2, float %negacc,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%negres = fsub float -0.0, %res
				ret float %negres
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-mul-11.ll

				; Test strict 128-bit floating-point multiplication on z14.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14 \| FileCheck %s

				declare fp128 @llvm.experimental.constrained.fmul.f128(fp128, fp128, metadata, metadata)

				define void @f1(fp128 %ptr1, fp128 %ptr2) {
				; CHECK-LABEL: f1:
				; CHECK-DAG: vl [[REG1:%v[0-9]+]], 0(%r2)
				; CHECK-DAG: vl [[REG2:%v[0-9]+]], 0(%r3)
				; CHECK: wfmxb [[RES:%v[0-9]+]], [[REG1]], [[REG2]]
				; CHECK: vst [[RES]], 0(%r2)
				; CHECK: br %r14
				%f1 = load fp128, fp128 *%ptr1
				%f2 = load fp128, fp128 *%ptr2
				%sum = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %f1, fp128 %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %sum, fp128 *%ptr1
				ret void
				}

				define void @f2(double %f1, double %f2, fp128 *%dst) {
				; CHECK-LABEL: f2:
				; CHECK-DAG: wflld [[REG1:%v[0-9]+]], %f0
				; CHECK-DAG: wflld [[REG2:%v[0-9]+]], %f2
				; CHECK: wfmxb [[RES:%v[0-9]+]], [[REG1]], [[REG2]]
				; CHECK: vst [[RES]], 0(%r2)
				; CHECK: br %r14
				%f1x = fpext double %f1 to fp128
				%f2x = fpext double %f2 to fp128
				%res = call fp128 @llvm.experimental.constrained.fmul.f128(
				fp128 %f1x, fp128 %f2x,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%dst
				ret void
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-round-01.ll

				; Test strict rounding functions for z10.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z10 \| FileCheck %s

				; Test rint for f32.
				declare float @llvm.experimental.constrained.rint.f32(float, metadata, metadata)
				define float @f1(float %f) {
				; CHECK-LABEL: f1:
				; CHECK: fiebr %f0, 0, %f0
				; CHECK: br %r14
				%res = call float @llvm.experimental.constrained.rint.f32(
				float %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Test rint for f64.
				declare double @llvm.experimental.constrained.rint.f64(double, metadata, metadata)
				define double @f2(double %f) {
				; CHECK-LABEL: f2:
				; CHECK: fidbr %f0, 0, %f0
				; CHECK: br %r14
				%res = call double @llvm.experimental.constrained.rint.f64(
				double %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Test rint for f128.
				declare fp128 @llvm.experimental.constrained.rint.f128(fp128, metadata, metadata)
				define void @f3(fp128 *%ptr) {
				; CHECK-LABEL: f3:
				; CHECK: fixbr %f0, 0, %f0
				; CHECK: br %r14
				%src = load fp128, fp128 *%ptr
				%res = call fp128 @llvm.experimental.constrained.rint.f128(
				fp128 %src,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%ptr
				ret void
				}

				; Test nearbyint for f32.
				declare float @llvm.experimental.constrained.nearbyint.f32(float, metadata, metadata)
				define float @f4(float %f) {
				; CHECK-LABEL: f4:
				; CHECK: brasl %r14, nearbyintf@PLT
				; CHECK: br %r14
				%res = call float @llvm.experimental.constrained.nearbyint.f32(
				float %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Test nearbyint for f64.
				declare double @llvm.experimental.constrained.nearbyint.f64(double, metadata, metadata)
				define double @f5(double %f) {
				; CHECK-LABEL: f5:
				; CHECK: brasl %r14, nearbyint@PLT
				; CHECK: br %r14
				%res = call double @llvm.experimental.constrained.nearbyint.f64(
				double %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Test nearbyint for f128.
				declare fp128 @llvm.experimental.constrained.nearbyint.f128(fp128, metadata, metadata)
				define void @f6(fp128 *%ptr) {
				; CHECK-LABEL: f6:
				; CHECK: brasl %r14, nearbyintl@PLT
				; CHECK: br %r14
				%src = load fp128, fp128 *%ptr
				%res = call fp128 @llvm.experimental.constrained.nearbyint.f128(
				fp128 %src,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%ptr
				ret void
				}

				; Test floor for f32.
				declare float @llvm.experimental.constrained.floor.f32(float, metadata, metadata)
				define float @f7(float %f) {
				; CHECK-LABEL: f7:
				; CHECK: brasl %r14, floorf@PLT
				; CHECK: br %r14
				%res = call float @llvm.experimental.constrained.floor.f32(
				float %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Test floor for f64.
				declare double @llvm.experimental.constrained.floor.f64(double, metadata, metadata)
				define double @f8(double %f) {
				; CHECK-LABEL: f8:
				; CHECK: brasl %r14, floor@PLT
				; CHECK: br %r14
				%res = call double @llvm.experimental.constrained.floor.f64(
				double %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Test floor for f128.
				declare fp128 @llvm.experimental.constrained.floor.f128(fp128, metadata, metadata)
				define void @f9(fp128 *%ptr) {
				; CHECK-LABEL: f9:
				; CHECK: brasl %r14, floorl@PLT
				; CHECK: br %r14
				%src = load fp128, fp128 *%ptr
				%res = call fp128 @llvm.experimental.constrained.floor.f128(
				fp128 %src,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%ptr
				ret void
				}

				; Test ceil for f32.
				declare float @llvm.experimental.constrained.ceil.f32(float, metadata, metadata)
				define float @f10(float %f) {
				; CHECK-LABEL: f10:
				; CHECK: brasl %r14, ceilf@PLT
				; CHECK: br %r14
				%res = call float @llvm.experimental.constrained.ceil.f32(
				float %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Test ceil for f64.
				declare double @llvm.experimental.constrained.ceil.f64(double, metadata, metadata)
				define double @f11(double %f) {
				; CHECK-LABEL: f11:
				; CHECK: brasl %r14, ceil@PLT
				; CHECK: br %r14
				%res = call double @llvm.experimental.constrained.ceil.f64(
				double %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Test ceil for f128.
				declare fp128 @llvm.experimental.constrained.ceil.f128(fp128, metadata, metadata)
				define void @f12(fp128 *%ptr) {
				; CHECK-LABEL: f12:
				; CHECK: brasl %r14, ceill@PLT
				; CHECK: br %r14
				%src = load fp128, fp128 *%ptr
				%res = call fp128 @llvm.experimental.constrained.ceil.f128(
				fp128 %src,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%ptr
				ret void
				}

				; Test trunc for f32.
				declare float @llvm.experimental.constrained.trunc.f32(float, metadata, metadata)
				define float @f13(float %f) {
				; CHECK-LABEL: f13:
				; CHECK: brasl %r14, truncf@PLT
				; CHECK: br %r14
				%res = call float @llvm.experimental.constrained.trunc.f32(
				float %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Test trunc for f64.
				declare double @llvm.experimental.constrained.trunc.f64(double, metadata, metadata)
				define double @f14(double %f) {
				; CHECK-LABEL: f14:
				; CHECK: brasl %r14, trunc@PLT
				; CHECK: br %r14
				%res = call double @llvm.experimental.constrained.trunc.f64(
				double %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Test trunc for f128.
				declare fp128 @llvm.experimental.constrained.trunc.f128(fp128, metadata, metadata)
				define void @f15(fp128 *%ptr) {
				; CHECK-LABEL: f15:
				; CHECK: brasl %r14, truncl@PLT
				; CHECK: br %r14
				%src = load fp128, fp128 *%ptr
				%res = call fp128 @llvm.experimental.constrained.trunc.f128(
				fp128 %src,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%ptr
				ret void
				}

				; Test round for f32.
				declare float @llvm.experimental.constrained.round.f32(float, metadata, metadata)
				define float @f16(float %f) {
				; CHECK-LABEL: f16:
				; CHECK: brasl %r14, roundf@PLT
				; CHECK: br %r14
				%res = call float @llvm.experimental.constrained.round.f32(
				float %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Test round for f64.
				declare double @llvm.experimental.constrained.round.f64(double, metadata, metadata)
				define double @f17(double %f) {
				; CHECK-LABEL: f17:
				; CHECK: brasl %r14, round@PLT
				; CHECK: br %r14
				%res = call double @llvm.experimental.constrained.round.f64(
				double %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Test round for f128.
				declare fp128 @llvm.experimental.constrained.round.f128(fp128, metadata, metadata)
				define void @f18(fp128 *%ptr) {
				; CHECK-LABEL: f18:
				; CHECK: brasl %r14, roundl@PLT
				; CHECK: br %r14
				%src = load fp128, fp128 *%ptr
				%res = call fp128 @llvm.experimental.constrained.round.f128(
				fp128 %src,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%ptr
				ret void
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-round-02.ll

				; Test strict rounding functions for z196 and above.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z196 \
				; RUN: \| FileCheck -check-prefix=CHECK -check-prefix=CHECK-SCALAR %s
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z13 \
				; RUN: \| FileCheck -check-prefix=CHECK -check-prefix=CHECK-VECTOR %s

				; Test rint for f32.
				declare float @llvm.experimental.constrained.rint.f32(float, metadata, metadata)
				define float @f1(float %f) {
				; CHECK-LABEL: f1:
				; CHECK: fiebr %f0, 0, %f0
				; CHECK: br %r14
				%res = call float @llvm.experimental.constrained.rint.f32(
				float %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Test rint for f64.
				declare double @llvm.experimental.constrained.rint.f64(double, metadata, metadata)
				define double @f2(double %f) {
				; CHECK-LABEL: f2:
				; CHECK-SCALAR: fidbr %f0, 0, %f0
				; CHECK-VECTOR: fidbra %f0, 0, %f0, 0
				; CHECK: br %r14
				%res = call double @llvm.experimental.constrained.rint.f64(
				double %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Test rint for f128.
				declare fp128 @llvm.experimental.constrained.rint.f128(fp128, metadata, metadata)
				define void @f3(fp128 *%ptr) {
				; CHECK-LABEL: f3:
				; CHECK: fixbr %f0, 0, %f0
				; CHECK: br %r14
				%src = load fp128, fp128 *%ptr
				%res = call fp128 @llvm.experimental.constrained.rint.f128(
				fp128 %src,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%ptr
				ret void
				}

				; Test nearbyint for f32.
				declare float @llvm.experimental.constrained.nearbyint.f32(float, metadata, metadata)
				define float @f4(float %f) {
				; CHECK-LABEL: f4:
				; CHECK: fiebra %f0, 0, %f0, 4
				; CHECK: br %r14
				%res = call float @llvm.experimental.constrained.nearbyint.f32(
				float %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Test nearbyint for f64.
				declare double @llvm.experimental.constrained.nearbyint.f64(double, metadata, metadata)
				define double @f5(double %f) {
				; CHECK-LABEL: f5:
				; CHECK: fidbra %f0, 0, %f0, 4
				; CHECK: br %r14
				%res = call double @llvm.experimental.constrained.nearbyint.f64(
				double %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Test nearbyint for f128.
				declare fp128 @llvm.experimental.constrained.nearbyint.f128(fp128, metadata, metadata)
				define void @f6(fp128 *%ptr) {
				; CHECK-LABEL: f6:
				; CHECK: fixbra %f0, 0, %f0, 4
				; CHECK: br %r14
				%src = load fp128, fp128 *%ptr
				%res = call fp128 @llvm.experimental.constrained.nearbyint.f128(
				fp128 %src,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%ptr
				ret void
				}

				; Test floor for f32.
				declare float @llvm.experimental.constrained.floor.f32(float, metadata, metadata)
				define float @f7(float %f) {
				; CHECK-LABEL: f7:
				; CHECK: fiebra %f0, 7, %f0, 4
				; CHECK: br %r14
				%res = call float @llvm.experimental.constrained.floor.f32(
				float %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Test floor for f64.
				declare double @llvm.experimental.constrained.floor.f64(double, metadata, metadata)
				define double @f8(double %f) {
				; CHECK-LABEL: f8:
				; CHECK: fidbra %f0, 7, %f0, 4
				; CHECK: br %r14
				%res = call double @llvm.experimental.constrained.floor.f64(
				double %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Test floor for f128.
				declare fp128 @llvm.experimental.constrained.floor.f128(fp128, metadata, metadata)
				define void @f9(fp128 *%ptr) {
				; CHECK-LABEL: f9:
				; CHECK: fixbra %f0, 7, %f0, 4
				; CHECK: br %r14
				%src = load fp128, fp128 *%ptr
				%res = call fp128 @llvm.experimental.constrained.floor.f128(
				fp128 %src,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%ptr
				ret void
				}

				; Test ceil for f32.
				declare float @llvm.experimental.constrained.ceil.f32(float, metadata, metadata)
				define float @f10(float %f) {
				; CHECK-LABEL: f10:
				; CHECK: fiebra %f0, 6, %f0, 4
				; CHECK: br %r14
				%res = call float @llvm.experimental.constrained.ceil.f32(
				float %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Test ceil for f64.
				declare double @llvm.experimental.constrained.ceil.f64(double, metadata, metadata)
				define double @f11(double %f) {
				; CHECK-LABEL: f11:
				; CHECK: fidbra %f0, 6, %f0, 4
				; CHECK: br %r14
				%res = call double @llvm.experimental.constrained.ceil.f64(
				double %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Test ceil for f128.
				declare fp128 @llvm.experimental.constrained.ceil.f128(fp128, metadata, metadata)
				define void @f12(fp128 *%ptr) {
				; CHECK-LABEL: f12:
				; CHECK: fixbra %f0, 6, %f0, 4
				; CHECK: br %r14
				%src = load fp128, fp128 *%ptr
				%res = call fp128 @llvm.experimental.constrained.ceil.f128(
				fp128 %src,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%ptr
				ret void
				}

				; Test trunc for f32.
				declare float @llvm.experimental.constrained.trunc.f32(float, metadata, metadata)
				define float @f13(float %f) {
				; CHECK-LABEL: f13:
				; CHECK: fiebra %f0, 5, %f0, 4
				; CHECK: br %r14
				%res = call float @llvm.experimental.constrained.trunc.f32(
				float %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Test trunc for f64.
				declare double @llvm.experimental.constrained.trunc.f64(double, metadata, metadata)
				define double @f14(double %f) {
				; CHECK-LABEL: f14:
				; CHECK: fidbra %f0, 5, %f0, 4
				; CHECK: br %r14
				%res = call double @llvm.experimental.constrained.trunc.f64(
				double %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Test trunc for f128.
				declare fp128 @llvm.experimental.constrained.trunc.f128(fp128, metadata, metadata)
				define void @f15(fp128 *%ptr) {
				; CHECK-LABEL: f15:
				; CHECK: fixbra %f0, 5, %f0, 4
				; CHECK: br %r14
				%src = load fp128, fp128 *%ptr
				%res = call fp128 @llvm.experimental.constrained.trunc.f128(
				fp128 %src,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%ptr
				ret void
				}

				; Test round for f32.
				declare float @llvm.experimental.constrained.round.f32(float, metadata, metadata)
				define float @f16(float %f) {
				; CHECK-LABEL: f16:
				; CHECK: fiebra %f0, 1, %f0, 4
				; CHECK: br %r14
				%res = call float @llvm.experimental.constrained.round.f32(
				float %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Test round for f64.
				declare double @llvm.experimental.constrained.round.f64(double, metadata, metadata)
				define double @f17(double %f) {
				; CHECK-LABEL: f17:
				; CHECK: fidbra %f0, 1, %f0, 4
				; CHECK: br %r14
				%res = call double @llvm.experimental.constrained.round.f64(
				double %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Test round for f128.
				declare fp128 @llvm.experimental.constrained.round.f128(fp128, metadata, metadata)
				define void @f18(fp128 *%ptr) {
				; CHECK-LABEL: f18:
				; CHECK: fixbra %f0, 1, %f0, 4
				; CHECK: br %r14
				%src = load fp128, fp128 *%ptr
				%res = call fp128 @llvm.experimental.constrained.round.f128(
				fp128 %src,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%ptr
				ret void
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-round-03.ll

				; Test strict rounding functions for z14 and above.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14 \| FileCheck %s

				; Test rint for f32.
				declare float @llvm.experimental.constrained.rint.f32(float, metadata, metadata)
				define float @f1(float %f) {
				; CHECK-LABEL: f1:
				; CHECK: fiebra %f0, 0, %f0, 0
				; CHECK: br %r14
				%res = call float @llvm.experimental.constrained.rint.f32(
				float %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Test rint for f64.
				declare double @llvm.experimental.constrained.rint.f64(double, metadata, metadata)
				define double @f2(double %f) {
				; CHECK-LABEL: f2:
				; CHECK: fidbra %f0, 0, %f0, 0
				; CHECK: br %r14
				%res = call double @llvm.experimental.constrained.rint.f64(
				double %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Test rint for f128.
				declare fp128 @llvm.experimental.constrained.rint.f128(fp128, metadata, metadata)
				define void @f3(fp128 *%ptr) {
				; CHECK-LABEL: f3:
				; CHECK: vl [[REG:%v[0-9]+]], 0(%r2)
				; CHECK: wfixb [[RES:%v[0-9]+]], [[REG]], 0, 0
				; CHECK: vst [[RES]], 0(%r2)
				; CHECK: br %r14
				%src = load fp128, fp128 *%ptr
				%res = call fp128 @llvm.experimental.constrained.rint.f128(
				fp128 %src,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%ptr
				ret void
				}

				; Test nearbyint for f32.
				declare float @llvm.experimental.constrained.nearbyint.f32(float, metadata, metadata)
				define float @f4(float %f) {
				; CHECK-LABEL: f4:
				; CHECK: fiebra %f0, 0, %f0, 4
				; CHECK: br %r14
				%res = call float @llvm.experimental.constrained.nearbyint.f32(
				float %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Test nearbyint for f64.
				declare double @llvm.experimental.constrained.nearbyint.f64(double, metadata, metadata)
				define double @f5(double %f) {
				; CHECK-LABEL: f5:
				; CHECK: fidbra %f0, 0, %f0, 4
				; CHECK: br %r14
				%res = call double @llvm.experimental.constrained.nearbyint.f64(
				double %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Test nearbyint for f128.
				declare fp128 @llvm.experimental.constrained.nearbyint.f128(fp128, metadata, metadata)
				define void @f6(fp128 *%ptr) {
				; CHECK-LABEL: f6:
				; CHECK: vl [[REG:%v[0-9]+]], 0(%r2)
				; CHECK: wfixb [[RES:%v[0-9]+]], [[REG]], 4, 0
				; CHECK: vst [[RES]], 0(%r2)
				; CHECK: br %r14
				%src = load fp128, fp128 *%ptr
				%res = call fp128 @llvm.experimental.constrained.nearbyint.f128(
				fp128 %src,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%ptr
				ret void
				}

				; Test floor for f32.
				declare float @llvm.experimental.constrained.floor.f32(float, metadata, metadata)
				define float @f7(float %f) {
				; CHECK-LABEL: f7:
				; CHECK: fiebra %f0, 7, %f0, 4
				; CHECK: br %r14
				%res = call float @llvm.experimental.constrained.floor.f32(
				float %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Test floor for f64.
				declare double @llvm.experimental.constrained.floor.f64(double, metadata, metadata)
				define double @f8(double %f) {
				; CHECK-LABEL: f8:
				; CHECK: fidbra %f0, 7, %f0, 4
				; CHECK: br %r14
				%res = call double @llvm.experimental.constrained.floor.f64(
				double %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Test floor for f128.
				declare fp128 @llvm.experimental.constrained.floor.f128(fp128, metadata, metadata)
				define void @f9(fp128 *%ptr) {
				; CHECK-LABEL: f9:
				; CHECK: vl [[REG:%v[0-9]+]], 0(%r2)
				; CHECK: wfixb [[RES:%v[0-9]+]], [[REG]], 4, 7
				; CHECK: vst [[RES]], 0(%r2)
				; CHECK: br %r14
				%src = load fp128, fp128 *%ptr
				%res = call fp128 @llvm.experimental.constrained.floor.f128(
				fp128 %src,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%ptr
				ret void
				}

				; Test ceil for f32.
				declare float @llvm.experimental.constrained.ceil.f32(float, metadata, metadata)
				define float @f10(float %f) {
				; CHECK-LABEL: f10:
				; CHECK: fiebra %f0, 6, %f0, 4
				; CHECK: br %r14
				%res = call float @llvm.experimental.constrained.ceil.f32(
				float %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Test ceil for f64.
				declare double @llvm.experimental.constrained.ceil.f64(double, metadata, metadata)
				define double @f11(double %f) {
				; CHECK-LABEL: f11:
				; CHECK: fidbra %f0, 6, %f0, 4
				; CHECK: br %r14
				%res = call double @llvm.experimental.constrained.ceil.f64(
				double %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Test ceil for f128.
				declare fp128 @llvm.experimental.constrained.ceil.f128(fp128, metadata, metadata)
				define void @f12(fp128 *%ptr) {
				; CHECK-LABEL: f12:
				; CHECK: vl [[REG:%v[0-9]+]], 0(%r2)
				; CHECK: wfixb [[RES:%v[0-9]+]], [[REG]], 4, 6
				; CHECK: vst [[RES]], 0(%r2)
				; CHECK: br %r14
				%src = load fp128, fp128 *%ptr
				%res = call fp128 @llvm.experimental.constrained.ceil.f128(
				fp128 %src,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%ptr
				ret void
				}

				; Test trunc for f32.
				declare float @llvm.experimental.constrained.trunc.f32(float, metadata, metadata)
				define float @f13(float %f) {
				; CHECK-LABEL: f13:
				; CHECK: fiebra %f0, 5, %f0, 4
				; CHECK: br %r14
				%res = call float @llvm.experimental.constrained.trunc.f32(
				float %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Test trunc for f64.
				declare double @llvm.experimental.constrained.trunc.f64(double, metadata, metadata)
				define double @f14(double %f) {
				; CHECK-LABEL: f14:
				; CHECK: fidbra %f0, 5, %f0, 4
				; CHECK: br %r14
				%res = call double @llvm.experimental.constrained.trunc.f64(
				double %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Test trunc for f128.
				declare fp128 @llvm.experimental.constrained.trunc.f128(fp128, metadata, metadata)
				define void @f15(fp128 *%ptr) {
				; CHECK-LABEL: f15:
				; CHECK: vl [[REG:%v[0-9]+]], 0(%r2)
				; CHECK: wfixb [[RES:%v[0-9]+]], [[REG]], 4, 5
				; CHECK: vst [[RES]], 0(%r2)
				; CHECK: br %r14
				%src = load fp128, fp128 *%ptr
				%res = call fp128 @llvm.experimental.constrained.trunc.f128(
				fp128 %src,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%ptr
				ret void
				}

				; Test round for f32.
				declare float @llvm.experimental.constrained.round.f32(float, metadata, metadata)
				define float @f16(float %f) {
				; CHECK-LABEL: f16:
				; CHECK: fiebra %f0, 1, %f0, 4
				; CHECK: br %r14
				%res = call float @llvm.experimental.constrained.round.f32(
				float %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Test round for f64.
				declare double @llvm.experimental.constrained.round.f64(double, metadata, metadata)
				define double @f17(double %f) {
				; CHECK-LABEL: f17:
				; CHECK: fidbra %f0, 1, %f0, 4
				; CHECK: br %r14
				%res = call double @llvm.experimental.constrained.round.f64(
				double %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Test round for f128.
				declare fp128 @llvm.experimental.constrained.round.f128(fp128, metadata, metadata)
				define void @f18(fp128 *%ptr) {
				; CHECK-LABEL: f18:
				; CHECK: vl [[REG:%v[0-9]+]], 0(%r2)
				; CHECK: wfixb [[RES:%v[0-9]+]], [[REG]], 4, 1
				; CHECK: vst [[RES]], 0(%r2)
				; CHECK: br %r14
				%src = load fp128, fp128 *%ptr
				%res = call fp128 @llvm.experimental.constrained.round.f128(
				fp128 %src,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%ptr
				ret void
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-sqrt-01.ll

				; Test strict 32-bit square root.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z10 \
				; RUN: \| FileCheck -check-prefix=CHECK -check-prefix=CHECK-SCALAR %s
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14 \| FileCheck %s

				declare float @llvm.experimental.constrained.sqrt.f32(float, metadata, metadata)

				; Check register square root.
				define float @f1(float %val) {
				; CHECK-LABEL: f1:
				; CHECK: sqebr %f0, %f0
				; CHECK: br %r14
				%res = call float @llvm.experimental.constrained.sqrt.f32(
				float %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check the low end of the SQEB range.
				define float @f2(float *%ptr) {
				; CHECK-LABEL: f2:
				; CHECK: sqeb %f0, 0(%r2)
				; CHECK: br %r14
				%val = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.sqrt.f32(
				float %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check the high end of the aligned SQEB range.
				define float @f3(float *%base) {
				; CHECK-LABEL: f3:
				; CHECK: sqeb %f0, 4092(%r2)
				; CHECK: br %r14
				%ptr = getelementptr float, float *%base, i64 1023
				%val = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.sqrt.f32(
				float %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check the next word up, which needs separate address logic.
				; Other sequences besides this one would be OK.
				define float @f4(float *%base) {
				; CHECK-LABEL: f4:
				; CHECK: aghi %r2, 4096
				; CHECK: sqeb %f0, 0(%r2)
				; CHECK: br %r14
				%ptr = getelementptr float, float *%base, i64 1024
				%val = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.sqrt.f32(
				float %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check negative displacements, which also need separate address logic.
				define float @f5(float *%base) {
				; CHECK-LABEL: f5:
				; CHECK: aghi %r2, -4
				; CHECK: sqeb %f0, 0(%r2)
				; CHECK: br %r14
				%ptr = getelementptr float, float *%base, i64 -1
				%val = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.sqrt.f32(
				float %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check that SQEB allows indices.
				define float @f6(float *%base, i64 %index) {
				; CHECK-LABEL: f6:
				; CHECK: sllg %r1, %r3, 2
				; CHECK: sqeb %f0, 400(%r1,%r2)
				; CHECK: br %r14
				%ptr1 = getelementptr float, float *%base, i64 %index
				%ptr2 = getelementptr float, float *%ptr1, i64 100
				%val = load float, float *%ptr2
				%res = call float @llvm.experimental.constrained.sqrt.f32(
				float %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-sqrt-02.ll

				; Test strict 64-bit square root.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z10 \
				; RUN: \| FileCheck -check-prefix=CHECK -check-prefix=CHECK-SCALAR %s
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z13 \| FileCheck %s

				declare double @llvm.experimental.constrained.sqrt.f64(double, metadata, metadata)

				; Check register square root.
				define double @f1(double %val) {
				; CHECK-LABEL: f1:
				; CHECK: sqdbr %f0, %f0
				; CHECK: br %r14
				%res = call double @llvm.experimental.constrained.sqrt.f64(
				double %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check the low end of the SQDB range.
				define double @f2(double *%ptr) {
				; CHECK-LABEL: f2:
				; CHECK: sqdb %f0, 0(%r2)
				; CHECK: br %r14
				%val = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.sqrt.f64(
				double %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check the high end of the aligned SQDB range.
				define double @f3(double *%base) {
				; CHECK-LABEL: f3:
				; CHECK: sqdb %f0, 4088(%r2)
				; CHECK: br %r14
				%ptr = getelementptr double, double *%base, i64 511
				%val = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.sqrt.f64(
				double %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check the next doubleword up, which needs separate address logic.
				; Other sequences besides this one would be OK.
				define double @f4(double *%base) {
				; CHECK-LABEL: f4:
				; CHECK: aghi %r2, 4096
				; CHECK: sqdb %f0, 0(%r2)
				; CHECK: br %r14
				%ptr = getelementptr double, double *%base, i64 512
				%val = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.sqrt.f64(
				double %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check negative displacements, which also need separate address logic.
				define double @f5(double *%base) {
				; CHECK-LABEL: f5:
				; CHECK: aghi %r2, -8
				; CHECK: sqdb %f0, 0(%r2)
				; CHECK: br %r14
				%ptr = getelementptr double, double *%base, i64 -1
				%val = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.sqrt.f64(
				double %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check that SQDB allows indices.
				define double @f6(double *%base, i64 %index) {
				; CHECK-LABEL: f6:
				; CHECK: sllg %r1, %r3, 3
				; CHECK: sqdb %f0, 800(%r1,%r2)
				; CHECK: br %r14
				%ptr1 = getelementptr double, double *%base, i64 %index
				%ptr2 = getelementptr double, double *%ptr1, i64 100
				%val = load double, double *%ptr2
				%res = call double @llvm.experimental.constrained.sqrt.f64(
				double %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-sqrt-03.ll

				; Test strict 128-bit square root.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu \| FileCheck %s

				declare fp128 @llvm.experimental.constrained.sqrt.f128(fp128, metadata, metadata)

				; There's no memory form of SQXBR.
				define void @f1(fp128 *%ptr) {
				; CHECK-LABEL: f1:
				; CHECK: ld %f0, 0(%r2)
				; CHECK: ld %f2, 8(%r2)
				; CHECK: sqxbr %f0, %f0
				; CHECK: std %f0, 0(%r2)
				; CHECK: std %f2, 8(%r2)
				; CHECK: br %r14
				%orig = load fp128, fp128 *%ptr
				%sqrt = call fp128 @llvm.experimental.constrained.sqrt.f128(
				fp128 %orig,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %sqrt, fp128 *%ptr
				ret void
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-sqrt-04.ll

				; Test strict 128-bit floating-point square root on z14.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14 \| FileCheck %s

				declare fp128 @llvm.experimental.constrained.sqrt.f128(fp128, metadata, metadata)

				define void @f1(fp128 *%ptr) {
				; CHECK-LABEL: f1:
				; CHECK-DAG: vl [[REG:%v[0-9]+]], 0(%r2)
				; CHECK: wfsqxb [[RES:%v[0-9]+]], [[REG]]
				; CHECK: vst [[RES]], 0(%r2)
				; CHECK: br %r14
				%f = load fp128, fp128 *%ptr
				%res = call fp128 @llvm.experimental.constrained.sqrt.f128(
				fp128 %f,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128 *%ptr
				ret void
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-sub-01.ll

				; Test 32-bit floating-point strict subtraction.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z10 \
				; RUN: \| FileCheck -check-prefix=CHECK -check-prefix=CHECK-SCALAR %s
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14 \| FileCheck %s

				declare float @foo()
				declare float @llvm.experimental.constrained.fsub.f32(float, float, metadata, metadata)

				; Check register subtraction.
				define float @f1(float %f1, float %f2) {
				; CHECK-LABEL: f1:
				; CHECK: sebr %f0, %f2
				; CHECK: br %r14
				%res = call float @llvm.experimental.constrained.fsub.f32(
				float %f1, float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check the low end of the SEB range.
				define float @f2(float %f1, float *%ptr) {
				; CHECK-LABEL: f2:
				; CHECK: seb %f0, 0(%r2)
				; CHECK: br %r14
				%f2 = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.fsub.f32(
				float %f1, float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check the high end of the aligned SEB range.
				define float @f3(float %f1, float *%base) {
				; CHECK-LABEL: f3:
				; CHECK: seb %f0, 4092(%r2)
				; CHECK: br %r14
				%ptr = getelementptr float, float *%base, i64 1023
				%f2 = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.fsub.f32(
				float %f1, float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check the next word up, which needs separate address logic.
				; Other sequences besides this one would be OK.
				define float @f4(float %f1, float *%base) {
				; CHECK-LABEL: f4:
				; CHECK: aghi %r2, 4096
				; CHECK: seb %f0, 0(%r2)
				; CHECK: br %r14
				%ptr = getelementptr float, float *%base, i64 1024
				%f2 = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.fsub.f32(
				float %f1, float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check negative displacements, which also need separate address logic.
				define float @f5(float %f1, float *%base) {
				; CHECK-LABEL: f5:
				; CHECK: aghi %r2, -4
				; CHECK: seb %f0, 0(%r2)
				; CHECK: br %r14
				%ptr = getelementptr float, float *%base, i64 -1
				%f2 = load float, float *%ptr
				%res = call float @llvm.experimental.constrained.fsub.f32(
				float %f1, float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check that SEB allows indices.
				define float @f6(float %f1, float *%base, i64 %index) {
				; CHECK-LABEL: f6:
				; CHECK: sllg %r1, %r3, 2
				; CHECK: seb %f0, 400(%r1,%r2)
				; CHECK: br %r14
				%ptr1 = getelementptr float, float *%base, i64 %index
				%ptr2 = getelementptr float, float *%ptr1, i64 100
				%f2 = load float, float *%ptr2
				%res = call float @llvm.experimental.constrained.fsub.f32(
				float %f1, float %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				; Check that subtractions of spilled values can use SEB rather than SEBR.
				define float @f7(float *%ptr0) {
				; CHECK-LABEL: f7:
				; CHECK: brasl %r14, foo@PLT
				; CHECK-SCALAR: seb %f0, 16{{[04]}}(%r15)
				; CHECK: br %r14
				%ptr1 = getelementptr float, float *%ptr0, i64 2
				%ptr2 = getelementptr float, float *%ptr0, i64 4
				%ptr3 = getelementptr float, float *%ptr0, i64 6
				%ptr4 = getelementptr float, float *%ptr0, i64 8
				%ptr5 = getelementptr float, float *%ptr0, i64 10
				%ptr6 = getelementptr float, float *%ptr0, i64 12
				%ptr7 = getelementptr float, float *%ptr0, i64 14
				%ptr8 = getelementptr float, float *%ptr0, i64 16
				%ptr9 = getelementptr float, float *%ptr0, i64 18
				%ptr10 = getelementptr float, float *%ptr0, i64 20

				%val0 = load float, float *%ptr0
				%val1 = load float, float *%ptr1
				%val2 = load float, float *%ptr2
				%val3 = load float, float *%ptr3
				%val4 = load float, float *%ptr4
				%val5 = load float, float *%ptr5
				%val6 = load float, float *%ptr6
				%val7 = load float, float *%ptr7
				%val8 = load float, float *%ptr8
				%val9 = load float, float *%ptr9
				%val10 = load float, float *%ptr10

				%ret = call float @foo()

				%sub0 = call float @llvm.experimental.constrained.fsub.f32(
				float %ret, float %val0,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%sub1 = call float @llvm.experimental.constrained.fsub.f32(
				float %sub0, float %val1,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%sub2 = call float @llvm.experimental.constrained.fsub.f32(
				float %sub1, float %val2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%sub3 = call float @llvm.experimental.constrained.fsub.f32(
				float %sub2, float %val3,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%sub4 = call float @llvm.experimental.constrained.fsub.f32(
				float %sub3, float %val4,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%sub5 = call float @llvm.experimental.constrained.fsub.f32(
				float %sub4, float %val5,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%sub6 = call float @llvm.experimental.constrained.fsub.f32(
				float %sub5, float %val6,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%sub7 = call float @llvm.experimental.constrained.fsub.f32(
				float %sub6, float %val7,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%sub8 = call float @llvm.experimental.constrained.fsub.f32(
				float %sub7, float %val8,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%sub9 = call float @llvm.experimental.constrained.fsub.f32(
				float %sub8, float %val9,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%sub10 = call float @llvm.experimental.constrained.fsub.f32(
				float %sub9, float %val10,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")

				ret float %sub10
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-sub-02.ll

				; Test strict 64-bit floating-point subtraction.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z10 \
				; RUN: \| FileCheck -check-prefix=CHECK -check-prefix=CHECK-SCALAR %s
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z13 \| FileCheck %s

				declare double @foo()
				declare double @llvm.experimental.constrained.fsub.f64(double, double, metadata, metadata)

				; Check register subtraction.
				define double @f1(double %f1, double %f2) {
				; CHECK-LABEL: f1:
				; CHECK: sdbr %f0, %f2
				; CHECK: br %r14
				%res = call double @llvm.experimental.constrained.fsub.f64(
				double %f1, double %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check the low end of the SDB range.
				define double @f2(double %f1, double *%ptr) {
				; CHECK-LABEL: f2:
				; CHECK: sdb %f0, 0(%r2)
				; CHECK: br %r14
				%f2 = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.fsub.f64(
				double %f1, double %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check the high end of the aligned SDB range.
				define double @f3(double %f1, double *%base) {
				; CHECK-LABEL: f3:
				; CHECK: sdb %f0, 4088(%r2)
				; CHECK: br %r14
				%ptr = getelementptr double, double *%base, i64 511
				%f2 = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.fsub.f64(
				double %f1, double %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check the next doubleword up, which needs separate address logic.
				; Other sequences besides this one would be OK.
				define double @f4(double %f1, double *%base) {
				; CHECK-LABEL: f4:
				; CHECK: aghi %r2, 4096
				; CHECK: sdb %f0, 0(%r2)
				; CHECK: br %r14
				%ptr = getelementptr double, double *%base, i64 512
				%f2 = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.fsub.f64(
				double %f1, double %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check negative displacements, which also need separate address logic.
				define double @f5(double %f1, double *%base) {
				; CHECK-LABEL: f5:
				; CHECK: aghi %r2, -8
				; CHECK: sdb %f0, 0(%r2)
				; CHECK: br %r14
				%ptr = getelementptr double, double *%base, i64 -1
				%f2 = load double, double *%ptr
				%res = call double @llvm.experimental.constrained.fsub.f64(
				double %f1, double %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check that SDB allows indices.
				define double @f6(double %f1, double *%base, i64 %index) {
				; CHECK-LABEL: f6:
				; CHECK: sllg %r1, %r3, 3
				; CHECK: sdb %f0, 800(%r1,%r2)
				; CHECK: br %r14
				%ptr1 = getelementptr double, double *%base, i64 %index
				%ptr2 = getelementptr double, double *%ptr1, i64 100
				%f2 = load double, double *%ptr2
				%res = call double @llvm.experimental.constrained.fsub.f64(
				double %f1, double %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				; Check that subtractions of spilled values can use SDB rather than SDBR.
				define double @f7(double *%ptr0) {
				; CHECK-LABEL: f7:
				; CHECK: brasl %r14, foo@PLT
				; CHECK-SCALAR: sdb %f0, 16{{[04]}}(%r15)
				; CHECK: br %r14
				%ptr1 = getelementptr double, double *%ptr0, i64 2
				%ptr2 = getelementptr double, double *%ptr0, i64 4
				%ptr3 = getelementptr double, double *%ptr0, i64 6
				%ptr4 = getelementptr double, double *%ptr0, i64 8
				%ptr5 = getelementptr double, double *%ptr0, i64 10
				%ptr6 = getelementptr double, double *%ptr0, i64 12
				%ptr7 = getelementptr double, double *%ptr0, i64 14
				%ptr8 = getelementptr double, double *%ptr0, i64 16
				%ptr9 = getelementptr double, double *%ptr0, i64 18
				%ptr10 = getelementptr double, double *%ptr0, i64 20

				%val0 = load double, double *%ptr0
				%val1 = load double, double *%ptr1
				%val2 = load double, double *%ptr2
				%val3 = load double, double *%ptr3
				%val4 = load double, double *%ptr4
				%val5 = load double, double *%ptr5
				%val6 = load double, double *%ptr6
				%val7 = load double, double *%ptr7
				%val8 = load double, double *%ptr8
				%val9 = load double, double *%ptr9
				%val10 = load double, double *%ptr10

				%ret = call double @foo()

				%sub0 = call double @llvm.experimental.constrained.fsub.f64(
				double %ret, double %val0,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%sub1 = call double @llvm.experimental.constrained.fsub.f64(
				double %sub0, double %val1,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%sub2 = call double @llvm.experimental.constrained.fsub.f64(
				double %sub1, double %val2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%sub3 = call double @llvm.experimental.constrained.fsub.f64(
				double %sub2, double %val3,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%sub4 = call double @llvm.experimental.constrained.fsub.f64(
				double %sub3, double %val4,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%sub5 = call double @llvm.experimental.constrained.fsub.f64(
				double %sub4, double %val5,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%sub6 = call double @llvm.experimental.constrained.fsub.f64(
				double %sub5, double %val6,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%sub7 = call double @llvm.experimental.constrained.fsub.f64(
				double %sub6, double %val7,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%sub8 = call double @llvm.experimental.constrained.fsub.f64(
				double %sub7, double %val8,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%sub9 = call double @llvm.experimental.constrained.fsub.f64(
				double %sub8, double %val9,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%sub10 = call double @llvm.experimental.constrained.fsub.f64(
				double %sub9, double %val10,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")

				ret double %sub10
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-sub-03.ll

				; Test strict 128-bit floating-point subtraction.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu \| FileCheck %s

				declare fp128 @llvm.experimental.constrained.fsub.f128(fp128, fp128, metadata, metadata)

				; There is no memory form of 128-bit subtraction.
				define void @f1(fp128 *%ptr, float %f2) {
				; CHECK-LABEL: f1:
				; CHECK-DAG: lxebr %f0, %f0
				; CHECK-DAG: ld %f1, 0(%r2)
				; CHECK-DAG: ld %f3, 8(%r2)
				; CHECK: sxbr %f1, %f0
				; CHECK: std %f1, 0(%r2)
				; CHECK: std %f3, 8(%r2)
				; CHECK: br %r14
				%f1 = load fp128, fp128 *%ptr
				%f2x = fpext float %f2 to fp128
				%sum = call fp128 @llvm.experimental.constrained.fsub.f128(
				fp128 %f1, fp128 %f2x,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %sum, fp128 *%ptr
				ret void
				}

llvm/trunk/test/CodeGen/SystemZ/fp-strict-sub-04.ll

				; Test strict 128-bit floating-point subtraction on z14.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14 \| FileCheck %s

				declare fp128 @llvm.experimental.constrained.fsub.f128(fp128, fp128, metadata, metadata)

				define void @f1(fp128 %ptr1, fp128 %ptr2) {
				; CHECK-LABEL: f1:
				; CHECK-DAG: vl [[REG1:%v[0-9]+]], 0(%r2)
				; CHECK-DAG: vl [[REG2:%v[0-9]+]], 0(%r3)
				; CHECK: wfsxb [[RES:%v[0-9]+]], [[REG1]], [[REG2]]
				; CHECK: vst [[RES]], 0(%r2)
				; CHECK: br %r14
				%f1 = load fp128, fp128 *%ptr1
				%f2 = load fp128, fp128 *%ptr2
				%sum = call fp128 @llvm.experimental.constrained.fsub.f128(
				fp128 %f1, fp128 %f2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %sum, fp128 *%ptr1
				ret void
				}

llvm/trunk/test/CodeGen/SystemZ/vec-strict-add-01.ll

				; Test strict vector addition.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z13 \| FileCheck %s

				declare double @llvm.experimental.constrained.fadd.f64(double, double, metadata, metadata)
				declare <2 x double> @llvm.experimental.constrained.fadd.v2f64(<2 x double>, <2 x double>, metadata, metadata)

				; Test a v2f64 addition.
				define <2 x double> @f5(<2 x double> %dummy, <2 x double> %val1,
				<2 x double> %val2) {
				; CHECK-LABEL: f5:
				; CHECK: vfadb %v24, %v26, %v28
				; CHECK: br %r14
				%ret = call <2 x double> @llvm.experimental.constrained.fadd.v2f64(
				<2 x double> %val1, <2 x double> %val2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <2 x double> %ret
				}

				; Test an f64 addition that uses vector registers.
				define double @f6(<2 x double> %val1, <2 x double> %val2) {
				; CHECK-LABEL: f6:
				; CHECK: wfadb %f0, %v24, %v26
				; CHECK: br %r14
				%scalar1 = extractelement <2 x double> %val1, i32 0
				%scalar2 = extractelement <2 x double> %val2, i32 0
				%ret = call double @llvm.experimental.constrained.fadd.f64(
				double %scalar1, double %scalar2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %ret
				}

llvm/trunk/test/CodeGen/SystemZ/vec-strict-add-02.ll

				; Test strict vector addition on z14.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14 \| FileCheck %s

				declare float @llvm.experimental.constrained.fadd.f32(float, float, metadata, metadata)
				declare <4 x float> @llvm.experimental.constrained.fadd.v4f32(<4 x float>, <4 x float>, metadata, metadata)

				; Test a v4f32 addition.
				define <4 x float> @f1(<4 x float> %dummy, <4 x float> %val1,
				<4 x float> %val2) {
				; CHECK-LABEL: f1:
				; CHECK: vfasb %v24, %v26, %v28
				; CHECK: br %r14
				%ret = call <4 x float> @llvm.experimental.constrained.fadd.v4f32(
				<4 x float> %val1, <4 x float> %val2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <4 x float> %ret
				}

				; Test an f32 addition that uses vector registers.
				define float @f2(<4 x float> %val1, <4 x float> %val2) {
				; CHECK-LABEL: f2:
				; CHECK: wfasb %f0, %v24, %v26
				; CHECK: br %r14
				%scalar1 = extractelement <4 x float> %val1, i32 0
				%scalar2 = extractelement <4 x float> %val2, i32 0
				%ret = call float @llvm.experimental.constrained.fadd.f32(
				float %scalar1, float %scalar2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %ret
				}

llvm/trunk/test/CodeGen/SystemZ/vec-strict-div-01.ll

				; Test strict vector division.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z13 \| FileCheck %s

				declare double @llvm.experimental.constrained.fdiv.f64(double, double, metadata, metadata)
				declare <2 x double> @llvm.experimental.constrained.fdiv.v2f64(<2 x double>, <2 x double>, metadata, metadata)

				; Test a v2f64 division.
				define <2 x double> @f5(<2 x double> %dummy, <2 x double> %val1,
				<2 x double> %val2) {
				; CHECK-LABEL: f5:
				; CHECK: vfddb %v24, %v26, %v28
				; CHECK: br %r14
				%ret = call <2 x double> @llvm.experimental.constrained.fdiv.v2f64(
				<2 x double> %val1, <2 x double> %val2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <2 x double> %ret
				}

				; Test an f64 division that uses vector registers.
				define double @f6(<2 x double> %val1, <2 x double> %val2) {
				; CHECK-LABEL: f6:
				; CHECK: wfddb %f0, %v24, %v26
				; CHECK: br %r14
				%scalar1 = extractelement <2 x double> %val1, i32 0
				%scalar2 = extractelement <2 x double> %val2, i32 0
				%ret = call double @llvm.experimental.constrained.fdiv.f64(
				double %scalar1, double %scalar2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %ret
				}

llvm/trunk/test/CodeGen/SystemZ/vec-strict-div-02.ll

				; Test strict vector division on z14.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14 \| FileCheck %s

				declare float @llvm.experimental.constrained.fdiv.f32(float, float, metadata, metadata)
				declare <4 x float> @llvm.experimental.constrained.fdiv.v4f32(<4 x float>, <4 x float>, metadata, metadata)

				; Test a v4f32 division.
				define <4 x float> @f1(<4 x float> %dummy, <4 x float> %val1,
				<4 x float> %val2) {
				; CHECK-LABEL: f1:
				; CHECK: vfdsb %v24, %v26, %v28
				; CHECK: br %r14
				%ret = call <4 x float> @llvm.experimental.constrained.fdiv.v4f32(
				<4 x float> %val1, <4 x float> %val2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <4 x float> %ret
				}

				; Test an f32 division that uses vector registers.
				define float @f2(<4 x float> %val1, <4 x float> %val2) {
				; CHECK-LABEL: f2:
				; CHECK: wfdsb %f0, %v24, %v26
				; CHECK: br %r14
				%scalar1 = extractelement <4 x float> %val1, i32 0
				%scalar2 = extractelement <4 x float> %val2, i32 0
				%ret = call float @llvm.experimental.constrained.fdiv.f32(
				float %scalar1, float %scalar2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %ret
				}

llvm/trunk/test/CodeGen/SystemZ/vec-strict-max-01.ll

				; Test strict vector maximum on z14.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14 \| FileCheck %s

				declare double @llvm.experimental.constrained.maxnum.f64(double, double, metadata, metadata)
				declare <2 x double> @llvm.experimental.constrained.maxnum.v2f64(<2 x double>, <2 x double>, metadata, metadata)

				declare float @llvm.experimental.constrained.maxnum.f32(float, float, metadata, metadata)
				declare <4 x float> @llvm.experimental.constrained.maxnum.v4f32(<4 x float>, <4 x float>, metadata, metadata)

				declare fp128 @llvm.experimental.constrained.maxnum.f128(fp128, fp128, metadata, metadata)

				; Test the f64 maxnum intrinsic.
				define double @f1(double %dummy, double %val1, double %val2) {
				; CHECK-LABEL: f1:
				; CHECK: wfmaxdb %f0, %f2, %f4, 4
				; CHECK: br %r14
				%ret = call double @llvm.experimental.constrained.maxnum.f64(
				double %val1, double %val2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %ret
				}

				; Test the v2f64 maxnum intrinsic.
				define <2 x double> @f2(<2 x double> %dummy, <2 x double> %val1,
				<2 x double> %val2) {
				; CHECK-LABEL: f2:
				; CHECK: vfmaxdb %v24, %v26, %v28, 4
				; CHECK: br %r14
				%ret = call <2 x double> @llvm.experimental.constrained.maxnum.v2f64(
				<2 x double> %val1, <2 x double> %val2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <2 x double> %ret
				}

				; Test the f32 maxnum intrinsic.
				define float @f3(float %dummy, float %val1, float %val2) {
				; CHECK-LABEL: f3:
				; CHECK: wfmaxsb %f0, %f2, %f4, 4
				; CHECK: br %r14
				%ret = call float @llvm.experimental.constrained.maxnum.f32(
				float %val1, float %val2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %ret
				}

				; Test the v4f32 maxnum intrinsic.
				define <4 x float> @f4(<4 x float> %dummy, <4 x float> %val1,
				<4 x float> %val2) {
				; CHECK-LABEL: f4:
				; CHECK: vfmaxsb %v24, %v26, %v28, 4
				; CHECK: br %r14
				%ret = call <4 x float> @llvm.experimental.constrained.maxnum.v4f32(
				<4 x float> %val1, <4 x float> %val2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <4 x float> %ret
				}

				; Test the f128 maxnum intrinsic.
				define void @f5(fp128 %ptr1, fp128 %ptr2, fp128 *%dst) {
				; CHECK-LABEL: f5:
				; CHECK-DAG: vl [[REG1:%v[0-9]+]], 0(%r2)
				; CHECK-DAG: vl [[REG2:%v[0-9]+]], 0(%r3)
				; CHECK: wfmaxxb [[RES:%v[0-9]+]], [[REG1]], [[REG2]], 4
				; CHECK: vst [[RES]], 0(%r4)
				; CHECK: br %r14
				%val1 = load fp128, fp128* %ptr1
				%val2 = load fp128, fp128* %ptr2
				%res = call fp128 @llvm.experimental.constrained.maxnum.f128(
				fp128 %val1, fp128 %val2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128* %dst
				ret void
				}

llvm/trunk/test/CodeGen/SystemZ/vec-strict-min-01.ll

				; Test strict vector minimum on z14.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14 \| FileCheck %s

				declare double @llvm.experimental.constrained.minnum.f64(double, double, metadata, metadata)
				declare <2 x double> @llvm.experimental.constrained.minnum.v2f64(<2 x double>, <2 x double>, metadata, metadata)

				declare float @llvm.experimental.constrained.minnum.f32(float, float, metadata, metadata)
				declare <4 x float> @llvm.experimental.constrained.minnum.v4f32(<4 x float>, <4 x float>, metadata, metadata)

				declare fp128 @llvm.experimental.constrained.minnum.f128(fp128, fp128, metadata, metadata)

				; Test the f64 minnum intrinsic.
				define double @f1(double %dummy, double %val1, double %val2) {
				; CHECK-LABEL: f1:
				; CHECK: wfmindb %f0, %f2, %f4, 4
				; CHECK: br %r14
				%ret = call double @llvm.experimental.constrained.minnum.f64(
				double %val1, double %val2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %ret
				}

				; Test the v2f64 minnum intrinsic.
				define <2 x double> @f2(<2 x double> %dummy, <2 x double> %val1,
				<2 x double> %val2) {
				; CHECK-LABEL: f2:
				; CHECK: vfmindb %v24, %v26, %v28, 4
				; CHECK: br %r14
				%ret = call <2 x double> @llvm.experimental.constrained.minnum.v2f64(
				<2 x double> %val1, <2 x double> %val2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <2 x double> %ret
				}

				; Test the f32 minnum intrinsic.
				define float @f3(float %dummy, float %val1, float %val2) {
				; CHECK-LABEL: f3:
				; CHECK: wfminsb %f0, %f2, %f4, 4
				; CHECK: br %r14
				%ret = call float @llvm.experimental.constrained.minnum.f32(
				float %val1, float %val2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %ret
				}

				; Test the v4f32 minnum intrinsic.
				define <4 x float> @f4(<4 x float> %dummy, <4 x float> %val1,
				<4 x float> %val2) {
				; CHECK-LABEL: f4:
				; CHECK: vfminsb %v24, %v26, %v28, 4
				; CHECK: br %r14
				%ret = call <4 x float> @llvm.experimental.constrained.minnum.v4f32(
				<4 x float> %val1, <4 x float> %val2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <4 x float> %ret
				}

				; Test the f128 minnum intrinsic.
				define void @f5(fp128 %ptr1, fp128 %ptr2, fp128 *%dst) {
				; CHECK-LABEL: f5:
				; CHECK-DAG: vl [[REG1:%v[0-9]+]], 0(%r2)
				; CHECK-DAG: vl [[REG2:%v[0-9]+]], 0(%r3)
				; CHECK: wfminxb [[RES:%v[0-9]+]], [[REG1]], [[REG2]], 4
				; CHECK: vst [[RES]], 0(%r4)
				; CHECK: br %r14
				%val1 = load fp128, fp128* %ptr1
				%val2 = load fp128, fp128* %ptr2
				%res = call fp128 @llvm.experimental.constrained.minnum.f128(
				fp128 %val1, fp128 %val2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				store fp128 %res, fp128* %dst
				ret void
				}

llvm/trunk/test/CodeGen/SystemZ/vec-strict-mul-01.ll

				; Test strict vector multiplication.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z13 \| FileCheck %s

				declare double @llvm.experimental.constrained.fmul.f64(double, double, metadata, metadata)
				declare <2 x double> @llvm.experimental.constrained.fmul.v2f64(<2 x double>, <2 x double>, metadata, metadata)

				; Test a v2f64 multiplication.
				define <2 x double> @f5(<2 x double> %dummy, <2 x double> %val1,
				<2 x double> %val2) {
				; CHECK-LABEL: f5:
				; CHECK: vfmdb %v24, %v26, %v28
				; CHECK: br %r14
				%ret = call <2 x double> @llvm.experimental.constrained.fmul.v2f64(
				<2 x double> %val1, <2 x double> %val2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <2 x double> %ret
				}

				; Test an f64 multiplication that uses vector registers.
				define double @f6(<2 x double> %val1, <2 x double> %val2) {
				; CHECK-LABEL: f6:
				; CHECK: wfmdb %f0, %v24, %v26
				; CHECK: br %r14
				%scalar1 = extractelement <2 x double> %val1, i32 0
				%scalar2 = extractelement <2 x double> %val2, i32 0
				%ret = call double @llvm.experimental.constrained.fmul.f64(
				double %scalar1, double %scalar2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %ret
				}

llvm/trunk/test/CodeGen/SystemZ/vec-strict-mul-02.ll

				; Test strict vector multiply-and-add.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z13 \| FileCheck %s

				declare <2 x double> @llvm.experimental.constrained.fma.v2f64(<2 x double>, <2 x double>, <2 x double>, metadata, metadata)

				; Test a v2f64 multiply-and-add.
				define <2 x double> @f4(<2 x double> %dummy, <2 x double> %val1,
				<2 x double> %val2, <2 x double> %val3) {
				; CHECK-LABEL: f4:
				; CHECK: vfmadb %v24, %v26, %v28, %v30
				; CHECK: br %r14
				%ret = call <2 x double> @llvm.experimental.constrained.fma.v2f64 (
				<2 x double> %val1,
				<2 x double> %val2,
				<2 x double> %val3,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <2 x double> %ret
				}

				; Test a v2f64 multiply-and-subtract.
				define <2 x double> @f5(<2 x double> %dummy, <2 x double> %val1,
				<2 x double> %val2, <2 x double> %val3) {
				; CHECK-LABEL: f5:
				; CHECK: vfmsdb %v24, %v26, %v28, %v30
				; CHECK: br %r14
				%negval3 = fsub <2 x double> <double -0.0, double -0.0>, %val3
				%ret = call <2 x double> @llvm.experimental.constrained.fma.v2f64 (
				<2 x double> %val1,
				<2 x double> %val2,
				<2 x double> %negval3,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <2 x double> %ret
				}

llvm/trunk/test/CodeGen/SystemZ/vec-strict-mul-03.ll

				; Test strict vector multiplication on z14.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14 \| FileCheck %s

				declare float @llvm.experimental.constrained.fmul.f32(float, float, metadata, metadata)
				declare <4 x float> @llvm.experimental.constrained.fmul.v4f32(<4 x float>, <4 x float>, metadata, metadata)

				; Test a v4f32 multiplication.
				define <4 x float> @f1(<4 x float> %dummy, <4 x float> %val1,
				<4 x float> %val2) {
				; CHECK-LABEL: f1:
				; CHECK: vfmsb %v24, %v26, %v28
				; CHECK: br %r14
				%ret = call <4 x float> @llvm.experimental.constrained.fmul.v4f32(
				<4 x float> %val1, <4 x float> %val2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <4 x float> %ret
				}

				; Test an f32 multiplication that uses vector registers.
				define float @f2(<4 x float> %val1, <4 x float> %val2) {
				; CHECK-LABEL: f2:
				; CHECK: wfmsb %f0, %v24, %v26
				; CHECK: br %r14
				%scalar1 = extractelement <4 x float> %val1, i32 0
				%scalar2 = extractelement <4 x float> %val2, i32 0
				%ret = call float @llvm.experimental.constrained.fmul.f32(
				float %scalar1, float %scalar2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %ret
				}

llvm/trunk/test/CodeGen/SystemZ/vec-strict-mul-04.ll

				; Test strict vector multiply-and-add on z14.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14 \| FileCheck %s

				declare <4 x float> @llvm.experimental.constrained.fma.v4f32(<4 x float>, <4 x float>, <4 x float>, metadata, metadata)

				; Test a v4f32 multiply-and-add.
				define <4 x float> @f1(<4 x float> %dummy, <4 x float> %val1,
				<4 x float> %val2, <4 x float> %val3) {
				; CHECK-LABEL: f1:
				; CHECK: vfmasb %v24, %v26, %v28, %v30
				; CHECK: br %r14
				%ret = call <4 x float> @llvm.experimental.constrained.fma.v4f32 (
				<4 x float> %val1,
				<4 x float> %val2,
				<4 x float> %val3,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <4 x float> %ret
				}

				; Test a v4f32 multiply-and-subtract.
				define <4 x float> @f2(<4 x float> %dummy, <4 x float> %val1,
				<4 x float> %val2, <4 x float> %val3) {
				; CHECK-LABEL: f2:
				; CHECK: vfmssb %v24, %v26, %v28, %v30
				; CHECK: br %r14
				%negval3 = fsub <4 x float> <float -0.0, float -0.0,
				float -0.0, float -0.0>, %val3
				%ret = call <4 x float> @llvm.experimental.constrained.fma.v4f32 (
				<4 x float> %val1,
				<4 x float> %val2,
				<4 x float> %negval3,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <4 x float> %ret
				}

llvm/trunk/test/CodeGen/SystemZ/vec-strict-mul-05.ll

				; Test vector negative multiply-and-add on z14.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14 \| FileCheck %s

				declare <2 x double> @llvm.experimental.constrained.fma.v2f64(<2 x double>, <2 x double>, <2 x double>, metadata, metadata)
				declare <4 x float> @llvm.experimental.constrained.fma.v4f32(<4 x float>, <4 x float>, <4 x float>, metadata, metadata)

				; Test a v2f64 negative multiply-and-add.
				define <2 x double> @f1(<2 x double> %dummy, <2 x double> %val1,
				<2 x double> %val2, <2 x double> %val3) {
				; CHECK-LABEL: f1:
				; CHECK: vfnmadb %v24, %v26, %v28, %v30
				; CHECK: br %r14
				%ret = call <2 x double> @llvm.experimental.constrained.fma.v2f64 (
				<2 x double> %val1,
				<2 x double> %val2,
				<2 x double> %val3,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%negret = fsub <2 x double> <double -0.0, double -0.0>, %ret
				ret <2 x double> %negret
				}

				; Test a v2f64 negative multiply-and-subtract.
				define <2 x double> @f2(<2 x double> %dummy, <2 x double> %val1,
				<2 x double> %val2, <2 x double> %val3) {
				; CHECK-LABEL: f2:
				; CHECK: vfnmsdb %v24, %v26, %v28, %v30
				; CHECK: br %r14
				%negval3 = fsub <2 x double> <double -0.0, double -0.0>, %val3
				%ret = call <2 x double> @llvm.experimental.constrained.fma.v2f64 (
				<2 x double> %val1,
				<2 x double> %val2,
				<2 x double> %negval3,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%negret = fsub <2 x double> <double -0.0, double -0.0>, %ret
				ret <2 x double> %negret
				}

				; Test a v4f32 negative multiply-and-add.
				define <4 x float> @f3(<4 x float> %dummy, <4 x float> %val1,
				<4 x float> %val2, <4 x float> %val3) {
				; CHECK-LABEL: f3:
				; CHECK: vfnmasb %v24, %v26, %v28, %v30
				; CHECK: br %r14
				%ret = call <4 x float> @llvm.experimental.constrained.fma.v4f32 (
				<4 x float> %val1,
				<4 x float> %val2,
				<4 x float> %val3,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%negret = fsub <4 x float> <float -0.0, float -0.0,
				float -0.0, float -0.0>, %ret
				ret <4 x float> %negret
				}

				; Test a v4f32 negative multiply-and-subtract.
				define <4 x float> @f4(<4 x float> %dummy, <4 x float> %val1,
				<4 x float> %val2, <4 x float> %val3) {
				; CHECK-LABEL: f4:
				; CHECK: vfnmssb %v24, %v26, %v28, %v30
				; CHECK: br %r14
				%negval3 = fsub <4 x float> <float -0.0, float -0.0,
				float -0.0, float -0.0>, %val3
				%ret = call <4 x float> @llvm.experimental.constrained.fma.v4f32 (
				<4 x float> %val1,
				<4 x float> %val2,
				<4 x float> %negval3,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				%negret = fsub <4 x float> <float -0.0, float -0.0,
				float -0.0, float -0.0>, %ret
				ret <4 x float> %negret
				}

llvm/trunk/test/CodeGen/SystemZ/vec-strict-round-01.ll

				; Test strict v2f64 rounding.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z13 \| FileCheck %s

				declare double @llvm.experimental.constrained.rint.f64(double, metadata, metadata)
				declare double @llvm.experimental.constrained.nearbyint.f64(double, metadata, metadata)
				declare double @llvm.experimental.constrained.floor.f64(double, metadata, metadata)
				declare double @llvm.experimental.constrained.ceil.f64(double, metadata, metadata)
				declare double @llvm.experimental.constrained.trunc.f64(double, metadata, metadata)
				declare double @llvm.experimental.constrained.round.f64(double, metadata, metadata)
				declare <2 x double> @llvm.experimental.constrained.rint.v2f64(<2 x double>, metadata, metadata)
				declare <2 x double> @llvm.experimental.constrained.nearbyint.v2f64(<2 x double>, metadata, metadata)
				declare <2 x double> @llvm.experimental.constrained.floor.v2f64(<2 x double>, metadata, metadata)
				declare <2 x double> @llvm.experimental.constrained.ceil.v2f64(<2 x double>, metadata, metadata)
				declare <2 x double> @llvm.experimental.constrained.trunc.v2f64(<2 x double>, metadata, metadata)
				declare <2 x double> @llvm.experimental.constrained.round.v2f64(<2 x double>, metadata, metadata)

				define <2 x double> @f1(<2 x double> %val) {
				; CHECK-LABEL: f1:
				; CHECK: vfidb %v24, %v24, 0, 0
				; CHECK: br %r14
				%res = call <2 x double> @llvm.experimental.constrained.rint.v2f64(
				<2 x double> %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <2 x double> %res
				}

				define <2 x double> @f2(<2 x double> %val) {
				; CHECK-LABEL: f2:
				; CHECK: vfidb %v24, %v24, 4, 0
				; CHECK: br %r14
				%res = call <2 x double> @llvm.experimental.constrained.nearbyint.v2f64(
				<2 x double> %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <2 x double> %res
				}

				define <2 x double> @f3(<2 x double> %val) {
				; CHECK-LABEL: f3:
				; CHECK: vfidb %v24, %v24, 4, 7
				; CHECK: br %r14
				%res = call <2 x double> @llvm.experimental.constrained.floor.v2f64(
				<2 x double> %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <2 x double> %res
				}

				define <2 x double> @f4(<2 x double> %val) {
				; CHECK-LABEL: f4:
				; CHECK: vfidb %v24, %v24, 4, 6
				; CHECK: br %r14
				%res = call <2 x double> @llvm.experimental.constrained.ceil.v2f64(
				<2 x double> %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <2 x double> %res
				}

				define <2 x double> @f5(<2 x double> %val) {
				; CHECK-LABEL: f5:
				; CHECK: vfidb %v24, %v24, 4, 5
				; CHECK: br %r14
				%res = call <2 x double> @llvm.experimental.constrained.trunc.v2f64(
				<2 x double> %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <2 x double> %res
				}

				define <2 x double> @f6(<2 x double> %val) {
				; CHECK-LABEL: f6:
				; CHECK: vfidb %v24, %v24, 4, 1
				; CHECK: br %r14
				%res = call <2 x double> @llvm.experimental.constrained.round.v2f64(
				<2 x double> %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <2 x double> %res
				}

				define double @f7(<2 x double> %val) {
				; CHECK-LABEL: f7:
				; CHECK: wfidb %f0, %v24, 0, 0
				; CHECK: br %r14
				%scalar = extractelement <2 x double> %val, i32 0
				%res = call double @llvm.experimental.constrained.rint.f64(
				double %scalar,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				define double @f8(<2 x double> %val) {
				; CHECK-LABEL: f8:
				; CHECK: wfidb %f0, %v24, 4, 0
				; CHECK: br %r14
				%scalar = extractelement <2 x double> %val, i32 0
				%res = call double @llvm.experimental.constrained.nearbyint.f64(
				double %scalar,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				define double @f9(<2 x double> %val) {
				; CHECK-LABEL: f9:
				; CHECK: wfidb %f0, %v24, 4, 7
				; CHECK: br %r14
				%scalar = extractelement <2 x double> %val, i32 0
				%res = call double @llvm.experimental.constrained.floor.f64(
				double %scalar,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}


				define double @f10(<2 x double> %val) {
				; CHECK-LABEL: f10:
				; CHECK: wfidb %f0, %v24, 4, 6
				; CHECK: br %r14
				%scalar = extractelement <2 x double> %val, i32 0
				%res = call double @llvm.experimental.constrained.ceil.f64(
				double %scalar,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				define double @f11(<2 x double> %val) {
				; CHECK-LABEL: f11:
				; CHECK: wfidb %f0, %v24, 4, 5
				; CHECK: br %r14
				%scalar = extractelement <2 x double> %val, i32 0
				%res = call double @llvm.experimental.constrained.trunc.f64(
				double %scalar,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

				define double @f12(<2 x double> %val) {
				; CHECK-LABEL: f12:
				; CHECK: wfidb %f0, %v24, 4, 1
				; CHECK: br %r14
				%scalar = extractelement <2 x double> %val, i32 0
				%res = call double @llvm.experimental.constrained.round.f64(
				double %scalar,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %res
				}

llvm/trunk/test/CodeGen/SystemZ/vec-strict-round-02.ll

				; Test strict v4f32 rounding on z14.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14 \| FileCheck %s

				declare float @llvm.experimental.constrained.rint.f32(float, metadata, metadata)
				declare float @llvm.experimental.constrained.nearbyint.f32(float, metadata, metadata)
				declare float @llvm.experimental.constrained.floor.f32(float, metadata, metadata)
				declare float @llvm.experimental.constrained.ceil.f32(float, metadata, metadata)
				declare float @llvm.experimental.constrained.trunc.f32(float, metadata, metadata)
				declare float @llvm.experimental.constrained.round.f32(float, metadata, metadata)
				declare <4 x float> @llvm.experimental.constrained.rint.v4f32(<4 x float>, metadata, metadata)
				declare <4 x float> @llvm.experimental.constrained.nearbyint.v4f32(<4 x float>, metadata, metadata)
				declare <4 x float> @llvm.experimental.constrained.floor.v4f32(<4 x float>, metadata, metadata)
				declare <4 x float> @llvm.experimental.constrained.ceil.v4f32(<4 x float>, metadata, metadata)
				declare <4 x float> @llvm.experimental.constrained.trunc.v4f32(<4 x float>, metadata, metadata)
				declare <4 x float> @llvm.experimental.constrained.round.v4f32(<4 x float>, metadata, metadata)

				define <4 x float> @f1(<4 x float> %val) {
				; CHECK-LABEL: f1:
				; CHECK: vfisb %v24, %v24, 0, 0
				; CHECK: br %r14
				%res = call <4 x float> @llvm.experimental.constrained.rint.v4f32(
				<4 x float> %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <4 x float> %res
				}

				define <4 x float> @f2(<4 x float> %val) {
				; CHECK-LABEL: f2:
				; CHECK: vfisb %v24, %v24, 4, 0
				; CHECK: br %r14
				%res = call <4 x float> @llvm.experimental.constrained.nearbyint.v4f32(
				<4 x float> %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <4 x float> %res
				}

				define <4 x float> @f3(<4 x float> %val) {
				; CHECK-LABEL: f3:
				; CHECK: vfisb %v24, %v24, 4, 7
				; CHECK: br %r14
				%res = call <4 x float> @llvm.experimental.constrained.floor.v4f32(
				<4 x float> %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <4 x float> %res
				}

				define <4 x float> @f4(<4 x float> %val) {
				; CHECK-LABEL: f4:
				; CHECK: vfisb %v24, %v24, 4, 6
				; CHECK: br %r14
				%res = call <4 x float> @llvm.experimental.constrained.ceil.v4f32(
				<4 x float> %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <4 x float> %res
				}

				define <4 x float> @f5(<4 x float> %val) {
				; CHECK-LABEL: f5:
				; CHECK: vfisb %v24, %v24, 4, 5
				; CHECK: br %r14
				%res = call <4 x float> @llvm.experimental.constrained.trunc.v4f32(
				<4 x float> %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <4 x float> %res
				}

				define <4 x float> @f6(<4 x float> %val) {
				; CHECK-LABEL: f6:
				; CHECK: vfisb %v24, %v24, 4, 1
				; CHECK: br %r14
				%res = call <4 x float> @llvm.experimental.constrained.round.v4f32(
				<4 x float> %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <4 x float> %res
				}

				define float @f7(<4 x float> %val) {
				; CHECK-LABEL: f7:
				; CHECK: wfisb %f0, %v24, 0, 0
				; CHECK: br %r14
				%scalar = extractelement <4 x float> %val, i32 0
				%res = call float @llvm.experimental.constrained.rint.f32(
				float %scalar,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				define float @f8(<4 x float> %val) {
				; CHECK-LABEL: f8:
				; CHECK: wfisb %f0, %v24, 4, 0
				; CHECK: br %r14
				%scalar = extractelement <4 x float> %val, i32 0
				%res = call float @llvm.experimental.constrained.nearbyint.f32(
				float %scalar,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				define float @f9(<4 x float> %val) {
				; CHECK-LABEL: f9:
				; CHECK: wfisb %f0, %v24, 4, 7
				; CHECK: br %r14
				%scalar = extractelement <4 x float> %val, i32 0
				%res = call float @llvm.experimental.constrained.floor.f32(
				float %scalar,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				define float @f10(<4 x float> %val) {
				; CHECK-LABEL: f10:
				; CHECK: wfisb %f0, %v24, 4, 6
				; CHECK: br %r14
				%scalar = extractelement <4 x float> %val, i32 0
				%res = call float @llvm.experimental.constrained.ceil.f32(
				float %scalar,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				define float @f11(<4 x float> %val) {
				; CHECK-LABEL: f11:
				; CHECK: wfisb %f0, %v24, 4, 5
				; CHECK: br %r14
				%scalar = extractelement <4 x float> %val, i32 0
				%res = call float @llvm.experimental.constrained.trunc.f32(
				float %scalar,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

				define float @f12(<4 x float> %val) {
				; CHECK-LABEL: f12:
				; CHECK: wfisb %f0, %v24, 4, 1
				; CHECK: br %r14
				%scalar = extractelement <4 x float> %val, i32 0
				%res = call float @llvm.experimental.constrained.round.f32(
				float %scalar,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %res
				}

llvm/trunk/test/CodeGen/SystemZ/vec-strict-sqrt-01.ll

				; Test f64 and v2f64 square root.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z13 \| FileCheck %s

				declare double @llvm.experimental.constrained.sqrt.f64(double, metadata, metadata)
				declare <2 x double> @llvm.experimental.constrained.sqrt.v2f64(<2 x double>, metadata, metadata)

				define <2 x double> @f1(<2 x double> %val) {
				; CHECK-LABEL: f1:
				; CHECK: vfsqdb %v24, %v24
				; CHECK: br %r14
				%ret = call <2 x double> @llvm.experimental.constrained.sqrt.v2f64(
				<2 x double> %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <2 x double> %ret
				}

				define double @f2(<2 x double> %val) {
				; CHECK-LABEL: f2:
				; CHECK: wfsqdb %f0, %v24
				; CHECK: br %r14
				%scalar = extractelement <2 x double> %val, i32 0
				%ret = call double @llvm.experimental.constrained.sqrt.f64(
				double %scalar,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %ret
				}

llvm/trunk/test/CodeGen/SystemZ/vec-strict-sqrt-02.ll

				; Test strict f32 and v4f32 square root on z14.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14 \| FileCheck %s

				declare float @llvm.experimental.constrained.sqrt.f32(float, metadata, metadata)
				declare <4 x float> @llvm.experimental.constrained.sqrt.v4f32(<4 x float>, metadata, metadata)

				define <4 x float> @f1(<4 x float> %val) {
				; CHECK-LABEL: f1:
				; CHECK: vfsqsb %v24, %v24
				; CHECK: br %r14
				%ret = call <4 x float> @llvm.experimental.constrained.sqrt.v4f32(
				<4 x float> %val,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <4 x float> %ret
				}

				define float @f2(<4 x float> %val) {
				; CHECK-LABEL: f2:
				; CHECK: wfsqsb %f0, %v24
				; CHECK: br %r14
				%scalar = extractelement <4 x float> %val, i32 0
				%ret = call float @llvm.experimental.constrained.sqrt.f32(
				float %scalar,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %ret
				}

llvm/trunk/test/CodeGen/SystemZ/vec-strict-sub-01.ll

				; Test strict vector subtraction.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z13 \| FileCheck %s

				declare double @llvm.experimental.constrained.fsub.f64(double, double, metadata, metadata)
				declare <2 x double> @llvm.experimental.constrained.fsub.v2f64(<2 x double>, <2 x double>, metadata, metadata)

				; Test a v2f64 subtraction.
				define <2 x double> @f6(<2 x double> %dummy, <2 x double> %val1,
				<2 x double> %val2) {
				; CHECK-LABEL: f6:
				; CHECK: vfsdb %v24, %v26, %v28
				; CHECK: br %r14
				%ret = call <2 x double> @llvm.experimental.constrained.fsub.v2f64(
				<2 x double> %val1, <2 x double> %val2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <2 x double> %ret
				}

				; Test an f64 subtraction that uses vector registers.
				define double @f7(<2 x double> %val1, <2 x double> %val2) {
				; CHECK-LABEL: f7:
				; CHECK: wfsdb %f0, %v24, %v26
				; CHECK: br %r14
				%scalar1 = extractelement <2 x double> %val1, i32 0
				%scalar2 = extractelement <2 x double> %val2, i32 0
				%ret = call double @llvm.experimental.constrained.fsub.f64(
				double %scalar1, double %scalar2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret double %ret
				}

llvm/trunk/test/CodeGen/SystemZ/vec-strict-sub-02.ll

				; Test strict vector subtraction on z14.
				;
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14 \| FileCheck %s

				declare float @llvm.experimental.constrained.fsub.f32(float, float, metadata, metadata)
				declare <4 x float> @llvm.experimental.constrained.fsub.v4f32(<4 x float>, <4 x float>, metadata, metadata)

				; Test a v4f32 subtraction.
				define <4 x float> @f6(<4 x float> %dummy, <4 x float> %val1,
				<4 x float> %val2) {
				; CHECK-LABEL: f6:
				; CHECK: vfssb %v24, %v26, %v28
				; CHECK: br %r14
				%ret = call <4 x float> @llvm.experimental.constrained.fsub.v4f32(
				<4 x float> %val1, <4 x float> %val2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret <4 x float> %ret
				}

				; Test an f32 subtraction that uses vector registers.
				define float @f7(<4 x float> %val1, <4 x float> %val2) {
				; CHECK-LABEL: f7:
				; CHECK: wfssb %f0, %v24, %v26
				; CHECK: br %r14
				%scalar1 = extractelement <4 x float> %val1, i32 0
				%scalar2 = extractelement <4 x float> %val2, i32 0
				%ret = call float @llvm.experimental.constrained.fsub.f32(
				float %scalar1, float %scalar2,
				metadata !"round.dynamic",
				metadata !"fpexcept.strict")
				ret float %ret
				}

llvm/trunk/test/CodeGen/SystemZ/vector-constrained-fp-intrinsics.ll

Show All 27 Lines
}		}

define <2 x double> @constrained_vector_fdiv_v2f64() {		define <2 x double> @constrained_vector_fdiv_v2f64() {
; S390X-LABEL: constrained_vector_fdiv_v2f64:		; S390X-LABEL: constrained_vector_fdiv_v2f64:
; S390X: # %bb.0: # %entry		; S390X: # %bb.0: # %entry
; S390X-NEXT: larl %r1, .LCPI1_0		; S390X-NEXT: larl %r1, .LCPI1_0
; S390X-NEXT: ldeb %f1, 0(%r1)		; S390X-NEXT: ldeb %f1, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI1_1		; S390X-NEXT: larl %r1, .LCPI1_1
; S390X-NEXT: ldeb %f0, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI1_2
; S390X-NEXT: ldeb %f2, 0(%r1)		; S390X-NEXT: ldeb %f2, 0(%r1)
; S390X-NEXT: ddbr %f0, %f1		; S390X-NEXT: larl %r1, .LCPI1_2
		; S390X-NEXT: ldeb %f0, 0(%r1)
; S390X-NEXT: ddbr %f2, %f1		; S390X-NEXT: ddbr %f2, %f1
		; S390X-NEXT: ddbr %f0, %f1
; S390X-NEXT: br %r14		; S390X-NEXT: br %r14
;		;
; SZ13-LABEL: constrained_vector_fdiv_v2f64:		; SZ13-LABEL: constrained_vector_fdiv_v2f64:
; SZ13: # %bb.0: # %entry		; SZ13: # %bb.0: # %entry
; SZ13-NEXT: larl %r1, .LCPI1_0		; SZ13-NEXT: larl %r1, .LCPI1_0
; SZ13-NEXT: vl %v0, 0(%r1)		; SZ13-NEXT: vl %v0, 0(%r1)
; SZ13-NEXT: larl %r1, .LCPI1_1		; SZ13-NEXT: larl %r1, .LCPI1_1
; SZ13-NEXT: vl %v1, 0(%r1)		; SZ13-NEXT: vl %v1, 0(%r1)
Show All 9 Lines
}		}

define <3 x float> @constrained_vector_fdiv_v3f32() {		define <3 x float> @constrained_vector_fdiv_v3f32() {
; S390X-LABEL: constrained_vector_fdiv_v3f32:		; S390X-LABEL: constrained_vector_fdiv_v3f32:
; S390X: # %bb.0: # %entry		; S390X: # %bb.0: # %entry
; S390X-NEXT: larl %r1, .LCPI2_0		; S390X-NEXT: larl %r1, .LCPI2_0
; S390X-NEXT: le %f1, 0(%r1)		; S390X-NEXT: le %f1, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI2_1		; S390X-NEXT: larl %r1, .LCPI2_1
; S390X-NEXT: le %f0, 0(%r1)		; S390X-NEXT: le %f4, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI2_2		; S390X-NEXT: larl %r1, .LCPI2_2
; S390X-NEXT: le %f2, 0(%r1)		; S390X-NEXT: le %f2, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI2_3		; S390X-NEXT: larl %r1, .LCPI2_3
; S390X-NEXT: le %f4, 0(%r1)		; S390X-NEXT: le %f0, 0(%r1)
; S390X-NEXT: debr %f0, %f1
; S390X-NEXT: debr %f2, %f1
; S390X-NEXT: debr %f4, %f1		; S390X-NEXT: debr %f4, %f1
		; S390X-NEXT: debr %f2, %f1
		; S390X-NEXT: debr %f0, %f1
; S390X-NEXT: br %r14		; S390X-NEXT: br %r14
;		;
; SZ13-LABEL: constrained_vector_fdiv_v3f32:		; SZ13-LABEL: constrained_vector_fdiv_v3f32:
; SZ13: # %bb.0: # %entry		; SZ13: # %bb.0: # %entry
; SZ13-NEXT: larl %r1, .LCPI2_0		; SZ13-NEXT: larl %r1, .LCPI2_0
; SZ13-NEXT: lde %f0, 0(%r1)		; SZ13-NEXT: lde %f0, 0(%r1)
; SZ13-NEXT: larl %r1, .LCPI2_1		; SZ13-NEXT: larl %r1, .LCPI2_1
; SZ13-NEXT: lde %f1, 0(%r1)		; SZ13-NEXT: lde %f1, 0(%r1)
Show All 13 Lines	%div = call <3 x float> @llvm.experimental.constrained.fdiv.v3f32(
metadata !"round.dynamic",		metadata !"round.dynamic",
metadata !"fpexcept.strict")		metadata !"fpexcept.strict")
ret <3 x float> %div		ret <3 x float> %div
}		}

define void @constrained_vector_fdiv_v3f64(<3 x double>* %a) {		define void @constrained_vector_fdiv_v3f64(<3 x double>* %a) {
; S390X-LABEL: constrained_vector_fdiv_v3f64:		; S390X-LABEL: constrained_vector_fdiv_v3f64:
; S390X: # %bb.0: # %entry		; S390X: # %bb.0: # %entry
; S390X-NEXT: larl %r1, .LCPI3_1		; S390X-NEXT: ld %f0, 16(%r2)
; S390X-NEXT: ldeb %f0, 0(%r1)		; S390X-NEXT: ld %f1, 8(%r2)
; S390X-NEXT: larl %r1, .LCPI3_2
; S390X-NEXT: ldeb %f1, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI3_0		; S390X-NEXT: larl %r1, .LCPI3_0
; S390X-NEXT: ldeb %f2, 0(%r1)		; S390X-NEXT: ldeb %f2, 0(%r1)
; S390X-NEXT: ddb %f1, 16(%r2)		; S390X-NEXT: larl %r1, .LCPI3_1
; S390X-NEXT: ddb %f0, 8(%r2)		; S390X-NEXT: ldeb %f3, 0(%r1)
		; S390X-NEXT: larl %r1, .LCPI3_2
		; S390X-NEXT: ldeb %f4, 0(%r1)
; S390X-NEXT: ddb %f2, 0(%r2)		; S390X-NEXT: ddb %f2, 0(%r2)
; S390X-NEXT: std %f1, 16(%r2)		; S390X-NEXT: ddbr %f3, %f1
; S390X-NEXT: std %f0, 8(%r2)		; S390X-NEXT: ddbr %f4, %f0
		; S390X-NEXT: std %f4, 16(%r2)
		; S390X-NEXT: std %f3, 8(%r2)
; S390X-NEXT: std %f2, 0(%r2)		; S390X-NEXT: std %f2, 0(%r2)
; S390X-NEXT: br %r14		; S390X-NEXT: br %r14
;		;
; SZ13-LABEL: constrained_vector_fdiv_v3f64:		; SZ13-LABEL: constrained_vector_fdiv_v3f64:
; SZ13: # %bb.0: # %entry		; SZ13: # %bb.0: # %entry
; SZ13-NEXT: larl %r1, .LCPI3_0		; SZ13-NEXT: larl %r1, .LCPI3_0
; SZ13-NEXT: vl %v0, 0(%r2)
; SZ13-NEXT: vl %v1, 0(%r1)
; SZ13-NEXT: vfddb %v0, %v1, %v0
; SZ13-NEXT: larl %r1, .LCPI3_1
; SZ13-NEXT: ldeb %f1, 0(%r1)		; SZ13-NEXT: ldeb %f1, 0(%r1)
; SZ13-NEXT: ddb %f1, 16(%r2)		; SZ13-NEXT: ddb %f1, 16(%r2)
		; SZ13-NEXT: larl %r1, .LCPI3_1
		; SZ13-NEXT: vl %v0, 0(%r2)
		; SZ13-NEXT: vl %v2, 0(%r1)
; SZ13-NEXT: std %f1, 16(%r2)		; SZ13-NEXT: std %f1, 16(%r2)
		; SZ13-NEXT: vfddb %v0, %v2, %v0
; SZ13-NEXT: vst %v0, 0(%r2)		; SZ13-NEXT: vst %v0, 0(%r2)
; SZ13-NEXT: br %r14		; SZ13-NEXT: br %r14
entry:		entry:
%b = load <3 x double>, <3 x double>* %a		%b = load <3 x double>, <3 x double>* %a
%div = call <3 x double> @llvm.experimental.constrained.fdiv.v3f64(		%div = call <3 x double> @llvm.experimental.constrained.fdiv.v3f64(
<3 x double> <double 1.000000e+00, double 2.000000e+00, double 3.000000e+00>,		<3 x double> <double 1.000000e+00, double 2.000000e+00, double 3.000000e+00>,
<3 x double> %b,		<3 x double> %b,
metadata !"round.dynamic",		metadata !"round.dynamic",
metadata !"fpexcept.strict")		metadata !"fpexcept.strict")
store <3 x double> %div, <3 x double>* %a		store <3 x double> %div, <3 x double>* %a
ret void		ret void
}		}

define <4 x double> @constrained_vector_fdiv_v4f64() {		define <4 x double> @constrained_vector_fdiv_v4f64() {
; S390X-LABEL: constrained_vector_fdiv_v4f64:		; S390X-LABEL: constrained_vector_fdiv_v4f64:
; S390X: # %bb.0: # %entry		; S390X: # %bb.0: # %entry
; S390X-NEXT: larl %r1, .LCPI4_0		; S390X-NEXT: larl %r1, .LCPI4_0
; S390X-NEXT: ldeb %f1, 0(%r1)		; S390X-NEXT: ldeb %f1, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI4_1		; S390X-NEXT: larl %r1, .LCPI4_1
; S390X-NEXT: ldeb %f0, 0(%r1)		; S390X-NEXT: ldeb %f6, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI4_2		; S390X-NEXT: larl %r1, .LCPI4_2
; S390X-NEXT: ldeb %f2, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI4_3
; S390X-NEXT: ldeb %f4, 0(%r1)		; S390X-NEXT: ldeb %f4, 0(%r1)
		; S390X-NEXT: larl %r1, .LCPI4_3
		; S390X-NEXT: ldeb %f2, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI4_4		; S390X-NEXT: larl %r1, .LCPI4_4
; S390X-NEXT: ldeb %f6, 0(%r1)		; S390X-NEXT: ldeb %f0, 0(%r1)
; S390X-NEXT: ddbr %f0, %f1
; S390X-NEXT: ddbr %f2, %f1
; S390X-NEXT: ddbr %f4, %f1
; S390X-NEXT: ddbr %f6, %f1		; S390X-NEXT: ddbr %f6, %f1
		; S390X-NEXT: ddbr %f4, %f1
		; S390X-NEXT: ddbr %f2, %f1
		; S390X-NEXT: ddbr %f0, %f1
; S390X-NEXT: br %r14		; S390X-NEXT: br %r14
;		;
; SZ13-LABEL: constrained_vector_fdiv_v4f64:		; SZ13-LABEL: constrained_vector_fdiv_v4f64:
; SZ13: # %bb.0: # %entry		; SZ13: # %bb.0: # %entry
; SZ13-NEXT: larl %r1, .LCPI4_0		; SZ13-NEXT: larl %r1, .LCPI4_0
; SZ13-NEXT: vl %v0, 0(%r1)		; SZ13-NEXT: vl %v0, 0(%r1)
; SZ13-NEXT: larl %r1, .LCPI4_1		; SZ13-NEXT: larl %r1, .LCPI4_1
; SZ13-NEXT: vl %v1, 0(%r1)		; SZ13-NEXT: vl %v1, 0(%r1)
; SZ13-NEXT: vfddb %v24, %v1, %v0		; SZ13-NEXT: vfddb %v26, %v1, %v0
; SZ13-NEXT: larl %r1, .LCPI4_2		; SZ13-NEXT: larl %r1, .LCPI4_2
; SZ13-NEXT: vl %v1, 0(%r1)		; SZ13-NEXT: vl %v1, 0(%r1)
; SZ13-NEXT: vfddb %v26, %v1, %v0		; SZ13-NEXT: vfddb %v24, %v1, %v0
; SZ13-NEXT: br %r14		; SZ13-NEXT: br %r14
entry:		entry:
%div = call <4 x double> @llvm.experimental.constrained.fdiv.v4f64(		%div = call <4 x double> @llvm.experimental.constrained.fdiv.v4f64(
<4 x double> <double 1.000000e+00, double 2.000000e+00,		<4 x double> <double 1.000000e+00, double 2.000000e+00,
double 3.000000e+00, double 4.000000e+00>,		double 3.000000e+00, double 4.000000e+00>,
<4 x double> <double 1.000000e+01, double 1.000000e+01,		<4 x double> <double 1.000000e+01, double 1.000000e+01,
double 1.000000e+01, double 1.000000e+01>,		double 1.000000e+01, double 1.000000e+01>,
metadata !"round.dynamic",		metadata !"round.dynamic",
▲ Show 20 Lines • Show All 407 Lines • ▼ Show 20 Lines	%mul = call <1 x float> @llvm.experimental.constrained.fmul.v1f32(
metadata !"fpexcept.strict")		metadata !"fpexcept.strict")
ret <1 x float> %mul		ret <1 x float> %mul
}		}

define <2 x double> @constrained_vector_fmul_v2f64() {		define <2 x double> @constrained_vector_fmul_v2f64() {
; S390X-LABEL: constrained_vector_fmul_v2f64:		; S390X-LABEL: constrained_vector_fmul_v2f64:
; S390X: # %bb.0: # %entry		; S390X: # %bb.0: # %entry
; S390X-NEXT: larl %r1, .LCPI11_0		; S390X-NEXT: larl %r1, .LCPI11_0
; S390X-NEXT: ldeb %f0, 0(%r1)		; S390X-NEXT: ldeb %f2, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI11_1		; S390X-NEXT: larl %r1, .LCPI11_1
; S390X-NEXT: ld %f1, 0(%r1)		; S390X-NEXT: ld %f1, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI11_2		; S390X-NEXT: larl %r1, .LCPI11_2
; S390X-NEXT: ldeb %f2, 0(%r1)		; S390X-NEXT: ldeb %f0, 0(%r1)
; S390X-NEXT: mdbr %f0, %f1
; S390X-NEXT: mdbr %f2, %f1		; S390X-NEXT: mdbr %f2, %f1
		; S390X-NEXT: mdbr %f0, %f1
; S390X-NEXT: br %r14		; S390X-NEXT: br %r14
;		;
; SZ13-LABEL: constrained_vector_fmul_v2f64:		; SZ13-LABEL: constrained_vector_fmul_v2f64:
; SZ13: # %bb.0: # %entry		; SZ13: # %bb.0: # %entry
; SZ13-NEXT: larl %r1, .LCPI11_0		; SZ13-NEXT: larl %r1, .LCPI11_0
; SZ13-NEXT: vl %v0, 0(%r1)		; SZ13-NEXT: vl %v0, 0(%r1)
; SZ13-NEXT: larl %r1, .LCPI11_1		; SZ13-NEXT: larl %r1, .LCPI11_1
; SZ13-NEXT: vl %v1, 0(%r1)		; SZ13-NEXT: vl %v1, 0(%r1)
; SZ13-NEXT: vfmdb %v24, %v1, %v0		; SZ13-NEXT: vfmdb %v24, %v1, %v0
; SZ13-NEXT: br %r14		; SZ13-NEXT: br %r14
entry:		entry:
%mul = call <2 x double> @llvm.experimental.constrained.fmul.v2f64(		%mul = call <2 x double> @llvm.experimental.constrained.fmul.v2f64(
<2 x double> <double 0x7FEFFFFFFFFFFFFF, double 0x7FEFFFFFFFFFFFFF>,		<2 x double> <double 0x7FEFFFFFFFFFFFFF, double 0x7FEFFFFFFFFFFFFF>,
<2 x double> <double 2.000000e+00, double 3.000000e+00>,		<2 x double> <double 2.000000e+00, double 3.000000e+00>,
metadata !"round.dynamic",		metadata !"round.dynamic",
metadata !"fpexcept.strict")		metadata !"fpexcept.strict")
ret <2 x double> %mul		ret <2 x double> %mul
}		}

define <3 x float> @constrained_vector_fmul_v3f32() {		define <3 x float> @constrained_vector_fmul_v3f32() {
; S390X-LABEL: constrained_vector_fmul_v3f32:		; S390X-LABEL: constrained_vector_fmul_v3f32:
; S390X: # %bb.0: # %entry		; S390X: # %bb.0: # %entry
; S390X-NEXT: larl %r1, .LCPI12_0		; S390X-NEXT: larl %r1, .LCPI12_0
; S390X-NEXT: le %f4, 0(%r1)		; S390X-NEXT: le %f0, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI12_1		; S390X-NEXT: larl %r1, .LCPI12_1
; S390X-NEXT: ler %f0, %f4		; S390X-NEXT: ler %f4, %f0
; S390X-NEXT: meeb %f0, 0(%r1)		; S390X-NEXT: meeb %f4, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI12_2		; S390X-NEXT: larl %r1, .LCPI12_2
; S390X-NEXT: ler %f2, %f4		; S390X-NEXT: ler %f2, %f0
; S390X-NEXT: meeb %f2, 0(%r1)		; S390X-NEXT: meeb %f2, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI12_3		; S390X-NEXT: larl %r1, .LCPI12_3
; S390X-NEXT: meeb %f4, 0(%r1)		; S390X-NEXT: meeb %f0, 0(%r1)
; S390X-NEXT: br %r14		; S390X-NEXT: br %r14
;		;
; SZ13-LABEL: constrained_vector_fmul_v3f32:		; SZ13-LABEL: constrained_vector_fmul_v3f32:
; SZ13: # %bb.0: # %entry		; SZ13: # %bb.0: # %entry
; SZ13-NEXT: vgmf %v0, 1, 8		; SZ13-NEXT: vgmf %v0, 1, 8
; SZ13-NEXT: larl %r1, .LCPI12_0		; SZ13-NEXT: larl %r1, .LCPI12_0
; SZ13-NEXT: vgmf %v2, 2, 8		; SZ13-NEXT: vgmf %v2, 2, 8
; SZ13-NEXT: vgmf %v1, 1, 8		; SZ13-NEXT: vgmf %v1, 1, 8
Show All 15 Lines	entry:
ret <3 x float> %mul		ret <3 x float> %mul
}		}

define void @constrained_vector_fmul_v3f64(<3 x double>* %a) {		define void @constrained_vector_fmul_v3f64(<3 x double>* %a) {
; S390X-LABEL: constrained_vector_fmul_v3f64:		; S390X-LABEL: constrained_vector_fmul_v3f64:
; S390X: # %bb.0: # %entry		; S390X: # %bb.0: # %entry
; S390X-NEXT: larl %r1, .LCPI13_0		; S390X-NEXT: larl %r1, .LCPI13_0
; S390X-NEXT: ld %f0, 0(%r1)		; S390X-NEXT: ld %f0, 0(%r1)
; S390X-NEXT: ldr %f1, %f0		; S390X-NEXT: ld %f1, 8(%r2)
; S390X-NEXT: ldr %f2, %f0		; S390X-NEXT: ld %f2, 16(%r2)
; S390X-NEXT: mdb %f0, 16(%r2)		; S390X-NEXT: ldr %f3, %f0
; S390X-NEXT: mdb %f2, 8(%r2)		; S390X-NEXT: mdb %f3, 0(%r2)
; S390X-NEXT: mdb %f1, 0(%r2)		; S390X-NEXT: mdbr %f1, %f0
; S390X-NEXT: std %f0, 16(%r2)		; S390X-NEXT: mdbr %f2, %f0
; S390X-NEXT: std %f2, 8(%r2)		; S390X-NEXT: std %f2, 16(%r2)
; S390X-NEXT: std %f1, 0(%r2)		; S390X-NEXT: std %f1, 8(%r2)
		; S390X-NEXT: std %f3, 0(%r2)
; S390X-NEXT: br %r14		; S390X-NEXT: br %r14
;		;
; SZ13-LABEL: constrained_vector_fmul_v3f64:		; SZ13-LABEL: constrained_vector_fmul_v3f64:
; SZ13: # %bb.0: # %entry		; SZ13: # %bb.0: # %entry
; SZ13-NEXT: larl %r1, .LCPI13_0		; SZ13-NEXT: larl %r1, .LCPI13_0
; SZ13-NEXT: vl %v0, 0(%r2)
; SZ13-NEXT: vl %v1, 0(%r1)
; SZ13-NEXT: larl %r1, .LCPI13_1
; SZ13-NEXT: vfmdb %v0, %v1, %v0
; SZ13-NEXT: ld %f1, 0(%r1)		; SZ13-NEXT: ld %f1, 0(%r1)
		; SZ13-NEXT: larl %r1, .LCPI13_1
		; SZ13-NEXT: vl %v0, 0(%r2)
		; SZ13-NEXT: vl %v2, 0(%r1)
; SZ13-NEXT: mdb %f1, 16(%r2)		; SZ13-NEXT: mdb %f1, 16(%r2)
		; SZ13-NEXT: vfmdb %v0, %v2, %v0
; SZ13-NEXT: std %f1, 16(%r2)		; SZ13-NEXT: std %f1, 16(%r2)
; SZ13-NEXT: vst %v0, 0(%r2)		; SZ13-NEXT: vst %v0, 0(%r2)
; SZ13-NEXT: br %r14		; SZ13-NEXT: br %r14
entry:		entry:
%b = load <3 x double>, <3 x double>* %a		%b = load <3 x double>, <3 x double>* %a
%mul = call <3 x double> @llvm.experimental.constrained.fmul.v3f64(		%mul = call <3 x double> @llvm.experimental.constrained.fmul.v3f64(
<3 x double> <double 0x7FEFFFFFFFFFFFFF, double 0x7FEFFFFFFFFFFFFF,		<3 x double> <double 0x7FEFFFFFFFFFFFFF, double 0x7FEFFFFFFFFFFFFF,
double 0x7FEFFFFFFFFFFFFF>,		double 0x7FEFFFFFFFFFFFFF>,
<3 x double> %b,		<3 x double> %b,
metadata !"round.dynamic",		metadata !"round.dynamic",
metadata !"fpexcept.strict")		metadata !"fpexcept.strict")
store <3 x double> %mul, <3 x double>* %a		store <3 x double> %mul, <3 x double>* %a
ret void		ret void
}		}

define <4 x double> @constrained_vector_fmul_v4f64() {		define <4 x double> @constrained_vector_fmul_v4f64() {
; S390X-LABEL: constrained_vector_fmul_v4f64:		; S390X-LABEL: constrained_vector_fmul_v4f64:
; S390X: # %bb.0: # %entry		; S390X: # %bb.0: # %entry
; S390X-NEXT: larl %r1, .LCPI14_0		; S390X-NEXT: larl %r1, .LCPI14_0
; S390X-NEXT: ldeb %f0, 0(%r1)		; S390X-NEXT: ldeb %f6, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI14_1		; S390X-NEXT: larl %r1, .LCPI14_1
; S390X-NEXT: ld %f1, 0(%r1)		; S390X-NEXT: ld %f1, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI14_2		; S390X-NEXT: larl %r1, .LCPI14_2
; S390X-NEXT: ldeb %f2, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI14_3
; S390X-NEXT: ldeb %f4, 0(%r1)		; S390X-NEXT: ldeb %f4, 0(%r1)
		; S390X-NEXT: larl %r1, .LCPI14_3
		; S390X-NEXT: ldeb %f2, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI14_4		; S390X-NEXT: larl %r1, .LCPI14_4
; S390X-NEXT: ldeb %f6, 0(%r1)		; S390X-NEXT: ldeb %f0, 0(%r1)
; S390X-NEXT: mdbr %f0, %f1
; S390X-NEXT: mdbr %f2, %f1
; S390X-NEXT: mdbr %f4, %f1
; S390X-NEXT: mdbr %f6, %f1		; S390X-NEXT: mdbr %f6, %f1
		; S390X-NEXT: mdbr %f4, %f1
		; S390X-NEXT: mdbr %f2, %f1
		; S390X-NEXT: mdbr %f0, %f1
; S390X-NEXT: br %r14		; S390X-NEXT: br %r14
;		;
; SZ13-LABEL: constrained_vector_fmul_v4f64:		; SZ13-LABEL: constrained_vector_fmul_v4f64:
; SZ13: # %bb.0: # %entry		; SZ13: # %bb.0: # %entry
; SZ13-NEXT: larl %r1, .LCPI14_0		; SZ13-NEXT: larl %r1, .LCPI14_0
; SZ13-NEXT: vl %v0, 0(%r1)		; SZ13-NEXT: vl %v0, 0(%r1)
; SZ13-NEXT: larl %r1, .LCPI14_1		; SZ13-NEXT: larl %r1, .LCPI14_1
; SZ13-NEXT: vl %v1, 0(%r1)		; SZ13-NEXT: vl %v1, 0(%r1)
; SZ13-NEXT: larl %r1, .LCPI14_2		; SZ13-NEXT: larl %r1, .LCPI14_2
; SZ13-NEXT: vfmdb %v24, %v1, %v0
; SZ13-NEXT: vl %v0, 0(%r1)
; SZ13-NEXT: vfmdb %v26, %v1, %v0		; SZ13-NEXT: vfmdb %v26, %v1, %v0
		; SZ13-NEXT: vl %v0, 0(%r1)
		; SZ13-NEXT: vfmdb %v24, %v1, %v0
; SZ13-NEXT: br %r14		; SZ13-NEXT: br %r14
entry:		entry:
%mul = call <4 x double> @llvm.experimental.constrained.fmul.v4f64(		%mul = call <4 x double> @llvm.experimental.constrained.fmul.v4f64(
<4 x double> <double 0x7FEFFFFFFFFFFFFF, double 0x7FEFFFFFFFFFFFFF,		<4 x double> <double 0x7FEFFFFFFFFFFFFF, double 0x7FEFFFFFFFFFFFFF,
double 0x7FEFFFFFFFFFFFFF, double 0x7FEFFFFFFFFFFFFF>,		double 0x7FEFFFFFFFFFFFFF, double 0x7FEFFFFFFFFFFFFF>,
<4 x double> <double 2.000000e+00, double 3.000000e+00,		<4 x double> <double 2.000000e+00, double 3.000000e+00,
double 4.000000e+00, double 5.000000e+00>,		double 4.000000e+00, double 5.000000e+00>,
metadata !"round.dynamic",		metadata !"round.dynamic",
Show All 25 Lines	%add = call <1 x float> @llvm.experimental.constrained.fadd.v1f32(
metadata !"fpexcept.strict")		metadata !"fpexcept.strict")
ret <1 x float> %add		ret <1 x float> %add
}		}

define <2 x double> @constrained_vector_fadd_v2f64() {		define <2 x double> @constrained_vector_fadd_v2f64() {
; S390X-LABEL: constrained_vector_fadd_v2f64:		; S390X-LABEL: constrained_vector_fadd_v2f64:
; S390X: # %bb.0: # %entry		; S390X: # %bb.0: # %entry
; S390X-NEXT: larl %r1, .LCPI16_0		; S390X-NEXT: larl %r1, .LCPI16_0
		; S390X-NEXT: ld %f1, 0(%r1)
		; S390X-NEXT: larl %r1, .LCPI16_2
; S390X-NEXT: ldeb %f0, 0(%r1)		; S390X-NEXT: ldeb %f0, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI16_1		; S390X-NEXT: larl %r1, .LCPI16_1
; S390X-NEXT: ld %f2, 0(%r1)		; S390X-NEXT: ldr %f2, %f1
; S390X-NEXT: adbr %f0, %f2
; S390X-NEXT: larl %r1, .LCPI16_2
; S390X-NEXT: adb %f2, 0(%r1)		; S390X-NEXT: adb %f2, 0(%r1)
		; S390X-NEXT: adbr %f0, %f1
; S390X-NEXT: br %r14		; S390X-NEXT: br %r14
;		;
; SZ13-LABEL: constrained_vector_fadd_v2f64:		; SZ13-LABEL: constrained_vector_fadd_v2f64:
; SZ13: # %bb.0: # %entry		; SZ13: # %bb.0: # %entry
; SZ13-NEXT: larl %r1, .LCPI16_0		; SZ13-NEXT: larl %r1, .LCPI16_0
; SZ13-NEXT: vl %v0, 0(%r1)		; SZ13-NEXT: vl %v0, 0(%r1)
; SZ13-NEXT: larl %r1, .LCPI16_1		; SZ13-NEXT: larl %r1, .LCPI16_1
; SZ13-NEXT: vl %v1, 0(%r1)		; SZ13-NEXT: vl %v1, 0(%r1)
; SZ13-NEXT: vfadb %v24, %v1, %v0		; SZ13-NEXT: vfadb %v24, %v1, %v0
; SZ13-NEXT: br %r14		; SZ13-NEXT: br %r14
entry:		entry:
%add = call <2 x double> @llvm.experimental.constrained.fadd.v2f64(		%add = call <2 x double> @llvm.experimental.constrained.fadd.v2f64(
<2 x double> <double 0x7FEFFFFFFFFFFFFF, double 0x7FEFFFFFFFFFFFFF>,		<2 x double> <double 0x7FEFFFFFFFFFFFFF, double 0x7FEFFFFFFFFFFFFF>,
<2 x double> <double 1.000000e+00, double 1.000000e-01>,		<2 x double> <double 1.000000e+00, double 1.000000e-01>,
metadata !"round.dynamic",		metadata !"round.dynamic",
metadata !"fpexcept.strict")		metadata !"fpexcept.strict")
ret <2 x double> %add		ret <2 x double> %add
}		}

define <3 x float> @constrained_vector_fadd_v3f32() {		define <3 x float> @constrained_vector_fadd_v3f32() {
; S390X-LABEL: constrained_vector_fadd_v3f32:		; S390X-LABEL: constrained_vector_fadd_v3f32:
; S390X: # %bb.0: # %entry		; S390X: # %bb.0: # %entry
; S390X-NEXT: larl %r1, .LCPI17_0		; S390X-NEXT: larl %r1, .LCPI17_0
; S390X-NEXT: le %f1, 0(%r1)		; S390X-NEXT: le %f0, 0(%r1)
		; S390X-NEXT: lzer %f4
		; S390X-NEXT: aebr %f4, %f0
; S390X-NEXT: larl %r1, .LCPI17_1		; S390X-NEXT: larl %r1, .LCPI17_1
; S390X-NEXT: ler %f2, %f1		; S390X-NEXT: ler %f2, %f0
; S390X-NEXT: ler %f0, %f1
; S390X-NEXT: aeb %f0, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI17_2
; S390X-NEXT: aeb %f2, 0(%r1)		; S390X-NEXT: aeb %f2, 0(%r1)
; S390X-NEXT: lzer %f4		; S390X-NEXT: larl %r1, .LCPI17_2
; S390X-NEXT: aebr %f4, %f1		; S390X-NEXT: aeb %f0, 0(%r1)
; S390X-NEXT: br %r14		; S390X-NEXT: br %r14
;		;
; SZ13-LABEL: constrained_vector_fadd_v3f32:		; SZ13-LABEL: constrained_vector_fadd_v3f32:
; SZ13: # %bb.0: # %entry		; SZ13: # %bb.0: # %entry
; SZ13-NEXT: vgbm %v0, 15		; SZ13-NEXT: vgbm %v0, 15
; SZ13-NEXT: vgmf %v2, 1, 1		; SZ13-NEXT: vgmf %v2, 1, 1
; SZ13-NEXT: vgmf %v3, 2, 8		; SZ13-NEXT: vgmf %v3, 2, 8
; SZ13-NEXT: lzer %f1		; SZ13-NEXT: lzer %f1
Show All 14 Lines	entry:
ret <3 x float> %add		ret <3 x float> %add
}		}

define void @constrained_vector_fadd_v3f64(<3 x double>* %a) {		define void @constrained_vector_fadd_v3f64(<3 x double>* %a) {
; S390X-LABEL: constrained_vector_fadd_v3f64:		; S390X-LABEL: constrained_vector_fadd_v3f64:
; S390X: # %bb.0: # %entry		; S390X: # %bb.0: # %entry
; S390X-NEXT: larl %r1, .LCPI18_0		; S390X-NEXT: larl %r1, .LCPI18_0
; S390X-NEXT: ld %f0, 0(%r1)		; S390X-NEXT: ld %f0, 0(%r1)
; S390X-NEXT: ldr %f1, %f0		; S390X-NEXT: ld %f1, 8(%r2)
; S390X-NEXT: ldr %f2, %f0		; S390X-NEXT: ld %f2, 16(%r2)
; S390X-NEXT: adb %f0, 16(%r2)		; S390X-NEXT: ldr %f3, %f0
; S390X-NEXT: adb %f2, 8(%r2)		; S390X-NEXT: adb %f3, 0(%r2)
; S390X-NEXT: adb %f1, 0(%r2)		; S390X-NEXT: adbr %f1, %f0
; S390X-NEXT: std %f0, 16(%r2)		; S390X-NEXT: adbr %f2, %f0
; S390X-NEXT: std %f2, 8(%r2)		; S390X-NEXT: std %f2, 16(%r2)
; S390X-NEXT: std %f1, 0(%r2)		; S390X-NEXT: std %f1, 8(%r2)
		; S390X-NEXT: std %f3, 0(%r2)
; S390X-NEXT: br %r14		; S390X-NEXT: br %r14
;		;
; SZ13-LABEL: constrained_vector_fadd_v3f64:		; SZ13-LABEL: constrained_vector_fadd_v3f64:
; SZ13: # %bb.0: # %entry		; SZ13: # %bb.0: # %entry
; SZ13-NEXT: larl %r1, .LCPI18_0		; SZ13-NEXT: larl %r1, .LCPI18_0
; SZ13-NEXT: vl %v0, 0(%r2)
; SZ13-NEXT: vl %v1, 0(%r1)
; SZ13-NEXT: larl %r1, .LCPI18_1
; SZ13-NEXT: vfadb %v0, %v1, %v0
; SZ13-NEXT: ld %f1, 0(%r1)		; SZ13-NEXT: ld %f1, 0(%r1)
		; SZ13-NEXT: larl %r1, .LCPI18_1
		; SZ13-NEXT: vl %v0, 0(%r2)
		; SZ13-NEXT: vl %v2, 0(%r1)
; SZ13-NEXT: adb %f1, 16(%r2)		; SZ13-NEXT: adb %f1, 16(%r2)
		; SZ13-NEXT: vfadb %v0, %v2, %v0
; SZ13-NEXT: std %f1, 16(%r2)		; SZ13-NEXT: std %f1, 16(%r2)
; SZ13-NEXT: vst %v0, 0(%r2)		; SZ13-NEXT: vst %v0, 0(%r2)
; SZ13-NEXT: br %r14		; SZ13-NEXT: br %r14
entry:		entry:
%b = load <3 x double>, <3 x double>* %a		%b = load <3 x double>, <3 x double>* %a
%add = call <3 x double> @llvm.experimental.constrained.fadd.v3f64(		%add = call <3 x double> @llvm.experimental.constrained.fadd.v3f64(
<3 x double> <double 0x7FEFFFFFFFFFFFFF, double 0x7FEFFFFFFFFFFFFF,		<3 x double> <double 0x7FEFFFFFFFFFFFFF, double 0x7FEFFFFFFFFFFFFF,
double 0x7FEFFFFFFFFFFFFF>,		double 0x7FEFFFFFFFFFFFFF>,
<3 x double> %b,		<3 x double> %b,
metadata !"round.dynamic",		metadata !"round.dynamic",
metadata !"fpexcept.strict")		metadata !"fpexcept.strict")
store <3 x double> %add, <3 x double>* %a		store <3 x double> %add, <3 x double>* %a
ret void		ret void
}		}

define <4 x double> @constrained_vector_fadd_v4f64() {		define <4 x double> @constrained_vector_fadd_v4f64() {
; S390X-LABEL: constrained_vector_fadd_v4f64:		; S390X-LABEL: constrained_vector_fadd_v4f64:
; S390X: # %bb.0: # %entry		; S390X: # %bb.0: # %entry
; S390X-NEXT: larl %r1, .LCPI19_0		; S390X-NEXT: larl %r1, .LCPI19_0
; S390X-NEXT: ldeb %f0, 0(%r1)		; S390X-NEXT: ld %f1, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI19_1		; S390X-NEXT: larl %r1, .LCPI19_1
; S390X-NEXT: ld %f6, 0(%r1)		; S390X-NEXT: ldr %f2, %f1
; S390X-NEXT: larl %r1, .LCPI19_3		; S390X-NEXT: ldr %f6, %f1
; S390X-NEXT: ldeb %f4, 0(%r1)		; S390X-NEXT: adb %f6, 0(%r1)
; S390X-NEXT: adbr %f0, %f6
; S390X-NEXT: larl %r1, .LCPI19_2		; S390X-NEXT: larl %r1, .LCPI19_2
; S390X-NEXT: ldr %f2, %f6		; S390X-NEXT: ldeb %f4, 0(%r1)
; S390X-NEXT: adb %f2, 0(%r1)
; S390X-NEXT: adbr %f4, %f6
; S390X-NEXT: larl %r1, .LCPI19_4		; S390X-NEXT: larl %r1, .LCPI19_4
; S390X-NEXT: adb %f6, 0(%r1)		; S390X-NEXT: ldeb %f0, 0(%r1)
		; S390X-NEXT: larl %r1, .LCPI19_3
		; S390X-NEXT: adb %f2, 0(%r1)
		; S390X-NEXT: adbr %f4, %f1
		; S390X-NEXT: adbr %f0, %f1
; S390X-NEXT: br %r14		; S390X-NEXT: br %r14
;		;
; SZ13-LABEL: constrained_vector_fadd_v4f64:		; SZ13-LABEL: constrained_vector_fadd_v4f64:
; SZ13: # %bb.0: # %entry		; SZ13: # %bb.0: # %entry
; SZ13-NEXT: larl %r1, .LCPI19_0		; SZ13-NEXT: larl %r1, .LCPI19_0
; SZ13-NEXT: vl %v0, 0(%r1)		; SZ13-NEXT: vl %v0, 0(%r1)
; SZ13-NEXT: larl %r1, .LCPI19_1		; SZ13-NEXT: larl %r1, .LCPI19_1
; SZ13-NEXT: vl %v1, 0(%r1)		; SZ13-NEXT: vl %v1, 0(%r1)
; SZ13-NEXT: larl %r1, .LCPI19_2		; SZ13-NEXT: larl %r1, .LCPI19_2
; SZ13-NEXT: vfadb %v24, %v1, %v0
; SZ13-NEXT: vl %v0, 0(%r1)
; SZ13-NEXT: vfadb %v26, %v1, %v0		; SZ13-NEXT: vfadb %v26, %v1, %v0
		; SZ13-NEXT: vl %v0, 0(%r1)
		; SZ13-NEXT: vfadb %v24, %v1, %v0
; SZ13-NEXT: br %r14		; SZ13-NEXT: br %r14
entry:		entry:
%add = call <4 x double> @llvm.experimental.constrained.fadd.v4f64(		%add = call <4 x double> @llvm.experimental.constrained.fadd.v4f64(
<4 x double> <double 0x7FEFFFFFFFFFFFFF, double 0x7FEFFFFFFFFFFFFF,		<4 x double> <double 0x7FEFFFFFFFFFFFFF, double 0x7FEFFFFFFFFFFFFF,
double 0x7FEFFFFFFFFFFFFF, double 0x7FEFFFFFFFFFFFFF>,		double 0x7FEFFFFFFFFFFFFF, double 0x7FEFFFFFFFFFFFFF>,
<4 x double> <double 1.000000e+00, double 1.000000e-01,		<4 x double> <double 1.000000e+00, double 1.000000e-01,
double 2.000000e+00, double 2.000000e-01>,		double 2.000000e+00, double 2.000000e-01>,
metadata !"round.dynamic",		metadata !"round.dynamic",
Show All 24 Lines	%sub = call <1 x float> @llvm.experimental.constrained.fsub.v1f32(
metadata !"round.dynamic",		metadata !"round.dynamic",
metadata !"fpexcept.strict")		metadata !"fpexcept.strict")
ret <1 x float> %sub		ret <1 x float> %sub
}		}

define <2 x double> @constrained_vector_fsub_v2f64() {		define <2 x double> @constrained_vector_fsub_v2f64() {
; S390X-LABEL: constrained_vector_fsub_v2f64:		; S390X-LABEL: constrained_vector_fsub_v2f64:
; S390X: # %bb.0: # %entry		; S390X: # %bb.0: # %entry
; S390X-NEXT: larl %r1, .LCPI21_1
; S390X-NEXT: ld %f2, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI21_0		; S390X-NEXT: larl %r1, .LCPI21_0
; S390X-NEXT: ldeb %f1, 0(%r1)		; S390X-NEXT: ld %f0, 0(%r1)
; S390X-NEXT: ldr %f0, %f2
; S390X-NEXT: larl %r1, .LCPI21_2		; S390X-NEXT: larl %r1, .LCPI21_2
		; S390X-NEXT: ldeb %f1, 0(%r1)
		; S390X-NEXT: larl %r1, .LCPI21_1
		; S390X-NEXT: ldr %f2, %f0
; S390X-NEXT: sdb %f2, 0(%r1)		; S390X-NEXT: sdb %f2, 0(%r1)
; S390X-NEXT: sdbr %f0, %f1		; S390X-NEXT: sdbr %f0, %f1
; S390X-NEXT: br %r14		; S390X-NEXT: br %r14
;		;
; SZ13-LABEL: constrained_vector_fsub_v2f64:		; SZ13-LABEL: constrained_vector_fsub_v2f64:
; SZ13: # %bb.0: # %entry		; SZ13: # %bb.0: # %entry
; SZ13-NEXT: larl %r1, .LCPI21_0		; SZ13-NEXT: larl %r1, .LCPI21_0
; SZ13-NEXT: vl %v0, 0(%r1)		; SZ13-NEXT: vl %v0, 0(%r1)
; SZ13-NEXT: vgmg %v1, 12, 10		; SZ13-NEXT: vgmg %v1, 12, 10
; SZ13-NEXT: vfsdb %v24, %v1, %v0		; SZ13-NEXT: vfsdb %v24, %v1, %v0
; SZ13-NEXT: br %r14		; SZ13-NEXT: br %r14
entry:		entry:
%sub = call <2 x double> @llvm.experimental.constrained.fsub.v2f64(		%sub = call <2 x double> @llvm.experimental.constrained.fsub.v2f64(
<2 x double> <double 0xFFEFFFFFFFFFFFFF, double 0xFFEFFFFFFFFFFFFF>,		<2 x double> <double 0xFFEFFFFFFFFFFFFF, double 0xFFEFFFFFFFFFFFFF>,
<2 x double> <double 1.000000e+00, double 1.000000e-01>,		<2 x double> <double 1.000000e+00, double 1.000000e-01>,
metadata !"round.dynamic",		metadata !"round.dynamic",
metadata !"fpexcept.strict")		metadata !"fpexcept.strict")
ret <2 x double> %sub		ret <2 x double> %sub
}		}

define <3 x float> @constrained_vector_fsub_v3f32() {		define <3 x float> @constrained_vector_fsub_v3f32() {
; S390X-LABEL: constrained_vector_fsub_v3f32:		; S390X-LABEL: constrained_vector_fsub_v3f32:
; S390X: # %bb.0: # %entry		; S390X: # %bb.0: # %entry
; S390X-NEXT: larl %r1, .LCPI22_0		; S390X-NEXT: larl %r1, .LCPI22_0
; S390X-NEXT: le %f4, 0(%r1)		; S390X-NEXT: le %f0, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI22_1
; S390X-NEXT: ler %f0, %f4
; S390X-NEXT: seb %f0, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI22_2
; S390X-NEXT: ler %f2, %f4
; S390X-NEXT: seb %f2, 0(%r1)
; S390X-NEXT: lzer %f1		; S390X-NEXT: lzer %f1
		; S390X-NEXT: ler %f4, %f0
; S390X-NEXT: sebr %f4, %f1		; S390X-NEXT: sebr %f4, %f1
		; S390X-NEXT: larl %r1, .LCPI22_1
		; S390X-NEXT: ler %f2, %f0
		; S390X-NEXT: seb %f2, 0(%r1)
		; S390X-NEXT: larl %r1, .LCPI22_2
		; S390X-NEXT: seb %f0, 0(%r1)
; S390X-NEXT: br %r14		; S390X-NEXT: br %r14
;		;
; SZ13-LABEL: constrained_vector_fsub_v3f32:		; SZ13-LABEL: constrained_vector_fsub_v3f32:
; SZ13: # %bb.0: # %entry		; SZ13: # %bb.0: # %entry
; SZ13-NEXT: vgbm %v2, 15		; SZ13-NEXT: vgbm %v2, 15
; SZ13-NEXT: lzer %f1		; SZ13-NEXT: lzer %f1
; SZ13-NEXT: sebr %f2, %f1		; SZ13-NEXT: sebr %f2, %f1
; SZ13-NEXT: vgmf %v1, 1, 1		; SZ13-NEXT: vgmf %v1, 1, 1
Show All 16 Lines	entry:
ret <3 x float> %sub		ret <3 x float> %sub
}		}

define void @constrained_vector_fsub_v3f64(<3 x double>* %a) {		define void @constrained_vector_fsub_v3f64(<3 x double>* %a) {
; S390X-LABEL: constrained_vector_fsub_v3f64:		; S390X-LABEL: constrained_vector_fsub_v3f64:
; S390X: # %bb.0: # %entry		; S390X: # %bb.0: # %entry
; S390X-NEXT: larl %r1, .LCPI23_0		; S390X-NEXT: larl %r1, .LCPI23_0
; S390X-NEXT: ld %f0, 0(%r1)		; S390X-NEXT: ld %f0, 0(%r1)
; S390X-NEXT: ldr %f1, %f0		; S390X-NEXT: ld %f1, 8(%r2)
; S390X-NEXT: ldr %f2, %f0		; S390X-NEXT: ld %f2, 16(%r2)
; S390X-NEXT: sdb %f0, 16(%r2)		; S390X-NEXT: ldr %f3, %f0
; S390X-NEXT: sdb %f2, 8(%r2)		; S390X-NEXT: sdb %f3, 0(%r2)
; S390X-NEXT: sdb %f1, 0(%r2)		; S390X-NEXT: ldr %f4, %f0
		; S390X-NEXT: sdbr %f4, %f1
		; S390X-NEXT: sdbr %f0, %f2
; S390X-NEXT: std %f0, 16(%r2)		; S390X-NEXT: std %f0, 16(%r2)
; S390X-NEXT: std %f2, 8(%r2)		; S390X-NEXT: std %f4, 8(%r2)
; S390X-NEXT: std %f1, 0(%r2)		; S390X-NEXT: std %f3, 0(%r2)
; S390X-NEXT: br %r14		; S390X-NEXT: br %r14
;		;
; SZ13-LABEL: constrained_vector_fsub_v3f64:		; SZ13-LABEL: constrained_vector_fsub_v3f64:
; SZ13: # %bb.0: # %entry		; SZ13: # %bb.0: # %entry
; SZ13-NEXT: vl %v0, 0(%r2)		; SZ13-NEXT: vl %v0, 0(%r2)
		; SZ13-NEXT: vgmg %v2, 12, 10
		; SZ13-NEXT: sdb %f2, 16(%r2)
; SZ13-NEXT: vgmg %v1, 12, 10		; SZ13-NEXT: vgmg %v1, 12, 10
; SZ13-NEXT: vfsdb %v0, %v1, %v0		; SZ13-NEXT: vfsdb %v0, %v1, %v0
; SZ13-NEXT: sdb %f1, 16(%r2)		; SZ13-NEXT: std %f2, 16(%r2)
; SZ13-NEXT: std %f1, 16(%r2)
; SZ13-NEXT: vst %v0, 0(%r2)		; SZ13-NEXT: vst %v0, 0(%r2)
; SZ13-NEXT: br %r14		; SZ13-NEXT: br %r14
entry:		entry:
%b = load <3 x double>, <3 x double>* %a		%b = load <3 x double>, <3 x double>* %a
%sub = call <3 x double> @llvm.experimental.constrained.fsub.v3f64(		%sub = call <3 x double> @llvm.experimental.constrained.fsub.v3f64(
<3 x double> <double 0xFFEFFFFFFFFFFFFF, double 0xFFEFFFFFFFFFFFFF,		<3 x double> <double 0xFFEFFFFFFFFFFFFF, double 0xFFEFFFFFFFFFFFFF,
double 0xFFEFFFFFFFFFFFFF>,		double 0xFFEFFFFFFFFFFFFF>,
<3 x double> %b,		<3 x double> %b,
metadata !"round.dynamic",		metadata !"round.dynamic",
metadata !"fpexcept.strict")		metadata !"fpexcept.strict")
store <3 x double> %sub, <3 x double>* %a		store <3 x double> %sub, <3 x double>* %a
ret void		ret void
}		}

define <4 x double> @constrained_vector_fsub_v4f64() {		define <4 x double> @constrained_vector_fsub_v4f64() {
; S390X-LABEL: constrained_vector_fsub_v4f64:		; S390X-LABEL: constrained_vector_fsub_v4f64:
; S390X: # %bb.0: # %entry		; S390X: # %bb.0: # %entry
; S390X-NEXT: larl %r1, .LCPI24_1
; S390X-NEXT: ld %f6, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI24_0		; S390X-NEXT: larl %r1, .LCPI24_0
; S390X-NEXT: ldeb %f1, 0(%r1)		; S390X-NEXT: ld %f0, 0(%r1)
; S390X-NEXT: ldr %f0, %f6		; S390X-NEXT: larl %r1, .LCPI24_1
		; S390X-NEXT: ldr %f6, %f0
		; S390X-NEXT: sdb %f6, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI24_2		; S390X-NEXT: larl %r1, .LCPI24_2
; S390X-NEXT: ldr %f2, %f6		; S390X-NEXT: ldeb %f1, 0(%r1)
; S390X-NEXT: sdb %f2, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI24_3
; S390X-NEXT: ldeb %f3, 0(%r1)
; S390X-NEXT: ldr %f4, %f6
; S390X-NEXT: larl %r1, .LCPI24_4		; S390X-NEXT: larl %r1, .LCPI24_4
; S390X-NEXT: sdb %f6, 0(%r1)		; S390X-NEXT: ldeb %f3, 0(%r1)
; S390X-NEXT: sdbr %f0, %f1		; S390X-NEXT: larl %r1, .LCPI24_3
; S390X-NEXT: sdbr %f4, %f3		; S390X-NEXT: ldr %f2, %f0
		; S390X-NEXT: sdb %f2, 0(%r1)
		; S390X-NEXT: ldr %f4, %f0
		; S390X-NEXT: sdbr %f4, %f1
		; S390X-NEXT: sdbr %f0, %f3
; S390X-NEXT: br %r14		; S390X-NEXT: br %r14
;		;
; SZ13-LABEL: constrained_vector_fsub_v4f64:		; SZ13-LABEL: constrained_vector_fsub_v4f64:
; SZ13: # %bb.0: # %entry		; SZ13: # %bb.0: # %entry
; SZ13-NEXT: larl %r1, .LCPI24_0		; SZ13-NEXT: larl %r1, .LCPI24_0
; SZ13-NEXT: vl %v0, 0(%r1)		; SZ13-NEXT: vl %v0, 0(%r1)
; SZ13-NEXT: vgmg %v1, 12, 10		; SZ13-NEXT: vgmg %v1, 12, 10
; SZ13-NEXT: larl %r1, .LCPI24_1		; SZ13-NEXT: larl %r1, .LCPI24_1
; SZ13-NEXT: vfsdb %v24, %v1, %v0
; SZ13-NEXT: vl %v0, 0(%r1)
; SZ13-NEXT: vfsdb %v26, %v1, %v0		; SZ13-NEXT: vfsdb %v26, %v1, %v0
		; SZ13-NEXT: vl %v0, 0(%r1)
		; SZ13-NEXT: vfsdb %v24, %v1, %v0
; SZ13-NEXT: br %r14		; SZ13-NEXT: br %r14
entry:		entry:
%sub = call <4 x double> @llvm.experimental.constrained.fsub.v4f64(		%sub = call <4 x double> @llvm.experimental.constrained.fsub.v4f64(
<4 x double> <double 0xFFEFFFFFFFFFFFFF, double 0xFFEFFFFFFFFFFFFF,		<4 x double> <double 0xFFEFFFFFFFFFFFFF, double 0xFFEFFFFFFFFFFFFF,
double 0xFFEFFFFFFFFFFFFF, double 0xFFEFFFFFFFFFFFFF>,		double 0xFFEFFFFFFFFFFFFF, double 0xFFEFFFFFFFFFFFFF>,
<4 x double> <double 1.000000e+00, double 1.000000e-01,		<4 x double> <double 1.000000e+00, double 1.000000e-01,
double 2.000000e+00, double 2.000000e-01>,		double 2.000000e+00, double 2.000000e-01>,
metadata !"round.dynamic",		metadata !"round.dynamic",
▲ Show 20 Lines • Show All 45 Lines • ▼ Show 20 Lines	%sqrt = call <2 x double> @llvm.experimental.constrained.sqrt.v2f64(
metadata !"fpexcept.strict")		metadata !"fpexcept.strict")
ret <2 x double> %sqrt		ret <2 x double> %sqrt
}		}

define <3 x float> @constrained_vector_sqrt_v3f32() {		define <3 x float> @constrained_vector_sqrt_v3f32() {
; S390X-LABEL: constrained_vector_sqrt_v3f32:		; S390X-LABEL: constrained_vector_sqrt_v3f32:
; S390X: # %bb.0: # %entry		; S390X: # %bb.0: # %entry
; S390X-NEXT: larl %r1, .LCPI27_0		; S390X-NEXT: larl %r1, .LCPI27_0
; S390X-NEXT: sqeb %f0, 0(%r1)		; S390X-NEXT: sqeb %f4, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI27_1		; S390X-NEXT: larl %r1, .LCPI27_1
; S390X-NEXT: sqeb %f2, 0(%r1)		; S390X-NEXT: sqeb %f2, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI27_2		; S390X-NEXT: larl %r1, .LCPI27_2
; S390X-NEXT: sqeb %f4, 0(%r1)		; S390X-NEXT: sqeb %f0, 0(%r1)
; S390X-NEXT: br %r14		; S390X-NEXT: br %r14
;		;
; SZ13-LABEL: constrained_vector_sqrt_v3f32:		; SZ13-LABEL: constrained_vector_sqrt_v3f32:
; SZ13: # %bb.0: # %entry		; SZ13: # %bb.0: # %entry
; SZ13-NEXT: larl %r1, .LCPI27_0		; SZ13-NEXT: larl %r1, .LCPI27_0
; SZ13-NEXT: sqeb %f0, 0(%r1)		; SZ13-NEXT: sqeb %f0, 0(%r1)
; SZ13-NEXT: larl %r1, .LCPI27_1		; SZ13-NEXT: larl %r1, .LCPI27_1
; SZ13-NEXT: vrepf %v0, %v0, 0		; SZ13-NEXT: vrepf %v0, %v0, 0
Show All 9 Lines	%sqrt = call <3 x float> @llvm.experimental.constrained.sqrt.v3f32(
metadata !"round.dynamic",		metadata !"round.dynamic",
metadata !"fpexcept.strict")		metadata !"fpexcept.strict")
ret <3 x float> %sqrt		ret <3 x float> %sqrt
}		}

define void @constrained_vector_sqrt_v3f64(<3 x double>* %a) {		define void @constrained_vector_sqrt_v3f64(<3 x double>* %a) {
; S390X-LABEL: constrained_vector_sqrt_v3f64:		; S390X-LABEL: constrained_vector_sqrt_v3f64:
; S390X: # %bb.0: # %entry		; S390X: # %bb.0: # %entry
; S390X-NEXT: sqdb %f0, 16(%r2)		; S390X-NEXT: ld %f0, 8(%r2)
; S390X-NEXT: sqdb %f1, 8(%r2)		; S390X-NEXT: ld %f1, 16(%r2)
; S390X-NEXT: sqdb %f2, 0(%r2)		; S390X-NEXT: sqdb %f2, 0(%r2)
; S390X-NEXT: std %f0, 16(%r2)		; S390X-NEXT: sqdbr %f0, %f0
; S390X-NEXT: std %f1, 8(%r2)		; S390X-NEXT: sqdbr %f1, %f1
		; S390X-NEXT: std %f1, 16(%r2)
		; S390X-NEXT: std %f0, 8(%r2)
; S390X-NEXT: std %f2, 0(%r2)		; S390X-NEXT: std %f2, 0(%r2)
; S390X-NEXT: br %r14		; S390X-NEXT: br %r14
;		;
; SZ13-LABEL: constrained_vector_sqrt_v3f64:		; SZ13-LABEL: constrained_vector_sqrt_v3f64:
; SZ13: # %bb.0: # %entry		; SZ13: # %bb.0: # %entry
; SZ13-NEXT: sqdb %f1, 16(%r2)		; SZ13-NEXT: sqdb %f1, 16(%r2)
; SZ13-NEXT: vl %v0, 0(%r2)		; SZ13-NEXT: vl %v0, 0(%r2)
; SZ13-NEXT: std %f1, 16(%r2)		; SZ13-NEXT: std %f1, 16(%r2)
Show All 9 Lines	entry:
store <3 x double> %sqrt, <3 x double>* %a		store <3 x double> %sqrt, <3 x double>* %a
ret void		ret void
}		}

define <4 x double> @constrained_vector_sqrt_v4f64() {		define <4 x double> @constrained_vector_sqrt_v4f64() {
; S390X-LABEL: constrained_vector_sqrt_v4f64:		; S390X-LABEL: constrained_vector_sqrt_v4f64:
; S390X: # %bb.0: # %entry		; S390X: # %bb.0: # %entry
; S390X-NEXT: larl %r1, .LCPI29_0		; S390X-NEXT: larl %r1, .LCPI29_0
; S390X-NEXT: sqdb %f2, 0(%r1)		; S390X-NEXT: sqdb %f6, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI29_1		; S390X-NEXT: larl %r1, .LCPI29_1
; S390X-NEXT: sqdb %f4, 0(%r1)		; S390X-NEXT: sqdb %f4, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI29_3		; S390X-NEXT: larl %r1, .LCPI29_3
; S390X-NEXT: ldeb %f0, 0(%r1)		; S390X-NEXT: ldeb %f0, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI29_2		; S390X-NEXT: larl %r1, .LCPI29_2
; S390X-NEXT: sqdb %f6, 0(%r1)		; S390X-NEXT: sqdb %f2, 0(%r1)
; S390X-NEXT: sqdbr %f0, %f0		; S390X-NEXT: sqdbr %f0, %f0
; S390X-NEXT: br %r14		; S390X-NEXT: br %r14
;		;
; SZ13-LABEL: constrained_vector_sqrt_v4f64:		; SZ13-LABEL: constrained_vector_sqrt_v4f64:
; SZ13: # %bb.0: # %entry		; SZ13: # %bb.0: # %entry
; SZ13-NEXT: larl %r1, .LCPI29_0		; SZ13-NEXT: larl %r1, .LCPI29_0
; SZ13-NEXT: vl %v0, 0(%r1)		; SZ13-NEXT: vl %v0, 0(%r1)
; SZ13-NEXT: vfsqdb %v24, %v0		; SZ13-NEXT: vfsqdb %v26, %v0
; SZ13-NEXT: larl %r1, .LCPI29_1		; SZ13-NEXT: larl %r1, .LCPI29_1
; SZ13-NEXT: vl %v0, 0(%r1)		; SZ13-NEXT: vl %v0, 0(%r1)
; SZ13-NEXT: vfsqdb %v26, %v0		; SZ13-NEXT: vfsqdb %v24, %v0
; SZ13-NEXT: br %r14		; SZ13-NEXT: br %r14
entry:		entry:
%sqrt = call <4 x double> @llvm.experimental.constrained.sqrt.v4f64(		%sqrt = call <4 x double> @llvm.experimental.constrained.sqrt.v4f64(
<4 x double> <double 42.0, double 42.1,		<4 x double> <double 42.0, double 42.1,
double 42.2, double 42.3>,		double 42.2, double 42.3>,
metadata !"round.dynamic",		metadata !"round.dynamic",
metadata !"fpexcept.strict")		metadata !"fpexcept.strict")
ret <4 x double> %sqrt		ret <4 x double> %sqrt
▲ Show 20 Lines • Show All 3,001 Lines • ▼ Show 20 Lines	%rint = call <1 x float> @llvm.experimental.constrained.rint.v1f32(
metadata !"fpexcept.strict")		metadata !"fpexcept.strict")
ret <1 x float> %rint		ret <1 x float> %rint
}		}

define <2 x double> @constrained_vector_rint_v2f64() {		define <2 x double> @constrained_vector_rint_v2f64() {
; S390X-LABEL: constrained_vector_rint_v2f64:		; S390X-LABEL: constrained_vector_rint_v2f64:
; S390X: # %bb.0: # %entry		; S390X: # %bb.0: # %entry
; S390X-NEXT: larl %r1, .LCPI76_0		; S390X-NEXT: larl %r1, .LCPI76_0
; S390X-NEXT: ld %f0, 0(%r1)		; S390X-NEXT: ldeb %f0, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI76_1		; S390X-NEXT: larl %r1, .LCPI76_1
; S390X-NEXT: ldeb %f1, 0(%r1)		; S390X-NEXT: ld %f1, 0(%r1)
; S390X-NEXT: fidbr %f0, 0, %f0		; S390X-NEXT: fidbr %f2, 0, %f0
; S390X-NEXT: fidbr %f2, 0, %f1		; S390X-NEXT: fidbr %f0, 0, %f1
; S390X-NEXT: br %r14		; S390X-NEXT: br %r14
;		;
; SZ13-LABEL: constrained_vector_rint_v2f64:		; SZ13-LABEL: constrained_vector_rint_v2f64:
; SZ13: # %bb.0: # %entry		; SZ13: # %bb.0: # %entry
; SZ13-NEXT: larl %r1, .LCPI76_0		; SZ13-NEXT: larl %r1, .LCPI76_0
; SZ13-NEXT: vl %v0, 0(%r1)		; SZ13-NEXT: vl %v0, 0(%r1)
; SZ13-NEXT: vfidb %v24, %v0, 0, 0		; SZ13-NEXT: vfidb %v24, %v0, 0, 0
; SZ13-NEXT: br %r14		; SZ13-NEXT: br %r14
Show All 9 Lines
; S390X-LABEL: constrained_vector_rint_v3f32:		; S390X-LABEL: constrained_vector_rint_v3f32:
; S390X: # %bb.0: # %entry		; S390X: # %bb.0: # %entry
; S390X-NEXT: larl %r1, .LCPI77_0		; S390X-NEXT: larl %r1, .LCPI77_0
; S390X-NEXT: le %f0, 0(%r1)		; S390X-NEXT: le %f0, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI77_1		; S390X-NEXT: larl %r1, .LCPI77_1
; S390X-NEXT: le %f1, 0(%r1)		; S390X-NEXT: le %f1, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI77_2		; S390X-NEXT: larl %r1, .LCPI77_2
; S390X-NEXT: le %f3, 0(%r1)		; S390X-NEXT: le %f3, 0(%r1)
; S390X-NEXT: fiebr %f0, 0, %f0		; S390X-NEXT: fiebr %f4, 0, %f0
; S390X-NEXT: fiebr %f2, 0, %f1		; S390X-NEXT: fiebr %f2, 0, %f1
; S390X-NEXT: fiebr %f4, 0, %f3		; S390X-NEXT: fiebr %f0, 0, %f3
; S390X-NEXT: br %r14		; S390X-NEXT: br %r14
;		;
; SZ13-LABEL: constrained_vector_rint_v3f32:		; SZ13-LABEL: constrained_vector_rint_v3f32:
; SZ13: # %bb.0: # %entry		; SZ13: # %bb.0: # %entry
; SZ13-NEXT: larl %r1, .LCPI77_0		; SZ13-NEXT: larl %r1, .LCPI77_0
; SZ13-NEXT: lde %f0, 0(%r1)		; SZ13-NEXT: lde %f0, 0(%r1)
; SZ13-NEXT: larl %r1, .LCPI77_1		; SZ13-NEXT: larl %r1, .LCPI77_1
; SZ13-NEXT: lde %f1, 0(%r1)		; SZ13-NEXT: lde %f1, 0(%r1)
▲ Show 20 Lines • Show All 50 Lines • ▼ Show 20 Lines
define <4 x double> @constrained_vector_rint_v4f64() {		define <4 x double> @constrained_vector_rint_v4f64() {
; S390X-LABEL: constrained_vector_rint_v4f64:		; S390X-LABEL: constrained_vector_rint_v4f64:
; S390X: # %bb.0: # %entry		; S390X: # %bb.0: # %entry
; S390X-NEXT: larl %r1, .LCPI79_0		; S390X-NEXT: larl %r1, .LCPI79_0
; S390X-NEXT: ld %f0, 0(%r1)		; S390X-NEXT: ld %f0, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI79_1		; S390X-NEXT: larl %r1, .LCPI79_1
; S390X-NEXT: ld %f1, 0(%r1)		; S390X-NEXT: ld %f1, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI79_2		; S390X-NEXT: larl %r1, .LCPI79_2
; S390X-NEXT: ld %f3, 0(%r1)		; S390X-NEXT: ld %f2, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI79_3		; S390X-NEXT: larl %r1, .LCPI79_3
; S390X-NEXT: ld %f5, 0(%r1)		; S390X-NEXT: ld %f3, 0(%r1)
; S390X-NEXT: fidbr %f0, 0, %f0		; S390X-NEXT: fidbr %f6, 0, %f0
; S390X-NEXT: fidbr %f2, 0, %f1		; S390X-NEXT: fidbr %f4, 0, %f1
; S390X-NEXT: fidbr %f4, 0, %f3		; S390X-NEXT: fidbr %f2, 0, %f2
; S390X-NEXT: fidbr %f6, 0, %f5		; S390X-NEXT: fidbr %f0, 0, %f3
; S390X-NEXT: br %r14		; S390X-NEXT: br %r14
;		;
; SZ13-LABEL: constrained_vector_rint_v4f64:		; SZ13-LABEL: constrained_vector_rint_v4f64:
; SZ13: # %bb.0: # %entry		; SZ13: # %bb.0: # %entry
; SZ13-NEXT: larl %r1, .LCPI79_0		; SZ13-NEXT: larl %r1, .LCPI79_0
; SZ13-NEXT: vl %v0, 0(%r1)		; SZ13-NEXT: vl %v0, 0(%r1)
; SZ13-NEXT: larl %r1, .LCPI79_1		; SZ13-NEXT: larl %r1, .LCPI79_1
; SZ13-NEXT: vfidb %v24, %v0, 0, 0		; SZ13-NEXT: vfidb %v24, %v0, 0, 0
▲ Show 20 Lines • Show All 1,040 Lines • ▼ Show 20 Lines

define <2 x float> @constrained_vector_fptrunc_v2f64() {		define <2 x float> @constrained_vector_fptrunc_v2f64() {
; S390X-LABEL: constrained_vector_fptrunc_v2f64:		; S390X-LABEL: constrained_vector_fptrunc_v2f64:
; S390X: # %bb.0: # %entry		; S390X: # %bb.0: # %entry
; S390X-NEXT: larl %r1, .LCPI96_0		; S390X-NEXT: larl %r1, .LCPI96_0
; S390X-NEXT: ld %f0, 0(%r1)		; S390X-NEXT: ld %f0, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI96_1		; S390X-NEXT: larl %r1, .LCPI96_1
; S390X-NEXT: ld %f1, 0(%r1)		; S390X-NEXT: ld %f1, 0(%r1)
; S390X-NEXT: ledbr %f0, %f0		; S390X-NEXT: ledbr %f2, %f0
; S390X-NEXT: ledbr %f2, %f1		; S390X-NEXT: ledbr %f0, %f1
; S390X-NEXT: br %r14		; S390X-NEXT: br %r14
;		;
; SZ13-LABEL: constrained_vector_fptrunc_v2f64:		; SZ13-LABEL: constrained_vector_fptrunc_v2f64:
; SZ13: # %bb.0: # %entry		; SZ13: # %bb.0: # %entry
; SZ13-NEXT: larl %r1, .LCPI96_0		; SZ13-NEXT: larl %r1, .LCPI96_0
; SZ13-NEXT: ld %f0, 0(%r1)		; SZ13-NEXT: ld %f0, 0(%r1)
; SZ13-NEXT: larl %r1, .LCPI96_1		; SZ13-NEXT: larl %r1, .LCPI96_1
; SZ13-NEXT: ld %f1, 0(%r1)		; SZ13-NEXT: ld %f1, 0(%r1)
▲ Show 20 Lines • Show All 54 Lines • ▼ Show 20 Lines
define <4 x float> @constrained_vector_fptrunc_v4f64() {		define <4 x float> @constrained_vector_fptrunc_v4f64() {
; S390X-LABEL: constrained_vector_fptrunc_v4f64:		; S390X-LABEL: constrained_vector_fptrunc_v4f64:
; S390X: # %bb.0: # %entry		; S390X: # %bb.0: # %entry
; S390X-NEXT: larl %r1, .LCPI98_0		; S390X-NEXT: larl %r1, .LCPI98_0
; S390X-NEXT: ld %f0, 0(%r1)		; S390X-NEXT: ld %f0, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI98_1		; S390X-NEXT: larl %r1, .LCPI98_1
; S390X-NEXT: ld %f1, 0(%r1)		; S390X-NEXT: ld %f1, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI98_2		; S390X-NEXT: larl %r1, .LCPI98_2
; S390X-NEXT: ld %f3, 0(%r1)		; S390X-NEXT: ld %f2, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI98_3		; S390X-NEXT: larl %r1, .LCPI98_3
; S390X-NEXT: ld %f5, 0(%r1)		; S390X-NEXT: ld %f3, 0(%r1)
; S390X-NEXT: ledbr %f0, %f0		; S390X-NEXT: ledbr %f6, %f0
; S390X-NEXT: ledbr %f2, %f1		; S390X-NEXT: ledbr %f4, %f1
; S390X-NEXT: ledbr %f4, %f3		; S390X-NEXT: ledbr %f2, %f2
; S390X-NEXT: ledbr %f6, %f5		; S390X-NEXT: ledbr %f0, %f3
; S390X-NEXT: br %r14		; S390X-NEXT: br %r14
;		;
; SZ13-LABEL: constrained_vector_fptrunc_v4f64:		; SZ13-LABEL: constrained_vector_fptrunc_v4f64:
; SZ13: # %bb.0: # %entry		; SZ13: # %bb.0: # %entry
; SZ13-NEXT: larl %r1, .LCPI98_0		; SZ13-NEXT: larl %r1, .LCPI98_0
; SZ13-NEXT: ld %f0, 0(%r1)		; SZ13-NEXT: ld %f0, 0(%r1)
; SZ13-NEXT: larl %r1, .LCPI98_1		; SZ13-NEXT: larl %r1, .LCPI98_1
; SZ13-NEXT: ld %f1, 0(%r1)		; SZ13-NEXT: ld %f1, 0(%r1)
▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines

define <2 x double> @constrained_vector_fpext_v2f32() {		define <2 x double> @constrained_vector_fpext_v2f32() {
; S390X-LABEL: constrained_vector_fpext_v2f32:		; S390X-LABEL: constrained_vector_fpext_v2f32:
; S390X: # %bb.0: # %entry		; S390X: # %bb.0: # %entry
; S390X-NEXT: larl %r1, .LCPI100_0		; S390X-NEXT: larl %r1, .LCPI100_0
; S390X-NEXT: le %f0, 0(%r1)		; S390X-NEXT: le %f0, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI100_1		; S390X-NEXT: larl %r1, .LCPI100_1
; S390X-NEXT: le %f1, 0(%r1)		; S390X-NEXT: le %f1, 0(%r1)
; S390X-NEXT: ldebr %f0, %f0		; S390X-NEXT: ldebr %f2, %f0
; S390X-NEXT: ldebr %f2, %f1		; S390X-NEXT: ldebr %f0, %f1
; S390X-NEXT: br %r14		; S390X-NEXT: br %r14
;		;
; SZ13-LABEL: constrained_vector_fpext_v2f32:		; SZ13-LABEL: constrained_vector_fpext_v2f32:
; SZ13: # %bb.0: # %entry		; SZ13: # %bb.0: # %entry
; SZ13-NEXT: larl %r1, .LCPI100_0		; SZ13-NEXT: larl %r1, .LCPI100_0
; SZ13-NEXT: lde %f0, 0(%r1)		; SZ13-NEXT: lde %f0, 0(%r1)
; SZ13-NEXT: larl %r1, .LCPI100_1		; SZ13-NEXT: larl %r1, .LCPI100_1
; SZ13-NEXT: lde %f1, 0(%r1)		; SZ13-NEXT: lde %f1, 0(%r1)
▲ Show 20 Lines • Show All 49 Lines • ▼ Show 20 Lines
define <4 x double> @constrained_vector_fpext_v4f32() {		define <4 x double> @constrained_vector_fpext_v4f32() {
; S390X-LABEL: constrained_vector_fpext_v4f32:		; S390X-LABEL: constrained_vector_fpext_v4f32:
; S390X: # %bb.0: # %entry		; S390X: # %bb.0: # %entry
; S390X-NEXT: larl %r1, .LCPI102_0		; S390X-NEXT: larl %r1, .LCPI102_0
; S390X-NEXT: le %f0, 0(%r1)		; S390X-NEXT: le %f0, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI102_1		; S390X-NEXT: larl %r1, .LCPI102_1
; S390X-NEXT: le %f1, 0(%r1)		; S390X-NEXT: le %f1, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI102_2		; S390X-NEXT: larl %r1, .LCPI102_2
; S390X-NEXT: le %f3, 0(%r1)		; S390X-NEXT: le %f2, 0(%r1)
; S390X-NEXT: larl %r1, .LCPI102_3		; S390X-NEXT: larl %r1, .LCPI102_3
; S390X-NEXT: le %f5, 0(%r1)		; S390X-NEXT: le %f3, 0(%r1)
; S390X-NEXT: ldebr %f0, %f0		; S390X-NEXT: ldebr %f6, %f0
; S390X-NEXT: ldebr %f2, %f1		; S390X-NEXT: ldebr %f4, %f1
; S390X-NEXT: ldebr %f4, %f3		; S390X-NEXT: ldebr %f2, %f2
; S390X-NEXT: ldebr %f6, %f5		; S390X-NEXT: ldebr %f0, %f3
; S390X-NEXT: br %r14		; S390X-NEXT: br %r14
;		;
; SZ13-LABEL: constrained_vector_fpext_v4f32:		; SZ13-LABEL: constrained_vector_fpext_v4f32:
; SZ13: # %bb.0: # %entry		; SZ13: # %bb.0: # %entry
; SZ13-NEXT: larl %r1, .LCPI102_0		; SZ13-NEXT: larl %r1, .LCPI102_0
; SZ13-NEXT: lde %f0, 0(%r1)		; SZ13-NEXT: lde %f0, 0(%r1)
; SZ13-NEXT: larl %r1, .LCPI102_1		; SZ13-NEXT: larl %r1, .LCPI102_1
; SZ13-NEXT: lde %f1, 0(%r1)		; SZ13-NEXT: lde %f1, 0(%r1)
▲ Show 20 Lines • Show All 847 Lines • Show Last 20 Lines

llvm/trunk/utils/TableGen/CodeGenInstruction.h

Show First 20 Lines • Show All 243 Lines • ▼ Show 20 Lines	public:
bool isCall : 1;		bool isCall : 1;
bool isAdd : 1;		bool isAdd : 1;
bool isTrap : 1;		bool isTrap : 1;
bool canFoldAsLoad : 1;		bool canFoldAsLoad : 1;
bool mayLoad : 1;		bool mayLoad : 1;
bool mayLoad_Unset : 1;		bool mayLoad_Unset : 1;
bool mayStore : 1;		bool mayStore : 1;
bool mayStore_Unset : 1;		bool mayStore_Unset : 1;
		bool mayRaiseFPException : 1;
bool isPredicable : 1;		bool isPredicable : 1;
bool isConvertibleToThreeAddress : 1;		bool isConvertibleToThreeAddress : 1;
bool isCommutable : 1;		bool isCommutable : 1;
bool isTerminator : 1;		bool isTerminator : 1;
bool isReMaterializable : 1;		bool isReMaterializable : 1;
bool hasDelaySlot : 1;		bool hasDelaySlot : 1;
bool usesCustomInserter : 1;		bool usesCustomInserter : 1;
bool hasPostISelHook : 1;		bool hasPostISelHook : 1;
▲ Show 20 Lines • Show All 119 Lines • Show Last 20 Lines

llvm/trunk/utils/TableGen/CodeGenInstruction.cpp

Show First 20 Lines • Show All 395 Lines • ▼ Show 20 Lines	CodeGenInstruction::CodeGenInstruction(Record *R)
FastISelShouldIgnore = R->getValueAsBit("FastISelShouldIgnore");		FastISelShouldIgnore = R->getValueAsBit("FastISelShouldIgnore");
variadicOpsAreDefs = R->getValueAsBit("variadicOpsAreDefs");		variadicOpsAreDefs = R->getValueAsBit("variadicOpsAreDefs");

bool Unset;		bool Unset;
mayLoad = R->getValueAsBitOrUnset("mayLoad", Unset);		mayLoad = R->getValueAsBitOrUnset("mayLoad", Unset);
mayLoad_Unset = Unset;		mayLoad_Unset = Unset;
mayStore = R->getValueAsBitOrUnset("mayStore", Unset);		mayStore = R->getValueAsBitOrUnset("mayStore", Unset);
mayStore_Unset = Unset;		mayStore_Unset = Unset;
		mayRaiseFPException = R->getValueAsBit("mayRaiseFPException");
hasSideEffects = R->getValueAsBitOrUnset("hasSideEffects", Unset);		hasSideEffects = R->getValueAsBitOrUnset("hasSideEffects", Unset);
hasSideEffects_Unset = Unset;		hasSideEffects_Unset = Unset;

isAsCheapAsAMove = R->getValueAsBit("isAsCheapAsAMove");		isAsCheapAsAMove = R->getValueAsBit("isAsCheapAsAMove");
hasExtraSrcRegAllocReq = R->getValueAsBit("hasExtraSrcRegAllocReq");		hasExtraSrcRegAllocReq = R->getValueAsBit("hasExtraSrcRegAllocReq");
hasExtraDefRegAllocReq = R->getValueAsBit("hasExtraDefRegAllocReq");		hasExtraDefRegAllocReq = R->getValueAsBit("hasExtraDefRegAllocReq");
isCodeGenOnly = R->getValueAsBit("isCodeGenOnly");		isCodeGenOnly = R->getValueAsBit("isCodeGenOnly");
isPseudo = R->getValueAsBit("isPseudo");		isPseudo = R->getValueAsBit("isPseudo");
▲ Show 20 Lines • Show All 365 Lines • Show Last 20 Lines

llvm/trunk/utils/TableGen/InstrInfoEmitter.cpp

Show First 20 Lines • Show All 597 Lines • ▼ Show 20 Lines	void InstrInfoEmitter::emitRecord(const CodeGenInstruction &Inst, unsigned Num,
if (Inst.isTrap) OS << "\|(1ULL<<MCID::Trap)";		if (Inst.isTrap) OS << "\|(1ULL<<MCID::Trap)";
if (Inst.isSelect) OS << "\|(1ULL<<MCID::Select)";		if (Inst.isSelect) OS << "\|(1ULL<<MCID::Select)";
if (Inst.isBarrier) OS << "\|(1ULL<<MCID::Barrier)";		if (Inst.isBarrier) OS << "\|(1ULL<<MCID::Barrier)";
if (Inst.hasDelaySlot) OS << "\|(1ULL<<MCID::DelaySlot)";		if (Inst.hasDelaySlot) OS << "\|(1ULL<<MCID::DelaySlot)";
if (Inst.isCall) OS << "\|(1ULL<<MCID::Call)";		if (Inst.isCall) OS << "\|(1ULL<<MCID::Call)";
if (Inst.canFoldAsLoad) OS << "\|(1ULL<<MCID::FoldableAsLoad)";		if (Inst.canFoldAsLoad) OS << "\|(1ULL<<MCID::FoldableAsLoad)";
if (Inst.mayLoad) OS << "\|(1ULL<<MCID::MayLoad)";		if (Inst.mayLoad) OS << "\|(1ULL<<MCID::MayLoad)";
if (Inst.mayStore) OS << "\|(1ULL<<MCID::MayStore)";		if (Inst.mayStore) OS << "\|(1ULL<<MCID::MayStore)";
		if (Inst.mayRaiseFPException) OS << "\|(1ULL<<MCID::MayRaiseFPException)";
if (Inst.isPredicable) OS << "\|(1ULL<<MCID::Predicable)";		if (Inst.isPredicable) OS << "\|(1ULL<<MCID::Predicable)";
if (Inst.isConvertibleToThreeAddress) OS << "\|(1ULL<<MCID::ConvertibleTo3Addr)";		if (Inst.isConvertibleToThreeAddress) OS << "\|(1ULL<<MCID::ConvertibleTo3Addr)";
if (Inst.isCommutable) OS << "\|(1ULL<<MCID::Commutable)";		if (Inst.isCommutable) OS << "\|(1ULL<<MCID::Commutable)";
if (Inst.isTerminator) OS << "\|(1ULL<<MCID::Terminator)";		if (Inst.isTerminator) OS << "\|(1ULL<<MCID::Terminator)";
if (Inst.isReMaterializable) OS << "\|(1ULL<<MCID::Rematerializable)";		if (Inst.isReMaterializable) OS << "\|(1ULL<<MCID::Rematerializable)";
if (Inst.isNotDuplicable) OS << "\|(1ULL<<MCID::NotDuplicable)";		if (Inst.isNotDuplicable) OS << "\|(1ULL<<MCID::NotDuplicable)";
if (Inst.Operands.hasOptionalDef) OS << "\|(1ULL<<MCID::HasOptionalDef)";		if (Inst.Operands.hasOptionalDef) OS << "\|(1ULL<<MCID::HasOptionalDef)";
if (Inst.usesCustomInserter) OS << "\|(1ULL<<MCID::UsesCustomInserter)";		if (Inst.usesCustomInserter) OS << "\|(1ULL<<MCID::UsesCustomInserter)";
▲ Show 20 Lines • Show All 117 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[RFC v2] Allow target to handle STRICT floating-point nodesClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 203262

llvm/trunk/include/llvm/CodeGen/MachineInstr.h

llvm/trunk/include/llvm/CodeGen/SelectionDAGNodes.h

llvm/trunk/include/llvm/MC/MCInstrDesc.h

llvm/trunk/include/llvm/Target/Target.td

llvm/trunk/include/llvm/Target/TargetSelectionDAG.td

llvm/trunk/lib/CodeGen/GlobalISel/InstructionSelector.cpp

llvm/trunk/lib/CodeGen/ImplicitNullChecks.cpp

llvm/trunk/lib/CodeGen/MIRParser/MILexer.h

llvm/trunk/lib/CodeGen/MIRParser/MILexer.cpp

llvm/trunk/lib/CodeGen/MIRParser/MIParser.cpp

llvm/trunk/lib/CodeGen/MIRPrinter.cpp

llvm/trunk/lib/CodeGen/MachineCSE.cpp

llvm/trunk/lib/CodeGen/MachineInstr.cpp

llvm/trunk/lib/CodeGen/MachinePipeliner.cpp

llvm/trunk/lib/CodeGen/PeepholeOptimizer.cpp

llvm/trunk/lib/CodeGen/ScheduleDAGInstrs.cpp

llvm/trunk/lib/CodeGen/SelectionDAG/InstrEmitter.cpp

llvm/trunk/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

llvm/trunk/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp

llvm/trunk/lib/CodeGen/TargetInstrInfo.cpp

llvm/trunk/lib/CodeGen/TargetLoweringBase.cpp

llvm/trunk/lib/Target/SystemZ/SystemZISelLowering.cpp

llvm/trunk/lib/Target/SystemZ/SystemZInstrFP.td

llvm/trunk/lib/Target/SystemZ/SystemZInstrVector.td

llvm/trunk/lib/Target/SystemZ/SystemZOperators.td

llvm/trunk/test/CodeGen/SystemZ/fp-strict-add-01.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-add-02.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-add-03.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-add-04.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-alias.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-conv-01.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-conv-02.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-conv-03.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-conv-04.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-conv-15.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-div-01.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-div-02.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-div-03.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-div-04.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-mul-01.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-mul-02.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-mul-03.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-mul-04.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-mul-05.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-mul-06.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-mul-07.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-mul-08.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-mul-09.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-mul-10.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-mul-11.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-round-01.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-round-02.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-round-03.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-sqrt-01.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-sqrt-02.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-sqrt-03.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-sqrt-04.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-sub-01.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-sub-02.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-sub-03.ll

llvm/trunk/test/CodeGen/SystemZ/fp-strict-sub-04.ll

llvm/trunk/test/CodeGen/SystemZ/vec-strict-add-01.ll

llvm/trunk/test/CodeGen/SystemZ/vec-strict-add-02.ll

llvm/trunk/test/CodeGen/SystemZ/vec-strict-div-01.ll

llvm/trunk/test/CodeGen/SystemZ/vec-strict-div-02.ll

llvm/trunk/test/CodeGen/SystemZ/vec-strict-max-01.ll

llvm/trunk/test/CodeGen/SystemZ/vec-strict-min-01.ll

llvm/trunk/test/CodeGen/SystemZ/vec-strict-mul-01.ll

llvm/trunk/test/CodeGen/SystemZ/vec-strict-mul-02.ll

llvm/trunk/test/CodeGen/SystemZ/vec-strict-mul-03.ll

llvm/trunk/test/CodeGen/SystemZ/vec-strict-mul-04.ll

llvm/trunk/test/CodeGen/SystemZ/vec-strict-mul-05.ll

llvm/trunk/test/CodeGen/SystemZ/vec-strict-round-01.ll

[RFC v2] Allow target to handle STRICT floating-point nodes
ClosedPublic