This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/CodeGen/
-
llvm/
-
CodeGen/
-
TargetLowering.h
-
lib/
-
CodeGen/
-
SelectionDAG/
2/10
LegalizeDAG.cpp
-
LegalizeVectorOps.cpp
1
SelectionDAGISel.cpp
-
TargetLoweringBase.cpp
-
Target/SystemZ/
-
SystemZ/
1/2
SystemZISelLowering.cpp

Differential D70226

Add an option to disable strict float node mutating to an normal float node
ClosedPublic

Authored by LiuChen3 on Nov 14 2019, 2:39 AM.

Download Raw Diff

Details

Reviewers

craig.topper
pengfei
uweigand
RKSimon

Commits

rG22a0edd070e4: [FPEnv] Add an option to disable strict float node mutating to an normal float…

Summary

This patch add an option 'disable-strictnode-mutation' to prevent strict node mutating to an normal node.
So we can make sure that the patch which sets strict-node as legal works correctly.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

LiuChen3 created this revision.Nov 14 2019, 2:39 AM

Herald added a project: Restricted Project. · View Herald TranscriptNov 14 2019, 2:39 AM

Herald added subscribers: llvm-commits, hiraditya. · View Herald Transcript

There are some tests fail on SystemZ and I simply transferred
thoses testcases which operations are not legal to a new file.

I don't believe those tests ought to fail. They're all cases where a FP operation is implemented via a library call. Those should be fine for strict FP semantics, so we should get this even with -disable-strictnode-mutation.

I think the option should only disable the call to mutateStrictFPToFP in SelectionDAGISel. The calls in SelectionDAGLegalize::ExpandFPLibCall and SelectionDAGLegalize::ExpandArgFPLibCall are fine IMO.

However, I think the option should in addition disable the special handling of strict FP operations in SelectionDAGLegalize::ExpandNode. Those are only valid because the code expects the mutateStrictFPToFP call to happen in SelectionDAGISel. If this won't happen, the special handling during Expand shouldn't happen either.

Also, I think it would be nice if a target could default to having -disable-strictnode-mutation always on. We'd want to do that on SystemZ. (Then you wouldn't have to change all the test cases either.)

This patch add an variable EnableStrictNode to the TargetLoweringBase to determine whether the target already supports the strict float operation.
Modify the patch @uweigand's suggestion.

@uweigand Thanks for your review. I have added information to see if the target supports strict float, so systemZ can disable strictnode mutation as default.

pengfei added inline comments.Nov 15 2019, 1:08 AM

llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
3716	Is it better to use `!(TLI.isStrictFPEnabled() \|\| DisableStrictNodeMutation)` ?
llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp
1160	Same as above.
llvm/lib/Target/SystemZ/SystemZISelLowering.cpp
639	Extra blank.

Thanks! I just noticed there are two more places where we need to be more careful when strict FP mode is enforced:

In SelectionDAGLegalize::ExpandNode, the implementation of the switch cases for ISD::STRICT_FP_ROUND and ISD::STRICT_FP_EXTEND does not respect strict FP mode, and therefore should be skipped if strict mode is enforced. So you might want to change

// This expansion does not honor the "strict" properties anyway,
// so prefer falling back to the non-strict operation if legal.
if (TLI.getStrictFPOperationAction(Node->getOpcode(),
                                   Node->getValueType(0))
    == TargetLowering::Legal)
  break;

to something like

// This expansion does not honor the "strict" properties,
// so we cannot use it if strict mode is enforced.
if (DisableStrictNodeMutation || TLI.isStrictFPEnabled())
  break;
// If strict mode is not enforced, and the non-strict operation
// is legal, we might as well fall back to that.
if (TLI.getStrictFPOperationAction(Node->getOpcode(),
                                   Node->getValueType(0))
    == TargetLowering::Legal)
  break;

The second place is this shortcut in VectorLegalizer::LegalizeOp

// If we're asked to expand a strict vector floating-point operation,
// by default we're going to simply unroll it.  That is usually the
// best approach, except in the case where the resulting strict (scalar)
// operations would themselves use the fallback mutation to non-strict.
// In that specific case, just do the fallback on the vector op.
if (Action == TargetLowering::Expand &&
    TLI.getStrictFPOperationAction(Node->getOpcode(),
                                   Node->getValueType(0))
    == TargetLowering::Legal) {
  EVT EltVT = Node->getValueType(0).getVectorElementType();
  if (TLI.getOperationAction(Node->getOpcode(), EltVT)
      == TargetLowering::Expand &&
      TLI.getStrictFPOperationAction(Node->getOpcode(), EltVT)
      == TargetLowering::Legal)
    Action = TargetLowering::Legal;
}

This whole logic is only valid if we are allowed to fall back to non-strict expansion, so it should also be guarded by a !StrictFPEnabled check.

Finally, just a minor nit pick: it seems odd to always have to check both DisableStrictNodeMutation and TLI.isStrictFPEnabled(). Can't we incorporate the command line override check into the TLI callback (e.g. by setting the default value of the flag depending on the command line variable)?

LiuChen3 added a subscriber: LuoYuanke.Nov 17 2019, 9:25 PM

Binding the value of DisableStrictNodeMutation to flag IsStrictFPEnabled, updating as comments.

LiuChen3 marked an inline comment as done.Nov 18 2019, 4:11 AM

LiuChen3 added inline comments.

llvm/lib/Target/SystemZ/SystemZISelLowering.cpp
639	Thanks for your review, there is no blank here original.

I'm not sure if there's a possibility: some normal float operations of a backend is not Expand,
but they may think expanding strict-float operation can meet their requirements.
Although I haven't found a case yet.

uweigand added inline comments.Nov 18 2019, 4:33 AM

llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
2658	I'm not sure this is always correct, I think it might be possible that a target might want to select Expand for a strict operation even if they use e.g. Custom for the non-strict version (obviously, that would have to be an operation where common code implements an Expand algorithm that respects the constrained FP semantics). More importantly, even if you do this, you still need to add the checks in STRICT_FP_ROUND and STRICT_FP_EXTEND I mentioned in my earlier comment: note that in those cases, even if the target uses Expand for both the strict and non-strict operation, the code below still cannot be used if isStrictFPEnabled is true (since it does not respect constrained FP semantics).

LiuChen3 marked an inline comment as done.Nov 18 2019, 5:08 AM

LiuChen3 added inline comments.

llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
2658	Thanks. I think I misunderstood what you meant before. You actually mean is if the backend has supported strict float, it can never expand STRICT_FP_ROUND and STRICT_FP_EXTEND operations. We don't expand it not because we setOperationAction wrong or something else, because it isn't 'strict float' at all. I'll only add judgment based on the previous patch and delete this.

pengfei added a subscriber: kpn.Nov 18 2019, 6:56 AM

pengfei added inline comments.

llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
2658	In my opinion, the behavior currently is reasonable. I don't think there's a way in common code can handle an expand strict node if its non-strict node is custom. Otherwise, its non-strict node isn't necessarily to be custom. For STRICT_FP_ROUND and STRICT_FP_EXTEND, I reviewed the discussion between you and @kpn in D65226. I think it's equal to your change if the action of the target's non-strict nodes is legal. And for target that isStrictFPEnabled, the action of the strict nodes can not be set to expand if the expansion does not respect constrained FP semantics.

In my opinion, the behavior currently is reasonable. I don't think there's a way in common code can handle an expand strict node if its non-strict node is custom. Otherwise, its non-strict node isn't necessarily to be custom.

What I was thinking about is: there are some operations where we can have an Expand implementation that correctly respects strict semantics, typically by mapping on top of (other) strict operations. E.g. an implementation of STRICT_UINT_TO_FP in terms of STRICT_SINT_TO_FP (see discussion in D69275). In those cases, the target should be able to select that expansion, even if it has a UINT_TO_FP custom handler (for whatever reason, maybe for some optimizations that aren't possible in the strict case). There's no particular reason to disallow this.

And for target that isStrictFPEnabled, the action of the strict nodes can not be set to expand if the expansion does not respect constrained FP semantics.

But it may still be possible to respect strict semantics by expanding to a libcall -- and that is what should happen in those cases, I think.

In summary, there are four potential cases how Expand of a STRICT node could be implemented:

A custom expansion sequence that respects constrained semantics
Expansion to libcall (where the library is assumed to respect constrained semantics)
A custom expansion sequence that does not respect constrained semantics
"Fake" expansion to the non-strict node

If isStrictFPEnabled is true, then cases 3) and 4) above are forbidden, but cases 1) and 2) are still OK.

My main point is that the difference between 1) and 3) has to be determined on a case-by-case basis by inspecting the particular expansion sequence, and therefore this should be checked in-line in each expansion sequence. This is why I'd prefer to have each affected custom expansion sequence implementation directly the flag check whether or not strict semantics must be enforced or not; I don't believe this can be a single check just at the top of ExpandNode.

My main point is that the difference between 1) and 3) has to be determined on a case-by-case basis by inspecting the particular expansion sequence, and therefore this should be checked in-line in each expansion sequence. This is why I'd prefer to have each affected custom expansion sequence implementation directly the flag check whether or not strict semantics must be enforced or not; I don't believe this can be a single check just at the top of ExpandNode.

Thanks for the explanation! It very helpful for us to understand the workflow of strict semantics.

Update as comments.

Fix format problem.

Thanks! Just one minor nit about the comment inline, otherwise this patch LGTM.

llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
2808–2813	This comment is now duplicated; it would be better to change the comment above to something along the lines of what I suggested earlier, e.g. // This expansion does not honor the "strict" properties, // so we cannot use it if strict mode is enforced.
2834–2839	See above.

I'm strongly considering making mutation a true operation action like Expand, Legal, Custom, etc. So we can distinquish Expand from "my target doesn't support strict FP yet". The checks for "legal" on the non-strict nodes to guess what we should do, don't work for X86 where we have a lot of Custom handling. The strict fp operations would default to this new operation action instead of Expand.

Modify comment

@uweigand Thanks for your help.

In D70226#1752432, @craig.topper wrote:

I'm strongly considering making mutation a true operation action like Expand, Legal, Custom, etc. So we can distinquish Expand from "my target doesn't support strict FP yet". The checks for "legal" on the non-strict nodes to guess what we should do, don't work for X86 where we have a lot of Custom handling. The strict fp operations would default to this new operation action instead of Expand.

Hmm. I'd actually consider this a step in the wrong direction, since the mutation operation is really wrong, it doesn't actually respect strict fp semantics. So I'd rather have it go away completely as soon as possible -- once we have enough target coverage (e.g. X86, Arm, Power, SystemZ?). That's why I like the approach in this patch: all the mutation-related stuff is now behind the isStrictFPEnabledCheck, and soon as we decide target support is sufficient, we just remove all that code. (For targets that still don't support strict FP natively, the intrinsics would then generally map to libcalls.)

uweigand added inline comments.Nov 20 2019, 4:25 AM

llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
2814	Sorry, this is still not quite what I expected: now you've removed the second comment, which was actually correct (and necessary) ... My point is that the two "if" statements implement two very different things, the first is a correctness issue, the second is just a performance optimization. So we really ought to have two different comments explaining the two different purposes of those if statements, as I had in my original suggestion. In your first patch that I commented upon earlier, you had two comments, but both were talking about the performance optimization -- this is wrong for the first if, which is all about correctness. Now you fixed the comment before the first if to talk about correctness, but you removed the second comment completely, which gives the impression that the second if is also about correctness, which it is not ...

LiuChen3 added a comment.Nov 20 2019, 6:58 AM

This comment was removed by LiuChen3.

update the comments

LiuChen3 marked an inline comment as done.Nov 20 2019, 7:20 AM

LiuChen3 added inline comments.

llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
2814	Thanks for your explanation. But why this is a performance optimization? I thought this conversion was just to allow the backend to make the correct instruction selection without supporting strict-float. The performance optimization means by the promotion of the legal instruction compared to the converting to statck operation? Or I misunderstand something?

uweigand added inline comments.Nov 20 2019, 8:20 AM

llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
2814	OK, so when we get here, the back-end has asked common code to "Expand" the STRICT_FP_ROUND operation. Common code has three options to do so: Emit a libcall Replace it with a FP_ROUND -- only possible if FP_ROUND is "Legal" Replace it with a stack operation (truncating store followed by load) If we must enforce strict FP semantics, then only option 1) is allowed, since both options 2) and 3) do not respect that semantics. That is the correctness property that is enforced by the first "if". Now, if we do not have to enfore strict FP semantics, then either option 1), 2) or 3) would be allowed. So in case, we make the decision on the relative efficiency of those options, where we'd usually have 2) the fastest, followed by 3), and then 1) as the slowest. Since 2) is not always possible, we'd choose 2) when it is available, and 3) otherwise. This is what the second "if" achieves. Does this make it clearer? If you find some other wording for those comments that convey that explanation in a better way, feel free to update them :-)

craig.topper added inline comments.Nov 20 2019, 8:59 AM

llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
2814	X86 has FP_ROUND marked Custom, but most type combinations are Legal. I had to mark STRICT_FP_ROUND as Custom to get it past this code. But now I can’t get it past the mutation code in SelectionDAGIsel because it’s not “Legal”. Scalar FADD on X86 is also marked Custom but most cases go through unmodified. STRICT_FADD is marked Expand currently. And only doesn’t get turned into a lib call because I don’t think there is STRICT_FADD libcall support yet. But that needs to be added to support strict ops on f128 for X86-64. The moment that happens then every other target that hasn’t implemented strict fp yet will generate a libcall for STRICT_FADD.

X86 has FP_ROUND marked Custom, but most type combinations are Legal. I had to mark STRICT_FP_ROUND as Custom to get it past this code. But now I can’t get it past the mutation code in SelectionDAGIsel because it’s not “Legal”.

Ah, so you mark STRICT_FP_ROUND Custom, but in the custom expander still leave it as STRICT_FP_ROUND, expecting it to be matched? I see. I believe this code in SelectionDAGISel:

if (Node->isStrictFPOpcode() &&
    (TLI->getOperationAction(Node->getOpcode(), Node->getValueType(0))
     != TargetLowering::Legal))

should really be:

if (Node->isStrictFPOpcode() &&
    (TLI->getOperationAction(Node->getOpcode(), Node->getValueType(0))
     == TargetLowering::Expand))

That should fix your problem. (Or else, once we get this patch committed, you could also set isStrictFPEnabled for your target, and the problem would also be gone.)

Scalar FADD on X86 is also marked Custom but most cases go through unmodified. STRICT_FADD is marked Expand currently. And only doesn’t get turned into a lib call because I don’t think there is STRICT_FADD libcall support yet. But that needs to be added to support strict ops on f128 for X86-64. The moment that happens then every other target that hasn’t implemented strict fp yet will generate a libcall for STRICT_FADD.

Right. But as I said, I personally would prefer this behavior: at least the compiler doesn't silently ignore strict semantics that it promised to implement ...

update comments and fix a bug

@uweigand Thank you, now I get clear of the logic there. I add some of your comment to the original comments in the code, I think it may helps us understand this code better. Or we still keep the previous comments?

Thanks again! As far as I'm concerned, this now looks good and I'd like to see it go in -- @craig.topper: given your comments above, do you still have any objections?

I'm fine with this. I changed the code in SelectionDAGIsel yesterday to use Expand. So this will need to be rebased.

OK, great. LGTM.

This revision is now accepted and ready to land.Nov 21 2019, 10:45 AM

Thanks for all of your help.

Closed by commit rG22a0edd070e4: [FPEnv] Add an option to disable strict float node mutating to an normal float… (authored by Pengfei Wang <pengfei.wang@intel.com>). · Explain WhyNov 21 2019, 6:08 PM

This revision was automatically updated to reflect the committed changes.

qiucf mentioned this in D87222: [PowerPC] [FPEnv] Disable strict FP mutation by default for PowerPC.Sep 6 2020, 10:09 PM

Revision Contents

Path

Size

llvm/

include/

llvm/

CodeGen/

TargetLowering.h

7 lines

lib/

CodeGen/

SelectionDAG/

LegalizeDAG.cpp

22 lines

LegalizeVectorOps.cpp

4 lines

SelectionDAGISel.cpp

2 lines

TargetLoweringBase.cpp

9 lines

Target/

SystemZ/

SystemZISelLowering.cpp

3 lines

Diff 230574

llvm/include/llvm/CodeGen/TargetLowering.h

Show First 20 Lines • Show All 225 Lines • ▼ Show 20 Lines	public:
}		}

/// NOTE: The TargetMachine owns TLOF.		/// NOTE: The TargetMachine owns TLOF.
explicit TargetLoweringBase(const TargetMachine &TM);		explicit TargetLoweringBase(const TargetMachine &TM);
TargetLoweringBase(const TargetLoweringBase &) = delete;		TargetLoweringBase(const TargetLoweringBase &) = delete;
TargetLoweringBase &operator=(const TargetLoweringBase &) = delete;		TargetLoweringBase &operator=(const TargetLoweringBase &) = delete;
virtual ~TargetLoweringBase() = default;		virtual ~TargetLoweringBase() = default;

		/// Return true if the target support strict float operation
		bool isStrictFPEnabled() const {
		return IsStrictFPEnabled;
		}

protected:		protected:
/// Initialize all of the actions to default values.		/// Initialize all of the actions to default values.
void initActions();		void initActions();

public:		public:
const TargetMachine &getTargetMachine() const { return TM; }		const TargetMachine &getTargetMachine() const { return TM; }

virtual bool useSoftFloat() const { return false; }		virtual bool useSoftFloat() const { return false; }
▲ Show 20 Lines • Show All 2,662 Lines • ▼ Show 20 Lines	protected:
/// details.		/// details.
MachineBasicBlock *emitXRayCustomEvent(MachineInstr &MI,		MachineBasicBlock *emitXRayCustomEvent(MachineInstr &MI,
MachineBasicBlock *MBB) const;		MachineBasicBlock *MBB) const;

/// Replace/modify the XRay typed event operands with target-dependent		/// Replace/modify the XRay typed event operands with target-dependent
/// details.		/// details.
MachineBasicBlock *emitXRayTypedEvent(MachineInstr &MI,		MachineBasicBlock *emitXRayTypedEvent(MachineInstr &MI,
MachineBasicBlock *MBB) const;		MachineBasicBlock *MBB) const;

		bool IsStrictFPEnabled;
};		};

/// This class defines information used to lower LLVM code to legal SelectionDAG		/// This class defines information used to lower LLVM code to legal SelectionDAG
/// operators that the target instruction selector can accept natively.		/// operators that the target instruction selector can accept natively.
///		///
/// This class also defines callbacks that targets must implement to lower		/// This class also defines callbacks that targets must implement to lower
/// target-specific constructs to SelectionDAG operators.		/// target-specific constructs to SelectionDAG operators.
class TargetLowering : public TargetLoweringBase {		class TargetLowering : public TargetLoweringBase {
▲ Show 20 Lines • Show All 1,345 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp

Show First 20 Lines • Show All 2,649 Lines • ▼ Show 20 Lines	bool SelectionDAGLegalize::ExpandNode(SDNode *Node) {
SDValue Tmp1, Tmp2, Tmp3, Tmp4;		SDValue Tmp1, Tmp2, Tmp3, Tmp4;
bool NeedInvert;		bool NeedInvert;
switch (Node->getOpcode()) {		switch (Node->getOpcode()) {
case ISD::ABS:		case ISD::ABS:
if (TLI.expandABS(Node, Tmp1, DAG))		if (TLI.expandABS(Node, Tmp1, DAG))
Results.push_back(Tmp1);		Results.push_back(Tmp1);
break;		break;
case ISD::CTPOP:		case ISD::CTPOP:
if (TLI.expandCTPOP(Node, Tmp1, DAG))		if (TLI.expandCTPOP(Node, Tmp1, DAG))
		uweigandUnsubmitted Not Done Reply Inline Actions I'm not sure this is always correct, I think it might be possible that a target might want to select Expand for a strict operation even if they use e.g. Custom for the non-strict version (obviously, that would have to be an operation where common code implements an Expand algorithm that respects the constrained FP semantics). More importantly, even if you do this, you still need to add the checks in STRICT_FP_ROUND and STRICT_FP_EXTEND I mentioned in my earlier comment: note that in those cases, even if the target uses Expand for both the strict and non-strict operation, the code below still cannot be used if isStrictFPEnabled is true (since it does not respect constrained FP semantics). uweigand: I'm not sure this is always correct, I think it might be possible that a target might want to…
		LiuChen3AuthorUnsubmitted Done Reply Inline Actions Thanks. I think I misunderstood what you meant before. You actually mean is if the backend has supported strict float, it can never expand STRICT_FP_ROUND and STRICT_FP_EXTEND operations. We don't expand it not because we setOperationAction wrong or something else, because it isn't 'strict float' at all. I'll only add judgment based on the previous patch and delete this. LiuChen3: Thanks. I think I misunderstood what you meant before. You actually mean is if the backend has…
		pengfeiUnsubmitted Not Done Reply Inline Actions In my opinion, the behavior currently is reasonable. I don't think there's a way in common code can handle an expand strict node if its non-strict node is custom. Otherwise, its non-strict node isn't necessarily to be custom. For STRICT_FP_ROUND and STRICT_FP_EXTEND, I reviewed the discussion between you and @kpn in D65226. I think it's equal to your change if the action of the target's non-strict nodes is legal. And for target that isStrictFPEnabled, the action of the strict nodes can not be set to expand if the expansion does not respect constrained FP semantics. pengfei: In my opinion, the behavior currently is reasonable. I don't think there's a way in common code…
Results.push_back(Tmp1);		Results.push_back(Tmp1);
break;		break;
case ISD::CTLZ:		case ISD::CTLZ:
case ISD::CTLZ_ZERO_UNDEF:		case ISD::CTLZ_ZERO_UNDEF:
if (TLI.expandCTLZ(Node, Tmp1, DAG))		if (TLI.expandCTLZ(Node, Tmp1, DAG))
Results.push_back(Tmp1);		Results.push_back(Tmp1);
break;		break;
case ISD::CTTZ:		case ISD::CTTZ:
▲ Show 20 Lines • Show All 133 Lines • ▼ Show 20 Lines	if (VT.isInteger())
Results.push_back(DAG.getConstant(0, dl, VT));		Results.push_back(DAG.getConstant(0, dl, VT));
else {		else {
assert(VT.isFloatingPoint() && "Unknown value type!");		assert(VT.isFloatingPoint() && "Unknown value type!");
Results.push_back(DAG.getConstantFP(0, dl, VT));		Results.push_back(DAG.getConstantFP(0, dl, VT));
}		}
break;		break;
}		}
case ISD::STRICT_FP_ROUND:		case ISD::STRICT_FP_ROUND:
// This expansion does not honor the "strict" properties anyway,		// When strict mode is enforced we can't do expansion because it
// so prefer falling back to the non-strict operation if legal.		// does not honor the "strict" properties. Only libcall is allowed.
		if (TLI.isStrictFPEnabled())
		break;
		// We might as well mutate to FP_ROUND when FP_ROUND operation is legal
		// since this operation is more efficient than stack operation.
		uweigandUnsubmitted Not Done Reply Inline Actions This comment is now duplicated; it would be better to change the comment above to something along the lines of what I suggested earlier, e.g. // This expansion does not honor the "strict" properties, // so we cannot use it if strict mode is enforced. uweigand: This comment is now duplicated; it would be better to change the comment above to something…
if (TLI.getStrictFPOperationAction(Node->getOpcode(),		if (TLI.getStrictFPOperationAction(Node->getOpcode(),
		uweigandUnsubmitted Not Done Reply Inline Actions Sorry, this is still not quite what I expected: now you've removed the second comment, which was actually correct (and necessary) ... My point is that the two "if" statements implement two very different things, the first is a correctness issue, the second is just a performance optimization. So we really ought to have two different comments explaining the two different purposes of those if statements, as I had in my original suggestion. In your first patch that I commented upon earlier, you had two comments, but both were talking about the performance optimization -- this is wrong for the first if, which is all about correctness. Now you fixed the comment before the first if to talk about correctness, but you removed the second comment completely, which gives the impression that the second if is also about correctness, which it is not ... uweigand: Sorry, this is still not quite what I expected: now you've removed the second comment, which…
		LiuChen3AuthorUnsubmitted Done Reply Inline Actions Thanks for your explanation. But why this is a performance optimization? I thought this conversion was just to allow the backend to make the correct instruction selection without supporting strict-float. The performance optimization means by the promotion of the legal instruction compared to the converting to statck operation? Or I misunderstand something? LiuChen3: Thanks for your explanation. But why this is a performance optimization? I thought this…
		uweigandUnsubmitted Not Done Reply Inline Actions OK, so when we get here, the back-end has asked common code to "Expand" the STRICT_FP_ROUND operation. Common code has three options to do so: Emit a libcall Replace it with a FP_ROUND -- only possible if FP_ROUND is "Legal" Replace it with a stack operation (truncating store followed by load) If we must enforce strict FP semantics, then only option 1) is allowed, since both options 2) and 3) do not respect that semantics. That is the correctness property that is enforced by the first "if". Now, if we do not have to enfore strict FP semantics, then either option 1), 2) or 3) would be allowed. So in case, we make the decision on the relative efficiency of those options, where we'd usually have 2) the fastest, followed by 3), and then 1) as the slowest. Since 2) is not always possible, we'd choose 2) when it is available, and 3) otherwise. This is what the second "if" achieves. Does this make it clearer? If you find some other wording for those comments that convey that explanation in a better way, feel free to update them :-) uweigand: OK, so when we get here, the back-end has asked common code to "Expand" the STRICT_FP_ROUND…
		craig.topperUnsubmitted Not Done Reply Inline Actions X86 has FP_ROUND marked Custom, but most type combinations are Legal. I had to mark STRICT_FP_ROUND as Custom to get it past this code. But now I can’t get it past the mutation code in SelectionDAGIsel because it’s not “Legal”. Scalar FADD on X86 is also marked Custom but most cases go through unmodified. STRICT_FADD is marked Expand currently. And only doesn’t get turned into a lib call because I don’t think there is STRICT_FADD libcall support yet. But that needs to be added to support strict ops on f128 for X86-64. The moment that happens then every other target that hasn’t implemented strict fp yet will generate a libcall for STRICT_FADD. craig.topper: X86 has FP_ROUND marked Custom, but most type combinations are Legal. I had to mark…
Node->getValueType(0))		Node->getValueType(0))
== TargetLowering::Legal)		== TargetLowering::Legal)
break;		break;
		// We fall back to use stack operation when the FP_ROUND operation
		// isn't available.
Tmp1 = EmitStackConvert(Node->getOperand(1),		Tmp1 = EmitStackConvert(Node->getOperand(1),
Node->getValueType(0),		Node->getValueType(0),
Node->getValueType(0), dl, Node->getOperand(0));		Node->getValueType(0), dl, Node->getOperand(0));
ReplaceNode(Node, Tmp1.getNode());		ReplaceNode(Node, Tmp1.getNode());
LLVM_DEBUG(dbgs() << "Successfully expanded STRICT_FP_ROUND node\n");		LLVM_DEBUG(dbgs() << "Successfully expanded STRICT_FP_ROUND node\n");
return true;		return true;
case ISD::FP_ROUND:		case ISD::FP_ROUND:
case ISD::BITCAST:		case ISD::BITCAST:
Tmp1 = EmitStackConvert(Node->getOperand(0),		Tmp1 = EmitStackConvert(Node->getOperand(0),
Node->getValueType(0),		Node->getValueType(0),
Node->getValueType(0), dl);		Node->getValueType(0), dl);
Results.push_back(Tmp1);		Results.push_back(Tmp1);
break;		break;
case ISD::STRICT_FP_EXTEND:		case ISD::STRICT_FP_EXTEND:
// This expansion does not honor the "strict" properties anyway,		// When strict mode is enforced we can't do expansion because it
// so prefer falling back to the non-strict operation if legal.		// does not honor the "strict" properties. Only libcall is allowed.
		if (TLI.isStrictFPEnabled())
		break;
		// We might as well mutate to FP_EXTEND when FP_EXTEND operation is legal
		// since this operation is more efficient than stack operation.
		uweigandUnsubmitted Not Done Reply Inline Actions See above. uweigand: See above.
if (TLI.getStrictFPOperationAction(Node->getOpcode(),		if (TLI.getStrictFPOperationAction(Node->getOpcode(),
Node->getValueType(0))		Node->getValueType(0))
== TargetLowering::Legal)		== TargetLowering::Legal)
break;		break;
		// We fall back to use stack operation when the FP_EXTEND operation
		// isn't available.
Tmp1 = EmitStackConvert(Node->getOperand(1),		Tmp1 = EmitStackConvert(Node->getOperand(1),
Node->getOperand(1).getValueType(),		Node->getOperand(1).getValueType(),
Node->getValueType(0), dl, Node->getOperand(0));		Node->getValueType(0), dl, Node->getOperand(0));
ReplaceNode(Node, Tmp1.getNode());		ReplaceNode(Node, Tmp1.getNode());
LLVM_DEBUG(dbgs() << "Successfully expanded STRICT_FP_EXTEND node\n");		LLVM_DEBUG(dbgs() << "Successfully expanded STRICT_FP_EXTEND node\n");
return true;		return true;
case ISD::FP_EXTEND:		case ISD::FP_EXTEND:
Tmp1 = EmitStackConvert(Node->getOperand(0),		Tmp1 = EmitStackConvert(Node->getOperand(0),
▲ Show 20 Lines • Show All 854 Lines • ▼ Show 20 Lines
case ISD::JumpTable:		case ISD::JumpTable:
case ISD::INTRINSIC_W_CHAIN:		case ISD::INTRINSIC_W_CHAIN:
case ISD::INTRINSIC_WO_CHAIN:		case ISD::INTRINSIC_WO_CHAIN:
case ISD::INTRINSIC_VOID:		case ISD::INTRINSIC_VOID:
// FIXME: Custom lowering for these operations shouldn't return null!		// FIXME: Custom lowering for these operations shouldn't return null!
break;		break;
}		}

if (Results.empty() && Node->isStrictFPOpcode()) {		if (!TLI.isStrictFPEnabled() && Results.empty() && Node->isStrictFPOpcode()) {
		pengfeiUnsubmitted Not Done Reply Inline Actions Is it better to use `!(TLI.isStrictFPEnabled() \|\| DisableStrictNodeMutation)` ? pengfei: Is it better to use `!(TLI.isStrictFPEnabled() \|\| DisableStrictNodeMutation)` ?
// FIXME: We were asked to expand a strict floating-point operation,		// FIXME: We were asked to expand a strict floating-point operation,
// but there is currently no expansion implemented that would preserve		// but there is currently no expansion implemented that would preserve
// the "strict" properties. For now, we just fall back to the non-strict		// the "strict" properties. For now, we just fall back to the non-strict
// version if that is legal on the target. The actual mutation of the		// version if that is legal on the target. The actual mutation of the
// operation will happen in SelectionDAGISel::DoInstructionSelection.		// operation will happen in SelectionDAGISel::DoInstructionSelection.
switch (Node->getOpcode()) {		switch (Node->getOpcode()) {
default:		default:
if (TLI.getStrictFPOperationAction(Node->getOpcode(),		if (TLI.getStrictFPOperationAction(Node->getOpcode(),
▲ Show 20 Lines • Show All 952 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp

Show First 20 Lines • Show All 313 Lines • ▼ Show 20 Lines	#define INSTRUCTION(NAME, NARG, ROUND_MODE, INTRINSIC, DAGN) \
case ISD::STRICT_##DAGN:		case ISD::STRICT_##DAGN:
#include "llvm/IR/ConstrainedOps.def"		#include "llvm/IR/ConstrainedOps.def"
Action = TLI.getOperationAction(Node->getOpcode(), Node->getValueType(0));		Action = TLI.getOperationAction(Node->getOpcode(), Node->getValueType(0));
// If we're asked to expand a strict vector floating-point operation,		// If we're asked to expand a strict vector floating-point operation,
// by default we're going to simply unroll it. That is usually the		// by default we're going to simply unroll it. That is usually the
// best approach, except in the case where the resulting strict (scalar)		// best approach, except in the case where the resulting strict (scalar)
// operations would themselves use the fallback mutation to non-strict.		// operations would themselves use the fallback mutation to non-strict.
// In that specific case, just do the fallback on the vector op.		// In that specific case, just do the fallback on the vector op.
if (Action == TargetLowering::Expand &&		if (Action == TargetLowering::Expand && !TLI.isStrictFPEnabled() &&
TLI.getStrictFPOperationAction(Node->getOpcode(),		TLI.getStrictFPOperationAction(Node->getOpcode(),
Node->getValueType(0))		Node->getValueType(0))
== TargetLowering::Legal) {		== TargetLowering::Legal) {
EVT EltVT = Node->getValueType(0).getVectorElementType();		EVT EltVT = Node->getValueType(0).getVectorElementType();
if (TLI.getOperationAction(Node->getOpcode(), EltVT)		if (TLI.getOperationAction(Node->getOpcode(), EltVT)
== TargetLowering::Expand &&		== TargetLowering::Expand &&
TLI.getStrictFPOperationAction(Node->getOpcode(), EltVT)		TLI.getStrictFPOperationAction(Node->getOpcode(), EltVT)
== TargetLowering::Legal)		== TargetLowering::Legal)
Action = TargetLowering::Legal;		Action = TargetLowering::Legal;
}		}
▲ Show 20 Lines • Show All 1,064 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp

	Show First 20 Lines • Show All 1,150 Lines • ▼ Show 20 Lines
	#endif			#endif

	// When we are using non-default rounding modes or FP exception behavior			// When we are using non-default rounding modes or FP exception behavior
	// FP operations are represented by StrictFP pseudo-operations. For			// FP operations are represented by StrictFP pseudo-operations. For
	// targets that do not (yet) understand strict FP operations directly,			// targets that do not (yet) understand strict FP operations directly,
	// we convert them to normal FP opcodes instead at this point. This			// we convert them to normal FP opcodes instead at this point. This
	// will allow them to be handled by existing target-specific instruction			// will allow them to be handled by existing target-specific instruction
	// selectors.			// selectors.
	if (Node->isStrictFPOpcode() &&			if (!TLI->isStrictFPEnabled() && Node->isStrictFPOpcode() &&
	(TLI->getOperationAction(Node->getOpcode(), Node->getValueType(0))			(TLI->getOperationAction(Node->getOpcode(), Node->getValueType(0))
				pengfeiUnsubmitted Not Done Reply Inline Actions Same as above. pengfei: Same as above.
	== TargetLowering::Expand))			== TargetLowering::Expand))
	Node = CurDAG->mutateStrictFPToFP(Node);			Node = CurDAG->mutateStrictFPToFP(Node);

	LLVM_DEBUG(dbgs() << "\nISEL: Starting selection on root node: ";			LLVM_DEBUG(dbgs() << "\nISEL: Starting selection on root node: ";
	Node->dump(CurDAG));			Node->dump(CurDAG));

	Select(Node);			Select(Node);
	}			}
	▲ Show 20 Lines • Show All 2,494 Lines • Show Last 20 Lines

llvm/lib/CodeGen/TargetLoweringBase.cpp

Show First 20 Lines • Show All 82 Lines • ▼ Show 20 Lines	JumpTableDensity("jump-table-density", cl::init(10), cl::Hidden,
"a normal function"));		"a normal function"));

/// Minimum jump table density for -Os or -Oz functions.		/// Minimum jump table density for -Os or -Oz functions.
static cl::opt<unsigned> OptsizeJumpTableDensity(		static cl::opt<unsigned> OptsizeJumpTableDensity(
"optsize-jump-table-density", cl::init(40), cl::Hidden,		"optsize-jump-table-density", cl::init(40), cl::Hidden,
cl::desc("Minimum density for building a jump table in "		cl::desc("Minimum density for building a jump table in "
"an optsize function"));		"an optsize function"));

		// FIXME: This option is only to test if the strict fp operation processed
		// correctly by preventing mutating strict fp operation to normal fp operation
		// during development. When the backend supports strict float operation, this
		// option will be meaningless.
		static cl::opt<bool> DisableStrictNodeMutation("disable-strictnode-mutation",
		cl::desc("Don't mutate strict-float node to a legalize node"),
		cl::init(false), cl::Hidden);

static bool darwinHasSinCos(const Triple &TT) {		static bool darwinHasSinCos(const Triple &TT) {
assert(TT.isOSDarwin() && "should be called with darwin triple");		assert(TT.isOSDarwin() && "should be called with darwin triple");
// Don't bother with 32 bit x86.		// Don't bother with 32 bit x86.
if (TT.getArch() == Triple::x86)		if (TT.getArch() == Triple::x86)
return false;		return false;
// Macos < 10.9 has no sincos_stret.		// Macos < 10.9 has no sincos_stret.
if (TT.isMacOSX())		if (TT.isMacOSX())
return !TT.isMacOSXVersionLT(10, 9) && TT.isArch64Bit();		return !TT.isMacOSXVersionLT(10, 9) && TT.isArch64Bit();
▲ Show 20 Lines • Show All 481 Lines • ▼ Show 20 Lines	TargetLoweringBase::TargetLoweringBase(const TargetMachine &tm) : TM(tm) {
PredictableSelectIsExpensive = false;		PredictableSelectIsExpensive = false;
EnableExtLdPromotion = false;		EnableExtLdPromotion = false;
StackPointerRegisterToSaveRestore = 0;		StackPointerRegisterToSaveRestore = 0;
BooleanContents = UndefinedBooleanContent;		BooleanContents = UndefinedBooleanContent;
BooleanFloatContents = UndefinedBooleanContent;		BooleanFloatContents = UndefinedBooleanContent;
BooleanVectorContents = UndefinedBooleanContent;		BooleanVectorContents = UndefinedBooleanContent;
SchedPreferenceInfo = Sched::ILP;		SchedPreferenceInfo = Sched::ILP;
GatherAllAliasesMaxDepth = 18;		GatherAllAliasesMaxDepth = 18;
		IsStrictFPEnabled = DisableStrictNodeMutation;
// TODO: the default will be switched to 0 in the next commit, along		// TODO: the default will be switched to 0 in the next commit, along
// with the Target-specific changes necessary.		// with the Target-specific changes necessary.
MaxAtomicSizeInBitsSupported = 1024;		MaxAtomicSizeInBitsSupported = 1024;

MinCmpXchgSizeInBits = 0;		MinCmpXchgSizeInBits = 0;
SupportsUnalignedAtomics = false;		SupportsUnalignedAtomics = false;

std::fill(std::begin(LibcallRoutineNames), std::end(LibcallRoutineNames), nullptr);		std::fill(std::begin(LibcallRoutineNames), std::end(LibcallRoutineNames), nullptr);
▲ Show 20 Lines • Show All 1,405 Lines • Show Last 20 Lines

llvm/lib/Target/SystemZ/SystemZISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 628 Lines • ▼ Show 20 Lines	SystemZTargetLowering::SystemZTargetLowering(const TargetMachine &TM,

// The main memset sequence is a byte store followed by an MVC.		// The main memset sequence is a byte store followed by an MVC.
// Two STC or MV..I stores win over that, but the kind of fused stores		// Two STC or MV..I stores win over that, but the kind of fused stores
// generated by target-independent code don't when the byte value is		// generated by target-independent code don't when the byte value is
// variable. E.g. "STC <reg>;MHI <reg>,257;STH <reg>" is not better		// variable. E.g. "STC <reg>;MHI <reg>,257;STH <reg>" is not better
// than "STC;MVC". Handle the choice in target-specific code instead.		// than "STC;MVC". Handle the choice in target-specific code instead.
MaxStoresPerMemset = 0;		MaxStoresPerMemset = 0;
MaxStoresPerMemsetOptSize = 0;		MaxStoresPerMemsetOptSize = 0;

		// Default to having -disable-strictnode-mutation on
		IsStrictFPEnabled = true;
		pengfeiUnsubmitted Not Done Reply Inline Actions Extra blank. pengfei: Extra blank.
		LiuChen3AuthorUnsubmitted Done Reply Inline Actions Thanks for your review, there is no blank here original. LiuChen3: Thanks for your review, there is no blank here original.
}		}

EVT SystemZTargetLowering::getSetCCResultType(const DataLayout &DL,		EVT SystemZTargetLowering::getSetCCResultType(const DataLayout &DL,
LLVMContext &, EVT VT) const {		LLVMContext &, EVT VT) const {
if (!VT.isVector())		if (!VT.isVector())
return MVT::i32;		return MVT::i32;
return VT.changeVectorElementTypeToInteger();		return VT.changeVectorElementTypeToInteger();
}		}
▲ Show 20 Lines • Show All 7,192 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Add an option to disable strict float node mutating to an normal float nodeClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 230574

llvm/include/llvm/CodeGen/TargetLowering.h

llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp

llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp

llvm/lib/CodeGen/TargetLoweringBase.cpp

llvm/lib/Target/SystemZ/SystemZISelLowering.cpp

Add an option to disable strict float node mutating to an normal float node
ClosedPublic