This is an archive of the discontinued LLVM Phabricator instance.

[DAG] Update foldSelectWithIdentityConstant to use llvm::isNeutralConstant
ClosedPublic

Authored by RKSimon on Sep 30 2022, 7:41 AM.

Download Raw Diff

Details

Reviewers

fakepaper56
pengfei
spatel
craig.topper

Commits

rG61dc5014acf1: [DAG] Update foldSelectWithIdentityConstant to use llvm::isNeutralConstant

Summary

D133866 added the llvm::isNeutralConstant helper to track neutral/passthrough constants

This patch updates foldSelectWithIdentityConstant to use the helper instead of maintaining its own opcode handling

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

RKSimon created this revision.Sep 30 2022, 7:41 AM

Herald added a project: Restricted Project. · View Herald TranscriptSep 30 2022, 7:41 AM

Herald added subscribers: StephenFan, ecnelises, hiraditya. · View Herald Transcript

RKSimon requested review of this revision.Sep 30 2022, 7:41 AM

Herald added a project: Restricted Project. · View Herald TranscriptSep 30 2022, 7:41 AM

Harbormaster completed remote builds in B189695: Diff 464278.Sep 30 2022, 8:22 AM

It looks to me equals to the old code.

llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp
10753–10754	Are these two comments needed?

This revision is now accepted and ready to land.Sep 30 2022, 9:07 AM

RKSimon added inline comments.Sep 30 2022, 9:14 AM

llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp
10753–10754	The NSZ TODO can certainly go - I haven't come across any real world cases where target opcodes would be useful yet, but theoretically they could be, so I might keep that one. The getBinOpIdentity TODO should probably be a NOTE instead

This revision was landed with ongoing or failed builds.Sep 30 2022, 9:47 AM

Closed by commit rG61dc5014acf1: [DAG] Update foldSelectWithIdentityConstant to use llvm::isNeutralConstant (authored by RKSimon). · Explain Why

This revision was automatically updated to reflect the committed changes.

RKSimon added a commit: rG61dc5014acf1: [DAG] Update foldSelectWithIdentityConstant to use llvm::isNeutralConstant.

as a headsup: I see this change causes failures in Halide https://github.com/halide/Halide/blob/main/test/correctness/div_round_to_zero.cpp on some architectures

In D134966#3833518, @goncharov wrote:

as a headsup: I see this change causes failures in Halide https://github.com/halide/Halide/blob/main/test/correctness/div_round_to_zero.cpp on some architectures

Any chance you can post the IR for that code? My first guess is that we've hoisted a div/rem, and that's illegal for this transform because those are not speculatable ops. Looks like this would only happen with a 512-bit vector on an AVX512VL target if I'm seeing the target hook restriction correctly.

spatel mentioned this in rG17dcbd816547: [SDAG] don't hoist div/rem through a select with neutral constant.Oct 4 2022, 10:21 AM

Looks like the Halide failure is a SIGFPE on an idiv instruction... we are doing idiv %r15b with edx=0xff, eax=0x8382ff80, r15=0xffffffff... i.e., we are dividing by -1. The result won't fit into 32 bits, so we fail.

In D134966#3834797, @srj wrote:

Looks like the Halide failure is a SIGFPE on an idiv instruction... we are doing idiv %r15b with edx=0xff, eax=0x8382ff80, r15=0xffffffff... i.e., we are dividing by -1. The result won't fit into 32 bits, so we fail.

I'm guessing we had (sdiv %x, (select %c, %y, 1)) and turned it into (select %c, (sdiv %x, %y), %x). The original code may have been selecting a safe divisor. The new code is not. What happens if we remove SDIV and UDIV from isNeutralConstant?

Update: confirmed that https://reviews.llvm.org/rG17dcbd8165479d5b2d7f827bfcb271b50ee03872 appears to fix the injection in Halide

It seems this patch cause some performance regression on X86. I created D141782 to fix it.

Revision Contents

Path

Size

llvm/

lib/

CodeGen/

SelectionDAG/

DAGCombiner.cpp

35 lines

SelectionDAG.cpp

6 lines

Diff 464306

llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 2,196 Lines • ▼ Show 20 Lines	if (N1.getOpcode() != ISD::VSELECT \|\| !N1.hasOneUse())
return SDValue();		return SDValue();

unsigned Opcode = N->getOpcode();		unsigned Opcode = N->getOpcode();
EVT VT = N->getValueType(0);		EVT VT = N->getValueType(0);
SDValue Cond = N1.getOperand(0);		SDValue Cond = N1.getOperand(0);
SDValue TVal = N1.getOperand(1);		SDValue TVal = N1.getOperand(1);
SDValue FVal = N1.getOperand(2);		SDValue FVal = N1.getOperand(2);

// TODO: The cases should match with IR's ConstantExpr::getBinOpIdentity().
// TODO: Target-specific opcodes could be added. Ex: "isCommutativeBinOp()".
// TODO: With fast-math (NSZ), allow the opposite-sign form of zero?
auto isIdentityConstantForOpcode = [](unsigned Opcode, SDValue V) {
if (ConstantFPSDNode *C = isConstOrConstSplatFP(V)) {
switch (Opcode) {
case ISD::FADD: // X + -0.0 --> X
return C->isZero() && C->isNegative();
case ISD::FSUB: // X - 0.0 --> X
return C->isZero() && !C->isNegative();
case ISD::FMUL: // X * 1.0 --> X
case ISD::FDIV: // X / 1.0 --> X
return C->isExactlyValue(1.0);
}
}
if (ConstantSDNode *C = isConstOrConstSplat(V)) {
switch (Opcode) {
case ISD::ADD: // X + 0 --> X
case ISD::SUB: // X - 0 --> X
case ISD::SHL: // X << 0 --> X
case ISD::SRA: // X s>> 0 --> X
case ISD::SRL: // X u>> 0 --> X
return C->isZero();
case ISD::MUL: // X * 1 --> X
return C->isOne();
}
}
return false;
};

// This transform increases uses of N0, so freeze it to be safe.		// This transform increases uses of N0, so freeze it to be safe.
// binop N0, (vselect Cond, IDC, FVal) --> vselect Cond, N0, (binop N0, FVal)		// binop N0, (vselect Cond, IDC, FVal) --> vselect Cond, N0, (binop N0, FVal)
if (isIdentityConstantForOpcode(Opcode, TVal)) {		unsigned OpNo = ShouldCommuteOperands ? 0 : 1;
		if (isNeutralConstant(Opcode, N->getFlags(), TVal, OpNo)) {
SDValue F0 = DAG.getFreeze(N0);		SDValue F0 = DAG.getFreeze(N0);
SDValue NewBO = DAG.getNode(Opcode, SDLoc(N), VT, F0, FVal, N->getFlags());		SDValue NewBO = DAG.getNode(Opcode, SDLoc(N), VT, F0, FVal, N->getFlags());
return DAG.getSelect(SDLoc(N), VT, Cond, F0, NewBO);		return DAG.getSelect(SDLoc(N), VT, Cond, F0, NewBO);
}		}
// binop N0, (vselect Cond, TVal, IDC) --> vselect Cond, (binop N0, TVal), N0		// binop N0, (vselect Cond, TVal, IDC) --> vselect Cond, (binop N0, TVal), N0
if (isIdentityConstantForOpcode(Opcode, FVal)) {		if (isNeutralConstant(Opcode, N->getFlags(), FVal, OpNo)) {
SDValue F0 = DAG.getFreeze(N0);		SDValue F0 = DAG.getFreeze(N0);
SDValue NewBO = DAG.getNode(Opcode, SDLoc(N), VT, F0, TVal, N->getFlags());		SDValue NewBO = DAG.getNode(Opcode, SDLoc(N), VT, F0, TVal, N->getFlags());
return DAG.getSelect(SDLoc(N), VT, Cond, NewBO, F0);		return DAG.getSelect(SDLoc(N), VT, Cond, NewBO, F0);
}		}

return SDValue();		return SDValue();
}		}

▲ Show 20 Lines • Show All 23,075 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 10,743 Lines • ▼ Show 20 Lines

	bool llvm::isMinSignedConstant(SDValue V) {			bool llvm::isMinSignedConstant(SDValue V) {
	ConstantSDNode *Const = dyn_cast<ConstantSDNode>(V);			ConstantSDNode *Const = dyn_cast<ConstantSDNode>(V);
	return Const != nullptr && Const->isMinSignedValue();			return Const != nullptr && Const->isMinSignedValue();
	}			}

	bool llvm::isNeutralConstant(unsigned Opcode, SDNodeFlags Flags, SDValue V,			bool llvm::isNeutralConstant(unsigned Opcode, SDNodeFlags Flags, SDValue V,
	unsigned OperandNo) {			unsigned OperandNo) {
	if (auto *Const = dyn_cast<ConstantSDNode>(V)) {			// NOTE: The cases should match with IR's ConstantExpr::getBinOpIdentity().
				// TODO: Target-specific opcodes could be added.
				if (auto *Const = isConstOrConstSplat(V)) {
				pengfeiUnsubmitted Not Done Reply Inline Actions Are these two comments needed? pengfei: Are these two comments needed?
				RKSimonAuthorUnsubmitted Not Done Reply Inline Actions The NSZ TODO can certainly go - I haven't come across any real world cases where target opcodes would be useful yet, but theoretically they could be, so I might keep that one. The getBinOpIdentity TODO should probably be a NOTE instead RKSimon: The NSZ TODO can certainly go - I haven't come across any real world cases where target opcodes…
	switch (Opcode) {			switch (Opcode) {
	case ISD::ADD:			case ISD::ADD:
	case ISD::OR:			case ISD::OR:
	case ISD::XOR:			case ISD::XOR:
	case ISD::UMAX:			case ISD::UMAX:
	return Const->isZero();			return Const->isZero();
	case ISD::MUL:			case ISD::MUL:
	return Const->isOne();			return Const->isOne();
	case ISD::AND:			case ISD::AND:
	case ISD::UMIN:			case ISD::UMIN:
	return Const->isAllOnes();			return Const->isAllOnes();
	case ISD::SMAX:			case ISD::SMAX:
	return Const->isMinSignedValue();			return Const->isMinSignedValue();
	case ISD::SMIN:			case ISD::SMIN:
	return Const->isMaxSignedValue();			return Const->isMaxSignedValue();
	case ISD::SUB:			case ISD::SUB:
	case ISD::SHL:			case ISD::SHL:
	case ISD::SRA:			case ISD::SRA:
	case ISD::SRL:			case ISD::SRL:
	return OperandNo == 1 && Const->isZero();			return OperandNo == 1 && Const->isZero();
	case ISD::UDIV:			case ISD::UDIV:
	case ISD::SDIV:			case ISD::SDIV:
	return OperandNo == 1 && Const->isOne();			return OperandNo == 1 && Const->isOne();
	}			}
	} else if (auto *ConstFP = dyn_cast<ConstantFPSDNode>(V)) {			} else if (auto *ConstFP = isConstOrConstSplatFP(V)) {
	switch (Opcode) {			switch (Opcode) {
	case ISD::FADD:			case ISD::FADD:
	return ConstFP->isZero() &&			return ConstFP->isZero() &&
	(Flags.hasNoSignedZeros() \|\| ConstFP->isNegative());			(Flags.hasNoSignedZeros() \|\| ConstFP->isNegative());
	case ISD::FSUB:			case ISD::FSUB:
	return OperandNo == 1 && ConstFP->isZero() &&			return OperandNo == 1 && ConstFP->isZero() &&
	(Flags.hasNoSignedZeros() \|\| !ConstFP->isNegative());			(Flags.hasNoSignedZeros() \|\| !ConstFP->isNegative());
	case ISD::FMUL:			case ISD::FMUL:
	▲ Show 20 Lines • Show All 1,288 Lines • Show Last 20 Lines