This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/
-
llvm/
-
CodeGen/
-
ISDOpcodes.h
-
SelectionDAG.h
-
SelectionDAGNodes.h
-
Target/
-
TargetSelectionDAG.td
-
lib/
-
CodeGen/SelectionDAG/
-
SelectionDAG/
-
DAGCombiner.cpp
4/5
SelectionDAG.cpp
1/2
SelectionDAGBuilder.cpp
-
SelectionDAGDumper.cpp
-
SelectionDAGISel.cpp
-
Target/AMDGPU/
-
AMDGPU/
3/6
AMDGPUISelDAGToDAG.cpp
-
test/CodeGen/AMDGPU/
-
CodeGen/
-
AMDGPU/
-
store-weird-sizes.ll

Differential D81711

[SDAG] Add new AssertAlign ISD node.
ClosedPublic

Authored by hliao on Jun 11 2020, 8:28 PM.

Download Raw Diff

Details

Reviewers

arsenm
bogner
rampitec

Commits

rGb1360caa823d: [SDAG] Add new AssertAlign ISD node.

Summary

AssertAlign node records the guaranteed alignment on its source node, where these alignments are retrieved from alignment attributes in LLVM IR. These tracked alignments could help DAG combining and lowering generating efficient code.
In this patch, the basic support of AssertAlign node is added. So far, we only generate AssertAlign nodes on return values from intrinsic calls.
Addressing selection in AMDGPU is revised accordingly to capture the new (base + offset) patterns.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

hliao created this revision.Jun 11 2020, 8:28 PM

Herald added a project: Restricted Project. · View Herald TranscriptJun 11 2020, 8:28 PM

Herald added subscribers: llvm-commits, kerbowa, hiraditya and 4 others. · View Herald Transcript

Harbormaster failed remote builds in B60067: Diff 270295!Jun 11 2020, 9:25 PM

craig.topper added a subscriber: craig.topper.Jun 11 2020, 11:24 PM

craig.topper added inline comments.

llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp
3175	Known.One should already be all 0s. I don't think you need to clear it. Though maybe you should call computeKnownBits to propagate from the input? Then you would need to clear it.

hliao marked an inline comment as done.Jun 12 2020, 8:23 AM

hliao added inline comments.

llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp
3175	`AssertAlign` is intended to be used with `align` function attributes, which could be used to annotate alignment on return value or arguments. Most of them are lowered into register copies. It's difficult and almost no way to trace back the original alignment embedded in LLVM IR. `AssertAlign` is added to propagate the alignment info from IR into DAG. It helps `computeKnowBits` instead of relying on it.

hliao added a reviewer: rampitec.Jun 12 2020, 12:28 PM

Rebase to trunk.

Harbormaster failed remote builds in B60200: Diff 270562!Jun 12 2020, 9:33 PM

arsenm added inline comments.Jun 15 2020, 5:04 AM

llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp
5201–5204	This check is unnecessary, a lower alignment isn't even representable in Align because it stores the log2 of the alignment
5201–5204	Nevermind, I misread this. I think it would be clearer as if (A == Align(1)) return Val
llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
138	Not sure we really need this?

hliao marked an inline comment as done.Jun 15 2020, 7:46 AM

hliao added inline comments.

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
138	That's a hidden option disabling `AssertAlign` inserting to make the debug of regressions easier.

ping

arsenm added inline comments.Jun 17 2020, 5:17 PM

llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp
5201–5204	I think == Align(1) would be clearer
llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.cpp
1635	Can you add a dag style comment for the pattern this matches? This could also use some early returns

Revise following reviewer's comments.

hliao marked 4 inline comments as done.Jun 18 2020, 11:34 AM

LGTM minus the random formatting changes

llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.cpp
1723–1737	Lots of unrelated formatting changes?

Harbormaster completed remote builds in B60870: Diff 271797.Jun 18 2020, 1:07 PM

hliao marked an inline comment as done.Jun 18 2020, 2:02 PM

hliao added inline comments.

llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.cpp
1723–1737	That's due to the extra indent added from L1680. Also, the lint progress in arc review tries to re-formatting all changed code.

arsenm added inline comments.Jun 18 2020, 2:16 PM

llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.cpp
1723–1737	Could switch to early return and avoid it?

hliao marked an inline comment as done.Jun 18 2020, 5:36 PM

hliao added inline comments.

llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.cpp
1723–1737	That code just conditionally refine `Addr` and, eventually, all paths need to join L1760 to prepare all the return values. Unless `goto` is used or code duplication, early return cannot be used here.

PING

ping for code review

arsenm accepted this revision.Jun 22 2020, 12:46 PM

arsenm added inline comments.

llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.cpp
1723–1737	Duplicating the trivial case isn't a big deal

This revision is now accepted and ready to land.Jun 22 2020, 12:46 PM

Closed by commit rGb1360caa823d: [SDAG] Add new AssertAlign ISD node. (authored by hliao). · Explain WhyJun 22 2020, 10:01 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

include/

llvm/

CodeGen/

ISDOpcodes.h

1 line

SelectionDAG.h

3 lines

SelectionDAGNodes.h

16 lines

Target/

TargetSelectionDAG.td

7 lines

lib/

CodeGen/

SelectionDAG/

DAGCombiner.cpp

41 lines

SelectionDAG.cpp

37 lines

SelectionDAGBuilder.cpp

14 lines

SelectionDAGDumper.cpp

1 line

SelectionDAGISel.cpp

1 line

Target/

AMDGPU/

AMDGPUISelDAGToDAG.cpp

192 lines

test/

CodeGen/

AMDGPU/

store-weird-sizes.ll

32 lines

Diff 272603

llvm/include/llvm/CodeGen/ISDOpcodes.h

Show First 20 Lines • Show All 52 Lines • ▼ Show 20 Lines	enum NodeType {

/// AssertSext, AssertZext - These nodes record if a register contains a		/// AssertSext, AssertZext - These nodes record if a register contains a
/// value that has already been zero or sign extended from a narrower type.		/// value that has already been zero or sign extended from a narrower type.
/// These nodes take two operands. The first is the node that has already		/// These nodes take two operands. The first is the node that has already
/// been extended, and the second is a value type node indicating the width		/// been extended, and the second is a value type node indicating the width
/// of the extension		/// of the extension
AssertSext,		AssertSext,
AssertZext,		AssertZext,
		AssertAlign,

/// Various leaf nodes.		/// Various leaf nodes.
BasicBlock,		BasicBlock,
VALUETYPE,		VALUETYPE,
CONDCODE,		CONDCODE,
Register,		Register,
RegisterMask,		RegisterMask,
Constant,		Constant,
▲ Show 20 Lines • Show All 1,222 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/SelectionDAG.h

Show First 20 Lines • Show All 1,336 Lines • ▼ Show 20 Lines	#endif

/// Return an AddrSpaceCastSDNode.		/// Return an AddrSpaceCastSDNode.
SDValue getAddrSpaceCast(const SDLoc &dl, EVT VT, SDValue Ptr, unsigned SrcAS,		SDValue getAddrSpaceCast(const SDLoc &dl, EVT VT, SDValue Ptr, unsigned SrcAS,
unsigned DestAS);		unsigned DestAS);

/// Return a freeze using the SDLoc of the value operand.		/// Return a freeze using the SDLoc of the value operand.
SDValue getFreeze(SDValue V);		SDValue getFreeze(SDValue V);

		/// Return an AssertAlignSDNode.
		SDValue getAssertAlign(const SDLoc &DL, SDValue V, Align A);

/// Return the specified value casted to		/// Return the specified value casted to
/// the target's desired shift amount type.		/// the target's desired shift amount type.
SDValue getShiftAmountOperand(EVT LHSTy, SDValue Op);		SDValue getShiftAmountOperand(EVT LHSTy, SDValue Op);

/// Expand the specified \c ISD::VAARG node as the Legalize pass would.		/// Expand the specified \c ISD::VAARG node as the Legalize pass would.
SDValue expandVAArg(SDNode *Node);		SDValue expandVAArg(SDNode *Node);

/// Expand the specified \c ISD::VACOPY node as the Legalize pass would.		/// Expand the specified \c ISD::VACOPY node as the Legalize pass would.
▲ Show 20 Lines • Show All 661 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/SelectionDAGNodes.h

Show First 20 Lines • Show All 2,520 Lines • ▼ Show 20 Lines	void clearMemRefs() {
NumMemRefs = 0;		NumMemRefs = 0;
}		}

static bool classof(const SDNode *N) {		static bool classof(const SDNode *N) {
return N->isMachineOpcode();		return N->isMachineOpcode();
}		}
};		};

		/// An SDNode that records if a register contains a value that is guaranteed to
		/// be aligned accordingly.
		class AssertAlignSDNode : public SDNode {
		Align Alignment;

		public:
		AssertAlignSDNode(unsigned Order, const DebugLoc &DL, EVT VT, Align A)
		: SDNode(ISD::AssertAlign, Order, DL, getSDVTList(VT)), Alignment(A) {}

		Align getAlign() const { return Alignment; }

		static bool classof(const SDNode *N) {
		return N->getOpcode() == ISD::AssertAlign;
		}
		};

class SDNodeIterator : public std::iterator<std::forward_iterator_tag,		class SDNodeIterator : public std::iterator<std::forward_iterator_tag,
SDNode, ptrdiff_t> {		SDNode, ptrdiff_t> {
const SDNode *Node;		const SDNode *Node;
unsigned Operand;		unsigned Operand;

SDNodeIterator(const SDNode *N, unsigned Op) : Node(N), Operand(Op) {}		SDNodeIterator(const SDNode *N, unsigned Op) : Node(N), Operand(Op) {}

public:		public:
▲ Show 20 Lines • Show All 151 Lines • Show Last 20 Lines

llvm/include/llvm/Target/TargetSelectionDAG.td

Show First 20 Lines • Show All 661 Lines • ▼ Show 20 Lines	def intrinsic_void : SDNode<"ISD::INTRINSIC_VOID",
SDTypeProfile<0, -1, [SDTCisPtrTy<0>]>,		SDTypeProfile<0, -1, [SDTCisPtrTy<0>]>,
[SDNPHasChain]>;		[SDNPHasChain]>;
def intrinsic_w_chain : SDNode<"ISD::INTRINSIC_W_CHAIN",		def intrinsic_w_chain : SDNode<"ISD::INTRINSIC_W_CHAIN",
SDTypeProfile<1, -1, [SDTCisPtrTy<1>]>,		SDTypeProfile<1, -1, [SDTCisPtrTy<1>]>,
[SDNPHasChain]>;		[SDNPHasChain]>;
def intrinsic_wo_chain : SDNode<"ISD::INTRINSIC_WO_CHAIN",		def intrinsic_wo_chain : SDNode<"ISD::INTRINSIC_WO_CHAIN",
SDTypeProfile<1, -1, [SDTCisPtrTy<1>]>, []>;		SDTypeProfile<1, -1, [SDTCisPtrTy<1>]>, []>;

def SDT_assertext : SDTypeProfile<1, 1,		def SDT_assert : SDTypeProfile<1, 1,
[SDTCisInt<0>, SDTCisInt<1>, SDTCisSameAs<1, 0>]>;		[SDTCisInt<0>, SDTCisInt<1>, SDTCisSameAs<1, 0>]>;
def assertsext : SDNode<"ISD::AssertSext", SDT_assertext>;		def assertsext : SDNode<"ISD::AssertSext", SDT_assert>;
def assertzext : SDNode<"ISD::AssertZext", SDT_assertext>;		def assertzext : SDNode<"ISD::AssertZext", SDT_assert>;
		def assertalign : SDNode<"ISD::AssertAlign", SDT_assert>;


//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Selection DAG Condition Codes		// Selection DAG Condition Codes

class CondCode<string fcmpName = "", string icmpName = ""> {		class CondCode<string fcmpName = "", string icmpName = ""> {
string ICmpPredicate = icmpName;		string ICmpPredicate = icmpName;
string FCmpPredicate = fcmpName;		string FCmpPredicate = fcmpName;
▲ Show 20 Lines • Show All 935 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 449 Lines • ▼ Show 20 Lines	private:
SDValue visitVSELECT(SDNode *N);		SDValue visitVSELECT(SDNode *N);
SDValue visitSELECT_CC(SDNode *N);		SDValue visitSELECT_CC(SDNode *N);
SDValue visitSETCC(SDNode *N);		SDValue visitSETCC(SDNode *N);
SDValue visitSETCCCARRY(SDNode *N);		SDValue visitSETCCCARRY(SDNode *N);
SDValue visitSIGN_EXTEND(SDNode *N);		SDValue visitSIGN_EXTEND(SDNode *N);
SDValue visitZERO_EXTEND(SDNode *N);		SDValue visitZERO_EXTEND(SDNode *N);
SDValue visitANY_EXTEND(SDNode *N);		SDValue visitANY_EXTEND(SDNode *N);
SDValue visitAssertExt(SDNode *N);		SDValue visitAssertExt(SDNode *N);
		SDValue visitAssertAlign(SDNode *N);
SDValue visitSIGN_EXTEND_INREG(SDNode *N);		SDValue visitSIGN_EXTEND_INREG(SDNode *N);
SDValue visitSIGN_EXTEND_VECTOR_INREG(SDNode *N);		SDValue visitSIGN_EXTEND_VECTOR_INREG(SDNode *N);
SDValue visitZERO_EXTEND_VECTOR_INREG(SDNode *N);		SDValue visitZERO_EXTEND_VECTOR_INREG(SDNode *N);
SDValue visitTRUNCATE(SDNode *N);		SDValue visitTRUNCATE(SDNode *N);
SDValue visitBITCAST(SDNode *N);		SDValue visitBITCAST(SDNode *N);
SDValue visitFREEZE(SDNode *N);		SDValue visitFREEZE(SDNode *N);
SDValue visitBUILD_PAIR(SDNode *N);		SDValue visitBUILD_PAIR(SDNode *N);
SDValue visitFADD(SDNode *N);		SDValue visitFADD(SDNode *N);
▲ Show 20 Lines • Show All 1,130 Lines • ▼ Show 20 Lines	SDValue DAGCombiner::visit(SDNode *N) {
case ISD::SELECT_CC: return visitSELECT_CC(N);		case ISD::SELECT_CC: return visitSELECT_CC(N);
case ISD::SETCC: return visitSETCC(N);		case ISD::SETCC: return visitSETCC(N);
case ISD::SETCCCARRY: return visitSETCCCARRY(N);		case ISD::SETCCCARRY: return visitSETCCCARRY(N);
case ISD::SIGN_EXTEND: return visitSIGN_EXTEND(N);		case ISD::SIGN_EXTEND: return visitSIGN_EXTEND(N);
case ISD::ZERO_EXTEND: return visitZERO_EXTEND(N);		case ISD::ZERO_EXTEND: return visitZERO_EXTEND(N);
case ISD::ANY_EXTEND: return visitANY_EXTEND(N);		case ISD::ANY_EXTEND: return visitANY_EXTEND(N);
case ISD::AssertSext:		case ISD::AssertSext:
case ISD::AssertZext: return visitAssertExt(N);		case ISD::AssertZext: return visitAssertExt(N);
		case ISD::AssertAlign: return visitAssertAlign(N);
case ISD::SIGN_EXTEND_INREG: return visitSIGN_EXTEND_INREG(N);		case ISD::SIGN_EXTEND_INREG: return visitSIGN_EXTEND_INREG(N);
case ISD::SIGN_EXTEND_VECTOR_INREG: return visitSIGN_EXTEND_VECTOR_INREG(N);		case ISD::SIGN_EXTEND_VECTOR_INREG: return visitSIGN_EXTEND_VECTOR_INREG(N);
case ISD::ZERO_EXTEND_VECTOR_INREG: return visitZERO_EXTEND_VECTOR_INREG(N);		case ISD::ZERO_EXTEND_VECTOR_INREG: return visitZERO_EXTEND_VECTOR_INREG(N);
case ISD::TRUNCATE: return visitTRUNCATE(N);		case ISD::TRUNCATE: return visitTRUNCATE(N);
case ISD::BITCAST: return visitBITCAST(N);		case ISD::BITCAST: return visitBITCAST(N);
case ISD::BUILD_PAIR: return visitBUILD_PAIR(N);		case ISD::BUILD_PAIR: return visitBUILD_PAIR(N);
case ISD::FADD: return visitFADD(N);		case ISD::FADD: return visitFADD(N);
case ISD::FSUB: return visitFSUB(N);		case ISD::FSUB: return visitFSUB(N);
▲ Show 20 Lines • Show All 9,079 Lines • ▼ Show 20 Lines	if (AssertVT.bitsLT(BigA_AssertVT)) {
BigA.getOperand(0), N1);		BigA.getOperand(0), N1);
return DAG.getNode(ISD::TRUNCATE, DL, N->getValueType(0), NewAssert);		return DAG.getNode(ISD::TRUNCATE, DL, N->getValueType(0), NewAssert);
}		}
}		}

return SDValue();		return SDValue();
}		}

		SDValue DAGCombiner::visitAssertAlign(SDNode *N) {
		SDLoc DL(N);

		Align AL = cast<AssertAlignSDNode>(N)->getAlign();
		SDValue N0 = N->getOperand(0);

		// Fold (assertalign (assertalign x, AL0), AL1) ->
		// (assertalign x, max(AL0, AL1))
		if (auto *AAN = dyn_cast<AssertAlignSDNode>(N0))
		return DAG.getAssertAlign(DL, N0.getOperand(0),
		std::max(AL, AAN->getAlign()));

		// In rare cases, there are trivial arithmetic ops in source operands. Sink
		// this assert down to source operands so that those arithmetic ops could be
		// exposed to the DAG combining.
		switch (N0.getOpcode()) {
		default:
		break;
		case ISD::ADD:
		case ISD::SUB: {
		unsigned AlignShift = Log2(AL);
		SDValue LHS = N0.getOperand(0);
		SDValue RHS = N0.getOperand(1);
		unsigned LHSAlignShift = DAG.computeKnownBits(LHS).countMinTrailingZeros();
		unsigned RHSAlignShift = DAG.computeKnownBits(RHS).countMinTrailingZeros();
		if (LHSAlignShift >= AlignShift \|\| RHSAlignShift >= AlignShift) {
		if (LHSAlignShift < AlignShift)
		LHS = DAG.getAssertAlign(DL, LHS, AL);
		if (RHSAlignShift < AlignShift)
		RHS = DAG.getAssertAlign(DL, RHS, AL);
		return DAG.getNode(N0.getOpcode(), DL, N0.getValueType(), LHS, RHS);
		}
		break;
		}
		}

		return SDValue();
		}

/// If the result of a wider load is shifted to right of N bits and then		/// If the result of a wider load is shifted to right of N bits and then
/// truncated to a narrower type and where N is a multiple of number of bits of		/// truncated to a narrower type and where N is a multiple of number of bits of
/// the narrower type, transform it to a narrower load from address + N / num of		/// the narrower type, transform it to a narrower load from address + N / num of
/// bits of new type. Also narrow the load if the result is masked with an AND		/// bits of new type. Also narrow the load if the result is masked with an AND
/// to effectively produce a smaller type. If the result is to be extended, also		/// to effectively produce a smaller type. If the result is to be extended, also
/// fold the extension to form a extending load.		/// fold the extension to form a extending load.
SDValue DAGCombiner::ReduceLoadWidth(SDNode *N) {		SDValue DAGCombiner::ReduceLoadWidth(SDNode *N) {
unsigned Opc = N->getOpcode();		unsigned Opc = N->getOpcode();
▲ Show 20 Lines • Show All 11,244 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 3,160 Lines • ▼ Show 20 Lines	KnownBits SelectionDAG::computeKnownBits(SDValue Op, const APInt &DemandedElts,
case ISD::AssertZext: {		case ISD::AssertZext: {
EVT VT = cast<VTSDNode>(Op.getOperand(1))->getVT();		EVT VT = cast<VTSDNode>(Op.getOperand(1))->getVT();
APInt InMask = APInt::getLowBitsSet(BitWidth, VT.getSizeInBits());		APInt InMask = APInt::getLowBitsSet(BitWidth, VT.getSizeInBits());
Known = computeKnownBits(Op.getOperand(0), Depth+1);		Known = computeKnownBits(Op.getOperand(0), Depth+1);
Known.Zero \|= (~InMask);		Known.Zero \|= (~InMask);
Known.One &= (~Known.Zero);		Known.One &= (~Known.Zero);
break;		break;
}		}
		case ISD::AssertAlign: {
		unsigned LogOfAlign = Log2(cast<AssertAlignSDNode>(Op)->getAlign());
		assert(LogOfAlign != 0);
		// If a node is guaranteed to be aligned, set low zero bits accordingly as
		// well as clearing one bits.
		Known.Zero.setLowBits(LogOfAlign);
		Known.One.clearLowBits(LogOfAlign);
		craig.topperUnsubmitted Not Done Reply Inline Actions Known.One should already be all 0s. I don't think you need to clear it. Though maybe you should call computeKnownBits to propagate from the input? Then you would need to clear it. craig.topper: Known.One should already be all 0s. I don't think you need to clear it. Though maybe you…
		hliaoAuthorUnsubmitted Done Reply Inline Actions `AssertAlign` is intended to be used with `align` function attributes, which could be used to annotate alignment on return value or arguments. Most of them are lowered into register copies. It's difficult and almost no way to trace back the original alignment embedded in LLVM IR. `AssertAlign` is added to propagate the alignment info from IR into DAG. It helps `computeKnowBits` instead of relying on it. hliao: `AssertAlign` is intended to be used with `align` function attributes, which could be used to…
		break;
		}
case ISD::FGETSIGN:		case ISD::FGETSIGN:
// All bits are zero except the low bit.		// All bits are zero except the low bit.
Known.Zero.setBitsFrom(1);		Known.Zero.setBitsFrom(1);
break;		break;
case ISD::USUBO:		case ISD::USUBO:
case ISD::SSUBO:		case ISD::SSUBO:
if (Op.getResNo() == 1) {		if (Op.getResNo() == 1) {
// If we know the result of a setcc has the top bits zero, use this info.		// If we know the result of a setcc has the top bits zero, use this info.
▲ Show 20 Lines • Show All 2,004 Lines • ▼ Show 20 Lines	case ISD::FREM:
if (N1.isUndef() && N2.isUndef())		if (N1.isUndef() && N2.isUndef())
return getUNDEF(VT);		return getUNDEF(VT);
if (N1.isUndef() \|\| N2.isUndef())		if (N1.isUndef() \|\| N2.isUndef())
return getConstantFP(APFloat::getNaN(EVTToAPFloatSemantics(VT)), DL, VT);		return getConstantFP(APFloat::getNaN(EVTToAPFloatSemantics(VT)), DL, VT);
}		}
return SDValue();		return SDValue();
}		}

		SDValue SelectionDAG::getAssertAlign(const SDLoc &DL, SDValue Val, Align A) {
		assert(Val.getValueType().isInteger() && "Invalid AssertAlign!");

		// There's no need to assert on a byte-aligned pointer. All pointers are at
		// least byte aligned.
		if (A == Align(1))
		return Val;
		arsenmUnsubmitted Done Reply Inline Actions This check is unnecessary, a lower alignment isn't even representable in Align because it stores the log2 of the alignment arsenm: This check is unnecessary, a lower alignment isn't even representable in Align because it…
		arsenmUnsubmitted Done Reply Inline Actions Nevermind, I misread this. I think it would be clearer as if (A == Align(1)) return Val arsenm: Nevermind, I misread this. I think it would be clearer as if (A == Align(1)) return Val
		arsenmUnsubmitted Done Reply Inline Actions I think == Align(1) would be clearer arsenm: I think == Align(1) would be clearer

		FoldingSetNodeID ID;
		AddNodeIDNode(ID, ISD::AssertAlign, getVTList(Val.getValueType()), {Val});
		ID.AddInteger(A.value());

		void *IP = nullptr;
		if (SDNode *E = FindNodeOrInsertPos(ID, DL, IP))
		return SDValue(E, 0);

		auto *N = newSDNode<AssertAlignSDNode>(DL.getIROrder(), DL.getDebugLoc(),
		Val.getValueType(), A);
		createOperands(N, {Val});

		CSEMap.InsertNode(N, IP);
		InsertNode(N);

		SDValue V(N, 0);
		NewSDValueDbgMsg(V, "Creating new node: ", this);
		return V;
		}

SDValue SelectionDAG::getNode(unsigned Opcode, const SDLoc &DL, EVT VT,		SDValue SelectionDAG::getNode(unsigned Opcode, const SDLoc &DL, EVT VT,
SDValue N1, SDValue N2, const SDNodeFlags Flags) {		SDValue N1, SDValue N2, const SDNodeFlags Flags) {
ConstantSDNode *N1C = dyn_cast<ConstantSDNode>(N1);		ConstantSDNode *N1C = dyn_cast<ConstantSDNode>(N1);
ConstantSDNode *N2C = dyn_cast<ConstantSDNode>(N2);		ConstantSDNode *N2C = dyn_cast<ConstantSDNode>(N2);
ConstantFPSDNode *N1CFP = dyn_cast<ConstantFPSDNode>(N1);		ConstantFPSDNode *N1CFP = dyn_cast<ConstantFPSDNode>(N1);
ConstantFPSDNode *N2CFP = dyn_cast<ConstantFPSDNode>(N2);		ConstantFPSDNode *N2CFP = dyn_cast<ConstantFPSDNode>(N2);

// Canonicalize constant to RHS if commutative.		// Canonicalize constant to RHS if commutative.
▲ Show 20 Lines • Show All 4,713 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 129 Lines • ▼ Show 20 Lines
using namespace SwitchCG;		using namespace SwitchCG;

#define DEBUG_TYPE "isel"		#define DEBUG_TYPE "isel"

/// LimitFloatPrecision - Generate low-precision inline sequences for		/// LimitFloatPrecision - Generate low-precision inline sequences for
/// some float libcalls (6, 8 or 12 bits).		/// some float libcalls (6, 8 or 12 bits).
static unsigned LimitFloatPrecision;		static unsigned LimitFloatPrecision;

		static cl::opt<bool>
		arsenmUnsubmitted Not Done Reply Inline Actions Not sure we really need this? arsenm: Not sure we really need this?
		hliaoAuthorUnsubmitted Done Reply Inline Actions That's a hidden option disabling `AssertAlign` inserting to make the debug of regressions easier. hliao: That's a hidden option disabling `AssertAlign` inserting to make the debug of regressions…
		InsertAssertAlign("insert-assert-align", cl::init(true),
		cl::desc("Insert the experimental `assertalign` node."),
		cl::ReallyHidden);

static cl::opt<unsigned, true>		static cl::opt<unsigned, true>
LimitFPPrecision("limit-float-precision",		LimitFPPrecision("limit-float-precision",
cl::desc("Generate low-precision inline sequences "		cl::desc("Generate low-precision inline sequences "
"for some float libcalls"),		"for some float libcalls"),
cl::location(LimitFloatPrecision), cl::Hidden,		cl::location(LimitFloatPrecision), cl::Hidden,
cl::init(0));		cl::init(0));

static cl::opt<unsigned> SwitchPeelThreshold(		static cl::opt<unsigned> SwitchPeelThreshold(
▲ Show 20 Lines • Show All 4,596 Lines • ▼ Show 20 Lines	void SelectionDAGBuilder::visitTargetIntrinsic(const CallInst &I,

if (!I.getType()->isVoidTy()) {		if (!I.getType()->isVoidTy()) {
if (VectorType *PTy = dyn_cast<VectorType>(I.getType())) {		if (VectorType *PTy = dyn_cast<VectorType>(I.getType())) {
EVT VT = TLI.getValueType(DAG.getDataLayout(), PTy);		EVT VT = TLI.getValueType(DAG.getDataLayout(), PTy);
Result = DAG.getNode(ISD::BITCAST, getCurSDLoc(), VT, Result);		Result = DAG.getNode(ISD::BITCAST, getCurSDLoc(), VT, Result);
} else		} else
Result = lowerRangeToAssertZExt(DAG, I, Result);		Result = lowerRangeToAssertZExt(DAG, I, Result);

		MaybeAlign Alignment = I.getRetAlign();
		if (!Alignment)
		Alignment = F->getAttributes().getRetAlignment();
		// Insert `assertalign` node if there's an alignment.
		if (InsertAssertAlign && Alignment) {
		Result =
		DAG.getAssertAlign(getCurSDLoc(), Result, Alignment.valueOrOne());
		}

setValue(&I, Result);		setValue(&I, Result);
}		}
}		}

/// GetSignificand - Get the significand and build it into a floating-point		/// GetSignificand - Get the significand and build it into a floating-point
/// number with exponent of 1:		/// number with exponent of 1:
///		///
/// Op = (Op & 0x007fffff) \| 0x3f800000;		/// Op = (Op & 0x007fffff) \| 0x3f800000;
▲ Show 20 Lines • Show All 5,892 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp

Show First 20 Lines • Show All 100 Lines • ▼ Show 20 Lines	#endif
case ISD::PCMARKER: return "PCMarker";		case ISD::PCMARKER: return "PCMarker";
case ISD::READCYCLECOUNTER: return "ReadCycleCounter";		case ISD::READCYCLECOUNTER: return "ReadCycleCounter";
case ISD::SRCVALUE: return "SrcValue";		case ISD::SRCVALUE: return "SrcValue";
case ISD::MDNODE_SDNODE: return "MDNode";		case ISD::MDNODE_SDNODE: return "MDNode";
case ISD::EntryToken: return "EntryToken";		case ISD::EntryToken: return "EntryToken";
case ISD::TokenFactor: return "TokenFactor";		case ISD::TokenFactor: return "TokenFactor";
case ISD::AssertSext: return "AssertSext";		case ISD::AssertSext: return "AssertSext";
case ISD::AssertZext: return "AssertZext";		case ISD::AssertZext: return "AssertZext";
		case ISD::AssertAlign: return "AssertAlign";

case ISD::BasicBlock: return "BasicBlock";		case ISD::BasicBlock: return "BasicBlock";
case ISD::VALUETYPE: return "ValueType";		case ISD::VALUETYPE: return "ValueType";
case ISD::Register: return "Register";		case ISD::Register: return "Register";
case ISD::RegisterMask: return "RegisterMask";		case ISD::RegisterMask: return "RegisterMask";
case ISD::Constant:		case ISD::Constant:
if (cast<ConstantSDNode>(this)->isOpaque())		if (cast<ConstantSDNode>(this)->isOpaque())
return "OpaqueConstant";		return "OpaqueConstant";
▲ Show 20 Lines • Show All 876 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp

Show First 20 Lines • Show All 2,814 Lines • ▼ Show 20 Lines	void SelectionDAGISel::SelectCodeCommon(SDNode *NodeToMatch,
case ISD::EH_LABEL:		case ISD::EH_LABEL:
case ISD::ANNOTATION_LABEL:		case ISD::ANNOTATION_LABEL:
case ISD::LIFETIME_START:		case ISD::LIFETIME_START:
case ISD::LIFETIME_END:		case ISD::LIFETIME_END:
NodeToMatch->setNodeId(-1); // Mark selected.		NodeToMatch->setNodeId(-1); // Mark selected.
return;		return;
case ISD::AssertSext:		case ISD::AssertSext:
case ISD::AssertZext:		case ISD::AssertZext:
		case ISD::AssertAlign:
ReplaceUses(SDValue(NodeToMatch, 0), NodeToMatch->getOperand(0));		ReplaceUses(SDValue(NodeToMatch, 0), NodeToMatch->getOperand(0));
CurDAG->RemoveDeadNode(NodeToMatch);		CurDAG->RemoveDeadNode(NodeToMatch);
return;		return;
case ISD::INLINEASM:		case ISD::INLINEASM:
case ISD::INLINEASM_BR:		case ISD::INLINEASM_BR:
Select_INLINEASM(NodeToMatch);		Select_INLINEASM(NodeToMatch);
return;		return;
case ISD::READ_REGISTER:		case ISD::READ_REGISTER:
▲ Show 20 Lines • Show All 917 Lines • Show Last 20 Lines

llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.cpp

Show First 20 Lines • Show All 1,622 Lines • ▼ Show 20 Lines	static MemSDNode* findMemSDNode(SDNode *N) {
assert(isa<BuildVectorSDNode>(N));		assert(isa<BuildVectorSDNode>(N));
for (SDValue V : N->op_values())		for (SDValue V : N->op_values())
if (MemSDNode *MN =		if (MemSDNode *MN =
dyn_cast<MemSDNode>(AMDGPUTargetLowering::stripBitcast(V)))		dyn_cast<MemSDNode>(AMDGPUTargetLowering::stripBitcast(V)))
return MN;		return MN;
llvm_unreachable("cannot find MemSDNode in the pattern!");		llvm_unreachable("cannot find MemSDNode in the pattern!");
}		}

		static bool getBaseWithOffsetUsingSplitOR(SelectionDAG &DAG, SDValue Addr,
		SDValue &N0, SDValue &N1) {
		if (Addr.getValueType() == MVT::i64 && Addr.getOpcode() == ISD::BITCAST &&
		Addr.getOperand(0).getOpcode() == ISD::BUILD_VECTOR) {
		// As we split 64-bit `or` earlier, it's complicated pattern to match, i.e.
		arsenmUnsubmitted Done Reply Inline Actions Can you add a dag style comment for the pattern this matches? This could also use some early returns arsenm: Can you add a dag style comment for the pattern this matches? This could also use some early…
		// (i64 (bitcast (v2i32 (build_vector
		// (or (extract_vector_elt V, 0), OFFSET),
		// (extract_vector_elt V, 1)))))
		SDValue Lo = Addr.getOperand(0).getOperand(0);
		if (Lo.getOpcode() == ISD::OR && DAG.isBaseWithConstantOffset(Lo)) {
		SDValue BaseLo = Lo.getOperand(0);
		SDValue BaseHi = Addr.getOperand(0).getOperand(1);
		// Check that split base (Lo and Hi) are extracted from the same one.
		if (BaseLo.getOpcode() == ISD::EXTRACT_VECTOR_ELT &&
		BaseHi.getOpcode() == ISD::EXTRACT_VECTOR_ELT &&
		BaseLo.getOperand(0) == BaseHi.getOperand(0) &&
		// Lo is statically extracted from index 0.
		isa<ConstantSDNode>(BaseLo.getOperand(1)) &&
		BaseLo.getConstantOperandVal(1) == 0 &&
		// Hi is statically extracted from index 0.
		isa<ConstantSDNode>(BaseHi.getOperand(1)) &&
		BaseHi.getConstantOperandVal(1) == 1) {
		N0 = BaseLo.getOperand(0).getOperand(0);
		N1 = Lo.getOperand(1);
		return true;
		}
		}
		}
		return false;
		}

template <bool IsSigned>		template <bool IsSigned>
bool AMDGPUDAGToDAGISel::SelectFlatOffset(SDNode *N,		bool AMDGPUDAGToDAGISel::SelectFlatOffset(SDNode *N,
SDValue Addr,		SDValue Addr,
SDValue &VAddr,		SDValue &VAddr,
SDValue &Offset,		SDValue &Offset,
SDValue &SLC) const {		SDValue &SLC) const {
int64_t OffsetVal = 0;		int64_t OffsetVal = 0;

if (Subtarget->hasFlatInstOffsets() &&		if (Subtarget->hasFlatInstOffsets() &&
(!Subtarget->hasFlatSegmentOffsetBug() \|\|		(!Subtarget->hasFlatSegmentOffsetBug() \|\|
findMemSDNode(N)->getAddressSpace() != AMDGPUAS::FLAT_ADDRESS) &&		findMemSDNode(N)->getAddressSpace() != AMDGPUAS::FLAT_ADDRESS)) {
CurDAG->isBaseWithConstantOffset(Addr)) {		SDValue N0, N1;
SDValue N0 = Addr.getOperand(0);		if (CurDAG->isBaseWithConstantOffset(Addr)) {
SDValue N1 = Addr.getOperand(1);		N0 = Addr.getOperand(0);
		N1 = Addr.getOperand(1);
		} else if (getBaseWithOffsetUsingSplitOR(*CurDAG, Addr, N0, N1)) {
		assert(N0 && N1 && isa<ConstantSDNode>(N1));
		}
		if (N0 && N1) {
uint64_t COffsetVal = cast<ConstantSDNode>(N1)->getSExtValue();		uint64_t COffsetVal = cast<ConstantSDNode>(N1)->getSExtValue();

const SIInstrInfo *TII = Subtarget->getInstrInfo();		const SIInstrInfo *TII = Subtarget->getInstrInfo();
unsigned AS = findMemSDNode(N)->getAddressSpace();		unsigned AS = findMemSDNode(N)->getAddressSpace();
if (TII->isLegalFLATOffset(COffsetVal, AS, IsSigned)) {		if (TII->isLegalFLATOffset(COffsetVal, AS, IsSigned)) {
Addr = N0;		Addr = N0;
OffsetVal = COffsetVal;		OffsetVal = COffsetVal;
} else {		} else {
// If the offset doesn't fit, put the low bits into the offset field and		// If the offset doesn't fit, put the low bits into the offset field and
// add the rest.		// add the rest.

SDLoc DL(N);		SDLoc DL(N);
uint64_t ImmField;		uint64_t ImmField;
const unsigned NumBits = TII->getNumFlatOffsetBits(AS, IsSigned);		const unsigned NumBits = TII->getNumFlatOffsetBits(AS, IsSigned);
if (IsSigned) {		if (IsSigned) {
ImmField = SignExtend64(COffsetVal, NumBits);		ImmField = SignExtend64(COffsetVal, NumBits);

// Don't use a negative offset field if the base offset is positive.		// Don't use a negative offset field if the base offset is positive.
// Since the scheduler currently relies on the offset field, doing so		// Since the scheduler currently relies on the offset field, doing so
// could result in strange scheduling decisions.		// could result in strange scheduling decisions.

// TODO: Should we not do this in the opposite direction as well?		// TODO: Should we not do this in the opposite direction as well?
if (static_cast<int64_t>(COffsetVal) > 0) {		if (static_cast<int64_t>(COffsetVal) > 0) {
if (static_cast<int64_t>(ImmField) < 0) {		if (static_cast<int64_t>(ImmField) < 0) {
const uint64_t OffsetMask = maskTrailingOnes<uint64_t>(NumBits - 1);		const uint64_t OffsetMask =
		maskTrailingOnes<uint64_t>(NumBits - 1);
ImmField = COffsetVal & OffsetMask;		ImmField = COffsetVal & OffsetMask;
}		}
}		}
} else {		} else {
// TODO: Should we do this for a negative offset?		// TODO: Should we do this for a negative offset?
const uint64_t OffsetMask = maskTrailingOnes<uint64_t>(NumBits);		const uint64_t OffsetMask = maskTrailingOnes<uint64_t>(NumBits);
ImmField = COffsetVal & OffsetMask;		ImmField = COffsetVal & OffsetMask;
}		}

uint64_t RemainderOffset = COffsetVal - ImmField;		uint64_t RemainderOffset = COffsetVal - ImmField;

assert(TII->isLegalFLATOffset(ImmField, AS, IsSigned));		assert(TII->isLegalFLATOffset(ImmField, AS, IsSigned));
assert(RemainderOffset + ImmField == COffsetVal);		assert(RemainderOffset + ImmField == COffsetVal);

OffsetVal = ImmField;		OffsetVal = ImmField;

// TODO: Should this try to use a scalar add pseudo if the base address is		// TODO: Should this try to use a scalar add pseudo if the base address
// uniform and saddr is usable?		// is uniform and saddr is usable?
SDValue Sub0 = CurDAG->getTargetConstant(AMDGPU::sub0, DL, MVT::i32);		SDValue Sub0 = CurDAG->getTargetConstant(AMDGPU::sub0, DL, MVT::i32);
SDValue Sub1 = CurDAG->getTargetConstant(AMDGPU::sub1, DL, MVT::i32);		SDValue Sub1 = CurDAG->getTargetConstant(AMDGPU::sub1, DL, MVT::i32);

SDNode *N0Lo = CurDAG->getMachineNode(TargetOpcode::EXTRACT_SUBREG,		SDNode *N0Lo = CurDAG->getMachineNode(TargetOpcode::EXTRACT_SUBREG, DL,
DL, MVT::i32, N0, Sub0);		MVT::i32, N0, Sub0);
SDNode *N0Hi = CurDAG->getMachineNode(TargetOpcode::EXTRACT_SUBREG,		SDNode *N0Hi = CurDAG->getMachineNode(TargetOpcode::EXTRACT_SUBREG, DL,
DL, MVT::i32, N0, Sub1);		MVT::i32, N0, Sub1);

SDValue AddOffsetLo		SDValue AddOffsetLo =
= getMaterializedScalarImm32(Lo_32(RemainderOffset), DL);		getMaterializedScalarImm32(Lo_32(RemainderOffset), DL);
SDValue AddOffsetHi		SDValue AddOffsetHi =
= getMaterializedScalarImm32(Hi_32(RemainderOffset), DL);		getMaterializedScalarImm32(Hi_32(RemainderOffset), DL);

		arsenmUnsubmitted Not Done Reply Inline Actions Lots of unrelated formatting changes? arsenm: Lots of unrelated formatting changes?
		hliaoAuthorUnsubmitted Done Reply Inline Actions That's due to the extra indent added from L1680. Also, the lint progress in arc review tries to re-formatting all changed code. hliao: That's due to the extra indent added from L1680. Also, the lint progress in arc review tries to…
		arsenmUnsubmitted Not Done Reply Inline Actions Could switch to early return and avoid it? arsenm: Could switch to early return and avoid it?
		hliaoAuthorUnsubmitted Done Reply Inline Actions That code just conditionally refine `Addr` and, eventually, all paths need to join L1760 to prepare all the return values. Unless `goto` is used or code duplication, early return cannot be used here. hliao: That code just conditionally refine `Addr` and, eventually, all paths need to join L1760 to…
		arsenmUnsubmitted Not Done Reply Inline Actions Duplicating the trivial case isn't a big deal arsenm: Duplicating the trivial case isn't a big deal
SDVTList VTs = CurDAG->getVTList(MVT::i32, MVT::i1);		SDVTList VTs = CurDAG->getVTList(MVT::i32, MVT::i1);
SDValue Clamp = CurDAG->getTargetConstant(0, DL, MVT::i1);		SDValue Clamp = CurDAG->getTargetConstant(0, DL, MVT::i1);

SDNode *Add = CurDAG->getMachineNode(		SDNode *Add =
AMDGPU::V_ADD_I32_e64, DL, VTs,		CurDAG->getMachineNode(AMDGPU::V_ADD_I32_e64, DL, VTs,
{AddOffsetLo, SDValue(N0Lo, 0), Clamp});		{AddOffsetLo, SDValue(N0Lo, 0), Clamp});

SDNode *Addc = CurDAG->getMachineNode(		SDNode *Addc = CurDAG->getMachineNode(
AMDGPU::V_ADDC_U32_e64, DL, VTs,		AMDGPU::V_ADDC_U32_e64, DL, VTs,
{AddOffsetHi, SDValue(N0Hi, 0), SDValue(Add, 1), Clamp});		{AddOffsetHi, SDValue(N0Hi, 0), SDValue(Add, 1), Clamp});

SDValue RegSequenceArgs[] = {		SDValue RegSequenceArgs[] = {
CurDAG->getTargetConstant(AMDGPU::VReg_64RegClassID, DL, MVT::i32),		CurDAG->getTargetConstant(AMDGPU::VReg_64RegClassID, DL, MVT::i32),
SDValue(Add, 0), Sub0, SDValue(Addc, 0), Sub1		SDValue(Add, 0), Sub0, SDValue(Addc, 0), Sub1};
};

Addr = SDValue(CurDAG->getMachineNode(AMDGPU::REG_SEQUENCE, DL,		Addr = SDValue(CurDAG->getMachineNode(AMDGPU::REG_SEQUENCE, DL,
MVT::i64, RegSequenceArgs), 0);		MVT::i64, RegSequenceArgs),
		0);
		}
}		}
}		}

VAddr = Addr;		VAddr = Addr;
Offset = CurDAG->getTargetConstant(OffsetVal, SDLoc(), MVT::i16);		Offset = CurDAG->getTargetConstant(OffsetVal, SDLoc(), MVT::i16);
SLC = CurDAG->getTargetConstant(0, SDLoc(), MVT::i1);		SLC = CurDAG->getTargetConstant(0, SDLoc(), MVT::i1);
return true;		return true;
}		}
▲ Show 20 Lines • Show All 92 Lines • ▼ Show 20 Lines

bool AMDGPUDAGToDAGISel::SelectSMRD(SDValue Addr, SDValue &SBase,		bool AMDGPUDAGToDAGISel::SelectSMRD(SDValue Addr, SDValue &SBase,
SDValue &Offset, bool &Imm) const {		SDValue &Offset, bool &Imm) const {
SDLoc SL(Addr);		SDLoc SL(Addr);

// A 32-bit (address + offset) should not cause unsigned 32-bit integer		// A 32-bit (address + offset) should not cause unsigned 32-bit integer
// wraparound, because s_load instructions perform the addition in 64 bits.		// wraparound, because s_load instructions perform the addition in 64 bits.
if ((Addr.getValueType() != MVT::i32 \|\|		if ((Addr.getValueType() != MVT::i32 \|\|
Addr->getFlags().hasNoUnsignedWrap()) &&		Addr->getFlags().hasNoUnsignedWrap())) {
(CurDAG->isBaseWithConstantOffset(Addr) \|\|		SDValue N0, N1;
Addr.getOpcode() == ISD::ADD)) {		// Extract the base and offset if possible.
SDValue N0 = Addr.getOperand(0);		if (CurDAG->isBaseWithConstantOffset(Addr) \|\|
SDValue N1 = Addr.getOperand(1);		Addr.getOpcode() == ISD::ADD) {
		N0 = Addr.getOperand(0);
		N1 = Addr.getOperand(1);
		} else if (getBaseWithOffsetUsingSplitOR(*CurDAG, Addr, N0, N1)) {
		assert(N0 && N1 && isa<ConstantSDNode>(N1));
		}
		if (N0 && N1) {
if (SelectSMRDOffset(N1, Offset, Imm)) {		if (SelectSMRDOffset(N1, Offset, Imm)) {
SBase = Expand32BitAddress(N0);		SBase = Expand32BitAddress(N0);
return true;		return true;
}		}
}		}
		}
SBase = Expand32BitAddress(Addr);		SBase = Expand32BitAddress(Addr);
Offset = CurDAG->getTargetConstant(0, SL, MVT::i32);		Offset = CurDAG->getTargetConstant(0, SL, MVT::i32);
Imm = true;		Imm = true;
return true;		return true;
}		}

bool AMDGPUDAGToDAGISel::SelectSMRDImm(SDValue Addr, SDValue &SBase,		bool AMDGPUDAGToDAGISel::SelectSMRDImm(SDValue Addr, SDValue &SBase,
SDValue &Offset) const {		SDValue &Offset) const {
▲ Show 20 Lines • Show All 1,051 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/store-weird-sizes.ll

	Show All 24 Lines
	; GFX9-NEXT: s_setpc_b64 s[30:31]			; GFX9-NEXT: s_setpc_b64 s[30:31]
	store i56 %arg, i56 addrspace(3)* %ptr, align 8			store i56 %arg, i56 addrspace(3)* %ptr, align 8
	ret void			ret void
	}			}

	define amdgpu_kernel void @local_store_i55(i55 addrspace(3)* %ptr, i55 %arg) #0 {			define amdgpu_kernel void @local_store_i55(i55 addrspace(3)* %ptr, i55 %arg) #0 {
	; HAWAII-LABEL: local_store_i55:			; HAWAII-LABEL: local_store_i55:
	; HAWAII: ; %bb.0:			; HAWAII: ; %bb.0:
	; HAWAII-NEXT: s_add_u32 s0, s4, 14			; HAWAII-NEXT: s_or_b32 s0, s4, 14
	; HAWAII-NEXT: s_addc_u32 s1, s5, 0
	; HAWAII-NEXT: v_mov_b32_e32 v0, s0			; HAWAII-NEXT: v_mov_b32_e32 v0, s0
	; HAWAII-NEXT: v_mov_b32_e32 v1, s1			; HAWAII-NEXT: v_mov_b32_e32 v1, s5
	; HAWAII-NEXT: flat_load_ubyte v0, v[0:1]			; HAWAII-NEXT: flat_load_ubyte v0, v[0:1]
	; HAWAII-NEXT: s_load_dword s0, s[4:5], 0x0			; HAWAII-NEXT: s_load_dword s0, s[4:5], 0x0
	; HAWAII-NEXT: s_load_dword s1, s[4:5], 0x2			; HAWAII-NEXT: s_load_dword s1, s[4:5], 0x2
	; HAWAII-NEXT: s_load_dword s2, s[4:5], 0x3			; HAWAII-NEXT: s_load_dword s2, s[4:5], 0x3
	; HAWAII-NEXT: s_mov_b32 m0, -1			; HAWAII-NEXT: s_mov_b32 m0, -1
	; HAWAII-NEXT: s_waitcnt lgkmcnt(0)			; HAWAII-NEXT: s_waitcnt lgkmcnt(0)
	; HAWAII-NEXT: v_mov_b32_e32 v1, s0			; HAWAII-NEXT: v_mov_b32_e32 v1, s0
	; HAWAII-NEXT: v_mov_b32_e32 v3, s1			; HAWAII-NEXT: v_mov_b32_e32 v3, s1
	; HAWAII-NEXT: v_mov_b32_e32 v2, s2			; HAWAII-NEXT: v_mov_b32_e32 v2, s2
	; HAWAII-NEXT: ds_write_b16 v1, v2 offset:4			; HAWAII-NEXT: ds_write_b16 v1, v2 offset:4
	; HAWAII-NEXT: s_waitcnt vmcnt(0)			; HAWAII-NEXT: s_waitcnt vmcnt(0)
	; HAWAII-NEXT: v_and_b32_e32 v0, 0x7f, v0			; HAWAII-NEXT: v_and_b32_e32 v0, 0x7f, v0
	; HAWAII-NEXT: ds_write_b8 v1, v0 offset:6			; HAWAII-NEXT: ds_write_b8 v1, v0 offset:6
	; HAWAII-NEXT: ds_write_b32 v1, v3			; HAWAII-NEXT: ds_write_b32 v1, v3
	; HAWAII-NEXT: s_endpgm			; HAWAII-NEXT: s_endpgm
	;			;
	; FIJI-LABEL: local_store_i55:			; FIJI-LABEL: local_store_i55:
	; FIJI: ; %bb.0:			; FIJI: ; %bb.0:
				; FIJI-NEXT: s_or_b32 s0, s4, 14
				; FIJI-NEXT: v_mov_b32_e32 v0, s0
				; FIJI-NEXT: v_mov_b32_e32 v1, s5
				; FIJI-NEXT: flat_load_ubyte v0, v[0:1]
	; FIJI-NEXT: s_load_dword s0, s[4:5], 0x0			; FIJI-NEXT: s_load_dword s0, s[4:5], 0x0
	; FIJI-NEXT: s_load_dword s2, s[4:5], 0x8			; FIJI-NEXT: s_load_dword s1, s[4:5], 0x8
	; FIJI-NEXT: s_load_dword s1, s[4:5], 0xc			; FIJI-NEXT: s_load_dword s2, s[4:5], 0xc
	; FIJI-NEXT: s_mov_b32 m0, -1			; FIJI-NEXT: s_mov_b32 m0, -1
	; FIJI-NEXT: s_waitcnt lgkmcnt(0)			; FIJI-NEXT: s_waitcnt lgkmcnt(0)
	; FIJI-NEXT: v_mov_b32_e32 v2, s0			; FIJI-NEXT: v_mov_b32_e32 v1, s0
	; FIJI-NEXT: s_and_b32 s3, s1, 0xffff
	; FIJI-NEXT: s_add_u32 s0, s4, 14
	; FIJI-NEXT: v_mov_b32_e32 v3, s1			; FIJI-NEXT: v_mov_b32_e32 v3, s1
	; FIJI-NEXT: s_addc_u32 s1, s5, 0			; FIJI-NEXT: s_and_b32 s3, s2, 0xffff
	; FIJI-NEXT: v_mov_b32_e32 v0, s0			; FIJI-NEXT: v_mov_b32_e32 v2, s2
	; FIJI-NEXT: v_mov_b32_e32 v1, s1			; FIJI-NEXT: ds_write_b16 v1, v2 offset:4
	; FIJI-NEXT: flat_load_ubyte v0, v[0:1]			; FIJI-NEXT: s_waitcnt vmcnt(0)
	; FIJI-NEXT: ds_write_b16 v2, v3 offset:4
	; FIJI-NEXT: v_mov_b32_e32 v3, s2
	; FIJI-NEXT: s_waitcnt vmcnt(0) lgkmcnt(1)
	; FIJI-NEXT: v_lshlrev_b32_e32 v0, 16, v0			; FIJI-NEXT: v_lshlrev_b32_e32 v0, 16, v0
	; FIJI-NEXT: v_or_b32_e32 v0, s3, v0			; FIJI-NEXT: v_or_b32_e32 v0, s3, v0
	; FIJI-NEXT: v_bfe_u32 v0, v0, 16, 7			; FIJI-NEXT: v_bfe_u32 v0, v0, 16, 7
	; FIJI-NEXT: ds_write_b8 v2, v0 offset:6			; FIJI-NEXT: ds_write_b8 v1, v0 offset:6
	; FIJI-NEXT: ds_write_b32 v2, v3			; FIJI-NEXT: ds_write_b32 v1, v3
	; FIJI-NEXT: s_endpgm			; FIJI-NEXT: s_endpgm
	;			;
	; GFX9-LABEL: local_store_i55:			; GFX9-LABEL: local_store_i55:
	; GFX9: ; %bb.0:			; GFX9: ; %bb.0:
	; GFX9-NEXT: v_mov_b32_e32 v0, s4			; GFX9-NEXT: v_mov_b32_e32 v0, s4
	; GFX9-NEXT: v_mov_b32_e32 v1, s5			; GFX9-NEXT: v_mov_b32_e32 v1, s5
	; GFX9-NEXT: v_mov_b32_e32 v2, 0			; GFX9-NEXT: v_mov_b32_e32 v2, 0
	; GFX9-NEXT: global_load_ubyte_d16_hi v2, v[0:1], off offset:14			; GFX9-NEXT: global_load_ubyte_d16_hi v2, v[0:1], off offset:14
	▲ Show 20 Lines • Show All 160 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[SDAG] Add new AssertAlign ISD node.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 272603

llvm/include/llvm/CodeGen/ISDOpcodes.h

llvm/include/llvm/CodeGen/SelectionDAG.h

llvm/include/llvm/CodeGen/SelectionDAGNodes.h

llvm/include/llvm/Target/TargetSelectionDAG.td

llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp

llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.cpp

llvm/test/CodeGen/AMDGPU/store-weird-sizes.ll

[SDAG] Add new AssertAlign ISD node.
ClosedPublic