This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/CodeGen/
-
llvm/
-
CodeGen/
-
TargetLowering.h
-
lib/CodeGen/SelectionDAG/
-
CodeGen/
-
SelectionDAG/
-
LegalizeDAG.cpp

Differential D91050

[NFC] Add the EmitTargetCodeForConstantPool hook for target to customize it with MachineConstantPoolValue
AbandonedPublic

Authored by lkail on Nov 9 2020, 12:14 AM.

Download Raw Diff

Details

Reviewers

MaskRay
RKSimon
echristo
spatel
efriedma
craig.topper
steven.zhang

Group Reviewers

Restricted Project

Summary

We have the MachineConstantPoolValue to allow the target to customize the lowering of the constant. But when we are trying to create the Constant Pool from SelectionDAG, target has no way to customize it with MachineConstantPoolValue. So, propose a new hook inside SelectionDAGTargetInfo so that, target could customize the lower of constant pool. Also, add a new interface so that, we won't assume that, the result value of DAG.getConstantPool() must be ConstantPoolSDNode which is not always true if target customizes the lower of constant pool.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	370 ms	linux > HWAddressSanitizer-x86_64.TestCases::sizes.cpp

Event Timeline

steven.zhang created this revision.Nov 9 2020, 12:14 AM

Herald added a project: Restricted Project. · View Herald TranscriptNov 9 2020, 12:14 AM

Herald added a subscriber: hiraditya. · View Herald Transcript

steven.zhang requested review of this revision.Nov 9 2020, 12:14 AM

Harbormaster completed remote builds in B78065: Diff 303760.Nov 9 2020, 12:52 AM

steven.zhang added a child revision: D91053: [PowerPC] Lump the constants to save one addis for each constant access.Nov 9 2020, 1:01 AM

Gentle ping.

steven.zhang added a reviewer: craig.topper.Nov 30 2020, 3:02 AM

steven.zhang mentioned this in D91053: [PowerPC] Lump the constants to save one addis for each constant access.Dec 22 2020, 3:38 PM

What should we do with SelectionDAG::getConstantPool? At the very least I'd expect better comments explaining when each should be used, but I'm wondering whether we can get away with just the TLI version?

yubing added a subscriber: yubing.Dec 23 2020, 6:29 AM

In D91050#2469885, @RKSimon wrote:

What should we do with SelectionDAG::getConstantPool? At the very least I'd expect better comments explaining when each should be used, but I'm wondering whether we can get away with just the TLI version?

Removing the getConstantPool from SelectionDAG doesn't make sense as we are creating SDNode inside DAG. And yes, it will confuse us if there are two version, no matter how detail the comments have. Maybe, we can use the SelectionDAGTargetInfo and call the hook inside DAG.getConstantPool as what we did for getMcmcpy(). So that, we only have one interface. The challenge is that, there are many places that assume the return value of DAG.getConstantPool is ConstantPoolSDNode which will not be true anymore if there is hook inside it. I will try with this direction to see if there is a better way to handle this.

Re-implement with SelectionDAGTargetInfo.

Herald added a subscriber: pengfei. · View Herald TranscriptDec 27 2020, 9:29 PM

Harbormaster completed remote builds in B83565: Diff 313821.Dec 27 2020, 10:03 PM

Hmm... I think it is very surprising if DAG.getConstantPool() does not return ConstantPoolSDNode. I am not in favour of any design that violates such seemingly reasonable assumptions.

In D91050#2472648, @nemanjai wrote:

Hmm... I think it is very surprising if DAG.getConstantPool() does not return ConstantPoolSDNode. I am not in favour of any design that violates such seemingly reasonable assumptions.

But it has no harm or even better to remove such assumptions. Is it right ?

The motivation for this patch is for D91053 (I should have documented it more clear). We indeed have the case that DAG.getConstantPool() doesn't return 'ConstantPoolSDNode'. This is what we want to do in PowerPC:

                  ConstantPoolSDNode
                  +----------------+
ConstantFP        |                |
+-------+         |                |
|  1.5  +--------->xxxxxxxxxxxxxxxx|
+-------+         |                |
                  |                |
                  |                |
ConstantFP        |                |
+-------+         |                |
|  3.0  +--------->xxxxxxxxxxxxxxxx|
+-------+         |                |
                  +----------------+

When we are creating the constant pool from DAG, we want to return the same ConstantPoolSDNode but with different offset. This is done by ConstantPoolSDNode + ADD. And as we have the MachineConstantPoolValue to allow target to customize its constant pool, we'd better remove such kind of assumption. And it also make the code a bit clean and extensible, IMO. What do you think ?

Ping...

In D91050#2480906, @steven.zhang wrote:

Ping...

I really don't think this is the right interface for this, similar to what Nemanja said. In addition, the uses and needs of Alignment here are a little odd and I think need some more explanation. Can you elaborate on what you're trying to do here a bit more?

Thanks!

-eric

Reason why we need to change the interface

I want to put the constants with the same type and alignment into the same constant pool(ConstantPoolSDNode) to reduce the TOC and improve the access performance. See the description in D91053 if you're interested in the detail. So, when we are trying to create the constant pool in DAG(DAG.getConstantPool), what it returns is the ADD ConstantPoolSDNode, Offset. And when emitting the ConstantPoolSDNode, we will have something like this(One TOC entry pointed to 4 constants in this example):

.LCPI0_0:
        .quad   0x402cc28f5c28f5c3              # double 14.380000000000001
        .quad   0x4002b851eb851eb8              # double 2.3399999999999999
        .quad   0x40120c49ba5e353f              # double 4.5119999999999996
        .quad   0x3ff3ae147ae147ae              # double 1.23

.LC0:
        .tc .LCPI0_0[TC],.LCPI0_0

As the return of DAG.getConstantPool() is NOT ConstantPoolSDNode anymore(It is shared ConstantPoolSDNode + Offset), we cannot get the alignment by casting the return value of DAG.getConstantPool() to ConstantPoolSDNode.

Reason why we change the interface that way

The semantic of the interface of getConstantPool is that, if we specify the alignment, use it, otherwise, it will calculate the alignment for you. You have to get the alignment from the result of DAG.getConstantPool() if it is calculated internal which shows as follows:

SDValue SelectionDAG::getConstantPool(const Constant *C, EVT VT,
                                      MaybeAlign Alignment, int Offset,
                                      bool isTarget, unsigned TargetFlags) {
  assert((TargetFlags == 0 || isTarget) &&
         "Cannot set target flags on target-independent globals");
  if (!Alignment)
    Alignment = shouldOptForSize()
                    ? getDataLayout().getABITypeAlign(C->getType())
                    : getDataLayout().getPrefTypeAlign(C->getType());

So, I added a new parameter NewAlign to tell us what the final alignment constant pool has instead of casting from the return value that assume that, the return value must be ConstantPoolSDNode:

SDValue getConstantPool(const Constant *C, EVT VT, MaybeAlign Alignment, Align &NewAlign, ...)

DAG.getConstantPool(C, VT, None, NewAlign,...) means you don't have preference on the alignment and we will calculate it for you.
DAG.getConstantPool(C, VT, Align, NewAlign,...) means you have the alignment preference, but the final alignment is still set in NewAlign and they can be difference technical speaking. (i.e. you want the 4 byte aligned constant pool, we can still return the 8 byte aligned constant pool to share with it on PowerPC)

This is the way I am proposing and I am open and welcome for any suggestion. Thank you for all the comments and happy new year :)

Passing an alignment in and returning an alignment out at the same time seems unlikely to be useful. DAGCombine can refine the alignment later anyway, if it turns out to be relevant. Maybe rename the version that returns the alignment out?

Also, if the point is literally just "glue a bunch of ConstantPools together", can we teach target-independent code to do that? It doesn't actually require anything target-specific, I think, and it should be useful on any target where computing the address of a constant pool isn't cheap.

(Just realized this patch isn't being actively developed; feel free to ignore if you're not planning to continue work on this.)

lkail commandeered this revision.Jul 28 2021, 7:58 PM

lkail added a reviewer: steven.zhang.

In D91050#2905388, @efriedma wrote:

(Just realized this patch isn't being actively developed; feel free to ignore if you're not planning to continue work on this.)

This patch is in preparation for https://reviews.llvm.org/D91053 which aims at optimizing layout of PPC's TOC. Current implementation still has limitations. To me, at first galance, we should just collect what constants we have at each basic block at isel phase and perform TOC layout optimization after isel.

Sorry, I just missed this.

Also, if the point is literally just "glue a bunch of ConstantPools together", can we teach target-independent code to do that? It doesn't actually require anything target-specific, I think, and it should be useful on any target where computing the address of a constant pool isn't cheap.

"glue a bunch of ConstantPools together", exactly.

can we teach target-independent code to do that? It doesn't actually require anything target-specific, I think, and it should be useful on any target where computing the address of a constant pool isn't cheap.

Good one, maybe we would post a new patch for this.

In D91050#2905380, @efriedma wrote:

Passing an alignment in and returning an alignment out at the same time seems unlikely to be useful. DAGCombine can refine the alignment later anyway, if it turns out to be relevant. Maybe rename the version that returns the alignment out?

Also, if the point is literally just "glue a bunch of ConstantPools together", can we teach target-independent code to do that? It doesn't actually require anything target-specific, I think, and it should be useful on any target where computing the address of a constant pool isn't cheap.

Passing a constant in，which has the alignment of the constant，return a constantpool，which need to specify the alignment of constantpool. We cannot query the alignment of constantpool now as the result could be cp + offset.

In D91050#2915306, @steven.zhang wrote:

In D91050#2905380, @efriedma wrote:

Passing an alignment in and returning an alignment out at the same time seems unlikely to be useful. DAGCombine can refine the alignment later anyway, if it turns out to be relevant. Maybe rename the version that returns the alignment out?

Passing a constant in，which has the alignment of the constant，return a constantpool，which need to specify the alignment of constantpool. We cannot query the alignment of constantpool now as the result could be cp + offset.

I think you're missing my point. If the caller has a particular idea of what alignment it needs, it will request it. The result will have at least that alignment. If the actual alignment comes out higher, it doesn't really matter.

SelectionDAG::computeKnownBits can be used to query the alignment of an arbitrary pointer. It looks like it doesn't currently try to use the alignment of a constant pool, but it could, if it mattered.

In D91050#2917401, @efriedma wrote:

In D91050#2915306, @steven.zhang wrote:

In D91050#2905380, @efriedma wrote:

Passing an alignment in and returning an alignment out at the same time seems unlikely to be useful. DAGCombine can refine the alignment later anyway, if it turns out to be relevant. Maybe rename the version that returns the alignment out?

Passing a constant in，which has the alignment of the constant，return a constantpool，which need to specify the alignment of constantpool. We cannot query the alignment of constantpool now as the result could be cp + offset.

I think you're missing my point. If the caller has a particular idea of what alignment it needs, it will request it. The result will have at least that alignment. If the actual alignment comes out higher, it doesn't really matter.

SelectionDAG::computeKnownBits can be used to query the alignment of an arbitrary pointer. It looks like it doesn't currently try to use the alignment of a constant pool, but it could, if it mattered.

Great to know that we have some way to query the alignment. If that works, we could have one patch to avoid querying the constant pool alignment again after it is created. And get the underlying alignment with computeKnowBits likewise interface in the place that need the exactly alignment.

Revision Contents

Path

Size

llvm/

include/

llvm/

CodeGen/

TargetLowering.h

13 lines

lib/

CodeGen/

SelectionDAG/

LegalizeDAG.cpp

22 lines

Diff 303760

llvm/include/llvm/CodeGen/TargetLowering.h

Show First 20 Lines • Show All 3,595 Lines • ▼ Show 20 Lines	public:
/// physical registers in the exit blocks. This function is called at the end		/// physical registers in the exit blocks. This function is called at the end
/// of instruction selection.		/// of instruction selection.
virtual void insertCopiesSplitCSR(		virtual void insertCopiesSplitCSR(
MachineBasicBlock *Entry,		MachineBasicBlock *Entry,
const SmallVectorImpl<MachineBasicBlock *> &Exits) const {		const SmallVectorImpl<MachineBasicBlock *> &Exits) const {
llvm_unreachable("Not Implemented");		llvm_unreachable("Not Implemented");
}		}

		/// Create target specific constant pool and set the new alignment in
		/// \p NewAlign
		virtual SDValue getConstantPool(const Constant *C, SelectionDAG &DAG,
		Align &NewAlign, EVT VT,
		MaybeAlign Align = None, int Offset = 0,
		bool isTarget = false,
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for parameter 'isTarget' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for parameter 'isTarget' [readability-identifier…
		unsigned TargetFlags = 0) const {
		SDValue CPIdx =
		DAG.getConstantPool(C, VT, Align, Offset, isTarget, TargetFlags);
		NewAlign = cast<ConstantPoolSDNode>(CPIdx)->getAlign();
		return CPIdx;
		}

/// Return the newly negated expression if the cost is not expensive and		/// Return the newly negated expression if the cost is not expensive and
/// set the cost in \p Cost to indicate that if it is cheaper or neutral to		/// set the cost in \p Cost to indicate that if it is cheaper or neutral to
/// do the negation.		/// do the negation.
virtual SDValue getNegatedExpression(SDValue Op, SelectionDAG &DAG,		virtual SDValue getNegatedExpression(SDValue Op, SelectionDAG &DAG,
bool LegalOps, bool OptForSize,		bool LegalOps, bool OptForSize,
NegatibleCost &Cost,		NegatibleCost &Cost,
unsigned Depth = 0) const;		unsigned Depth = 0) const;

▲ Show 20 Lines • Show All 938 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp

Show First 20 Lines • Show All 322 Lines • ▼ Show 20 Lines	while (SVT != MVT::f32 && SVT != MVT::f16) {
Type SType = SVT.getTypeForEVT(DAG.getContext());		Type SType = SVT.getTypeForEVT(DAG.getContext());
LLVMC = cast<ConstantFP>(ConstantExpr::getFPTrunc(LLVMC, SType));		LLVMC = cast<ConstantFP>(ConstantExpr::getFPTrunc(LLVMC, SType));
VT = SVT;		VT = SVT;
Extend = true;		Extend = true;
}		}
}		}
}		}

SDValue CPIdx =		Align Alignment;
DAG.getConstantPool(LLVMC, TLI.getPointerTy(DAG.getDataLayout()));		SDValue CPIdx = TLI.getConstantPool(LLVMC, DAG, Alignment,
Align Alignment = cast<ConstantPoolSDNode>(CPIdx)->getAlign();		TLI.getPointerTy(DAG.getDataLayout()));
if (Extend) {		if (Extend) {
SDValue Result = DAG.getExtLoad(		SDValue Result = DAG.getExtLoad(
ISD::EXTLOAD, dl, OrigVT, DAG.getEntryNode(), CPIdx,		ISD::EXTLOAD, dl, OrigVT, DAG.getEntryNode(), CPIdx,
MachinePointerInfo::getConstantPool(DAG.getMachineFunction()), VT,		MachinePointerInfo::getConstantPool(DAG.getMachineFunction()), VT,
Alignment);		Alignment);
return Result;		return Result;
}		}
SDValue Result = DAG.getLoad(		SDValue Result = DAG.getLoad(
OrigVT, dl, DAG.getEntryNode(), CPIdx,		OrigVT, dl, DAG.getEntryNode(), CPIdx,
MachinePointerInfo::getConstantPool(DAG.getMachineFunction()), Alignment);		MachinePointerInfo::getConstantPool(DAG.getMachineFunction()), Alignment);
return Result;		return Result;
}		}

/// Expands the Constant node to a load from the constant pool.		/// Expands the Constant node to a load from the constant pool.
SDValue SelectionDAGLegalize::ExpandConstant(ConstantSDNode *CP) {		SDValue SelectionDAGLegalize::ExpandConstant(ConstantSDNode *CP) {
SDLoc dl(CP);		SDLoc dl(CP);
		Align Alignment;
EVT VT = CP->getValueType(0);		EVT VT = CP->getValueType(0);
SDValue CPIdx = DAG.getConstantPool(CP->getConstantIntValue(),		SDValue CPIdx = TLI.getConstantPool(CP->getConstantIntValue(), DAG, Alignment,
TLI.getPointerTy(DAG.getDataLayout()));		TLI.getPointerTy(DAG.getDataLayout()));
Align Alignment = cast<ConstantPoolSDNode>(CPIdx)->getAlign();
SDValue Result = DAG.getLoad(		SDValue Result = DAG.getLoad(
VT, dl, DAG.getEntryNode(), CPIdx,		VT, dl, DAG.getEntryNode(), CPIdx,
MachinePointerInfo::getConstantPool(DAG.getMachineFunction()), Alignment);		MachinePointerInfo::getConstantPool(DAG.getMachineFunction()), Alignment);
return Result;		return Result;
}		}

/// Some target cannot handle a variable insertion index for the		/// Some target cannot handle a variable insertion index for the
/// INSERT_VECTOR_ELT instruction. In this case, it		/// INSERT_VECTOR_ELT instruction. In this case, it
▲ Show 20 Lines • Show All 1,643 Lines • ▼ Show 20 Lines	for (unsigned i = 0, e = NumElems; i != e; ++i) {
CI->getZExtValue()));		CI->getZExtValue()));
}		}
} else {		} else {
assert(Node->getOperand(i).isUndef());		assert(Node->getOperand(i).isUndef());
Type OpNTy = EltVT.getTypeForEVT(DAG.getContext());		Type OpNTy = EltVT.getTypeForEVT(DAG.getContext());
CV.push_back(UndefValue::get(OpNTy));		CV.push_back(UndefValue::get(OpNTy));
}		}
}		}
		Align Alignment;
Constant *CP = ConstantVector::get(CV);		Constant *CP = ConstantVector::get(CV);
SDValue CPIdx =		SDValue CPIdx = TLI.getConstantPool(CP, DAG, Alignment,
DAG.getConstantPool(CP, TLI.getPointerTy(DAG.getDataLayout()));		TLI.getPointerTy(DAG.getDataLayout()));
Align Alignment = cast<ConstantPoolSDNode>(CPIdx)->getAlign();
return DAG.getLoad(		return DAG.getLoad(
VT, dl, DAG.getEntryNode(), CPIdx,		VT, dl, DAG.getEntryNode(), CPIdx,
MachinePointerInfo::getConstantPool(DAG.getMachineFunction()),		MachinePointerInfo::getConstantPool(DAG.getMachineFunction()),
Alignment);		Alignment);
}		}

SmallSet<SDValue, 16> DefinedValues;		SmallSet<SDValue, 16> DefinedValues;
for (unsigned i = 0; i < NumElems; ++i) {		for (unsigned i = 0; i < NumElems; ++i) {
▲ Show 20 Lines • Show All 527 Lines • ▼ Show 20 Lines	SDValue SelectionDAGLegalize::ExpandLegalINT_TO_FP(SDNode *Node,
case MVT::i32: FF = 0x4F800000ULL; break; // 2^32 (as a float)		case MVT::i32: FF = 0x4F800000ULL; break; // 2^32 (as a float)
case MVT::i64: FF = 0x5F800000ULL; break; // 2^64 (as a float)		case MVT::i64: FF = 0x5F800000ULL; break; // 2^64 (as a float)
}		}
if (DAG.getDataLayout().isLittleEndian())		if (DAG.getDataLayout().isLittleEndian())
FF <<= 32;		FF <<= 32;
Constant *FudgeFactor = ConstantInt::get(		Constant *FudgeFactor = ConstantInt::get(
Type::getInt64Ty(*DAG.getContext()), FF);		Type::getInt64Ty(*DAG.getContext()), FF);

SDValue CPIdx =		Align Alignment;
DAG.getConstantPool(FudgeFactor, TLI.getPointerTy(DAG.getDataLayout()));		SDValue CPIdx = TLI.getConstantPool(FudgeFactor, DAG, Alignment,
Align Alignment = cast<ConstantPoolSDNode>(CPIdx)->getAlign();		TLI.getPointerTy(DAG.getDataLayout()));
CPIdx = DAG.getNode(ISD::ADD, dl, CPIdx.getValueType(), CPIdx, CstOffset);		CPIdx = DAG.getNode(ISD::ADD, dl, CPIdx.getValueType(), CPIdx, CstOffset);
Alignment = commonAlignment(Alignment, 4);		Alignment = commonAlignment(Alignment, 4);
SDValue FudgeInReg;		SDValue FudgeInReg;
if (DestVT == MVT::f32)		if (DestVT == MVT::f32)
FudgeInReg = DAG.getLoad(		FudgeInReg = DAG.getLoad(
MVT::f32, dl, DAG.getEntryNode(), CPIdx,		MVT::f32, dl, DAG.getEntryNode(), CPIdx,
MachinePointerInfo::getConstantPool(DAG.getMachineFunction()),		MachinePointerInfo::getConstantPool(DAG.getMachineFunction()),
Alignment);		Alignment);
▲ Show 20 Lines • Show All 2,394 Lines • Show Last 20 Lines