This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lib/CodeGen/GlobalISel/
-
CodeGen/
-
GlobalISel/
1/8
IRTranslator.cpp
-
test/CodeGen/AArch64/GlobalISel/
-
CodeGen/
-
AArch64/
-
GlobalISel/
2/6
arm64-irtranslator.ll

Differential D30216

GlobalISel: Translate ConstantDataVector
ClosedPublic

Authored by volkan on Feb 21 2017, 10:38 AM.

Download Raw Diff

Details

Reviewers

qcolombet
dsanders
t.p.northover
ab
javed.absar
aditya_nandakumar

Commits

rG38a91a0de661: GlobalISel: Translate ConstantDataVector
rL297670: GlobalISel: Translate ConstantDataVector

Diff Detail

Event Timeline

volkan created this revision.Feb 21 2017, 10:38 AM

Herald added a reviewer: javed.absar. · View Herald TranscriptFeb 21 2017, 10:38 AM

Herald added subscribers: kristof.beyls, rovka, dberris. · View Herald Transcript

This will LGTM for the architectures supported by GlobalISel at the moment with a couple more tests but I think there's targets in LLVM that need different behaviour (PowerPC?). If that's the case, should we account for this now or later?

lib/CodeGen/GlobalISel/IRTranslator.cpp
988	This looks correct for ARM/AArch64/Mips, but I'm not sure this is correct for all targets. I'm thinking of big-endian targets where the highest-numbered element is stored at bit 0 (PowerPC?).
test/CodeGen/AArch64/GlobalISel/arm64-irtranslator.ll
1193–1201	I'd add a couple more tests, one for another vector size (preferably not a power of 2) and one for a different type and size like 'double'.

This revision is now accepted and ready to land.Feb 28 2017, 7:01 AM

kristof.beyls added inline comments.Feb 28 2017, 11:32 AM

lib/CodeGen/GlobalISel/IRTranslator.cpp
988	If this is not correct for some targets, there should at list be a FIXME here explaining in what circumstances the code isn't correct?
test/CodeGen/AArch64/GlobalISel/arm64-irtranslator.ll
1193–1201	My guess is that non-power-of-2 vector sizes are too ill-supported to be able to write tests using them at the moment? I'm currently starting to look into supporting non-power-of-2-sized types - it's looking like it may be quite a bit of work.

I'm not sure we want to use G_SEQUENCE for this. It seems to me that we should figure out a representation for vectors in general before diving into constants.

lib/CodeGen/GlobalISel/IRTranslator.cpp
988	This isn't always a valid assumption; we can have things like v16i1. Make this size in bits instead?

This revision now requires changes to proceed.Feb 28 2017, 4:31 PM

dsanders added inline comments.Mar 1 2017, 1:45 AM

lib/CodeGen/GlobalISel/IRTranslator.cpp
988	This isn't always a valid assumption; we can have things like v16i1. v16i1 can't occur here. ConstantDataVector handles the common case vectors where the elements are i8/i16/i32/i64/half/float/double. The v16i1 case is handled by ConstantVector instead.

qcolombet added inline comments.Mar 7 2017, 12:03 PM

lib/CodeGen/GlobalISel/IRTranslator.cpp
988	A genuine question, does SD care about the endian distinction for building vectors? The reason I am asking is because I could see this being the target dealing with that at selection time, so I want to make sure we are consistent with whatever SD was doing.
990	For this case we should use G_MERGE.
test/CodeGen/AArch64/GlobalISel/arm64-irtranslator.ll
1193–1201	For the IRTranslator non-power-of-2 vectors should be fine. At least I don't see any difficulty for this specific pass.

dsanders added inline comments.Mar 7 2017, 3:40 PM

lib/CodeGen/GlobalISel/IRTranslator.cpp
988	I don't think it matters to the SelectionDAG representation but I believe it's important to the transition between SelectionDAG nodes to MachineInstrs. Mips was handling the bitconvert differences at instruction selection time. I'm not familiar with the PowerPC instruction set but it looks like PowerPC is doing some endian aware lowering of BUILD_VECTOR in PPCTargetLowering::LowerBUILD_VECTOR(). I suppose the key question to answer is: Do gMIR's virtual registers hold SelectionDAG values or physical register values?

volkan added inline comments.Mar 7 2017, 3:42 PM

lib/CodeGen/GlobalISel/IRTranslator.cpp
988	SelectionDAG uses the same order to build vector.

Use G_MERGE_VALUES to represent vectors.

volkan marked 2 inline comments as done.Mar 7 2017, 4:25 PM

LGTM + test case

test/CodeGen/AArch64/GlobalISel/arm64-irtranslator.ll
1193–1201	Could you add a non-power-of-2 test case?

Added a non-power-of-2 test.

volkan marked an inline comment as done.Mar 13 2017, 10:24 AM

This approach makes sense to me too, thanks folks! If no one else has a paragraph handy, I'll try to explain the whole post-G_SEQUENCE scheme in the docs page.

test/CodeGen/AArch64/GlobalISel/arm64-irtranslator.ll
1202–1210	Any reason for this test to involve an extractelement?

This revision is now accepted and ready to land.Mar 13 2017, 2:37 PM

volkan added inline comments.Mar 13 2017, 2:44 PM

test/CodeGen/AArch64/GlobalISel/arm64-irtranslator.ll
1202–1210	We are unable to translate `ret <3 x i32> <...>`, so I tried something else.

volkan closed this revision.Mar 13 2017, 2:48 PM

Revision Contents

Path

Size

lib/

CodeGen/

GlobalISel/

IRTranslator.cpp

11 lines

test/

CodeGen/

AArch64/

GlobalISel/

arm64-irtranslator.ll

10 lines

Diff 89242

lib/CodeGen/GlobalISel/IRTranslator.cpp

Show First 20 Lines • Show All 973 Lines • ▼ Show 20 Lines	bool IRTranslator::translate(const Constant &C, unsigned Reg) {
else if (auto CF = dyn_cast<ConstantFP>(&C))		else if (auto CF = dyn_cast<ConstantFP>(&C))
EntryBuilder.buildFConstant(Reg, *CF);		EntryBuilder.buildFConstant(Reg, *CF);
else if (isa<UndefValue>(C))		else if (isa<UndefValue>(C))
EntryBuilder.buildInstr(TargetOpcode::IMPLICIT_DEF).addDef(Reg);		EntryBuilder.buildInstr(TargetOpcode::IMPLICIT_DEF).addDef(Reg);
else if (isa<ConstantPointerNull>(C))		else if (isa<ConstantPointerNull>(C))
EntryBuilder.buildConstant(Reg, 0);		EntryBuilder.buildConstant(Reg, 0);
else if (auto GV = dyn_cast<GlobalValue>(&C))		else if (auto GV = dyn_cast<GlobalValue>(&C))
EntryBuilder.buildGlobalValue(Reg, GV);		EntryBuilder.buildGlobalValue(Reg, GV);
else if (auto CE = dyn_cast<ConstantExpr>(&C)) {		else if (auto CV = dyn_cast<ConstantDataVector>(&C)) {
		std::vector<unsigned> Ops;
		std::vector<uint64_t> Indices;
		for (unsigned i = 0; i < CV->getNumElements(); ++i) {
		Constant &Elt = *CV->getElementAsConstant(i);
		Ops.push_back(getOrCreateVReg(Elt));
		Indices.push_back(i * (CV->getElementByteSize() * 8));
		abUnsubmitted Not Done Reply Inline Actions This isn't always a valid assumption; we can have things like v16i1. Make this size in bits instead? ab: This isn't always a valid assumption; we can have things like v16i1. Make this size in bits…
		dsandersUnsubmitted Not Done Reply Inline Actions This isn't always a valid assumption; we can have things like v16i1. v16i1 can't occur here. ConstantDataVector handles the common case vectors where the elements are i8/i16/i32/i64/half/float/double. The v16i1 case is handled by ConstantVector instead. dsanders: > This isn't always a valid assumption; we can have things like v16i1. v16i1 can't occur here.
		dsandersUnsubmitted Not Done Reply Inline Actions This looks correct for ARM/AArch64/Mips, but I'm not sure this is correct for all targets. I'm thinking of big-endian targets where the highest-numbered element is stored at bit 0 (PowerPC?). dsanders: This looks correct for ARM/AArch64/Mips, but I'm not sure this is correct for all targets. I'm…
		kristof.beylsUnsubmitted Not Done Reply Inline Actions If this is not correct for some targets, there should at list be a FIXME here explaining in what circumstances the code isn't correct? kristof.beyls: If this is not correct for some targets, there should at list be a FIXME here explaining in…
		qcolombetUnsubmitted Not Done Reply Inline Actions A genuine question, does SD care about the endian distinction for building vectors? The reason I am asking is because I could see this being the target dealing with that at selection time, so I want to make sure we are consistent with whatever SD was doing. qcolombet: A genuine question, does SD care about the endian distinction for building vectors? The reason…
		dsandersUnsubmitted Not Done Reply Inline Actions I don't think it matters to the SelectionDAG representation but I believe it's important to the transition between SelectionDAG nodes to MachineInstrs. Mips was handling the bitconvert differences at instruction selection time. I'm not familiar with the PowerPC instruction set but it looks like PowerPC is doing some endian aware lowering of BUILD_VECTOR in PPCTargetLowering::LowerBUILD_VECTOR(). I suppose the key question to answer is: Do gMIR's virtual registers hold SelectionDAG values or physical register values? dsanders: I don't think it matters to the SelectionDAG representation but I believe it's important to the…
		volkanAuthorUnsubmitted Not Done Reply Inline Actions SelectionDAG uses the same order to build vector. volkan: SelectionDAG uses the same order to build vector.
		}
		EntryBuilder.buildSequence(Reg, Ops, Indices);
		qcolombetUnsubmitted Done Reply Inline Actions For this case we should use G_MERGE. qcolombet: For this case we should use G_MERGE.
		} else if (auto CE = dyn_cast<ConstantExpr>(&C)) {
switch(CE->getOpcode()) {		switch(CE->getOpcode()) {
#define HANDLE_INST(NUM, OPCODE, CLASS) \		#define HANDLE_INST(NUM, OPCODE, CLASS) \
case Instruction::OPCODE: return translate##OPCODE(*CE, EntryBuilder);		case Instruction::OPCODE: return translate##OPCODE(*CE, EntryBuilder);
#include "llvm/IR/Instruction.def"		#include "llvm/IR/Instruction.def"
default:		default:
if (!TPC->isGlobalISelAbortEnabled())		if (!TPC->isGlobalISelAbortEnabled())
return false;		return false;
llvm_unreachable("unknown opcode");		llvm_unreachable("unknown opcode");
▲ Show 20 Lines • Show All 113 Lines • Show Last 20 Lines

test/CodeGen/AArch64/GlobalISel/arm64-irtranslator.ll

Show First 20 Lines • Show All 1,183 Lines • ▼ Show 20 Lines	; CHECK: G_STORE [[V2]](s8), [[ADDR]](p0) :: (store singlethread monotonic 1 into %ir.addr)
%v1 = load atomic i8, i8* %addr acquire, align 1		%v1 = load atomic i8, i8* %addr acquire, align 1
store atomic i8 %v1, i8* %addr release, align 1		store atomic i8 %v1, i8* %addr release, align 1

%v2 = load atomic i8, i8* %addr singlethread seq_cst, align 1		%v2 = load atomic i8, i8* %addr singlethread seq_cst, align 1
store atomic i8 %v2, i8* %addr singlethread monotonic, align 1		store atomic i8 %v2, i8* %addr singlethread monotonic, align 1

ret void		ret void
}		}

		define <2 x i32> @test_constant_datavector() {
		; CHECK-LABEL: name: test_constant_datavector
		; CHECK: [[ONE:%[0-9]+]](s32) = G_CONSTANT i32 1
		; CHECK: [[TWO:%[0-9]+]](s32) = G_CONSTANT i32 2
		; CHECK: [[RES:%[0-9]+]](<2 x s32>) = G_SEQUENCE [[ONE]](s32), 0, [[TWO]](s32), 32
		; CHECK: %d0 = COPY [[RES]](<2 x s32>)
		entry:
		ret <2 x i32> <i32 1, i32 2>
		}
		dsandersUnsubmitted Done Reply Inline Actions I'd add a couple more tests, one for another vector size (preferably not a power of 2) and one for a different type and size like 'double'. dsanders: I'd add a couple more tests, one for another vector size (preferably not a power of 2) and one…
		kristof.beylsUnsubmitted Not Done Reply Inline Actions My guess is that non-power-of-2 vector sizes are too ill-supported to be able to write tests using them at the moment? I'm currently starting to look into supporting non-power-of-2-sized types - it's looking like it may be quite a bit of work. kristof.beyls: My guess is that non-power-of-2 vector sizes are too ill-supported to be able to write tests…
		qcolombetUnsubmitted Not Done Reply Inline Actions For the IRTranslator non-power-of-2 vectors should be fine. At least I don't see any difficulty for this specific pass. qcolombet: For the IRTranslator non-power-of-2 vectors should be fine. At least I don't see any difficulty…
		qcolombetUnsubmitted Done Reply Inline Actions Could you add a non-power-of-2 test case? qcolombet: Could you add a non-power-of-2 test case?
		abUnsubmitted Not Done Reply Inline Actions Any reason for this test to involve an extractelement? ab: Any reason for this test to involve an extractelement?
		volkanAuthorUnsubmitted Not Done Reply Inline Actions We are unable to translate `ret <3 x i32> <...>`, so I tried something else. volkan: We are unable to translate `ret <3 x i32> <...>`, so I tried something else.