This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Target/PowerPC/
-
Target/
-
PowerPC/
-
AsmParser/
2/3
PPCAsmParser.cpp
-
Disassembler/
-
PPCDisassembler.cpp
2/6
PPCInstr64Bit.td
-
PPCInstrFormats.td
-
PPCInstrInfo.h
-
PPCInstrInfo.cpp
-
PPCInstrInfo.td
-
PPCRegisterInfo.h
2/2
PPCRegisterInfo.cpp
5/6
PPCRegisterInfo.td
1/1
PPCSchedule.td
-
test/
-
CodeGen/PowerPC/
-
PowerPC/
-
ldst-16-byte-asm.mir
2/3
ldst-16-byte.mir
-
MC/
-
Disassembler/PowerPC/
-
PowerPC/
-
ppc64-encoding-bookII.txt
-
ppc64-encoding.txt
-
PowerPC/
-
ppc64-encoding-bookII.s
-
ppc64-encoding.s

Differential D103010

[PowerPC] Export 16 byte load-store instructions
ClosedPublic

Authored by lkail on May 24 2021, 4:06 AM.

Download Raw Diff

Details

Reviewers

nemanjai
jsji
lei
shchenz
xingxue
jfb

Group Reviewers

Restricted Project

Commits

rG1c450c3d7ec0: [PowerPC] Export 16 byte load-store instructions

Summary

Export lq, stq, lqarx and stqcx. in preparation for implementing 16-byte lock free atomic operations on AIX.
Add a new register class g8prc for these instructions, since these instructions require even-odd register pair.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

lkail created this revision.May 24 2021, 4:06 AM

Herald added a reviewer: jfb. · View Herald TranscriptMay 24 2021, 4:06 AM

Herald added subscribers: jfb, kbarton, hiraditya. · View Herald Transcript

lkail requested review of this revision.May 24 2021, 4:06 AM

Herald added a project: Restricted Project. · View Herald TranscriptMay 24 2021, 4:06 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

lkail edited the summary of this revision. (Show Details)May 24 2021, 4:20 AM

Harbormaster completed remote builds in B105871: Diff 347344.May 24 2021, 4:36 AM

Fixed failed pre-merge tests.

Harbormaster completed remote builds in B105874: Diff 347352.May 24 2021, 5:45 AM

Although I am obviously not opposed to adding these instructions and the necessary register classes, we do have to devise a way to ensure RA doesn't allocate the same register to the base and result of LQ. An additional complication for this check is ensuring that if RA == 0, RTp != 0.

llvm/lib/Target/PowerPC/PPCRegisterInfo.cpp
118	This is likely problematic. What happens if we need to spill one of these at an offset from the stack pointer that doesn't fit in a DQ field? I imagine something breaks. These have to be handled differently in that we need to bump the base register rather than loading an immediate into a register and using it as an index. See `PPCRegisterInfo::eliminateFrameIndex()`
llvm/lib/Target/PowerPC/PPCRegisterInfo.td
23	These names are odd, they seem to suggest that there are pairs that are subregisters, but it is actually the individual registers. Maybe: sub_gp8_x0 sub_gp8_x1

we do have to devise a way to ensure RA doesn't allocate the same register to the base and result of LQ

One way I can figure out now is to have an additional pseudo instruction after LQ which uses the base register of LQ so that liveranges of base register and result register overlap.

we do have to devise a way to ensure RA doesn't allocate the same register to the base and result of LQ

Another approach for this might be adding an additional out $ea_result operand which can be considered pseudo for LQ. But there might be redundant copy. For example,

%0, %1 = LQ 128, %2(tied-def 1)
use %2

Solve LQ register constraint RTp != RA by adding early-clobber on RTp
Use doubleword load/store to perform quadword restore/spill

LQ and STQ are still needed for quadword atomic load/store.

lkail updated this revision to Diff 348199.May 27 2021, 3:08 AM

Harbormaster completed remote builds in B106465: Diff 348199.May 27 2021, 3:25 AM

nemanjai added inline comments.May 27 2021, 4:00 AM

llvm/lib/Target/PowerPC/PPCInstr64Bit.td
1259	nit: misaligned - move right by one more space.
llvm/lib/Target/PowerPC/PPCRegisterInfo.cpp
1151	Why use an X-Form for spilling/restoring rather than a D-Form the way all the other spilling code does? The FI eliminator should have what is needed to convert the D-Form to an X-Form (rather than relying on the pre-emit peephole which is what I assume is converting the X-Form to a D-Form in the attached test case).

Use DForm to keep consistency
Format fix

lkail marked 2 inline comments as done.May 27 2021, 4:13 AM

Harbormaster completed remote builds in B106482: Diff 348220.May 27 2021, 4:50 AM

Add encoding tests.

lkail updated this revision to Diff 348477.May 28 2021, 2:36 AM

Harbormaster completed remote builds in B106672: Diff 348477.May 28 2021, 3:15 AM

lkail added a child revision: D103445: [PowerPC][AIX][RFC] Generate inlined quadword lock free atomic operations.Jun 1 2021, 3:49 AM

lkail added a child revision: D103614: [PowerPC] Generate inlined quadword lock free atomic operations via AtomicExpand.Jun 3 2021, 6:37 AM

jsji added inline comments.Jun 9 2021, 2:29 PM

llvm/lib/Target/PowerPC/AsmParser/PPCAsmParser.cpp
297	This is wrong and not needed , should be removed -- there is NO instruction using Regnum/2 as reg index.
440	`isEvenRegNumber` is better, this is not specific to G8p, just a plain even regnum check.
441	This is wrong. We can have `lq 30, 128(4)` Should be something like: `isRegNumber() && ((getImm() & 1) == 0)`
llvm/lib/Target/PowerPC/PPCInstr64Bit.td
286	`isRecordForm` ? lqarx is not a recordform, stdcx. is a record form.
llvm/lib/Target/PowerPC/PPCRegisterInfo.td
199	This looks wrong. The encoding should be using original regnum, as in getG8pReg. eg: G8p1 is actually {X2,X3}, so the HW encoding should be 2/3, not 1.
201	Shouldn't define `Index` here, should be `<[-1, -1]>` -- G8p is reg number in llvm only, there is NO regnum defined visible to debugger. Defining here will mess up mapping table.
486	Add comments about why the allocation order is defined like this? And why we need a AltOrders for ELF?
llvm/lib/Target/PowerPC/PPCSchedule.td
46	nit: these are in alphabetic order , so these two should be moved down to after `IIC_LdStLMW`
llvm/test/CodeGen/PowerPC/ldst-16-byte.mir
3	How about powerpc32?

lkail added inline comments.Jun 9 2021, 6:54 PM

llvm/test/CodeGen/PowerPC/ldst-16-byte.mir

I guess it's not supported in powerpc32 mode, though I can't find where it is documented(It's documented in AIX assembly manual that ldarx and stdcx are not supported in powerpc32 mode). I've tried following on AIX-7.2

main:
  lq 2,128(4)
  stq 2,128(4)
  lqarx 2,3,4
  lqarx 2,3,4,1
  stqcx. 2,3,4

# as above code
as -a64 -mpwr8 enc.s
objdump -D a.out

It gives correct dump result.

0000000000000000 <.text>:
   0:   e0 44 00 80     lq      r2,128(r4)
   4:   f8 44 00 82     stq     r2,128(r4)
   8:   7c 43 22 28     lqarx   r2,r3,r4
   c:   7c 43 22 29     lqarx   r2,r3,r4,1
  10:   7c 43 21 6d     stqcx.  r2,r3,r4

But for as -a32 -mpwr8 enc.s, the result is not correct.

00000000 <.text>:
   0:   e0 44 00 80     lfq     f2,128(r4)
   4:   f8 44 00 82     .long 0xf8440082
   8:   7c 43 22 28     .long 0x7c432228
   c:   7c 43 22 29     .long 0x7c432229
  10:   7c 43 21 6d     .long 0x7c43216d

jsji added inline comments.Jun 9 2021, 7:10 PM

llvm/test/CodeGen/PowerPC/ldst-16-byte.mir
3	Thanks. If so, there opcode should be guarded with isPPC64 pred.

lkail added inline comments.Jun 9 2021, 11:44 PM

llvm/lib/Target/PowerPC/PPCInstr64Bit.td
286	This should be a historical issue(IIUC, to avoid decoding conflict), I'll leave `L(B\|W\|D\|Q)ARXL` as it is now and fix them in following patches.

Address comments.

lkail marked 8 inline comments as done.Jun 9 2021, 11:49 PM

lkail updated this revision to Diff 351078.Jun 9 2021, 11:52 PM

lkail marked an inline comment as done.Jun 9 2021, 11:54 PM

Harbormaster completed remote builds in B108550: Diff 351078.Jun 10 2021, 12:24 AM

nemanjai added inline comments.Jun 10 2021, 2:46 AM

llvm/lib/Target/PowerPC/PPCInstr64Bit.td
286	The `L` variant is a hinted variant and the hint sets the same bit as record-form instructions. So we just reuse that bit. There is no danger of `PPCInstrInfo::optimizeCompareInstr()` attempting to convert one to the other since this is not part of the `RecFormRel` relation. So I don't really think there is anything here that needs fixing.

LGTM.

This revision is now accepted and ready to land.Jun 10 2021, 2:51 AM

LGTM with some nits.

llvm/lib/Target/PowerPC/PPCInstr64Bit.td
286	Thanks for the background, yes, it is functionality correct here, but I think it is misleading and causing confusion about the semantic of `isRecordForm`. We should be able to achieve same goal about the encoding/decoding here without messing up with isRecordForm. But yeah, this can be done in a follow up patch.
1258	`memrix16` ? We are restoring using two ld, do we still require DQ offset?
llvm/lib/Target/PowerPC/PPCRegisterInfo.td
125	nit: This doesn't enforce the relationship between SubReg0 and SubReg1 in Pair. How about something like? // GP8Pair - Paired GP8. class GP8Pair<string n, bits<5> EvenIndex>: PPCReg<n>{ assert !eq(EvenIndex{0},0), "Index should be even"; let HWEncoding{4-0} = EvenIndex ; let SubRegs = [!cast<GP8>("X"#EvenIndex), !cast<GP8>("X"#!add(EvenIndex, 1))]; let DwarfNumbers = [-1,-1] ; let SubRegIndices = [sub_gp8_x0, sub_gp8_x1] ; } then we can define pair safely and simply like: // 16 paired even-odd consecutive GP8s. foreach Index = { 0, 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30 } in { def G8p#!srl(Index, 1) : GP8Pair<"r"#Index, Index>; }

lkail added inline comments.Jun 10 2021, 7:45 PM

llvm/lib/Target/PowerPC/PPCRegisterInfo.td
125	Nice one.

lkail updated this revision to Diff 351338.Jun 10 2021, 7:46 PM

Harbormaster completed remote builds in B108742: Diff 351338.Jun 10 2021, 8:17 PM

This revision was landed with ongoing or failed builds.Jun 14 2021, 6:56 PM

Closed by commit rG1c450c3d7ec0: [PowerPC] Export 16 byte load-store instructions (authored by lkail). · Explain Why

This revision was automatically updated to reflect the committed changes.

lkail added a commit: rG1c450c3d7ec0: [PowerPC] Export 16 byte load-store instructions.

jsji removed a child revision: D103614: [PowerPC] Generate inlined quadword lock free atomic operations via AtomicExpand.Jun 22 2021, 2:00 PM

Revision Contents

Path

Size

llvm/

lib/

Target/

PowerPC/

AsmParser/

PPCAsmParser.cpp

13 lines

Disassembler/

6 lines

44 lines

14 lines

17 lines

18 lines

6 lines

5 lines

61 lines

33 lines

4 lines

test/

CodeGen/

PowerPC/

ldst-16-byte-asm.mir

23 lines

ldst-16-byte.mir

221 lines

MC/

Disassembler/

PowerPC/

ppc64-encoding-bookII.txt

18 lines

ppc64-encoding.txt

12 lines

PowerPC/

ppc64-encoding-bookII.s

24 lines

ppc64-encoding.s

12 lines

Diff 351076

llvm/lib/Target/PowerPC/AsmParser/PPCAsmParser.cpp

Show First 20 Lines • Show All 288 Lines • ▼ Show 20 Lines	unsigned getACCReg() const {
return (unsigned) Imm.Val;		return (unsigned) Imm.Val;
}		}

unsigned getVSRpEvenReg() const {		unsigned getVSRpEvenReg() const {
assert(isVSRpEvenRegNumber() && "Invalid access!");		assert(isVSRpEvenRegNumber() && "Invalid access!");
return (unsigned) Imm.Val >> 1;		return (unsigned) Imm.Val >> 1;
}		}

		unsigned getG8pReg() const {
		jsjiUnsubmitted Done Reply Inline Actions This is wrong and not needed , should be removed -- there is NO instruction using Regnum/2 as reg index. jsji: This is wrong and not needed , should be removed -- there is NO instruction using Regnum/2 as…
		assert(isEvenRegNumber() && "Invalid access!");
		return (unsigned)Imm.Val;
		}

unsigned getCCReg() const {		unsigned getCCReg() const {
assert(isCCRegNumber() && "Invalid access!");		assert(isCCRegNumber() && "Invalid access!");
return (unsigned) (Kind == Immediate ? Imm.Val : Expr.CRVal);		return (unsigned) (Kind == Immediate ? Imm.Val : Expr.CRVal);
}		}

unsigned getCRBit() const {		unsigned getCRBit() const {
assert(isCRBitNumber() && "Invalid access!");		assert(isCRBitNumber() && "Invalid access!");
return (unsigned) (Kind == Immediate ? Imm.Val : Expr.CRVal);		return (unsigned) (Kind == Immediate ? Imm.Val : Expr.CRVal);
▲ Show 20 Lines • Show All 122 Lines • ▼ Show 20 Lines	public:
bool isCCRegNumber() const { return (Kind == Expression		bool isCCRegNumber() const { return (Kind == Expression
&& isUInt<3>(getExprCRVal())) \|\|		&& isUInt<3>(getExprCRVal())) \|\|
(Kind == Immediate		(Kind == Immediate
&& isUInt<3>(getImm())); }		&& isUInt<3>(getImm())); }
bool isCRBitNumber() const { return (Kind == Expression		bool isCRBitNumber() const { return (Kind == Expression
&& isUInt<5>(getExprCRVal())) \|\|		&& isUInt<5>(getExprCRVal())) \|\|
(Kind == Immediate		(Kind == Immediate
&& isUInt<5>(getImm())); }		&& isUInt<5>(getImm())); }

		jsjiUnsubmitted Done Reply Inline Actions `isEvenRegNumber` is better, this is not specific to G8p, just a plain even regnum check. jsji: `isEvenRegNumber` is better, this is not specific to G8p, just a plain even regnum check.
		bool isEvenRegNumber() const { return isRegNumber() && (getImm() & 1) == 0; }
		jsjiUnsubmitted Not Done Reply Inline Actions This is wrong. We can have `lq 30, 128(4)` Should be something like: `isRegNumber() && ((getImm() & 1) == 0)` jsji: This is wrong. We can have `lq 30, 128(4)` Should be something like: `isRegNumber() &&…

bool isCRBitMask() const { return Kind == Immediate && isUInt<8>(getImm()) &&		bool isCRBitMask() const { return Kind == Immediate && isUInt<8>(getImm()) &&
isPowerOf2_32(getImm()); }		isPowerOf2_32(getImm()); }
bool isATBitsAsHint() const { return false; }		bool isATBitsAsHint() const { return false; }
bool isMem() const override { return false; }		bool isMem() const override { return false; }
bool isReg() const override { return false; }		bool isReg() const override { return false; }

void addRegOperands(MCInst &Inst, unsigned N) const {		void addRegOperands(MCInst &Inst, unsigned N) const {
llvm_unreachable("addRegOperands");		llvm_unreachable("addRegOperands");
Show All 14 Lines	void addRegG8RCOperands(MCInst &Inst, unsigned N) const {
Inst.addOperand(MCOperand::createReg(XRegs[getReg()]));		Inst.addOperand(MCOperand::createReg(XRegs[getReg()]));
}		}

void addRegG8RCNoX0Operands(MCInst &Inst, unsigned N) const {		void addRegG8RCNoX0Operands(MCInst &Inst, unsigned N) const {
assert(N == 1 && "Invalid number of operands!");		assert(N == 1 && "Invalid number of operands!");
Inst.addOperand(MCOperand::createReg(XRegsNoX0[getReg()]));		Inst.addOperand(MCOperand::createReg(XRegsNoX0[getReg()]));
}		}

		void addRegG8pRCOperands(MCInst &Inst, unsigned N) const {
		assert(N == 1 && "Invalid number of operands!");
		Inst.addOperand(MCOperand::createReg(XRegs[getG8pReg()]));
		}

void addRegGxRCOperands(MCInst &Inst, unsigned N) const {		void addRegGxRCOperands(MCInst &Inst, unsigned N) const {
if (isPPC64())		if (isPPC64())
addRegG8RCOperands(Inst, N);		addRegG8RCOperands(Inst, N);
else		else
addRegGPRCOperands(Inst, N);		addRegGPRCOperands(Inst, N);
}		}

void addRegGxRCNoR0Operands(MCInst &Inst, unsigned N) const {		void addRegGxRCNoR0Operands(MCInst &Inst, unsigned N) const {
▲ Show 20 Lines • Show All 1,295 Lines • Show Last 20 Lines

llvm/lib/Target/PowerPC/Disassembler/PPCDisassembler.cpp

	Show First 20 Lines • Show All 154 Lines • ▼ Show 20 Lines
	}			}

	static DecodeStatus DecodeG8RCRegisterClass(MCInst &Inst, uint64_t RegNo,			static DecodeStatus DecodeG8RCRegisterClass(MCInst &Inst, uint64_t RegNo,
	uint64_t Address,			uint64_t Address,
	const void *Decoder) {			const void *Decoder) {
	return decodeRegisterClass(Inst, RegNo, XRegs);			return decodeRegisterClass(Inst, RegNo, XRegs);
	}			}

				static DecodeStatus DecodeG8pRCRegisterClass(MCInst &Inst, uint64_t RegNo,
				uint64_t Address,
				const void *Decoder) {
				return decodeRegisterClass(Inst, RegNo, XRegs);
				}

	static DecodeStatus DecodeG8RC_NOX0RegisterClass(MCInst &Inst, uint64_t RegNo,			static DecodeStatus DecodeG8RC_NOX0RegisterClass(MCInst &Inst, uint64_t RegNo,
	uint64_t Address,			uint64_t Address,
	const void *Decoder) {			const void *Decoder) {
	return decodeRegisterClass(Inst, RegNo, XRegsNoX0);			return decodeRegisterClass(Inst, RegNo, XRegsNoX0);
	}			}

	#define DecodePointerLikeRegClass0 DecodeGPRCRegisterClass			#define DecodePointerLikeRegClass0 DecodeGPRCRegisterClass
	#define DecodePointerLikeRegClass1 DecodeGPRC_NOR0RegisterClass			#define DecodePointerLikeRegClass1 DecodeGPRC_NOR0RegisterClass
	▲ Show 20 Lines • Show All 276 Lines • Show Last 20 Lines

llvm/lib/Target/PowerPC/PPCInstr64Bit.td

Show First 20 Lines • Show All 265 Lines • ▼ Show 20 Lines	def ATOMIC_SWAP_I64 : PPCCustomInserterPseudo<
(outs g8rc:$dst), (ins memrr:$ptr, g8rc:$new), "#ATOMIC_SWAP_I64",		(outs g8rc:$dst), (ins memrr:$ptr, g8rc:$new), "#ATOMIC_SWAP_I64",
[(set i64:$dst, (atomic_swap_64 ForceXForm:$ptr, i64:$new))]>;		[(set i64:$dst, (atomic_swap_64 ForceXForm:$ptr, i64:$new))]>;
}		}

// Instructions to support atomic operations		// Instructions to support atomic operations
let mayLoad = 1, hasSideEffects = 0 in {		let mayLoad = 1, hasSideEffects = 0 in {
def LDARX : XForm_1_memOp<31, 84, (outs g8rc:$rD), (ins memrr:$ptr),		def LDARX : XForm_1_memOp<31, 84, (outs g8rc:$rD), (ins memrr:$ptr),
"ldarx $rD, $ptr", IIC_LdStLDARX, []>;		"ldarx $rD, $ptr", IIC_LdStLDARX, []>;
		// TODO: Add scheduling info.
		let hasNoSchedulingInfo = 1 in
		def LQARX : XForm_1_memOp<31, 276, (outs g8prc:$RTp), (ins memrr:$ptr),
		"lqarx $RTp, $ptr", IIC_LdStLQARX, []>, isPPC64;

// Instruction to support lock versions of atomics		// Instruction to support lock versions of atomics
// (EH=1 - see Power ISA 2.07 Book II 4.4.2)		// (EH=1 - see Power ISA 2.07 Book II 4.4.2)
def LDARXL : XForm_1<31, 84, (outs g8rc:$rD), (ins memrr:$ptr),		def LDARXL : XForm_1<31, 84, (outs g8rc:$rD), (ins memrr:$ptr),
"ldarx $rD, $ptr, 1", IIC_LdStLDARX, []>, isRecordForm;		"ldarx $rD, $ptr, 1", IIC_LdStLDARX, []>, isRecordForm;
		// TODO: Add scheduling info.
		let hasNoSchedulingInfo = 1 in
		def LQARXL : XForm_1<31, 276, (outs g8prc:$RTp), (ins memrr:$ptr),
		"lqarx $RTp, $ptr, 1", IIC_LdStLQARX, []>,
		jsjiUnsubmitted Not Done Reply Inline Actions `isRecordForm` ? lqarx is not a recordform, stdcx. is a record form. jsji: `isRecordForm` ? lqarx is not a recordform, stdcx. is a record form.
		lkailAuthorUnsubmitted Done Reply Inline Actions This should be a historical issue(IIUC, to avoid decoding conflict), I'll leave `L(B\|W\|D\|Q)ARXL` as it is now and fix them in following patches. lkail: This should be a historical issue(IIUC, to avoid decoding conflict), I'll leave `L…
		nemanjaiUnsubmitted Not Done Reply Inline Actions The `L` variant is a hinted variant and the hint sets the same bit as record-form instructions. So we just reuse that bit. There is no danger of `PPCInstrInfo::optimizeCompareInstr()` attempting to convert one to the other since this is not part of the `RecFormRel` relation. So I don't really think there is anything here that needs fixing. nemanjai: The `L` variant is a hinted variant and the hint sets the same bit as record-form instructions.
		jsjiUnsubmitted Not Done Reply Inline Actions Thanks for the background, yes, it is functionality correct here, but I think it is misleading and causing confusion about the semantic of `isRecordForm`. We should be able to achieve same goal about the encoding/decoding here without messing up with isRecordForm. But yeah, this can be done in a follow up patch. jsji: Thanks for the background, yes, it is functionality correct here, but I think it is misleading…
		isPPC64, isRecordForm;

let hasExtraDefRegAllocReq = 1 in		let hasExtraDefRegAllocReq = 1 in
def LDAT : X_RD5_RS5_IM5<31, 614, (outs g8rc:$rD), (ins g8rc:$rA, u5imm:$FC),		def LDAT : X_RD5_RS5_IM5<31, 614, (outs g8rc:$rD), (ins g8rc:$rA, u5imm:$FC),
"ldat $rD, $rA, $FC", IIC_LdStLoad>, isPPC64,		"ldat $rD, $rA, $FC", IIC_LdStLoad>, isPPC64,
Requires<[IsISA3_0]>;		Requires<[IsISA3_0]>;
}		}

let Defs = [CR0], mayStore = 1, mayLoad = 0, hasSideEffects = 0 in		let Defs = [CR0], mayStore = 1, mayLoad = 0, hasSideEffects = 0 in {
def STDCX : XForm_1_memOp<31, 214, (outs), (ins g8rc:$rS, memrr:$dst),		def STDCX : XForm_1_memOp<31, 214, (outs), (ins g8rc:$rS, memrr:$dst),
"stdcx. $rS, $dst", IIC_LdStSTDCX, []>, isRecordForm;		"stdcx. $rS, $dst", IIC_LdStSTDCX, []>, isRecordForm;
		// TODO: Add scheduling info.
		let hasNoSchedulingInfo = 1 in
		def STQCX : XForm_1_memOp<31, 182, (outs), (ins g8prc:$RSp, memrr:$dst),
		"stqcx. $RSp, $dst", IIC_LdStSTQCX, []>,
		isPPC64, isRecordForm;
		}

let mayStore = 1, mayLoad = 0, hasSideEffects = 0 in		let mayStore = 1, mayLoad = 0, hasSideEffects = 0 in
def STDAT : X_RD5_RS5_IM5<31, 742, (outs), (ins g8rc:$rS, g8rc:$rA, u5imm:$FC),		def STDAT : X_RD5_RS5_IM5<31, 742, (outs), (ins g8rc:$rS, g8rc:$rA, u5imm:$FC),
"stdat $rS, $rA, $FC", IIC_LdStStore>, isPPC64,		"stdat $rS, $rA, $FC", IIC_LdStStore>, isPPC64,
Requires<[IsISA3_0]>;		Requires<[IsISA3_0]>;

let Interpretation64Bit = 1, isCodeGenOnly = 1 in {		let Interpretation64Bit = 1, isCodeGenOnly = 1 in {
let isCall = 1, isTerminator = 1, isReturn = 1, isBarrier = 1, Uses = [RM] in		let isCall = 1, isTerminator = 1, isReturn = 1, isBarrier = 1, Uses = [RM] in
▲ Show 20 Lines • Show All 926 Lines • ▼ Show 20 Lines	def LDUX : XForm_1_memOp<31, 53, (outs g8rc:$rD, ptr_rc_nor0:$ea_result),
"ldux $rD, $addr", IIC_LdStLDUX,		"ldux $rD, $addr", IIC_LdStLDUX,
[]>, RegConstraint<"$addr.ptrreg = $ea_result">,		[]>, RegConstraint<"$addr.ptrreg = $ea_result">,
NoEncode<"$ea_result">, isPPC64;		NoEncode<"$ea_result">, isPPC64;

def LDMX : XForm_1<31, 309, (outs g8rc:$rD), (ins memrr:$src),		def LDMX : XForm_1<31, 309, (outs g8rc:$rD), (ins memrr:$src),
"ldmx $rD, $src", IIC_LdStLD, []>, isPPC64,		"ldmx $rD, $src", IIC_LdStLD, []>, isPPC64,
Requires<[IsISA3_0]>;		Requires<[IsISA3_0]>;
}		}

		let mayLoad = 1, hasNoSchedulingInfo = 1 in {
		// Full 16-byte load.
		// Early clobber $RTp to avoid assigned to the same register as RA.
		// TODO: Add scheduling info.
		def LQ : DQForm_RTp5_RA17_MEM<56, 0,
		(outs g8prc:$RTp),
		(ins memrix16:$src),
		"lq $RTp, $src", IIC_LdStLQ,
		[]>,
		RegConstraint<"@earlyclobber $RTp">,
		isPPC64;
		def RESTORE_QUADWORD : PPCEmitTimePseudo<(outs g8prc:$RTp), (ins memrix16:$src),
		jsjiUnsubmitted Not Done Reply Inline Actions `memrix16` ? We are restoring using two ld, do we still require DQ offset? jsji: `memrix16` ? We are restoring using two ld, do we still require DQ offset?
		"#RESTORE_QUADWORD", []>;
		nemanjaiUnsubmitted Done Reply Inline Actions nit: misaligned - move right by one more space. nemanjai: nit: misaligned - move right by one more space.
		}

}		}

// Support for medium and large code model.		// Support for medium and large code model.
let hasSideEffects = 0 in {		let hasSideEffects = 0 in {
let isReMaterializable = 1 in {		let isReMaterializable = 1 in {
def ADDIStocHA8: PPCEmitTimePseudo<(outs g8rc:$rD), (ins g8rc_nox0:$reg, tocentry:$disp),		def ADDIStocHA8: PPCEmitTimePseudo<(outs g8rc:$rD), (ins g8rc_nox0:$reg, tocentry:$disp),
"#ADDIStocHA8", []>, isPPC64;		"#ADDIStocHA8", []>, isPPC64;
def ADDItocL: PPCEmitTimePseudo<(outs g8rc:$rD), (ins g8rc_nox0:$reg, tocentry:$disp),		def ADDItocL: PPCEmitTimePseudo<(outs g8rc:$rD), (ins g8rc_nox0:$reg, tocentry:$disp),
▲ Show 20 Lines • Show All 167 Lines • ▼ Show 20 Lines
def STDX : XForm_8_memOp<31, 149, (outs), (ins g8rc:$rS, memrr:$dst),		def STDX : XForm_8_memOp<31, 149, (outs), (ins g8rc:$rS, memrr:$dst),
"stdx $rS, $dst", IIC_LdStSTD,		"stdx $rS, $dst", IIC_LdStSTD,
[(store i64:$rS, XForm:$dst)]>, isPPC64,		[(store i64:$rS, XForm:$dst)]>, isPPC64,
PPC970_DGroup_Cracked;		PPC970_DGroup_Cracked;
def STDBRX: XForm_8_memOp<31, 660, (outs), (ins g8rc:$rS, memrr:$dst),		def STDBRX: XForm_8_memOp<31, 660, (outs), (ins g8rc:$rS, memrr:$dst),
"stdbrx $rS, $dst", IIC_LdStStore,		"stdbrx $rS, $dst", IIC_LdStStore,
[(PPCstbrx i64:$rS, ForceXForm:$dst, i64)]>, isPPC64,		[(PPCstbrx i64:$rS, ForceXForm:$dst, i64)]>, isPPC64,
PPC970_DGroup_Cracked;		PPC970_DGroup_Cracked;

		let mayStore = 1, hasNoSchedulingInfo = 1 in {
		// Normal 16-byte stores.
		// TODO: Add scheduling info.
		def STQ : DSForm_1<62, 2, (outs), (ins g8prc:$RSp, memrix:$dst),
		"stq $RSp, $dst", IIC_LdStSTQ,
		[]>, isPPC64;
		def SPILL_QUADWORD : PPCEmitTimePseudo<(outs), (ins g8prc:$RSp, memrix:$dst),
		"#SPILL_QUADWORD", []>;
		}

}		}

// Stores with Update (pre-inc).		// Stores with Update (pre-inc).
let PPC970_Unit = 2, mayStore = 1, mayLoad = 0 in {		let PPC970_Unit = 2, mayStore = 1, mayLoad = 0 in {
let Interpretation64Bit = 1, isCodeGenOnly = 1 in {		let Interpretation64Bit = 1, isCodeGenOnly = 1 in {
def STBU8 : DForm_1<39, (outs ptr_rc_nor0:$ea_res), (ins g8rc:$rS, memri:$dst),		def STBU8 : DForm_1<39, (outs ptr_rc_nor0:$ea_res), (ins g8rc:$rS, memri:$dst),
"stbu $rS, $dst", IIC_LdStSTU, []>,		"stbu $rS, $dst", IIC_LdStSTU, []>,
RegConstraint<"$dst.reg = $ea_res">, NoEncode<"$ea_res">;		RegConstraint<"$dst.reg = $ea_res">, NoEncode<"$ea_res">;
▲ Show 20 Lines • Show All 257 Lines • Show Last 20 Lines

llvm/lib/Target/PowerPC/PPCInstrFormats.td

Show First 20 Lines • Show All 416 Lines • ▼ Show 20 Lines	class DQ_RD6_RS5_DQ12<bits<6> opcode, bits<3> xo, dag OOL, dag IOL,

let Inst{6-10} = XT{4-0};		let Inst{6-10} = XT{4-0};
let Inst{11-15} = DS_RA{16-12}; // Register #		let Inst{11-15} = DS_RA{16-12}; // Register #
let Inst{16-27} = DS_RA{11-0}; // Displacement.		let Inst{16-27} = DS_RA{11-0}; // Displacement.
let Inst{28} = XT{5};		let Inst{28} = XT{5};
let Inst{29-31} = xo;		let Inst{29-31} = xo;
}		}

		class DQForm_RTp5_RA17_MEM<bits<6> opcode, bits<4> xo, dag OOL, dag IOL,
		string asmstr, InstrItinClass itin,
		list<dag> pattern>
		: I<opcode, OOL, IOL, asmstr, itin> {
		bits<5> RTp;
		bits<17> DQ_RA;
		let Pattern = pattern;

		let Inst{6-10} = RTp{4-0};
		let Inst{11-15} = DQ_RA{16-12}; // Register #
		let Inst{16-27} = DQ_RA{11-0}; // Displacement.
		let Inst{28-31} = xo;
		}

// 1.7.6 X-Form		// 1.7.6 X-Form
class XForm_base_r3xo<bits<6> opcode, bits<10> xo, dag OOL, dag IOL, string asmstr,		class XForm_base_r3xo<bits<6> opcode, bits<10> xo, dag OOL, dag IOL, string asmstr,
InstrItinClass itin, list<dag> pattern>		InstrItinClass itin, list<dag> pattern>
: I<opcode, OOL, IOL, asmstr, itin> {		: I<opcode, OOL, IOL, asmstr, itin> {
bits<5> RST;		bits<5> RST;
bits<5> A;		bits<5> A;
bits<5> B;		bits<5> B;

▲ Show 20 Lines • Show All 1,727 Lines • Show Last 20 Lines

llvm/lib/Target/PowerPC/PPCInstrInfo.h

Show First 20 Lines • Show All 121 Lines • ▼ Show 20 Lines	enum SpillOpcodeKey {
SOK_VSXVectorSpill,		SOK_VSXVectorSpill,
SOK_VectorFloat8Spill,		SOK_VectorFloat8Spill,
SOK_VectorFloat4Spill,		SOK_VectorFloat4Spill,
SOK_SpillToVSR,		SOK_SpillToVSR,
SOK_PairedVecSpill,		SOK_PairedVecSpill,
SOK_AccumulatorSpill,		SOK_AccumulatorSpill,
SOK_UAccumulatorSpill,		SOK_UAccumulatorSpill,
SOK_SPESpill,		SOK_SPESpill,
		SOK_PairedG8Spill,
SOK_LastOpcodeSpill // This must be last on the enum.		SOK_LastOpcodeSpill // This must be last on the enum.
};		};

// Define list of load and store spill opcodes.		// Define list of load and store spill opcodes.
#define NoInstr PPC::INSTRUCTION_LIST_END		#define NoInstr PPC::INSTRUCTION_LIST_END
#define Pwr8LoadOpcodes \		#define Pwr8LoadOpcodes \
{ \		{ \
PPC::LWZ, PPC::LD, PPC::LFD, PPC::LFS, PPC::RESTORE_CR, \		PPC::LWZ, PPC::LD, PPC::LFD, PPC::LFS, PPC::RESTORE_CR, \
PPC::RESTORE_CRBIT, PPC::LVX, PPC::LXVD2X, PPC::LXSDX, PPC::LXSSPX, \		PPC::RESTORE_CRBIT, PPC::LVX, PPC::LXVD2X, PPC::LXSDX, PPC::LXSSPX, \
PPC::SPILLTOVSR_LD, NoInstr, NoInstr, NoInstr, PPC::EVLDD \		PPC::SPILLTOVSR_LD, NoInstr, NoInstr, NoInstr, PPC::EVLDD, \
		PPC::RESTORE_QUADWORD \
}		}

#define Pwr9LoadOpcodes \		#define Pwr9LoadOpcodes \
{ \		{ \
PPC::LWZ, PPC::LD, PPC::LFD, PPC::LFS, PPC::RESTORE_CR, \		PPC::LWZ, PPC::LD, PPC::LFD, PPC::LFS, PPC::RESTORE_CR, \
PPC::RESTORE_CRBIT, PPC::LVX, PPC::LXV, PPC::DFLOADf64, \		PPC::RESTORE_CRBIT, PPC::LVX, PPC::LXV, PPC::DFLOADf64, \
PPC::DFLOADf32, PPC::SPILLTOVSR_LD, NoInstr, NoInstr, NoInstr, NoInstr \		PPC::DFLOADf32, PPC::SPILLTOVSR_LD, NoInstr, NoInstr, NoInstr, \
		NoInstr, PPC::RESTORE_QUADWORD \
}		}

#define Pwr10LoadOpcodes \		#define Pwr10LoadOpcodes \
{ \		{ \
PPC::LWZ, PPC::LD, PPC::LFD, PPC::LFS, PPC::RESTORE_CR, \		PPC::LWZ, PPC::LD, PPC::LFD, PPC::LFS, PPC::RESTORE_CR, \
PPC::RESTORE_CRBIT, PPC::LVX, PPC::LXV, PPC::DFLOADf64, \		PPC::RESTORE_CRBIT, PPC::LVX, PPC::LXV, PPC::DFLOADf64, \
PPC::DFLOADf32, PPC::SPILLTOVSR_LD, PPC::LXVP, PPC::RESTORE_ACC, \		PPC::DFLOADf32, PPC::SPILLTOVSR_LD, PPC::LXVP, PPC::RESTORE_ACC, \
PPC::RESTORE_UACC, NoInstr \		PPC::RESTORE_UACC, NoInstr, PPC::RESTORE_QUADWORD \
}		}

#define Pwr8StoreOpcodes \		#define Pwr8StoreOpcodes \
{ \		{ \
PPC::STW, PPC::STD, PPC::STFD, PPC::STFS, PPC::SPILL_CR, PPC::SPILL_CRBIT, \		PPC::STW, PPC::STD, PPC::STFD, PPC::STFS, PPC::SPILL_CR, PPC::SPILL_CRBIT, \
PPC::STVX, PPC::STXVD2X, PPC::STXSDX, PPC::STXSSPX, \		PPC::STVX, PPC::STXVD2X, PPC::STXSDX, PPC::STXSSPX, \
PPC::SPILLTOVSR_ST, NoInstr, NoInstr, NoInstr, PPC::EVSTDD \		PPC::SPILLTOVSR_ST, NoInstr, NoInstr, NoInstr, PPC::EVSTDD, \
		PPC::SPILL_QUADWORD \
}		}

#define Pwr9StoreOpcodes \		#define Pwr9StoreOpcodes \
{ \		{ \
PPC::STW, PPC::STD, PPC::STFD, PPC::STFS, PPC::SPILL_CR, PPC::SPILL_CRBIT, \		PPC::STW, PPC::STD, PPC::STFD, PPC::STFS, PPC::SPILL_CR, PPC::SPILL_CRBIT, \
PPC::STVX, PPC::STXV, PPC::DFSTOREf64, PPC::DFSTOREf32, \		PPC::STVX, PPC::STXV, PPC::DFSTOREf64, PPC::DFSTOREf32, \
PPC::SPILLTOVSR_ST, NoInstr, NoInstr, NoInstr, NoInstr \		PPC::SPILLTOVSR_ST, NoInstr, NoInstr, NoInstr, NoInstr, \
		PPC::SPILL_QUADWORD \
}		}

#define Pwr10StoreOpcodes \		#define Pwr10StoreOpcodes \
{ \		{ \
PPC::STW, PPC::STD, PPC::STFD, PPC::STFS, PPC::SPILL_CR, PPC::SPILL_CRBIT, \		PPC::STW, PPC::STD, PPC::STFD, PPC::STFS, PPC::SPILL_CR, PPC::SPILL_CRBIT, \
PPC::STVX, PPC::STXV, PPC::DFSTOREf64, PPC::DFSTOREf32, \		PPC::STVX, PPC::STXV, PPC::DFSTOREf64, PPC::DFSTOREf32, \
PPC::SPILLTOVSR_ST, PPC::STXVP, PPC::SPILL_ACC, PPC::SPILL_UACC, \		PPC::SPILLTOVSR_ST, PPC::STXVP, PPC::SPILL_ACC, PPC::SPILL_UACC, \
NoInstr \		NoInstr, PPC::SPILL_QUADWORD \
}		}

// Initialize arrays for load and store spill opcodes on supported subtargets.		// Initialize arrays for load and store spill opcodes on supported subtargets.
#define StoreOpcodesForSpill \		#define StoreOpcodesForSpill \
{ Pwr8StoreOpcodes, Pwr9StoreOpcodes, Pwr10StoreOpcodes }		{ Pwr8StoreOpcodes, Pwr9StoreOpcodes, Pwr10StoreOpcodes }
#define LoadOpcodesForSpill \		#define LoadOpcodesForSpill \
{ Pwr8LoadOpcodes, Pwr9LoadOpcodes, Pwr10LoadOpcodes }		{ Pwr8LoadOpcodes, Pwr9LoadOpcodes, Pwr10LoadOpcodes }

▲ Show 20 Lines • Show All 515 Lines • Show Last 20 Lines

llvm/lib/Target/PowerPC/PPCInstrInfo.cpp

Show First 20 Lines • Show All 1,828 Lines • ▼ Show 20 Lines	for (unsigned Idx = 0; Idx < 4; Idx++)
BuildMI(MBB, I, DL, get(PPC::XXLOR), VSLDestReg + Idx)		BuildMI(MBB, I, DL, get(PPC::XXLOR), VSLDestReg + Idx)
.addReg(VSLSrcReg + Idx)		.addReg(VSLSrcReg + Idx)
.addReg(VSLSrcReg + Idx, getKillRegState(KillSrc));		.addReg(VSLSrcReg + Idx, getKillRegState(KillSrc));
if (DestPrimed)		if (DestPrimed)
BuildMI(MBB, I, DL, get(PPC::XXMTACC), DestReg).addReg(DestReg);		BuildMI(MBB, I, DL, get(PPC::XXMTACC), DestReg).addReg(DestReg);
if (SrcPrimed && !KillSrc)		if (SrcPrimed && !KillSrc)
BuildMI(MBB, I, DL, get(PPC::XXMTACC), SrcReg).addReg(SrcReg);		BuildMI(MBB, I, DL, get(PPC::XXMTACC), SrcReg).addReg(SrcReg);
return;		return;
		} else if (PPC::G8pRCRegClass.contains(DestReg) &&
		PPC::G8pRCRegClass.contains(SrcReg)) {
		// TODO: Handle G8RC to G8pRC (and vice versa) copy.
		unsigned DestRegIdx = DestReg - PPC::G8p0;
		MCRegister DestRegSub0 = PPC::X0 + 2 * DestRegIdx;
		MCRegister DestRegSub1 = PPC::X0 + 2 * DestRegIdx + 1;
		unsigned SrcRegIdx = SrcReg - PPC::G8p0;
		MCRegister SrcRegSub0 = PPC::X0 + 2 * SrcRegIdx;
		MCRegister SrcRegSub1 = PPC::X0 + 2 * SrcRegIdx + 1;
		BuildMI(MBB, I, DL, get(PPC::OR8), DestRegSub0)
		.addReg(SrcRegSub0)
		.addReg(SrcRegSub0, getKillRegState(KillSrc));
		BuildMI(MBB, I, DL, get(PPC::OR8), DestRegSub1)
		.addReg(SrcRegSub1)
		.addReg(SrcRegSub1, getKillRegState(KillSrc));
		return;
} else		} else
llvm_unreachable("Impossible reg-to-reg copy");		llvm_unreachable("Impossible reg-to-reg copy");

const MCInstrDesc &MCID = get(Opc);		const MCInstrDesc &MCID = get(Opc);
if (MCID.getNumOperands() == 3)		if (MCID.getNumOperands() == 3)
BuildMI(MBB, I, DL, MCID, DestReg)		BuildMI(MBB, I, DL, MCID, DestReg)
.addReg(SrcReg).addReg(SrcReg, getKillRegState(KillSrc));		.addReg(SrcReg).addReg(SrcReg, getKillRegState(KillSrc));
else		else
Show All 36 Lines	unsigned PPCInstrInfo::getSpillIndex(const TargetRegisterClass *RC) const {
} else if (PPC::UACCRCRegClass.hasSubClassEq(RC)) {		} else if (PPC::UACCRCRegClass.hasSubClassEq(RC)) {
assert(Subtarget.pairedVectorMemops() &&		assert(Subtarget.pairedVectorMemops() &&
"Register unexpected when paired memops are disabled.");		"Register unexpected when paired memops are disabled.");
OpcodeIndex = SOK_UAccumulatorSpill;		OpcodeIndex = SOK_UAccumulatorSpill;
} else if (PPC::VSRpRCRegClass.hasSubClassEq(RC)) {		} else if (PPC::VSRpRCRegClass.hasSubClassEq(RC)) {
assert(Subtarget.pairedVectorMemops() &&		assert(Subtarget.pairedVectorMemops() &&
"Register unexpected when paired memops are disabled.");		"Register unexpected when paired memops are disabled.");
OpcodeIndex = SOK_PairedVecSpill;		OpcodeIndex = SOK_PairedVecSpill;
		} else if (PPC::G8pRCRegClass.hasSubClassEq(RC)) {
		OpcodeIndex = SOK_PairedG8Spill;
} else {		} else {
llvm_unreachable("Unknown regclass!");		llvm_unreachable("Unknown regclass!");
}		}
return OpcodeIndex;		return OpcodeIndex;
}		}

unsigned		unsigned
PPCInstrInfo::getStoreOpcodeForSpill(const TargetRegisterClass *RC) const {		PPCInstrInfo::getStoreOpcodeForSpill(const TargetRegisterClass *RC) const {
▲ Show 20 Lines • Show All 3,603 Lines • Show Last 20 Lines

llvm/lib/Target/PowerPC/PPCInstrInfo.td

Show First 20 Lines • Show All 629 Lines • ▼ Show 20 Lines	def gprc : RegisterOperand<GPRC> {
let ParserMatchClass = PPCRegGPRCAsmOperand;		let ParserMatchClass = PPCRegGPRCAsmOperand;
}		}
def PPCRegG8RCAsmOperand : AsmOperandClass {		def PPCRegG8RCAsmOperand : AsmOperandClass {
let Name = "RegG8RC"; let PredicateMethod = "isRegNumber";		let Name = "RegG8RC"; let PredicateMethod = "isRegNumber";
}		}
def g8rc : RegisterOperand<G8RC> {		def g8rc : RegisterOperand<G8RC> {
let ParserMatchClass = PPCRegG8RCAsmOperand;		let ParserMatchClass = PPCRegG8RCAsmOperand;
}		}
		def PPCRegG8pRCAsmOperand : AsmOperandClass {
		let Name = "RegG8pRC"; let PredicateMethod = "isEvenRegNumber";
		}
		def g8prc : RegisterOperand<G8pRC> {
		let ParserMatchClass = PPCRegG8pRCAsmOperand;
		}
def PPCRegGPRCNoR0AsmOperand : AsmOperandClass {		def PPCRegGPRCNoR0AsmOperand : AsmOperandClass {
let Name = "RegGPRCNoR0"; let PredicateMethod = "isRegNumber";		let Name = "RegGPRCNoR0"; let PredicateMethod = "isRegNumber";
}		}
def gprc_nor0 : RegisterOperand<GPRC_NOR0> {		def gprc_nor0 : RegisterOperand<GPRC_NOR0> {
let ParserMatchClass = PPCRegGPRCNoR0AsmOperand;		let ParserMatchClass = PPCRegGPRCNoR0AsmOperand;
}		}
def PPCRegG8RCNoX0AsmOperand : AsmOperandClass {		def PPCRegG8RCNoX0AsmOperand : AsmOperandClass {
let Name = "RegG8RCNoX0"; let PredicateMethod = "isRegNumber";		let Name = "RegG8RCNoX0"; let PredicateMethod = "isRegNumber";
▲ Show 20 Lines • Show All 4,762 Lines • Show Last 20 Lines

llvm/lib/Target/PowerPC/PPCRegisterInfo.h

Show First 20 Lines • Show All 117 Lines • ▼ Show 20 Lines	public:
void lowerCRBitRestore(MachineBasicBlock::iterator II,		void lowerCRBitRestore(MachineBasicBlock::iterator II,
unsigned FrameIndex) const;		unsigned FrameIndex) const;

void lowerACCSpilling(MachineBasicBlock::iterator II,		void lowerACCSpilling(MachineBasicBlock::iterator II,
unsigned FrameIndex) const;		unsigned FrameIndex) const;
void lowerACCRestore(MachineBasicBlock::iterator II,		void lowerACCRestore(MachineBasicBlock::iterator II,
unsigned FrameIndex) const;		unsigned FrameIndex) const;

		void lowerQuadwordSpilling(MachineBasicBlock::iterator II,
		unsigned FrameIndex) const;
		void lowerQuadwordRestore(MachineBasicBlock::iterator II,
		unsigned FrameIndex) const;

static void emitAccCopyInfo(MachineBasicBlock &MBB, MCRegister DestReg,		static void emitAccCopyInfo(MachineBasicBlock &MBB, MCRegister DestReg,
MCRegister SrcReg);		MCRegister SrcReg);

bool hasReservedSpillSlot(const MachineFunction &MF, Register Reg,		bool hasReservedSpillSlot(const MachineFunction &MF, Register Reg,
int &FrameIdx) const override;		int &FrameIdx) const override;
void eliminateFrameIndex(MachineBasicBlock::iterator II, int SPAdj,		void eliminateFrameIndex(MachineBasicBlock::iterator II, int SPAdj,
unsigned FIOperandNum,		unsigned FIOperandNum,
RegScavenger *RS = nullptr) const override;		RegScavenger *RS = nullptr) const override;
▲ Show 20 Lines • Show All 44 Lines • Show Last 20 Lines

llvm/lib/Target/PowerPC/PPCRegisterInfo.cpp

Show First 20 Lines • Show All 109 Lines • ▼ Show 20 Lines	PPCRegisterInfo::PPCRegisterInfo(const PPCTargetMachine &TM)
// 64-bit		// 64-bit
ImmToIdxMap[PPC::LHA8] = PPC::LHAX8; ImmToIdxMap[PPC::LBZ8] = PPC::LBZX8;		ImmToIdxMap[PPC::LHA8] = PPC::LHAX8; ImmToIdxMap[PPC::LBZ8] = PPC::LBZX8;
ImmToIdxMap[PPC::LHZ8] = PPC::LHZX8; ImmToIdxMap[PPC::LWZ8] = PPC::LWZX8;		ImmToIdxMap[PPC::LHZ8] = PPC::LHZX8; ImmToIdxMap[PPC::LWZ8] = PPC::LWZX8;
ImmToIdxMap[PPC::STB8] = PPC::STBX8; ImmToIdxMap[PPC::STH8] = PPC::STHX8;		ImmToIdxMap[PPC::STB8] = PPC::STBX8; ImmToIdxMap[PPC::STH8] = PPC::STHX8;
ImmToIdxMap[PPC::STW8] = PPC::STWX8; ImmToIdxMap[PPC::STDU] = PPC::STDUX;		ImmToIdxMap[PPC::STW8] = PPC::STWX8; ImmToIdxMap[PPC::STDU] = PPC::STDUX;
ImmToIdxMap[PPC::ADDI8] = PPC::ADD8;		ImmToIdxMap[PPC::ADDI8] = PPC::ADD8;

// VSX		// VSX
ImmToIdxMap[PPC::DFLOADf32] = PPC::LXSSPX;		ImmToIdxMap[PPC::DFLOADf32] = PPC::LXSSPX;
		nemanjaiUnsubmitted Done Reply Inline Actions This is likely problematic. What happens if we need to spill one of these at an offset from the stack pointer that doesn't fit in a DQ field? I imagine something breaks. These have to be handled differently in that we need to bump the base register rather than loading an immediate into a register and using it as an index. See `PPCRegisterInfo::eliminateFrameIndex()` nemanjai: This is likely problematic. What happens if we need to spill one of these at an offset from the…
ImmToIdxMap[PPC::DFLOADf64] = PPC::LXSDX;		ImmToIdxMap[PPC::DFLOADf64] = PPC::LXSDX;
ImmToIdxMap[PPC::SPILLTOVSR_LD] = PPC::SPILLTOVSR_LDX;		ImmToIdxMap[PPC::SPILLTOVSR_LD] = PPC::SPILLTOVSR_LDX;
ImmToIdxMap[PPC::SPILLTOVSR_ST] = PPC::SPILLTOVSR_STX;		ImmToIdxMap[PPC::SPILLTOVSR_ST] = PPC::SPILLTOVSR_STX;
ImmToIdxMap[PPC::DFSTOREf32] = PPC::STXSSPX;		ImmToIdxMap[PPC::DFSTOREf32] = PPC::STXSSPX;
ImmToIdxMap[PPC::DFSTOREf64] = PPC::STXSDX;		ImmToIdxMap[PPC::DFSTOREf64] = PPC::STXSDX;
ImmToIdxMap[PPC::LXV] = PPC::LXVX;		ImmToIdxMap[PPC::LXV] = PPC::LXVX;
ImmToIdxMap[PPC::LXSD] = PPC::LXSDX;		ImmToIdxMap[PPC::LXSD] = PPC::LXSDX;
ImmToIdxMap[PPC::LXSSP] = PPC::LXSSPX;		ImmToIdxMap[PPC::LXSSP] = PPC::LXSSPX;
▲ Show 20 Lines • Show All 1,000 Lines • ▼ Show 20 Lines	addFrameReference(BuildMI(MBB, II, DL, TII.get(PPC::LXVP), Reg + 1),
FrameIndex, IsLittleEndian ? 0 : 32);		FrameIndex, IsLittleEndian ? 0 : 32);
if (IsPrimed)		if (IsPrimed)
BuildMI(MBB, II, DL, TII.get(PPC::XXMTACC), DestReg).addReg(DestReg);		BuildMI(MBB, II, DL, TII.get(PPC::XXMTACC), DestReg).addReg(DestReg);

// Discard the pseudo instruction.		// Discard the pseudo instruction.
MBB.erase(II);		MBB.erase(II);
}		}

		/// lowerQuadwordSpilling - Generate code to spill paired general register.
		void PPCRegisterInfo::lowerQuadwordSpilling(MachineBasicBlock::iterator II,
		unsigned FrameIndex) const {
		MachineInstr &MI = *II;
		MachineBasicBlock &MBB = *MI.getParent();
		MachineFunction &MF = *MBB.getParent();
		const PPCSubtarget &Subtarget = MF.getSubtarget<PPCSubtarget>();
		const TargetInstrInfo &TII = *Subtarget.getInstrInfo();
		DebugLoc DL = MI.getDebugLoc();

		Register SrcReg = MI.getOperand(0).getReg();
		bool IsKilled = MI.getOperand(0).isKill();

		Register Reg = PPC::X0 + (SrcReg - PPC::G8p0) * 2;
		bool IsLittleEndian = Subtarget.isLittleEndian();

		addFrameReference(BuildMI(MBB, II, DL, TII.get(PPC::STD))
		nemanjaiUnsubmitted Done Reply Inline Actions Why use an X-Form for spilling/restoring rather than a D-Form the way all the other spilling code does? The FI eliminator should have what is needed to convert the D-Form to an X-Form (rather than relying on the pre-emit peephole which is what I assume is converting the X-Form to a D-Form in the attached test case). nemanjai: Why use an X-Form for spilling/restoring rather than a D-Form the way all the other spilling…
		.addReg(Reg, getKillRegState(IsKilled)),
		FrameIndex, IsLittleEndian ? 8 : 0);
		addFrameReference(BuildMI(MBB, II, DL, TII.get(PPC::STD))
		.addReg(Reg + 1, getKillRegState(IsKilled)),
		FrameIndex, IsLittleEndian ? 0 : 8);

		// Discard the pseudo instruction.
		MBB.erase(II);
		}

		/// lowerQuadwordRestore - Generate code to restore paired general register.
		void PPCRegisterInfo::lowerQuadwordRestore(MachineBasicBlock::iterator II,
		unsigned FrameIndex) const {
		MachineInstr &MI = *II;
		MachineBasicBlock &MBB = *MI.getParent();
		MachineFunction &MF = *MBB.getParent();
		const PPCSubtarget &Subtarget = MF.getSubtarget<PPCSubtarget>();
		const TargetInstrInfo &TII = *Subtarget.getInstrInfo();
		DebugLoc DL = MI.getDebugLoc();

		Register DestReg = MI.getOperand(0).getReg();
		assert(MI.definesRegister(DestReg) &&
		"RESTORE_QUADWORD does not define its destination");

		Register Reg = PPC::X0 + (DestReg - PPC::G8p0) * 2;
		bool IsLittleEndian = Subtarget.isLittleEndian();

		addFrameReference(BuildMI(MBB, II, DL, TII.get(PPC::LD), Reg), FrameIndex,
		IsLittleEndian ? 8 : 0);
		addFrameReference(BuildMI(MBB, II, DL, TII.get(PPC::LD), Reg + 1), FrameIndex,
		IsLittleEndian ? 0 : 8);

		// Discard the pseudo instruction.
		MBB.erase(II);
		}

bool PPCRegisterInfo::hasReservedSpillSlot(const MachineFunction &MF,		bool PPCRegisterInfo::hasReservedSpillSlot(const MachineFunction &MF,
Register Reg, int &FrameIdx) const {		Register Reg, int &FrameIdx) const {
// For the nonvolatile condition registers (CR2, CR3, CR4) return true to		// For the nonvolatile condition registers (CR2, CR3, CR4) return true to
// prevent allocating an additional frame slot.		// prevent allocating an additional frame slot.
// For 64-bit ELF and AIX, the CR save area is in the linkage area at SP+8,		// For 64-bit ELF and AIX, the CR save area is in the linkage area at SP+8,
// for 32-bit AIX the CR save area is in the linkage area at SP+4.		// for 32-bit AIX the CR save area is in the linkage area at SP+4.
// We have created a FrameIndex to that spill slot to keep the CalleSaveInfos		// We have created a FrameIndex to that spill slot to keep the CalleSaveInfos
// valid.		// valid.
Show All 20 Lines	static unsigned offsetMinAlignForOpcode(unsigned OpC) {
case PPC::DFLOADf32:		case PPC::DFLOADf32:
case PPC::DFLOADf64:		case PPC::DFLOADf64:
case PPC::DFSTOREf32:		case PPC::DFSTOREf32:
case PPC::DFSTOREf64:		case PPC::DFSTOREf64:
case PPC::LXSD:		case PPC::LXSD:
case PPC::LXSSP:		case PPC::LXSSP:
case PPC::STXSD:		case PPC::STXSD:
case PPC::STXSSP:		case PPC::STXSSP:
		case PPC::STQ:
return 4;		return 4;
case PPC::EVLDD:		case PPC::EVLDD:
case PPC::EVSTDD:		case PPC::EVSTDD:
return 8;		return 8;
case PPC::LXV:		case PPC::LXV:
case PPC::STXV:		case PPC::STXV:
		case PPC::LQ:
return 16;		return 16;
}		}
}		}

// If the offset must be a multiple of some value, return what that value is.		// If the offset must be a multiple of some value, return what that value is.
static unsigned offsetMinAlign(const MachineInstr &MI) {		static unsigned offsetMinAlign(const MachineInstr &MI) {
unsigned OpC = MI.getOpcode();		unsigned OpC = MI.getOpcode();
return offsetMinAlignForOpcode(OpC);		return offsetMinAlignForOpcode(OpC);
▲ Show 20 Lines • Show All 79 Lines • ▼ Show 20 Lines	if (OpC == PPC::SPILL_CR) {
lowerCRBitRestore(II, FrameIndex);		lowerCRBitRestore(II, FrameIndex);
return;		return;
} else if (OpC == PPC::SPILL_ACC \|\| OpC == PPC::SPILL_UACC) {		} else if (OpC == PPC::SPILL_ACC \|\| OpC == PPC::SPILL_UACC) {
lowerACCSpilling(II, FrameIndex);		lowerACCSpilling(II, FrameIndex);
return;		return;
} else if (OpC == PPC::RESTORE_ACC \|\| OpC == PPC::RESTORE_UACC) {		} else if (OpC == PPC::RESTORE_ACC \|\| OpC == PPC::RESTORE_UACC) {
lowerACCRestore(II, FrameIndex);		lowerACCRestore(II, FrameIndex);
return;		return;
		} else if (OpC == PPC::SPILL_QUADWORD) {
		lowerQuadwordSpilling(II, FrameIndex);
		return;
		} else if (OpC == PPC::RESTORE_QUADWORD) {
		lowerQuadwordRestore(II, FrameIndex);
		return;
}		}

// Replace the FrameIndex with base register with GPR1 (SP) or GPR31 (FP).		// Replace the FrameIndex with base register with GPR1 (SP) or GPR31 (FP).
MI.getOperand(FIOperandNum).ChangeToRegister(		MI.getOperand(FIOperandNum).ChangeToRegister(
FrameIndex < 0 ? getBaseRegister(MF) : getFrameRegister(MF), false);		FrameIndex < 0 ? getBaseRegister(MF) : getFrameRegister(MF), false);

// If the instruction is not present in ImmToIdxMap, then it has no immediate		// If the instruction is not present in ImmToIdxMap, then it has no immediate
// form (and must be r+r).		// form (and must be r+r).
▲ Show 20 Lines • Show All 234 Lines • Show Last 20 Lines

llvm/lib/Target/PowerPC/PPCRegisterInfo.td

Show All 14 Lines
def sub_eq : SubRegIndex<1, 2>;		def sub_eq : SubRegIndex<1, 2>;
def sub_un : SubRegIndex<1, 3>;		def sub_un : SubRegIndex<1, 3>;
def sub_32 : SubRegIndex<32>;		def sub_32 : SubRegIndex<32>;
def sub_64 : SubRegIndex<64>;		def sub_64 : SubRegIndex<64>;
def sub_vsx0 : SubRegIndex<128>;		def sub_vsx0 : SubRegIndex<128>;
def sub_vsx1 : SubRegIndex<128, 128>;		def sub_vsx1 : SubRegIndex<128, 128>;
def sub_pair0 : SubRegIndex<256>;		def sub_pair0 : SubRegIndex<256>;
def sub_pair1 : SubRegIndex<256, 256>;		def sub_pair1 : SubRegIndex<256, 256>;
		def sub_gp8_x0 : SubRegIndex<64>;
		nemanjaiUnsubmitted Done Reply Inline Actions These names are odd, they seem to suggest that there are pairs that are subregisters, but it is actually the individual registers. Maybe: sub_gp8_x0 sub_gp8_x1 nemanjai: These names are odd, they seem to suggest that there are pairs that are subregisters, but it is…
		def sub_gp8_x1 : SubRegIndex<64, 64>;
}		}


class PPCReg<string n> : Register<n> {		class PPCReg<string n> : Register<n> {
let Namespace = "PPC";		let Namespace = "PPC";
}		}

// We identify all our registers with a 5-bit ID, for consistency's sake.		// We identify all our registers with a 5-bit ID, for consistency's sake.
▲ Show 20 Lines • Show All 83 Lines • ▼ Show 20 Lines
}		}

// VSR Pairs - One of the 32 paired even-odd consecutive VSRs.		// VSR Pairs - One of the 32 paired even-odd consecutive VSRs.
class VSRPair<bits<5> num, string n, list<Register> subregs> : PPCReg<n> {		class VSRPair<bits<5> num, string n, list<Register> subregs> : PPCReg<n> {
let HWEncoding{4-0} = num;		let HWEncoding{4-0} = num;
let SubRegs = subregs;		let SubRegs = subregs;
}		}

		// GP8Pair - Paired GP8.
		class GP8Pair<string n, GP8 SubReg0, GP8 SubReg1> : PPCReg<n> {
		jsjiUnsubmitted Not Done Reply Inline Actions nit: This doesn't enforce the relationship between SubReg0 and SubReg1 in Pair. How about something like? // GP8Pair - Paired GP8. class GP8Pair<string n, bits<5> EvenIndex>: PPCReg<n>{ assert !eq(EvenIndex{0},0), "Index should be even"; let HWEncoding{4-0} = EvenIndex ; let SubRegs = [!cast<GP8>("X"#EvenIndex), !cast<GP8>("X"#!add(EvenIndex, 1))]; let DwarfNumbers = [-1,-1] ; let SubRegIndices = [sub_gp8_x0, sub_gp8_x1] ; } then we can define pair safely and simply like: // 16 paired even-odd consecutive GP8s. foreach Index = { 0, 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30 } in { def G8p#!srl(Index, 1) : GP8Pair<"r"#Index, Index>; } jsji: nit: This doesn't enforce the relationship between SubReg0 and SubReg1 in Pair. How about…
		lkailAuthorUnsubmitted Done Reply Inline Actions Nice one. lkail: Nice one.
		let HWEncoding = SubReg0.HWEncoding;
		let SubRegs = [SubReg0, SubReg1];
		}

// General-purpose registers		// General-purpose registers
foreach Index = 0-31 in {		foreach Index = 0-31 in {
def R#Index : GPR<Index, "r"#Index>, DwarfRegNum<[-2, Index]>;		def R#Index : GPR<Index, "r"#Index>, DwarfRegNum<[-2, Index]>;
}		}

// 64-bit General-purpose registers		// 64-bit General-purpose registers
foreach Index = 0-31 in {		foreach Index = 0-31 in {
def X#Index : GP8<!cast<GPR>("R"#Index), "r"#Index>,		def X#Index : GP8<!cast<GPR>("R"#Index), "r"#Index>,
▲ Show 20 Lines • Show All 50 Lines • ▼ Show 20 Lines	let SubRegIndices = [sub_vsx0, sub_vsx1] in {
foreach Index = { 0, 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30 } in {		foreach Index = { 0, 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30 } in {
def VSRp#!add(!srl(Index, 1), 16) :		def VSRp#!add(!srl(Index, 1), 16) :
VSRPair<!add(!srl(Index, 1), 16), "vsp"#!add(Index, 32),		VSRPair<!add(!srl(Index, 1), 16), "vsp"#!add(Index, 32),
[!cast<VR>("V"#Index), !cast<VR>("V"#!add(Index, 1))]>,		[!cast<VR>("V"#Index), !cast<VR>("V"#!add(Index, 1))]>,
DwarfRegNum<[-1, -1]>;		DwarfRegNum<[-1, -1]>;
}		}
}		}

		// 16 paired even-odd consecutive GP8s.
		let SubRegIndices = [sub_gp8_x0, sub_gp8_x1] in {
		foreach Index = { 0, 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30 } in {
		def G8p#!srl(Index, 1) :
		jsjiUnsubmitted Done Reply Inline Actions This looks wrong. The encoding should be using original regnum, as in getG8pReg. eg: G8p1 is actually {X2,X3}, so the HW encoding should be 2/3, not 1. jsji: This looks wrong. The encoding should be using original regnum, as in getG8pReg. eg: G8p1 is…
		GP8Pair<"r"#Index,
		!cast<GP8>("X"#Index), !cast<GP8>("X"#!add(Index, 1))>,
		jsjiUnsubmitted Done Reply Inline Actions Shouldn't define `Index` here, should be `<[-1, -1]>` -- G8p is reg number in llvm only, there is NO regnum defined visible to debugger. Defining here will mess up mapping table. jsji: Shouldn't define `Index` here, should be `<[-1, -1]>` -- G8p is reg number in llvm only, there…
		DwarfRegNum<[-1, -1]>;
		}
		}

// The representation of r0 when treated as the constant 0.		// The representation of r0 when treated as the constant 0.
def ZERO : GPR<0, "0">, DwarfRegAlias<R0>;		def ZERO : GPR<0, "0">, DwarfRegAlias<R0>;
def ZERO8 : GP8<ZERO, "0">, DwarfRegAlias<X0>;		def ZERO8 : GP8<ZERO, "0">, DwarfRegAlias<X0>;

// Representations of the frame pointer used by ISD::FRAMEADDR.		// Representations of the frame pointer used by ISD::FRAMEADDR.
def FP : GPR<0 /* arbitrary /, "FRAME POINTER*">;		def FP : GPR<0 /* arbitrary /, "FRAME POINTER*">;
def FP8 : GP8<FP, "FRAME POINTER">;		def FP8 : GP8<FP, "FRAME POINTER">;

▲ Show 20 Lines • Show All 261 Lines • ▼ Show 20 Lines	def VSRpRC :
RegisterClass<"PPC", [v256i1], 128,		RegisterClass<"PPC", [v256i1], 128,
(add (sequence "VSRp%u", 0, 6),		(add (sequence "VSRp%u", 0, 6),
(sequence "VSRp%u", 15, 7), VSRp17, VSRp18,		(sequence "VSRp%u", 15, 7), VSRp17, VSRp18,
VSRp16, VSRp19, VSRp20, VSRp21, VSRp22, VSRp23,		VSRp16, VSRp19, VSRp20, VSRp21, VSRp22, VSRp23,
VSRp24, VSRp25, VSRp31, VSRp30, VSRp29, VSRp28,		VSRp24, VSRp25, VSRp31, VSRp30, VSRp29, VSRp28,
VSRp27, VSRp26)> {		VSRp27, VSRp26)> {
let Size = 256;		let Size = 256;
}		}

		// Make AllocationOrder as similar as G8RC's to avoid potential spilling.
		// Similarly, we have an AltOrder for 64-bit ELF ABI which r2 is allocated
		// at last.
		jsjiUnsubmitted Done Reply Inline Actions Add comments about why the allocation order is defined like this? And why we need a AltOrders for ELF? jsji: Add comments about why the allocation order is defined like this? And why we need a AltOrders…
		def G8pRC :
		RegisterClass<"PPC", [i128], 128,
		(add (sequence "G8p%u", 1, 5),
		(sequence "G8p%u", 14, 7),
		G8p15, G8p6, G8p0)> {
		let AltOrders = [(add (sub G8pRC, G8p1), G8p1)];
		let AltOrderSelect = [{
		return MF.getSubtarget<PPCSubtarget>().is64BitELFABI();
		}];
		let Size = 128;
		}

llvm/lib/Target/PowerPC/PPCSchedule.td

	Show All 37 Lines
	def IIC_LdStDCBF : InstrItinClass;			def IIC_LdStDCBF : InstrItinClass;
	def IIC_LdStDCBI : InstrItinClass;			def IIC_LdStDCBI : InstrItinClass;
	def IIC_LdStLoad : InstrItinClass;			def IIC_LdStLoad : InstrItinClass;
	def IIC_LdStLoadUpd : InstrItinClass;			def IIC_LdStLoadUpd : InstrItinClass;
	def IIC_LdStLoadUpdX : InstrItinClass;			def IIC_LdStLoadUpdX : InstrItinClass;
	def IIC_LdStStore : InstrItinClass;			def IIC_LdStStore : InstrItinClass;
	def IIC_LdStDSS : InstrItinClass;			def IIC_LdStDSS : InstrItinClass;
	def IIC_LdStICBI : InstrItinClass;			def IIC_LdStICBI : InstrItinClass;
	def IIC_LdStLD : InstrItinClass;			def IIC_LdStLD : InstrItinClass;
				jsjiUnsubmitted Done Reply Inline Actions nit: these are in alphabetic order , so these two should be moved down to after `IIC_LdStLMW` jsji: nit: these are in alphabetic order , so these two should be moved down to after `IIC_LdStLMW`
	def IIC_LdStLDU : InstrItinClass;			def IIC_LdStLDU : InstrItinClass;
	def IIC_LdStLDUX : InstrItinClass;			def IIC_LdStLDUX : InstrItinClass;
	def IIC_LdStLDARX : InstrItinClass;			def IIC_LdStLDARX : InstrItinClass;
	def IIC_LdStLFD : InstrItinClass;			def IIC_LdStLFD : InstrItinClass;
	def IIC_LdStLFDU : InstrItinClass;			def IIC_LdStLFDU : InstrItinClass;
	def IIC_LdStLFDUX : InstrItinClass;			def IIC_LdStLFDUX : InstrItinClass;
	def IIC_LdStLHA : InstrItinClass;			def IIC_LdStLHA : InstrItinClass;
	def IIC_LdStLHAU : InstrItinClass;			def IIC_LdStLHAU : InstrItinClass;
	def IIC_LdStLHAUX : InstrItinClass;			def IIC_LdStLHAUX : InstrItinClass;
	def IIC_LdStLMW : InstrItinClass;			def IIC_LdStLMW : InstrItinClass;
				def IIC_LdStLQ : InstrItinClass;
				def IIC_LdStLQARX : InstrItinClass;
	def IIC_LdStLVecX : InstrItinClass;			def IIC_LdStLVecX : InstrItinClass;
	def IIC_LdStLWA : InstrItinClass;			def IIC_LdStLWA : InstrItinClass;
	def IIC_LdStLWARX : InstrItinClass;			def IIC_LdStLWARX : InstrItinClass;
	def IIC_LdStSLBIA : InstrItinClass;			def IIC_LdStSLBIA : InstrItinClass;
	def IIC_LdStSLBIE : InstrItinClass;			def IIC_LdStSLBIE : InstrItinClass;
	def IIC_LdStSTD : InstrItinClass;			def IIC_LdStSTD : InstrItinClass;
	def IIC_LdStSTDCX : InstrItinClass;			def IIC_LdStSTDCX : InstrItinClass;
				def IIC_LdStSTQ : InstrItinClass;
				def IIC_LdStSTQCX : InstrItinClass;
	def IIC_LdStSTU : InstrItinClass;			def IIC_LdStSTU : InstrItinClass;
	def IIC_LdStSTUX : InstrItinClass;			def IIC_LdStSTUX : InstrItinClass;
	def IIC_LdStSTFD : InstrItinClass;			def IIC_LdStSTFD : InstrItinClass;
	def IIC_LdStSTFDU : InstrItinClass;			def IIC_LdStSTFDU : InstrItinClass;
	def IIC_LdStSTVEBX : InstrItinClass;			def IIC_LdStSTVEBX : InstrItinClass;
	def IIC_LdStSTWCX : InstrItinClass;			def IIC_LdStSTWCX : InstrItinClass;
	def IIC_LdStSync : InstrItinClass;			def IIC_LdStSync : InstrItinClass;
	def IIC_LdStCOPY : InstrItinClass;			def IIC_LdStCOPY : InstrItinClass;
	▲ Show 20 Lines • Show All 68 Lines • Show Last 20 Lines

llvm/test/CodeGen/PowerPC/ldst-16-byte-asm.mir

This file was added.

				# RUN: llc -simplify-mir -verify-machineinstrs -mtriple=powerpc64-ibm-aix-xcoff \
				# RUN: %s -o - \| FileCheck %s

				---
				name: foo
				alignment: 8
				tracksRegLiveness: true
				body: \|
				bb.0.entry:
				liveins: $x3, $x4, $x30, $x31
				; CHECK-LABEL: .foo
				; CHECK: lq 2, 128(4)
				; CHECK: lqarx 28, 30, 31
				; CHECK: stqcx. 28, 30, 31
				; CHECK: stq 2, 128(4)
				$g8p1 = LQ 128, $x4
				$g8p14 = LQARX $x30, $x31
				STQCX $g8p14, $x30, $x31, implicit-def $cr0
				STQ $g8p1, 128, $x4
				$x3 = COPY $x31
				BLR8 implicit $lr8, implicit undef $rm, implicit $x3
				...

llvm/test/CodeGen/PowerPC/ldst-16-byte.mir

This file was added.

				# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
				# RUN: llc -simplify-mir -verify-machineinstrs -mtriple=powerpc64-ibm-aix-xcoff \
				# RUN: -stop-after=postrapseudos %s -o - \| FileCheck %s
				jsjiUnsubmitted Not Done Reply Inline Actions How about powerpc32? jsji: How about powerpc32?
				lkailAuthorUnsubmitted Done Reply Inline Actions I guess it's not supported in powerpc32 mode, though I can't find where it is documented(It's documented in AIX assembly manual that `ldarx` and `stdcx` are not supported in powerpc32 mode). I've tried following on AIX-7.2 main: lq 2,128(4) stq 2,128(4) lqarx 2,3,4 lqarx 2,3,4,1 stqcx. 2,3,4 # as above code as -a64 -mpwr8 enc.s objdump -D a.out It gives correct dump result. 0000000000000000 <.text>: 0: e0 44 00 80 lq r2,128(r4) 4: f8 44 00 82 stq r2,128(r4) 8: 7c 43 22 28 lqarx r2,r3,r4 c: 7c 43 22 29 lqarx r2,r3,r4,1 10: 7c 43 21 6d stqcx. r2,r3,r4 But for `as -a32 -mpwr8 enc.s`, the result is not correct. 00000000 <.text>: 0: e0 44 00 80 lfq f2,128(r4) 4: f8 44 00 82 .long 0xf8440082 8: 7c 43 22 28 .long 0x7c432228 c: 7c 43 22 29 .long 0x7c432229 10: 7c 43 21 6d .long 0x7c43216d lkail: I guess it's not supported in powerpc32 mode, though I can't find where it is documented(It's…
				jsjiUnsubmitted Done Reply Inline Actions Thanks. If so, there opcode should be guarded with isPPC64 pred. jsji: Thanks. If so, there opcode should be guarded with isPPC64 pred.

				---
				name: foo
				alignment: 8
				tracksRegLiveness: true
				body: \|
				bb.0.entry:
				liveins: $x3, $x4
				; CHECK-LABEL: name: foo
				; CHECK: liveins: $x3, $x4
				; CHECK: early-clobber renamable $g8p3 = LQ 128, $x4
				; CHECK: $x3 = OR8 $x7, $x7
				; CHECK: STQ killed renamable $g8p3, 160, $x3
				; CHECK: BLR8 implicit $lr8, implicit undef $rm, implicit $x3
				%0:g8prc = LQ 128, $x4
				$x3 = COPY %0.sub_gp8_x1:g8prc
				STQ %0, 160, $x3
				BLR8 implicit $lr8, implicit undef $rm, implicit $x3
				...

				---
				name: foobar
				alignment: 8
				tracksRegLiveness: true
				body: \|
				bb.0.entry:
				liveins: $x3, $x4
				; CHECK-LABEL: name: foobar
				; CHECK: liveins: $x3, $x4
				; CHECK: renamable $g8p3 = LQARX $x3, $x4
				; CHECK: STQCX renamable $g8p3, $x3, $x4, implicit-def dead $cr0
				; CHECK: $x3 = OR8 $x7, $x7, implicit killed $g8p3
				; CHECK: BLR8 implicit $lr8, implicit undef $rm, implicit $x3
				%0:g8prc = LQARX $x3, $x4
				STQCX %0:g8prc, $x3, $x4, implicit-def $cr0
				$x3 = COPY %0.sub_gp8_x1:g8prc
				BLR8 implicit $lr8, implicit undef $rm, implicit $x3
				...

				---
				name: bar
				alignment: 8
				tracksRegLiveness: true
				body: \|
				bb.0.entry:
				liveins: $x3, $x4
				; CHECK-LABEL: name: bar
				; CHECK: liveins: $x3, $x4
				; CHECK: early-clobber renamable $g8p2 = LQ 128, renamable $x3
				; CHECK: STQ renamable $g8p2, 160, $x3
				; CHECK: $x3 = OR8 $x4, $x4, implicit killed $g8p2
				; CHECK: BLR8 implicit $lr8, implicit undef $rm, implicit $x3
				%0:g8rc_nox0 = COPY $x3
				%1:g8prc = LQ 128, %0
				STQ %1, 160, $x3
				$x3 = COPY %1.sub_gp8_x0:g8prc
				BLR8 implicit $lr8, implicit undef $rm, implicit $x3
				...

				# `spill_g8prc`'s code contains undefined behaviors. This code is only to
				# demonstrate correctness of spiller.
				---
				name: spill_g8prc
				alignment: 8
				tracksRegLiveness: true
				body: \|
				bb.0.entry:
				liveins: $x3, $x4, $x5, $x6, $x7, $x8, $x9, $x10, $x11, $x12
				; CHECK-LABEL: name: spill_g8prc
				; CHECK: liveins: $x3, $x4, $x5, $x6, $x7, $x8, $x9, $x10, $x11, $x12, $x14, $x15, $x16, $x17, $x18, $x19, $x20, $x21, $x22, $x23, $x24, $x25, $x26, $x27, $x28, $x29, $x30, $x31
				; CHECK: STD killed $x14, -144, $x1 :: (store 8 into %fixed-stack.17, align 16)
				; CHECK: STD killed $x15, -136, $x1 :: (store 8 into %fixed-stack.16)
				; CHECK: STD killed $x16, -128, $x1 :: (store 8 into %fixed-stack.15, align 16)
				; CHECK: STD killed $x17, -120, $x1 :: (store 8 into %fixed-stack.14)
				; CHECK: STD killed $x18, -112, $x1 :: (store 8 into %fixed-stack.13, align 16)
				; CHECK: STD killed $x19, -104, $x1 :: (store 8 into %fixed-stack.12)
				; CHECK: STD killed $x20, -96, $x1 :: (store 8 into %fixed-stack.11, align 16)
				; CHECK: STD killed $x21, -88, $x1 :: (store 8 into %fixed-stack.10)
				; CHECK: STD killed $x22, -80, $x1 :: (store 8 into %fixed-stack.9, align 16)
				; CHECK: STD killed $x23, -72, $x1 :: (store 8 into %fixed-stack.8)
				; CHECK: STD killed $x24, -64, $x1 :: (store 8 into %fixed-stack.7, align 16)
				; CHECK: STD killed $x25, -56, $x1 :: (store 8 into %fixed-stack.6)
				; CHECK: STD killed $x26, -48, $x1 :: (store 8 into %fixed-stack.5, align 16)
				; CHECK: STD killed $x27, -40, $x1 :: (store 8 into %fixed-stack.4)
				; CHECK: STD killed $x28, -32, $x1 :: (store 8 into %fixed-stack.3, align 16)
				; CHECK: STD killed $x29, -24, $x1 :: (store 8 into %fixed-stack.2)
				; CHECK: STD killed $x30, -16, $x1 :: (store 8 into %fixed-stack.1, align 16)
				; CHECK: STD killed $x31, -8, $x1 :: (store 8 into %fixed-stack.0)
				; CHECK: $x7 = OR8 $x3, $x3
				; CHECK: renamable $g8p4 = LQARX $x5, $x6
				; CHECK: STD killed $x8, -160, $x1
				; CHECK: STD killed $x9, -152, $x1
				; CHECK: renamable $g8p5 = LQARX $x3, renamable $x4
				; CHECK: renamable $g8p4 = LQARX $x3, renamable $x4
				; CHECK: STD killed $x8, -176, $x1
				; CHECK: STD killed $x9, -168, $x1
				; CHECK: renamable $g8p4 = LQARX $x3, renamable $x4
				; CHECK: STD killed $x8, -192, $x1
				; CHECK: STD killed $x9, -184, $x1
				; CHECK: renamable $g8p4 = LQARX $x3, renamable $x4
				; CHECK: STD killed $x8, -208, $x1
				; CHECK: STD killed $x9, -200, $x1
				; CHECK: renamable $g8p4 = LQARX $x3, renamable $x4
				; CHECK: STD killed $x8, -224, $x1
				; CHECK: STD killed $x9, -216, $x1
				; CHECK: renamable $g8p10 = LQARX $x3, renamable $x4
				; CHECK: renamable $g8p9 = LQARX $x3, renamable $x4
				; CHECK: renamable $g8p8 = LQARX $x3, renamable $x4
				; CHECK: renamable $g8p7 = LQARX $x3, renamable $x4
				; CHECK: renamable $g8p15 = LQARX $x3, renamable $x4
				; CHECK: renamable $g8p11 = LQARX $x3, renamable $x4
				; CHECK: renamable $g8p12 = LQARX $x3, renamable $x4
				; CHECK: renamable $g8p13 = LQARX $x3, renamable $x4
				; CHECK: renamable $g8p14 = LQARX $x3, renamable $x4
				; CHECK: renamable $g8p4 = LQARX $x3, renamable $x4
				; CHECK: $x3 = OR8 $x11, $x11
				; CHECK: STQCX killed renamable $g8p4, renamable $x7, renamable $x4, implicit-def dead $cr0
				; CHECK: STQCX killed renamable $g8p14, renamable $x7, renamable $x4, implicit-def dead $cr0
				; CHECK: STQCX killed renamable $g8p13, renamable $x7, renamable $x4, implicit-def dead $cr0
				; CHECK: STQCX killed renamable $g8p12, renamable $x7, renamable $x4, implicit-def dead $cr0
				; CHECK: STQCX killed renamable $g8p11, renamable $x7, renamable $x4, implicit-def dead $cr0
				; CHECK: STQCX killed renamable $g8p15, renamable $x7, renamable $x4, implicit-def dead $cr0
				; CHECK: STQCX killed renamable $g8p7, renamable $x7, renamable $x4, implicit-def dead $cr0
				; CHECK: STQCX killed renamable $g8p8, renamable $x7, renamable $x4, implicit-def dead $cr0
				; CHECK: STQCX killed renamable $g8p9, renamable $x7, renamable $x4, implicit-def dead $cr0
				; CHECK: STQCX killed renamable $g8p10, renamable $x7, renamable $x4, implicit-def dead $cr0
				; CHECK: $x8 = LD -224, $x1
				; CHECK: $x9 = LD -216, $x1
				; CHECK: STQCX killed renamable $g8p4, renamable $x7, renamable $x4, implicit-def dead $cr0
				; CHECK: $x8 = LD -208, $x1
				; CHECK: $x9 = LD -200, $x1
				; CHECK: STQCX killed renamable $g8p4, renamable $x7, renamable $x4, implicit-def dead $cr0
				; CHECK: $x8 = LD -192, $x1
				; CHECK: $x9 = LD -184, $x1
				; CHECK: STQCX killed renamable $g8p4, renamable $x7, renamable $x4, implicit-def dead $cr0
				; CHECK: $x8 = LD -176, $x1
				; CHECK: $x9 = LD -168, $x1
				; CHECK: STQCX killed renamable $g8p4, renamable $x7, renamable $x4, implicit-def dead $cr0
				; CHECK: STQCX killed renamable $g8p5, killed renamable $x7, killed renamable $x4, implicit-def dead $cr0
				; CHECK: $x8 = LD -160, $x1
				; CHECK: $x9 = LD -152, $x1
				; CHECK: STQCX killed renamable $g8p4, $x5, $x6, implicit-def dead $cr0
				; CHECK: $x31 = LD -8, $x1 :: (load 8 from %fixed-stack.0)
				; CHECK: $x30 = LD -16, $x1 :: (load 8 from %fixed-stack.1, align 16)
				; CHECK: $x29 = LD -24, $x1 :: (load 8 from %fixed-stack.2)
				; CHECK: $x28 = LD -32, $x1 :: (load 8 from %fixed-stack.3, align 16)
				; CHECK: $x27 = LD -40, $x1 :: (load 8 from %fixed-stack.4)
				; CHECK: $x26 = LD -48, $x1 :: (load 8 from %fixed-stack.5, align 16)
				; CHECK: $x25 = LD -56, $x1 :: (load 8 from %fixed-stack.6)
				; CHECK: $x24 = LD -64, $x1 :: (load 8 from %fixed-stack.7, align 16)
				; CHECK: $x23 = LD -72, $x1 :: (load 8 from %fixed-stack.8)
				; CHECK: $x22 = LD -80, $x1 :: (load 8 from %fixed-stack.9, align 16)
				; CHECK: $x21 = LD -88, $x1 :: (load 8 from %fixed-stack.10)
				; CHECK: $x20 = LD -96, $x1 :: (load 8 from %fixed-stack.11, align 16)
				; CHECK: $x19 = LD -104, $x1 :: (load 8 from %fixed-stack.12)
				; CHECK: $x18 = LD -112, $x1 :: (load 8 from %fixed-stack.13, align 16)
				; CHECK: $x17 = LD -120, $x1 :: (load 8 from %fixed-stack.14)
				; CHECK: $x16 = LD -128, $x1 :: (load 8 from %fixed-stack.15, align 16)
				; CHECK: $x15 = LD -136, $x1 :: (load 8 from %fixed-stack.16)
				; CHECK: $x14 = LD -144, $x1 :: (load 8 from %fixed-stack.17, align 16)
				; CHECK: BLR8 implicit $lr8, implicit undef $rm, implicit $x3
				%addr0:g8rc_nox0 = COPY $x3
				%addr1:g8rc = COPY $x4
				%0:g8prc = LQARX $x5, $x6
				%1:g8prc = LQARX %addr0, %addr1
				%2:g8prc = LQARX %addr0, %addr1
				%3:g8prc = LQARX %addr0, %addr1
				%4:g8prc = LQARX %addr0, %addr1
				%5:g8prc = LQARX %addr0, %addr1
				%6:g8prc = LQARX %addr0, %addr1
				%7:g8prc = LQARX %addr0, %addr1
				%8:g8prc = LQARX %addr0, %addr1
				%9:g8prc = LQARX %addr0, %addr1
				%10:g8prc = LQARX %addr0, %addr1
				%11:g8prc = LQARX %addr0, %addr1
				%12:g8prc = LQARX %addr0, %addr1
				%13:g8prc = LQARX %addr0, %addr1
				%14:g8prc = LQARX %addr0, %addr1
				%15:g8prc = LQARX %addr0, %addr1
				STQCX %15:g8prc, %addr0, %addr1, implicit-def $cr0
				STQCX %14:g8prc, %addr0, %addr1, implicit-def $cr0
				STQCX %13:g8prc, %addr0, %addr1, implicit-def $cr0
				STQCX %12:g8prc, %addr0, %addr1, implicit-def $cr0
				STQCX %11:g8prc, %addr0, %addr1, implicit-def $cr0
				STQCX %10:g8prc, %addr0, %addr1, implicit-def $cr0
				STQCX %9:g8prc, %addr0, %addr1, implicit-def $cr0
				STQCX %8:g8prc, %addr0, %addr1, implicit-def $cr0
				STQCX %7:g8prc, %addr0, %addr1, implicit-def $cr0
				STQCX %6:g8prc, %addr0, %addr1, implicit-def $cr0
				STQCX %5:g8prc, %addr0, %addr1, implicit-def $cr0
				STQCX %4:g8prc, %addr0, %addr1, implicit-def $cr0
				STQCX %3:g8prc, %addr0, %addr1, implicit-def $cr0
				STQCX %2:g8prc, %addr0, %addr1, implicit-def $cr0
				STQCX %1:g8prc, %addr0, %addr1, implicit-def $cr0
				STQCX %0:g8prc, $x5, $x6, implicit-def $cr0
				$x3 = COPY %1.sub_gp8_x1:g8prc
				BLR8 implicit $lr8, implicit undef $rm, implicit $x3
				...

				---
				name: copy_g8prc
				alignment: 8
				tracksRegLiveness: true
				body: \|
				bb.0.entry:
				liveins: $g8p8
				; CHECK-LABEL: name: copy_g8prc
				; CHECK: liveins: $g8p8
				; CHECK: $x4 = OR8 $x16, $x16
				; CHECK: $x5 = OR8 $x17, $x17
				; CHECK: $x3 = OR8 $x5, $x5
				; CHECK: $x4 = KILL renamable $x4, implicit killed $g8p2
				; CHECK: BLR8 implicit $lr8, implicit undef $rm, implicit killed $x3, implicit $x4
				%0:g8prc = COPY $g8p8
				$x3 = COPY %0.sub_gp8_x1:g8prc
				$x4 = COPY %0.sub_gp8_x0:g8prc
				BLR8 implicit $lr8, implicit undef $rm, implicit $x3, implicit $x4
				...

llvm/test/MC/Disassembler/PowerPC/ppc64-encoding-bookII.txt

	Show All 27 Lines
	0x4c 0x00 0x01 0x2c			0x4c 0x00 0x01 0x2c

	# CHECK: stwcx. 2, 3, 4			# CHECK: stwcx. 2, 3, 4
	0x7c 0x43 0x21 0x2d			0x7c 0x43 0x21 0x2d

	# CHECK: stdcx. 2, 3, 4			# CHECK: stdcx. 2, 3, 4
	0x7c 0x43 0x21 0xad			0x7c 0x43 0x21 0xad

				# CHECK: stqcx. 2, 3, 4
				0x7c 0x43 0x21 0x6d

				# CHECK: stqcx. 28, 30, 31
				0x7f 0x9e 0xf9 0x6d

	# CHECK: stwat 2, 3, 28			# CHECK: stwat 2, 3, 28
	0x7c 0x43 0xe5 0x8c			0x7c 0x43 0xe5 0x8c

	# CHECK: stdat 2, 3, 28			# CHECK: stdat 2, 3, 28
	0x7c 0x43 0xe5 0xcc			0x7c 0x43 0xe5 0xcc

	# CHECK: ptesync			# CHECK: ptesync
	0x7c 0x40 0x04 0xac			0x7c 0x40 0x04 0xac
	Show All 23 Lines
	0x7c 0x43 0x20 0xe8			0x7c 0x43 0x20 0xe8

	# CHECK: lwarx 2, 3, 4			# CHECK: lwarx 2, 3, 4
	0x7c 0x43 0x20 0x28			0x7c 0x43 0x20 0x28

	# CHECK: ldarx 2, 3, 4			# CHECK: ldarx 2, 3, 4
	0x7c 0x43 0x20 0xa8			0x7c 0x43 0x20 0xa8

				# CHECK: lqarx 2, 3, 4
				0x7c 0x43 0x22 0x28

				# CHECK: lqarx 28, 30, 31
				0x7f 0x9e 0xfa 0x28

	# CHECK: lbarx 2, 3, 4, 1			# CHECK: lbarx 2, 3, 4, 1
	0x7c 0x43 0x20 0x69			0x7c 0x43 0x20 0x69

	# CHECK: lharx 2, 3, 4, 1			# CHECK: lharx 2, 3, 4, 1
	0x7c 0x43 0x20 0xe9			0x7c 0x43 0x20 0xe9

	# CHECK: lwarx 2, 3, 4, 1			# CHECK: lwarx 2, 3, 4, 1
	0x7c 0x43 0x20 0x29			0x7c 0x43 0x20 0x29

	# CHECK: ldarx 2, 3, 4, 1			# CHECK: ldarx 2, 3, 4, 1
	0x7c 0x43 0x20 0xa9			0x7c 0x43 0x20 0xa9

				# CHECK: lqarx 2, 3, 4, 1
				0x7c 0x43 0x22 0x29

				# CHECK: lqarx 28, 30, 31, 1
				0x7f 0x9e 0xfa 0x29

	# CHECK: lwat 2, 3, 28			# CHECK: lwat 2, 3, 28
	0x7c 0x43 0xe4 0x8c			0x7c 0x43 0xe4 0x8c

	# CHECK: ldat 2, 3, 28			# CHECK: ldat 2, 3, 28
	0x7c 0x43 0xe4 0xcc			0x7c 0x43 0xe4 0xcc

	# CHECK: sync			# CHECK: sync
	0x7c 0x00 0x04 0xac			0x7c 0x00 0x04 0xac
	Show All 28 Lines

llvm/test/MC/Disassembler/PowerPC/ppc64-encoding.txt

	Show First 20 Lines • Show All 148 Lines • ▼ Show 20 Lines
	0xe8 0x44 0x00 0x81			0xe8 0x44 0x00 0x81

	# CHECK: ldux 2, 3, 4			# CHECK: ldux 2, 3, 4
	0x7c 0x43 0x20 0x6a			0x7c 0x43 0x20 0x6a

	# CHECK: ldmx 2, 3, 4			# CHECK: ldmx 2, 3, 4
	0x7c 0x43 0x22 0x6a			0x7c 0x43 0x22 0x6a

				# CHECK: lq 2, 128(4)
				0xe0 0x44 0x00 0x80

				# CHECK: lq 28, 128(30)
				0xe3 0x9e 0x00 0x80

	# CHECK: stb 2, 128(4)			# CHECK: stb 2, 128(4)
	0x98 0x44 0x00 0x80			0x98 0x44 0x00 0x80

	# CHECK: stbx 2, 3, 4			# CHECK: stbx 2, 3, 4
	0x7c 0x43 0x21 0xae			0x7c 0x43 0x21 0xae

	# CHECK: stbu 2, 128(4)			# CHECK: stbu 2, 128(4)
	0x9c 0x44 0x00 0x80			0x9c 0x44 0x00 0x80
	Show All 32 Lines
	0x7c 0x43 0x21 0x2a			0x7c 0x43 0x21 0x2a

	# CHECK: stdu 2, 128(4)			# CHECK: stdu 2, 128(4)
	0xf8 0x44 0x00 0x81			0xf8 0x44 0x00 0x81

	# CHECK: stdux 2, 3, 4			# CHECK: stdux 2, 3, 4
	0x7c 0x43 0x21 0x6a			0x7c 0x43 0x21 0x6a

				# CHECK: stq 2, 128(4)
				0xf8 0x44 0x00 0x82

				# CHECK: stq 28, 128(30)
				0xfb 0x9e 0x00 0x82

	# CHECK: lhbrx 2, 3, 4			# CHECK: lhbrx 2, 3, 4
	0x7c 0x43 0x26 0x2c			0x7c 0x43 0x26 0x2c

	# CHECK: sthbrx 2, 3, 4			# CHECK: sthbrx 2, 3, 4
	0x7c 0x43 0x27 0x2c			0x7c 0x43 0x27 0x2c

	# CHECK: lwbrx 2, 3, 4			# CHECK: lwbrx 2, 3, 4
	0x7c 0x43 0x24 0x2c			0x7c 0x43 0x24 0x2c
	▲ Show 20 Lines • Show All 669 Lines • Show Last 20 Lines

llvm/test/MC/PowerPC/ppc64-encoding-bookII.s

	Show First 20 Lines • Show All 77 Lines • ▼ Show 20 Lines
	# CHECK-BE: stwcx. 2, 3, 4 # encoding: [0x7c,0x43,0x21,0x2d]			# CHECK-BE: stwcx. 2, 3, 4 # encoding: [0x7c,0x43,0x21,0x2d]
	# CHECK-LE: stwcx. 2, 3, 4 # encoding: [0x2d,0x21,0x43,0x7c]			# CHECK-LE: stwcx. 2, 3, 4 # encoding: [0x2d,0x21,0x43,0x7c]
	stwcx. 2, 3, 4			stwcx. 2, 3, 4

	# CHECK-BE: stdcx. 2, 3, 4 # encoding: [0x7c,0x43,0x21,0xad]			# CHECK-BE: stdcx. 2, 3, 4 # encoding: [0x7c,0x43,0x21,0xad]
	# CHECK-LE: stdcx. 2, 3, 4 # encoding: [0xad,0x21,0x43,0x7c]			# CHECK-LE: stdcx. 2, 3, 4 # encoding: [0xad,0x21,0x43,0x7c]
	stdcx. 2, 3, 4			stdcx. 2, 3, 4

				# CHECK-BE: stqcx. 2, 3, 4 # encoding: [0x7c,0x43,0x21,0x6d]
				# CHECK-LE: stqcx. 2, 3, 4 # encoding: [0x6d,0x21,0x43,0x7c]
				stqcx. 2, 3, 4

				# CHECK-BE: stqcx. 28, 30, 31 # encoding: [0x7f,0x9e,0xf9,0x6d]
				# CHECK-LE: stqcx. 28, 30, 31 # encoding: [0x6d,0xf9,0x9e,0x7f]
				stqcx. 28, 30, 31

	# CHECK-BE: stwat 2, 3, 28 # encoding: [0x7c,0x43,0xe5,0x8c]			# CHECK-BE: stwat 2, 3, 28 # encoding: [0x7c,0x43,0xe5,0x8c]
	# CHECK-LE: stwat 2, 3, 28 # encoding: [0x8c,0xe5,0x43,0x7c]			# CHECK-LE: stwat 2, 3, 28 # encoding: [0x8c,0xe5,0x43,0x7c]
	stwat 2, 3, 28			stwat 2, 3, 28

	# CHECK-BE: stdat 2, 3, 28 # encoding: [0x7c,0x43,0xe5,0xcc]			# CHECK-BE: stdat 2, 3, 28 # encoding: [0x7c,0x43,0xe5,0xcc]
	# CHECK-LE: stdat 2, 3, 28 # encoding: [0xcc,0xe5,0x43,0x7c]			# CHECK-LE: stdat 2, 3, 28 # encoding: [0xcc,0xe5,0x43,0x7c]
	stdat 2, 3, 28			stdat 2, 3, 28

	Show All 35 Lines
	# CHECK-BE: lwarx 2, 3, 4 # encoding: [0x7c,0x43,0x20,0x28]			# CHECK-BE: lwarx 2, 3, 4 # encoding: [0x7c,0x43,0x20,0x28]
	# CHECK-LE: lwarx 2, 3, 4 # encoding: [0x28,0x20,0x43,0x7c]			# CHECK-LE: lwarx 2, 3, 4 # encoding: [0x28,0x20,0x43,0x7c]
	lwarx 2, 3, 4			lwarx 2, 3, 4

	# CHECK-BE: ldarx 2, 3, 4 # encoding: [0x7c,0x43,0x20,0xa8]			# CHECK-BE: ldarx 2, 3, 4 # encoding: [0x7c,0x43,0x20,0xa8]
	# CHECK-LE: ldarx 2, 3, 4 # encoding: [0xa8,0x20,0x43,0x7c]			# CHECK-LE: ldarx 2, 3, 4 # encoding: [0xa8,0x20,0x43,0x7c]
	ldarx 2, 3, 4			ldarx 2, 3, 4

				# CHECK-BE: lqarx 2, 3, 4 # encoding: [0x7c,0x43,0x22,0x28]
				# CHECK-LE: lqarx 2, 3, 4 # encoding: [0x28,0x22,0x43,0x7c]
				lqarx 2, 3, 4

				# CHECK-BE: lqarx 28, 30, 31 # encoding: [0x7f,0x9e,0xfa,0x28]
				# CHECK-LE: lqarx 28, 30, 31 # encoding: [0x28,0xfa,0x9e,0x7f]
				lqarx 28, 30, 31

	# CHECK-BE: lbarx 2, 3, 4, 1 # encoding: [0x7c,0x43,0x20,0x69]			# CHECK-BE: lbarx 2, 3, 4, 1 # encoding: [0x7c,0x43,0x20,0x69]
	# CHECK-LE: lbarx 2, 3, 4, 1 # encoding: [0x69,0x20,0x43,0x7c]			# CHECK-LE: lbarx 2, 3, 4, 1 # encoding: [0x69,0x20,0x43,0x7c]
	lbarx 2, 3, 4, 1			lbarx 2, 3, 4, 1

	# CHECK-BE: lharx 2, 3, 4, 1 # encoding: [0x7c,0x43,0x20,0xe9]			# CHECK-BE: lharx 2, 3, 4, 1 # encoding: [0x7c,0x43,0x20,0xe9]
	# CHECK-LE: lharx 2, 3, 4, 1 # encoding: [0xe9,0x20,0x43,0x7c]			# CHECK-LE: lharx 2, 3, 4, 1 # encoding: [0xe9,0x20,0x43,0x7c]
	lharx 2, 3, 4, 1			lharx 2, 3, 4, 1

	# CHECK-BE: lwarx 2, 3, 4, 1 # encoding: [0x7c,0x43,0x20,0x29]			# CHECK-BE: lwarx 2, 3, 4, 1 # encoding: [0x7c,0x43,0x20,0x29]
	# CHECK-LE: lwarx 2, 3, 4, 1 # encoding: [0x29,0x20,0x43,0x7c]			# CHECK-LE: lwarx 2, 3, 4, 1 # encoding: [0x29,0x20,0x43,0x7c]
	lwarx 2, 3, 4, 1			lwarx 2, 3, 4, 1

	# CHECK-BE: ldarx 2, 3, 4, 1 # encoding: [0x7c,0x43,0x20,0xa9]			# CHECK-BE: ldarx 2, 3, 4, 1 # encoding: [0x7c,0x43,0x20,0xa9]
	# CHECK-LE: ldarx 2, 3, 4, 1 # encoding: [0xa9,0x20,0x43,0x7c]			# CHECK-LE: ldarx 2, 3, 4, 1 # encoding: [0xa9,0x20,0x43,0x7c]
	ldarx 2, 3, 4, 1			ldarx 2, 3, 4, 1

				# CHECK-BE: lqarx 2, 3, 4, 1 # encoding: [0x7c,0x43,0x22,0x29]
				# CHECK-LE: lqarx 2, 3, 4, 1 # encoding: [0x29,0x22,0x43,0x7c]
				lqarx 2, 3, 4, 1

				# CHECK-BE: lqarx 28, 30, 31, 1 # encoding: [0x7f,0x9e,0xfa,0x29]
				# CHECK-LE: lqarx 28, 30, 31, 1 # encoding: [0x29,0xfa,0x9e,0x7f]
				lqarx 28, 30, 31, 1

	# CHECK-BE: lwat 2, 3, 28 # encoding: [0x7c,0x43,0xe4,0x8c]			# CHECK-BE: lwat 2, 3, 28 # encoding: [0x7c,0x43,0xe4,0x8c]
	# CHECK-LE: lwat 2, 3, 28 # encoding: [0x8c,0xe4,0x43,0x7c]			# CHECK-LE: lwat 2, 3, 28 # encoding: [0x8c,0xe4,0x43,0x7c]
	lwat 2, 3, 28			lwat 2, 3, 28

	# CHECK-BE: ldat 2, 3, 28 # encoding: [0x7c,0x43,0xe4,0xcc]			# CHECK-BE: ldat 2, 3, 28 # encoding: [0x7c,0x43,0xe4,0xcc]
	# CHECK-LE: ldat 2, 3, 28 # encoding: [0xcc,0xe4,0x43,0x7c]			# CHECK-LE: ldat 2, 3, 28 # encoding: [0xcc,0xe4,0x43,0x7c]
	ldat 2, 3, 28			ldat 2, 3, 28

	▲ Show 20 Lines • Show All 44 Lines • Show Last 20 Lines

llvm/test/MC/PowerPC/ppc64-encoding.s

	Show First 20 Lines • Show All 235 Lines • ▼ Show 20 Lines
	# CHECK-LE: ldu 2, 128(4) # encoding: [0x81,0x00,0x44,0xe8]			# CHECK-LE: ldu 2, 128(4) # encoding: [0x81,0x00,0x44,0xe8]
	ldu 2, 128(4)			ldu 2, 128(4)
	# CHECK-BE: ldux 2, 3, 4 # encoding: [0x7c,0x43,0x20,0x6a]			# CHECK-BE: ldux 2, 3, 4 # encoding: [0x7c,0x43,0x20,0x6a]
	# CHECK-LE: ldux 2, 3, 4 # encoding: [0x6a,0x20,0x43,0x7c]			# CHECK-LE: ldux 2, 3, 4 # encoding: [0x6a,0x20,0x43,0x7c]
	ldux 2, 3, 4			ldux 2, 3, 4
	# CHECK-BE: ldmx 2, 3, 4 # encoding: [0x7c,0x43,0x22,0x6a]			# CHECK-BE: ldmx 2, 3, 4 # encoding: [0x7c,0x43,0x22,0x6a]
	# CHECK-LE: ldmx 2, 3, 4 # encoding: [0x6a,0x22,0x43,0x7c]			# CHECK-LE: ldmx 2, 3, 4 # encoding: [0x6a,0x22,0x43,0x7c]
	ldmx 2, 3, 4			ldmx 2, 3, 4
				# CHECK-BE: lq 2, 128(4) # encoding: [0xe0,0x44,0x00,0x80]
				# CHECK-LE: lq 2, 128(4) # encoding: [0x80,0x00,0x44,0xe0]
				lq 2, 128(4)
				# CHECK-BE: lq 28, 128(30) # encoding: [0xe3,0x9e,0x00,0x80]
				# CHECK-LE: lq 28, 128(30) # encoding: [0x80,0x00,0x9e,0xe3]
				lq 28, 128(30)

	# Fixed-point store instructions			# Fixed-point store instructions

	# CHECK-BE: stb 2, 128(4) # encoding: [0x98,0x44,0x00,0x80]			# CHECK-BE: stb 2, 128(4) # encoding: [0x98,0x44,0x00,0x80]
	# CHECK-LE: stb 2, 128(4) # encoding: [0x80,0x00,0x44,0x98]			# CHECK-LE: stb 2, 128(4) # encoding: [0x80,0x00,0x44,0x98]
	stb 2, 128(4)			stb 2, 128(4)
	# CHECK-BE: stbx 2, 3, 4 # encoding: [0x7c,0x43,0x21,0xae]			# CHECK-BE: stbx 2, 3, 4 # encoding: [0x7c,0x43,0x21,0xae]
	# CHECK-LE: stbx 2, 3, 4 # encoding: [0xae,0x21,0x43,0x7c]			# CHECK-LE: stbx 2, 3, 4 # encoding: [0xae,0x21,0x43,0x7c]
	Show All 35 Lines
	# CHECK-LE: stdx 2, 3, 4 # encoding: [0x2a,0x21,0x43,0x7c]			# CHECK-LE: stdx 2, 3, 4 # encoding: [0x2a,0x21,0x43,0x7c]
	stdx 2, 3, 4			stdx 2, 3, 4
	# CHECK-BE: stdu 2, 128(4) # encoding: [0xf8,0x44,0x00,0x81]			# CHECK-BE: stdu 2, 128(4) # encoding: [0xf8,0x44,0x00,0x81]
	# CHECK-LE: stdu 2, 128(4) # encoding: [0x81,0x00,0x44,0xf8]			# CHECK-LE: stdu 2, 128(4) # encoding: [0x81,0x00,0x44,0xf8]
	stdu 2, 128(4)			stdu 2, 128(4)
	# CHECK-BE: stdux 2, 3, 4 # encoding: [0x7c,0x43,0x21,0x6a]			# CHECK-BE: stdux 2, 3, 4 # encoding: [0x7c,0x43,0x21,0x6a]
	# CHECK-LE: stdux 2, 3, 4 # encoding: [0x6a,0x21,0x43,0x7c]			# CHECK-LE: stdux 2, 3, 4 # encoding: [0x6a,0x21,0x43,0x7c]
	stdux 2, 3, 4			stdux 2, 3, 4
				# CHECK-BE: stq 2, 128(4) # encoding: [0xf8,0x44,0x00,0x82]
				# CHECK-LE: stq 2, 128(4) # encoding: [0x82,0x00,0x44,0xf8]
				stq 2, 128(4)
				# CHECK-BE: stq 28, 128(30) # encoding: [0xfb,0x9e,0x00,0x82]
				# CHECK-LE: stq 28, 128(30) # encoding: [0x82,0x00,0x9e,0xfb]
				stq 28, 128(30)

	# Fixed-point load and store with byte reversal instructions			# Fixed-point load and store with byte reversal instructions

	# CHECK-BE: lhbrx 2, 3, 4 # encoding: [0x7c,0x43,0x26,0x2c]			# CHECK-BE: lhbrx 2, 3, 4 # encoding: [0x7c,0x43,0x26,0x2c]
	# CHECK-LE: lhbrx 2, 3, 4 # encoding: [0x2c,0x26,0x43,0x7c]			# CHECK-LE: lhbrx 2, 3, 4 # encoding: [0x2c,0x26,0x43,0x7c]
	lhbrx 2, 3, 4			lhbrx 2, 3, 4
	# CHECK-BE: sthbrx 2, 3, 4 # encoding: [0x7c,0x43,0x27,0x2c]			# CHECK-BE: sthbrx 2, 3, 4 # encoding: [0x7c,0x43,0x27,0x2c]
	# CHECK-LE: sthbrx 2, 3, 4 # encoding: [0x2c,0x27,0x43,0x7c]			# CHECK-LE: sthbrx 2, 3, 4 # encoding: [0x2c,0x27,0x43,0x7c]
	▲ Show 20 Lines • Show All 783 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[PowerPC] Export 16 byte load-store instructionsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 351076

llvm/lib/Target/PowerPC/AsmParser/PPCAsmParser.cpp

llvm/lib/Target/PowerPC/Disassembler/PPCDisassembler.cpp

llvm/lib/Target/PowerPC/PPCInstr64Bit.td

llvm/lib/Target/PowerPC/PPCInstrFormats.td

llvm/lib/Target/PowerPC/PPCInstrInfo.h

llvm/lib/Target/PowerPC/PPCInstrInfo.cpp

llvm/lib/Target/PowerPC/PPCInstrInfo.td

llvm/lib/Target/PowerPC/PPCRegisterInfo.h

llvm/lib/Target/PowerPC/PPCRegisterInfo.cpp

llvm/lib/Target/PowerPC/PPCRegisterInfo.td

llvm/lib/Target/PowerPC/PPCSchedule.td

llvm/test/CodeGen/PowerPC/ldst-16-byte-asm.mir

llvm/test/CodeGen/PowerPC/ldst-16-byte.mir

llvm/test/MC/Disassembler/PowerPC/ppc64-encoding-bookII.txt

llvm/test/MC/Disassembler/PowerPC/ppc64-encoding.txt

llvm/test/MC/PowerPC/ppc64-encoding-bookII.s

llvm/test/MC/PowerPC/ppc64-encoding.s

[PowerPC] Export 16 byte load-store instructions
ClosedPublic