This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
lib/Target/Mips/
-
Target/
-
Mips/
-
Mips.td
-
Mips64InstrInfo.td
-
MipsInstrInfo.td
-
test/CodeGen/Mips/
-
CodeGen/
-
Mips/
-
indirect-jump-hazard/
-
long-calls.ll
-
long-calls.ll
-
pr42736.ll

Differential D66228

[mips] Fix 64-bit address loading in case of applying 32-bit mask to the result
ClosedPublic

Authored by atanasyan on Aug 14 2019, 9:35 AM.

Download Raw Diff

Details

Reviewers

Petar.Avramovic
sdardis

Commits

rG59bb3609fa5f: [mips] Fix 64-bit address loading in case of applying 32-bit mask to the result
rL370268: [mips] Fix 64-bit address loading in case of applying 32-bit mask to the result

Summary

If result of 64-bit address loading combines with 32-bit mask, LLVM tries to optimize the code and remove "redundant" loading of upper 32-bits of the address. It leads to incorrect code on MIPS64 targets.

MIPS backend creates the following chain of commands to load 64-bit address in the MipsTargetLowering::getAddrNonPICSym64 method:

(add (shl (add (shl (add %highest(sym), %higher(sym)),
                    16),
               %hi(sym)),
          16),
     %lo(%sym))

If the mask presents, LLVM decides to optimize the chain of commands. It really does not make sense to load upper 32-bits because the 0x0fffffff mask anyway clears them. After removing redundant commands we get this chain:

(add (shl (%hi(sym), 16), %lo(%sym))

There is no patterns matched (MipsHi (i64 symbol)). Due a bug in SYM_32 predicate definition, backend incorrectly selects a pattern for a 32-bit symbols and uses the lui instruction for loading %hi(sym).

As a result we get incorrect set of instructions with unnecessary 16-bit left shifting:

lui     at,0x0
    R_MIPS_HI16     foo
dsll    at,at,0x10
daddiu  at,at,0
    R_MIPS_LO16     foo

This patch resolves two problems:

Fix SYM_32/SYM_64 predicates to prevent selection of patterns dedicated to 32-bit symbols in case of using N64 ABI.
Add missed patterns for 64-bit symbols for %hi/%lo.

Fix PR42736.

Diff Detail

Repository: rL LLVM

Event Timeline

atanasyan created this revision.Aug 14 2019, 9:35 AM

Herald added a project: Restricted Project. · View Herald TranscriptAug 14 2019, 9:35 AM

Herald added subscribers: jrtc27, hiraditya, arichardson. · View Herald Transcript

This looks OK to me, I'd like to take a hard look at what DAGCombiner is doing here.

Can you also supply a test case where the mask has the topmost bit in an i32 set so that we can catch the sign extension cases?

llvm/lib/Target/Mips/Mips64InstrInfo.td
687 ↗	(On Diff #215143)	This addition and below needs SYM_64 I believe.

I am not really sure if this is correct way to fix PR42736. I looked in td files and found a few inconsistencies, again I might have misinterpreted something.

MipsHi(the sd node) seems to have multiple interpretations if we add new patterns

There is MipsHiLoRelocs multiclass in MipsInstrInfo.td

multiclass MipsHiLoRelocs<Instruction Lui, Instruction Addiu,
                          Register ZeroReg, RegisterOperand GPROpnd> {
  def : MipsPat<(MipsHi tglobaladdr:$in), (Lui tglobaladdr:$in)>;
...

and in Mips64InstrInfo.td

defm : MipsHiLoRelocs<LUi64, DADDiu, ZERO_64, GPR64Opnd>, ISA_MIPS3, GPR_64,
       SYM_32;

equivalent to

def : MipsPat<(MipsHi tglobaladdr:$in), (LUi64 tglobaladdr:$in)>;

disagrees with the new

def : MipsPat<(shl (MipsHi (i64 tglobaladdr:$in)), (i32 16)),
              (LUi64 tglobaladdr:$in)>, ISA_MIPS3, GPR_64;

In td file MipsHi is interpreted like: "16 bit imm(bits 31-16 of global addr in our case) is placed into bits 31-16"
while in getAddrNonPICSym64 it is: "16 bit imm(bits 31-16 of global addr in our case) is placed into low 16 bits of result and shift for 16 is required?"

If we use existing MipsISD::Hi interpretation from td file getAddrNonPICSym64 could be modified like this:

       SDValue HigherPart =
           DAG.getNode(ISD::ADD, DL, Ty, Highest,
                       DAG.getNode(MipsISD::Higher, DL, Ty, Higher));
-      SDValue Cst = DAG.getConstant(16, DL, MVT::i32);
+      SDValue Cst = DAG.getConstant(32, DL, MVT::i32);
       SDValue Shift = DAG.getNode(ISD::SHL, DL, Ty, HigherPart, Cst);
       SDValue Add = DAG.getNode(ISD::ADD, DL, Ty, Shift,
                                 DAG.getNode(MipsISD::Hi, DL, Ty, Hi));
-      SDValue Shift2 = DAG.getNode(ISD::SHL, DL, Ty, Add, Cst);
 
-      return DAG.getNode(ISD::ADD, DL, Ty, Shift2,
+      return DAG.getNode(ISD::ADD, DL, Ty, Add,
                          DAG.getNode(MipsISD::Lo, DL, Ty, Lo));

Also MipsISD::Highest and MipsISD::Higher do not seem consistent with their names,
MipsISD::Highest places 16 bit imm(bits 48-63 of global addr in our case) into bits 31-16

MipsISD::Highest is selected like MipsISD::Hi (lui), while
MipsISD::Higher is selected like MipsISD::Lo (addiu)

Replacing MipsISD::Highest and MipsISD::Higher with MipsISD::Hi and MipsISD::Lo in getAddrNonPICSym64, results in same instructions being selected.
Now, I might be missing something here since I am not that familiar with llvm options and ways that global address and similar should be transformed into instructions,
but SDNodes

def MipsHigher : SDNode<"MipsISD::Higher", SDTIntUnaryOp>;
def MipsHighest : SDNode<"MipsISD::Highest", SDTIntUnaryOp>;

in MipsInstrInfo.td, and patterns they define seem to be duplicates of

def MipsHi    : SDNode<"MipsISD::Hi", SDTIntUnaryOp>;
def MipsLo    : SDNode<"MipsISD::Lo", SDTIntUnaryOp>;

and its patterns. Thoughts?

I took a second look, and I believe this patch is the incorrect solution. The bug actually lies in the implementation of PredicateControl and SYM_32/SYM_64, @Petar.Avramovic nearly spotted it.

Those two adjectives add a predicate which checks if the sym32 feature is enabled or not[1]. However, as they are not appended to the list of predicates for a pattern--as SYMPredicates is not in the definition of PredicateControl.

Without that adjective taking effect there are two sets of patterns for a MipsHi node depending on it's parent. If the parent node is a register and an add--as expected--the instruction selection machinery produces an DADDiu with the relevant operand and relocation. If it is not an add, i.e. the case where the known values of the upper 32 bits are zero which would allow DAGCombiner to elide the addition of the upper components and the addition of the upper components to the MipsHi node as they are all zero, then the instruction selection machinery can't maximally munch to the graph like in the (add (MipsHi ...) ..) case and picks the transformation of MipsHi -> LUi64, when that node is the child of a shift because the SYM_32 patterns provide for that result at a lower complexity.

Those patterns should not be selectable, leading to an instruction selection failure. Since they are selectable, the result of an LUi64 gets shifted into MipsHigher's space then has MipsLo added to it.

The correct approach I believe is to fix the SYM_32 bug, then provide the patterns for cases of where intermediate/end nodes such as MipsHi / MipsLo can appear without an add appearing as their parent node.

[1] There's a bug there too! it should be hasSym32() rather than HasSym32().

llvm/lib/Target/Mips/Mips64InstrInfo.td
687 ↗	(On Diff #215143)	I don't believe this comment is relevant now.
llvm/test/CodeGen/Mips/global-address-with-mask.ll
1 ↗	(On Diff #215143)	Call this file pr42736.ll

This revision now requires changes to proceed.Aug 15 2019, 12:55 PM

draganm added a subscriber: draganm.Aug 15 2019, 11:14 PM

@sdardis What is the difference between MipsHi and MipsLo SD nodes? Some of their patterns are the same, e.g. :

def : MipsPat<(add GPR64:$hi, (MipsHi (i64 tglobaladdr:$lo))),
              (DADDiu GPR64:$hi, tglobaladdr:$lo)>, ISA_MIPS3, GPR_64, SYM_64;

def : MipsPat<(add GPR64:$hi, (MipsLo (i64 tglobaladdr:$lo))),
              (DADDiu GPR64:$hi, tglobaladdr:$lo)>, ISA_MIPS3, GPR_64, SYM_64;

Both act like "glue" between add/shift/nothing node and node with part of the address (controlled by target flag) and are used for pattern match into lui or addiu.
They are used interchangeably but I didnt find any comments about what these nodes actually do.
MipsHi can be alone (lui) or used in combine with add (into addiu) according to patterns. By the name I would use MipsLo if I wanted it to combine with add into addiu, not MipsHi.
Aren't all of the MipsHighest, MipsHigher, MipsHi and MipsLo equivalent to some Mips16bitImm node
that would have patterns
Mips16bitImm + shl 16 -> lui
Mips16bitImm + add -> addiu
Mips16bitImm alone -> addiu with zero
but then instead of using MipsHi and MipsHighest alone and expecting lui to be selected we need to make two nodes: Mips16bitImm and shl 16.
MipsHigher and MipsLo are fine if we just replace them with Mips16bitImm.
What difference does SYM_32/SYM_64 make, all selected instructions use only 16 bits of something that is resolved later?

In D66228#1632790, @Petar.Avramovic wrote:

@sdardis What is the difference between MipsHi and MipsLo SD nodes? Some of their patterns are the same, e.g. :

Maybe the difference is in the relocations related to each nodes.

In getAddrNonPICSym64 Relocation looks to come from another SDValue (Hi bellow)
MipsISD::Hi doesn't seem to be connected with any of the target flags (like MipsII::MO_ABS_HI)

SDValue Shift = DAG.getNode(ISD::SHL ...

SDValue Hi = getTargetNode(N, Ty, DAG, MipsII::MO_ABS_HI);	<-Relocation
SDValue Add = DAG.getNode(ISD::ADD, DL, Ty, Shift,		<-Add should become Daddiu when combined with MipsISD::Hi node(below) and  SDValue Hi (above)
                          DAG.getNode(MipsISD::Hi, DL, Ty, Hi));  <-Glue Node, pattern matches immediate daddiu instead of ordinary daddu. This can be whatever as long as these three nodes give Daddiu with imm that is defined in SDValue Hi(MipsISD::Lo also works)

Thanks for review. Could you clarify some points in your comments?

In D66228#1632060, @sdardis wrote:

The correct approach I believe is to fix the SYM_32 bug, then provide the patterns for cases of where intermediate/end nodes such as MipsHi / MipsLo can appear without an add appearing as their parent node.

If we do not change the getAddrNonPICSym64 function, at some point (before lowering) we anyway(?) get the following chain of commands (add (shl (%hi(sym), 16), %lo(%sym)). How can we lower it to a correct set of instructions, if we do not have a pattern with the shl? Please correct we if I'm wrong.
What do you mean by "patterns for cases of where intermediate/end nodes such as MipsHi / MipsLo can appear without an add appearing as their parent node"? Do you mean a pattern like add zero, MipsHi(tglobaladdr), or just MipsHi(tglobaladdr) or something else?

In D66228#1635859, @atanasyan wrote:

Thanks for review. Could you clarify some points in your comments?

In D66228#1632060, @sdardis wrote:

The correct approach I believe is to fix the SYM_32 bug, then provide the patterns for cases of where intermediate/end nodes such as MipsHi / MipsLo can appear without an add appearing as their parent node.

If we do not change the getAddrNonPICSym64 function, at some point (before lowering) we anyway(?) get the following chain of commands (add (shl (%hi(sym), 16), %lo(%sym)). How can we lower it to a correct set of instructions, if we do not have a pattern with the shl? Please correct we if I'm wrong.

Match the MipsHi to daddiu zero, MipsHi. The shift will move the Hi value to the correct place, this is correct behaviour. If you look at the stanza of patterns for Higher, you'll notice they're duplicated, both (MipsHigher (i64 symboltype:%sum)) and (add $highest (MipsHigher (i64 symboltype:%sum))). Providing the first set of patterns for (MipsHi (i64 symbol)) will allow DAGISel to pick the correct instructions. We can''t use LUi64 in this case as it would sign extend the symbol value into the upper 32 bits.

What do you mean by "patterns for cases of where intermediate/end nodes such as MipsHi / MipsLo can appear without an add appearing as their parent node"? Do you mean a pattern like add zero, MipsHi(tglobaladdr), or just MipsHi(tglobaladdr) or something else?

We need to ensure that (add Hi$, MipsHi/Higher(tglobaladdr)) can be selected for the chain of nodes created by getAddrNonPICSym64. If that series of nodes is modified we could have a case where the .td patterns don't match what getAddrNonPICSym64 produce, and DAGISel sees (shl (MipsHi (tglobaladdr), 16). We need two sets of patterns, one for (add (MipsHi ...)) and (MipsHi ..).

Also, without fixing the sym32 bug, this patch won't produce the correct result.

In D66228#1632790, @Petar.Avramovic wrote:

...
What difference does SYM_32/SYM_64 make, all selected instructions use only 16 bits of something that is resolved later?

SYM_32 is a special submode of the N64 ABI where symbols are 32bits wide. This means embedded/kernel developers who precisely know how their memory map is laid out can save on code-size.

atanasyan added a child revision: D66553: [mips] Reduce number of instructions used for loading a global symbol's value.Aug 21 2019, 1:40 PM

atanasyan removed a child revision: D66553: [mips] Reduce number of instructions used for loading a global symbol's value.Aug 21 2019, 1:44 PM

atanasyan added a parent revision: D66771: [mips] Use less registers to load address of TargetExternalSymbol.Aug 26 2019, 3:46 PM

Fix SYM_32/SYM64 predicates definitions
Add missed patterns to load 64-bit symbol's address

LGTM. Let's wait for @sdardis.

LGTM.

llvm/lib/Target/Mips/Mips.td
28 ↗	(On Diff #217256)	Nit: Predicates for a symbol's sizee such as hasSym32.
llvm/lib/Target/Mips/Mips64InstrInfo.td
654 ↗	(On Diff #217256)	Touch up the formatting in a separate NFC commit.

This revision is now accepted and ready to land.Aug 28 2019, 12:46 PM

Thanks for review.

Closed by commit rL370268: [mips] Fix 64-bit address loading in case of applying 32-bit mask to the result (authored by atanasyan). · Explain WhyAug 28 2019, 3:36 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

trunk/

lib/

Target/

Mips/

Mips.td

3 lines

Mips64InstrInfo.td

34 lines

MipsInstrInfo.td

4 lines

test/

CodeGen/

Mips/

indirect-jump-hazard/

long-calls.ll

14 lines

long-calls.ll

6 lines

pr42736.ll

28 lines

Diff 217732

llvm/trunk/lib/Target/Mips/Mips.td

	Show All 19 Lines
	// having to re-add all the existing predicates.			// having to re-add all the existing predicates.
	class PredicateControl {			class PredicateControl {
	// Predicates for the encoding scheme in use such as HasStdEnc			// Predicates for the encoding scheme in use such as HasStdEnc
	list<Predicate> EncodingPredicates = [];			list<Predicate> EncodingPredicates = [];
	// Predicates for the GPR size such as IsGP64bit			// Predicates for the GPR size such as IsGP64bit
	list<Predicate> GPRPredicates = [];			list<Predicate> GPRPredicates = [];
	// Predicates for the PTR size such as IsPTR64bit			// Predicates for the PTR size such as IsPTR64bit
	list<Predicate> PTRPredicates = [];			list<Predicate> PTRPredicates = [];
				// Predicates for a symbol's size such as hasSym32.
				list<Predicate> SYMPredicates = [];
	// Predicates for the FGR size and layout such as IsFP64bit			// Predicates for the FGR size and layout such as IsFP64bit
	list<Predicate> FGRPredicates = [];			list<Predicate> FGRPredicates = [];
	// Predicates for the instruction group membership such as ISA's.			// Predicates for the instruction group membership such as ISA's.
	list<Predicate> InsnPredicates = [];			list<Predicate> InsnPredicates = [];
	// Predicate for the ASE that an instruction belongs to.			// Predicate for the ASE that an instruction belongs to.
	list<Predicate> ASEPredicate = [];			list<Predicate> ASEPredicate = [];
	// Predicate for marking the instruction as usable in hard-float mode only.			// Predicate for marking the instruction as usable in hard-float mode only.
	list<Predicate> HardFloatPredicate = [];			list<Predicate> HardFloatPredicate = [];
	// Predicates for anything else			// Predicates for anything else
	list<Predicate> AdditionalPredicates = [];			list<Predicate> AdditionalPredicates = [];
	list<Predicate> Predicates = !listconcat(EncodingPredicates,			list<Predicate> Predicates = !listconcat(EncodingPredicates,
	GPRPredicates,			GPRPredicates,
	PTRPredicates,			PTRPredicates,
				SYMPredicates,
	FGRPredicates,			FGRPredicates,
	InsnPredicates,			InsnPredicates,
	HardFloatPredicate,			HardFloatPredicate,
	ASEPredicate,			ASEPredicate,
	AdditionalPredicates);			AdditionalPredicates);
	}			}

	// Like Requires<> but for the AdditionalPredicates list			// Like Requires<> but for the AdditionalPredicates list
	▲ Show 20 Lines • Show All 211 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/Mips/Mips64InstrInfo.td

Show First 20 Lines • Show All 676 Lines • ▼ Show 20 Lines	def : MipsPat<(add GPR64:$hi, (MipsHigher (i64 tglobaladdr:$lo))),
(DADDiu GPR64:$hi, tglobaladdr:$lo)>, ISA_MIPS3, GPR_64, SYM_64;		(DADDiu GPR64:$hi, tglobaladdr:$lo)>, ISA_MIPS3, GPR_64, SYM_64;
def : MipsPat<(add GPR64:$hi, (MipsHigher (i64 tblockaddress:$lo))),		def : MipsPat<(add GPR64:$hi, (MipsHigher (i64 tblockaddress:$lo))),
(DADDiu GPR64:$hi, tblockaddress:$lo)>, ISA_MIPS3, GPR_64,		(DADDiu GPR64:$hi, tblockaddress:$lo)>, ISA_MIPS3, GPR_64,
SYM_64;		SYM_64;
def : MipsPat<(add GPR64:$hi, (MipsHigher (i64 tjumptable:$lo))),		def : MipsPat<(add GPR64:$hi, (MipsHigher (i64 tjumptable:$lo))),
(DADDiu GPR64:$hi, tjumptable:$lo)>, ISA_MIPS3, GPR_64, SYM_64;		(DADDiu GPR64:$hi, tjumptable:$lo)>, ISA_MIPS3, GPR_64, SYM_64;
def : MipsPat<(add GPR64:$hi, (MipsHigher (i64 tconstpool:$lo))),		def : MipsPat<(add GPR64:$hi, (MipsHigher (i64 tconstpool:$lo))),
(DADDiu GPR64:$hi, tconstpool:$lo)>, ISA_MIPS3, GPR_64, SYM_64;		(DADDiu GPR64:$hi, tconstpool:$lo)>, ISA_MIPS3, GPR_64, SYM_64;
		def : MipsPat<(add GPR64:$hi, (MipsHigher (i64 texternalsym:$lo))),
		(DADDiu GPR64:$hi, texternalsym:$lo)>,
		ISA_MIPS3, GPR_64, SYM_64;

		def : MipsPat<(MipsHi (i64 tglobaladdr:$in)),
		(DADDiu ZERO_64, tglobaladdr:$in)>, ISA_MIPS3, GPR_64, SYM_64;
		def : MipsPat<(MipsHi (i64 tblockaddress:$in)),
		(DADDiu ZERO_64, tblockaddress:$in)>, ISA_MIPS3, GPR_64, SYM_64;
		def : MipsPat<(MipsHi (i64 tjumptable:$in)),
		(DADDiu ZERO_64, tjumptable:$in)>, ISA_MIPS3, GPR_64, SYM_64;
		def : MipsPat<(MipsHi (i64 tconstpool:$in)),
		(DADDiu ZERO_64, tconstpool:$in)>, ISA_MIPS3, GPR_64, SYM_64;
		def : MipsPat<(MipsHi (i64 texternalsym:$in)),
		(DADDiu ZERO_64, texternalsym:$in)>, ISA_MIPS3, GPR_64, SYM_64;

def : MipsPat<(add GPR64:$hi, (MipsHi (i64 tglobaladdr:$lo))),		def : MipsPat<(add GPR64:$hi, (MipsHi (i64 tglobaladdr:$lo))),
(DADDiu GPR64:$hi, tglobaladdr:$lo)>, ISA_MIPS3, GPR_64, SYM_64;		(DADDiu GPR64:$hi, tglobaladdr:$lo)>, ISA_MIPS3, GPR_64, SYM_64;
def : MipsPat<(add GPR64:$hi, (MipsHi (i64 tblockaddress:$lo))),		def : MipsPat<(add GPR64:$hi, (MipsHi (i64 tblockaddress:$lo))),
(DADDiu GPR64:$hi, tblockaddress:$lo)>, ISA_MIPS3, GPR_64,		(DADDiu GPR64:$hi, tblockaddress:$lo)>, ISA_MIPS3, GPR_64,
SYM_64;		SYM_64;
def : MipsPat<(add GPR64:$hi, (MipsHi (i64 tjumptable:$lo))),		def : MipsPat<(add GPR64:$hi, (MipsHi (i64 tjumptable:$lo))),
(DADDiu GPR64:$hi, tjumptable:$lo)>, ISA_MIPS3, GPR_64, SYM_64;		(DADDiu GPR64:$hi, tjumptable:$lo)>, ISA_MIPS3, GPR_64, SYM_64;
def : MipsPat<(add GPR64:$hi, (MipsHi (i64 tconstpool:$lo))),		def : MipsPat<(add GPR64:$hi, (MipsHi (i64 tconstpool:$lo))),
(DADDiu GPR64:$hi, tconstpool:$lo)>, ISA_MIPS3, GPR_64, SYM_64;		(DADDiu GPR64:$hi, tconstpool:$lo)>, ISA_MIPS3, GPR_64, SYM_64;
		def : MipsPat<(add GPR64:$hi, (MipsHi (i64 texternalsym:$lo))),
		(DADDiu GPR64:$hi, texternalsym:$lo)>,
		ISA_MIPS3, GPR_64, SYM_64;

		def : MipsPat<(MipsLo (i64 tglobaladdr:$in)),
		(DADDiu ZERO_64, tglobaladdr:$in)>, ISA_MIPS3, GPR_64, SYM_64;
		def : MipsPat<(MipsLo (i64 tblockaddress:$in)),
		(DADDiu ZERO_64, tblockaddress:$in)>, ISA_MIPS3, GPR_64, SYM_64;
		def : MipsPat<(MipsLo (i64 tjumptable:$in)),
		(DADDiu ZERO_64, tjumptable:$in)>, ISA_MIPS3, GPR_64, SYM_64;
		def : MipsPat<(MipsLo (i64 tconstpool:$in)),
		(DADDiu ZERO_64, tconstpool:$in)>, ISA_MIPS3, GPR_64, SYM_64;
		def : MipsPat<(MipsLo (i64 tglobaltlsaddr:$in)),
		(DADDiu ZERO_64, tglobaltlsaddr:$in)>,
		ISA_MIPS3, GPR_64, SYM_64;
		def : MipsPat<(MipsLo (i64 texternalsym:$in)),
		(DADDiu ZERO_64, texternalsym:$in)>, ISA_MIPS3, GPR_64, SYM_64;

def : MipsPat<(add GPR64:$hi, (MipsLo (i64 tglobaladdr:$lo))),		def : MipsPat<(add GPR64:$hi, (MipsLo (i64 tglobaladdr:$lo))),
(DADDiu GPR64:$hi, tglobaladdr:$lo)>, ISA_MIPS3, GPR_64, SYM_64;		(DADDiu GPR64:$hi, tglobaladdr:$lo)>, ISA_MIPS3, GPR_64, SYM_64;
def : MipsPat<(add GPR64:$hi, (MipsLo (i64 tblockaddress:$lo))),		def : MipsPat<(add GPR64:$hi, (MipsLo (i64 tblockaddress:$lo))),
(DADDiu GPR64:$hi, tblockaddress:$lo)>, ISA_MIPS3, GPR_64,		(DADDiu GPR64:$hi, tblockaddress:$lo)>, ISA_MIPS3, GPR_64,
SYM_64;		SYM_64;
def : MipsPat<(add GPR64:$hi, (MipsLo (i64 tjumptable:$lo))),		def : MipsPat<(add GPR64:$hi, (MipsLo (i64 tjumptable:$lo))),
(DADDiu GPR64:$hi, tjumptable:$lo)>, ISA_MIPS3, GPR_64, SYM_64;		(DADDiu GPR64:$hi, tjumptable:$lo)>, ISA_MIPS3, GPR_64, SYM_64;
def : MipsPat<(add GPR64:$hi, (MipsLo (i64 tconstpool:$lo))),		def : MipsPat<(add GPR64:$hi, (MipsLo (i64 tconstpool:$lo))),
(DADDiu GPR64:$hi, tconstpool:$lo)>, ISA_MIPS3, GPR_64, SYM_64;		(DADDiu GPR64:$hi, tconstpool:$lo)>, ISA_MIPS3, GPR_64, SYM_64;
def : MipsPat<(add GPR64:$hi, (MipsLo (i64 tglobaltlsaddr:$lo))),		def : MipsPat<(add GPR64:$hi, (MipsLo (i64 tglobaltlsaddr:$lo))),
(DADDiu GPR64:$hi, tglobaltlsaddr:$lo)>, ISA_MIPS3, GPR_64,		(DADDiu GPR64:$hi, tglobaltlsaddr:$lo)>, ISA_MIPS3, GPR_64,
SYM_64;		SYM_64;
		def : MipsPat<(add GPR64:$hi, (MipsLo (i64 texternalsym:$lo))),
		(DADDiu GPR64:$hi, texternalsym:$lo)>,
		ISA_MIPS3, GPR_64, SYM_64;
}		}

// gp_rel relocs		// gp_rel relocs
def : MipsPat<(add GPR64:$gp, (MipsGPRel tglobaladdr:$in)),		def : MipsPat<(add GPR64:$gp, (MipsGPRel tglobaladdr:$in)),
(DADDiu GPR64:$gp, tglobaladdr:$in)>, ISA_MIPS3, ABI_N64;		(DADDiu GPR64:$gp, tglobaladdr:$in)>, ISA_MIPS3, ABI_N64;
def : MipsPat<(add GPR64:$gp, (MipsGPRel tconstpool:$in)),		def : MipsPat<(add GPR64:$gp, (MipsGPRel tconstpool:$in)),
(DADDiu GPR64:$gp, tconstpool:$in)>, ISA_MIPS3, ABI_N64;		(DADDiu GPR64:$gp, tconstpool:$in)>, ISA_MIPS3, ABI_N64;

▲ Show 20 Lines • Show All 472 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/Mips/MipsInstrInfo.td

	Show First 20 Lines • Show All 205 Lines • ▼ Show 20 Lines
	def InMips16Mode : Predicate<"Subtarget->inMips16Mode()">,			def InMips16Mode : Predicate<"Subtarget->inMips16Mode()">,
	AssemblerPredicate<"FeatureMips16">;			AssemblerPredicate<"FeatureMips16">;
	def NotInMips16Mode : Predicate<"!Subtarget->inMips16Mode()">,			def NotInMips16Mode : Predicate<"!Subtarget->inMips16Mode()">,
	AssemblerPredicate<"!FeatureMips16">;			AssemblerPredicate<"!FeatureMips16">;
	def HasCnMips : Predicate<"Subtarget->hasCnMips()">,			def HasCnMips : Predicate<"Subtarget->hasCnMips()">,
	AssemblerPredicate<"FeatureCnMips">;			AssemblerPredicate<"FeatureCnMips">;
	def NotCnMips : Predicate<"!Subtarget->hasCnMips()">,			def NotCnMips : Predicate<"!Subtarget->hasCnMips()">,
	AssemblerPredicate<"!FeatureCnMips">;			AssemblerPredicate<"!FeatureCnMips">;
	def IsSym32 : Predicate<"Subtarget->HasSym32()">,			def IsSym32 : Predicate<"Subtarget->hasSym32()">,
	AssemblerPredicate<"FeatureSym32">;			AssemblerPredicate<"FeatureSym32">;
	def IsSym64 : Predicate<"!Subtarget->HasSym32()">,			def IsSym64 : Predicate<"!Subtarget->hasSym32()">,
	AssemblerPredicate<"!FeatureSym32">;			AssemblerPredicate<"!FeatureSym32">;
	def IsN64 : Predicate<"Subtarget->isABI_N64()">;			def IsN64 : Predicate<"Subtarget->isABI_N64()">;
	def IsNotN64 : Predicate<"!Subtarget->isABI_N64()">;			def IsNotN64 : Predicate<"!Subtarget->isABI_N64()">;
	def RelocNotPIC : Predicate<"!TM.isPositionIndependent()">;			def RelocNotPIC : Predicate<"!TM.isPositionIndependent()">;
	def RelocPIC : Predicate<"TM.isPositionIndependent()">;			def RelocPIC : Predicate<"TM.isPositionIndependent()">;
	def NoNaNsFPMath : Predicate<"TM.Options.NoNaNsFPMath">;			def NoNaNsFPMath : Predicate<"TM.Options.NoNaNsFPMath">;
	def UseAbs : Predicate<"Subtarget->inAbs2008Mode() \|\|"			def UseAbs : Predicate<"Subtarget->inAbs2008Mode() \|\|"
	"TM.Options.NoNaNsFPMath">;			"TM.Options.NoNaNsFPMath">;
	▲ Show 20 Lines • Show All 3,153 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/Mips/indirect-jump-hazard/long-calls.ll

	Show First 20 Lines • Show All 68 Lines • ▼ Show 20 Lines
	; N64-NEXT: lui $1, %highest(callee)			; N64-NEXT: lui $1, %highest(callee)
	; N64-NEXT: daddiu $1, $1, %higher(callee)			; N64-NEXT: daddiu $1, $1, %higher(callee)
	; N64-NEXT: dsll $1, $1, 16			; N64-NEXT: dsll $1, $1, 16
	; N64-NEXT: daddiu $1, $1, %hi(callee)			; N64-NEXT: daddiu $1, $1, %hi(callee)
	; N64-NEXT: dsll $1, $1, 16			; N64-NEXT: dsll $1, $1, 16
	; N64-NEXT: daddiu $25, $1, %lo(callee)			; N64-NEXT: daddiu $25, $1, %lo(callee)
	; N64-NEXT: jalr.hb $25			; N64-NEXT: jalr.hb $25
	; N64-NEXT: nop			; N64-NEXT: nop
	; N64-NEXT: daddiu $1, $zero, %higher(memset)
	; N64-NEXT: lui $2, %highest(memset)
	; N64-NEXT: daddu $1, $2, $1
	; N64-NEXT: dsll $1, $1, 16
	; N64-NEXT: lui $2, %hi(memset)
	; N64-NEXT: daddu $1, $1, $2
	; N64-NEXT: dsll $1, $1, 16
	; N64-NEXT: daddiu $25, $1, %lo(memset)
	; N64-NEXT: lui $1, %highest(val)			; N64-NEXT: lui $1, %highest(val)
	; N64-NEXT: daddiu $1, $1, %higher(val)			; N64-NEXT: daddiu $1, $1, %higher(val)
	; N64-NEXT: dsll $1, $1, 16			; N64-NEXT: dsll $1, $1, 16
	; N64-NEXT: daddiu $1, $1, %hi(val)			; N64-NEXT: daddiu $1, $1, %hi(val)
	; N64-NEXT: dsll $1, $1, 16			; N64-NEXT: dsll $1, $1, 16
				; N64-NEXT: lui $2, %highest(memset)
	; N64-NEXT: daddiu $4, $1, %lo(val)			; N64-NEXT: daddiu $4, $1, %lo(val)
				; N64-NEXT: daddiu $1, $2, %higher(memset)
				; N64-NEXT: dsll $1, $1, 16
				; N64-NEXT: daddiu $1, $1, %hi(memset)
				; N64-NEXT: dsll $1, $1, 16
				; N64-NEXT: daddiu $25, $1, %lo(memset)
	; N64-NEXT: daddiu $5, $zero, 0			; N64-NEXT: daddiu $5, $zero, 0
	; N64-NEXT: jalr.hb $25			; N64-NEXT: jalr.hb $25
	; N64-NEXT: daddiu $6, $zero, 80			; N64-NEXT: daddiu $6, $zero, 80
	; N64-NEXT: ld $ra, 8($sp) # 8-byte Folded Reload			; N64-NEXT: ld $ra, 8($sp) # 8-byte Folded Reload
	; N64-NEXT: jr $ra			; N64-NEXT: jr $ra
	; N64-NEXT: daddiu $sp, $sp, 16			; N64-NEXT: daddiu $sp, $sp, 16
	call void @callee()			call void @callee()
	call void @llvm.memset.p0i8.i32(i8* align 4 bitcast ([20 x i32]* @val to i8*), i8 0, i32 80, i1 false)			call void @llvm.memset.p0i8.i32(i8* align 4 bitcast ([20 x i32]* @val to i8*), i8 0, i32 80, i1 false)
	ret void			ret void
	}			}

llvm/trunk/test/CodeGen/Mips/long-calls.ll

	Show All 37 Lines
	; ON32: jalr $25			; ON32: jalr $25

	; ON64: lui $1, %highest(callee)			; ON64: lui $1, %highest(callee)
	; ON64: daddiu $1, $1, %higher(callee)			; ON64: daddiu $1, $1, %higher(callee)
	; ON64: daddiu $1, $1, %hi(callee)			; ON64: daddiu $1, $1, %hi(callee)
	; ON64: daddiu $25, $1, %lo(callee)			; ON64: daddiu $25, $1, %lo(callee)
	; ON64: jalr $25			; ON64: jalr $25

	; ON64: daddiu $1, $zero, %higher(memset)
	; ON64: lui $2, %highest(memset)			; ON64: lui $2, %highest(memset)
	; ON64: lui $2, %hi(memset)			; ON64: daddiu $1, $2, %higher(memset)
				; ON64: dsll $1, $1, 16
				; ON64: daddiu $1, $1, %hi(memset)
				; ON64: dsll $1, $1, 16
	; ON64: daddiu $25, $1, %lo(memset)			; ON64: daddiu $25, $1, %lo(memset)
	; ON64: jalr $25			; ON64: jalr $25

	call void @callee()			call void @callee()
	call void @llvm.memset.p0i8.i32(i8* align 4 bitcast ([20 x i32]* @val to i8*), i8 0, i32 80, i1 false)			call void @llvm.memset.p0i8.i32(i8* align 4 bitcast ([20 x i32]* @val to i8*), i8 0, i32 80, i1 false)
	ret void			ret void
	}			}

llvm/trunk/test/CodeGen/Mips/pr42736.ll

				; RUN: llc -mtriple=mips64-linux-gnuabi64 \
				; RUN: -relocation-model=pic < %s \| FileCheck %s -check-prefix=PIC
				; RUN: llc -mtriple=mips64-linux-gnuabi64 \
				; RUN: -relocation-model=static < %s \| FileCheck %s -check-prefix=STATIC

				define void @bar1() nounwind {
				entry:
				; PIC: lui $[[R0:[0-9]+]], 4095
				; PIC-NEXT: ori $[[R0]], $[[R0]], 65535
				; PIC-NEXT: ld $[[R1:[0-9]+]], %got_disp(foo)(${{[0-9]+}})
				; PIC-NEXT: and $[[R1]], $[[R1]], $[[R0]]
				; PIC-NEXT: sd $[[R1]]

				; STATIC: lui $[[R0:[0-9]+]], 4095
				; STATIC-NEXT: ori $[[R0]], $[[R0]], 65535
				; STATIC-NEXT: daddiu $[[R1:[0-9]+]], $zero, %hi(foo)
				; STATIC-NEXT: dsll $[[R1]], $[[R1]], 16
				; STATIC-NEXT: daddiu $[[R1]], $[[R1]], %lo(foo)
				; STATIC-NEXT: and $[[R0]], $[[R1]], $[[R0]]
				; STATIC-NEXT: sd $[[R0]]

				%val = alloca i64, align 8
				store i64 and (i64 ptrtoint (void ()* @foo to i64), i64 268435455), i64* %val, align 8
				%0 = load i64, i64* %val, align 8
				ret void
				}

				declare void @foo()