This is an archive of the discontinued LLVM Phabricator instance.

test/CodeGen/Mips/GlobalISel/legalizer/add.mir
262–266	Note: Machine function generated from input mir has different `fixedStack` than input: fixedStack: - { id: 0, offset: 16, size: 4, alignment: 8, stack-id: 0, isImmutable: true } - { id: 1, offset: 20, size: 4, alignment: 4, stack-id: 0, isImmutable: true } - { id: 2, offset: 24, size: 4, alignment: 8, stack-id: 0, isImmutable: true } - { id: 3, offset: 28, size: 4, alignment: 4, stack-id: 0, isImmutable: true } Let's look at these two instructions in output: %5:_(p0) = G_FRAME_INDEX %fixed-stack.0 %6:_(s32) = G_LOAD %5(p0) :: (load 4 from %fixed-stack.0, align 0) They correspond to these two in original input %10:_(p0) = G_FRAME_INDEX %fixed-stack.3 %6:_(s32) = G_LOAD %10(p0) :: (load 4 from %fixed-stack.3, align 0) We see that numbers of Vregs are different but that is fine as they are connected with instructions. On the other hand, `id`s in `fixedStack` are not connected with content (content with `id:3` in input corresponds to content with `id:0` in output). We cannot see the connection between e.g. Vreg of `G_LOAD` instruction and its offset in test. Looking at `fixedStack` at input may lead to misinterpretation of this test. Any comments on this?

Ping.

Sorry for the belated reply.

In general, the patch is looking good to me. Unfortunately I cannot comment the note about fixedStack because do not know this part of code good enough. Probably you need to extend the list of reviewers or ask a separate question on the llvm-dev mail list.

In D54580#1309484, @atanasyan wrote:

Sorry for the belated reply.

In general, the patch is looking good to me. Unfortunately I cannot comment the note about fixedStack because do not know this part of code good enough. Probably you need to extend the list of reviewers or ask a separate question on the llvm-dev mail list.

I'm not sure either but fwiw the id's being renumbered between input and output sounds like a bug to me. It's going to be difficult to write tests if they can re-number like that.

Hi, what happens here (when we use -run-pass) is that mir input given in test is not same as MachineFunction given to legalizer. The reason looks to be that constructors for objects are not called in same order.

This is part of MIR input.

name:            add_i128
alignment:       2
tracksRegLiveness: true
fixedStack:
  - { id: 0, offset: 28, size: 4, alignment: 4, stack-id: 0, isImmutable: true }
  - { id: 1, offset: 24, size: 4, alignment: 8, stack-id: 0, isImmutable: true }
  - { id: 2, offset: 20, size: 4, alignment: 4, stack-id: 0, isImmutable: true }
  - { id: 3, offset: 16, size: 4, alignment: 8, stack-id: 0, isImmutable: true }
body:             |
  bb.1.entry:
    liveins: $a0, $a1, $a2, $a3

    %2:_(s32) = COPY $a0
    ...
    %0:_(s128) = G_MERGE_VALUES %2(s32), %3(s32), %4(s32), %5(s32)
    %10:_(p0) = G_FRAME_INDEX %fixed-stack.3
    %6:_(s32) = G_LOAD %10(p0) :: (load 4 from %fixed-stack.3, align 0)
    ...
    %1:_(s128) = G_MERGE_VALUES %6(s32), %7(s32), %8(s32), %9(s32)
    %14:_(s128) = G_ADD %1, %0
    ...

It looks that we parse line by line and call createGenericVirtualRegister (defs in input go %2, %3, %4, %5, %0, while in reconstructed MF they go %0, %1, %2,...).
This is part of reconstructed MF from start of legalizer pass.

# Machine code for function add_i128: IsSSA, NoPHIs, TracksLiveness
Frame Objects:
  fi#-4: size=4, align=8, fixed, at location [SP+16]
  fi#-3: size=4, align=4, fixed, at location [SP+20]
  fi#-2: size=4, align=8, fixed, at location [SP+24]
  fi#-1: size=4, align=4, fixed, at location [SP+28]

bb.0.entry:
  liveins: $a0, $a1, $a2, $a3
  %0:_(s32) = COPY $a0
  ...
  %4:_(s128) = G_MERGE_VALUES %0:_(s32), %1:_(s32), %2:_(s32), %3:_(s32)
  %5:_(p0) = G_FRAME_INDEX %fixed-stack.0
  %6:_(s32) = G_LOAD %5:_(p0) :: (load 4 from %fixed-stack.0, align 0)
  ...
  %13:_(s128) = G_MERGE_VALUES %6:_(s32), %8:_(s32), %10:_(s32), %12:_(s32)
  %14:_(s128) = G_ADD %13:_, %4:_
  ...

Our scripts for test generation deal with re-numbering of VRegs with names and regular expressions.
When we look at check lines we can figure out that regular expression [0-9]+ in [[COPY:%[0-9]+]] corresponds to %2 as they have same right part of instruction i.e. " = COPY $a0".
Next, if we look at %0:_(s128) = G_MERGE_VALUES %2(s32), %3(s32), %4(s32), %5(s32) we realise that COPY are lowest bits while COPY3 are highest bits of an i128.
Lowest bits of next i128 are in stack with offset 16. If we could see second block of code it would be clear that VReg with name LOAD holds bits from SP+16.
But since we don't, we and could easily associate LOAD with bits form SP+28 as we cannot see how fixedStack looks in reconstructed MF.

I am not sure if this is classified as a bug since check lines are fine as they just save output that current legalizer version produces for given mir input.
If we changed something later and this test failed we could have hard time interpreting why since we lack some information (that fixedStack looks different in reconstructed MF).

One solution could be to have content in %fixed-stack.0 next to an instruction

; MIPS32: [[FRAME_INDEX:%[0-9]+]]:_(p0) = G_FRAME_INDEX %fixed-stack.0    <fixedStack: { id: 0, offset: 16, size: 4, alignment: 8, stack-id: 0, isImmutable: true }>
; MIPS32: [[LOAD:%[0-9]+]]:_(s32) = G_LOAD [[FRAME_INDEX]](p0) :: (load 4 from %fixed-stack.0, align 0)   <fixedStack: { id: 0, offset: 16, size: 4, alignment: 8, stack-id: 0, isImmutable: true }>

another idea is to have content of fixed stack from output under comments.

---
name:            add_i128
alignment:       2
tracksRegLiveness: true
fixedStack:
  - { id: 0, offset: 28, size: 4, alignment: 4, stack-id: 0, isImmutable: true }
  - { id: 1, offset: 24, size: 4, alignment: 8, stack-id: 0, isImmutable: true }
  - { id: 2, offset: 20, size: 4, alignment: 4, stack-id: 0, isImmutable: true }
  - { id: 3, offset: 16, size: 4, alignment: 8, stack-id: 0, isImmutable: true }

;fixedStack:      
;  - { id: 0, offset: 16, size: 4, alignment: 8, stack-id: 0, isImmutable: true }
;  - { id: 1, offset: 20, size: 4, alignment: 4, stack-id: 0, isImmutable: true }
;  - { id: 2, offset: 24, size: 4, alignment: 8, stack-id: 0, isImmutable: true }
;  - { id: 3, offset: 28, size: 4, alignment: 4, stack-id: 0, isImmutable: true }
body:             |
  ...

In D54580#1310009, @Petar.Avramovic wrote:
Hi, what happens here (when we use -run-pass) is that mir input given in test is not same as MachineFunction given to legalizer. The reason looks to be that constructors for objects are not called in same order.

This is part of MIR input.
name:            add_i128
alignment:       2
tracksRegLiveness: true
fixedStack:
  - { id: 0, offset: 28, size: 4, alignment: 4, stack-id: 0, isImmutable: true }
  - { id: 1, offset: 24, size: 4, alignment: 8, stack-id: 0, isImmutable: true }
  - { id: 2, offset: 20, size: 4, alignment: 4, stack-id: 0, isImmutable: true }
  - { id: 3, offset: 16, size: 4, alignment: 8, stack-id: 0, isImmutable: true }
body:             |
  bb.1.entry:
    liveins: $a0, $a1, $a2, $a3

    %2:_(s32) = COPY $a0
    ...
    %0:_(s128) = G_MERGE_VALUES %2(s32), %3(s32), %4(s32), %5(s32)
    %10:_(p0) = G_FRAME_INDEX %fixed-stack.3
    %6:_(s32) = G_LOAD %10(p0) :: (load 4 from %fixed-stack.3, align 0)
    ...
    %1:_(s128) = G_MERGE_VALUES %6(s32), %7(s32), %8(s32), %9(s32)
    %14:_(s128) = G_ADD %1, %0
    ...
It looks that we parse line by line and call createGenericVirtualRegister (defs in input go %2, %3, %4, %5, %0, while in reconstructed MF they go %0, %1, %2,...).

That's a different issue to the %fixedstack.<id> renumbering, vreg renumbering is expected. MIR is more lenient than LLVM-IR about what it will parse but it still numbers from 0 while printing. I'm surprised by the renumbering of the fixedStack ids because they are identified within the MachineFunction by their id and some of those values have special meanings (e.g. negative ids).

I see. Then we can agree that renumbering of the fixedStack ids is a bug not connected with this patch.
Are we ready to commit?

Similar to renumbering of vregs, fixedStack objects are created in different order then in original MF and that causes id renumbering.
In order to avoid id renumbering we could change lib/CodeGen/MIRParser/MIRParser.cpp accordingly.

We can create fixedStack objects bottom-up like this:

for (auto RIt = YamlMF.FixedStackObjects.rbegin();
     RIt != YamlMF.FixedStackObjects.rend(); ++RIt) {
  const auto &Object = *RIt;
...

This way fixedStack objects are created in the same order as in the original MF and there is no id renumbering.

fixedStack objects are currently created in top-down order.

for (const auto &Object : YamlMF.FixedStackObjects) {

I can prepare another patch to fix renumbering of the fixedStack ids.

Ping.

In D54580#1311203, @Petar.Avramovic wrote:

I see. Then we can agree that renumbering of the fixedStack ids is a bug not connected with this patch.
Are we ready to commit?

...

I can prepare another patch to fix renumbering of the fixedStack ids.

As to me, I like this way.

This revision is now accepted and ready to land.Dec 16 2018, 12:56 PM

Closed by commit rL349346: [MIPS GlobalISel] Lower G_UADDE and narrowScalar G_ADD (authored by Petar.Avramovic). · Explain WhyDec 17 2018, 4:34 AM

This revision was automatically updated to reflect the committed changes.

Herald added subscribers: llvm-commits, jrtc27. · View Herald TranscriptDec 17 2018, 4:34 AM

Revision Contents

Path

Size

lib/

CodeGen/

GlobalISel/

LegalizerHelper.cpp

18 lines

Target/

Mips/

MipsLegalizerInfo.cpp

35 lines

test/

CodeGen/

Mips/

GlobalISel/

legalizer/

add.mir

96 lines

llvm-ir/

add.ll

69 lines

Diff 174214

lib/CodeGen/GlobalISel/LegalizerHelper.cpp

Show First 20 Lines • Show All 1,057 Lines • ▼ Show 20 Lines	case TargetOpcode::G_ZEXTLOAD: {
return UnableToLegalize;		return UnableToLegalize;
}		}
case TargetOpcode::G_CTLZ_ZERO_UNDEF:		case TargetOpcode::G_CTLZ_ZERO_UNDEF:
case TargetOpcode::G_CTTZ_ZERO_UNDEF:		case TargetOpcode::G_CTTZ_ZERO_UNDEF:
case TargetOpcode::G_CTLZ:		case TargetOpcode::G_CTLZ:
case TargetOpcode::G_CTTZ:		case TargetOpcode::G_CTTZ:
case TargetOpcode::G_CTPOP:		case TargetOpcode::G_CTPOP:
return lowerBitCount(MI, TypeIdx, Ty);		return lowerBitCount(MI, TypeIdx, Ty);
		case G_UADDE: {
		unsigned Res = MI.getOperand(0).getReg();
		unsigned CarryOut = MI.getOperand(1).getReg();
		unsigned LHS = MI.getOperand(2).getReg();
		unsigned RHS = MI.getOperand(3).getReg();
		unsigned CarryIn = MI.getOperand(4).getReg();

		unsigned TmpRes = MRI.createGenericVirtualRegister(Ty);
		unsigned ZExtCarryIn = MRI.createGenericVirtualRegister(Ty);

		MIRBuilder.buildAdd(TmpRes, LHS, RHS);
		MIRBuilder.buildZExt(ZExtCarryIn, CarryIn);
		MIRBuilder.buildAdd(Res, TmpRes, ZExtCarryIn);
		MIRBuilder.buildICmp(CmpInst::ICMP_ULT, CarryOut, Res, LHS);

		MI.eraseFromParent();
		return Legalized;
		}
}		}
}		}

LegalizerHelper::LegalizeResult		LegalizerHelper::LegalizeResult
LegalizerHelper::fewerElementsVector(MachineInstr &MI, unsigned TypeIdx,		LegalizerHelper::fewerElementsVector(MachineInstr &MI, unsigned TypeIdx,
LLT NarrowTy) {		LLT NarrowTy) {
// FIXME: Don't know how to handle secondary types yet.		// FIXME: Don't know how to handle secondary types yet.
if (TypeIdx != 0)		if (TypeIdx != 0)
▲ Show 20 Lines • Show All 142 Lines • Show Last 20 Lines

lib/Target/Mips/MipsLegalizerInfo.cpp

Show All 14 Lines
#include "MipsTargetMachine.h"		#include "MipsTargetMachine.h"
#include "llvm/CodeGen/GlobalISel/LegalizerHelper.h"		#include "llvm/CodeGen/GlobalISel/LegalizerHelper.h"

using namespace llvm;		using namespace llvm;

MipsLegalizerInfo::MipsLegalizerInfo(const MipsSubtarget &ST) {		MipsLegalizerInfo::MipsLegalizerInfo(const MipsSubtarget &ST) {
using namespace TargetOpcode;		using namespace TargetOpcode;

		const LLT s1 = LLT::scalar(1);
const LLT s32 = LLT::scalar(32);		const LLT s32 = LLT::scalar(32);
const LLT s64 = LLT::scalar(64);
const LLT p0 = LLT::pointer(0, 32);		const LLT p0 = LLT::pointer(0, 32);

getActionDefinitionsBuilder(G_ADD)		getActionDefinitionsBuilder(G_ADD)
.legalFor({s32})		.legalFor({s32})
.minScalar(0, s32)		.clampScalar(0, s32, s32);
.customFor({s64});
		getActionDefinitionsBuilder(G_UADDE)
		.lowerFor({{s32, s1}});

getActionDefinitionsBuilder({G_LOAD, G_STORE})		getActionDefinitionsBuilder({G_LOAD, G_STORE})
.legalForCartesianProduct({p0, s32}, {p0});		.legalForCartesianProduct({p0, s32}, {p0});

getActionDefinitionsBuilder({G_AND, G_OR, G_XOR, G_SHL, G_ASHR, G_LSHR})		getActionDefinitionsBuilder({G_AND, G_OR, G_XOR, G_SHL, G_ASHR, G_LSHR})
.legalFor({s32});		.legalFor({s32});

getActionDefinitionsBuilder(G_ICMP)		getActionDefinitionsBuilder(G_ICMP)
Show All 21 Lines	bool MipsLegalizerInfo::legalizeCustom(MachineInstr &MI,
MachineRegisterInfo &MRI,		MachineRegisterInfo &MRI,
MachineIRBuilder &MIRBuilder) const {		MachineIRBuilder &MIRBuilder) const {

using namespace TargetOpcode;		using namespace TargetOpcode;

MIRBuilder.setInstr(MI);		MIRBuilder.setInstr(MI);

switch (MI.getOpcode()) {		switch (MI.getOpcode()) {
case G_ADD: {
unsigned Size = MRI.getType(MI.getOperand(0).getReg()).getSizeInBits();

const LLT sHalf = LLT::scalar(Size / 2);

unsigned RHSLow = MRI.createGenericVirtualRegister(sHalf);
unsigned RHSHigh = MRI.createGenericVirtualRegister(sHalf);
unsigned LHSLow = MRI.createGenericVirtualRegister(sHalf);
unsigned LHSHigh = MRI.createGenericVirtualRegister(sHalf);
unsigned ResLow = MRI.createGenericVirtualRegister(sHalf);
unsigned ResHigh = MRI.createGenericVirtualRegister(sHalf);
unsigned Carry = MRI.createGenericVirtualRegister(sHalf);
unsigned TmpResHigh = MRI.createGenericVirtualRegister(sHalf);

MIRBuilder.buildUnmerge({RHSLow, RHSHigh}, MI.getOperand(2).getReg());
MIRBuilder.buildUnmerge({LHSLow, LHSHigh}, MI.getOperand(1).getReg());

MIRBuilder.buildAdd(TmpResHigh, LHSHigh, RHSHigh);
MIRBuilder.buildAdd(ResLow, LHSLow, RHSLow);
MIRBuilder.buildICmp(CmpInst::ICMP_ULT, Carry, ResLow, LHSLow);
MIRBuilder.buildAdd(ResHigh, TmpResHigh, Carry);

MIRBuilder.buildMerge(MI.getOperand(0).getReg(), {ResLow, ResHigh});

MI.eraseFromParent();
break;
}
default:		default:
return false;		return false;
}		}

return true;		return true;
}		}

test/CodeGen/Mips/GlobalISel/legalizer/add.mir

# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py		# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
# RUN: llc -O0 -mtriple=mipsel-linux-gnu -run-pass=legalizer -verify-machineinstrs %s -o - \| FileCheck %s -check-prefixes=MIPS32		# RUN: llc -O0 -mtriple=mipsel-linux-gnu -run-pass=legalizer -verify-machineinstrs %s -o - \| FileCheck %s -check-prefixes=MIPS32
--- \|		--- \|

define void @add_i32() {entry: ret void}		define void @add_i32() {entry: ret void}
define void @add_i8_sext() {entry: ret void}		define void @add_i8_sext() {entry: ret void}
define void @add_i8_zext() {entry: ret void}		define void @add_i8_zext() {entry: ret void}
define void @add_i8_aext() {entry: ret void}		define void @add_i8_aext() {entry: ret void}
define void @add_i16_sext() {entry: ret void}		define void @add_i16_sext() {entry: ret void}
define void @add_i16_zext() {entry: ret void}		define void @add_i16_zext() {entry: ret void}
define void @add_i16_aext() {entry: ret void}		define void @add_i16_aext() {entry: ret void}
define void @add_i64() {entry: ret void}		define void @add_i64() {entry: ret void}
		define void @add_i128() {entry: ret void}

...		...
---		---
name: add_i32		name: add_i32
alignment: 2		alignment: 2
tracksRegLiveness: true		tracksRegLiveness: true
body: \|		body: \|
bb.0.entry:		bb.0.entry:
▲ Show 20 Lines • Show All 200 Lines • ▼ Show 20 Lines	bb.1.entry:
liveins: $a0, $a1, $a2, $a3		liveins: $a0, $a1, $a2, $a3

; MIPS32-LABEL: name: add_i64		; MIPS32-LABEL: name: add_i64
; MIPS32: liveins: $a0, $a1, $a2, $a3		; MIPS32: liveins: $a0, $a1, $a2, $a3
; MIPS32: [[COPY:%[0-9]+]]:_(s32) = COPY $a0		; MIPS32: [[COPY:%[0-9]+]]:_(s32) = COPY $a0
; MIPS32: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1		; MIPS32: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1
; MIPS32: [[COPY2:%[0-9]+]]:_(s32) = COPY $a2		; MIPS32: [[COPY2:%[0-9]+]]:_(s32) = COPY $a2
; MIPS32: [[COPY3:%[0-9]+]]:_(s32) = COPY $a3		; MIPS32: [[COPY3:%[0-9]+]]:_(s32) = COPY $a3
; MIPS32: [[ADD:%[0-9]+]]:_(s32) = G_ADD [[COPY2]], [[COPY]]		; MIPS32: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 0
; MIPS32: [[ADD1:%[0-9]+]]:_(s32) = G_ADD [[COPY3]], [[COPY1]]		; MIPS32: [[ADD:%[0-9]+]]:_(s32) = G_ADD [[COPY3]], [[COPY1]]
		; MIPS32: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 1
		; MIPS32: [[COPY4:%[0-9]+]]:_(s32) = COPY [[C]](s32)
		; MIPS32: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY4]], [[C1]]
		; MIPS32: [[ADD1:%[0-9]+]]:_(s32) = G_ADD [[ADD]], [[AND]]
; MIPS32: [[ICMP:%[0-9]+]]:_(s32) = G_ICMP intpred(ult), [[ADD1]](s32), [[COPY3]]		; MIPS32: [[ICMP:%[0-9]+]]:_(s32) = G_ICMP intpred(ult), [[ADD1]](s32), [[COPY3]]
; MIPS32: [[ADD2:%[0-9]+]]:_(s32) = G_ADD [[ADD]], [[ICMP]]		; MIPS32: [[ADD2:%[0-9]+]]:_(s32) = G_ADD [[COPY2]], [[COPY]]
; MIPS32: $v0 = COPY [[ADD2]](s32)		; MIPS32: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 1
		; MIPS32: [[COPY5:%[0-9]+]]:_(s32) = COPY [[ICMP]](s32)
		; MIPS32: [[AND1:%[0-9]+]]:_(s32) = G_AND [[COPY5]], [[C2]]
		; MIPS32: [[ADD3:%[0-9]+]]:_(s32) = G_ADD [[ADD2]], [[AND1]]
		; MIPS32: $v0 = COPY [[ADD3]](s32)
; MIPS32: $v1 = COPY [[ADD1]](s32)		; MIPS32: $v1 = COPY [[ADD1]](s32)
; MIPS32: RetRA implicit $v0, implicit $v1		; MIPS32: RetRA implicit $v0, implicit $v1
%2:_(s32) = COPY $a0		%2:_(s32) = COPY $a0
%3:_(s32) = COPY $a1		%3:_(s32) = COPY $a1
%0:_(s64) = G_MERGE_VALUES %3(s32), %2(s32)		%0:_(s64) = G_MERGE_VALUES %3(s32), %2(s32)
%4:_(s32) = COPY $a2		%4:_(s32) = COPY $a2
%5:_(s32) = COPY $a3		%5:_(s32) = COPY $a3
%1:_(s64) = G_MERGE_VALUES %5(s32), %4(s32)		%1:_(s64) = G_MERGE_VALUES %5(s32), %4(s32)
%6:_(s64) = G_ADD %1, %0		%6:_(s64) = G_ADD %1, %0
%7:_(s32), %8:_(s32) = G_UNMERGE_VALUES %6(s64)		%7:_(s32), %8:_(s32) = G_UNMERGE_VALUES %6(s64)
$v0 = COPY %8(s32)		$v0 = COPY %8(s32)
$v1 = COPY %7(s32)		$v1 = COPY %7(s32)
RetRA implicit $v0, implicit $v1		RetRA implicit $v0, implicit $v1

...		...
		---
		name: add_i128
		alignment: 2
		tracksRegLiveness: true
		fixedStack:
		- { id: 0, offset: 28, size: 4, alignment: 4, stack-id: 0, isImmutable: true }
		- { id: 1, offset: 24, size: 4, alignment: 8, stack-id: 0, isImmutable: true }
		- { id: 2, offset: 20, size: 4, alignment: 4, stack-id: 0, isImmutable: true }
		- { id: 3, offset: 16, size: 4, alignment: 8, stack-id: 0, isImmutable: true }
		Petar.AvramovicAuthorUnsubmitted Not Done Reply Inline Actions Note: Machine function generated from input mir has different `fixedStack` than input: fixedStack: - { id: 0, offset: 16, size: 4, alignment: 8, stack-id: 0, isImmutable: true } - { id: 1, offset: 20, size: 4, alignment: 4, stack-id: 0, isImmutable: true } - { id: 2, offset: 24, size: 4, alignment: 8, stack-id: 0, isImmutable: true } - { id: 3, offset: 28, size: 4, alignment: 4, stack-id: 0, isImmutable: true } Let's look at these two instructions in output: %5:_(p0) = G_FRAME_INDEX %fixed-stack.0 %6:_(s32) = G_LOAD %5(p0) :: (load 4 from %fixed-stack.0, align 0) They correspond to these two in original input %10:_(p0) = G_FRAME_INDEX %fixed-stack.3 %6:_(s32) = G_LOAD %10(p0) :: (load 4 from %fixed-stack.3, align 0) We see that numbers of Vregs are different but that is fine as they are connected with instructions. On the other hand, `id`s in `fixedStack` are not connected with content (content with `id:3` in input corresponds to content with `id:0` in output). We cannot see the connection between e.g. Vreg of `G_LOAD` instruction and its offset in test. Looking at `fixedStack` at input may lead to misinterpretation of this test. Any comments on this? Petar.Avramovic: Note: Machine function generated from input mir has different `fixedStack` than input: ```…
		body: \|
		bb.1.entry:
		liveins: $a0, $a1, $a2, $a3

		; MIPS32-LABEL: name: add_i128
		; MIPS32: liveins: $a0, $a1, $a2, $a3
		; MIPS32: [[COPY:%[0-9]+]]:_(s32) = COPY $a0
		; MIPS32: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1
		; MIPS32: [[COPY2:%[0-9]+]]:_(s32) = COPY $a2
		; MIPS32: [[COPY3:%[0-9]+]]:_(s32) = COPY $a3
		; MIPS32: [[FRAME_INDEX:%[0-9]+]]:_(p0) = G_FRAME_INDEX %fixed-stack.0
		; MIPS32: [[LOAD:%[0-9]+]]:_(s32) = G_LOAD [[FRAME_INDEX]](p0) :: (load 4 from %fixed-stack.0, align 0)
		; MIPS32: [[FRAME_INDEX1:%[0-9]+]]:_(p0) = G_FRAME_INDEX %fixed-stack.1
		; MIPS32: [[LOAD1:%[0-9]+]]:_(s32) = G_LOAD [[FRAME_INDEX1]](p0) :: (load 4 from %fixed-stack.1, align 0)
		; MIPS32: [[FRAME_INDEX2:%[0-9]+]]:_(p0) = G_FRAME_INDEX %fixed-stack.2
		; MIPS32: [[LOAD2:%[0-9]+]]:_(s32) = G_LOAD [[FRAME_INDEX2]](p0) :: (load 4 from %fixed-stack.2, align 0)
		; MIPS32: [[FRAME_INDEX3:%[0-9]+]]:_(p0) = G_FRAME_INDEX %fixed-stack.3
		; MIPS32: [[LOAD3:%[0-9]+]]:_(s32) = G_LOAD [[FRAME_INDEX3]](p0) :: (load 4 from %fixed-stack.3, align 0)
		; MIPS32: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 0
		; MIPS32: [[ADD:%[0-9]+]]:_(s32) = G_ADD [[LOAD]], [[COPY]]
		; MIPS32: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 1
		; MIPS32: [[COPY4:%[0-9]+]]:_(s32) = COPY [[C]](s32)
		; MIPS32: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY4]], [[C1]]
		; MIPS32: [[ADD1:%[0-9]+]]:_(s32) = G_ADD [[ADD]], [[AND]]
		; MIPS32: [[ICMP:%[0-9]+]]:_(s32) = G_ICMP intpred(ult), [[ADD1]](s32), [[LOAD]]
		; MIPS32: [[ADD2:%[0-9]+]]:_(s32) = G_ADD [[LOAD1]], [[COPY1]]
		; MIPS32: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 1
		; MIPS32: [[COPY5:%[0-9]+]]:_(s32) = COPY [[ICMP]](s32)
		; MIPS32: [[AND1:%[0-9]+]]:_(s32) = G_AND [[COPY5]], [[C2]]
		; MIPS32: [[ADD3:%[0-9]+]]:_(s32) = G_ADD [[ADD2]], [[AND1]]
		; MIPS32: [[ICMP1:%[0-9]+]]:_(s32) = G_ICMP intpred(ult), [[ADD3]](s32), [[LOAD1]]
		; MIPS32: [[ADD4:%[0-9]+]]:_(s32) = G_ADD [[LOAD2]], [[COPY2]]
		; MIPS32: [[C3:%[0-9]+]]:_(s32) = G_CONSTANT i32 1
		; MIPS32: [[COPY6:%[0-9]+]]:_(s32) = COPY [[ICMP1]](s32)
		; MIPS32: [[AND2:%[0-9]+]]:_(s32) = G_AND [[COPY6]], [[C3]]
		; MIPS32: [[ADD5:%[0-9]+]]:_(s32) = G_ADD [[ADD4]], [[AND2]]
		; MIPS32: [[ICMP2:%[0-9]+]]:_(s32) = G_ICMP intpred(ult), [[ADD5]](s32), [[LOAD2]]
		; MIPS32: [[ADD6:%[0-9]+]]:_(s32) = G_ADD [[LOAD3]], [[COPY3]]
		; MIPS32: [[C4:%[0-9]+]]:_(s32) = G_CONSTANT i32 1
		; MIPS32: [[COPY7:%[0-9]+]]:_(s32) = COPY [[ICMP2]](s32)
		; MIPS32: [[AND3:%[0-9]+]]:_(s32) = G_AND [[COPY7]], [[C4]]
		; MIPS32: [[ADD7:%[0-9]+]]:_(s32) = G_ADD [[ADD6]], [[AND3]]
		; MIPS32: $v0 = COPY [[ADD1]](s32)
		; MIPS32: $v1 = COPY [[ADD3]](s32)
		; MIPS32: $a0 = COPY [[ADD5]](s32)
		; MIPS32: $a1 = COPY [[ADD7]](s32)
		; MIPS32: RetRA implicit $v0, implicit $v1, implicit $a0, implicit $a1
		%2:_(s32) = COPY $a0
		%3:_(s32) = COPY $a1
		%4:_(s32) = COPY $a2
		%5:_(s32) = COPY $a3
		%0:_(s128) = G_MERGE_VALUES %2(s32), %3(s32), %4(s32), %5(s32)
		%10:_(p0) = G_FRAME_INDEX %fixed-stack.3
		%6:_(s32) = G_LOAD %10(p0) :: (load 4 from %fixed-stack.3, align 0)
		%11:_(p0) = G_FRAME_INDEX %fixed-stack.2
		%7:_(s32) = G_LOAD %11(p0) :: (load 4 from %fixed-stack.2, align 0)
		%12:_(p0) = G_FRAME_INDEX %fixed-stack.1
		%8:_(s32) = G_LOAD %12(p0) :: (load 4 from %fixed-stack.1, align 0)
		%13:_(p0) = G_FRAME_INDEX %fixed-stack.0
		%9:_(s32) = G_LOAD %13(p0) :: (load 4 from %fixed-stack.0, align 0)
		%1:_(s128) = G_MERGE_VALUES %6(s32), %7(s32), %8(s32), %9(s32)
		%14:_(s128) = G_ADD %1, %0
		%15:_(s32), %16:_(s32), %17:_(s32), %18:_(s32) = G_UNMERGE_VALUES %14(s128)
		$v0 = COPY %15(s32)
		$v1 = COPY %16(s32)
		$a0 = COPY %17(s32)
		$a1 = COPY %18(s32)
		RetRA implicit $v0, implicit $v1, implicit $a0, implicit $a1

		...

test/CodeGen/Mips/GlobalISel/llvm-ir/add.ll

	Show First 20 Lines • Show All 84 Lines • ▼ Show 20 Lines
	entry:			entry:
	%add = add i16 %b, %a			%add = add i16 %b, %a
	ret i16 %add			ret i16 %add
	}			}

	define i64 @add_i64(i64 %a, i64 %b) {			define i64 @add_i64(i64 %a, i64 %b) {
	; MIPS32-LABEL: add_i64:			; MIPS32-LABEL: add_i64:
	; MIPS32: # %bb.0: # %entry			; MIPS32: # %bb.0: # %entry
	; MIPS32-NEXT: addu $5, $7, $5			; MIPS32-NEXT: lui $1, 0
				; MIPS32-NEXT: ori $1, $1, 0
	; MIPS32-NEXT: addu $4, $6, $4			; MIPS32-NEXT: addu $4, $6, $4
	; MIPS32-NEXT: sltu $6, $4, $6			; MIPS32-NEXT: lui $2, 0
	; MIPS32-NEXT: addu $3, $5, $6			; MIPS32-NEXT: ori $2, $2, 1
	; MIPS32-NEXT: move $2, $4			; MIPS32-NEXT: and $1, $1, $2
				; MIPS32-NEXT: addu $1, $4, $1
				; MIPS32-NEXT: sltu $2, $1, $6
				; MIPS32-NEXT: addu $4, $7, $5
				; MIPS32-NEXT: lui $5, 0
				; MIPS32-NEXT: ori $5, $5, 1
				; MIPS32-NEXT: and $2, $2, $5
				; MIPS32-NEXT: addu $3, $4, $2
				; MIPS32-NEXT: move $2, $1
	; MIPS32-NEXT: jr $ra			; MIPS32-NEXT: jr $ra
	; MIPS32-NEXT: nop			; MIPS32-NEXT: nop
	entry:			entry:
	%add = add i64 %b, %a			%add = add i64 %b, %a
	ret i64 %add			ret i64 %add
	}			}
	No newline at end of file
				define i128 @add_i128(i128 %a, i128 %b) {
				; MIPS32-LABEL: add_i128:
				; MIPS32: # %bb.0: # %entry
				; MIPS32-NEXT: addiu $sp, $sp, -8
				; MIPS32-NEXT: .cfi_def_cfa_offset 8
				; MIPS32-NEXT: addiu $1, $sp, 24
				; MIPS32-NEXT: lw $1, 0($1)
				; MIPS32-NEXT: addiu $2, $sp, 28
				; MIPS32-NEXT: lw $2, 0($2)
				; MIPS32-NEXT: addiu $3, $sp, 32
				; MIPS32-NEXT: lw $3, 0($3)
				; MIPS32-NEXT: addiu $8, $sp, 36
				; MIPS32-NEXT: lw $8, 0($8)
				; MIPS32-NEXT: lui $9, 0
				; MIPS32-NEXT: ori $9, $9, 0
				; MIPS32-NEXT: addu $4, $1, $4
				; MIPS32-NEXT: lui $10, 0
				; MIPS32-NEXT: ori $10, $10, 1
				; MIPS32-NEXT: and $9, $9, $10
				; MIPS32-NEXT: addu $4, $4, $9
				; MIPS32-NEXT: sltu $1, $4, $1
				; MIPS32-NEXT: addu $5, $2, $5
				; MIPS32-NEXT: lui $9, 0
				; MIPS32-NEXT: ori $9, $9, 1
				; MIPS32-NEXT: and $1, $1, $9
				; MIPS32-NEXT: addu $1, $5, $1
				; MIPS32-NEXT: sltu $2, $1, $2
				; MIPS32-NEXT: addu $5, $3, $6
				; MIPS32-NEXT: lui $6, 0
				; MIPS32-NEXT: ori $6, $6, 1
				; MIPS32-NEXT: and $2, $2, $6
				; MIPS32-NEXT: addu $2, $5, $2
				; MIPS32-NEXT: sltu $3, $2, $3
				; MIPS32-NEXT: addu $5, $8, $7
				; MIPS32-NEXT: lui $6, 0
				; MIPS32-NEXT: ori $6, $6, 1
				; MIPS32-NEXT: and $3, $3, $6
				; MIPS32-NEXT: addu $5, $5, $3
				; MIPS32-NEXT: sw $2, 4($sp) # 4-byte Folded Spill
				; MIPS32-NEXT: move $2, $4
				; MIPS32-NEXT: move $3, $1
				; MIPS32-NEXT: lw $4, 4($sp) # 4-byte Folded Reload
				; MIPS32-NEXT: addiu $sp, $sp, 8
				; MIPS32-NEXT: jr $ra
				; MIPS32-NEXT: nop
				entry:
				%add = add i128 %b, %a
				ret i128 %add
				}

This is an archive of the discontinued LLVM Phabricator instance.

[MIPS GlobalISel] Lower G_UADDE and narrowScalar G_ADDClosedPublic

Details

Diff Detail