This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Target/RISCV/
-
Target/
-
RISCV/
-
CMakeLists.txt
-
RISCV.h
-
RISCVAsmPrinter.cpp
-
RISCVInstrInfo.td
18/56
RISCVRVVInitUndef.cpp
-
RISCVTargetMachine.cpp
-
test/CodeGen/RISCV/
-
CodeGen/
-
RISCV/
2
O3-pipeline.ll
1/3
regalloc-last-chance-recoloring-failure.ll
-
rvv/
-
subregister-undef-early-clobber.mir
1/4
undef-earlyclobber-chain.ll
-
vrgatherei16-subreg-liveness.ll

Differential D129735

[RISCV] Add new pass to transform undef to pseudo for vector values.
ClosedPublic

Authored by BeMg on Jul 14 2022, 12:38 AM.

Download Raw Diff

Details

Reviewers

craig.topper
rogfer01
frasercrmck
reames
kito-cheng
arsenm

Commits

rG3b8c0b342e16: [RISCV] Add new pass to transform undef to pseudo for vector values.
rGf1c4241fb6e5: [RISCV] Add new pass to transform undef to pseudo for vector values.

Summary

RISC-V vector instruction has register overlapping constraint for certain
instructions, and will cause illegal instruction trap if violated, we use
early clobber to model this constraint, but it can't prevent register allocator
allocated same or overlapped if the input register is undef value, so convert
IMPLICIT_DEF to temporary pseudo could prevent that happen, it's not best way
to resolve this. Ideally we should model the constraint right, but before we
model the constraint right, it's the approach to prevent that happen.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

In D129735#3873448, @craig.topper wrote:

In D129735#3873436, @BeMg wrote:

In D129735#3871632, @craig.topper wrote:

Does this patch work for this test case

define internal void @foo() {
loopIR.preheader.i.i:
  %v15 = tail call <vscale x 1 x i16> @llvm.experimental.stepvector.nxv1i16()
  %v17 = tail call <vscale x 8 x i16> @llvm.vector.insert.nxv8i16.nxv1i16(<vscale x 8 x i16> poison, <vscale x 1 x i16> %v15, i64 0)
  %vs12.i.i.i = add <vscale x 1 x i16> %v15, shufflevector (<vscale x 1 x i16> insertelement (<vscale x 1 x i16> poison, i16 1, i32 0), <vscale x 1 x i16> poison, <vscale x 1 x i32> zeroinitializer)
  %v18 = tail call <vscale x 8 x i16> @llvm.vector.insert.nxv8i16.nxv1i16(<vscale x 8 x i16> poison, <vscale x 1 x i16> %vs12.i.i.i, i64 0)
  %vs16.i.i.i = add <vscale x 1 x i16> %v15, shufflevector (<vscale x 1 x i16> insertelement (<vscale x 1 x i16> poison, i16 3, i32 0), <vscale x 1 x i16> poison, <vscale x 1 x i32> zeroinitializer)
  %v20 = tail call <vscale x 8 x i16> @llvm.vector.insert.nxv8i16.nxv1i16(<vscale x 8 x i16> poison, <vscale x 1 x i16> %vs16.i.i.i, i64 0)
  br label %loopIR3.i.i

loopIR3.i.i:                                      ; preds = %loopIR3.i.i, %loopIR.preheader.i.i
  %v37 = load <vscale x 8 x i8>, ptr addrspace(1) null, align 8
  %v38 = tail call <vscale x 8 x i8> @llvm.riscv.vrgatherei16.vv.nxv8i8.i64(<vscale x 8 x i8> undef, <vscale x 8 x i8> %v37, <vscale x 8 x i16> %v17, i64 4)
  %v40 = tail call <vscale x 8 x i8> @llvm.riscv.vrgatherei16.vv.nxv8i8.i64(<vscale x 8 x i8> undef, <vscale x 8 x i8> %v37, <vscale x 8 x i16> %v18, i64 4)
  %v42 = and <vscale x 8 x i8> %v38, %v40
  %v46 = tail call <vscale x 8 x i8> @llvm.riscv.vrgatherei16.vv.nxv8i8.i64(<vscale x 8 x i8> undef, <vscale x 8 x i8> %v37, <vscale x 8 x i16> %v20, i64 4)
  %v60 = and <vscale x 8 x i8> %v42, %v46
  store <vscale x 8 x i8> %v60, ptr addrspace(1) null, align 4
  br label %loopIR3.i.i
}

declare <vscale x 1 x i16> @llvm.experimental.stepvector.nxv1i16()

declare <vscale x 8 x i16> @llvm.vector.insert.nxv8i16.nxv1i16(<vscale x 8 x i16>, <vscale x 1 x i16>, i64 immarg) 

declare <vscale x 8 x i8> @llvm.riscv.vrgatherei16.vv.nxv8i8.i64(<vscale x 8 x i8>, <vscale x 8 x i8>, <vscale x 8 x i16>, i64)

This IR doesn't generate the undef+early-clobber situation, so this pass will not work on it.

RA result will not break the early-clobber constraint in current compiler.

Corrsponding MachineInst before RA place below:

********** MACHINEINSTRS **********
# Machine code for function foo: NoPHIs, TracksLiveness, TiedOpsRewritten, TracksDebugUserValues

0B      bb.0.loopIR.preheader.i.i:
          successors: %bb.1(0x80000000); %bb.1(100.00%)

16B       dead %16:gpr = PseudoVSETVLIX0 $x0, 206, implicit-def $vl, implicit-def $vtype
32B       undef %0.sub_vrm1_0:vrm2 = PseudoVID_V_MF4 -1, 4, implicit $vl, implicit $vtype
64B       undef %1.sub_vrm1_0:vrm2 = PseudoVADD_VI_MF4 %0.sub_vrm1_0:vrm2, 1, -1, 4, implicit $vl, implicit $vtype
96B       undef %2.sub_vrm1_0:vrm2 = PseudoVADD_VI_MF4 %0.sub_vrm1_0:vrm2, 3, -1, 4, implicit $vl, implicit $vtype

128B    bb.1.loopIR3.i.i:
        ; predecessors: %bb.0, %bb.1
          successors: %bb.1(0x80000000); %bb.1(100.00%)

160B      %10:vr = VL1RE8_V $x0 :: (load unknown-size from `ptr addrspace(1) null`, align 8, addrspace 1)
176B      dead $x0 = PseudoVSETIVLI 4, 192, implicit-def $vl, implicit-def $vtype
192B      early-clobber %11:vr = PseudoVRGATHEREI16_VV_M1_M2 %10:vr, %0:vrm2, 4, 3, implicit $vl, implicit $vtype
208B      early-clobber %12:vr = PseudoVRGATHEREI16_VV_M1_M2 %10:vr, %1:vrm2, 4, 3, implicit $vl, implicit $vtype
224B      dead %17:gpr = PseudoVSETVLIX0 $x0, 192, implicit-def $vl, implicit-def $vtype
240B      %13:vr = PseudoVAND_VV_M1 %11:vr, %12:vr, -1, 3, implicit $vl, implicit $vtype
256B      dead $x0 = PseudoVSETIVLI 4, 192, implicit-def $vl, implicit-def $vtype
272B      early-clobber %14:vr = PseudoVRGATHEREI16_VV_M1_M2 %10:vr, %2:vrm2, 4, 3, implicit $vl, implicit $vtype
288B      dead %18:gpr = PseudoVSETVLIX0 $x0, 192, implicit-def $vl, implicit-def $vtype
304B      %15:vr = PseudoVAND_VV_M1 %13:vr, %14:vr, -1, 3, implicit $vl, implicit $vtype
320B      VS1R_V %15:vr, $x0 :: (store unknown-size into `ptr addrspace(1) null`, align 4, addrspace 1)
336B      PseudoBR %bb.1

Generated assembly see the note inline.

foo:                                    # @foo
	.cfi_startproc
# %bb.0:                                # %loopIR.preheader.i.i
	vsetvli	a0, zero, e16, mf4, ta, ma
	vid.v	v8
	vadd.vi	v10, v8, 1
	vadd.vi	v12, v8, 3
.LBB0_1:                                # %loopIR3.i.i
                                        # =>This Inner Loop Header: Depth=1
	vl1r.v	v14, (zero)
	vsetivli	zero, 4, e8, m1, ta, ma
	vrgatherei16.vv	v15, v14, v8  <- The v14 here is LMUL=2 so it's v14 and v15. This means writing v15 violated the early clobber constraint.
	vrgatherei16.vv	v16, v14, v10
	vsetvli	a0, zero, e8, m1, ta, ma
	vand.vv	v15, v15, v16
	vsetivli	zero, 4, e8, m1, ta, ma
	vrgatherei16.vv	v16, v14, v12
	vsetvli	a0, zero, e8, m1, ta, ma
	vand.vv	v14, v15, v16
	vs1r.v	v14, (zero)
	j	.LBB0_1
.Lfunc_end0:
	.size	foo, .Lfunc_end0-foo
	.cfi_endproc
                                        # -- End function
	.section	".note.GNU-stack","",@progbits

Is this MIR wrong here? Does %10 should be marked as vrm2?
Compiler treat %11 and %10 as M1, %0 as M2, so i think RA doesn't violated the early clobber constraint.

192B      early-clobber %11:vr = PseudoVRGATHEREI16_VV_M1_M2 %10:vr, %0:vrm2, 4, 3, implicit $vl, implicit $vtype
vrgatherei16.vv	v15, v14, v8 
// %11 -> v15
// %10 -> v14
// %0 -> v8

Compiler Register allocation result:

********** REWRITE VIRTUAL REGISTERS **********
********** Function: foo
********** REGISTER MAP **********
[%0 -> $v8m2] VRM2
[%1 -> $v10m2] VRM2
[%2 -> $v12m2] VRM2
[%10 -> $v14] VR
[%11 -> $v15] VR
[%12 -> $v16] VR
[%13 -> $v15] VR
[%14 -> $v16] VR
[%15 -> $v14] VR
[%16 -> $x10] GPR
[%17 -> $x10] GPR
[%18 -> $x10] GPR

Update the approach that finding the VR super class.

Harbormaster completed remote builds in B193451: Diff 469502.Oct 21 2022, 2:27 AM

In D129735#3873595, @BeMg wrote:

In D129735#3873448, @craig.topper wrote:

In D129735#3873436, @BeMg wrote:

In D129735#3871632, @craig.topper wrote:

Does this patch work for this test case

define internal void @foo() {
loopIR.preheader.i.i:
  %v15 = tail call <vscale x 1 x i16> @llvm.experimental.stepvector.nxv1i16()
  %v17 = tail call <vscale x 8 x i16> @llvm.vector.insert.nxv8i16.nxv1i16(<vscale x 8 x i16> poison, <vscale x 1 x i16> %v15, i64 0)
  %vs12.i.i.i = add <vscale x 1 x i16> %v15, shufflevector (<vscale x 1 x i16> insertelement (<vscale x 1 x i16> poison, i16 1, i32 0), <vscale x 1 x i16> poison, <vscale x 1 x i32> zeroinitializer)
  %v18 = tail call <vscale x 8 x i16> @llvm.vector.insert.nxv8i16.nxv1i16(<vscale x 8 x i16> poison, <vscale x 1 x i16> %vs12.i.i.i, i64 0)
  %vs16.i.i.i = add <vscale x 1 x i16> %v15, shufflevector (<vscale x 1 x i16> insertelement (<vscale x 1 x i16> poison, i16 3, i32 0), <vscale x 1 x i16> poison, <vscale x 1 x i32> zeroinitializer)
  %v20 = tail call <vscale x 8 x i16> @llvm.vector.insert.nxv8i16.nxv1i16(<vscale x 8 x i16> poison, <vscale x 1 x i16> %vs16.i.i.i, i64 0)
  br label %loopIR3.i.i

loopIR3.i.i:                                      ; preds = %loopIR3.i.i, %loopIR.preheader.i.i
  %v37 = load <vscale x 8 x i8>, ptr addrspace(1) null, align 8
  %v38 = tail call <vscale x 8 x i8> @llvm.riscv.vrgatherei16.vv.nxv8i8.i64(<vscale x 8 x i8> undef, <vscale x 8 x i8> %v37, <vscale x 8 x i16> %v17, i64 4)
  %v40 = tail call <vscale x 8 x i8> @llvm.riscv.vrgatherei16.vv.nxv8i8.i64(<vscale x 8 x i8> undef, <vscale x 8 x i8> %v37, <vscale x 8 x i16> %v18, i64 4)
  %v42 = and <vscale x 8 x i8> %v38, %v40
  %v46 = tail call <vscale x 8 x i8> @llvm.riscv.vrgatherei16.vv.nxv8i8.i64(<vscale x 8 x i8> undef, <vscale x 8 x i8> %v37, <vscale x 8 x i16> %v20, i64 4)
  %v60 = and <vscale x 8 x i8> %v42, %v46
  store <vscale x 8 x i8> %v60, ptr addrspace(1) null, align 4
  br label %loopIR3.i.i
}

declare <vscale x 1 x i16> @llvm.experimental.stepvector.nxv1i16()

declare <vscale x 8 x i16> @llvm.vector.insert.nxv8i16.nxv1i16(<vscale x 8 x i16>, <vscale x 1 x i16>, i64 immarg) 

declare <vscale x 8 x i8> @llvm.riscv.vrgatherei16.vv.nxv8i8.i64(<vscale x 8 x i8>, <vscale x 8 x i8>, <vscale x 8 x i16>, i64)

This IR doesn't generate the undef+early-clobber situation, so this pass will not work on it.

RA result will not break the early-clobber constraint in current compiler.

Corrsponding MachineInst before RA place below:

********** MACHINEINSTRS **********
# Machine code for function foo: NoPHIs, TracksLiveness, TiedOpsRewritten, TracksDebugUserValues

0B      bb.0.loopIR.preheader.i.i:
          successors: %bb.1(0x80000000); %bb.1(100.00%)

16B       dead %16:gpr = PseudoVSETVLIX0 $x0, 206, implicit-def $vl, implicit-def $vtype
32B       undef %0.sub_vrm1_0:vrm2 = PseudoVID_V_MF4 -1, 4, implicit $vl, implicit $vtype
64B       undef %1.sub_vrm1_0:vrm2 = PseudoVADD_VI_MF4 %0.sub_vrm1_0:vrm2, 1, -1, 4, implicit $vl, implicit $vtype
96B       undef %2.sub_vrm1_0:vrm2 = PseudoVADD_VI_MF4 %0.sub_vrm1_0:vrm2, 3, -1, 4, implicit $vl, implicit $vtype

128B    bb.1.loopIR3.i.i:
        ; predecessors: %bb.0, %bb.1
          successors: %bb.1(0x80000000); %bb.1(100.00%)

160B      %10:vr = VL1RE8_V $x0 :: (load unknown-size from `ptr addrspace(1) null`, align 8, addrspace 1)
176B      dead $x0 = PseudoVSETIVLI 4, 192, implicit-def $vl, implicit-def $vtype
192B      early-clobber %11:vr = PseudoVRGATHEREI16_VV_M1_M2 %10:vr, %0:vrm2, 4, 3, implicit $vl, implicit $vtype
208B      early-clobber %12:vr = PseudoVRGATHEREI16_VV_M1_M2 %10:vr, %1:vrm2, 4, 3, implicit $vl, implicit $vtype
224B      dead %17:gpr = PseudoVSETVLIX0 $x0, 192, implicit-def $vl, implicit-def $vtype
240B      %13:vr = PseudoVAND_VV_M1 %11:vr, %12:vr, -1, 3, implicit $vl, implicit $vtype
256B      dead $x0 = PseudoVSETIVLI 4, 192, implicit-def $vl, implicit-def $vtype
272B      early-clobber %14:vr = PseudoVRGATHEREI16_VV_M1_M2 %10:vr, %2:vrm2, 4, 3, implicit $vl, implicit $vtype
288B      dead %18:gpr = PseudoVSETVLIX0 $x0, 192, implicit-def $vl, implicit-def $vtype
304B      %15:vr = PseudoVAND_VV_M1 %13:vr, %14:vr, -1, 3, implicit $vl, implicit $vtype
320B      VS1R_V %15:vr, $x0 :: (store unknown-size into `ptr addrspace(1) null`, align 4, addrspace 1)
336B      PseudoBR %bb.1

Generated assembly see the note inline.

foo:                                    # @foo
	.cfi_startproc
# %bb.0:                                # %loopIR.preheader.i.i
	vsetvli	a0, zero, e16, mf4, ta, ma
	vid.v	v8
	vadd.vi	v10, v8, 1
	vadd.vi	v12, v8, 3
.LBB0_1:                                # %loopIR3.i.i
                                        # =>This Inner Loop Header: Depth=1
	vl1r.v	v14, (zero)
	vsetivli	zero, 4, e8, m1, ta, ma
	vrgatherei16.vv	v15, v14, v8  <- The v14 here is LMUL=2 so it's v14 and v15. This means writing v15 violated the early clobber constraint.
	vrgatherei16.vv	v16, v14, v10
	vsetvli	a0, zero, e8, m1, ta, ma
	vand.vv	v15, v15, v16
	vsetivli	zero, 4, e8, m1, ta, ma
	vrgatherei16.vv	v16, v14, v12
	vsetvli	a0, zero, e8, m1, ta, ma
	vand.vv	v14, v15, v16
	vs1r.v	v14, (zero)
	j	.LBB0_1
.Lfunc_end0:
	.size	foo, .Lfunc_end0-foo
	.cfi_endproc
                                        # -- End function
	.section	".note.GNU-stack","",@progbits

Is this MIR wrong here? Does %10 should be marked as vrm2?
Compiler treat %11 and %10 as M1, %0 as M2, so i think RA doesn't violated the early clobber constraint.

192B      early-clobber %11:vr = PseudoVRGATHEREI16_VV_M1_M2 %10:vr, %0:vrm2, 4, 3, implicit $vl, implicit $vtype
vrgatherei16.vv	v15, v14, v8 
// %11 -> v15
// %10 -> v14
// %0 -> v8

Compiler Register allocation result:

********** REWRITE VIRTUAL REGISTERS **********
********** Function: foo
********** REGISTER MAP **********
[%0 -> $v8m2] VRM2
[%1 -> $v10m2] VRM2
[%2 -> $v12m2] VRM2
[%10 -> $v14] VR
[%11 -> $v15] VR
[%12 -> $v16] VR
[%13 -> $v15] VR
[%14 -> $v16] VR
[%15 -> $v14] VR
[%16 -> $x10] GPR
[%17 -> $x10] GPR
[%18 -> $x10] GPR

You're right. I mixed up the operand order. I need to go look at this test again. It used to fail. Maybe we fixed it some other way.

I remember now. It only miscompiles with -riscv-enable-subreg-liveness

That produces

foo:                                    # @foo
        .cfi_startproc
# %bb.0:                                # %loopIR.preheader.i.i
        vsetvli a0, zero, e16, mf4, ta, ma
        vid.v   v8
        vadd.vi v10, v8, 1
        vadd.vi v12, v8, 3
.LBB0_1:                                # %loopIR3.i.i
                                        # =>This Inner Loop Header: Depth=1
        vl1r.v  v9, (zero)
        vsetivli        zero, 4, e8, m1, ta, ma
        vrgatherei16.vv v11, v9, v8
        vrgatherei16.vv v13, v9, v10
        vsetvli a0, zero, e8, m1, ta, ma
        vand.vv v11, v11, v13
        vsetivli        zero, 4, e8, m1, ta, ma
        vrgatherei16.vv v13, v9, v12 <- this instruction violates the early clobber constraint
        vsetvli a0, zero, e8, m1, ta, ma
        vand.vv v9, v11, v13
        vs1r.v  v9, (zero)
        j       .LBB0_1
.Lfunc_end0:
        .size   foo, .Lfunc_end0-foo
        .cfi_endproc
                                        # -- End function                        
        .section        ".note.GNU-stack","",@progbits

Replace Insert_subreg with PesudoInit when subreg liveness is enabled.

In D129735#3875529, @craig.topper wrote:

I remember now. It only miscompiles with -riscv-enable-subreg-liveness

That produces

foo:                                    # @foo
        .cfi_startproc
# %bb.0:                                # %loopIR.preheader.i.i
        vsetvli a0, zero, e16, mf4, ta, ma
        vid.v   v8
        vadd.vi v10, v8, 1
        vadd.vi v12, v8, 3
.LBB0_1:                                # %loopIR3.i.i
                                        # =>This Inner Loop Header: Depth=1
        vl1r.v  v9, (zero)
        vsetivli        zero, 4, e8, m1, ta, ma
        vrgatherei16.vv v11, v9, v8
        vrgatherei16.vv v13, v9, v10
        vsetvli a0, zero, e8, m1, ta, ma
        vand.vv v11, v11, v13
        vsetivli        zero, 4, e8, m1, ta, ma
        vrgatherei16.vv v13, v9, v12 <- this instruction violates the early clobber constraint
        vsetvli a0, zero, e8, m1, ta, ma
        vand.vv v9, v11, v13
        vs1r.v  v9, (zero)
        j       .LBB0_1
.Lfunc_end0:
        .size   foo, .Lfunc_end0-foo
        .cfi_endproc
                                        # -- End function                        
        .section        ".note.GNU-stack","",@progbits

Add one more condtion for subreg-liveness.

The assembly show as below:

        .p2align        2                               # -- Begin function foo
        .type   foo,@function
foo:                                    # @foo
        .cfi_startproc
# %bb.0:                                # %loopIR.preheader.i.i
        vsetvli a0, zero, e16, mf4, ta, ma
        vid.v   v14
        vadd.vi v15, v14, 1
        vadd.vi v16, v14, 3
        vmv1r.v v8, v14
        vmv1r.v v10, v15
        vmv1r.v v12, v16
.LBB0_1:                                # %loopIR3.i.i
                                        # =>This Inner Loop Header: Depth=1
        vl1r.v  v14, (zero)
        vsetivli        zero, 4, e8, m1, ta, ma
        vrgatherei16.vv v15, v14, v8
        vrgatherei16.vv v16, v14, v10
        vsetvli a0, zero, e8, m1, ta, ma
        vand.vv v15, v15, v16
        vsetivli        zero, 4, e8, m1, ta, ma
        vrgatherei16.vv v16, v14, v12
        vsetvli a0, zero, e8, m1, ta, ma
        vand.vv v14, v15, v16
        vs1r.v  v14, (zero)
        j       .LBB0_1
.Lfunc_end0:
        .size   foo, .Lfunc_end0-foo
        .cfi_endproc
                                        # -- End function
        .section        ".note.GNU-stack","",@progbits

Harbormaster completed remote builds in B194144: Diff 470444.Oct 25 2022, 6:02 AM

Handle subregister liveness situation

Harbormaster completed remote builds in B196822: Diff 474153.Nov 8 2022, 11:06 PM

Avoid infinite loop by DenseMap

Harbormaster completed remote builds in B196832: Diff 474165.Nov 9 2022, 12:20 AM

BeMg retitled this revision from [WIP][RISCV] Add new pass to transform undef to pesudo for vector values. to [RISCV] Add new pass to transform undef to pesudo for vector values..Nov 9 2022, 5:13 PM

kito-cheng added inline comments.Nov 9 2022, 5:26 PM

llvm/test/CodeGen/RISCV/rvv/undef-earlyclobber-chain.ll
1	Could you add a pre-commit patch for this testcase so that could easier demonstrate what's get fixed?

kito-cheng added inline comments.Nov 9 2022, 5:39 PM

llvm/lib/Target/RISCV/RISCVRVVInitUndef.cpp
51	`SmallPtrSet` for `Seen`

BeMg added a child revision: D137763: [RISCV] precommit test for D129735.Nov 9 2022, 9:06 PM

Change DenseMap into SmallPrtSet

BeMg marked an inline comment as done.Nov 9 2022, 9:27 PM

BeMg added inline comments.

llvm/test/CodeGen/RISCV/rvv/undef-earlyclobber-chain.ll
1	New patch https://reviews.llvm.org/D137763 for test only and set up the dependency. Should I remove the testcase in this patch?

Update testcase

Harbormaster completed remote builds in B197021: Diff 474443.Nov 9 2022, 10:26 PM

BeMg removed a child revision: D137763: [RISCV] precommit test for D129735.Nov 15 2022, 9:09 PM

BeMg added a parent revision: D137763: [RISCV] precommit test for D129735.

Record PHI node subregister change
New INSERT_SUBREG insert before early-clobber instruciton
Subreg index computation include PHI node

Handle Sub-register undef+early-clobber

For sub-registers, there is the same issue. The register allocator will also generate the program that breaks the early-clobber constraint. The reason for this situation is that the partial register used in instruction (with early-clobber flag) is undef. For example:

early-clobber %12:vr = PseudoVRGATHEREI16_VV_M1_M2 %10:vr, %1:vrm2, 4, 3, implicit $vl, implicit $vtype

vrgatherei16.vv v13, v9, v12

v12 is selected as VRM2, it will occupy the v12~v13. The register allocator still allocates the v13 for %12:vrm2 due to the v13 is undef for %1:vr in the register allocation stage. This is an example of how an undef subregister breaks the early-clobber constraint in the register allocation stage.

Here we propose an approach to fix this problem. The concept is the same as a normal undef register situation. We define the sub-register with pseudo instruction and remove it in the later pass (after RA).

There are three steps for this approach:

Select the def-use chain from implicit_def to the first user with early-clobber constraint
Compute the undef sub-register index from collecting information from INSERT_SUBREG and PHI node
Insert the PseudoInit and INSERT_SUBREG for undefined sub-register after the last INSERT_SUGREG that updates the sub-register

Here we show the example with the pattern that will trigger undef+early-clobber issue.

Step 1

There are three def-use chains we need to care about in this program.

The pattern will look like

v0 = Implicit_def
…
INSERT_SUBREG | COPY | PHI
…
early-clobber rd = Op vN

Step 2

The INSERT_SUBREG node third operand is subregister index. It shows that this node defines which sub-register in the whole register. We can use the information to construct the sub-register that is undefined.

We use the LaneBitMask for this purpose.

LaneBitmask == 0xC for whole VRM2 register
LaneBitmask == 0x4 for %subreg.sub_vrm1_0 
LaneBitmask == 0x8 for %subreg.sub_vrm1_1

If we get the following def-use chain in step1

%4:vrm2 = Implicit_def
%0:vrm2 = INSERT_SUBREG %4, %subreg.sub_vrm1_0
early-clobber %11:vr = Op %0

0xC is VRM2’s LaneBitMask and 0x4 is already defined by INSERT_SUBREG in the program.

0xC & ~0x4 = 0x8 -> subreg.sub_vrm1_1

In this case, subreg.sub_vrm1_1 is the undefined sub-register before being used by early-clobber instruction.

Step 3

We can define a sub-register by INSERT_SUBREG between the last INSERT_SUBREG and the user with early-clobber. Our goal is to make sure the sub-registers are all defined before being used by early-clobber instruction.

%4:vrm2 = Implicit_def
%0:vrm2 = INSERT_SUBREG %4, %subreg.sub_vrm1_0
early-clobber %11:vr = Op %0

%4:vrm2 = Implicit_def
%0:vrm2 = INSERT_SUBREG %4, %subreg.sub_vrm1_0
%21:vr = PseudoRVVInitUndefM1
%22:vrm2 = INSERT_SUBREG %1:vrm2, %21:vr, %subreg.sub_vrm1_1
early-clobber %11:vr = Op %22

PHI in def-use chain

In Step 2, PHI will be seen as another instruction that will change the subregister defined region. The PHINodeLaneBitRecord will record the LaneBitMask from both predecessors, and insert the INSERT_SUBREG with this information.

Harbormaster completed remote builds in B198739: Diff 476831.Nov 21 2022, 2:22 PM

Update testcase

Harbormaster completed remote builds in B199916: Diff 478437.Nov 28 2022, 8:22 PM

pseudo is misspelled in the title

craig.topper added inline comments.Nov 29 2022, 4:41 PM

llvm/lib/Target/RISCV/RISCVRVVInitUndef.cpp
2	pesudo -> pseudo
11	"pesudo" -> "pseudo" in two places
19	pesudo -> pseudo
19	latter -> later
246	Pesudo -> Pseudo
247	regitser -> register
289	Candidata -> Candidate?
llvm/test/CodeGen/RISCV/regalloc-last-chance-recoloring-failure.ll
29	Are we treating insert_subreg for segment load tuples the same as inserting a small LMUL into a wider LMUL?

craig.topper added inline comments.Nov 29 2022, 4:44 PM

llvm/test/CodeGen/RISCV/rvv/subregister-undef-early-clobber-vrm4.mir
118 ↗	(On Diff #478437)	Why is there an PseudoRVVInitUnde pseudo in the IR before the pass runs?

craig.topper added inline comments.Nov 29 2022, 4:54 PM

llvm/lib/Target/RISCV/RISCVRVVInitUndef.cpp
74	You can pass Register by value
106	origin -> original
111	Can't we do something like VRRegClass.hasSubClassEq(MRI->getRegClass(R)) \|\| VRM2RegClass.hasSubClassEq(MRI->getRegClass(R)) \|\| VRM4RegClass.hasSubClassEq(MRI->getRegClass(R)) \|\| VRM8RegClass.hasSubClassEq(MRI->getRegClass(R)) why do we need to use getVRLargestSuperClass?

BeMg retitled this revision from [RISCV] Add new pass to transform undef to pesudo for vector values. to [RISCV] Add new pass to transform undef to pseudo for vector values..Nov 30 2022, 12:40 AM

Fix spell
Update isVectorRegClass

Harbormaster completed remote builds in B200224: Diff 478877.Nov 30 2022, 5:00 AM

Rplace PseudoRVVInitUndef with VLE in MIR test
Move MIR test into precommit

Harbormaster completed remote builds in B200428: Diff 479164.Nov 30 2022, 11:38 PM

BeMg added inline comments.Dec 13 2022, 6:18 AM

llvm/test/CodeGen/RISCV/regalloc-last-chance-recoloring-failure.ll
29	This pass doesn't consider segment load as instruction that assign sub-register. The following Insert_subreg work like put %5:vrm2 into %6:vrm4 %1:vrm4 = IMPLICIT_DEF %5:vrm2 = PseudoVLE32_V_M2 killed %4, 0, 5 /* e32 */ %6:vrm4 = INSERT_SUBREG %1, %5, %subreg.sub_vrm2_0 Do we should treat vloxseg2ei32 as INSERT_SUBREG in this patch?

Ping

craig.topper added inline comments.Dec 13 2022, 2:40 PM

llvm/lib/Target/RISCV/RISCVRVVInitUndef.cpp
172	Why passing `Insts` by value? That will make a copy but it doesn't look like we need a copy.
182	Why passing `Insts` by value? That will make a copy but it doesn't look like we need a copy.
183	unsigned -> Register
197	unsigned -> Register
197	Why passing `Insts` by value? That will make a copy but it doesn't look like we need a copy.
200	unsigned -> Register
299	unsigned -> Register
300	unsigned -> Register
313	createVirtualRegister returns `Register` not `unsigned`
317	createVirtualRegister returns `Register` not `unsigned`
llvm/test/CodeGen/RISCV/regalloc-last-chance-recoloring-failure.ll
29	Nevermind, I didn't realize this test had so many `undef` and `poison` operands. I suspect llvm-reduce or bugpoint. I dislike tests with undef/poison operands. It makes things very fragile. It would be legal for DAG combine to delete a large portion of this test.

craig.topper added inline comments.Dec 13 2022, 2:52 PM

llvm/lib/Target/RISCV/RISCVRVVInitUndef.cpp
279	"the " is unnecessary in this sentence

craig.topper added inline comments.Dec 13 2022, 2:54 PM

llvm/lib/Target/RISCV/RISCVRVVInitUndef.cpp
28	"is be occupied" -> "is occupied"

Is it possible to insert the PseudoRVVInitUndef instructions after we've left SSA and the LiveIntervals have been built. Would that make it easier to find the undef lanes?

unsigned -> Register
use const std::vector<MachineInstr *> & instead of call by value

rebase

Harbormaster completed remote builds in B203007: Diff 482703.Dec 13 2022, 8:53 PM

Reuse DetectDeadLanes pass info for undefined subregister

Harbormaster completed remote builds in B203754: Diff 483741.Dec 17 2022, 4:25 AM

Move DetectDeadLanes pass change into another patch

BeMg added a parent revision: D140382: [CodeGen] Add user interface for DetectDeadLanes.Dec 20 2022, 2:09 AM

Harbormaster completed remote builds in B204108: Diff 484196.Dec 20 2022, 2:09 AM

craig.topper added inline comments.Dec 21 2022, 10:35 AM

llvm/lib/Target/RISCV/RISCVRVVInitUndef.cpp
202	Isn't this calculating for the entire function? But handleSubReg is called for individual instructions. So we'll be recomputing information right?

craig.topper added a reviewer: arsenm.Dec 21 2022, 10:37 AM

Herald added a subscriber: wdng. · View Herald TranscriptDec 21 2022, 10:37 AM

This could certainly use some new MIR tests. I didn't look super closely but I'm not sure you're correctly handling undef vs. not-undef subreg defs

llvm/lib/Target/RISCV/RISCVRVVInitUndef.cpp
54	DenseMap
91–101	I don't understand the point of this function, getSuperClasses is already sorted by largest. You can just take the first?
106	Could this use the new getPhysRegBaseClass?
108	Don't call getRegClass for each use
142	Reg.isVirtual()
201	unique_ptr

Use unique_ptr for VRegInfo
Only run once DetectDeadLanes for each function
Remove Seen and PHINodeLaneBitRecord
Only call getRegClass once

Harbormaster completed remote builds in B204548: Diff 484791.Dec 22 2022, 3:37 AM

BeMg marked 4 inline comments as done.Dec 22 2022, 3:39 AM

BeMg added inline comments.

llvm/lib/Target/RISCV/RISCVRVVInitUndef.cpp
91–101	When this function take VRNoV0RegClassID as input. the getSuperClasses will return as following order. AnyRegRegClassID 1 VMRegClassID 20 VRRegClassID 21 <- stop here ... This patch only want those four RegClass as result.
106	Could this hook use for virtual register? It seem only for physical register but this pass run before register allocation.
202	Yes, you're right. we don't need recompute this info for each instruction. It call only once now.

Add more subregister testcase

Harbormaster completed remote builds in B205431: Diff 485949.Jan 3 2023, 4:04 AM

craig.topper added inline comments.Jan 3 2023, 9:31 AM

llvm/lib/Target/RISCV/RISCVRVVInitUndef.cpp
40	Are there any maps in the code anymore?
llvm/test/CodeGen/RISCV/O3-pipeline.ll
111	If I understand correctly, we're effectively running DetectDeadLanes inside of RISCV init undef pass and then running the real DetectDeadLanes pass which won't do anything because we already did it?

craig.topper added inline comments.Jan 3 2023, 10:35 AM

llvm/test/CodeGen/RISCV/O3-pipeline.ll
111	Can we run DetectDeadLanes, then run our pass and just use the portion of the DetectDeadLanes that computes the Lane Masks in our pass?

Move init undef pass after DetectDeadLanes
Remove <map>

craig.topper added inline comments.Jan 4 2023, 9:09 PM

llvm/lib/Target/RISCV/RISCVRVVInitUndef.cpp
282	Should this be `std::unique_ptr<VRegInfo[]>` since it points to an array?

Update std::unique_ptr<VRegInfo> with std::unique_ptr<VRegInfo[]>

Harbormaster completed remote builds in B205835: Diff 486467.Jan 4 2023, 10:09 PM

rebase

Harbormaster completed remote builds in B205846: Diff 486478.Jan 5 2023, 12:39 AM

rebase

Harbormaster completed remote builds in B206492: Diff 487364.Jan 9 2023, 5:03 AM

rebaseY

Harbormaster completed remote builds in B206694: Diff 487663.Jan 9 2023, 9:43 PM

craig.topper mentioned this in D141993: [CodeGen] Split some functionality from DetectDeadLanes into its own class to be reused. NFCi.Jan 18 2023, 12:01 AM

luke957 removed a subscriber: luke957.Jan 21 2023, 7:51 AM

Herald added a subscriber: luke. · View Herald TranscriptJan 21 2023, 7:51 AM

craig.topper mentioned this in rG23d576bb838e: [CodeGen] Split some functionality from DetectDeadLanes into its own class to….Jan 25 2023, 1:31 PM

Use D141993 as DetectDeadLane user interface

BeMg removed a parent revision: D140382: [CodeGen] Add user interface for DetectDeadLanes.Feb 1 2023, 10:03 PM

Harbormaster completed remote builds in B211385: Diff 494165.Feb 1 2023, 10:22 PM

craig.topper added inline comments.Feb 13 2023, 3:13 PM

llvm/lib/Target/RISCV/RISCVExpandPseudoInsts.cpp
149 ↗	(On Diff #494165)	I kind of think we should keep these instructions all the way to `RISCVAsmPrinter::emitInstruction`. Setting the operands to "undef" says it is ok to change the operands after this pass runs, but its not. RISCVExpandPseudo runs late enough there is probably no pass that will change them. Keeping the pseudo all the way to `RISCVAsmPrinter::emitInstruction` removes any possibility.
llvm/lib/Target/RISCV/RISCVRVVInitUndef.cpp
88	Can this be if (RISCV::VRM8RegClass.hasSubClassEq(RC)) return &RISCV::VRM8RegClass; if (RISCV::VRM4RegClass.hasSubClassEq(RC)) return &RISCV::VRM4RegClass; if (RISCV::VRM2RegClass.hasSubClassEq(RC)) return &RISCV::VRM2RegClass; if (RISCV::VRRegClass.hasSubClassEq(RC)) return &RISCV::VRRegClass; return RC;
117	`return RISCV::PseudoRVVInitUndefM1;`
152	Do we need to scan all operands? Can we check if MO is a tied use and find the operand it is tied to check the early clobber?
188	Use `defs()`?
188	Replace the place with llvm::any_of?
201	Can we scan `uses()` instead of `operands()`?
206	I think we wouldn't need this if we only checked `uses`?
229	Can we put `Info.UsedLanes & ~Info.DefinedLanes` into a variable? We use that expression twice
232	`Lastest` isn't a word

Skip undef-init pseudo in ASMEmitter instead of removing it in PseudoExpend pass
Update operands with defs() or uses()
Fix some typo
Use variable to represent Info.UsedLanes & ~Info.DefinedLanes

Harbormaster completed remote builds in B213633: Diff 497281.Feb 14 2023, 5:10 AM

Remove removeTempRVVInitUndef declaration

Harbormaster completed remote builds in B213636: Diff 497285.Feb 14 2023, 5:27 AM

BeMg marked 8 inline comments as done.Feb 14 2023, 6:15 AM

BeMg added inline comments.

llvm/lib/Target/RISCV/RISCVRVVInitUndef.cpp

152

0B      bb.0.entry:
16B       LIFETIME_START %stack.0.dst
32B       %1:vr = IMPLICIT_DEF
48B       early-clobber %0:vr = PseudoVRGATHER_VI_M1 killed %1:vr, 0, 0, 5
64B       %2:gpr = ADDI %stack.0.dst, 0
80B       PseudoVSE32_V_M1 killed %0:vr, killed %2:gpr, 0, 5
96B       LIFETIME_END %stack.0.dst
112B      %3:gpr = COPY $x0
128B      $x10 = COPY %3:gpr
144B      PseudoRET implicit $x10

Here we maybe need scan both defs and uses operand, because the early-clobber operand not always exist tied-to operand.

BeMg added inline comments.Feb 14 2023, 6:25 AM

llvm/lib/Target/RISCV/RISCVExpandPseudoInsts.cpp
149 ↗	(On Diff #494165)	Skip `PseudoRVVInitUndefM1\|2\|4\|8` in `RISCVAsmPrinter::emitInstruction` and remove undef-init relate function in `RISCVExpandPseudo`.

LGTM

llvm/lib/Target/RISCV/RISCVRVVInitUndef.cpp
152	Right. I wasn't thinking about that right.

This revision is now accepted and ready to land.Feb 14 2023, 2:32 PM

BeMg mentioned this in rGa17bfbad6387: [RISCV] precommit test for D129735.Feb 14 2023, 7:42 PM

This revision was landed with ongoing or failed builds.Feb 14 2023, 7:51 PM

Closed by commit rGf1c4241fb6e5: [RISCV] Add new pass to transform undef to pseudo for vector values. (authored by BeMg). · Explain Why

This revision was automatically updated to reflect the committed changes.

BeMg added a commit: rGf1c4241fb6e5: [RISCV] Add new pass to transform undef to pseudo for vector values..

It seems like this breaks under ASan: https://lab.llvm.org/buildbot/#/builders/5/builds/31529/steps/13/logs/stdio

FAIL: LLVM :: CodeGen/RISCV/regalloc-last-chance-recoloring-failure.ll (35389 of 72944)
******************** TEST 'LLVM :: CodeGen/RISCV/regalloc-last-chance-recoloring-failure.ll' FAILED ********************
Script:
--
: 'RUN: at line 2';   /b/sanitizer-x86_64-linux-fast/build/llvm_build_asan/bin/llc -mtriple=riscv64 -mattr=+f,+m,+zfh,+experimental-zvfh    -riscv-enable-subreg-liveness=false < /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/test/CodeGen/RISCV/regalloc-last-chance-recoloring-failure.ll | /b/sanitizer-x86_64-linux-fast/build/llvm_build_asan/bin/FileCheck /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/test/CodeGen/RISCV/regalloc-last-chance-recoloring-failure.ll
: 'RUN: at line 4';   /b/sanitizer-x86_64-linux-fast/build/llvm_build_asan/bin/llc -mtriple=riscv64 -mattr=+f,+m,+zfh,+experimental-zvfh < /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/test/CodeGen/RISCV/regalloc-last-chance-recoloring-failure.ll    -riscv-enable-subreg-liveness=true| /b/sanitizer-x86_64-linux-fast/build/llvm_build_asan/bin/FileCheck /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/test/CodeGen/RISCV/regalloc-last-chance-recoloring-failure.ll --check-prefix=SUBREGLIVENESS
--
Exit Code: 2
Command Output (stderr):
--
=================================================================
==924420==ERROR: AddressSanitizer: use-after-poison on address 0x62100002da58 at pc 0x564ad06d209e bp 0x7ffe116375f0 sp 0x7ffe116375e8
READ of size 8 at 0x62100002da58 thread T0
    #0 0x564ad06d209d in operands_begin /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/include/llvm/CodeGen/MachineInstr.h:635:42
    #1 0x564ad06d209d in defs /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/include/llvm/CodeGen/MachineInstr.h:679:23
    #2 0x564ad06d209d in isEarlyClobberMI /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/Target/RISCV/RISCVRVVInitUndef.cpp:175:26
    #3 0x564ad06d209d in processBasicBlock /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/Target/RISCV/RISCVRVVInitUndef.cpp:250:39
    #4 0x564ad06d209d in (anonymous namespace)::RISCVInitUndef::runOnMachineFunction(llvm::MachineFunction&) /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/Target/RISCV/RISCVRVVInitUndef.cpp:270:16
    #5 0x564ad2aa92d2 in llvm::MachineFunctionPass::runOnFunction(llvm::Function&) /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/CodeGen/MachineFunctionPass.cpp:91:13
    #6 0x564ad39ba4a3 in llvm::FPPassManager::runOnFunction(llvm::Function&) /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/IR/LegacyPassManager.cpp:1430:27
    #7 0x564ad39d4140 in llvm::FPPassManager::runOnModule(llvm::Module&) /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/IR/LegacyPassManager.cpp:1476:16
    #8 0x564ad39bc312 in runOnModule /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/IR/LegacyPassManager.cpp:1545:27
    #9 0x564ad39bc312 in llvm::legacy::PassManagerImpl::run(llvm::Module&) /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/IR/LegacyPassManager.cpp:535:44
    #10 0x564acd741a10 in compileModule /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/tools/llc/llc.cpp:733:8
    #11 0x564acd741a10 in main /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/tools/llc/llc.cpp:420:22
    #12 0x7f2985940d8f  (/lib/x86_64-linux-gnu/libc.so.6+0x29d8f) (BuildId: 69389d485a9793dbe873f0ea2c93e02efaa9aa3d)
    #13 0x7f2985940e3f in __libc_start_main (/lib/x86_64-linux-gnu/libc.so.6+0x29e3f) (BuildId: 69389d485a9793dbe873f0ea2c93e02efaa9aa3d)
    #14 0x564acd66dba4 in _start (/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan/bin/llc+0x7670ba4)
0x62100002da58 is located 2392 bytes inside of 4096-byte region [0x62100002d100,0x62100002e100)
allocated by thread T0 here:
    #0 0x564acd72db32 in operator new(unsigned long, std::align_val_t) /b/sanitizer-x86_64-linux-fast/build/llvm-project/compiler-rt/lib/asan/asan_new_delete.cpp:107:3
    #1 0x564acdc2908d in Allocate /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/include/llvm/Support/AllocatorBase.h:86:12
    #2 0x564acdc2908d in StartNewSlab /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/include/llvm/Support/Allocator.h:339:42
    #3 0x564acdc2908d in llvm::BumpPtrAllocatorImpl<llvm::MallocAllocator, 4096ul, 4096ul, 128ul>::Allocate(unsigned long, llvm::Align) /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/include/llvm/Support/Allocator.h:195:5
    #4 0x564ad2a81869 in Allocate /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/include/llvm/Support/Allocator.h:209:12
    #5 0x564ad2a81869 in operator new<llvm::MallocAllocator, 4096UL, 4096UL, 128UL> /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/include/llvm/Support/Allocator.h:443:20
    #6 0x564ad2a81869 in llvm::MachineFunction::init() /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/CodeGen/MachineFunction.cpp:185:15
    #7 0x564ad2b43c17 in llvm::MachineModuleInfo::getOrCreateMachineFunction(llvm::Function&) /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/CodeGen/MachineModuleInfo.cpp:108:14
    #8 0x564ad2aa8def in llvm::MachineFunctionPass::runOnFunction(llvm::Function&) /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/CodeGen/MachineFunctionPass.cpp:46:29
    #9 0x564ad39ba4a3 in llvm::FPPassManager::runOnFunction(llvm::Function&) /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/IR/LegacyPassManager.cpp:1430:27
    #10 0x564ad39d4140 in llvm::FPPassManager::runOnModule(llvm::Module&) /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/IR/LegacyPassManager.cpp:1476:16
    #11 0x564ad39bc312 in runOnModule /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/IR/LegacyPassManager.cpp:1545:27
    #12 0x564ad39bc312 in llvm::legacy::PassManagerImpl::run(llvm::Module&) /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/IR/LegacyPassManager.cpp:535:44
    #13 0x564acd741a10 in compileModule /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/tools/llc/llc.cpp:733:8
    #14 0x564acd741a10 in main /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/tools/llc/llc.cpp:420:22
    #15 0x7f2985940d8f  (/lib/x86_64-linux-gnu/libc.so.6+0x29d8f) (BuildId: 69389d485a9793dbe873f0ea2c93e02efaa9aa3d)
SUMMARY: AddressSanitizer: use-after-poison /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/include/llvm/CodeGen/MachineInstr.h:635:42 in operands_begin
Shadow bytes around the buggy address:
  0x0c427fffdaf0: 00 f7 00 00 00 00 00 00 00 00 00 f7 00 00 00 00
  0x0c427fffdb00: 00 00 00 00 00 00 00 00 00 00 00 00 f7 00 00 00
  0x0c427fffdb10: 00 00 00 00 00 00 f7 00 00 00 00 00 00 00 00 00
  0x0c427fffdb20: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
  0x0c427fffdb30: 00 00 00 00 00 00 00 f7 00 00 00 00 00 00 00 00
=>0x0c427fffdb40: 00 f7 00 00 00 00 f7 f7 f7 f7 f7[f7]f7 f7 f7 f7
  0x0c427fffdb50: f7 00 00 00 00 f7 00 00 00 00 00 00 00 00 00 f7
  0x0c427fffdb60: 00 00 00 00 00 00 00 00 f7 00 00 00 00 00 00 00
  0x0c427fffdb70: 00 00 f7 00 00 00 00 00 00 00 00 00 00 00 00 00
  0x0c427fffdb80: 00 00 00 f7 00 00 00 00 00 00 00 00 00 f7 00 00
  0x0c427fffdb90: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 f7 00
Shadow byte legend (one shadow byte represents 8 application bytes):
  Addressable:           00
  Partially addressable: 01 02 03 04 05 06 07 
  Heap left redzone:       fa
  Freed heap region:       fd
  Stack left redzone:      f1
  Stack mid redzone:       f2
  Stack right redzone:     f3
  Stack after return:      f5
  Stack use after scope:   f8
  Global redzone:          f9
  Global init order:       f6
  Poisoned by user:        f7
  Container overflow:      fc
  Array cookie:            ac
  Intra object redzone:    bb
  ASan internal:           fe
  Left alloca redzone:     ca
  Right alloca redzone:    cb
==924420==ABORTING
FileCheck error: '<stdin>' is empty.
FileCheck command line:  /b/sanitizer-x86_64-linux-fast/build/llvm_build_asan/bin/FileCheck /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/test/CodeGen/RISCV/regalloc-last-chance-recoloring-failure.ll --check-prefix=SUBREGLIVENESS
--
********************
Testing:  0.. 10.. 20.. 30.. 40..
FAIL: LLVM :: CodeGen/RISCV/rvv/undef-earlyclobber-chain.ll (35659 of 72944)
******************** TEST 'LLVM :: CodeGen/RISCV/rvv/undef-earlyclobber-chain.ll' FAILED ********************
Script:
--
: 'RUN: at line 2';   /b/sanitizer-x86_64-linux-fast/build/llvm_build_asan/bin/llc -mtriple riscv64 -mattr=+v -riscv-enable-subreg-liveness < /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/test/CodeGen/RISCV/rvv/undef-earlyclobber-chain.ll  | /b/sanitizer-x86_64-linux-fast/build/llvm_build_asan/bin/FileCheck /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/test/CodeGen/RISCV/rvv/undef-earlyclobber-chain.ll
--
Exit Code: 2
Command Output (stderr):
--
=================================================================
==928893==ERROR: AddressSanitizer: use-after-poison on address 0x62100005a980 at pc 0x55a20925a09e bp 0x7ffebd9ed930 sp 0x7ffebd9ed928
READ of size 8 at 0x62100005a980 thread T0
    #0 0x55a20925a09d in operands_begin /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/include/llvm/CodeGen/MachineInstr.h:635:42
    #1 0x55a20925a09d in defs /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/include/llvm/CodeGen/MachineInstr.h:679:23
    #2 0x55a20925a09d in isEarlyClobberMI /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/Target/RISCV/RISCVRVVInitUndef.cpp:175:26
    #3 0x55a20925a09d in processBasicBlock /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/Target/RISCV/RISCVRVVInitUndef.cpp:250:39
    #4 0x55a20925a09d in (anonymous namespace)::RISCVInitUndef::runOnMachineFunction(llvm::MachineFunction&) /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/Target/RISCV/RISCVRVVInitUndef.cpp:270:16
    #5 0x55a20b6312d2 in llvm::MachineFunctionPass::runOnFunction(llvm::Function&) /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/CodeGen/MachineFunctionPass.cpp:91:13
    #6 0x55a20c5424a3 in llvm::FPPassManager::runOnFunction(llvm::Function&) /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/IR/LegacyPassManager.cpp:1430:27
    #7 0x55a20c55c140 in llvm::FPPassManager::runOnModule(llvm::Module&) /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/IR/LegacyPassManager.cpp:1476:16
    #8 0x55a20c544312 in runOnModule /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/IR/LegacyPassManager.cpp:1545:27
    #9 0x55a20c544312 in llvm::legacy::PassManagerImpl::run(llvm::Module&) /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/IR/LegacyPassManager.cpp:535:44
    #10 0x55a2062c9a10 in compileModule /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/tools/llc/llc.cpp:733:8
    #11 0x55a2062c9a10 in main /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/tools/llc/llc.cpp:420:22
    #12 0x7f2447396d8f  (/lib/x86_64-linux-gnu/libc.so.6+0x29d8f) (BuildId: 69389d485a9793dbe873f0ea2c93e02efaa9aa3d)
    #13 0x7f2447396e3f in __libc_start_main (/lib/x86_64-linux-gnu/libc.so.6+0x29e3f) (BuildId: 69389d485a9793dbe873f0ea2c93e02efaa9aa3d)
    #14 0x55a2061f5ba4 in _start (/b/sanitizer-x86_64-linux-fast/build/llvm_build_asan/bin/llc+0x7670ba4)
0x62100005a980 is located 2176 bytes inside of 4096-byte region [0x62100005a100,0x62100005b100)
allocated by thread T0 here:
    #0 0x55a2062b5b32 in operator new(unsigned long, std::align_val_t) /b/sanitizer-x86_64-linux-fast/build/llvm-project/compiler-rt/lib/asan/asan_new_delete.cpp:107:3
    #1 0x55a2067b108d in Allocate /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/include/llvm/Support/AllocatorBase.h:86:12
    #2 0x55a2067b108d in StartNewSlab /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/include/llvm/Support/Allocator.h:339:42
    #3 0x55a2067b108d in llvm::BumpPtrAllocatorImpl<llvm::MallocAllocator, 4096ul, 4096ul, 128ul>::Allocate(unsigned long, llvm::Align) /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/include/llvm/Support/Allocator.h:195:5
    #4 0x55a20b609869 in Allocate /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/include/llvm/Support/Allocator.h:209:12
    #5 0x55a20b609869 in operator new<llvm::MallocAllocator, 4096UL, 4096UL, 128UL> /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/include/llvm/Support/Allocator.h:443:20
    #6 0x55a20b609869 in llvm::MachineFunction::init() /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/CodeGen/MachineFunction.cpp:185:15
    #7 0x55a20b6cbc17 in llvm::MachineModuleInfo::getOrCreateMachineFunction(llvm::Function&) /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/CodeGen/MachineModuleInfo.cpp:108:14
    #8 0x55a20b630def in llvm::MachineFunctionPass::runOnFunction(llvm::Function&) /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/CodeGen/MachineFunctionPass.cpp:46:29
    #9 0x55a20c5424a3 in llvm::FPPassManager::runOnFunction(llvm::Function&) /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/IR/LegacyPassManager.cpp:1430:27
    #10 0x55a20c55c140 in llvm::FPPassManager::runOnModule(llvm::Module&) /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/IR/LegacyPassManager.cpp:1476:16
    #11 0x55a20c544312 in runOnModule /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/IR/LegacyPassManager.cpp:1545:27
    #12 0x55a20c544312 in llvm::legacy::PassManagerImpl::run(llvm::Module&) /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/lib/IR/LegacyPassManager.cpp:535:44
    #13 0x55a2062c9a10 in compileModule /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/tools/llc/llc.cpp:733:8
    #14 0x55a2062c9a10 in main /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/tools/llc/llc.cpp:420:22
    #15 0x7f2447396d8f  (/lib/x86_64-linux-gnu/libc.so.6+0x29d8f) (BuildId: 69389d485a9793dbe873f0ea2c93e02efaa9aa3d)
SUMMARY: AddressSanitizer: use-after-poison /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/include/llvm/CodeGen/MachineInstr.h:635:42 in operands_begin
Shadow bytes around the buggy address:
  0x0c42800034e0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
  0x0c42800034f0: 00 f7 00 00 00 00 00 00 00 00 00 f7 00 00 00 00
  0x0c4280003500: f7 00 00 00 00 00 00 00 00 00 f7 00 00 00 00 00
  0x0c4280003510: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
  0x0c4280003520: 00 00 00 00 00 00 00 00 00 00 00 f7 f7 f7 f7 f7
=>0x0c4280003530:[f7]f7 f7 f7 f7 f7 00 00 00 00 f7 00 00 00 00 00
  0x0c4280003540: 00 00 00 00 f7 00 00 00 00 00 00 00 00 00 00 00
  0x0c4280003550: 00 00 00 00 00 f7 00 00 00 00 00 00 00 00 00 f7
  0x0c4280003560: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
  0x0c4280003570: f7 00 00 00 00 00 00 00 00 00 f7 00 00 00 00 f7
  0x0c4280003580: 00 00 00 00 00 00 00 00 00 f7 00 00 00 00 00 00
Shadow byte legend (one shadow byte represents 8 application bytes):
  Addressable:           00
  Partially addressable: 01 02 03 04 05 06 07 
  Heap left redzone:       fa
  Freed heap region:       fd
  Stack left redzone:      f1
  Stack mid redzone:       f2
  Stack right redzone:     f3
  Stack after return:      f5
  Stack use after scope:   f8
  Global redzone:          f9
  Global init order:       f6
  Poisoned by user:        f7
  Container overflow:      fc
  Array cookie:            ac
  Intra object redzone:    bb
  ASan internal:           fe
  Left alloca redzone:     ca
  Right alloca redzone:    cb
==928893==ABORTING
FileCheck error: '<stdin>' is empty.
FileCheck command line:  /b/sanitizer-x86_64-linux-fast/build/llvm_build_asan/bin/FileCheck /b/sanitizer-x86_64-linux-fast/build/llvm-project/llvm/test/CodeGen/RISCV/rvv/undef-earlyclobber-chain.ll

MaskRay added inline comments.Feb 15 2023, 11:50 AM

llvm/lib/Target/RISCV/RISCVRVVInitUndef.cpp
2	RISCVRVVInitUndef.cpp
216	delete blank line

MaskRay added a reverting change: rG6f3e6a765a9e: Revert D129735 "[RISCV] Add new pass to transform undef to pseudo for vector….Feb 15 2023, 11:51 AM

craig.topper reopened this revision.Feb 15 2023, 12:49 PM

This revision is now accepted and ready to land.Feb 15 2023, 12:49 PM

Removing my approval until the asan failure is fixed

This revision now requires changes to proceed.Feb 15 2023, 12:50 PM

Update comment
Remove blank line
Use the old version isEarlyClobberMI

craig.topper added inline comments.Feb 15 2023, 7:36 PM

llvm/lib/Target/RISCV/RISCVRVVInitUndef.cpp
176	Any idea why the llvm::any_of didn't work?

Harbormaster completed remote builds in B214044: Diff 497869.Feb 15 2023, 8:00 PM

jrtc27 added inline comments.Feb 15 2023, 8:51 PM

llvm/lib/Target/RISCV/RISCVRVVInitUndef.cpp
176	Because the thing that calls this is erasing from the MBB whilst it iterates over it... so this is the wrong fix and probably doesn't even work either

Restore any_of version isEarlyClobberMI
handleImplicitDef will remove ImplicitDef MI, and be use in following for loop body. I think this is the reason it trigger ASan fail.

Harbormaster completed remote builds in B214051: Diff 497878.Feb 15 2023, 10:57 PM

craig.topper added inline comments.Feb 16 2023, 12:37 AM

llvm/lib/Target/RISCV/RISCVRVVInitUndef.cpp
176	Do we need `[&]` here or would `[]` work?

Rename some variables name (UserMOs -> UseMOs, UseMO -> UserMO)
Reorder if statement in processBasicBlock
Use [] instead of [&] because the body of the function doesn't use any variables other than DefMO.

Harbormaster completed remote builds in B214089: Diff 497924.Feb 16 2023, 3:09 AM

LGTM

This revision is now accepted and ready to land.Feb 20 2023, 5:00 PM

Closed by commit rG3b8c0b342e16: [RISCV] Add new pass to transform undef to pseudo for vector values. (authored by BeMg). · Explain WhyFeb 22 2023, 4:33 AM

This revision was automatically updated to reflect the committed changes.

BeMg added a commit: rG3b8c0b342e16: [RISCV] Add new pass to transform undef to pseudo for vector values..

craig.topper mentioned this in D145546: [RISCV] Enable subregister liveness by default.Mar 7 2023, 10:51 PM

BeMg mentioned this in rG365f84039878: [RISCV] Enable subregister liveness by default.Mar 8 2023, 11:15 PM

BeMg mentioned this in D155041: [RISCV] Remove unnecessary move of undefined with subregister liveness enabled.Jul 11 2023, 10:15 PM

1.ll24 KBDownload

Illegal instructions "vslideup.vi v8, v8, 5" and "vslideup.vi v8, v8, 2" are generated.

Herald added subscribers: wangpc, jobnoorman. · View Herald TranscriptSep 7 2023, 1:05 AM

In D129735#4640476, @garthlei wrote:

1.ll24 KBDownload

Illegal instructions "vslideup.vi v8, v8, 5" and "vslideup.vi v8, v8, 2" are generated.

Looks like the undef operand is not the passthru operand in this case. It's the value to slide. This pass only fixed the case where the passthru was undef. Can you file a new issue?

evandro removed a subscriber: evandro.Sep 12 2023, 5:49 PM

Revision Contents

Path

Size

llvm/

lib/

Target/

RISCV/

1 line

4 lines

5 lines

8 lines

RISCVRVVInitUndef.cpp

274 lines

RISCVTargetMachine.cpp

9 lines

test/

CodeGen/

RISCV/

O3-pipeline.ll

1 line

regalloc-last-chance-recoloring-failure.ll

20 lines

rvv/

subregister-undef-early-clobber.mir

130 lines

undef-earlyclobber-chain.ll

20 lines

vrgatherei16-subreg-liveness.ll

6 lines

Diff 499452

llvm/lib/Target/RISCV/CMakeLists.txt

Show All 30 Lines	add_llvm_target(RISCVCodeGen
RISCVISelDAGToDAG.cpp		RISCVISelDAGToDAG.cpp
RISCVISelLowering.cpp		RISCVISelLowering.cpp
RISCVMachineFunctionInfo.cpp		RISCVMachineFunctionInfo.cpp
RISCVMacroFusion.cpp		RISCVMacroFusion.cpp
RISCVMCInstLower.cpp		RISCVMCInstLower.cpp
RISCVMergeBaseOffset.cpp		RISCVMergeBaseOffset.cpp
RISCVRedundantCopyElimination.cpp		RISCVRedundantCopyElimination.cpp
RISCVRegisterInfo.cpp		RISCVRegisterInfo.cpp
		RISCVRVVInitUndef.cpp
RISCVSExtWRemoval.cpp		RISCVSExtWRemoval.cpp
RISCVStripWSuffix.cpp		RISCVStripWSuffix.cpp
RISCVSubtarget.cpp		RISCVSubtarget.cpp
RISCVTargetMachine.cpp		RISCVTargetMachine.cpp
RISCVTargetObjectFile.cpp		RISCVTargetObjectFile.cpp
RISCVTargetTransformInfo.cpp		RISCVTargetTransformInfo.cpp
GISel/RISCVCallLowering.cpp		GISel/RISCVCallLowering.cpp
GISel/RISCVInstructionSelector.cpp		GISel/RISCVInstructionSelector.cpp
Show All 28 Lines

llvm/lib/Target/RISCV/RISCV.h

	Show First 20 Lines • Show All 65 Lines • ▼ Show 20 Lines
	void initializeRISCVExpandAtomicPseudoPass(PassRegistry &);			void initializeRISCVExpandAtomicPseudoPass(PassRegistry &);

	FunctionPass *createRISCVInsertVSETVLIPass();			FunctionPass *createRISCVInsertVSETVLIPass();
	void initializeRISCVInsertVSETVLIPass(PassRegistry &);			void initializeRISCVInsertVSETVLIPass(PassRegistry &);

	FunctionPass *createRISCVRedundantCopyEliminationPass();			FunctionPass *createRISCVRedundantCopyEliminationPass();
	void initializeRISCVRedundantCopyEliminationPass(PassRegistry &);			void initializeRISCVRedundantCopyEliminationPass(PassRegistry &);

				FunctionPass *createRISCVInitUndefPass();
				void initializeRISCVInitUndefPass(PassRegistry &);
				extern char &RISCVInitUndefID;

	InstructionSelector *createRISCVInstructionSelector(const RISCVTargetMachine &,			InstructionSelector *createRISCVInstructionSelector(const RISCVTargetMachine &,
	RISCVSubtarget &,			RISCVSubtarget &,
	RISCVRegisterBankInfo &);			RISCVRegisterBankInfo &);
	void initializeRISCVDAGToDAGISelPass(PassRegistry &);			void initializeRISCVDAGToDAGISelPass(PassRegistry &);
	} // namespace llvm			} // namespace llvm

	#endif			#endif

llvm/lib/Target/RISCV/RISCVAsmPrinter.cpp

Show First 20 Lines • Show All 107 Lines • ▼ Show 20 Lines	void RISCVAsmPrinter::emitInstruction(const MachineInstr *MI) {
if (emitPseudoExpansionLowering(*OutStreamer, MI))		if (emitPseudoExpansionLowering(*OutStreamer, MI))
return;		return;


switch (MI->getOpcode()) {		switch (MI->getOpcode()) {
case RISCV::HWASAN_CHECK_MEMACCESS_SHORTGRANULES:		case RISCV::HWASAN_CHECK_MEMACCESS_SHORTGRANULES:
LowerHWASAN_CHECK_MEMACCESS(*MI);		LowerHWASAN_CHECK_MEMACCESS(*MI);
return;		return;
		case RISCV::PseudoRVVInitUndefM1:
		case RISCV::PseudoRVVInitUndefM2:
		case RISCV::PseudoRVVInitUndefM4:
		case RISCV::PseudoRVVInitUndefM8:
		return;
}		}

MCInst TmpInst;		MCInst TmpInst;
if (!lowerRISCVMachineInstrToMCInst(MI, TmpInst, *this))		if (!lowerRISCVMachineInstrToMCInst(MI, TmpInst, *this))
EmitToStreamer(*OutStreamer, TmpInst);		EmitToStreamer(*OutStreamer, TmpInst);
}		}

bool RISCVAsmPrinter::PrintAsmOperand(const MachineInstr *MI, unsigned OpNo,		bool RISCVAsmPrinter::PrintAsmOperand(const MachineInstr *MI, unsigned OpNo,
▲ Show 20 Lines • Show All 344 Lines • Show Last 20 Lines

llvm/lib/Target/RISCV/RISCVInstrInfo.td

	Show First 20 Lines • Show All 1,864 Lines • ▼ Show 20 Lines

	let Predicates = [IsRV64] in {			let Predicates = [IsRV64] in {
	// Select W instructions if only the lower 32-bits of the result are used.			// Select W instructions if only the lower 32-bits of the result are used.
	def : Pat<(binop_allwusers<add> GPR:$rs1, (AddiPair:$rs2)),			def : Pat<(binop_allwusers<add> GPR:$rs1, (AddiPair:$rs2)),
	(ADDIW (ADDIW GPR:$rs1, (AddiPairImmLarge AddiPair:$rs2)),			(ADDIW (ADDIW GPR:$rs1, (AddiPairImmLarge AddiPair:$rs2)),
	(AddiPairImmSmall AddiPair:$rs2))>;			(AddiPairImmSmall AddiPair:$rs2))>;
	}			}

				/// Empty pseudo for RISCVInitUndefPass
				let hasSideEffects = 0, mayLoad = 0, mayStore = 0, Size = 0, isCodeGenOnly = 1 in {
				def PseudoRVVInitUndefM1 : Pseudo<(outs VR:$vd), (ins), [], "">;
				def PseudoRVVInitUndefM2 : Pseudo<(outs VRM2:$vd), (ins), [], "">;
				def PseudoRVVInitUndefM4 : Pseudo<(outs VRM4:$vd), (ins), [], "">;
				def PseudoRVVInitUndefM8 : Pseudo<(outs VRM8:$vd), (ins), [], "">;
				}

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// Standard extensions			// Standard extensions
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	include "RISCVInstrInfoM.td"			include "RISCVInstrInfoM.td"
	include "RISCVInstrInfoA.td"			include "RISCVInstrInfoA.td"
	include "RISCVInstrInfoF.td"			include "RISCVInstrInfoF.td"
	include "RISCVInstrInfoD.td"			include "RISCVInstrInfoD.td"
	Show All 15 Lines

llvm/lib/Target/RISCV/RISCVRVVInitUndef.cpp

This file was added.

				//===- RISCVRVVInitUndef.cpp - Initialize undef vector value to pseudo ----===//
				//
				craig.topperUnsubmitted Not Done Reply Inline Actions pesudo -> pseudo craig.topper: pesudo -> pseudo
				MaskRayUnsubmitted Not Done Reply Inline Actions RISCVRVVInitUndef.cpp MaskRay: RISCVRVVInitUndef.cpp
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// This file implements a function pass that initializes undef vector value to
				// temporary pseudo instruction and remove it in expandpseudo pass to prevent
				// register allocation resulting in a constraint violated result for vector
				craig.topperUnsubmitted Not Done Reply Inline Actions "pesudo" -> "pseudo" in two places craig.topper: "pesudo" -> "pseudo" in two places
				// instruction.
				//
				// RISC-V vector instruction has register overlapping constraint for certain
				// instructions, and will cause illegal instruction trap if violated, we use
				// early clobber to model this constraint, but it can't prevent register
				// allocator allocated same or overlapped if the input register is undef value,
				// so convert IMPLICIT_DEF to temporary pseudo instruction and remove it later
				// could prevent that happen, it's not best way to resolve this, and it might
				craig.topperUnsubmitted Not Done Reply Inline Actions pesudo -> pseudo craig.topper: pesudo -> pseudo
				craig.topperUnsubmitted Not Done Reply Inline Actions latter -> later craig.topper: latter -> later
				// change the order of program or increase the register pressure, so ideally we
				// should model the constraint right, but before we model the constraint right,
				// it's the only way to prevent that happen.
				//
				// When we enable the subregister liveness option, it will also trigger same
				// issue due to the partial of register is undef. If we pseudoinit the whole
				// register, then it will generate redundant COPY instruction. Currently, it
				// will generate INSERT_SUBREG to make sure the whole register is occupied
				// when program encounter operation that has early-clobber constraint.
				craig.topperUnsubmitted Not Done Reply Inline Actions "is be occupied" -> "is occupied" craig.topper: "is be occupied" -> "is occupied"
				//
				//
				// See also: https://github.com/llvm/llvm-project/issues/50157
				//
				//===----------------------------------------------------------------------===//

				#include "RISCV.h"
				#include "RISCVSubtarget.h"
				#include "llvm/CodeGen/DetectDeadLanes.h"
				#include "llvm/CodeGen/MachineFunctionPass.h"
				using namespace llvm;

				craig.topperUnsubmitted Not Done Reply Inline Actions Are there any maps in the code anymore? craig.topper: Are there any maps in the code anymore?
				#define DEBUG_TYPE "riscv-init-undef"
				#define RISCV_INIT_UNDEF_NAME "RISCV init undef pass"

				namespace {

				class RISCVInitUndef : public MachineFunctionPass {
				const TargetInstrInfo *TII;
				MachineRegisterInfo *MRI;
				const RISCVSubtarget *ST;
				const TargetRegisterInfo *TRI;

				kito-chengUnsubmitted Done Reply Inline Actions `SmallPtrSet` for `Seen` kito-cheng: `SmallPtrSet` for `Seen`
				public:
				static char ID;

				arsenmUnsubmitted Done Reply Inline Actions DenseMap arsenm: DenseMap
				RISCVInitUndef() : MachineFunctionPass(ID) {
				initializeRISCVInitUndefPass(*PassRegistry::getPassRegistry());
				}
				bool runOnMachineFunction(MachineFunction &MF) override;

				void getAnalysisUsage(AnalysisUsage &AU) const override {
				AU.setPreservesCFG();
				MachineFunctionPass::getAnalysisUsage(AU);
				}

				StringRef getPassName() const override { return RISCV_INIT_UNDEF_NAME; }

				private:
				bool processBasicBlock(MachineFunction &MF, MachineBasicBlock &MBB,
				const DeadLaneDetector &DLD);
				bool handleImplicitDef(MachineBasicBlock &MBB,
				MachineBasicBlock::iterator &Inst);
				bool isVectorRegClass(const Register R);
				const TargetRegisterClass *
				getVRLargestSuperClass(const TargetRegisterClass *RC) const;
				craig.topperUnsubmitted Not Done Reply Inline Actions You can pass Register by value craig.topper: You can pass Register by value
				bool handleSubReg(MachineFunction &MF, MachineInstr &MI,
				const DeadLaneDetector &DLD);
				};

				craig.topperUnsubmitted Not Done Reply Inline Actions There are a bunch of synthesized register classes. You need to the ID and then ask if that register class is a subclass of the 4 main classes. craig.topper: There are a bunch of synthesized register classes. You need to the ID and then ask if that…
				} // end anonymous namespace

				char RISCVInitUndef::ID = 0;
				INITIALIZE_PASS(RISCVInitUndef, DEBUG_TYPE, RISCV_INIT_UNDEF_NAME, false, false)
				char &llvm::RISCVInitUndefID = RISCVInitUndef::ID;

				const TargetRegisterClass *
				RISCVInitUndef::getVRLargestSuperClass(const TargetRegisterClass *RC) const {
				if (RISCV::VRM8RegClass.hasSubClassEq(RC))
				return &RISCV::VRM8RegClass;
				craig.topperUnsubmitted Done Reply Inline Actions Can this be if (RISCV::VRM8RegClass.hasSubClassEq(RC)) return &RISCV::VRM8RegClass; if (RISCV::VRM4RegClass.hasSubClassEq(RC)) return &RISCV::VRM4RegClass; if (RISCV::VRM2RegClass.hasSubClassEq(RC)) return &RISCV::VRM2RegClass; if (RISCV::VRRegClass.hasSubClassEq(RC)) return &RISCV::VRRegClass; return RC; craig.topper: Can this be ``` if (RISCV::VRM8RegClass.hasSubClassEq(RC)) return &RISCV::VRM8RegClass; if…
				if (RISCV::VRM4RegClass.hasSubClassEq(RC))
				return &RISCV::VRM4RegClass;
				if (RISCV::VRM2RegClass.hasSubClassEq(RC))
				return &RISCV::VRM2RegClass;
				if (RISCV::VRRegClass.hasSubClassEq(RC))
				return &RISCV::VRRegClass;
				return RC;
				}

				bool RISCVInitUndef::isVectorRegClass(const Register R) {
				const TargetRegisterClass *RC = MRI->getRegClass(R);
				craig.topperUnsubmitted Done Reply Inline Actions This variable is not named correctly now. craig.topper: This variable is not named correctly now.
				return RISCV::VRRegClass.hasSubClassEq(RC) \|\|
				RISCV::VRM2RegClass.hasSubClassEq(RC) \|\|
				arsenmUnsubmitted Not Done Reply Inline Actions I don't understand the point of this function, getSuperClasses is already sorted by largest. You can just take the first? arsenm: I don't understand the point of this function, getSuperClasses is already sorted by largest.
				BeMgAuthorUnsubmitted Done Reply Inline Actions When this function take VRNoV0RegClassID as input. the getSuperClasses will return as following order. AnyRegRegClassID 1 VMRegClassID 20 VRRegClassID 21 <- stop here ... This patch only want those four RegClass as result. BeMg: When this function take VRNoV0RegClassID as input. the getSuperClasses will return as following…
				RISCV::VRM4RegClass.hasSubClassEq(RC) \|\|
				RISCV::VRM8RegClass.hasSubClassEq(RC);
				}

				static unsigned getUndefInitOpcode(unsigned RegClassID) {
				craig.topperUnsubmitted Not Done Reply Inline Actions origin -> original craig.topper: origin -> original
				arsenmUnsubmitted Not Done Reply Inline Actions Could this use the new getPhysRegBaseClass? arsenm: Could this use the new getPhysRegBaseClass?
				BeMgAuthorUnsubmitted Done Reply Inline Actions Could this hook use for virtual register? It seem only for physical register but this pass run before register allocation. BeMg: Could this hook use for virtual register? It seem only for physical register but this pass run…
				switch (RegClassID) {
				case RISCV::VRRegClassID:
				arsenmUnsubmitted Done Reply Inline Actions Don't call getRegClass for each use arsenm: Don't call getRegClass for each use
				return RISCV::PseudoRVVInitUndefM1;
				case RISCV::VRM2RegClassID:
				return RISCV::PseudoRVVInitUndefM2;
				craig.topperUnsubmitted Not Done Reply Inline Actions Can't we do something like VRRegClass.hasSubClassEq(MRI->getRegClass(R)) \|\| VRM2RegClass.hasSubClassEq(MRI->getRegClass(R)) \|\| VRM4RegClass.hasSubClassEq(MRI->getRegClass(R)) \|\| VRM8RegClass.hasSubClassEq(MRI->getRegClass(R)) why do we need to use getVRLargestSuperClass? craig.topper: Can't we do something like ``` VRRegClass.hasSubClassEq(MRI->getRegClass(R)) \|\| VRM2RegClass.
				case RISCV::VRM4RegClassID:
				return RISCV::PseudoRVVInitUndefM4;
				case RISCV::VRM8RegClassID:
				return RISCV::PseudoRVVInitUndefM8;
				default:
				llvm_unreachable("Unexpected register class.");
				craig.topperUnsubmitted Done Reply Inline Actions `return RISCV::PseudoRVVInitUndefM1;` craig.topper: `return RISCV::PseudoRVVInitUndefM1;`
				}
				kito-chengUnsubmitted Not Done Reply Inline Actions The filed is SEW, but seems like I don't have good way to figure out right SEW setting without tracing UD chain, and sometime we even don't have that info, but fix that to 16 or 32 can work correctly, but sub-optimal. kito-cheng: The filed is SEW, but seems like I don't have good way to figure out right SEW setting without…
				}

				bool RISCVInitUndef::handleImplicitDef(MachineBasicBlock &MBB,
				MachineBasicBlock::iterator &Inst) {
				const TargetRegisterInfo &TRI =
				*MBB.getParent()->getSubtarget().getRegisterInfo();

				assert(Inst->getOpcode() == TargetOpcode::IMPLICIT_DEF);

				Register Reg = Inst->getOperand(0).getReg();
				if (!Reg.isVirtual())
				return false;

				bool NeedPseudoInit = false;
				SmallVector<MachineOperand *, 1> UseMOs;
				for (MachineOperand &MO : MRI->use_nodbg_operands(Reg)) {
				MachineInstr *UserMI = MO.getParent();

				craig.topperUnsubmitted Not Done Reply Inline Actions We don't need a pseudo for the NoV0 classes. We need to figure out which main class the register class is a subclass of and use that to pick the pseudo. craig.topper: We don't need a pseudo for the NoV0 classes. We need to figure out which main class the…
				bool HasEarlyClobber = false;
				bool TiedToDef = false;
				for (MachineOperand &UserMO : UserMI->operands()) {
				if (!UserMO.isReg())
				continue;
				if (UserMO.isEarlyClobber())
				arsenmUnsubmitted Done Reply Inline Actions Reg.isVirtual() arsenm: Reg.isVirtual()
				HasEarlyClobber = true;
				if (UserMO.isUse() && UserMO.isTied() &&
				TRI.regsOverlap(UserMO.getReg(), Reg))
				TiedToDef = true;
				}
				if (HasEarlyClobber && !TiedToDef) {
				NeedPseudoInit = true;
				UseMOs.push_back(&MO);
				}
				}
				craig.topperUnsubmitted Not Done Reply Inline Actions Do we need to scan all operands? Can we check if MO is a tied use and find the operand it is tied to check the early clobber? craig.topper: Do we need to scan all operands? Can we check if MO is a tied use and find the operand it is…
				BeMgAuthorUnsubmitted Done Reply Inline Actions 0B bb.0.entry: 16B LIFETIME_START %stack.0.dst 32B %1:vr = IMPLICIT_DEF 48B early-clobber %0:vr = PseudoVRGATHER_VI_M1 killed %1:vr, 0, 0, 5 64B %2:gpr = ADDI %stack.0.dst, 0 80B PseudoVSE32_V_M1 killed %0:vr, killed %2:gpr, 0, 5 96B LIFETIME_END %stack.0.dst 112B %3:gpr = COPY $x0 128B $x10 = COPY %3:gpr 144B PseudoRET implicit $x10 Here we maybe need scan both defs and uses operand, because the early-clobber operand not always exist tied-to operand. BeMg: ``` 0B bb.0.entry: 16B LIFETIME_START %stack.0.dst 32B %1:vr = IMPLICIT_DEF…
				craig.topperUnsubmitted Not Done Reply Inline Actions Right. I wasn't thinking about that right. craig.topper: Right. I wasn't thinking about that right.

				if (!NeedPseudoInit)
				return false;

				LLVM_DEBUG(
				dbgs() << "Emitting PseudoRVVInitUndef for implicit vector register "
				<< Reg << '\n');

				unsigned RegClassID = getVRLargestSuperClass(MRI->getRegClass(Reg))->getID();
				unsigned Opcode = getUndefInitOpcode(RegClassID);

				BuildMI(MBB, Inst, Inst->getDebugLoc(), TII->get(Opcode), Reg);

				Inst = MBB.erase(Inst);

				for (auto MO : UseMOs)
				craig.topperUnsubmitted Not Done Reply Inline Actions This is not maintainable. I meant something like const TargetRegisterClass RC = MRI->getRegClass(Reg); if (RC->hasSuperClassEq(&RISCV::VRRegClass)) { Opcode = RISCV::PseudoRVVInitUndefM1; } else if (RC->hasSuperClassEq(&RRISCV::VRM2RegClass) { Opcode = RISCV::PseudoRVVInitUndefM2; } ... craig.topper:* This is not maintainable. I meant something like ``` const TargetRegisterClass *RC = MRI…
				MO->setIsUndef(false);

				return true;
				}
				craig.topperUnsubmitted Not Done Reply Inline Actions Why passing `Insts` by value? That will make a copy but it doesn't look like we need a copy. craig.topper: Why passing `Insts` by value? That will make a copy but it doesn't look like we need a copy.

				static bool isEarlyClobberMI(MachineInstr &MI) {
				return llvm::any_of(MI.defs(), [](const MachineOperand &DefMO) {
				return DefMO.isReg() && DefMO.isEarlyClobber();
				craig.topperUnsubmitted Not Done Reply Inline Actions Any idea why the llvm::any_of didn't work? craig.topper: Any idea why the llvm::any_of didn't work?
				jrtc27Unsubmitted Not Done Reply Inline Actions Because the thing that calls this is erasing from the MBB whilst it iterates over it... so this is the wrong fix and probably doesn't even work either jrtc27: Because the thing that calls this is erasing from the MBB whilst it iterates over it... so this…
				craig.topperUnsubmitted Not Done Reply Inline Actions Do we need `[&]` here or would `[]` work? craig.topper: Do we need `[&]` here or would `[]` work?
				});
				}

				bool RISCVInitUndef::handleSubReg(MachineFunction &MF, MachineInstr &MI,
				const DeadLaneDetector &DLD) {
				bool Changed = false;
				craig.topperUnsubmitted Not Done Reply Inline Actions Why passing `Insts` by value? That will make a copy but it doesn't look like we need a copy. craig.topper: Why passing `Insts` by value? That will make a copy but it doesn't look like we need a copy.

				craig.topperUnsubmitted Not Done Reply Inline Actions unsigned -> Register craig.topper: unsigned -> Register
				for (MachineOperand &UseMO : MI.uses()) {
				if (!UseMO.isReg())
				continue;
				if (!UseMO.getReg().isVirtual())
				continue;
				craig.topperUnsubmitted Done Reply Inline Actions Use `defs()`? craig.topper: Use `defs()`?
				craig.topperUnsubmitted Done Reply Inline Actions Replace the place with llvm::any_of? craig.topper: Replace the place with llvm::any_of?

				Register Reg = UseMO.getReg();
				DeadLaneDetector::VRegInfo Info =
				DLD.getVRegInfo(Register::virtReg2Index(Reg));

				if (Info.UsedLanes == Info.DefinedLanes)
				continue;

				const TargetRegisterClass *TargetRegClass =
				craig.topperUnsubmitted Not Done Reply Inline Actions unsigned -> Register craig.topper: unsigned -> Register
				craig.topperUnsubmitted Not Done Reply Inline Actions Why passing `Insts` by value? That will make a copy but it doesn't look like we need a copy. craig.topper: Why passing `Insts` by value? That will make a copy but it doesn't look like we need a copy.
				getVRLargestSuperClass(MRI->getRegClass(Reg));

				LaneBitmask NeedDef = Info.UsedLanes & ~Info.DefinedLanes;
				craig.topperUnsubmitted Not Done Reply Inline Actions unsigned -> Register craig.topper: unsigned -> Register

				arsenmUnsubmitted Done Reply Inline Actions unique_ptr arsenm: unique_ptr
				craig.topperUnsubmitted Done Reply Inline Actions Can we scan `uses()` instead of `operands()`? craig.topper: Can we scan `uses()` instead of `operands()`?
				LLVM_DEBUG({
				craig.topperUnsubmitted Not Done Reply Inline Actions Isn't this calculating for the entire function? But handleSubReg is called for individual instructions. So we'll be recomputing information right? craig.topper: Isn't this calculating for the entire function? But handleSubReg is called for individual…
				BeMgAuthorUnsubmitted Done Reply Inline Actions Yes, you're right. we don't need recompute this info for each instruction. It call only once now. BeMg: Yes, you're right. we don't need recompute this info for each instruction. It call only once…
				dbgs() << "Instruction has undef subregister.\n";
				dbgs() << printReg(Reg, nullptr)
				<< " Used: " << PrintLaneMask(Info.UsedLanes)
				<< " Def: " << PrintLaneMask(Info.DefinedLanes)
				craig.topperUnsubmitted Done Reply Inline Actions I think we wouldn't need this if we only checked `uses`? craig.topper: I think we wouldn't need this if we only checked `uses`?
				<< " Need Def: " << PrintLaneMask(NeedDef) << "\n";
				});

				SmallVector<unsigned> SubRegIndexNeedInsert;
				TRI->getCoveringSubRegIndexes(*MRI, TargetRegClass, NeedDef,
				SubRegIndexNeedInsert);

				Register LatestReg = Reg;
				for (auto ind : SubRegIndexNeedInsert) {
				Changed = true;
				MaskRayUnsubmitted Not Done Reply Inline Actions delete blank line MaskRay: delete blank line
				const TargetRegisterClass *SubRegClass =
				getVRLargestSuperClass(TRI->getSubRegisterClass(TargetRegClass, ind));
				Register TmpInitSubReg = MRI->createVirtualRegister(SubRegClass);
				BuildMI(*MI.getParent(), &MI, MI.getDebugLoc(),
				TII->get(getUndefInitOpcode(SubRegClass->getID())),
				TmpInitSubReg);
				Register NewReg = MRI->createVirtualRegister(TargetRegClass);
				BuildMI(*MI.getParent(), &MI, MI.getDebugLoc(),
				TII->get(TargetOpcode::INSERT_SUBREG), NewReg)
				.addReg(LatestReg)
				.addReg(TmpInitSubReg)
				.addImm(ind);
				LatestReg = NewReg;
				craig.topperUnsubmitted Done Reply Inline Actions Can we put `Info.UsedLanes & ~Info.DefinedLanes` into a variable? We use that expression twice craig.topper: Can we put `Info.UsedLanes & ~Info.DefinedLanes` into a variable? We use that expression twice
				}

				UseMO.setReg(LatestReg);
				craig.topperUnsubmitted Done Reply Inline Actions `Lastest` isn't a word craig.topper: `Lastest` isn't a word
				}

				return Changed;
				}

				bool RISCVInitUndef::processBasicBlock(MachineFunction &MF,
				MachineBasicBlock &MBB,
				const DeadLaneDetector &DLD) {
				bool Changed = false;
				for (MachineBasicBlock::iterator I = MBB.begin(); I != MBB.end(); ++I) {
				MachineInstr &MI = *I;
				if (ST->enableSubRegLiveness() && isEarlyClobberMI(MI))
				Changed \|= handleSubReg(MF, MI, DLD);
				if (MI.isImplicitDef()) {
				craig.topperUnsubmitted Not Done Reply Inline Actions Pesudo -> Pseudo craig.topper: Pesudo -> Pseudo
				auto DstReg = MI.getOperand(0).getReg();
				craig.topperUnsubmitted Not Done Reply Inline Actions regitser -> register craig.topper: regitser -> register
				if (isVectorRegClass(DstReg))
				Changed \|= handleImplicitDef(MBB, I);
				}
				}
				return Changed;
				}

				bool RISCVInitUndef::runOnMachineFunction(MachineFunction &MF) {
				ST = &MF.getSubtarget<RISCVSubtarget>();
				if (!ST->hasVInstructions())
				return false;

				MRI = &MF.getRegInfo();
				TII = ST->getInstrInfo();
				TRI = MRI->getTargetRegisterInfo();

				bool Changed = false;
				DeadLaneDetector DLD(MRI, TRI);
				DLD.computeSubRegisterLaneBitInfo();

				for (MachineBasicBlock &BB : MF)
				Changed \|= processBasicBlock(MF, BB, DLD);

				return Changed;
				}

				FunctionPass *llvm::createRISCVInitUndefPass() { return new RISCVInitUndef(); }
				craig.topperUnsubmitted Not Done Reply Inline Actions Candidata -> Candidate? craig.topper: Candidata -> Candidate?
				craig.topperUnsubmitted Not Done Reply Inline Actions createVirtualRegister returns `Register` not `unsigned` craig.topper: createVirtualRegister returns `Register` not `unsigned`
				craig.topperUnsubmitted Not Done Reply Inline Actions createVirtualRegister returns `Register` not `unsigned` craig.topper: createVirtualRegister returns `Register` not `unsigned`
				craig.topperUnsubmitted Not Done Reply Inline Actions unsigned -> Register craig.topper: unsigned -> Register
				craig.topperUnsubmitted Not Done Reply Inline Actions unsigned -> Register craig.topper: unsigned -> Register
				craig.topperUnsubmitted Not Done Reply Inline Actions "the " is unnecessary in this sentence craig.topper: "the " is unnecessary in this sentence
				craig.topperUnsubmitted Not Done Reply Inline Actions Should this be `std::unique_ptr<VRegInfo[]>` since it points to an array? craig.topper: Should this be `std::unique_ptr<VRegInfo[]>` since it points to an array?

llvm/lib/Target/RISCV/RISCVTargetMachine.cpp

Show First 20 Lines • Show All 75 Lines • ▼ Show 20 Lines	extern "C" LLVM_EXTERNAL_VISIBILITY void LLVMInitializeRISCVTarget() {
initializeRISCVCodeGenPreparePass(*PR);		initializeRISCVCodeGenPreparePass(*PR);
initializeRISCVMergeBaseOffsetOptPass(*PR);		initializeRISCVMergeBaseOffsetOptPass(*PR);
initializeRISCVSExtWRemovalPass(*PR);		initializeRISCVSExtWRemovalPass(*PR);
initializeRISCVStripWSuffixPass(*PR);		initializeRISCVStripWSuffixPass(*PR);
initializeRISCVPreRAExpandPseudoPass(*PR);		initializeRISCVPreRAExpandPseudoPass(*PR);
initializeRISCVExpandPseudoPass(*PR);		initializeRISCVExpandPseudoPass(*PR);
initializeRISCVInsertVSETVLIPass(*PR);		initializeRISCVInsertVSETVLIPass(*PR);
initializeRISCVDAGToDAGISelPass(*PR);		initializeRISCVDAGToDAGISelPass(*PR);
		initializeRISCVInitUndefPass(*PR);
}		}

static StringRef computeDataLayout(const Triple &TT) {		static StringRef computeDataLayout(const Triple &TT) {
if (TT.isArch64Bit())		if (TT.isArch64Bit())
return "e-m:e-p:64:64-i64:64-i128:128-n32:64-S128";		return "e-m:e-p:64:64-i64:64-i128:128-n32:64-S128";
assert(TT.isArch32Bit() && "only RV32 and RV64 are currently supported");		assert(TT.isArch32Bit() && "only RV32 and RV64 are currently supported");
return "e-m:e-p:32:32-i64:64-n32-S128";		return "e-m:e-p:32:32-i64:64-n32-S128";
}		}
▲ Show 20 Lines • Show All 163 Lines • ▼ Show 20 Lines	public:
bool addRegBankSelect() override;		bool addRegBankSelect() override;
bool addGlobalInstructionSelect() override;		bool addGlobalInstructionSelect() override;
void addPreEmitPass() override;		void addPreEmitPass() override;
void addPreEmitPass2() override;		void addPreEmitPass2() override;
void addPreSched2() override;		void addPreSched2() override;
void addMachineSSAOptimization() override;		void addMachineSSAOptimization() override;
void addPreRegAlloc() override;		void addPreRegAlloc() override;
void addPostRegAlloc() override;		void addPostRegAlloc() override;
		void addOptimizedRegAlloc() override;
};		};
} // namespace		} // namespace

TargetPassConfig *RISCVTargetMachine::createPassConfig(PassManagerBase &PM) {		TargetPassConfig *RISCVTargetMachine::createPassConfig(PassManagerBase &PM) {
return new RISCVPassConfig(*this, PM);		return new RISCVPassConfig(*this, PM);
}		}

void RISCVPassConfig::addIRPasses() {		void RISCVPassConfig::addIRPasses() {
▲ Show 20 Lines • Show All 79 Lines • ▼ Show 20 Lines

void RISCVPassConfig::addPreRegAlloc() {		void RISCVPassConfig::addPreRegAlloc() {
addPass(createRISCVPreRAExpandPseudoPass());		addPass(createRISCVPreRAExpandPseudoPass());
if (TM->getOptLevel() != CodeGenOpt::None)		if (TM->getOptLevel() != CodeGenOpt::None)
addPass(createRISCVMergeBaseOffsetOptPass());		addPass(createRISCVMergeBaseOffsetOptPass());
addPass(createRISCVInsertVSETVLIPass());		addPass(createRISCVInsertVSETVLIPass());
}		}

		void RISCVPassConfig::addOptimizedRegAlloc() {
		if (getOptimizeRegAlloc())
		insertPass(&DetectDeadLanesID, &RISCVInitUndefID);

		TargetPassConfig::addOptimizedRegAlloc();
		}

void RISCVPassConfig::addPostRegAlloc() {		void RISCVPassConfig::addPostRegAlloc() {
if (TM->getOptLevel() != CodeGenOpt::None && EnableRedundantCopyElimination)		if (TM->getOptLevel() != CodeGenOpt::None && EnableRedundantCopyElimination)
addPass(createRISCVRedundantCopyEliminationPass());		addPass(createRISCVRedundantCopyEliminationPass());
}		}

yaml::MachineFunctionInfo *		yaml::MachineFunctionInfo *
RISCVTargetMachine::createDefaultFuncInfoYAML() const {		RISCVTargetMachine::createDefaultFuncInfoYAML() const {
return new yaml::RISCVMachineFunctionInfo();		return new yaml::RISCVMachineFunctionInfo();
Show All 16 Lines

llvm/test/CodeGen/RISCV/O3-pipeline.ll

	Show First 20 Lines • Show All 102 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: Machine Trace Metrics			; CHECK-NEXT: Machine Trace Metrics
	; CHECK-NEXT: Lazy Machine Block Frequency Analysis			; CHECK-NEXT: Lazy Machine Block Frequency Analysis
	; CHECK-NEXT: Machine InstCombiner			; CHECK-NEXT: Machine InstCombiner
	; RV64-NEXT: RISCV sext.w Removal			; RV64-NEXT: RISCV sext.w Removal
	; RV64-NEXT: RISCV Strip W Suffix			; RV64-NEXT: RISCV Strip W Suffix
	; CHECK-NEXT: RISCV Pre-RA pseudo instruction expansion pass			; CHECK-NEXT: RISCV Pre-RA pseudo instruction expansion pass
	; CHECK-NEXT: RISCV Merge Base Offset			; CHECK-NEXT: RISCV Merge Base Offset
	; CHECK-NEXT: RISCV Insert VSETVLI pass			; CHECK-NEXT: RISCV Insert VSETVLI pass
	; CHECK-NEXT: Detect Dead Lanes			; CHECK-NEXT: Detect Dead Lanes
				craig.topperUnsubmitted Not Done Reply Inline Actions If I understand correctly, we're effectively running DetectDeadLanes inside of RISCV init undef pass and then running the real DetectDeadLanes pass which won't do anything because we already did it? craig.topper: If I understand correctly, we're effectively running DetectDeadLanes inside of RISCV init undef…
				craig.topperUnsubmitted Not Done Reply Inline Actions Can we run DetectDeadLanes, then run our pass and just use the portion of the DetectDeadLanes that computes the Lane Masks in our pass? craig.topper: Can we run DetectDeadLanes, then run our pass and just use the portion of the DetectDeadLanes…
				; CHECK-NEXT: RISCV init undef pass
	; CHECK-NEXT: Process Implicit Definitions			; CHECK-NEXT: Process Implicit Definitions
	; CHECK-NEXT: Remove unreachable machine basic blocks			; CHECK-NEXT: Remove unreachable machine basic blocks
	; CHECK-NEXT: Live Variable Analysis			; CHECK-NEXT: Live Variable Analysis
	; CHECK-NEXT: Eliminate PHI nodes for register allocation			; CHECK-NEXT: Eliminate PHI nodes for register allocation
	; CHECK-NEXT: Two-Address instruction pass			; CHECK-NEXT: Two-Address instruction pass
	; CHECK-NEXT: MachineDominator Tree Construction			; CHECK-NEXT: MachineDominator Tree Construction
	; CHECK-NEXT: Slot index numbering			; CHECK-NEXT: Slot index numbering
	; CHECK-NEXT: Live Interval Analysis			; CHECK-NEXT: Live Interval Analysis
	▲ Show 20 Lines • Show All 63 Lines • Show Last 20 Lines

llvm/test/CodeGen/RISCV/regalloc-last-chance-recoloring-failure.ll

	Show All 20 Lines
	; CHECK-NEXT: .cfi_offset s0, -16			; CHECK-NEXT: .cfi_offset s0, -16
	; CHECK-NEXT: csrr a0, vlenb			; CHECK-NEXT: csrr a0, vlenb
	; CHECK-NEXT: li a1, 24			; CHECK-NEXT: li a1, 24
	; CHECK-NEXT: mul a0, a0, a1			; CHECK-NEXT: mul a0, a0, a1
	; CHECK-NEXT: sub sp, sp, a0			; CHECK-NEXT: sub sp, sp, a0
	; CHECK-NEXT: .cfi_escape 0x0f, 0x0d, 0x72, 0x00, 0x11, 0x20, 0x22, 0x11, 0x18, 0x92, 0xa2, 0x38, 0x00, 0x1e, 0x22 # sp + 32 + 24 * vlenb			; CHECK-NEXT: .cfi_escape 0x0f, 0x0d, 0x72, 0x00, 0x11, 0x20, 0x22, 0x11, 0x18, 0x92, 0xa2, 0x38, 0x00, 0x1e, 0x22 # sp + 32 + 24 * vlenb
	; CHECK-NEXT: li a0, 55			; CHECK-NEXT: li a0, 55
	; CHECK-NEXT: vsetvli zero, a0, e16, m4, ta, ma			; CHECK-NEXT: vsetvli zero, a0, e16, m4, ta, ma
	; CHECK-NEXT: vloxseg2ei32.v v8, (a0), v8			; CHECK-NEXT: vloxseg2ei32.v v16, (a0), v8
				craig.topperUnsubmitted Not Done Reply Inline Actions Are we treating insert_subreg for segment load tuples the same as inserting a small LMUL into a wider LMUL? craig.topper: Are we treating insert_subreg for segment load tuples the same as inserting a small LMUL into a…
				BeMgAuthorUnsubmitted Done Reply Inline Actions This pass doesn't consider segment load as instruction that assign sub-register. The following Insert_subreg work like put %5:vrm2 into %6:vrm4 %1:vrm4 = IMPLICIT_DEF %5:vrm2 = PseudoVLE32_V_M2 killed %4, 0, 5 /* e32 / %6:vrm4 = INSERT_SUBREG %1, %5, %subreg.sub_vrm2_0 Do we should treat vloxseg2ei32 as INSERT_SUBREG in this patch? BeMg:* This pass doesn't consider segment load as instruction that assign sub-register. The…
				craig.topperUnsubmitted Not Done Reply Inline Actions Nevermind, I didn't realize this test had so many `undef` and `poison` operands. I suspect llvm-reduce or bugpoint. I dislike tests with undef/poison operands. It makes things very fragile. It would be legal for DAG combine to delete a large portion of this test. craig.topper: Nevermind, I didn't realize this test had so many `undef` and `poison` operands. I suspect llvm…
	; CHECK-NEXT: csrr a0, vlenb			; CHECK-NEXT: csrr a0, vlenb
	; CHECK-NEXT: slli a0, a0, 3			; CHECK-NEXT: slli a0, a0, 3
	; CHECK-NEXT: add a0, sp, a0			; CHECK-NEXT: add a0, sp, a0
	; CHECK-NEXT: addi a0, a0, 16			; CHECK-NEXT: addi a0, a0, 16
	; CHECK-NEXT: csrr a1, vlenb			; CHECK-NEXT: csrr a1, vlenb
	; CHECK-NEXT: slli a1, a1, 2			; CHECK-NEXT: slli a1, a1, 2
	; CHECK-NEXT: vs4r.v v8, (a0) # Unknown-size Folded Spill			; CHECK-NEXT: vs4r.v v16, (a0) # Unknown-size Folded Spill
	; CHECK-NEXT: add a0, a0, a1			; CHECK-NEXT: add a0, a0, a1
	; CHECK-NEXT: vs4r.v v12, (a0) # Unknown-size Folded Spill			; CHECK-NEXT: vs4r.v v20, (a0) # Unknown-size Folded Spill
	; CHECK-NEXT: vsetvli a0, zero, e8, m2, ta, ma			; CHECK-NEXT: vsetvli a0, zero, e8, m2, ta, ma
	; CHECK-NEXT: vmclr.m v0			; CHECK-NEXT: vmclr.m v0
	; CHECK-NEXT: li s0, 36			; CHECK-NEXT: li s0, 36
	; CHECK-NEXT: vsetvli zero, s0, e16, m4, ta, ma			; CHECK-NEXT: vsetvli zero, s0, e16, m4, ta, ma
	; CHECK-NEXT: vfwadd.vv v8, v8, v8, v0.t			; CHECK-NEXT: vfwadd.vv v16, v8, v8, v0.t
	; CHECK-NEXT: csrr a0, vlenb			; CHECK-NEXT: csrr a0, vlenb
	; CHECK-NEXT: slli a0, a0, 4			; CHECK-NEXT: slli a0, a0, 4
	; CHECK-NEXT: add a0, sp, a0			; CHECK-NEXT: add a0, sp, a0
	; CHECK-NEXT: addi a0, a0, 16			; CHECK-NEXT: addi a0, a0, 16
	; CHECK-NEXT: vs8r.v v8, (a0) # Unknown-size Folded Spill			; CHECK-NEXT: vs8r.v v16, (a0) # Unknown-size Folded Spill
	; CHECK-NEXT: call func@plt			; CHECK-NEXT: call func@plt
	; CHECK-NEXT: li a0, 32			; CHECK-NEXT: li a0, 32
	; CHECK-NEXT: vsetvli zero, a0, e16, m4, ta, ma			; CHECK-NEXT: vsetvli zero, a0, e16, m4, ta, ma
	; CHECK-NEXT: vrgather.vv v4, v8, v8, v0.t			; CHECK-NEXT: vrgather.vv v4, v8, v8, v0.t
	; CHECK-NEXT: vsetvli zero, s0, e16, m4, ta, ma			; CHECK-NEXT: vsetvli zero, s0, e16, m4, ta, ma
	; CHECK-NEXT: csrr a1, vlenb			; CHECK-NEXT: csrr a1, vlenb
	; CHECK-NEXT: slli a1, a1, 3			; CHECK-NEXT: slli a1, a1, 3
	; CHECK-NEXT: add a1, sp, a1			; CHECK-NEXT: add a1, sp, a1
	▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines
	; SUBREGLIVENESS-NEXT: .cfi_offset ra, -8			; SUBREGLIVENESS-NEXT: .cfi_offset ra, -8
	; SUBREGLIVENESS-NEXT: .cfi_offset s0, -16			; SUBREGLIVENESS-NEXT: .cfi_offset s0, -16
	; SUBREGLIVENESS-NEXT: csrr a0, vlenb			; SUBREGLIVENESS-NEXT: csrr a0, vlenb
	; SUBREGLIVENESS-NEXT: slli a0, a0, 4			; SUBREGLIVENESS-NEXT: slli a0, a0, 4
	; SUBREGLIVENESS-NEXT: sub sp, sp, a0			; SUBREGLIVENESS-NEXT: sub sp, sp, a0
	; SUBREGLIVENESS-NEXT: .cfi_escape 0x0f, 0x0d, 0x72, 0x00, 0x11, 0x20, 0x22, 0x11, 0x10, 0x92, 0xa2, 0x38, 0x00, 0x1e, 0x22 # sp + 32 + 16 * vlenb			; SUBREGLIVENESS-NEXT: .cfi_escape 0x0f, 0x0d, 0x72, 0x00, 0x11, 0x20, 0x22, 0x11, 0x10, 0x92, 0xa2, 0x38, 0x00, 0x1e, 0x22 # sp + 32 + 16 * vlenb
	; SUBREGLIVENESS-NEXT: li a0, 55			; SUBREGLIVENESS-NEXT: li a0, 55
	; SUBREGLIVENESS-NEXT: vsetvli zero, a0, e16, m4, ta, ma			; SUBREGLIVENESS-NEXT: vsetvli zero, a0, e16, m4, ta, ma
	; SUBREGLIVENESS-NEXT: vloxseg2ei32.v v8, (a0), v8			; SUBREGLIVENESS-NEXT: vloxseg2ei32.v v16, (a0), v8
	; SUBREGLIVENESS-NEXT: csrr a0, vlenb			; SUBREGLIVENESS-NEXT: csrr a0, vlenb
	; SUBREGLIVENESS-NEXT: slli a0, a0, 3			; SUBREGLIVENESS-NEXT: slli a0, a0, 3
	; SUBREGLIVENESS-NEXT: add a0, sp, a0			; SUBREGLIVENESS-NEXT: add a0, sp, a0
	; SUBREGLIVENESS-NEXT: addi a0, a0, 16			; SUBREGLIVENESS-NEXT: addi a0, a0, 16
	; SUBREGLIVENESS-NEXT: csrr a1, vlenb			; SUBREGLIVENESS-NEXT: csrr a1, vlenb
	; SUBREGLIVENESS-NEXT: slli a1, a1, 2			; SUBREGLIVENESS-NEXT: slli a1, a1, 2
	; SUBREGLIVENESS-NEXT: vs4r.v v8, (a0) # Unknown-size Folded Spill			; SUBREGLIVENESS-NEXT: vs4r.v v16, (a0) # Unknown-size Folded Spill
	; SUBREGLIVENESS-NEXT: add a0, a0, a1			; SUBREGLIVENESS-NEXT: add a0, a0, a1
	; SUBREGLIVENESS-NEXT: vs4r.v v12, (a0) # Unknown-size Folded Spill			; SUBREGLIVENESS-NEXT: vs4r.v v20, (a0) # Unknown-size Folded Spill
	; SUBREGLIVENESS-NEXT: vsetvli a0, zero, e8, m2, ta, ma			; SUBREGLIVENESS-NEXT: vsetvli a0, zero, e8, m2, ta, ma
	; SUBREGLIVENESS-NEXT: vmclr.m v0			; SUBREGLIVENESS-NEXT: vmclr.m v0
	; SUBREGLIVENESS-NEXT: li s0, 36			; SUBREGLIVENESS-NEXT: li s0, 36
	; SUBREGLIVENESS-NEXT: vsetvli zero, s0, e16, m4, ta, ma			; SUBREGLIVENESS-NEXT: vsetvli zero, s0, e16, m4, ta, ma
	; SUBREGLIVENESS-NEXT: vfwadd.vv v8, v8, v8, v0.t			; SUBREGLIVENESS-NEXT: vfwadd.vv v16, v8, v8, v0.t
	; SUBREGLIVENESS-NEXT: addi a0, sp, 16			; SUBREGLIVENESS-NEXT: addi a0, sp, 16
	; SUBREGLIVENESS-NEXT: vs8r.v v8, (a0) # Unknown-size Folded Spill			; SUBREGLIVENESS-NEXT: vs8r.v v16, (a0) # Unknown-size Folded Spill
	; SUBREGLIVENESS-NEXT: call func@plt			; SUBREGLIVENESS-NEXT: call func@plt
	; SUBREGLIVENESS-NEXT: li a0, 32			; SUBREGLIVENESS-NEXT: li a0, 32
	; SUBREGLIVENESS-NEXT: vsetvli zero, a0, e16, m4, ta, ma			; SUBREGLIVENESS-NEXT: vsetvli zero, a0, e16, m4, ta, ma
	; SUBREGLIVENESS-NEXT: vrgather.vv v16, v8, v8, v0.t			; SUBREGLIVENESS-NEXT: vrgather.vv v16, v8, v8, v0.t
	; SUBREGLIVENESS-NEXT: vsetvli zero, s0, e16, m4, ta, ma			; SUBREGLIVENESS-NEXT: vsetvli zero, s0, e16, m4, ta, ma
	; SUBREGLIVENESS-NEXT: csrr a1, vlenb			; SUBREGLIVENESS-NEXT: csrr a1, vlenb
	; SUBREGLIVENESS-NEXT: slli a1, a1, 3			; SUBREGLIVENESS-NEXT: slli a1, a1, 3
	; SUBREGLIVENESS-NEXT: add a1, sp, a1			; SUBREGLIVENESS-NEXT: add a1, sp, a1
	▲ Show 20 Lines • Show All 42 Lines • Show Last 20 Lines

llvm/test/CodeGen/RISCV/rvv/subregister-undef-early-clobber.mir

	# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py			# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
	# RUN: llc %s -mtriple=riscv64 -mattr=+v -riscv-enable-subreg-liveness -run-pass=none -o - \| FileCheck %s			# RUN: llc %s -mtriple=riscv64 -mattr=+v -riscv-enable-subreg-liveness -run-pass=riscv-init-undef -o - \| FileCheck %s

	...			...
	---			---
	name: test_M4_sub_vrm1_0			name: test_M4_sub_vrm1_0
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_M4_sub_vrm1_0			; CHECK-LABEL: name: test_M4_sub_vrm1_0
	; CHECK: [[DEF:%[0-9]+]]:vrm4 = IMPLICIT_DEF			; CHECK: [[DEF:%[0-9]+]]:vrm4 = IMPLICIT_DEF
	; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8			; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8
	; CHECK-NEXT: [[PseudoVLE32_V_M1_:%[0-9]+]]:vr = PseudoVLE32_V_M1 killed [[ADDI]], 0, 5 /* e32 */			; CHECK-NEXT: [[PseudoVLE32_V_M1_:%[0-9]+]]:vr = PseudoVLE32_V_M1 killed [[ADDI]], 0, 5 /* e32 */
	; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm4 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M1_]], %subreg.sub_vrm1_0			; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm4 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M1_]], %subreg.sub_vrm1_0
	; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype			; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype
	; CHECK-NEXT: early-clobber %4:vrm4 = PseudoVRGATHER_VI_M4 killed [[INSERT_SUBREG]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: [[PseudoRVVInitUndefM2_:%[0-9]+]]:vrm2 = PseudoRVVInitUndefM2
				; CHECK-NEXT: [[INSERT_SUBREG1:%[0-9]+]]:vrm4 = INSERT_SUBREG [[INSERT_SUBREG]], [[PseudoRVVInitUndefM2_]], %subreg.sub_vrm2_1
				; CHECK-NEXT: [[PseudoRVVInitUndefM1_:%[0-9]+]]:vr = PseudoRVVInitUndefM1
				; CHECK-NEXT: [[INSERT_SUBREG2:%[0-9]+]]:vrm4 = INSERT_SUBREG [[INSERT_SUBREG1]], [[PseudoRVVInitUndefM1_]], %subreg.sub_vrm1_1
				; CHECK-NEXT: early-clobber %4:vrm4 = PseudoVRGATHER_VI_M4 killed [[INSERT_SUBREG2]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0			; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0
	; CHECK-NEXT: PseudoVSE32_V_M4 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: PseudoVSE32_V_M4 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0			; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0
	; CHECK-NEXT: $x10 = COPY [[COPY]]			; CHECK-NEXT: $x10 = COPY [[COPY]]
	; CHECK-NEXT: PseudoRET implicit $x10			; CHECK-NEXT: PseudoRET implicit $x10
	%1:vrm4 = IMPLICIT_DEF			%1:vrm4 = IMPLICIT_DEF
	%7:gpr = ADDI $x0, 8			%7:gpr = ADDI $x0, 8
	%5:vr = PseudoVLE32_V_M1 killed %7:gpr, 0, 5			%5:vr = PseudoVLE32_V_M1 killed %7:gpr, 0, 5
	Show All 13 Lines
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_M4_sub_vrm1_1			; CHECK-LABEL: name: test_M4_sub_vrm1_1
	; CHECK: [[DEF:%[0-9]+]]:vrm4 = IMPLICIT_DEF			; CHECK: [[DEF:%[0-9]+]]:vrm4 = IMPLICIT_DEF
	; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8			; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8
	; CHECK-NEXT: [[PseudoVLE32_V_M1_:%[0-9]+]]:vr = PseudoVLE32_V_M1 killed [[ADDI]], 0, 5 /* e32 */			; CHECK-NEXT: [[PseudoVLE32_V_M1_:%[0-9]+]]:vr = PseudoVLE32_V_M1 killed [[ADDI]], 0, 5 /* e32 */
	; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm4 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M1_]], %subreg.sub_vrm1_1			; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm4 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M1_]], %subreg.sub_vrm1_1
	; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype			; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype
	; CHECK-NEXT: early-clobber %4:vrm4 = PseudoVRGATHER_VI_M4 killed [[INSERT_SUBREG]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: [[PseudoRVVInitUndefM2_:%[0-9]+]]:vrm2 = PseudoRVVInitUndefM2
				; CHECK-NEXT: [[INSERT_SUBREG1:%[0-9]+]]:vrm4 = INSERT_SUBREG [[INSERT_SUBREG]], [[PseudoRVVInitUndefM2_]], %subreg.sub_vrm2_1
				; CHECK-NEXT: [[PseudoRVVInitUndefM1_:%[0-9]+]]:vr = PseudoRVVInitUndefM1
				; CHECK-NEXT: [[INSERT_SUBREG2:%[0-9]+]]:vrm4 = INSERT_SUBREG [[INSERT_SUBREG1]], [[PseudoRVVInitUndefM1_]], %subreg.sub_vrm1_0
				; CHECK-NEXT: early-clobber %4:vrm4 = PseudoVRGATHER_VI_M4 killed [[INSERT_SUBREG2]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0			; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0
	; CHECK-NEXT: PseudoVSE32_V_M4 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: PseudoVSE32_V_M4 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0			; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0
	; CHECK-NEXT: $x10 = COPY [[COPY]]			; CHECK-NEXT: $x10 = COPY [[COPY]]
	; CHECK-NEXT: PseudoRET implicit $x10			; CHECK-NEXT: PseudoRET implicit $x10
	%1:vrm4 = IMPLICIT_DEF			%1:vrm4 = IMPLICIT_DEF
	%7:gpr = ADDI $x0, 8			%7:gpr = ADDI $x0, 8
	%5:vr = PseudoVLE32_V_M1 killed %7:gpr, 0, 5			%5:vr = PseudoVLE32_V_M1 killed %7:gpr, 0, 5
	Show All 13 Lines
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_M4_sub_vrm1_2			; CHECK-LABEL: name: test_M4_sub_vrm1_2
	; CHECK: [[DEF:%[0-9]+]]:vrm4 = IMPLICIT_DEF			; CHECK: [[DEF:%[0-9]+]]:vrm4 = IMPLICIT_DEF
	; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8			; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8
	; CHECK-NEXT: [[PseudoVLE32_V_M1_:%[0-9]+]]:vr = PseudoVLE32_V_M1 killed [[ADDI]], 0, 5 /* e32 */			; CHECK-NEXT: [[PseudoVLE32_V_M1_:%[0-9]+]]:vr = PseudoVLE32_V_M1 killed [[ADDI]], 0, 5 /* e32 */
	; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm4 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M1_]], %subreg.sub_vrm1_2			; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm4 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M1_]], %subreg.sub_vrm1_2
	; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype			; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype
	; CHECK-NEXT: early-clobber %4:vrm4 = PseudoVRGATHER_VI_M4 killed [[INSERT_SUBREG]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: [[PseudoRVVInitUndefM2_:%[0-9]+]]:vrm2 = PseudoRVVInitUndefM2
				; CHECK-NEXT: [[INSERT_SUBREG1:%[0-9]+]]:vrm4 = INSERT_SUBREG [[INSERT_SUBREG]], [[PseudoRVVInitUndefM2_]], %subreg.sub_vrm2_0
				; CHECK-NEXT: [[PseudoRVVInitUndefM1_:%[0-9]+]]:vr = PseudoRVVInitUndefM1
				; CHECK-NEXT: [[INSERT_SUBREG2:%[0-9]+]]:vrm4 = INSERT_SUBREG [[INSERT_SUBREG1]], [[PseudoRVVInitUndefM1_]], %subreg.sub_vrm1_3
				; CHECK-NEXT: early-clobber %4:vrm4 = PseudoVRGATHER_VI_M4 killed [[INSERT_SUBREG2]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0			; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0
	; CHECK-NEXT: PseudoVSE32_V_M4 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: PseudoVSE32_V_M4 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0			; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0
	; CHECK-NEXT: $x10 = COPY [[COPY]]			; CHECK-NEXT: $x10 = COPY [[COPY]]
	; CHECK-NEXT: PseudoRET implicit $x10			; CHECK-NEXT: PseudoRET implicit $x10
	%1:vrm4 = IMPLICIT_DEF			%1:vrm4 = IMPLICIT_DEF
	%7:gpr = ADDI $x0, 8			%7:gpr = ADDI $x0, 8
	%5:vr = PseudoVLE32_V_M1 killed %7:gpr, 0, 5			%5:vr = PseudoVLE32_V_M1 killed %7:gpr, 0, 5
	Show All 13 Lines
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_M4_sub_vrm1_3			; CHECK-LABEL: name: test_M4_sub_vrm1_3
	; CHECK: [[DEF:%[0-9]+]]:vrm4 = IMPLICIT_DEF			; CHECK: [[DEF:%[0-9]+]]:vrm4 = IMPLICIT_DEF
	; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8			; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8
	; CHECK-NEXT: [[PseudoVLE32_V_M1_:%[0-9]+]]:vr = PseudoVLE32_V_M1 killed [[ADDI]], 0, 5 /* e32 */			; CHECK-NEXT: [[PseudoVLE32_V_M1_:%[0-9]+]]:vr = PseudoVLE32_V_M1 killed [[ADDI]], 0, 5 /* e32 */
	; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm4 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M1_]], %subreg.sub_vrm1_3			; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm4 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M1_]], %subreg.sub_vrm1_3
	; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype			; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype
	; CHECK-NEXT: early-clobber %4:vrm4 = PseudoVRGATHER_VI_M4 killed [[INSERT_SUBREG]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: [[PseudoRVVInitUndefM2_:%[0-9]+]]:vrm2 = PseudoRVVInitUndefM2
				; CHECK-NEXT: [[INSERT_SUBREG1:%[0-9]+]]:vrm4 = INSERT_SUBREG [[INSERT_SUBREG]], [[PseudoRVVInitUndefM2_]], %subreg.sub_vrm2_0
				; CHECK-NEXT: [[PseudoRVVInitUndefM1_:%[0-9]+]]:vr = PseudoRVVInitUndefM1
				; CHECK-NEXT: [[INSERT_SUBREG2:%[0-9]+]]:vrm4 = INSERT_SUBREG [[INSERT_SUBREG1]], [[PseudoRVVInitUndefM1_]], %subreg.sub_vrm1_2
				; CHECK-NEXT: early-clobber %4:vrm4 = PseudoVRGATHER_VI_M4 killed [[INSERT_SUBREG2]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0			; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0
	; CHECK-NEXT: PseudoVSE32_V_M4 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: PseudoVSE32_V_M4 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0			; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0
	; CHECK-NEXT: $x10 = COPY [[COPY]]			; CHECK-NEXT: $x10 = COPY [[COPY]]
	; CHECK-NEXT: PseudoRET implicit $x10			; CHECK-NEXT: PseudoRET implicit $x10
	%1:vrm4 = IMPLICIT_DEF			%1:vrm4 = IMPLICIT_DEF
	%7:gpr = ADDI $x0, 8			%7:gpr = ADDI $x0, 8
	%5:vr = PseudoVLE32_V_M1 killed %7:gpr, 0, 5			%5:vr = PseudoVLE32_V_M1 killed %7:gpr, 0, 5
	Show All 13 Lines
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_M4_sub_vrm2_0			; CHECK-LABEL: name: test_M4_sub_vrm2_0
	; CHECK: [[DEF:%[0-9]+]]:vrm4 = IMPLICIT_DEF			; CHECK: [[DEF:%[0-9]+]]:vrm4 = IMPLICIT_DEF
	; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8			; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8
	; CHECK-NEXT: [[PseudoVLE32_V_M2_:%[0-9]+]]:vrm2 = PseudoVLE32_V_M2 killed [[ADDI]], 0, 5 /* e32 */			; CHECK-NEXT: [[PseudoVLE32_V_M2_:%[0-9]+]]:vrm2 = PseudoVLE32_V_M2 killed [[ADDI]], 0, 5 /* e32 */
	; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm4 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M2_]], %subreg.sub_vrm2_0			; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm4 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M2_]], %subreg.sub_vrm2_0
	; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype			; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype
	; CHECK-NEXT: early-clobber %4:vrm4 = PseudoVRGATHER_VI_M4 killed [[INSERT_SUBREG]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: [[PseudoRVVInitUndefM2_:%[0-9]+]]:vrm2 = PseudoRVVInitUndefM2
				; CHECK-NEXT: [[INSERT_SUBREG1:%[0-9]+]]:vrm4 = INSERT_SUBREG [[INSERT_SUBREG]], [[PseudoRVVInitUndefM2_]], %subreg.sub_vrm2_1
				; CHECK-NEXT: early-clobber %4:vrm4 = PseudoVRGATHER_VI_M4 killed [[INSERT_SUBREG1]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0			; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0
	; CHECK-NEXT: PseudoVSE32_V_M4 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: PseudoVSE32_V_M4 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0			; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0
	; CHECK-NEXT: $x10 = COPY [[COPY]]			; CHECK-NEXT: $x10 = COPY [[COPY]]
	; CHECK-NEXT: PseudoRET implicit $x10			; CHECK-NEXT: PseudoRET implicit $x10
	%1:vrm4 = IMPLICIT_DEF			%1:vrm4 = IMPLICIT_DEF
	%7:gpr = ADDI $x0, 8			%7:gpr = ADDI $x0, 8
	%5:vrm2 = PseudoVLE32_V_M2 killed %7:gpr, 0, 5			%5:vrm2 = PseudoVLE32_V_M2 killed %7:gpr, 0, 5
	Show All 13 Lines
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_M4_sub_vrm2_1			; CHECK-LABEL: name: test_M4_sub_vrm2_1
	; CHECK: [[DEF:%[0-9]+]]:vrm4 = IMPLICIT_DEF			; CHECK: [[DEF:%[0-9]+]]:vrm4 = IMPLICIT_DEF
	; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8			; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8
	; CHECK-NEXT: [[PseudoVLE32_V_M2_:%[0-9]+]]:vrm2 = PseudoVLE32_V_M2 killed [[ADDI]], 0, 5 /* e32 */			; CHECK-NEXT: [[PseudoVLE32_V_M2_:%[0-9]+]]:vrm2 = PseudoVLE32_V_M2 killed [[ADDI]], 0, 5 /* e32 */
	; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm4 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M2_]], %subreg.sub_vrm2_1			; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm4 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M2_]], %subreg.sub_vrm2_1
	; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype			; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype
	; CHECK-NEXT: early-clobber %4:vrm4 = PseudoVRGATHER_VI_M4 killed [[INSERT_SUBREG]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: [[PseudoRVVInitUndefM2_:%[0-9]+]]:vrm2 = PseudoRVVInitUndefM2
				; CHECK-NEXT: [[INSERT_SUBREG1:%[0-9]+]]:vrm4 = INSERT_SUBREG [[INSERT_SUBREG]], [[PseudoRVVInitUndefM2_]], %subreg.sub_vrm2_0
				; CHECK-NEXT: early-clobber %4:vrm4 = PseudoVRGATHER_VI_M4 killed [[INSERT_SUBREG1]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0			; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0
	; CHECK-NEXT: PseudoVSE32_V_M4 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: PseudoVSE32_V_M4 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0			; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0
	; CHECK-NEXT: $x10 = COPY [[COPY]]			; CHECK-NEXT: $x10 = COPY [[COPY]]
	; CHECK-NEXT: PseudoRET implicit $x10			; CHECK-NEXT: PseudoRET implicit $x10
	%1:vrm4 = IMPLICIT_DEF			%1:vrm4 = IMPLICIT_DEF
	%7:gpr = ADDI $x0, 8			%7:gpr = ADDI $x0, 8
	%5:vrm2 = PseudoVLE32_V_M2 killed %7:gpr, 0, 5			%5:vrm2 = PseudoVLE32_V_M2 killed %7:gpr, 0, 5
	Show All 14 Lines
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_M8_sub_vrm1_0			; CHECK-LABEL: name: test_M8_sub_vrm1_0
	; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF			; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF
	; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8			; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8
	; CHECK-NEXT: [[PseudoVLE32_V_M1_:%[0-9]+]]:vr = PseudoVLE32_V_M1 killed [[ADDI]], 0, 5 /* e32 */			; CHECK-NEXT: [[PseudoVLE32_V_M1_:%[0-9]+]]:vr = PseudoVLE32_V_M1 killed [[ADDI]], 0, 5 /* e32 */
	; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M1_]], %subreg.sub_vrm1_0			; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M1_]], %subreg.sub_vrm1_0
	; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype			; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype
	; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: [[PseudoRVVInitUndefM4_:%[0-9]+]]:vrm4 = PseudoRVVInitUndefM4
				; CHECK-NEXT: [[INSERT_SUBREG1:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG]], [[PseudoRVVInitUndefM4_]], %subreg.sub_vrm4_1
				; CHECK-NEXT: [[PseudoRVVInitUndefM2_:%[0-9]+]]:vrm2 = PseudoRVVInitUndefM2
				; CHECK-NEXT: [[INSERT_SUBREG2:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG1]], [[PseudoRVVInitUndefM2_]], %subreg.sub_vrm2_1
				; CHECK-NEXT: [[PseudoRVVInitUndefM1_:%[0-9]+]]:vr = PseudoRVVInitUndefM1
				; CHECK-NEXT: [[INSERT_SUBREG3:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG2]], [[PseudoRVVInitUndefM1_]], %subreg.sub_vrm1_1
				; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG3]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0			; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0
	; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0			; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0
	; CHECK-NEXT: $x10 = COPY [[COPY]]			; CHECK-NEXT: $x10 = COPY [[COPY]]
	; CHECK-NEXT: PseudoRET implicit $x10			; CHECK-NEXT: PseudoRET implicit $x10
	%1:vrm8 = IMPLICIT_DEF			%1:vrm8 = IMPLICIT_DEF
	%7:gpr = ADDI $x0, 8			%7:gpr = ADDI $x0, 8
	%5:vr = PseudoVLE32_V_M1 killed %7:gpr, 0, 5			%5:vr = PseudoVLE32_V_M1 killed %7:gpr, 0, 5
	Show All 13 Lines
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_M8_sub_vrm1_1			; CHECK-LABEL: name: test_M8_sub_vrm1_1
	; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF			; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF
	; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8			; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8
	; CHECK-NEXT: [[PseudoVLE32_V_M1_:%[0-9]+]]:vr = PseudoVLE32_V_M1 killed [[ADDI]], 0, 5 /* e32 */			; CHECK-NEXT: [[PseudoVLE32_V_M1_:%[0-9]+]]:vr = PseudoVLE32_V_M1 killed [[ADDI]], 0, 5 /* e32 */
	; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M1_]], %subreg.sub_vrm1_1			; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M1_]], %subreg.sub_vrm1_1
	; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype			; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype
	; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: [[PseudoRVVInitUndefM4_:%[0-9]+]]:vrm4 = PseudoRVVInitUndefM4
				; CHECK-NEXT: [[INSERT_SUBREG1:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG]], [[PseudoRVVInitUndefM4_]], %subreg.sub_vrm4_1
				; CHECK-NEXT: [[PseudoRVVInitUndefM2_:%[0-9]+]]:vrm2 = PseudoRVVInitUndefM2
				; CHECK-NEXT: [[INSERT_SUBREG2:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG1]], [[PseudoRVVInitUndefM2_]], %subreg.sub_vrm2_1
				; CHECK-NEXT: [[PseudoRVVInitUndefM1_:%[0-9]+]]:vr = PseudoRVVInitUndefM1
				; CHECK-NEXT: [[INSERT_SUBREG3:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG2]], [[PseudoRVVInitUndefM1_]], %subreg.sub_vrm1_0
				; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG3]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0			; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0
	; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0			; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0
	; CHECK-NEXT: $x10 = COPY [[COPY]]			; CHECK-NEXT: $x10 = COPY [[COPY]]
	; CHECK-NEXT: PseudoRET implicit $x10			; CHECK-NEXT: PseudoRET implicit $x10
	%1:vrm8 = IMPLICIT_DEF			%1:vrm8 = IMPLICIT_DEF
	%7:gpr = ADDI $x0, 8			%7:gpr = ADDI $x0, 8
	%5:vr = PseudoVLE32_V_M1 killed %7:gpr, 0, 5			%5:vr = PseudoVLE32_V_M1 killed %7:gpr, 0, 5
	Show All 13 Lines
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_M8_sub_vrm1_2			; CHECK-LABEL: name: test_M8_sub_vrm1_2
	; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF			; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF
	; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8			; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8
	; CHECK-NEXT: [[PseudoVLE32_V_M1_:%[0-9]+]]:vr = PseudoVLE32_V_M1 killed [[ADDI]], 0, 5 /* e32 */			; CHECK-NEXT: [[PseudoVLE32_V_M1_:%[0-9]+]]:vr = PseudoVLE32_V_M1 killed [[ADDI]], 0, 5 /* e32 */
	; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M1_]], %subreg.sub_vrm1_2			; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M1_]], %subreg.sub_vrm1_2
	; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype			; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype
	; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: [[PseudoRVVInitUndefM4_:%[0-9]+]]:vrm4 = PseudoRVVInitUndefM4
				; CHECK-NEXT: [[INSERT_SUBREG1:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG]], [[PseudoRVVInitUndefM4_]], %subreg.sub_vrm4_1
				; CHECK-NEXT: [[PseudoRVVInitUndefM2_:%[0-9]+]]:vrm2 = PseudoRVVInitUndefM2
				; CHECK-NEXT: [[INSERT_SUBREG2:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG1]], [[PseudoRVVInitUndefM2_]], %subreg.sub_vrm2_0
				; CHECK-NEXT: [[PseudoRVVInitUndefM1_:%[0-9]+]]:vr = PseudoRVVInitUndefM1
				; CHECK-NEXT: [[INSERT_SUBREG3:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG2]], [[PseudoRVVInitUndefM1_]], %subreg.sub_vrm1_3
				; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG3]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0			; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0
	; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0			; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0
	; CHECK-NEXT: $x10 = COPY [[COPY]]			; CHECK-NEXT: $x10 = COPY [[COPY]]
	; CHECK-NEXT: PseudoRET implicit $x10			; CHECK-NEXT: PseudoRET implicit $x10
	%1:vrm8 = IMPLICIT_DEF			%1:vrm8 = IMPLICIT_DEF
	%7:gpr = ADDI $x0, 8			%7:gpr = ADDI $x0, 8
	%5:vr = PseudoVLE32_V_M1 killed %7:gpr, 0, 5			%5:vr = PseudoVLE32_V_M1 killed %7:gpr, 0, 5
	Show All 13 Lines
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_M8_sub_vrm1_3			; CHECK-LABEL: name: test_M8_sub_vrm1_3
	; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF			; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF
	; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8			; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8
	; CHECK-NEXT: [[PseudoVLE32_V_M1_:%[0-9]+]]:vr = PseudoVLE32_V_M1 killed [[ADDI]], 0, 5 /* e32 */			; CHECK-NEXT: [[PseudoVLE32_V_M1_:%[0-9]+]]:vr = PseudoVLE32_V_M1 killed [[ADDI]], 0, 5 /* e32 */
	; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M1_]], %subreg.sub_vrm1_3			; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M1_]], %subreg.sub_vrm1_3
	; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype			; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype
	; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: [[PseudoRVVInitUndefM4_:%[0-9]+]]:vrm4 = PseudoRVVInitUndefM4
				; CHECK-NEXT: [[INSERT_SUBREG1:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG]], [[PseudoRVVInitUndefM4_]], %subreg.sub_vrm4_1
				; CHECK-NEXT: [[PseudoRVVInitUndefM2_:%[0-9]+]]:vrm2 = PseudoRVVInitUndefM2
				; CHECK-NEXT: [[INSERT_SUBREG2:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG1]], [[PseudoRVVInitUndefM2_]], %subreg.sub_vrm2_0
				; CHECK-NEXT: [[PseudoRVVInitUndefM1_:%[0-9]+]]:vr = PseudoRVVInitUndefM1
				; CHECK-NEXT: [[INSERT_SUBREG3:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG2]], [[PseudoRVVInitUndefM1_]], %subreg.sub_vrm1_2
				; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG3]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0			; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0
	; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0			; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0
	; CHECK-NEXT: $x10 = COPY [[COPY]]			; CHECK-NEXT: $x10 = COPY [[COPY]]
	; CHECK-NEXT: PseudoRET implicit $x10			; CHECK-NEXT: PseudoRET implicit $x10
	%1:vrm8 = IMPLICIT_DEF			%1:vrm8 = IMPLICIT_DEF
	%7:gpr = ADDI $x0, 8			%7:gpr = ADDI $x0, 8
	%5:vr = PseudoVLE32_V_M1 killed %7:gpr, 0, 5			%5:vr = PseudoVLE32_V_M1 killed %7:gpr, 0, 5
	Show All 13 Lines
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_M8_sub_vrm1_4			; CHECK-LABEL: name: test_M8_sub_vrm1_4
	; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF			; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF
	; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8			; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8
	; CHECK-NEXT: [[PseudoVLE32_V_M1_:%[0-9]+]]:vr = PseudoVLE32_V_M1 killed [[ADDI]], 0, 5 /* e32 */			; CHECK-NEXT: [[PseudoVLE32_V_M1_:%[0-9]+]]:vr = PseudoVLE32_V_M1 killed [[ADDI]], 0, 5 /* e32 */
	; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M1_]], %subreg.sub_vrm1_4			; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M1_]], %subreg.sub_vrm1_4
	; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype			; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype
	; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: [[PseudoRVVInitUndefM4_:%[0-9]+]]:vrm4 = PseudoRVVInitUndefM4
				; CHECK-NEXT: [[INSERT_SUBREG1:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG]], [[PseudoRVVInitUndefM4_]], %subreg.sub_vrm4_0
				; CHECK-NEXT: [[PseudoRVVInitUndefM2_:%[0-9]+]]:vrm2 = PseudoRVVInitUndefM2
				; CHECK-NEXT: [[INSERT_SUBREG2:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG1]], [[PseudoRVVInitUndefM2_]], %subreg.sub_vrm2_3
				; CHECK-NEXT: [[PseudoRVVInitUndefM1_:%[0-9]+]]:vr = PseudoRVVInitUndefM1
				; CHECK-NEXT: [[INSERT_SUBREG3:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG2]], [[PseudoRVVInitUndefM1_]], %subreg.sub_vrm1_5
				; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG3]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0			; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0
	; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0			; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0
	; CHECK-NEXT: $x10 = COPY [[COPY]]			; CHECK-NEXT: $x10 = COPY [[COPY]]
	; CHECK-NEXT: PseudoRET implicit $x10			; CHECK-NEXT: PseudoRET implicit $x10
	%1:vrm8 = IMPLICIT_DEF			%1:vrm8 = IMPLICIT_DEF
	%7:gpr = ADDI $x0, 8			%7:gpr = ADDI $x0, 8
	%5:vr = PseudoVLE32_V_M1 killed %7:gpr, 0, 5			%5:vr = PseudoVLE32_V_M1 killed %7:gpr, 0, 5
	Show All 13 Lines
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_M8_sub_vrm1_5			; CHECK-LABEL: name: test_M8_sub_vrm1_5
	; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF			; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF
	; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8			; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8
	; CHECK-NEXT: [[PseudoVLE32_V_M1_:%[0-9]+]]:vr = PseudoVLE32_V_M1 killed [[ADDI]], 0, 5 /* e32 */			; CHECK-NEXT: [[PseudoVLE32_V_M1_:%[0-9]+]]:vr = PseudoVLE32_V_M1 killed [[ADDI]], 0, 5 /* e32 */
	; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M1_]], %subreg.sub_vrm1_5			; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M1_]], %subreg.sub_vrm1_5
	; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype			; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype
	; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: [[PseudoRVVInitUndefM4_:%[0-9]+]]:vrm4 = PseudoRVVInitUndefM4
				; CHECK-NEXT: [[INSERT_SUBREG1:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG]], [[PseudoRVVInitUndefM4_]], %subreg.sub_vrm4_0
				; CHECK-NEXT: [[PseudoRVVInitUndefM2_:%[0-9]+]]:vrm2 = PseudoRVVInitUndefM2
				; CHECK-NEXT: [[INSERT_SUBREG2:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG1]], [[PseudoRVVInitUndefM2_]], %subreg.sub_vrm2_3
				; CHECK-NEXT: [[PseudoRVVInitUndefM1_:%[0-9]+]]:vr = PseudoRVVInitUndefM1
				; CHECK-NEXT: [[INSERT_SUBREG3:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG2]], [[PseudoRVVInitUndefM1_]], %subreg.sub_vrm1_4
				; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG3]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0			; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0
	; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0			; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0
	; CHECK-NEXT: $x10 = COPY [[COPY]]			; CHECK-NEXT: $x10 = COPY [[COPY]]
	; CHECK-NEXT: PseudoRET implicit $x10			; CHECK-NEXT: PseudoRET implicit $x10
	%1:vrm8 = IMPLICIT_DEF			%1:vrm8 = IMPLICIT_DEF
	%7:gpr = ADDI $x0, 8			%7:gpr = ADDI $x0, 8
	%5:vr = PseudoVLE32_V_M1 killed %7:gpr, 0, 5			%5:vr = PseudoVLE32_V_M1 killed %7:gpr, 0, 5
	Show All 13 Lines
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_M8_sub_vrm1_6			; CHECK-LABEL: name: test_M8_sub_vrm1_6
	; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF			; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF
	; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8			; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8
	; CHECK-NEXT: [[PseudoVLE32_V_M1_:%[0-9]+]]:vr = PseudoVLE32_V_M1 killed [[ADDI]], 0, 5 /* e32 */			; CHECK-NEXT: [[PseudoVLE32_V_M1_:%[0-9]+]]:vr = PseudoVLE32_V_M1 killed [[ADDI]], 0, 5 /* e32 */
	; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M1_]], %subreg.sub_vrm1_6			; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M1_]], %subreg.sub_vrm1_6
	; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype			; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype
	; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: [[PseudoRVVInitUndefM4_:%[0-9]+]]:vrm4 = PseudoRVVInitUndefM4
				; CHECK-NEXT: [[INSERT_SUBREG1:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG]], [[PseudoRVVInitUndefM4_]], %subreg.sub_vrm4_0
				; CHECK-NEXT: [[PseudoRVVInitUndefM2_:%[0-9]+]]:vrm2 = PseudoRVVInitUndefM2
				; CHECK-NEXT: [[INSERT_SUBREG2:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG1]], [[PseudoRVVInitUndefM2_]], %subreg.sub_vrm2_2
				; CHECK-NEXT: [[PseudoRVVInitUndefM1_:%[0-9]+]]:vr = PseudoRVVInitUndefM1
				; CHECK-NEXT: [[INSERT_SUBREG3:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG2]], [[PseudoRVVInitUndefM1_]], %subreg.sub_vrm1_7
				; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG3]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0			; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0
	; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0			; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0
	; CHECK-NEXT: $x10 = COPY [[COPY]]			; CHECK-NEXT: $x10 = COPY [[COPY]]
	; CHECK-NEXT: PseudoRET implicit $x10			; CHECK-NEXT: PseudoRET implicit $x10
	%1:vrm8 = IMPLICIT_DEF			%1:vrm8 = IMPLICIT_DEF
	%7:gpr = ADDI $x0, 8			%7:gpr = ADDI $x0, 8
	%5:vr = PseudoVLE32_V_M1 killed %7:gpr, 0, 5			%5:vr = PseudoVLE32_V_M1 killed %7:gpr, 0, 5
	Show All 13 Lines
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_M8_sub_vrm1_7			; CHECK-LABEL: name: test_M8_sub_vrm1_7
	; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF			; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF
	; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8			; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8
	; CHECK-NEXT: [[PseudoVLE32_V_M1_:%[0-9]+]]:vr = PseudoVLE32_V_M1 killed [[ADDI]], 0, 5 /* e32 */			; CHECK-NEXT: [[PseudoVLE32_V_M1_:%[0-9]+]]:vr = PseudoVLE32_V_M1 killed [[ADDI]], 0, 5 /* e32 */
	; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M1_]], %subreg.sub_vrm1_7			; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M1_]], %subreg.sub_vrm1_7
	; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype			; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype
	; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: [[PseudoRVVInitUndefM4_:%[0-9]+]]:vrm4 = PseudoRVVInitUndefM4
				; CHECK-NEXT: [[INSERT_SUBREG1:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG]], [[PseudoRVVInitUndefM4_]], %subreg.sub_vrm4_0
				; CHECK-NEXT: [[PseudoRVVInitUndefM2_:%[0-9]+]]:vrm2 = PseudoRVVInitUndefM2
				; CHECK-NEXT: [[INSERT_SUBREG2:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG1]], [[PseudoRVVInitUndefM2_]], %subreg.sub_vrm2_2
				; CHECK-NEXT: [[PseudoRVVInitUndefM1_:%[0-9]+]]:vr = PseudoRVVInitUndefM1
				; CHECK-NEXT: [[INSERT_SUBREG3:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG2]], [[PseudoRVVInitUndefM1_]], %subreg.sub_vrm1_6
				; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG3]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0			; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0
	; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0			; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0
	; CHECK-NEXT: $x10 = COPY [[COPY]]			; CHECK-NEXT: $x10 = COPY [[COPY]]
	; CHECK-NEXT: PseudoRET implicit $x10			; CHECK-NEXT: PseudoRET implicit $x10
	%1:vrm8 = IMPLICIT_DEF			%1:vrm8 = IMPLICIT_DEF
	%7:gpr = ADDI $x0, 8			%7:gpr = ADDI $x0, 8
	%5:vr = PseudoVLE32_V_M1 killed %7:gpr, 0, 5			%5:vr = PseudoVLE32_V_M1 killed %7:gpr, 0, 5
	Show All 13 Lines
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_M8_sub_vrm2_0			; CHECK-LABEL: name: test_M8_sub_vrm2_0
	; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF			; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF
	; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8			; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8
	; CHECK-NEXT: [[PseudoVLE32_V_M2_:%[0-9]+]]:vrm2 = PseudoVLE32_V_M2 killed [[ADDI]], 0, 5 /* e32 */			; CHECK-NEXT: [[PseudoVLE32_V_M2_:%[0-9]+]]:vrm2 = PseudoVLE32_V_M2 killed [[ADDI]], 0, 5 /* e32 */
	; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M2_]], %subreg.sub_vrm2_0			; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M2_]], %subreg.sub_vrm2_0
	; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype			; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype
	; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: [[PseudoRVVInitUndefM4_:%[0-9]+]]:vrm4 = PseudoRVVInitUndefM4
				; CHECK-NEXT: [[INSERT_SUBREG1:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG]], [[PseudoRVVInitUndefM4_]], %subreg.sub_vrm4_1
				; CHECK-NEXT: [[PseudoRVVInitUndefM2_:%[0-9]+]]:vrm2 = PseudoRVVInitUndefM2
				; CHECK-NEXT: [[INSERT_SUBREG2:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG1]], [[PseudoRVVInitUndefM2_]], %subreg.sub_vrm2_1
				; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG2]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0			; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0
	; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0			; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0
	; CHECK-NEXT: $x10 = COPY [[COPY]]			; CHECK-NEXT: $x10 = COPY [[COPY]]
	; CHECK-NEXT: PseudoRET implicit $x10			; CHECK-NEXT: PseudoRET implicit $x10
	%1:vrm8 = IMPLICIT_DEF			%1:vrm8 = IMPLICIT_DEF
	%7:gpr = ADDI $x0, 8			%7:gpr = ADDI $x0, 8
	%5:vrm2 = PseudoVLE32_V_M2 killed %7:gpr, 0, 5			%5:vrm2 = PseudoVLE32_V_M2 killed %7:gpr, 0, 5
	Show All 13 Lines
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_M8_sub_vrm2_1			; CHECK-LABEL: name: test_M8_sub_vrm2_1
	; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF			; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF
	; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8			; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8
	; CHECK-NEXT: [[PseudoVLE32_V_M2_:%[0-9]+]]:vrm2 = PseudoVLE32_V_M2 killed [[ADDI]], 0, 5 /* e32 */			; CHECK-NEXT: [[PseudoVLE32_V_M2_:%[0-9]+]]:vrm2 = PseudoVLE32_V_M2 killed [[ADDI]], 0, 5 /* e32 */
	; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M2_]], %subreg.sub_vrm2_1			; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M2_]], %subreg.sub_vrm2_1
	; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype			; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype
	; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: [[PseudoRVVInitUndefM4_:%[0-9]+]]:vrm4 = PseudoRVVInitUndefM4
				; CHECK-NEXT: [[INSERT_SUBREG1:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG]], [[PseudoRVVInitUndefM4_]], %subreg.sub_vrm4_1
				; CHECK-NEXT: [[PseudoRVVInitUndefM2_:%[0-9]+]]:vrm2 = PseudoRVVInitUndefM2
				; CHECK-NEXT: [[INSERT_SUBREG2:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG1]], [[PseudoRVVInitUndefM2_]], %subreg.sub_vrm2_0
				; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG2]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0			; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0
	; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0			; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0
	; CHECK-NEXT: $x10 = COPY [[COPY]]			; CHECK-NEXT: $x10 = COPY [[COPY]]
	; CHECK-NEXT: PseudoRET implicit $x10			; CHECK-NEXT: PseudoRET implicit $x10
	%1:vrm8 = IMPLICIT_DEF			%1:vrm8 = IMPLICIT_DEF
	%7:gpr = ADDI $x0, 8			%7:gpr = ADDI $x0, 8
	%5:vrm2 = PseudoVLE32_V_M2 killed %7:gpr, 0, 5			%5:vrm2 = PseudoVLE32_V_M2 killed %7:gpr, 0, 5
	Show All 13 Lines
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_M8_sub_vrm2_2			; CHECK-LABEL: name: test_M8_sub_vrm2_2
	; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF			; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF
	; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8			; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8
	; CHECK-NEXT: [[PseudoVLE32_V_M2_:%[0-9]+]]:vrm2 = PseudoVLE32_V_M2 killed [[ADDI]], 0, 5 /* e32 */			; CHECK-NEXT: [[PseudoVLE32_V_M2_:%[0-9]+]]:vrm2 = PseudoVLE32_V_M2 killed [[ADDI]], 0, 5 /* e32 */
	; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M2_]], %subreg.sub_vrm2_2			; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M2_]], %subreg.sub_vrm2_2
	; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype			; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype
	; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: [[PseudoRVVInitUndefM4_:%[0-9]+]]:vrm4 = PseudoRVVInitUndefM4
				; CHECK-NEXT: [[INSERT_SUBREG1:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG]], [[PseudoRVVInitUndefM4_]], %subreg.sub_vrm4_0
				; CHECK-NEXT: [[PseudoRVVInitUndefM2_:%[0-9]+]]:vrm2 = PseudoRVVInitUndefM2
				; CHECK-NEXT: [[INSERT_SUBREG2:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG1]], [[PseudoRVVInitUndefM2_]], %subreg.sub_vrm2_3
				; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG2]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0			; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0
	; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0			; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0
	; CHECK-NEXT: $x10 = COPY [[COPY]]			; CHECK-NEXT: $x10 = COPY [[COPY]]
	; CHECK-NEXT: PseudoRET implicit $x10			; CHECK-NEXT: PseudoRET implicit $x10
	%1:vrm8 = IMPLICIT_DEF			%1:vrm8 = IMPLICIT_DEF
	%7:gpr = ADDI $x0, 8			%7:gpr = ADDI $x0, 8
	%5:vrm2 = PseudoVLE32_V_M2 killed %7:gpr, 0, 5			%5:vrm2 = PseudoVLE32_V_M2 killed %7:gpr, 0, 5
	Show All 13 Lines
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_M8_sub_vrm2_3			; CHECK-LABEL: name: test_M8_sub_vrm2_3
	; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF			; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF
	; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8			; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8
	; CHECK-NEXT: [[PseudoVLE32_V_M2_:%[0-9]+]]:vrm2 = PseudoVLE32_V_M2 killed [[ADDI]], 0, 5 /* e32 */			; CHECK-NEXT: [[PseudoVLE32_V_M2_:%[0-9]+]]:vrm2 = PseudoVLE32_V_M2 killed [[ADDI]], 0, 5 /* e32 */
	; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M2_]], %subreg.sub_vrm2_3			; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M2_]], %subreg.sub_vrm2_3
	; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype			; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype
	; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: [[PseudoRVVInitUndefM4_:%[0-9]+]]:vrm4 = PseudoRVVInitUndefM4
				; CHECK-NEXT: [[INSERT_SUBREG1:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG]], [[PseudoRVVInitUndefM4_]], %subreg.sub_vrm4_0
				; CHECK-NEXT: [[PseudoRVVInitUndefM2_:%[0-9]+]]:vrm2 = PseudoRVVInitUndefM2
				; CHECK-NEXT: [[INSERT_SUBREG2:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG1]], [[PseudoRVVInitUndefM2_]], %subreg.sub_vrm2_2
				; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG2]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0			; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0
	; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0			; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0
	; CHECK-NEXT: $x10 = COPY [[COPY]]			; CHECK-NEXT: $x10 = COPY [[COPY]]
	; CHECK-NEXT: PseudoRET implicit $x10			; CHECK-NEXT: PseudoRET implicit $x10
	%1:vrm8 = IMPLICIT_DEF			%1:vrm8 = IMPLICIT_DEF
	%7:gpr = ADDI $x0, 8			%7:gpr = ADDI $x0, 8
	%5:vrm2 = PseudoVLE32_V_M2 killed %7:gpr, 0, 5			%5:vrm2 = PseudoVLE32_V_M2 killed %7:gpr, 0, 5
	Show All 13 Lines
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_M8_sub_vrm4_0			; CHECK-LABEL: name: test_M8_sub_vrm4_0
	; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF			; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF
	; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8			; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8
	; CHECK-NEXT: [[PseudoVLE32_V_M4_:%[0-9]+]]:vrm4 = PseudoVLE32_V_M4 killed [[ADDI]], 0, 5 /* e32 */			; CHECK-NEXT: [[PseudoVLE32_V_M4_:%[0-9]+]]:vrm4 = PseudoVLE32_V_M4 killed [[ADDI]], 0, 5 /* e32 */
	; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M4_]], %subreg.sub_vrm4_0			; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M4_]], %subreg.sub_vrm4_0
	; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype			; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype
	; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: [[PseudoRVVInitUndefM4_:%[0-9]+]]:vrm4 = PseudoRVVInitUndefM4
				; CHECK-NEXT: [[INSERT_SUBREG1:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG]], [[PseudoRVVInitUndefM4_]], %subreg.sub_vrm4_1
				; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG1]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0			; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0
	; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0			; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0
	; CHECK-NEXT: $x10 = COPY [[COPY]]			; CHECK-NEXT: $x10 = COPY [[COPY]]
	; CHECK-NEXT: PseudoRET implicit $x10			; CHECK-NEXT: PseudoRET implicit $x10
	%1:vrm8 = IMPLICIT_DEF			%1:vrm8 = IMPLICIT_DEF
	%7:gpr = ADDI $x0, 8			%7:gpr = ADDI $x0, 8
	%5:vrm4 = PseudoVLE32_V_M4 killed %7:gpr, 0, 5			%5:vrm4 = PseudoVLE32_V_M4 killed %7:gpr, 0, 5
	Show All 13 Lines
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_M8_sub_vrm4_1			; CHECK-LABEL: name: test_M8_sub_vrm4_1
	; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF			; CHECK: [[DEF:%[0-9]+]]:vrm8 = IMPLICIT_DEF
	; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8			; CHECK-NEXT: [[ADDI:%[0-9]+]]:gpr = ADDI $x0, 8
	; CHECK-NEXT: [[PseudoVLE32_V_M4_:%[0-9]+]]:vrm4 = PseudoVLE32_V_M4 killed [[ADDI]], 0, 5 /* e32 */			; CHECK-NEXT: [[PseudoVLE32_V_M4_:%[0-9]+]]:vrm4 = PseudoVLE32_V_M4 killed [[ADDI]], 0, 5 /* e32 */
	; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M4_]], %subreg.sub_vrm4_1			; CHECK-NEXT: [[INSERT_SUBREG:%[0-9]+]]:vrm8 = INSERT_SUBREG [[DEF]], [[PseudoVLE32_V_M4_]], %subreg.sub_vrm4_1
	; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype			; CHECK-NEXT: dead $x0 = PseudoVSETIVLI 0, 210 /* e32, m4, ta, ma */, implicit-def $vl, implicit-def $vtype
	; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: [[PseudoRVVInitUndefM4_:%[0-9]+]]:vrm4 = PseudoRVVInitUndefM4
				; CHECK-NEXT: [[INSERT_SUBREG1:%[0-9]+]]:vrm8 = INSERT_SUBREG [[INSERT_SUBREG]], [[PseudoRVVInitUndefM4_]], %subreg.sub_vrm4_0
				; CHECK-NEXT: early-clobber %4:vrm8 = PseudoVRGATHER_VI_M8 killed [[INSERT_SUBREG1]], 0, 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0			; CHECK-NEXT: [[ADDI1:%[0-9]+]]:gpr = ADDI $x0, 0
	; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype			; CHECK-NEXT: PseudoVSE32_V_M8 killed %4, killed [[ADDI1]], 0, 5 /* e32 */, implicit $vl, implicit $vtype
	; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0			; CHECK-NEXT: [[COPY:%[0-9]+]]:gpr = COPY $x0
	; CHECK-NEXT: $x10 = COPY [[COPY]]			; CHECK-NEXT: $x10 = COPY [[COPY]]
	; CHECK-NEXT: PseudoRET implicit $x10			; CHECK-NEXT: PseudoRET implicit $x10
	%1:vrm8 = IMPLICIT_DEF			%1:vrm8 = IMPLICIT_DEF
	%7:gpr = ADDI $x0, 8			%7:gpr = ADDI $x0, 8
	%5:vrm4 = PseudoVLE32_V_M4 killed %7:gpr, 0, 5			%5:vrm4 = PseudoVLE32_V_M4 killed %7:gpr, 0, 5
	Show All 10 Lines

llvm/test/CodeGen/RISCV/rvv/undef-earlyclobber-chain.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
				kito-chengUnsubmitted Not Done Reply Inline Actions Could you add a pre-commit patch for this testcase so that could easier demonstrate what's get fixed? kito-cheng: Could you add a pre-commit patch for this testcase so that could easier demonstrate what's get…
				BeMgAuthorUnsubmitted Done Reply Inline Actions New patch https://reviews.llvm.org/D137763 for test only and set up the dependency. Should I remove the testcase in this patch? BeMg: New patch https://reviews.llvm.org/D137763 for test only and set up the dependency. Should I…
	; RUN: llc -mtriple riscv64 -mattr=+v -riscv-enable-subreg-liveness < %s \| FileCheck %s			; RUN: llc -mtriple riscv64 -mattr=+v -riscv-enable-subreg-liveness < %s \| FileCheck %s

	define dso_local signext i32 @undef_early_clobber_chain() {			define dso_local signext i32 @undef_early_clobber_chain() {
	; CHECK-LABEL: undef_early_clobber_chain:			; CHECK-LABEL: undef_early_clobber_chain:
	; CHECK: # %bb.0: # %entry			; CHECK: # %bb.0: # %entry
	; CHECK-NEXT: addi sp, sp, -400			; CHECK-NEXT: addi sp, sp, -400
	; CHECK-NEXT: .cfi_def_cfa_offset 400			; CHECK-NEXT: .cfi_def_cfa_offset 400
	; CHECK-NEXT: vsetivli zero, 0, e32, m1, ta, ma			; CHECK-NEXT: vsetivli zero, 0, e32, m1, ta, ma
	; CHECK-NEXT: vrgather.vi v8, v8, 0			; CHECK-NEXT: vrgather.vi v9, v8, 0
	; CHECK-NEXT: mv a0, sp			; CHECK-NEXT: mv a0, sp
	; CHECK-NEXT: vse32.v v8, (a0)			; CHECK-NEXT: vse32.v v9, (a0)
	; CHECK-NEXT: li a0, 0			; CHECK-NEXT: li a0, 0
	; CHECK-NEXT: addi sp, sp, 400			; CHECK-NEXT: addi sp, sp, 400
	; CHECK-NEXT: ret			; CHECK-NEXT: ret
	entry:			entry:
	%dst = alloca [100 x float], align 8			%dst = alloca [100 x float], align 8
	call void @llvm.lifetime.start.p0(i64 400, ptr nonnull %dst) #4			call void @llvm.lifetime.start.p0(i64 400, ptr nonnull %dst) #4
	%0 = tail call <vscale x 2 x float> @llvm.riscv.vrgather.vx.nxv2f32.i64(<vscale x 2 x float> undef, <vscale x 2 x float> undef, i64 0, i64 0)			%0 = tail call <vscale x 2 x float> @llvm.riscv.vrgather.vx.nxv2f32.i64(<vscale x 2 x float> undef, <vscale x 2 x float> undef, i64 0, i64 0)
				craig.topperUnsubmitted Not Done Reply Inline Actions The operands for this instruction make it an obvious candidate for deletion earlier. Can we use more realistic operands that still demonstrate the bug? craig.topper: The operands for this instruction make it an obvious candidate for deletion earlier. Can we use…
				craig.topperUnsubmitted Not Done Reply Inline Actions Oh I guess you can't. This bug only exists for this instruction because we don't delete vrgather with undef operands in DAG combine. craig.topper: Oh I guess you can't. This bug only exists for this instruction because we don't delete…
	call void @llvm.riscv.vse.nxv2f32.i64(<vscale x 2 x float> %0, ptr nonnull %dst, i64 0)			call void @llvm.riscv.vse.nxv2f32.i64(<vscale x 2 x float> %0, ptr nonnull %dst, i64 0)
	call void @llvm.lifetime.end.p0(i64 400, ptr nonnull %dst) #4			call void @llvm.lifetime.end.p0(i64 400, ptr nonnull %dst) #4
	ret i32 0			ret i32 0
	}			}

	define internal void @SubRegLivenessUndefInPhi(i64 %cond) {			define internal void @SubRegLivenessUndefInPhi(i64 %cond) {
	; CHECK-LABEL: SubRegLivenessUndefInPhi:			; CHECK-LABEL: SubRegLivenessUndefInPhi:
	; CHECK: # %bb.0: # %start			; CHECK: # %bb.0: # %start
	Show All 18 Lines
	; CHECK-NEXT: vslideup.vx v10, v11, a0			; CHECK-NEXT: vslideup.vx v10, v11, a0
	; CHECK-NEXT: vsetvli a2, zero, e16, mf4, ta, ma			; CHECK-NEXT: vsetvli a2, zero, e16, mf4, ta, ma
	; CHECK-NEXT: vadd.vi v9, v9, 3			; CHECK-NEXT: vadd.vi v9, v9, 3
	; CHECK-NEXT: vsetvli zero, a1, e16, m1, ta, ma			; CHECK-NEXT: vsetvli zero, a1, e16, m1, ta, ma
	; CHECK-NEXT: vslideup.vx v12, v9, a0			; CHECK-NEXT: vslideup.vx v12, v9, a0
	; CHECK-NEXT: .LBB1_3: # %UseSR			; CHECK-NEXT: .LBB1_3: # %UseSR
	; CHECK-NEXT: vl1r.v v14, (zero)			; CHECK-NEXT: vl1r.v v14, (zero)
	; CHECK-NEXT: vsetivli zero, 4, e8, m1, ta, ma			; CHECK-NEXT: vsetivli zero, 4, e8, m1, ta, ma
	; CHECK-NEXT: vrgatherei16.vv v15, v14, v8			; CHECK-NEXT: vrgatherei16.vv v13, v14, v8
	; CHECK-NEXT: vrgatherei16.vv v8, v14, v10			; CHECK-NEXT: vrgatherei16.vv v8, v14, v10
	; CHECK-NEXT: vsetvli a0, zero, e8, m1, ta, ma			; CHECK-NEXT: vsetvli a0, zero, e8, m1, ta, ma
	; CHECK-NEXT: vand.vv v8, v15, v8			; CHECK-NEXT: vand.vv v8, v13, v8
	; CHECK-NEXT: vsetivli zero, 4, e8, m1, ta, ma			; CHECK-NEXT: vsetivli zero, 4, e8, m1, ta, ma
	; CHECK-NEXT: vrgatherei16.vv v9, v14, v12			; CHECK-NEXT: vrgatherei16.vv v9, v14, v12
	; CHECK-NEXT: vsetvli a0, zero, e8, m1, ta, ma			; CHECK-NEXT: vsetvli a0, zero, e8, m1, ta, ma
	; CHECK-NEXT: vand.vv v8, v8, v9			; CHECK-NEXT: vand.vv v8, v8, v9
	; CHECK-NEXT: vs1r.v v8, (zero)			; CHECK-NEXT: vs1r.v v8, (zero)
	; CHECK-NEXT: ret			; CHECK-NEXT: ret
	start:			start:
	%0 = icmp sgt i64 %cond, 0			%0 = icmp sgt i64 %cond, 0
	Show All 35 Lines
	; CHECK-LABEL: SubRegLivenessUndef:			; CHECK-LABEL: SubRegLivenessUndef:
	; CHECK: # %bb.0: # %loopIR.preheader.i.i			; CHECK: # %bb.0: # %loopIR.preheader.i.i
	; CHECK-NEXT: vsetvli a0, zero, e16, mf4, ta, ma			; CHECK-NEXT: vsetvli a0, zero, e16, mf4, ta, ma
	; CHECK-NEXT: vid.v v8			; CHECK-NEXT: vid.v v8
	; CHECK-NEXT: vadd.vi v10, v8, 1			; CHECK-NEXT: vadd.vi v10, v8, 1
	; CHECK-NEXT: vadd.vi v12, v8, 3			; CHECK-NEXT: vadd.vi v12, v8, 3
	; CHECK-NEXT: .LBB2_1: # %loopIR3.i.i			; CHECK-NEXT: .LBB2_1: # %loopIR3.i.i
	; CHECK-NEXT: # =>This Inner Loop Header: Depth=1			; CHECK-NEXT: # =>This Inner Loop Header: Depth=1
	; CHECK-NEXT: vl1r.v v9, (zero)			; CHECK-NEXT: vl1r.v v14, (zero)
	; CHECK-NEXT: vsetivli zero, 4, e8, m1, ta, ma			; CHECK-NEXT: vsetivli zero, 4, e8, m1, ta, ma
	; CHECK-NEXT: vrgatherei16.vv v11, v9, v8			; CHECK-NEXT: vrgatherei16.vv v13, v14, v8
	; CHECK-NEXT: vrgatherei16.vv v13, v9, v10			; CHECK-NEXT: vrgatherei16.vv v9, v14, v10
	; CHECK-NEXT: vsetvli a0, zero, e8, m1, ta, ma			; CHECK-NEXT: vsetvli a0, zero, e8, m1, ta, ma
	; CHECK-NEXT: vand.vv v11, v11, v13			; CHECK-NEXT: vand.vv v9, v13, v9
	; CHECK-NEXT: vsetivli zero, 4, e8, m1, ta, ma			; CHECK-NEXT: vsetivli zero, 4, e8, m1, ta, ma
	; CHECK-NEXT: vrgatherei16.vv v13, v9, v12			; CHECK-NEXT: vrgatherei16.vv v11, v14, v12
	; CHECK-NEXT: vsetvli a0, zero, e8, m1, ta, ma			; CHECK-NEXT: vsetvli a0, zero, e8, m1, ta, ma
	; CHECK-NEXT: vand.vv v9, v11, v13			; CHECK-NEXT: vand.vv v9, v9, v11
	; CHECK-NEXT: vs1r.v v9, (zero)			; CHECK-NEXT: vs1r.v v9, (zero)
	; CHECK-NEXT: j .LBB2_1			; CHECK-NEXT: j .LBB2_1
	loopIR.preheader.i.i:			loopIR.preheader.i.i:
	%v15 = tail call <vscale x 1 x i16> @llvm.experimental.stepvector.nxv1i16()			%v15 = tail call <vscale x 1 x i16> @llvm.experimental.stepvector.nxv1i16()
	%v17 = tail call <vscale x 8 x i16> @llvm.vector.insert.nxv8i16.nxv1i16(<vscale x 8 x i16> poison, <vscale x 1 x i16> %v15, i64 0)			%v17 = tail call <vscale x 8 x i16> @llvm.vector.insert.nxv8i16.nxv1i16(<vscale x 8 x i16> poison, <vscale x 1 x i16> %v15, i64 0)
	%vs12.i.i.i = add <vscale x 1 x i16> %v15, shufflevector (<vscale x 1 x i16> insertelement (<vscale x 1 x i16> poison, i16 1, i32 0), <vscale x 1 x i16> poison, <vscale x 1 x i32> zeroinitializer)			%vs12.i.i.i = add <vscale x 1 x i16> %v15, shufflevector (<vscale x 1 x i16> insertelement (<vscale x 1 x i16> poison, i16 1, i32 0), <vscale x 1 x i16> poison, <vscale x 1 x i32> zeroinitializer)
	%v18 = tail call <vscale x 8 x i16> @llvm.vector.insert.nxv8i16.nxv1i16(<vscale x 8 x i16> poison, <vscale x 1 x i16> %vs12.i.i.i, i64 0)			%v18 = tail call <vscale x 8 x i16> @llvm.vector.insert.nxv8i16.nxv1i16(<vscale x 8 x i16> poison, <vscale x 1 x i16> %vs12.i.i.i, i64 0)
	%vs16.i.i.i = add <vscale x 1 x i16> %v15, shufflevector (<vscale x 1 x i16> insertelement (<vscale x 1 x i16> poison, i16 3, i32 0), <vscale x 1 x i16> poison, <vscale x 1 x i32> zeroinitializer)			%vs16.i.i.i = add <vscale x 1 x i16> %v15, shufflevector (<vscale x 1 x i16> insertelement (<vscale x 1 x i16> poison, i16 3, i32 0), <vscale x 1 x i16> poison, <vscale x 1 x i32> zeroinitializer)
	Show All 21 Lines

llvm/test/CodeGen/RISCV/rvv/vrgatherei16-subreg-liveness.ll

	Show All 38 Lines
	; SUBREG-NEXT: vmv.v.i v9, 0			; SUBREG-NEXT: vmv.v.i v9, 0
	; SUBREG-NEXT: vsetivli zero, 4, e8, m1, tu, ma			; SUBREG-NEXT: vsetivli zero, 4, e8, m1, tu, ma
	; SUBREG-NEXT: vmv1r.v v8, v9			; SUBREG-NEXT: vmv1r.v v8, v9
	; SUBREG-NEXT: vrgatherei16.vv v8, v9, v14			; SUBREG-NEXT: vrgatherei16.vv v8, v9, v14
	; SUBREG-NEXT: .LBB0_1: # %loopIR3.i.i			; SUBREG-NEXT: .LBB0_1: # %loopIR3.i.i
	; SUBREG-NEXT: # =>This Inner Loop Header: Depth=1			; SUBREG-NEXT: # =>This Inner Loop Header: Depth=1
	; SUBREG-NEXT: vl1r.v v9, (zero)			; SUBREG-NEXT: vl1r.v v9, (zero)
	; SUBREG-NEXT: vsetivli zero, 4, e8, m1, tu, ma			; SUBREG-NEXT: vsetivli zero, 4, e8, m1, tu, ma
	; SUBREG-NEXT: vmv1r.v v11, v12			; SUBREG-NEXT: vmv1r.v v13, v12
	; SUBREG-NEXT: vrgatherei16.vv v11, v9, v10			; SUBREG-NEXT: vrgatherei16.vv v13, v9, v10
	; SUBREG-NEXT: vsetvli a0, zero, e8, m1, ta, ma			; SUBREG-NEXT: vsetvli a0, zero, e8, m1, ta, ma
	; SUBREG-NEXT: vand.vv v9, v8, v11			; SUBREG-NEXT: vand.vv v9, v8, v13
	; SUBREG-NEXT: vs1r.v v9, (zero)			; SUBREG-NEXT: vs1r.v v9, (zero)
	; SUBREG-NEXT: j .LBB0_1			; SUBREG-NEXT: j .LBB0_1
	loopIR.preheader.i.i:			loopIR.preheader.i.i:
	%v18 = tail call <vscale x 8 x i16> @llvm.vector.insert.nxv8i16.nxv1i16(<vscale x 8 x i16> poison, <vscale x 1 x i16> %vs12.i.i.i, i64 0)			%v18 = tail call <vscale x 8 x i16> @llvm.vector.insert.nxv8i16.nxv1i16(<vscale x 8 x i16> poison, <vscale x 1 x i16> %vs12.i.i.i, i64 0)
	br label %loopIR3.i.i			br label %loopIR3.i.i

	loopIR3.i.i: ; preds = %loopIR3.i.i, %loopIR.preheader.i.i			loopIR3.i.i: ; preds = %loopIR3.i.i, %loopIR.preheader.i.i
	%v376 = load <vscale x 8 x i8>, ptr addrspace(1) null, align 8			%v376 = load <vscale x 8 x i8>, ptr addrspace(1) null, align 8
	Show All 15 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[RISCV] Add new pass to transform undef to pseudo for vector values.ClosedPublic

Details

Diff Detail

Event Timeline

Handle Sub-register undef+early-clobber

Step 1

Step 2

Step 3

PHI in def-use chain

Revision Contents

Diff 499452

llvm/lib/Target/RISCV/CMakeLists.txt

llvm/lib/Target/RISCV/RISCV.h

llvm/lib/Target/RISCV/RISCVAsmPrinter.cpp

llvm/lib/Target/RISCV/RISCVInstrInfo.td

llvm/lib/Target/RISCV/RISCVRVVInitUndef.cpp

llvm/lib/Target/RISCV/RISCVTargetMachine.cpp

llvm/test/CodeGen/RISCV/O3-pipeline.ll

llvm/test/CodeGen/RISCV/regalloc-last-chance-recoloring-failure.ll

llvm/test/CodeGen/RISCV/rvv/subregister-undef-early-clobber.mir

llvm/test/CodeGen/RISCV/rvv/undef-earlyclobber-chain.ll

llvm/test/CodeGen/RISCV/rvv/vrgatherei16-subreg-liveness.ll

[RISCV] Add new pass to transform undef to pseudo for vector values.
ClosedPublic