Download Raw Diff

Details

Reviewers

power-llvm-team
hfinkel
echristo
nemanjai
stefanp
lei
jsji

Group Reviewers

Restricted Project

Commits

rGc7be06797436: [PowerPC] Fix SH field overflow issue
rL373519: [PowerPC] Fix SH field overflow issue

Summary

Store rlwinm Rx, Ry, 32, 0, 31 as rlwinm Rx, Ry, 0, 0, 31 and store
rldicl Rx, Ry, 64, 0 as rldicl Rx, Ry, 0, 0. Otherwise SH field is overflow
and fails assertion in assembly printing stage.

Diff Detail

Event Timeline

Yi-Hong.Lyu created this revision.Aug 29 2019, 9:56 PM

Herald added a project: Restricted Project. · View Herald TranscriptAug 29 2019, 9:57 PM

Herald added subscribers: llvm-commits, shchenz, jsji and 3 others. · View Herald Transcript

The fix looks good to me.
However, I feel that the test is over-specified. What you are looking for is a rotate or a shift with an immediate of zero and to make sure that we can safely print those kinds of instructions to assembly. I feel that using the update_llc_test_checks.py script only produces a test that will fail when other changes are made later on and forcing the developer of that changeset to regenerate your test.

llvm/test/CodeGen/PowerPC/convert-rr-to-ri-instrs.ll
5 ↗	(On Diff #218023)	Should add a comment here to say what you are testing.
11 ↗	(On Diff #218023)	So, if I understand correctly, this is the instruction that we had trouble printing. slwi r5, r3, 0 I think the test should just look for this instruction and make sure that the zero is there correctly. The test as it stands is over-specified. The tool to generate tests is nice but it creates overhead for the future when scheduling or register allocation changes slightly then tests like this one would need to be updated. I think a set of simpler checks might work better. Look for the label (CHECK-LABEL) as you have done. Then look for the `slwi` with the zero making sure you use some kind of wildcard for the register numbers. I would usually also look for the `blr` but you don't have one for this case so that's fine.
47 ↗	(On Diff #218023)	Same goes for this. You can simplify the checks quite a bit here.

Herald added a subscriber: • wuzish. · View Herald TranscriptAug 30 2019, 12:57 PM

stefanp requested changes to this revision.Aug 30 2019, 1:01 PM

This revision now requires changes to proceed.Aug 30 2019, 1:01 PM

Yi-Hong.Lyu edited the summary of this revision. (Show Details)Aug 31 2019, 2:27 PM

Address Stefan's comments

ping

LGTM. I'll let Stefan give the final ack though.

Sorry it took me so long to get back to this.
LGTM.

This revision is now accepted and ready to land.Sep 12 2019, 7:10 AM

I think this also LGTM overall.

llvm/test/CodeGen/PowerPC/convert-rr-to-ri-instrs.ll
2 ↗	(On Diff #218252)	We don't need this line, do we?

Can you please use MIR test instead. Thanks.

llvm/lib/Target/PowerPC/PPCInstrInfo.cpp
3593–3594	Comments should be updated too.
llvm/test/CodeGen/PowerPC/convert-rr-to-ri-instrs.ll
3 ↗	(On Diff #218252)	Remove this line as well.
15 ↗	(On Diff #218252)	run-pass may not trigger the assert, as we don't call the InstPrinters. But I think you should still be able to run MIR test and trigger the assert with `start-before`. eg: $ llc -O3 -stop-before ppc-mi-peepholes ../llvm-project/llvm/test/CodeGen/PowerPC/convert-rr-to-ri-instrs.ll -o t.mir $ llc -O3 -start-before ppc-mi-peepholes t.mir llc: .../llvm/lib/Target/PowerPC/MCTargetDesc/PPCInstPrinter.cpp:327: void llvm::PPCInstPrinter::printU5ImmOperand(const llvm::MCInst*, unsigned int, llvm::raw_ostream&): Assertion `Value <= 31 && "Invalid u5imm argument!"' failed. Stack dump: 0. Program arguments: llc -O3 -start-before ppc-mi-peepholes t.mir 1. Running pass 'Function Pass Manager' on module 't.mir'. 2. Running pass 'Linux PPC Assembly Printer' on function '@special_right_shift32_0' ... Aborted (core dumped)

This revision now requires changes to proceed.Sep 13 2019, 8:23 AM

Address Jinsong's comment

Yi-Hong.Lyu marked 6 inline comments as done.Sep 17 2019, 10:12 PM

Yi-Hong.Lyu added inline comments.

llvm/test/CodeGen/PowerPC/convert-rr-to-ri-instrs.ll
2 ↗	(On Diff #218252)	No longer an issue since MIR doesn't have such field
3 ↗	(On Diff #218252)	No longer an issue since MIR doesn't have such field

Thanks for working on MIR test, I think we can reduce it a little further.

llvm/test/CodeGen/PowerPC/SH-field-overflow.mir
1 ↗	(On Diff #220612)	Add -mtriple please, don't rely on the `target triple` in optional embedded LLVM IR module.
10 ↗	(On Diff #220612)	We know what exactly we want to test in MIR, so why don't we reduce this MIR test further ? eg: a MIR with following lines should be sufficient for 32 bit, you can add another module for 64 bits, and that should be all. $ cat sh-overflow.mir --- name: special_right_shift32_0 liveins: - { reg: '$x3'} - { reg: '$x4'} tracksRegLiveness: true body: \| bb.0.entry: liveins: $r3, $r4 renamable $r4 = LI 0 renamable $r3 = SRW renamable $r3, renamable $r4 BLR8 implicit $lr8, implicit $rm, implicit $x3 ...

This revision now requires changes to proceed.Sep 18 2019, 11:16 AM

Yi-Hong.Lyu marked 4 inline comments as done.Sep 22 2019, 1:19 PM

Yi-Hong.Lyu added inline comments.

llvm/test/CodeGen/PowerPC/SH-field-overflow.mir
10 ↗	(On Diff #220612)	Multiple passes after ppc-mi-peepholes relies on SSA form. Apparently, the test case is not in SSA form so we can't just use it.

Address Jinsong's comments

Yi-Hong.Lyu marked an inline comment as done.Sep 22 2019, 1:30 PM

I think the test can be simplified more, like what @jsji has mentioned and I think it should not be related to SSA form. Test is kinda like

# RUN: llc -O3 -mtriple=powerpc64le-unknown-linux-gnu -stop-after=ppc-pre-emit-peephole -verify-machineinstrs %s -o - | FileCheck %s
---                                                                                                                                                                           
name: foo                                                                                                                                                                   
alignment: 4                                                                                                                                                                  
tracksRegLiveness: true                                                                                                                                                       
body: |                                                                                                                                                                       
  bb.0.entry:                                                                                                                                                                 
    liveins: $r3                                                                                                                                                              
    renamable $r4 = LI 0                                                                                                                                                      
    renamable $r5 = SRW renamable $r3, renamable killed $r4                                                                                                                   
    $r3 = COPY renamable killed $r5                                                                                                                                           
    BLR implicit $lr, implicit $rm, implicit $r3                                                                                                                              
...

We can just let llc -stop-after=ppc-pre-emit-peephole here since we have known the cause. Another nit, an NFC patch to pre-commit the test would also be preferable.

You are right, we should make sure we meet SSA constraint when writing the MIR for passes before RA.

However, this does NOT prevent us from further reducing the testcase.

llvm/test/CodeGen/PowerPC/SH-field-overflow.mir

10 ↗

(On Diff #220612)

Good point, sorry, I did not check SSA constraint for MIR carefully.
My example above is mostly just to demo the idea of how to reduce the case

How about something like these?

$ cat t2.mir
---
name:            special_right_shift32_0
liveins:
  - { reg: '$r4'}
  - { reg: '$r5'}
tracksRegLiveness: true
body:             |
  bb.0.entry:
    liveins: $r5, $r4
  
    renamable $r4 = LI 0
    renamable $r3 = SRW killed renamable $r5, killed renamable $r4, implicit-def $x3
    BLR8 implicit $lr8, implicit $rm, implicit killed $x3
...
$ cat t3.mir
---
name:            special_right_shift64_0
liveins:
  - { reg: '$r4'}
  - { reg: '$x5'}
tracksRegLiveness: true
body:             |
  bb.0.entry:
    liveins: $r4, $x5
  
    renamable $r4 = LI 0
    renamable $x3 = SRD killed renamable $x5, killed renamable $r4
    BLR8 implicit $lr8, implicit $rm, implicit killed $x3
...

In D66991#1679267, @jsji wrote:

You are right, we should make sure we meet SSA constraint when writing the MIR for passes before RA.

BTW: isSSA() in MIRParser will always return true for MIRs that does NOT have virtual regs,
I think this is a limitation for now, we might want to extend it to support MIR with all HWregs,
but for now, we should check the code manually and carefully. Thanks for pointing out.

Reduce the MIR test cases (Address Jinsong's comments)

Yi-Hong.Lyu marked 2 inline comments as done.Sep 23 2019, 12:12 PM

Yi-Hong.Lyu added inline comments.

llvm/test/CodeGen/PowerPC/SH-field-overflow.mir
10 ↗	(On Diff #220612)	Thanks for your test case. Both of your test cases works. One concern is that they are non-SSA MIR tests but we apply some SSA MIR passes (e.g., Live Variable Analysis) on them. I come up with reduced SSA MIR tests, how do you think?

I am OK with you using virtual regs, and the reduced case looks mostly good to me, but I think we should try to avoid rely on other opts before peephole.

llvm/test/CodeGen/PowerPC/SH-field-overflow.mir
10 ↗	(On Diff #220612)	Why they are non-SSA?
llvm/test/CodeGen/PowerPC/sh-overflow.mir
1	`-start-after=phi-node-elimination` ? Why not `start-before=ppc-mi-peepholes`? Does this imply that we rely on some other passes to generate the necessary input? Can you come up with one that works with either `start-before=ppc-mi-peepholes` or `-start-after ppc-mi-peepholes -ppc-late-peephole` ?
50	nit: `alignment` and `maxAlignment` seems random here, any reason you want to use 1 here?

nemanjai added inline comments.Sep 24 2019, 3:33 AM

llvm/test/CodeGen/PowerPC/SH-field-overflow.mir
10 ↗	(On Diff #220612)	I don't understand this question. SSA == Static Single Assignment form. Since `r4` is live-in and defined in the entry block, it is not SSA. But this is really arguing semantics. I haven't checked, but it is entirely possible that the MachineRegisterInfo does not compute the defs and uses of physical registers when consuming MIR. So if we are providing an MIR test case that we are passing to pre-RA passes, we should write them without the registers already allocated.
llvm/test/CodeGen/PowerPC/sh-overflow.mir
1	Yes, let's change the test case to have 2 RUN lines. One starts before MI Peephole. The other starts after it. The checks should be the same for both as we should do the right thing in both places. Also, please do not talk about the "current state of things" or "what the patch fixes" in the source/tests. These statements make sense only in the context of the patch now, but later down the road, such statements are meaningless. It should suffice to say something like: "Ensure we do not attempt to transform this into `srwi $r3, $r3, 0` in the form specified by ISA 3.0b (`rlwinm $r3, $r3, 32 - 0, 0, 31`)"

jsji added inline comments.Sep 24 2019, 7:16 AM

llvm/test/CodeGen/PowerPC/SH-field-overflow.mir
10 ↗	(On Diff #220612)	Ah, yes, I forgot to remove the live-in for $r4 here. Thanks @nemanjai . Yes, agree that if we are providing an MIR test to pre-RA passes, we should try our best to write them without the register already allocated. I think we shouldn't assume that a testcase without virtual register is definitely NOT SSA. Although it is indeed really error prone without the MIRParse isSSA() check support, and so should be avoided if possible. Thanks.
llvm/test/CodeGen/PowerPC/sh-overflow.mir
1	Good point.

Yi-Hong.Lyu marked 2 inline comments as done.Sep 27 2019, 10:03 AM

Yi-Hong.Lyu added inline comments.

llvm/test/CodeGen/PowerPC/sh-overflow.mir
50	The special_right_shift32_0 is derived from unsigned int test(unsigned int a, unsigned int b) { return a >> b; } generated by `clang --target=powerpc-unknown-unknown`. In contrast, the special_right_shift64_0 is derived from unsigned long test(unsigned long a, unsigned long b) { return a >> b; } generated by `clang --target=powerpc64-unknown-unknown`. I just leave `alignment` and `maxAlignment` as it is. What alignment and maxAlignment do you think it should be?

jsji added inline comments.Sep 27 2019, 3:16 PM

llvm/test/CodeGen/PowerPC/sh-overflow.mir
50	Thanks for explanation. I don't have specific number in mind, just wondering why it is 1? since it is smaller than 2 above.

Yi-Hong.Lyu marked 2 inline comments as done.Sep 30 2019, 12:37 AM

Yi-Hong.Lyu added inline comments.

llvm/test/CodeGen/PowerPC/sh-overflow.mir

Given the MIR:

$ cat special_right_shift32_0.mir
---
name:            test
alignment:       2
tracksRegLiveness: true
registers:
  - { id: 0, class: gprc }
  - { id: 1, class: gprc }
  - { id: 2, class: gprc }
liveins:
  - { reg: '$r3', virtual-reg: '%0' }
machineFunctionInfo: {}
body:             |
  bb.0.entry:
    liveins: $r3

    %1:gprc = LI 0
    %0:gprc = COPY $r3
    %2:gprc = SRW %0, %1
    $r3 = COPY %2
    BLR implicit $lr, implicit $rm, implicit $r3

...

$ llc -O3 -mtriple=powerpc64-unknown-linux-gnu -start-before=ppc-mi-peepholes special_right_shift32_0.mir -o special_right_shift32_0.before.s
$ llc -O3 -mtriple=powerpc64-unknown-linux-gnu -start-after=ppc-mi-peepholes -ppc-late-peephole special_right_shift32_0.mir -o special_right_shift32_0.after.s
$ diff special_right_shift32_0.before.s special_right_shift32_0.after.s
15a16
>       slwi 3, 3, 0

All the assembly in special_right_shift32_0.before.s is optimized out and it contains only blr[1]. In contrast, special_right_shift32_0.after.s has expected output[2]. That is, we get different results for the 2 RUN lines.

[1]

$ cat special_right_shift32_0.before.s
        .text
        .file   "special_right_shift32_0.mir"
        .globl  test                    # -- Begin function test
        .p2align        2
        .type   test,@function
        .section        .opd,"aw",@progbits
test:                                   # @test
        .p2align        3
        .quad   .Lfunc_begin0
        .quad   .TOC.@tocbase
        .quad   0
        .text
.Lfunc_begin0:
        .cfi_startproc
# %bb.0:                                # %entry
        blr
        .long   0
        .quad   0
.Lfunc_end0:
        .size   test, .Lfunc_end0-.Lfunc_begin0
        .cfi_endproc
                                        # -- End function

        .section        ".note.GNU-stack","",@progbits

[2]

$ cat special_right_shift32_0.after.s
        .text
        .file   "special_right_shift32_0.mir"
        .globl  test                    # -- Begin function test
        .p2align        2
        .type   test,@function
        .section        .opd,"aw",@progbits
test:                                   # @test
        .p2align        3
        .quad   .Lfunc_begin0
        .quad   .TOC.@tocbase
        .quad   0
        .text
.Lfunc_begin0:
        .cfi_startproc
# %bb.0:                                # %entry
        slwi 3, 3, 0
        blr
        .long   0
        .quad   0
.Lfunc_end0:
        .size   test, .Lfunc_end0-.Lfunc_begin0
        .cfi_endproc
                                        # -- End function

        .section        ".note.GNU-stack","",@progbits

According to https://llvm.org/docs/MIRLangRef.html#simplifying-mir-files:

The whole frameInfo section is often unnecessary if there is no special frame usage in the function.

Would remove the whole frameInfo section and leave alignment: 2 there for both cases.

jsji added inline comments.Sep 30 2019, 8:32 AM

llvm/test/CodeGen/PowerPC/sh-overflow.mir
1	I believe we do difference opts depends on `PostRA`, that maybe why you can not get exact results with above two RUN line.
50	OK for me. Thanks.

Address Jonsong and Nemanja's comments

Yi-Hong.Lyu marked 7 inline comments as done.Oct 2 2019, 7:15 AM

LGTM. Thanks for the patience during review.

llvm/test/CodeGen/PowerPC/sh-overflow.mir
20	This might be a little confusing -- why avoiding transform into a form in ISA 3.0b , which normally is supposed to be correct? yes, I think you are trying to express that ISA 3.0b also have the bug when describing extended mnemonics of `srwi` and `srdi`. In that case, I think we should contact the ISA team to fix it. We don't need to emphasis that in the comments here.

jsji accepted this revision.Oct 2 2019, 7:29 AM

This revision is now accepted and ready to land.Oct 2 2019, 7:29 AM

Closed by commit rL373519: [PowerPC] Fix SH field overflow issue (authored by Yi-Hong.Lyu). · Explain WhyOct 2 2019, 1:27 PM

This revision was automatically updated to reflect the committed changes.

Diff 221381

llvm/lib/Target/PowerPC/PPCInstrInfo.cpp

Show First 20 Lines • Show All 3,575 Lines • ▼ Show 20 Lines	if (SpecialShift32 \|\| SpecialShift64) {
// just convert this to a COPY. Can't do this post-RA since we've already		// just convert this to a COPY. Can't do this post-RA since we've already
// cleaned up the copies.		// cleaned up the copies.
else if (!SetCR && ShAmt == 0 && !PostRA) {		else if (!SetCR && ShAmt == 0 && !PostRA) {
MI.RemoveOperand(2);		MI.RemoveOperand(2);
MI.setDesc(get(PPC::COPY));		MI.setDesc(get(PPC::COPY));
} else {		} else {
// The 32 bit and 64 bit instructions are quite different.		// The 32 bit and 64 bit instructions are quite different.
if (SpecialShift32) {		if (SpecialShift32) {
// Left shifts use (N, 0, 31-N), right shifts use (32-N, N, 31).		// Left shifts use (N, 0, 31-N).
uint64_t SH = RightShift ? 32 - ShAmt : ShAmt;		// Right shifts use (32-N, N, 31) if 0 < N < 32.
		// use (0, 0, 31) if N == 0.
		uint64_t SH = ShAmt == 0 ? 0 : RightShift ? 32 - ShAmt : ShAmt;
uint64_t MB = RightShift ? ShAmt : 0;		uint64_t MB = RightShift ? ShAmt : 0;
uint64_t ME = RightShift ? 31 : 31 - ShAmt;		uint64_t ME = RightShift ? 31 : 31 - ShAmt;
replaceInstrOperandWithImm(MI, III.OpNoForForwarding, SH);		replaceInstrOperandWithImm(MI, III.OpNoForForwarding, SH);
MachineInstrBuilder(*MI.getParent()->getParent(), MI).addImm(MB)		MachineInstrBuilder(*MI.getParent()->getParent(), MI).addImm(MB)
.addImm(ME);		.addImm(ME);
} else {		} else {
// Left shifts use (N, 63-N), right shifts use (64-N, N).		// Left shifts use (N, 63-N).
		jsjiUnsubmitted Done Reply Inline Actions Comments should be updated too. jsji: Comments should be updated too.
uint64_t SH = RightShift ? 64 - ShAmt : ShAmt;		// Right shifts use (64-N, N) if 0 < N < 64.
		// use (0, 0) if N == 0.
		uint64_t SH = ShAmt == 0 ? 0 : RightShift ? 64 - ShAmt : ShAmt;
uint64_t ME = RightShift ? ShAmt : 63 - ShAmt;		uint64_t ME = RightShift ? ShAmt : 63 - ShAmt;
replaceInstrOperandWithImm(MI, III.OpNoForForwarding, SH);		replaceInstrOperandWithImm(MI, III.OpNoForForwarding, SH);
MachineInstrBuilder(*MI.getParent()->getParent(), MI).addImm(ME);		MachineInstrBuilder(*MI.getParent()->getParent(), MI).addImm(ME);
}		}
}		}
} else		} else
replaceInstrOperandWithImm(MI, ConstantOpNo, Imm);		replaceInstrOperandWithImm(MI, ConstantOpNo, Imm);
}		}
▲ Show 20 Lines • Show All 467 Lines • Show Last 20 Lines

llvm/test/CodeGen/PowerPC/sh-overflow.mir

This file was added.

				# RUN: llc -O3 -mtriple=powerpc64le-unknown-linux-gnu -start-after=phi-node-elimination -ppc-asm-full-reg-names -verify-machineinstrs %s -o - \| FileCheck %s
				jsjiUnsubmitted Done Reply Inline Actions `-start-after=phi-node-elimination` ? Why not `start-before=ppc-mi-peepholes`? Does this imply that we rely on some other passes to generate the necessary input? Can you come up with one that works with either `start-before=ppc-mi-peepholes` or `-start-after ppc-mi-peepholes -ppc-late-peephole` ? jsji: `-start-after=phi-node-elimination` ? Why not `start-before=ppc-mi-peepholes`? Does this imply…
				nemanjaiUnsubmitted Done Reply Inline Actions Yes, let's change the test case to have 2 RUN lines. One starts before MI Peephole. The other starts after it. The checks should be the same for both as we should do the right thing in both places. Also, please do not talk about the "current state of things" or "what the patch fixes" in the source/tests. These statements make sense only in the context of the patch now, but later down the road, such statements are meaningless. It should suffice to say something like: "Ensure we do not attempt to transform this into `srwi $r3, $r3, 0` in the form specified by ISA 3.0b (`rlwinm $r3, $r3, 32 - 0, 0, 31`)" nemanjai: Yes, let's change the test case to have 2 RUN lines. One starts before MI Peephole. The other…
				jsjiUnsubmitted Done Reply Inline Actions Good point. jsji: Good point.
				Yi-Hong.LyuAuthorUnsubmitted Done Reply Inline Actions Given the MIR: $ cat special_right_shift32_0.mir --- name: test alignment: 2 tracksRegLiveness: true registers: - { id: 0, class: gprc } - { id: 1, class: gprc } - { id: 2, class: gprc } liveins: - { reg: '$r3', virtual-reg: '%0' } machineFunctionInfo: {} body: \| bb.0.entry: liveins: $r3 %1:gprc = LI 0 %0:gprc = COPY $r3 %2:gprc = SRW %0, %1 $r3 = COPY %2 BLR implicit $lr, implicit $rm, implicit $r3 ... $ llc -O3 -mtriple=powerpc64-unknown-linux-gnu -start-before=ppc-mi-peepholes special_right_shift32_0.mir -o special_right_shift32_0.before.s $ llc -O3 -mtriple=powerpc64-unknown-linux-gnu -start-after=ppc-mi-peepholes -ppc-late-peephole special_right_shift32_0.mir -o special_right_shift32_0.after.s $ diff special_right_shift32_0.before.s special_right_shift32_0.after.s 15a16 > slwi 3, 3, 0 All the assembly in special_right_shift32_0.before.s is optimized out and it contains only `blr`[1]. In contrast, special_right_shift32_0.after.s has expected output[2]. That is, we get different results for the 2 RUN lines. [1] $ cat special_right_shift32_0.before.s .text .file "special_right_shift32_0.mir" .globl test # -- Begin function test .p2align 2 .type test,@function .section .opd,"aw",@progbits test: # @test .p2align 3 .quad .Lfunc_begin0 .quad .TOC.@tocbase .quad 0 .text .Lfunc_begin0: .cfi_startproc # %bb.0: # %entry blr .long 0 .quad 0 .Lfunc_end0: .size test, .Lfunc_end0-.Lfunc_begin0 .cfi_endproc # -- End function .section ".note.GNU-stack","",@progbits [2] $ cat special_right_shift32_0.after.s .text .file "special_right_shift32_0.mir" .globl test # -- Begin function test .p2align 2 .type test,@function .section .opd,"aw",@progbits test: # @test .p2align 3 .quad .Lfunc_begin0 .quad .TOC.@tocbase .quad 0 .text .Lfunc_begin0: .cfi_startproc # %bb.0: # %entry slwi 3, 3, 0 blr .long 0 .quad 0 .Lfunc_end0: .size test, .Lfunc_end0-.Lfunc_begin0 .cfi_endproc # -- End function .section ".note.GNU-stack","",@progbits Yi-Hong.Lyu: Given the MIR: ``` $ cat special_right_shift32_0.mir --- name: test alignment…
				jsjiUnsubmitted Done Reply Inline Actions I believe we do difference opts depends on `PostRA`, that maybe why you can not get exact results with above two RUN line. jsji: I believe we do difference opts depends on `PostRA`, that maybe why you can not get exact…

				---
				name: special_right_shift32_0
				alignment: 2
				tracksRegLiveness: true
				registers:
				- { id: 0, class: gprc }
				- { id: 1, class: gprc }
				- { id: 2, class: gprc }
				liveins:
				- { reg: '$r3', virtual-reg: '%0' }
				frameInfo:
				maxAlignment: 4
				machineFunctionInfo: {}
				body: \|
				bb.0.entry:
				liveins: $r3

				; PowerPC Pre-Emit Peephole converts
				jsjiUnsubmitted Not Done Reply Inline Actions This might be a little confusing -- why avoiding transform into a form in ISA 3.0b , which normally is supposed to be correct? yes, I think you are trying to express that ISA 3.0b also have the bug when describing extended mnemonics of `srwi` and `srdi`. In that case, I think we should contact the ISA team to fix it. We don't need to emphasis that in the comments here. jsji: This might be a little confusing -- why avoiding transform into a form in ISA 3.0b , which…
				; renamable $r4 = LI 0
				; renamable $r3 = SRW killed renamable $r3, killed renamable $r4
				; to
				; renamable $r3 = RLWINM killed renamable $r3, 32, 0, 31
				; so the assertion fails in assembly printing stage. The fix convert it to
				; renamable $r3 = RLWINM killed renamable $r3, 0, 0, 31
				; instead.

				; CHECK-LABEL: special_right_shift32_0:
				; CHECK: slwi r[[#]], r[[#]], 0

				%0:gprc = COPY killed $r3
				%1:gprc = LI 0
				%2:gprc = SRW killed %0, killed %1
				$r3 = COPY killed %2
				BLR implicit $lr, implicit $rm, implicit killed $r3

				...
				---
				name: special_right_shift64_0
				alignment: 2
				tracksRegLiveness: true
				registers:
				- { id: 0, class: g8rc }
				- { id: 1, class: gprc }
				- { id: 2, class: g8rc }
				liveins:
				- { reg: '$x3', virtual-reg: '%0' }
				frameInfo:
				maxAlignment: 1
				jsjiUnsubmitted Done Reply Inline Actions nit: `alignment` and `maxAlignment` seems random here, any reason you want to use 1 here? jsji: nit: `alignment` and `maxAlignment` seems random here, any reason you want to use 1 here?
				Yi-Hong.LyuAuthorUnsubmitted Done Reply Inline Actions The special_right_shift32_0 is derived from unsigned int test(unsigned int a, unsigned int b) { return a >> b; } generated by `clang --target=powerpc-unknown-unknown`. In contrast, the special_right_shift64_0 is derived from unsigned long test(unsigned long a, unsigned long b) { return a >> b; } generated by `clang --target=powerpc64-unknown-unknown`. I just leave `alignment` and `maxAlignment` as it is. What alignment and maxAlignment do you think it should be? Yi-Hong.Lyu: The special_right_shift32_0 is derived from ``` unsigned int test(unsigned int a, unsigned int…
				jsjiUnsubmitted Done Reply Inline Actions Thanks for explanation. I don't have specific number in mind, just wondering why it is 1? since it is smaller than 2 above. jsji: Thanks for explanation. I don't have specific number in mind, just wondering why it is 1?
				Yi-Hong.LyuAuthorUnsubmitted Done Reply Inline Actions According to https://llvm.org/docs/MIRLangRef.html#simplifying-mir-files: The whole frameInfo section is often unnecessary if there is no special frame usage in the function. Would remove the whole frameInfo section and leave `alignment: 2` there for both cases. Yi-Hong.Lyu: According to https://llvm.org/docs/MIRLangRef.html#simplifying-mir-files: ``` The whole…
				jsjiUnsubmitted Done Reply Inline Actions OK for me. Thanks. jsji: OK for me. Thanks.
				machineFunctionInfo: {}
				body: \|
				bb.0.entry:
				liveins: $x3

				; PowerPC Pre-Emit Peephole converts
				; renamable $r4 = LI 0
				; renamable $x3 = SRD killed renamable $x3, killed renamable $r4
				; to
				; renamable $x3 = RLDICL killed renamable $x3, 64, 0
				; so the assertion fails in assembly printing stage. The fix convert it to
				; renamable $x3 = RLDICL killed renamable $x3, 64, 0
				; instead.

				; CHECK-LABEL: special_right_shift64_0:
				; CHECK: rotldi r[[#]], r[[#]], 0

				%0:g8rc = COPY killed $x3
				%1:gprc = LI 0
				%2:g8rc = SRD killed %0, killed %1
				$x3 = COPY killed %2
				BLR8 implicit $lr8, implicit $rm, implicit killed $x3

				...

This is an archive of the discontinued LLVM Phabricator instance.

[PowerPC] Fix SH field overflow issue
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 221381

llvm/lib/Target/PowerPC/PPCInstrInfo.cpp

llvm/test/CodeGen/PowerPC/sh-overflow.mir

This is an archive of the discontinued LLVM Phabricator instance.

[PowerPC] Fix SH field overflow issueClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 221381

llvm/lib/Target/PowerPC/PPCInstrInfo.cpp

llvm/test/CodeGen/PowerPC/sh-overflow.mir

[PowerPC] Fix SH field overflow issue
ClosedPublic