This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Target/RISCV/
-
Target/
-
RISCV/
-
RISCVRegisterInfo.cpp
-
test/CodeGen/RISCV/
-
CodeGen/
-
RISCV/
-
callee-saved-gprs.ll

Differential D67698

[RISCV] Remove RA from reserved register to use as callee saved register
ClosedPublic

Authored by shiva0217 on Sep 18 2019, 2:00 AM.

Download Raw Diff

Details

Reviewers

asb
lenary
luismarques

Commits

rGc1498e37abe6: [RISCV] Remove RA from reserved register to use as callee saved register

Summary

Remove RA from reserved register list, so we could use it as callee saved register

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

shiva0217 created this revision.Sep 18 2019, 2:00 AM

Herald added a project: Restricted Project. · View Herald TranscriptSep 18 2019, 2:00 AM

Herald added subscribers: pzheng, s.egerton, Jim and 20 others. · View Herald Transcript

We discussed this in the RISC-V meeting on 19 Sep 2019.

Pros: GCC for RISC-V and LLVM for ARM and AArch64 seem to do the same. It can help in situations with particularly bad register pressure requirements.
Cons: It can make debugging a lot harder, though this seems not to be an issue in GDB for RISC-V.

I think we don't want to have yet another configuration flag to control this.

It would be good to see some performance comparison, but I realise you may not be able to release internal benchmarks, and we have no public benchmarking system for RISC-V.

In D67698#1675370, @lenary wrote:

We discussed this in the RISC-V meeting on 19 Sep 2019.

Pros: GCC for RISC-V and LLVM for ARM and AArch64 seem to do the same. It can help in situations with particularly bad register pressure requirements.

Cons: It can make debugging a lot harder, though this seems not to be an issue in GDB for RISC-V.

I think we don't want to have yet another configuration flag to control this.

It would be good to see some performance comparison, but I realise you may not be able to release internal benchmarks, and we have no public benchmarking system for RISC-V.

I can't reveal the detail of the benchmark, but the most significant case improve about 1%. It might be reasonable to free RA for register allocation since most of the backend already do so.

In CoreMark-Pro, when the execution model is 1 instr == 1 cycle, with this patch the results I get is that sha-test improves by +1.43% in RV64 (GC, LP64D). For all other sub-benchmarks the performance differences round to 0.00%. There's no change to sha-test in RV32 before and after the patch.

We had a discussion internally at lowRISC about this. I no longer think that debuggability is a concern, as FP/CFI-info is far more important than the return address. Alex has raised that preserving RA is useful for some lightweight tracing applications, but I feel that can be solved with the implementation of -ffixed-x1 (D67185).

With that in mind, we're happy to approve this patch, pending the following:

Please can you rebase it? It applies, but tests fail. Looks like a simple update caused by a change to the materialisation of the address of var.

Rebase the test case.

Good to know -ffixed-x1 seems to be a great solution.

Herald added a subscriber: hiraditya. · View Herald TranscriptOct 14 2019, 7:59 PM

LGTM, thanks @shiva0217

This revision is now accepted and ready to land.Oct 28 2019, 7:06 AM

Herald added a subscriber: sameer.abuasal. · View Herald TranscriptOct 28 2019, 7:07 AM

Closed by commit rGc1498e37abe6: [RISCV] Remove RA from reserved register to use as callee saved register (authored by shiva0217). · Explain WhyOct 28 2019, 8:34 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

lib/

Target/

RISCV/

RISCVRegisterInfo.cpp

1 line

test/

CodeGen/

RISCV/

callee-saved-gprs.ll

54 lines

Diff 226834

llvm/lib/Target/RISCV/RISCVRegisterInfo.cpp

Show First 20 Lines • Show All 71 Lines • ▼ Show 20 Lines	BitVector RISCVRegisterInfo::getReservedRegs(const MachineFunction &MF) const {
// Mark any registers requested to be reserved as such		// Mark any registers requested to be reserved as such
for (size_t Reg = 0; Reg < getNumRegs(); Reg++) {		for (size_t Reg = 0; Reg < getNumRegs(); Reg++) {
if (MF.getSubtarget<RISCVSubtarget>().isRegisterReservedByUser(Reg))		if (MF.getSubtarget<RISCVSubtarget>().isRegisterReservedByUser(Reg))
markSuperRegs(Reserved, Reg);		markSuperRegs(Reserved, Reg);
}		}

// Use markSuperRegs to ensure any register aliases are also reserved		// Use markSuperRegs to ensure any register aliases are also reserved
markSuperRegs(Reserved, RISCV::X0); // zero		markSuperRegs(Reserved, RISCV::X0); // zero
markSuperRegs(Reserved, RISCV::X1); // ra
markSuperRegs(Reserved, RISCV::X2); // sp		markSuperRegs(Reserved, RISCV::X2); // sp
markSuperRegs(Reserved, RISCV::X3); // gp		markSuperRegs(Reserved, RISCV::X3); // gp
markSuperRegs(Reserved, RISCV::X4); // tp		markSuperRegs(Reserved, RISCV::X4); // tp
if (TFI->hasFP(MF))		if (TFI->hasFP(MF))
markSuperRegs(Reserved, RISCV::X8); // fp		markSuperRegs(Reserved, RISCV::X8); // fp
assert(checkAllSuperRegsMarked(Reserved));		assert(checkAllSuperRegsMarked(Reserved));
return Reserved;		return Reserved;
}		}
▲ Show 20 Lines • Show All 89 Lines • Show Last 20 Lines

llvm/test/CodeGen/RISCV/callee-saved-gprs.ll

	Show All 22 Lines

	; This function tests that RISCVRegisterInfo::getCalleeSavedRegs returns			; This function tests that RISCVRegisterInfo::getCalleeSavedRegs returns
	; something appropriate.			; something appropriate.

	define void @callee() nounwind {			define void @callee() nounwind {
	; RV32I-LABEL: callee:			; RV32I-LABEL: callee:
	; RV32I: # %bb.0:			; RV32I: # %bb.0:
	; RV32I-NEXT: addi sp, sp, -80			; RV32I-NEXT: addi sp, sp, -80
	; RV32I-NEXT: sw s0, 76(sp)			; RV32I-NEXT: sw ra, 76(sp)
	; RV32I-NEXT: sw s1, 72(sp)			; RV32I-NEXT: sw s0, 72(sp)
	; RV32I-NEXT: sw s2, 68(sp)			; RV32I-NEXT: sw s1, 68(sp)
	; RV32I-NEXT: sw s3, 64(sp)			; RV32I-NEXT: sw s2, 64(sp)
	; RV32I-NEXT: sw s4, 60(sp)			; RV32I-NEXT: sw s3, 60(sp)
	; RV32I-NEXT: sw s5, 56(sp)			; RV32I-NEXT: sw s4, 56(sp)
	; RV32I-NEXT: sw s6, 52(sp)			; RV32I-NEXT: sw s5, 52(sp)
	; RV32I-NEXT: sw s7, 48(sp)			; RV32I-NEXT: sw s6, 48(sp)
	; RV32I-NEXT: sw s8, 44(sp)			; RV32I-NEXT: sw s7, 44(sp)
	; RV32I-NEXT: sw s9, 40(sp)			; RV32I-NEXT: sw s8, 40(sp)
	; RV32I-NEXT: sw s10, 36(sp)			; RV32I-NEXT: sw s9, 36(sp)
	; RV32I-NEXT: sw s11, 32(sp)			; RV32I-NEXT: sw s10, 32(sp)
				; RV32I-NEXT: sw s11, 28(sp)
	; RV32I-NEXT: lui a0, %hi(var)			; RV32I-NEXT: lui a0, %hi(var)
	; RV32I-NEXT: lw a1, %lo(var)(a0)			; RV32I-NEXT: lw a1, %lo(var)(a0)
	; RV32I-NEXT: sw a1, 28(sp)			; RV32I-NEXT: sw a1, 24(sp)
	; RV32I-NEXT: addi a2, a0, %lo(var)			; RV32I-NEXT: addi a2, a0, %lo(var)
	;			;
	; RV32I-WITH-FP-LABEL: callee:			; RV32I-WITH-FP-LABEL: callee:
	; RV32I-WITH-FP: # %bb.0:			; RV32I-WITH-FP: # %bb.0:
	; RV32I-WITH-FP-NEXT: addi sp, sp, -80			; RV32I-WITH-FP-NEXT: addi sp, sp, -80
	; RV32I-WITH-FP-NEXT: sw ra, 76(sp)			; RV32I-WITH-FP-NEXT: sw ra, 76(sp)
	; RV32I-WITH-FP-NEXT: sw s0, 72(sp)			; RV32I-WITH-FP-NEXT: sw s0, 72(sp)
	; RV32I-WITH-FP-NEXT: sw s1, 68(sp)			; RV32I-WITH-FP-NEXT: sw s1, 68(sp)
	Show All 11 Lines
	; RV32I-WITH-FP-NEXT: lui a0, %hi(var)			; RV32I-WITH-FP-NEXT: lui a0, %hi(var)
	; RV32I-WITH-FP-NEXT: lw a1, %lo(var)(a0)			; RV32I-WITH-FP-NEXT: lw a1, %lo(var)(a0)
	; RV32I-WITH-FP-NEXT: sw a1, -56(s0)			; RV32I-WITH-FP-NEXT: sw a1, -56(s0)
	; RV32I-WITH-FP-NEXT: addi a2, a0, %lo(var)			; RV32I-WITH-FP-NEXT: addi a2, a0, %lo(var)
	;			;
	; RV64I-LABEL: callee:			; RV64I-LABEL: callee:
	; RV64I: # %bb.0:			; RV64I: # %bb.0:
	; RV64I-NEXT: addi sp, sp, -144			; RV64I-NEXT: addi sp, sp, -144
	; RV64I-NEXT: sd s0, 136(sp)			; RV64I-NEXT: sd ra, 136(sp)
	; RV64I-NEXT: sd s1, 128(sp)			; RV64I-NEXT: sd s0, 128(sp)
	; RV64I-NEXT: sd s2, 120(sp)			; RV64I-NEXT: sd s1, 120(sp)
	; RV64I-NEXT: sd s3, 112(sp)			; RV64I-NEXT: sd s2, 112(sp)
	; RV64I-NEXT: sd s4, 104(sp)			; RV64I-NEXT: sd s3, 104(sp)
	; RV64I-NEXT: sd s5, 96(sp)			; RV64I-NEXT: sd s4, 96(sp)
	; RV64I-NEXT: sd s6, 88(sp)			; RV64I-NEXT: sd s5, 88(sp)
	; RV64I-NEXT: sd s7, 80(sp)			; RV64I-NEXT: sd s6, 80(sp)
	; RV64I-NEXT: sd s8, 72(sp)			; RV64I-NEXT: sd s7, 72(sp)
	; RV64I-NEXT: sd s9, 64(sp)			; RV64I-NEXT: sd s8, 64(sp)
	; RV64I-NEXT: sd s10, 56(sp)			; RV64I-NEXT: sd s9, 56(sp)
	; RV64I-NEXT: sd s11, 48(sp)			; RV64I-NEXT: sd s10, 48(sp)
				; RV64I-NEXT: sd s11, 40(sp)
	; RV64I-NEXT: lui a0, %hi(var)			; RV64I-NEXT: lui a0, %hi(var)
	; RV64I-NEXT: lw a1, %lo(var)(a0)			; RV64I-NEXT: lw a1, %lo(var)(a0)
	; RV64I-NEXT: sd a1, 40(sp)			; RV64I-NEXT: sd a1, 32(sp)
	; RV64I-NEXT: addi a2, a0, %lo(var)			; RV64I-NEXT: addi a2, a0, %lo(var)
	;			;
	; RV64I-WITH-FP-LABEL: callee:			; RV64I-WITH-FP-LABEL: callee:
	; RV64I-WITH-FP: # %bb.0:			; RV64I-WITH-FP: # %bb.0:
	; RV64I-WITH-FP-NEXT: addi sp, sp, -160			; RV64I-WITH-FP-NEXT: addi sp, sp, -160
	; RV64I-WITH-FP-NEXT: sd ra, 152(sp)			; RV64I-WITH-FP-NEXT: sd ra, 152(sp)
	; RV64I-WITH-FP-NEXT: sd s0, 144(sp)			; RV64I-WITH-FP-NEXT: sd s0, 144(sp)
	; RV64I-WITH-FP-NEXT: sd s1, 136(sp)			; RV64I-WITH-FP-NEXT: sd s1, 136(sp)
	▲ Show 20 Lines • Show All 151 Lines • Show Last 20 Lines