This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Target/RISCV/
-
Target/
-
RISCV/
-
RISCVFrameLowering.h
-
RISCVRegisterInfo.h
25/32
RISCVRegisterInfo.cpp
-
test/CodeGen/RISCV/
-
CodeGen/
-
RISCV/
1
local-stack-slot-allocation.ll

Differential D98101

[RISCV] Enable the LocalStackSlotAllocation pass support
ClosedPublic

Authored by craig.topper on Mar 5 2021, 9:35 PM.

Download Raw Diff

Details

Reviewers

rogfer01
jrtc27
luismarques
HsiangKai
StephenFan
LiDongjin
craig.topper

Commits

rG4554663bc0da: Recommit "[RISCV] Enable the LocalStackSlotAllocation pass support"
rG180397cdded6: [RISCV] Enable the LocalStackSlotAllocation pass support.

Summary

For RISC-V, load/store(exclude vector load/store) instructions only has a 12 bit immediate operand. If the offset is out-of-range, it must make use of a temp register to make up this offset. If between these offsets, they have a small(IsInt<12>) relative offset, LocalStackSlotAllocation pass can find a value as frame base register's value, and replace the origin offset with this register's value plus the relative offset.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

StephenFan created this revision.Mar 5 2021, 9:35 PM

Herald added subscribers: vkmr, frasercrmck, evandro and 21 others. · View Herald TranscriptMar 5 2021, 9:35 PM

StephenFan requested review of this revision.Mar 5 2021, 9:35 PM

Herald added a project: Restricted Project. · View Herald TranscriptMar 5 2021, 9:35 PM

Herald added subscribers: llvm-commits, MaskRay. · View Herald Transcript

StephenFan edited the summary of this revision. (Show Details)Mar 5 2021, 9:36 PM

craig.topper added inline comments.Mar 5 2021, 9:58 PM

llvm/test/CodeGen/RISCV/local-stack-allocation.ll
1 ↗	(On Diff #328731)	Is it possible to pre-commit this test case and only show the changes here in this patch?

craig.topper added inline comments.Mar 5 2021, 10:03 PM

llvm/lib/Target/RISCV/RISCVRegisterInfo.cpp
519	I think vector load/store can have a frameindex operand. There's no immediate field so it will get split out into an ADDI in eliminateFrameIndex, but I think that happens later.

StephenFan added inline comments.Mar 5 2021, 10:10 PM

llvm/lib/Target/RISCV/RISCVRegisterInfo.cpp

519

In the RISCVIselDAGToDAG.cpp:

case ISD::FrameIndex: {
    SDValue Imm = CurDAG->getTargetConstant(0, DL, XLenVT);
    int FI = cast<FrameIndexSDNode>(Node)->getIndex();
    SDValue TFI = CurDAG->getTargetFrameIndex(FI, VT);
    ReplaceNode(Node, CurDAG->getMachineNode(RISCV::ADDI, DL, VT, TFI, Imm));
    return;
  }

So the llvm ir like this :

%addr = alloca i8
call void callee(i8* %addr)

the %addr will be selected to ADDI

craig.topper added inline comments.Mar 5 2021, 10:20 PM

llvm/lib/Target/RISCV/RISCVRegisterInfo.cpp
519	Right. I was just asking why the comment says vector load/store are different?

StephenFan added inline comments.Mar 5 2021, 10:25 PM

llvm/lib/Target/RISCV/RISCVRegisterInfo.cpp
519	Oh, sorry. You're right, the vector load/store can have a FrameIndex Operand. I made a mistake.

StephenFan marked 2 inline comments as done.Mar 5 2021, 10:25 PM

Pre-commit local-stack-allocation.ll test.

StephenFan marked an inline comment as done.Mar 5 2021, 10:32 PM

Harbormaster completed remote builds in B92463: Diff 328736.Mar 5 2021, 10:33 PM

StephenFan added inline comments.Mar 5 2021, 10:39 PM

llvm/test/CodeGen/RISCV/local-stack-allocation.ll
17 ↗	(On Diff #328736)	This instruction will be eliminated in D92479 which I will land it soon after.

Fix comment error.

Harbormaster completed remote builds in B92465: Diff 328738.Mar 5 2021, 10:52 PM

Harbormaster completed remote builds in B92458: Diff 328731.Mar 6 2021, 3:42 PM

Ping.

Could you please rebase this to account for D98716?

I _think_ the right fix is to change the call for needsStackRealignment to shouldRealignStack. But I'm seeing multiple runtime failures for the GCC torture suite with the patch applied. e.g. 20031012-1.c at O0 for rv32imafdc ilp32.

This revision now requires changes to proceed.Apr 1 2021, 3:37 AM

Fix error.

Harbormaster completed remote builds in B97879: Diff 336311.Apr 8 2021, 9:52 PM

In D98101#2663432, @asb wrote:

Could you please rebase this to account for D98716?

I _think_ the right fix is to change the call for needsStackRealignment to shouldRealignStack. But I'm seeing multiple runtime failures for the GCC torture suite with the patch applied. e.g. 20031012-1.c at O0 for rv32imafdc ilp32.

Hi @asb . I wrote a script to compile and run the gcc c torture tests on qemu. According to the test result and as you said, multiple runtime failures appeared. However, 20031012-1.c at O0 for rv32imafdc ilp32 didn't fail on qemu. And I have fixed the failures and updated the patch. Can you help me testing the gcc c torture test again? Thanks!

The GCC torture suite now gets a 100% pass rate for me - thanks for fixing. I've left various minor notes about comment phrasing and formatting etc.

I don't feel I've stepped through the logic of the patch carefully enough yet, but hopefully the suggested edits are actionable in the meantime.

llvm/lib/Target/RISCV/RISCVRegisterInfo.cpp
501	No need to copy this doc comment TargetRegisterInfo.h.
508	returns => Returns
510	I think this sentence needs rewriting - I can't quite follow it.
515	clang-format prefers this with a space after the first `;`
519	This thread is marked as done, but the comment still same to make the same claim that vector load/store don't have a frameindex?
538	I think: "the maximum possible offset relative to the frame pointer."?
544	bytes => byte and "maximum possible offset relative to the stack pointer." I think
560	Nit: end with full stop
579	Nit: re-wrap and end sentence in full stop.
589	Nit: end with full stop
597	Nit: end with full stop.

Address @asb 's comment

Harbormaster completed remote builds in B99300: Diff 338278.Apr 16 2021, 11:25 PM

Delete "that" in comment

Harbormaster completed remote builds in B99301: Diff 338279.Apr 16 2021, 11:27 PM

StephenFan marked 9 inline comments as done.Apr 16 2021, 11:50 PM

StephenFan added inline comments.

llvm/lib/Target/RISCV/RISCVRegisterInfo.cpp
510	emm, This sentence is copied from TargetRegisterInfo.h. It means that, For RISCV, if a frame index operand has a Offset that out-of-range 12 bits, this function will return true to indicate this frame index operand needs a frame base register.

Full end

Harbormaster completed remote builds in B99460: Diff 338495.Apr 19 2021, 5:43 AM

Ping.

asb added inline comments.May 13 2021, 12:50 AM

llvm/lib/Target/RISCV/RISCVRegisterInfo.cpp
510	Yes that was my bad - the sentence does indeed make sense as written.

I've benchmarked the impact of this patch with CoreMark and Embench and it looked good. In summary, there were no or minimal performance differences but there were various small improvements to code size, which are probably similar to your test case.
I was going to add other checks like running the llvm test suite but the patch no longer applies. Can you refresh it?

jrtc27 added inline comments.May 13 2021, 7:43 AM

llvm/lib/Target/RISCV/RISCVRegisterInfo.cpp
520–524	Can we not just look and see if it both has an FI and is an I or S type instruction? This isn't really flexible, and I don't see why it's needed either (and I'd have to more than double the size of this switch statement downstream in CHERI-LLVM to add both capability base and capability value instructions).
535–537	What about floating point registers? Can we also generalise this to iterate over the list of saved registers rather than copying the ABI to yet another place (and thus yet another place we'd have to change downstream in CHERI-LLVM)?
542	Why 128? A hard-coded constant common across all ISAs seems fishy.
582	We use the less-confusing name FIOperandNum elsewhere
584	The copies of this in RegisterScavenging and other backends have `assert(i < MI.getNumOperands() && "Instr doesn't have FrameIndex operand!");`
586–588	I don't see why the first part matters. Only include the relevant information, that "FrameIndex operands are always represented as a register followed by an immediate".
589	This variable seems pointless to me, just inline the +1
llvm/test/CodeGen/RISCV/local-stack-allocation.ll
30 ↗	(On Diff #338495)	Please fix this test, either before you pre-commit it or now as another pre-commit if you've already done so.

asb mentioned this in D134851: [RISCV][WIP] Enable the local stack allocation pass for RISC-V..Sep 29 2022, 3:32 AM

craig.topper added inline comments.Sep 29 2022, 1:09 PM

llvm/lib/Target/RISCV/RISCVRegisterInfo.cpp
519	I think vector load/store no longer have frame indices.
520–524	This is missing LBU, LHU, and LWU. I agree with @jrtc27 we should check the I or S type instead.
533	No need to say 'int' llvm almost always uses `unsigned` by itself.
535–537	Are X3 and X4 really callee saved to the stack? They're reserved so I don't think we ever spill them.

Herald added a project: Restricted Project. · View Herald TranscriptSep 29 2022, 1:09 PM

Herald added subscribers: sunshaoce, • pcwang-thead, eopXD and 2 others. · View Herald Transcript

Address Comments.

Harbormaster completed remote builds in B189701: Diff 464286.Sep 30 2022, 8:24 AM

StephenFan marked 9 inline comments as done.Sep 30 2022, 8:24 AM

Improve comments.

Harbormaster completed remote builds in B189702: Diff 464289.Sep 30 2022, 8:32 AM

StephenFan added inline comments.Sep 30 2022, 8:48 AM

llvm/lib/Target/RISCV/RISCVRegisterInfo.cpp
542	This value should be an experimental target-dependent value. But I don't know yet which value is appropriate for riscv.

craig.topper added inline comments.Sep 30 2022, 10:26 AM

llvm/lib/Target/RISCV/RISCVRegisterInfo.cpp
535	Is ReservedByUser not already cover by getReservedRegs?
542	Can we copy the comments from ARM and AArch64 here including the FIXME for 128?

Remove check user reserved regsiter list.
Improve comments.

Harbormaster completed remote builds in B189887: Diff 464553.Oct 2 2022, 7:39 AM

craig.topper added inline comments.Oct 4 2022, 9:52 PM

llvm/lib/Target/RISCV/RISCVRegisterInfo.cpp
602	Should we assert that it is I or S format?

Add assertion.

Harbormaster completed remote builds in B190479: Diff 465382.Oct 5 2022, 7:53 AM

I think we need this change from D134851 to match SVE from D83859

 bool isStackIdSafeForLocalArea(unsigned StackId) const override {
  // We don't support putting RVV objects into the pre-allocated local
  // frame block at the moment.
  return StackId != TargetStackID::ScalableVector;
}

Implement target hook.

Harbormaster completed remote builds in B191748: Diff 467161.Oct 12 2022, 9:26 AM

craig.topper added inline comments.Oct 12 2022, 10:48 AM

llvm/test/CodeGen/RISCV/local-stack-slot-allocation.ll
7–8	Remove TODO

Remove TODO.

Harbormaster completed remote builds in B191890: Diff 467357.Oct 12 2022, 10:06 PM

Ping :)

LGTM

This revision was not accepted when it landed; it landed in state Needs Review.Oct 19 2022, 1:16 AM

Closed by commit rG82c820b95cf7: [RISCV] Enable the LocalStackSlotAllocation pass support (authored by StephenFan). · Explain Why

This revision was automatically updated to reflect the committed changes.

StephenFan added a commit: rG82c820b95cf7: [RISCV] Enable the LocalStackSlotAllocation pass support.

I'm seeing failures in the llvm-testsuite on riscv64-unknown-linux-gnu with -O2 (no vectors) which git bisect attributes to this change.

Failed Tests (3):
  test-suite :: MultiSource/Benchmarks/DOE-ProxyApps-C++/CLAMR/CLAMR.test
  test-suite :: MultiSource/Benchmarks/tramp3d-v4/tramp3d-v4.test
  test-suite :: SingleSource/UnitTests/matrix-types-spec.test

Has anyone seen something similar?

In D98101#3883090, @rogfer01 wrote:
I'm seeing failures in the llvm-testsuite on riscv64-unknown-linux-gnu with -O2 (no vectors) which git bisect attributes to this change.
Failed Tests (3):
  test-suite :: MultiSource/Benchmarks/DOE-ProxyApps-C++/CLAMR/CLAMR.test
  test-suite :: MultiSource/Benchmarks/tramp3d-v4/tramp3d-v4.test
  test-suite :: SingleSource/UnitTests/matrix-types-spec.test
Has anyone seen something similar?

I think we're seeing those failures too.

craig.topper added a reverting change: rGdc452a76c27a: Revert "[RISCV] Enable the LocalStackSlotAllocation pass support".Nov 1 2022, 8:20 PM

craig.topper reopened this revision.Nov 1 2022, 8:21 PM

dnpetrov-sc added a subscriber: dnpetrov-sc.Nov 18 2022, 3:44 AM

This comment was removed by dnpetrov-sc.

iiiyours added a subscriber: iiiyours.Nov 20 2022, 10:07 PM

LiDongjin added a subscriber: LiDongjin.Nov 27 2022, 7:00 PM

LiDongjin removed a subscriber: LiDongjin.

Fix the failures in the llvm-testsuite on riscv64-unknown-linux-gnu with -O2 (no vectors).

For the failures in the llvm-testsuite, it seems some problems for Add instruction which generates wrong frame index.
Also in Arm and AArch64, No need to support for Add instruction.
Therefore load/store(exclude vector load/store) instruction is enough for LocalStackSlotAllocation pass.

Harbormaster completed remote builds in B199673: Diff 478127.Nov 27 2022, 7:24 PM

If it would be better to add a test case that tests we don't do local stack slot allocation on ADDI instruction?

LiDongjin removed a commit: rG82c820b95cf7: [RISCV] Enable the LocalStackSlotAllocation pass support.Nov 30 2022, 1:13 AM

LiDongjin updated this revision to Diff 478853.Nov 30 2022, 1:35 AM

Harbormaster completed remote builds in B200205: Diff 478853.Nov 30 2022, 2:27 AM

LGTM

Reverse ping. Can we commit this?

This revision was not accepted when it landed; it landed in state Needs Review.Dec 21 2022, 12:46 AM

Closed by commit rG180397cdded6: [RISCV] Enable the LocalStackSlotAllocation pass support. (authored by LiDongjin). · Explain Why

This revision was automatically updated to reflect the committed changes.

LiDongjin added a commit: rG180397cdded6: [RISCV] Enable the LocalStackSlotAllocation pass support..

I'm seeing failures on the llvm testsuite after this change (scalar only).

Failed Tests (3):
  test-suite :: MultiSource/Benchmarks/DOE-ProxyApps-C++/CLAMR/CLAMR.test
  test-suite :: MultiSource/Benchmarks/tramp3d-v4/tramp3d-v4.test
  test-suite :: SingleSource/UnitTests/matrix-types-spec.test

Does anyone else see them too? Thanks!

In D98101#4013228, @rogfer01 wrote:
I'm seeing failures on the llvm testsuite after this change (scalar only).
Failed Tests (3):
  test-suite :: MultiSource/Benchmarks/DOE-ProxyApps-C++/CLAMR/CLAMR.test
  test-suite :: MultiSource/Benchmarks/tramp3d-v4/tramp3d-v4.test
  test-suite :: SingleSource/UnitTests/matrix-types-spec.test
Does anyone else see them too? Thanks!

Are those the exact same tests you reported last time?

In D98101#4013423, @craig.topper wrote:
In D98101#4013228, @rogfer01 wrote:
I'm seeing failures on the llvm testsuite after this change (scalar only).
Failed Tests (3):
  test-suite :: MultiSource/Benchmarks/DOE-ProxyApps-C++/CLAMR/CLAMR.test
  test-suite :: MultiSource/Benchmarks/tramp3d-v4/tramp3d-v4.test
  test-suite :: SingleSource/UnitTests/matrix-types-spec.test
Does anyone else see them too? Thanks!
Are those the exact same tests you reported last time?

Yes, I see the same failures. I admit I'm a bit puzzled.

Might be a problem on our side, so if someone can reproduce it too, that'd be helpful.

craig.topper added a reverting change: rGdfec6f7e6230: Revert "[RISCV] Enable the LocalStackSlotAllocation pass support.".Dec 25 2022, 12:58 PM

I've reverted the patch so I'm reopening.

It looks like maybe we're somehow creating load/stores with negative immediates that are supposed to be large positive immediates.

For example,
I see s9 pointing to sp+2152. A store that is supposed to be accessing sp+4288 was created with s9-1960. If we consider that -1960 as a uimm12 it would be +2136. 2152+2136 is the 4288 we were after.

This is backed up by running -verify-machineinstrs which generates a ton of errors about immediates that are uimm12 instead of simm12.

I think we failed to consider the offset in the store itself in RISCVRegisterInfo::isFrameOffsetLegal

craig.topper commandeered this revision.Jan 5 2023, 3:17 PM

craig.topper edited reviewers, added: LiDongjin; removed: craig.topper.

Consider the instruction's local offset in isFrameOffsetLegal.

Harbormaster completed remote builds in B206009: Diff 486705.Jan 5 2023, 5:11 PM

This revision was not accepted when it landed; it landed in state Needs Review.Jan 6 2023, 9:54 AM

Closed by commit rG4554663bc0da: Recommit "[RISCV] Enable the LocalStackSlotAllocation pass support" (authored by LiDongjin, committed by craig.topper). · Explain Why

This revision was automatically updated to reflect the committed changes.

craig.topper added a commit: rG4554663bc0da: Recommit "[RISCV] Enable the LocalStackSlotAllocation pass support".

modify the isFrameOffsetLegal to (Offset <= maxIntN(12)) && (Offset >= minIntN(12)),
Simply constrain offset to be within the range of 12-bit unsigned integers.

@LiDongjin I made changes and re-commited this already. Are you proposing additional changes?

LiDongjin commandeered this revision.Jan 6 2023, 6:02 PM

LiDongjin edited reviewers, added: craig.topper; removed: LiDongjin.

LiDongjin updated this revision to Diff 487025.Jan 6 2023, 6:05 PM

This comment was removed by LiDongjin.

In D98101#4032848, @LiDongjin wrote:

modify the isFrameOffsetLegal to (Offset <= maxIntN(12)) && (Offset >= minIntN(12)),

That’s what isIntN<12>(Offset) does

Simply constrain offset to be within the range of 12-bit unsigned integers.

Signed, not unsigned, which is already done

Harbormaster completed remote builds in B206226: Diff 487025.Jan 6 2023, 6:06 PM

Sorry I miss the message, I have no additional changes @craig.topper .

Can we please have the diff reverted back to what was committed and closed? This is a mess now…

LiDongjin abandoned this revision.Jan 6 2023, 6:09 PM

Just change the isFrameOffsetLegal like:

return (Offset <= maxIntN(12)) && (Offset >= minIntN(12));

Harbormaster completed remote builds in B206227: Diff 487026.Jan 6 2023, 6:13 PM

LiDongjin abandoned this revision.Jan 6 2023, 6:14 PM

Commandeering so I can fix the history

craig.topper reclaimed this revision.Jan 6 2023, 6:24 PM

craig.topper accepted this revision.

Well I guess I can't reclose this properly as long as @asb has a blocking review.

Restoring to commited version

craig.topper removed a reviewer: asb.Jan 6 2023, 6:29 PM

This revision is now accepted and ready to land.Jan 6 2023, 6:29 PM

Herald added a subscriber: asb. · View Herald TranscriptJan 6 2023, 6:29 PM

Bumped @asb from reviewers to unblock.

This should be back to the version commited in 4554663bc0da71d61ab488641c95ef98430cb451

Harbormaster completed remote builds in B206228: Diff 487027.Jan 6 2023, 7:16 PM

Revision Contents

Path

Size

llvm/

lib/

Target/

RISCV/

RISCVFrameLowering.h

6 lines

RISCVRegisterInfo.h

16 lines

RISCVRegisterInfo.cpp

121 lines

test/

CodeGen/

RISCV/

local-stack-slot-allocation.ll

62 lines

Diff 487027

llvm/lib/Target/RISCV/RISCVFrameLowering.h

//===-- RISCVFrameLowering.h - Define frame lowering for RISCV -- C++ ---===//		//===-- RISCVFrameLowering.h - Define frame lowering for RISCV -- C++ ---===//
		Lint: Lint Inline Actions clang-format not found in user’s local PATH; not linting file. Lint: Lint: clang-format not found in user’s local PATH; not linting file.
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
▲ Show 20 Lines • Show All 59 Lines • ▼ Show 20 Lines	public:
bool canUseAsPrologue(const MachineBasicBlock &MBB) const override;		bool canUseAsPrologue(const MachineBasicBlock &MBB) const override;
bool canUseAsEpilogue(const MachineBasicBlock &MBB) const override;		bool canUseAsEpilogue(const MachineBasicBlock &MBB) const override;

bool enableShrinkWrapping(const MachineFunction &MF) const override;		bool enableShrinkWrapping(const MachineFunction &MF) const override;

bool isSupportedStackID(TargetStackID::Value ID) const override;		bool isSupportedStackID(TargetStackID::Value ID) const override;
TargetStackID::Value getStackIDForScalableVectors() const override;		TargetStackID::Value getStackIDForScalableVectors() const override;

		bool isStackIdSafeForLocalArea(unsigned StackId) const override {
		// We don't support putting RISCV Vector objects into the pre-allocated
		// local frame block at the moment.
		return StackId != TargetStackID::ScalableVector;
		}

protected:		protected:
const RISCVSubtarget &STI;		const RISCVSubtarget &STI;

private:		private:
void determineFrameLayout(MachineFunction &MF) const;		void determineFrameLayout(MachineFunction &MF) const;
void adjustStackForRVV(MachineFunction &MF, MachineBasicBlock &MBB,		void adjustStackForRVV(MachineFunction &MF, MachineBasicBlock &MBB,
MachineBasicBlock::iterator MBBI, const DebugLoc &DL,		MachineBasicBlock::iterator MBBI, const DebugLoc &DL,
int64_t Amount, MachineInstr::MIFlag Flag) const;		int64_t Amount, MachineInstr::MIFlag Flag) const;
std::pair<int64_t, Align>		std::pair<int64_t, Align>
assignRVVStackObjectOffsets(MachineFunction &MF) const;		assignRVVStackObjectOffsets(MachineFunction &MF) const;
};		};
} // namespace llvm		} // namespace llvm
#endif		#endif

llvm/lib/Target/RISCV/RISCVRegisterInfo.h

//===-- RISCVRegisterInfo.h - RISCV Register Information Impl ---- C++ --===//		//===-- RISCVRegisterInfo.h - RISCV Register Information Impl ---- C++ --===//
		Lint: Lint Inline Actions clang-format not found in user’s local PATH; not linting file. Lint: Lint: clang-format not found in user’s local PATH; not linting file.
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
Show All 37 Lines	void adjustReg(MachineBasicBlock &MBB, MachineBasicBlock::iterator II,
const DebugLoc &DL, Register DestReg, Register SrcReg,		const DebugLoc &DL, Register DestReg, Register SrcReg,
StackOffset Offset, MachineInstr::MIFlag Flag,		StackOffset Offset, MachineInstr::MIFlag Flag,
MaybeAlign RequiredAlign) const;		MaybeAlign RequiredAlign) const;

bool eliminateFrameIndex(MachineBasicBlock::iterator MI, int SPAdj,		bool eliminateFrameIndex(MachineBasicBlock::iterator MI, int SPAdj,
unsigned FIOperandNum,		unsigned FIOperandNum,
RegScavenger *RS = nullptr) const override;		RegScavenger *RS = nullptr) const override;

		bool requiresVirtualBaseRegisters(const MachineFunction &MF) const override;

		bool needsFrameBaseReg(MachineInstr *MI, int64_t Offset) const override;

		bool isFrameOffsetLegal(const MachineInstr *MI, Register BaseReg,
		int64_t Offset) const override;

		Register materializeFrameBaseRegister(MachineBasicBlock *MBB, int FrameIdx,
		int64_t Offset) const override;

		void resolveFrameIndex(MachineInstr &MI, Register BaseReg,
		int64_t Offset) const override;

		int64_t getFrameIndexInstrOffset(const MachineInstr *MI,
		int Idx) const override;

void lowerVSPILL(MachineBasicBlock::iterator II) const;		void lowerVSPILL(MachineBasicBlock::iterator II) const;
void lowerVRELOAD(MachineBasicBlock::iterator II) const;		void lowerVRELOAD(MachineBasicBlock::iterator II) const;

Register getFrameRegister(const MachineFunction &MF) const override;		Register getFrameRegister(const MachineFunction &MF) const override;

bool requiresRegisterScavenging(const MachineFunction &MF) const override {		bool requiresRegisterScavenging(const MachineFunction &MF) const override {
return true;		return true;
}		}
Show All 28 Lines

llvm/lib/Target/RISCV/RISCVRegisterInfo.cpp

//===-- RISCVRegisterInfo.cpp - RISCV Register Information ------- C++ --===//		//===-- RISCVRegisterInfo.cpp - RISCV Register Information ------- C++ --===//
		Lint: Lint Inline Actions clang-format not found in user’s local PATH; not linting file. Lint: Lint: clang-format not found in user’s local PATH; not linting file.
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
▲ Show 20 Lines • Show All 484 Lines • ▼ Show 20 Lines	bool RISCVRegisterInfo::eliminateFrameIndex(MachineBasicBlock::iterator II,
case RISCV::PseudoVRELOAD8_M1:		case RISCV::PseudoVRELOAD8_M1:
lowerVRELOAD(II);		lowerVRELOAD(II);
return true;		return true;
}		}

return false;		return false;
}		}

		bool RISCVRegisterInfo::requiresVirtualBaseRegisters(
		asbUnsubmitted Done Reply Inline Actions No need to copy this doc comment TargetRegisterInfo.h. asb: No need to copy this doc comment TargetRegisterInfo.h.
		const MachineFunction &MF) const {
		return true;
		}

		// Returns true if the instruction's frame index reference would be better
		// served by a base register other than FP or SP.
		// Used by LocalStackSlotAllocation pass to determine which frame index
		asbUnsubmitted Done Reply Inline Actions returns => Returns asb: returns => Returns
		// references it should create new base registers for.
		bool RISCVRegisterInfo::needsFrameBaseReg(MachineInstr *MI,
		asbUnsubmitted Not Done Reply Inline Actions I think this sentence needs rewriting - I can't quite follow it. asb: I think this sentence needs rewriting - I can't quite follow it.
		StephenFanUnsubmitted Done Reply Inline Actions emm, This sentence is copied from TargetRegisterInfo.h. It means that, For RISCV, if a frame index operand has a Offset that out-of-range 12 bits, this function will return true to indicate this frame index operand needs a frame base register. StephenFan: emm, This sentence is copied from TargetRegisterInfo.h. It means that, For RISCV, if a frame…
		asbUnsubmitted Not Done Reply Inline Actions Yes that was my bad - the sentence does indeed make sense as written. asb: Yes that was my bad - the sentence does indeed make sense as written.
		int64_t Offset) const {
		unsigned FIOperandNum = 0;
		for (; !MI->getOperand(FIOperandNum).isFI(); FIOperandNum++)
		assert(FIOperandNum < MI->getNumOperands() &&
		"Instr doesn't have FrameIndex operand");
		asbUnsubmitted Done Reply Inline Actions clang-format prefers this with a space after the first `;` asb: clang-format prefers this with a space after the first `;`

		// For RISC-V, The machine instructions that include a FrameIndex operand
		// are load/store, ADDI instructions.
		unsigned MIFrm = RISCVII::getFormat(MI->getDesc().TSFlags);
		craig.topperAuthorUnsubmitted Done Reply Inline Actions I think vector load/store can have a frameindex operand. There's no immediate field so it will get split out into an ADDI in eliminateFrameIndex, but I think that happens later. craig.topper: I think vector load/store can have a frameindex operand. There's no immediate field so it will…
		StephenFanUnsubmitted Done Reply Inline Actions In the RISCVIselDAGToDAG.cpp: case ISD::FrameIndex: { SDValue Imm = CurDAG->getTargetConstant(0, DL, XLenVT); int FI = cast<FrameIndexSDNode>(Node)->getIndex(); SDValue TFI = CurDAG->getTargetFrameIndex(FI, VT); ReplaceNode(Node, CurDAG->getMachineNode(RISCV::ADDI, DL, VT, TFI, Imm)); return; } So the llvm ir like this : %addr = alloca i8 call void callee(i8* %addr) the %addr will be selected to ADDI StephenFan: In the RISCVIselDAGToDAG.cpp: ``` case ISD::FrameIndex: { SDValue Imm = CurDAG…
		craig.topperAuthorUnsubmitted Done Reply Inline Actions Right. I was just asking why the comment says vector load/store are different? craig.topper: Right. I was just asking why the comment says vector load/store are different?
		StephenFanUnsubmitted Done Reply Inline Actions Oh, sorry. You're right, the vector load/store can have a FrameIndex Operand. I made a mistake. StephenFan: Oh, sorry. You're right, the vector load/store can have a FrameIndex Operand. I made a mistake.
		asbUnsubmitted Done Reply Inline Actions This thread is marked as done, but the comment still same to make the same claim that vector load/store don't have a frameindex? asb: This thread is marked as done, but the comment still same to make the same claim that vector…
		craig.topperAuthorUnsubmitted Not Done Reply Inline Actions I think vector load/store no longer have frame indices. craig.topper: I think vector load/store no longer have frame indices.
		if (MIFrm != RISCVII::InstFormatI && MIFrm != RISCVII::InstFormatS)
		return false;
		// We only generate virtual base registers for loads and stores, so
		// return false for everything else.
		if (!MI->mayLoad() && !MI->mayStore())
		jrtc27Unsubmitted Done Reply Inline Actions Can we not just look and see if it both has an FI and is an I or S type instruction? This isn't really flexible, and I don't see why it's needed either (and I'd have to more than double the size of this switch statement downstream in CHERI-LLVM to add both capability base and capability value instructions). jrtc27: Can we not just look and see if it both has an FI and is an I or S type instruction? This isn't…
		craig.topperAuthorUnsubmitted Done Reply Inline Actions This is missing LBU, LHU, and LWU. I agree with @jrtc27 we should check the I or S type instead. craig.topper: This is missing LBU, LHU, and LWU. I agree with @jrtc27 we should check the I or S type…
		return false;

		const MachineFunction &MF = *MI->getMF();
		const MachineFrameInfo &MFI = MF.getFrameInfo();
		const RISCVFrameLowering *TFI = getFrameLowering(MF);
		const MachineRegisterInfo &MRI = MF.getRegInfo();
		unsigned CalleeSavedSize = 0;
		Offset += getFrameIndexInstrOffset(MI, FIOperandNum);

		craig.topperAuthorUnsubmitted Done Reply Inline Actions No need to say 'int' llvm almost always uses `unsigned` by itself. craig.topper: No need to say 'int' llvm almost always uses `unsigned` by itself.
		// Estimate the stack size used to store callee saved registers(
		// excludes reserved registers).
		craig.topperAuthorUnsubmitted Not Done Reply Inline Actions Is ReservedByUser not already cover by getReservedRegs? craig.topper: Is ReservedByUser not already cover by getReservedRegs?
		BitVector ReservedRegs = getReservedRegs(MF);
		for (const MCPhysReg R = MRI.getCalleeSavedRegs(); MCPhysReg Reg = R; ++R) {
		jrtc27Unsubmitted Done Reply Inline Actions What about floating point registers? Can we also generalise this to iterate over the list of saved registers rather than copying the ABI to yet another place (and thus yet another place we'd have to change downstream in CHERI-LLVM)? jrtc27: What about floating point registers? Can we also generalise this to iterate over the list of…
		craig.topperAuthorUnsubmitted Done Reply Inline Actions Are X3 and X4 really callee saved to the stack? They're reserved so I don't think we ever spill them. craig.topper: Are X3 and X4 really callee saved to the stack? They're reserved so I don't think we ever spill…
		if (!ReservedRegs.test(Reg))
		asbUnsubmitted Done Reply Inline Actions I think: "the maximum possible offset relative to the frame pointer."? asb: I think: "the maximum possible offset relative to the frame pointer."?
		CalleeSavedSize += getSpillSize(*getMinimalPhysRegClass(Reg));
		}

		int64_t MaxFPOffset = Offset - CalleeSavedSize;
		jrtc27Unsubmitted Not Done Reply Inline Actions Why 128? A hard-coded constant common across all ISAs seems fishy. jrtc27: Why 128? A hard-coded constant common across all ISAs seems fishy.
		StephenFanUnsubmitted Done Reply Inline Actions This value should be an experimental target-dependent value. But I don't know yet which value is appropriate for riscv. StephenFan: This value should be an experimental target-dependent value. But I don't know yet which value…
		craig.topperAuthorUnsubmitted Not Done Reply Inline Actions Can we copy the comments from ARM and AArch64 here including the FIXME for 128? craig.topper: Can we copy the comments from ARM and AArch64 here including the FIXME for 128?
		if (TFI->hasFP(MF) && !shouldRealignStack(MF))
		return !isFrameOffsetLegal(MI, RISCV::X8, MaxFPOffset);
		asbUnsubmitted Done Reply Inline Actions bytes => byte and "maximum possible offset relative to the stack pointer." I think asb: bytes => byte and "maximum possible offset relative to the stack pointer." I think

		// Assume 128 bytes spill slots size to estimate the maximum possible
		// offset relative to the stack pointer.
		// FIXME: The 128 is copied from ARM. We should run some statistics and pick a
		// real one for RISC-V.
		int64_t MaxSPOffset = Offset + 128;
		MaxSPOffset += MFI.getLocalFrameSize();
		return !isFrameOffsetLegal(MI, RISCV::X2, MaxSPOffset);
		}

		// Determine whether a given base register plus offset immediate is
		// encodable to resolve a frame index.
		bool RISCVRegisterInfo::isFrameOffsetLegal(const MachineInstr *MI,
		Register BaseReg,
		int64_t Offset) const {
		unsigned FIOperandNum = 0;
		asbUnsubmitted Done Reply Inline Actions Nit: end with full stop asb: Nit: end with full stop
		while (!MI->getOperand(FIOperandNum).isFI()) {
		FIOperandNum++;
		assert(FIOperandNum < MI->getNumOperands() &&
		"Instr does not have a FrameIndex operand!");
		}

		Offset += getFrameIndexInstrOffset(MI, FIOperandNum);
		return isInt<12>(Offset);
		}

		// Insert defining instruction(s) for a pointer to FrameIdx before
		// insertion point I.
		// Return materialized frame pointer.
		Register RISCVRegisterInfo::materializeFrameBaseRegister(MachineBasicBlock *MBB,
		int FrameIdx,
		int64_t Offset) const {
		MachineBasicBlock::iterator MBBI = MBB->begin();
		DebugLoc DL;
		if (MBBI != MBB->end())
		asbUnsubmitted Done Reply Inline Actions Nit: re-wrap and end sentence in full stop. asb: Nit: re-wrap and end sentence in full stop.
		DL = MBBI->getDebugLoc();
		MachineFunction *MF = MBB->getParent();
		MachineRegisterInfo &MFI = MF->getRegInfo();
		jrtc27Unsubmitted Done Reply Inline Actions We use the less-confusing name FIOperandNum elsewhere jrtc27: We use the less-confusing name FIOperandNum elsewhere
		const TargetInstrInfo *TII = MF->getSubtarget().getInstrInfo();

		jrtc27Unsubmitted Done Reply Inline Actions The copies of this in RegisterScavenging and other backends have `assert(i < MI.getNumOperands() && "Instr doesn't have FrameIndex operand!");` jrtc27: The copies of this in RegisterScavenging and other backends have `assert(i < MI.getNumOperands…
		Register BaseReg = MFI.createVirtualRegister(&RISCV::GPRRegClass);
		BuildMI(*MBB, MBBI, DL, TII->get(RISCV::ADDI), BaseReg)
		.addFrameIndex(FrameIdx)
		.addImm(Offset);
		jrtc27Unsubmitted Done Reply Inline Actions I don't see why the first part matters. Only include the relevant information, that "FrameIndex operands are always represented as a register followed by an immediate". jrtc27: I don't see why the first part matters. Only include the relevant information, that "FrameIndex…
		return BaseReg;
		asbUnsubmitted Done Reply Inline Actions Nit: end with full stop asb: Nit: end with full stop
		jrtc27Unsubmitted Done Reply Inline Actions This variable seems pointless to me, just inline the +1 jrtc27: This variable seems pointless to me, just inline the +1
		}

		// Resolve a frame index operand of an instruction to reference the
		// indicated base register plus offset instead.
		void RISCVRegisterInfo::resolveFrameIndex(MachineInstr &MI, Register BaseReg,
		int64_t Offset) const {
		unsigned FIOperandNum = 0;
		while (!MI.getOperand(FIOperandNum).isFI()) {
		asbUnsubmitted Done Reply Inline Actions Nit: end with full stop. asb: Nit: end with full stop.
		FIOperandNum++;
		assert(FIOperandNum < MI.getNumOperands() &&
		"Instr does not have a FrameIndex operand!");
		}

		craig.topperAuthorUnsubmitted Not Done Reply Inline Actions Should we assert that it is I or S format? craig.topper: Should we assert that it is I or S format?
		Offset += getFrameIndexInstrOffset(&MI, FIOperandNum);
		// FrameIndex Operands are always represented as a
		// register followed by an immediate.
		MI.getOperand(FIOperandNum).ChangeToRegister(BaseReg, false);
		MI.getOperand(FIOperandNum + 1).ChangeToImmediate(Offset);
		}

		// Get the offset from the referenced frame index in the instruction,
		// if there is one.
		int64_t RISCVRegisterInfo::getFrameIndexInstrOffset(const MachineInstr *MI,
		int Idx) const {
		assert((RISCVII::getFormat(MI->getDesc().TSFlags) == RISCVII::InstFormatI \|\|
		RISCVII::getFormat(MI->getDesc().TSFlags) == RISCVII::InstFormatS) &&
		"The MI must be I or S format.");
		assert(MI->getOperand(Idx).isFI() && "The Idx'th operand of MI is not a "
		"FrameIndex operand");
		return MI->getOperand(Idx + 1).getImm();
		}

Register RISCVRegisterInfo::getFrameRegister(const MachineFunction &MF) const {		Register RISCVRegisterInfo::getFrameRegister(const MachineFunction &MF) const {
const TargetFrameLowering *TFI = getFrameLowering(MF);		const TargetFrameLowering *TFI = getFrameLowering(MF);
return TFI->hasFP(MF) ? RISCV::X8 : RISCV::X2;		return TFI->hasFP(MF) ? RISCV::X8 : RISCV::X2;
}		}

const uint32_t *		const uint32_t *
RISCVRegisterInfo::getCallPreservedMask(const MachineFunction & MF,		RISCVRegisterInfo::getCallPreservedMask(const MachineFunction & MF,
CallingConv::ID CC) const {		CallingConv::ID CC) const {
▲ Show 20 Lines • Show All 159 Lines • Show Last 20 Lines

llvm/test/CodeGen/RISCV/local-stack-slot-allocation.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc -mtriple=riscv32 -verify-machineinstrs < %s \| FileCheck %s --check-prefix=RV32I			; RUN: llc -mtriple=riscv32 -verify-machineinstrs < %s \| FileCheck %s --check-prefix=RV32I
	; RUN: llc -mtriple=riscv64 -verify-machineinstrs < %s \| FileCheck %s --check-prefix=RV64I			; RUN: llc -mtriple=riscv64 -verify-machineinstrs < %s \| FileCheck %s --check-prefix=RV64I

	; This test case test the LocalStackSlotAllocation pass that use a base register			; This test case test the LocalStackSlotAllocation pass that use a base register
	; for the frame index that its offset is out-of-range (for RISC-V. the immediate			; for the frame index that its offset is out-of-range (for RISC-V. the immediate
	; is 12 bits for the load store instruction (excludes vector load / store))			; is 12 bits for the load store instruction (excludes vector load / store))
	; TODO: Enable LocalStackSlotAllocation pass.
	define void @use_frame_base_reg() {			define void @use_frame_base_reg() {
				craig.topperAuthorUnsubmitted Not Done Reply Inline Actions Remove TODO craig.topper: Remove TODO
	; RV32I-LABEL: use_frame_base_reg:			; RV32I-LABEL: use_frame_base_reg:
	; RV32I: # %bb.0:			; RV32I: # %bb.0:
	; RV32I-NEXT: lui a0, 24			; RV32I-NEXT: lui a0, 24
	; RV32I-NEXT: addi a0, a0, 1712			; RV32I-NEXT: addi a0, a0, 1712
	; RV32I-NEXT: sub sp, sp, a0			; RV32I-NEXT: sub sp, sp, a0
	; RV32I-NEXT: .cfi_def_cfa_offset 100016			; RV32I-NEXT: .cfi_def_cfa_offset 100016
	; RV32I-NEXT: lui a0, 24			; RV32I-NEXT: lui a0, 24
				; RV32I-NEXT: addi a0, a0, 1704
	; RV32I-NEXT: add a0, sp, a0			; RV32I-NEXT: add a0, sp, a0
	; RV32I-NEXT: lb a0, 1708(a0)			; RV32I-NEXT: lb a1, 4(a0)
	; RV32I-NEXT: lui a0, 24			; RV32I-NEXT: lb a0, 0(a0)
	; RV32I-NEXT: add a0, sp, a0
	; RV32I-NEXT: lb a0, 1704(a0)
	; RV32I-NEXT: lui a0, 24			; RV32I-NEXT: lui a0, 24
	; RV32I-NEXT: addi a0, a0, 1712			; RV32I-NEXT: addi a0, a0, 1712
	; RV32I-NEXT: add sp, sp, a0			; RV32I-NEXT: add sp, sp, a0
	; RV32I-NEXT: ret			; RV32I-NEXT: ret
	;			;
	; RV64I-LABEL: use_frame_base_reg:			; RV64I-LABEL: use_frame_base_reg:
	; RV64I: # %bb.0:			; RV64I: # %bb.0:
	; RV64I-NEXT: lui a0, 24			; RV64I-NEXT: lui a0, 24
	; RV64I-NEXT: addiw a0, a0, 1712			; RV64I-NEXT: addiw a0, a0, 1712
	; RV64I-NEXT: sub sp, sp, a0			; RV64I-NEXT: sub sp, sp, a0
	; RV64I-NEXT: .cfi_def_cfa_offset 100016			; RV64I-NEXT: .cfi_def_cfa_offset 100016
	; RV64I-NEXT: lui a0, 24			; RV64I-NEXT: lui a0, 24
				; RV64I-NEXT: addiw a0, a0, 1704
	; RV64I-NEXT: add a0, sp, a0			; RV64I-NEXT: add a0, sp, a0
	; RV64I-NEXT: lb a0, 1708(a0)			; RV64I-NEXT: lb a1, 4(a0)
	; RV64I-NEXT: lui a0, 24			; RV64I-NEXT: lb a0, 0(a0)
	; RV64I-NEXT: add a0, sp, a0
	; RV64I-NEXT: lb a0, 1704(a0)
	; RV64I-NEXT: lui a0, 24			; RV64I-NEXT: lui a0, 24
	; RV64I-NEXT: addiw a0, a0, 1712			; RV64I-NEXT: addiw a0, a0, 1712
	; RV64I-NEXT: add sp, sp, a0			; RV64I-NEXT: add sp, sp, a0
	; RV64I-NEXT: ret			; RV64I-NEXT: ret

	%va = alloca i8, align 4			%va = alloca i8, align 4
	%va1 = alloca i8, align 4			%va1 = alloca i8, align 4
	%large = alloca [ 100000 x i8 ]			%large = alloca [ 100000 x i8 ]
	%argp.cur = load volatile i8, ptr %va, align 4			%argp.cur = load volatile i8, ptr %va, align 4
	%argp.next = load volatile i8, ptr %va1, align 4			%argp.next = load volatile i8, ptr %va1, align 4
	ret void			ret void
	}			}

				; Test containing a load with its own local offset. Make sure isFrameOffsetLegal
				; considers it and does not create a virtual base register.
				define void @load_with_offset() {
				; RV32I-LABEL: load_with_offset:
				; RV32I: # %bb.0:
				; RV32I-NEXT: lui a0, 25
				; RV32I-NEXT: addi a0, a0, -1792
				; RV32I-NEXT: sub sp, sp, a0
				; RV32I-NEXT: .cfi_def_cfa_offset 100608
				; RV32I-NEXT: lui a0, 25
				; RV32I-NEXT: add a0, sp, a0
				; RV32I-NEXT: lb a0, -292(a0)
				; RV32I-NEXT: lui a0, 24
				; RV32I-NEXT: add a0, sp, a0
				; RV32I-NEXT: lb a0, 1704(a0)
				; RV32I-NEXT: lui a0, 25
				; RV32I-NEXT: addi a0, a0, -1792
				; RV32I-NEXT: add sp, sp, a0
				; RV32I-NEXT: ret
				;
				; RV64I-LABEL: load_with_offset:
				; RV64I: # %bb.0:
				; RV64I-NEXT: lui a0, 25
				; RV64I-NEXT: addiw a0, a0, -1792
				; RV64I-NEXT: sub sp, sp, a0
				; RV64I-NEXT: .cfi_def_cfa_offset 100608
				; RV64I-NEXT: lui a0, 25
				; RV64I-NEXT: add a0, sp, a0
				; RV64I-NEXT: lb a0, -292(a0)
				; RV64I-NEXT: lui a0, 24
				; RV64I-NEXT: add a0, sp, a0
				; RV64I-NEXT: lb a0, 1704(a0)
				; RV64I-NEXT: lui a0, 25
				; RV64I-NEXT: addiw a0, a0, -1792
				; RV64I-NEXT: add sp, sp, a0
				; RV64I-NEXT: ret

				%va = alloca [100 x i8], align 4
				%va1 = alloca [500 x i8], align 4
				%large = alloca [100000 x i8]
				%va_gep = getelementptr [100 x i8], ptr %va, i64 16
				%va1_gep = getelementptr [100 x i8], ptr %va1, i64 0
				%load = load volatile i8, ptr %va_gep, align 4
				%load1 = load volatile i8, ptr %va1_gep, align 4
				ret void
				}

This is an archive of the discontinued LLVM Phabricator instance.

[RISCV] Enable the LocalStackSlotAllocation pass supportClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 487027

llvm/lib/Target/RISCV/RISCVFrameLowering.h

llvm/lib/Target/RISCV/RISCVRegisterInfo.h

llvm/lib/Target/RISCV/RISCVRegisterInfo.cpp

llvm/test/CodeGen/RISCV/local-stack-slot-allocation.ll

[RISCV] Enable the LocalStackSlotAllocation pass support
ClosedPublic