This is an archive of the discontinued LLVM Phabricator instance.

Are these two generic changes (not specific to riscv)? If so can you make a patch for each. If it can be tested as well then great, if it's some theoretical corner case then should be fine without.

I presume that with this change a bunch of tests started passing? Would be good to summarise that in the commit message. "after this change a further N tests that use single stepping now pass".

Address review comments:

Add unittests for EmulateInstructionRISCV.
Split "thread error" and "nullptr dereference" as separate PRs.

Harbormaster completed remote builds in B181457: Diff 452909.Aug 16 2022, 1:30 AM

DavidSpickett added inline comments.Aug 16 2022, 2:32 AM

lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.cpp
36	Move this next to first use if possible.
38	What part of the encoding does this help you extract? Can you add a comment with the name of the field as the spec calls it.
39	Also what does this mean?
83	Here we know that reg_encode is not 0, and reg_encode is unsigned so it cannot be negative. So it could just be `if (reg_encode <= 31)`.
109	Does compressed come into this at any point? Just checking since I see a lot of +4. It's fine to say it's not supported at this time.
120	JALR clears the bottom bit, why is this? On Arm thumb we have bx which can change modes between arm and thumb code. In thumb code the bottom bit is used as a mode bit to say you're in thumb. Is it anything like that? Please add comments to explain.
126	Is funct3 the name of a field from the spec? If so can you add a comment like "funct3 is the type of the compare", whatever it means.
139	Do you ever expect to hit this? Add an assert if you don't.
146	Nit: put this next to the ReadPC call. Generally, order the declarations as they are used.
150	Do this check one line up immediately after the ReadPC.
164	I would just merge these into the CompareB below.
182	Can you explain in the comments what RVI and RVA are?
183	What is not certain at this point? It would be good to record that at any point where the spec being in flux is an issue. E.g. "this code may change once the spec is ratified, at the moment X and Y aspects are not fixed" Just in case we wanted to fixup individual bits as they get decided on later (or find out where we diverge from the final spec).
188	Just put the array_lengthof call in the for line.
190	I would write this `inst & pat.type_mask`. I know it works either way but it looks a bit strange in my opinion.
198	`0x%x` and `does not branch:`
223	This comment seems redundant given that the function is called DecodeAndExecute.
250	Please add comments to explain this. I think what you're doing is checking whether the compressed encoding could be valid. We do similair things for Thumb encodings for Arm.
251	Seems easier to write this as != 3.
260	I would change this into an early return style like this: bool success = false; m_addr = ReadPC(&success); if (!success) m_addr = LLDB_INVALID_ADDRESS; Context ctx; ctx.type = eContextReadOpcode; ctx.SetNoArgs(); uint32_t inst = (uint32_t)ReadMemoryUnsigned(ctx, m_addr, 4, 0, &success); uint16_t try_rvc = (uint16_t)(inst & 0x0000ffff); uint16_t mask = try_rvc & 0b11; if (try_rvc != 0 && (mask == 0 \|\| mask == 1 \|\| mask == 2)) { m_opcode.SetOpcode16(try_rvc, GetByteOrder()); } else { m_opcode.SetOpcode32(inst, GetByteOrder()); } return true; }
lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.h
56	Do we need this wrapper/forwarder? It just passes on the arg unchanged.
lldb/unittests/Instruction/CMakeLists.txt
4 ↗	(On Diff #452909)	Also for AArch64 this should be "AARCH64" or "ARM64" (I think it's the former). In this context ARM == 32 bit Arm and AArch64 == 64 bit Arm (where elsewhere Arm means both, and I agree, it's confusing).
16 ↗	(On Diff #452909)	Why do we need this conditional logic here? Surely unittests don't need a riscv or arm64 host and shouldn't clash with each other, or did you find some issue including both?

I know this is a flood of comments but the overall impression is that this is mostly fine. A bunch of small things.

Thanks for splitting out the other patches and adding the test cases here.

lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.cpp
299	Assert here if you do not expect to hit this code. (even if that's just temporary until you emulate more instructions, it's better to know that something needs implementing)
lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.h
21	You have 2 public markers in the same class with no private between them. Intentional?
lldb/source/Plugins/Process/Linux/NativeProcessLinux.cpp
886	Could you use `isRISCV` here instead?
lldb/unittests/Instruction/RISCV/TestRISCVEmulator.cpp
9 ↗	(On Diff #452909)	This appears unused.
39 ↗	(On Diff #452909)	Please explain what this does. Just on the face of it, something a bit dodgy :) (maybe there is a better way to express it, even if it is legit)
45 ↗	(On Diff #452909)	if (reg == gpr_x0_riscv) reg_value.SetUInt(0, reg_info->byte_size); else reg_value.SetUInt(tester->gpr.gpr[reg], reg_info->byte_size); return true;
55–59 ↗	(On Diff #452909)	if (reg != gpr_x0_riscv) tester->gpr.gpr[reg] = reg_value.GetAsUInt64(); return true;
68 ↗	(On Diff #452909)	Assert that this was successful.
79 ↗	(On Diff #452909)	EncodeInstruction? I don't think we need to shorten the names quite this much.
103 ↗	(On Diff #452909)	You check the `~1` here but your input values don't have the bottom bit set. Does that mean this test is not covering that aspect of the emulation? Seems like it should be.
106 ↗	(On Diff #452909)	EncodeBranch
140 ↗	(On Diff #452909)	testBranch (you get the idea by now)
164 ↗	(On Diff #452909)	Add comments to explain what these numbers are. With the macro wrapper it's hard to tell what they mean and what this is actually checking.

Oh and please note in the commit message that this does not support compressed instructions (or does it, I assume no but make it obvious).

Emmmer marked 30 inline comments as done.Aug 16 2022, 5:00 AM

Emmmer added inline comments.

lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.cpp
83	Just to make the if condition explicitly exhaustive. Not relying on implicitly derived conditions helps the code survive larger refactorings that may break the implicit conditions. Sure it can be simplified to `reg_encode <= 31`, but we should let compilers do it for us rather than doing it manually.
109	No. `jal/jalr/b` are always 4-byte instructions. Supporting RVC does not need to change the +4 here.
120	It is like the mode bit in ARM but more flexible since riscv does not have two modes. The spec says auxiliary information can be stored to LSB in function addresses (like JIT compilers can use this bit). It is open to developers.
126	Yes. `funct3` is a widely used name in spec meaning `3-bits function selector`
183	It's a pity that the whole debug spec was not useable (not implemented by any emulator nor hardware). We don't use this code for platforms which implemented the debug spec. But for platforms without the debug spec we still need this emulation. We don't need to change the code here in either case in the future. So I guess it's fine to not record the issue.
223	A sense of ceremony haha. Will delete the comment accordingly.
250	Exactly! Comments are added now.
lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.h
21	Yes, to distinguish public static members from public instance members.
56	Yes. `SupportsThisInstructionType` is also used in `CreateInstance` which is a static method.
lldb/unittests/Instruction/CMakeLists.txt
4 ↗	(On Diff #452909)	This code confused me a lot and thank you for the explanation. I am going to check ARM and AARCH64 explicitly.
16 ↗	(On Diff #452909)	I don't know either. I see the original code checking TARGETS_TO_BUILD and there might be special consideration I suppose?
lldb/unittests/Instruction/RISCV/TestRISCVEmulator.cpp
9 ↗	(On Diff #452909)	nice catch!
39 ↗	(On Diff #452909)	It is actually an always-success dynamic_cast because: The `RISCVEmulatorTester` is derived from `EmulateInstruction` These callbacks are set in the constructor of `RISCVEmulatorTester` All `EmulateInstruction` instances created in this unittest are always `RISCVEmulatorTester` Signature of these callbacks is not polymorphic (like for Java we have `<T extends EmulateInstruction>` with bounded constraints.) and that's the reason why we need this cast. We may turn it into a template but it requires more effort. I also noticed that there's a `void *baton` in parameters. But using that field looks more terrible. I guess we could have a `userdata` whose type was parametrized.
79 ↗	(On Diff #452909)	The `I` stands for `I-Type`, a riscv instruction format. Like `EncodeB` below.
103 ↗	(On Diff #452909)	`0x1024 - 255` = 3877 `(0x1024 - 255) & ~1` = 3876 I think it is covered 😃
106 ↗	(On Diff #452909)	The `B` stands for `B-Type` which works for all B-type instructions. Branch instructions are all of B-type so renaming it to `EncodeBranch` seems Okay right now, but we may add more B-type instructions in the future.
140 ↗	(On Diff #452909)	Yes this should be `Branch` since it only tests branch instructions.

address review comments

Emmmer marked an inline comment as done.Aug 16 2022, 5:03 AM

Harbormaster completed remote builds in B181491: Diff 452959.Aug 16 2022, 5:05 AM

Are we expecting this code to only ever be given JAL or JALR at this time?

If it's the case that anything else will just return false or do nothing, might be worth adding a couple of tests that check that encodings that aren't branches just do nothing or fail. Obviously not every encoding maybe just flip 1 bit in the instruction type field just as a smoke test for each of jal and jalr.

lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.cpp
223	Now worries. Usually I end up with these because I wrote out the steps as comments first and forget to delete them.
lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.h
21	Got it, I missed that.
lldb/unittests/Instruction/CMakeLists.txt
16 ↗	(On Diff #452909)	Sorry I completely missed that first line (the diff split at that point). Right, so you're just adding the riscv equivalent of what's already there. So actually just leave the ARM bit as is, I'll do some builds and find out whether it needs to change for AArch64.
lldb/unittests/Instruction/RISCV/TestRISCVEmulator.cpp
98 ↗	(On Diff #452959)	Add a comment like "JALR will always zero the bottom bit of the target".
161 ↗	(On Diff #452959)	Now I understand what's going on. I thought that this somehow included non branch encodings when in fact what it's doing is saying for a branch equals we should only branch if the operands are equal.
39 ↗	(On Diff #452909)	But using that field looks more terrible. I guess we could have a userdata whose type was parametrized. Yeah there's a lot of...not ideal patterns in this area but I wouldn't pay much attention to them. Without opting into llvm's rtti equivalent I guess this is the best way to do it.
79 ↗	(On Diff #452909)	Cool. It's mostly that EncodeB looks like the author got distracted half way through the name. But anyway I'm not familiar with the spec so if it matches that then great.
103 ↗	(On Diff #452909)	Doh, yes I wasn't looking at the offset.

Emmmer added inline comments.Aug 16 2022, 6:35 AM

lldb/unittests/Instruction/CMakeLists.txt
16 ↗	(On Diff #452909)	Oh... thank you! I am gonna revert the change for AArch64.
lldb/unittests/Instruction/RISCV/TestRISCVEmulator.cpp
79 ↗	(On Diff #452909)	You're right, the name looks unfriendly to other developers. So I changed its name to `EncodeIType` (also for `EncodeBType`) to avoid confusion.

address review comments

Harbormaster completed remote builds in B181524: Diff 452997.Aug 16 2022, 7:16 AM

LGTM

This revision is now accepted and ready to land.Aug 16 2022, 7:22 AM

Closed by commit rG4fc7e9cba24b: [LLDB][RISCV] Make software single stepping work (authored by Emmmer). · Explain WhyAug 16 2022, 8:45 AM

This revision was automatically updated to reflect the committed changes.

Emmmer added a commit: rG4fc7e9cba24b: [LLDB][RISCV] Make software single stepping work.

Herald added a subscriber: lldb-commits. · View Herald TranscriptAug 16 2022, 8:45 AM

Revision Contents

Path

Size

lldb/

source/

Plugins/

Instruction/

CMakeLists.txt

1 line

RISCV/

CMakeLists.txt

11 lines

EmulateInstructionRISCV.h

72 lines

EmulateInstructionRISCV.cpp

330 lines

Process/

Linux/

NativeProcessLinux.cpp

14 lines

Utility/

NativeProcessSoftwareSingleStep.cpp

9 lines

tools/

lldb-server/

CMakeLists.txt

1 line

SystemInitializerLLGS.cpp

11 lines

Diff 452128

lldb/source/Plugins/Instruction/CMakeLists.txt

	add_subdirectory(ARM)			add_subdirectory(ARM)
	add_subdirectory(ARM64)			add_subdirectory(ARM64)
	add_subdirectory(MIPS)			add_subdirectory(MIPS)
	add_subdirectory(MIPS64)			add_subdirectory(MIPS64)
	add_subdirectory(PPC64)			add_subdirectory(PPC64)
				add_subdirectory(RISCV)

lldb/source/Plugins/Instruction/RISCV/CMakeLists.txt

This file was added.

				add_lldb_library(lldbPluginInstructionRISCV PLUGIN
				EmulateInstructionRISCV.cpp

				LINK_LIBS
				lldbCore
				lldbInterpreter
				lldbSymbol
				lldbPluginProcessUtility
				LINK_COMPONENTS
				Support
				)

lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.h

This file was added.

				//===-- EmulateInstructionRISCV.h -----------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLDB_SOURCE_PLUGINS_INSTRUCTION_RISCV_EMULATEINSTRUCTIONRISCV_H
				#define LLDB_SOURCE_PLUGINS_INSTRUCTION_RISCV_EMULATEINSTRUCTIONRISCV_H

				#include "lldb/Core/EmulateInstruction.h"
				#include "lldb/Interpreter/OptionValue.h"
				#include "lldb/Utility/Log.h"
				#include "lldb/Utility/Status.h"

				namespace lldb_private {

				class EmulateInstructionRISCV : public EmulateInstruction {
				public:
				static llvm::StringRef GetPluginNameStatic() { return "riscv"; }
				DavidSpickettUnsubmitted Done Reply Inline Actions You have 2 public markers in the same class with no private between them. Intentional? DavidSpickett: You have 2 public markers in the same class with no private between them. Intentional?
				EmmmerAuthorUnsubmitted Done Reply Inline Actions Yes, to distinguish public static members from public instance members. Emmmer: Yes, to distinguish public static members from public instance members.
				DavidSpickettUnsubmitted Done Reply Inline Actions Got it, I missed that. DavidSpickett: Got it, I missed that.

				static llvm::StringRef GetPluginDescriptionStatic() {
				return "Emulate instructions for the RISC-V architecture.";
				}

				static bool SupportsThisInstructionType(InstructionType inst_type) {
				switch (inst_type) {
				case eInstructionTypeAny:
				case eInstructionTypePCModifying:
				return true;
				case eInstructionTypePrologueEpilogue:
				case eInstructionTypeAll:
				default:
				return false;
				}
				}

				static bool SupportsThisArch(const ArchSpec &arch);

				static lldb_private::EmulateInstruction *
				CreateInstance(const lldb_private::ArchSpec &arch, InstructionType inst_type);

				static void Initialize();

				static void Terminate();

				public:
				EmulateInstructionRISCV(const ArchSpec &arch) : EmulateInstruction(arch) {}

				llvm::StringRef GetPluginName() override { return GetPluginNameStatic(); }

				bool SupportsEmulatingInstructionsOfType(InstructionType inst_type) override {
				return SupportsThisInstructionType(inst_type);
				}

				DavidSpickettUnsubmitted Done Reply Inline Actions Do we need this wrapper/forwarder? It just passes on the arg unchanged. DavidSpickett: Do we need this wrapper/forwarder? It just passes on the arg unchanged.
				EmmmerAuthorUnsubmitted Done Reply Inline Actions Yes. `SupportsThisInstructionType` is also used in `CreateInstance` which is a static method. Emmmer: Yes. `SupportsThisInstructionType` is also used in `CreateInstance` which is a static method.
				bool SetTargetTriple(const ArchSpec &arch) override;
				bool ReadInstruction() override;
				bool EvaluateInstruction(uint32_t options) override;
				bool TestEmulation(Stream *out_stream, ArchSpec &arch,
				OptionValueDictionary *test_data) override;
				bool GetRegisterInfo(lldb::RegisterKind reg_kind, uint32_t reg_num,
				RegisterInfo &reg_info) override;

				lldb::addr_t ReadPC(bool *success);
				bool WritePC(lldb::addr_t pc);
				bool DecodeAndExecute(uint32_t inst, bool ignore_cond);
				};

				} // namespace lldb_private

				#endif // LLDB_SOURCE_PLUGINS_INSTRUCTION_RISCV_EMULATEINSTRUCTIONRISCV_H

lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.cpp

This file was added.

				//===-- EmulateInstructionRISCV.cpp ---------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include <cstdlib>

				#include "EmulateInstructionRISCV.h"
				#include "Plugins/Process/Utility/RegisterInfoPOSIX_riscv64.h"
				#include "Plugins/Process/Utility/lldb-riscv-register-enums.h"

				#include "lldb/Core/Address.h"
				#include "lldb/Core/PluginManager.h"
				#include "lldb/Interpreter/OptionValueArray.h"
				#include "lldb/Interpreter/OptionValueDictionary.h"
				#include "lldb/Symbol/UnwindPlan.h"
				#include "lldb/Utility/ArchSpec.h"
				#include "lldb/Utility/LLDBLog.h"
				#include "lldb/Utility/RegisterValue.h"
				#include "lldb/Utility/Stream.h"

				#include "llvm/ADT/STLExtras.h"
				#include "llvm/Support/MathExtras.h"

				using namespace lldb;
				using namespace lldb_private;

				LLDB_PLUGIN_DEFINE_ADV(EmulateInstructionRISCV, InstructionRISCV)

				namespace lldb_private {

				struct InstrPattern {
				const char *name;
				DavidSpickettUnsubmitted Done Reply Inline Actions Move this next to first use if possible. DavidSpickett: Move this next to first use if possible.
				uint32_t type_mask;
				uint32_t eigen;
				DavidSpickettUnsubmitted Done Reply Inline Actions What part of the encoding does this help you extract? Can you add a comment with the name of the field as the spec calls it. DavidSpickett: What part of the encoding does this help you extract? Can you add a comment with the name of…
				bool (exec)(EmulateInstructionRISCV emulator, uint32_t inst,
				DavidSpickettUnsubmitted Done Reply Inline Actions Also what does this mean? DavidSpickett: Also what does this mean?
				bool ignore_cond);
				};

				constexpr uint32_t I_MASK = 0b111000001111111;
				constexpr uint32_t J_MASK = 0b000000001111111;
				// no funct3 in the b-mask because the logic executing B<CMP> is quite similar.
				constexpr uint32_t B_MASK = 0b000000001111111;
				constexpr uint32_t BEQ = 0b000;
				constexpr uint32_t BNE = 0b001;
				constexpr uint32_t BLT = 0b100;
				constexpr uint32_t BGE = 0b101;
				constexpr uint32_t BLTU = 0b110;
				constexpr uint32_t BGEU = 0b111;

				constexpr uint32_t DecodeRD(uint32_t inst) { return (inst & 0xF80) >> 7; }
				constexpr uint32_t DecodeRS1(uint32_t inst) { return (inst & 0xF8000) >> 15; }
				constexpr uint32_t DecodeRS2(uint32_t inst) { return (inst & 0x1F00000) >> 20; }
				constexpr uint32_t DecodeFunct3(uint32_t inst) { return (inst & 0x7000) >> 12; }

				constexpr uint32_t DecodeJImm(uint32_t inst) {
				return (uint64_t(int64_t(int32_t(inst & 0x80000000)) >> 11)) // imm[20]
				\| (inst & 0xff000) // imm[19:12]
				\| ((inst >> 9) & 0x800) // imm[11]
				\| ((inst >> 20) & 0x7fe); // imm[10:1]
				}

				constexpr uint32_t DecodeIImm(uint32_t inst) {
				return int64_t(int32_t(inst)) >> 20; // imm[11:0]
				}

				constexpr uint32_t DecodeBImm(uint32_t inst) {
				return (uint64_t(int64_t(int32_t(inst & 0x80000000)) >> 19)) // imm[12]
				\| ((inst & 0x80) << 4) // imm[11]
				\| ((inst >> 20) & 0x7e0) // imm[10:5]
				\| ((inst >> 7) & 0x1e); // imm[4:1]
				}

				static uint32_t GPREncodingToLLDB(uint32_t reg_encode) {
				if (reg_encode == 0)
				return gpr_x0_riscv;
				if (reg_encode >= 1 && reg_encode <= 31)
				return gpr_x1_riscv + reg_encode - 1;
				return LLDB_INVALID_REGNUM;
				}
				DavidSpickettUnsubmitted Done Reply Inline Actions Here we know that reg_encode is not 0, and reg_encode is unsigned so it cannot be negative. So it could just be `if (reg_encode <= 31)`. DavidSpickett: Here we know that reg_encode is not 0, and reg_encode is unsigned so it cannot be negative. So…
				EmmmerAuthorUnsubmitted Done Reply Inline Actions Just to make the if condition explicitly exhaustive. Not relying on implicitly derived conditions helps the code survive larger refactorings that may break the implicit conditions. Sure it can be simplified to `reg_encode <= 31`, but we should let compilers do it for us rather than doing it manually. Emmmer: Just to make the if condition explicitly exhaustive. Not relying on implicitly derived…

				static bool ReadRegister(EmulateInstructionRISCV *emulator, uint32_t reg_encode,
				RegisterValue &value) {
				uint32_t lldb_reg = GPREncodingToLLDB(reg_encode);
				return emulator->ReadRegister(eRegisterKindLLDB, lldb_reg, value);
				}

				static bool ExecJAL(EmulateInstructionRISCV *emulator, uint32_t inst, bool) {
				bool success = false;
				int64_t offset = DecodeJImm(inst);
				int64_t pc = emulator->ReadPC(&success);
				return success && emulator->WritePC(pc + offset);
				}

				static bool ExecJALR(EmulateInstructionRISCV *emulator, uint32_t inst, bool) {
				int64_t offset = DecodeIImm(inst);
				RegisterValue value;
				if (!ReadRegister(emulator, DecodeRS1(inst), value))
				return false;
				int64_t rs1 = int64_t(value.GetAsUInt64());
				return emulator->WritePC((rs1 + offset) & ~1);
				}

				static bool CompareB(uint64_t rs1, uint64_t rs2, uint32_t funct3) {
				switch (funct3) {
				case BEQ:
				DavidSpickettUnsubmitted Done Reply Inline Actions Does compressed come into this at any point? Just checking since I see a lot of +4. It's fine to say it's not supported at this time. DavidSpickett: Does compressed come into this at any point? Just checking since I see a lot of +4. It's fine…
				EmmmerAuthorUnsubmitted Done Reply Inline Actions No. `jal/jalr/b` are always 4-byte instructions. Supporting RVC does not need to change the +4 here. Emmmer: No. `jal/jalr/b` are always 4-byte instructions. Supporting RVC does not need to change the +4…
				return rs1 == rs2;
				case BNE:
				return rs1 != rs2;
				case BLT:
				return int64_t(rs1) < int64_t(rs2);
				case BGE:
				return int64_t(rs1) >= int64_t(rs2);
				case BLTU:
				return rs1 < rs2;
				case BGEU:
				return rs1 >= rs2;
				DavidSpickettUnsubmitted Done Reply Inline Actions JALR clears the bottom bit, why is this? On Arm thumb we have bx which can change modes between arm and thumb code. In thumb code the bottom bit is used as a mode bit to say you're in thumb. Is it anything like that? Please add comments to explain. DavidSpickett: JALR clears the bottom bit, why is this? On Arm thumb we have bx which can change modes…
				EmmmerAuthorUnsubmitted Done Reply Inline Actions It is like the mode bit in ARM but more flexible since riscv does not have two modes. The spec says auxiliary information can be stored to LSB in function addresses (like JIT compilers can use this bit). It is open to developers. Emmmer: It is like the mode bit in ARM but more flexible since riscv does not have two modes. The…
				default:
				return false;
				}
				}

				static bool ExecB(EmulateInstructionRISCV *emulator, uint32_t inst,
				DavidSpickettUnsubmitted Done Reply Inline Actions Is funct3 the name of a field from the spec? If so can you add a comment like "funct3 is the type of the compare", whatever it means. DavidSpickett: Is funct3 the name of a field from the spec? If so can you add a comment like "funct3 is the…
				EmmmerAuthorUnsubmitted Done Reply Inline Actions Yes. `funct3` is a widely used name in spec meaning `3-bits function selector` Emmmer: Yes. `funct3` is a widely used name in spec meaning `3-bits function selector`
				bool ignore_cond) {
				bool success = false;
				uint64_t offset = DecodeBImm(inst);
				uint64_t pc = emulator->ReadPC(&success);
				uint64_t target = pc + offset;
				if (!success)
				return false;
				if (ignore_cond)
				return emulator->WritePC(target);

				RegisterValue value1;
				RegisterValue value2;
				if (!ReadRegister(emulator, DecodeRS1(inst), value1) \|\|
				DavidSpickettUnsubmitted Done Reply Inline Actions Do you ever expect to hit this? Add an assert if you don't. DavidSpickett: Do you ever expect to hit this? Add an assert if you don't.
				!ReadRegister(emulator, DecodeRS2(inst), value2))
				return false;

				uint32_t funct3 = DecodeFunct3(inst);

				uint64_t rs1 = value1.GetAsUInt64();
				uint64_t rs2 = value1.GetAsUInt64();
				DavidSpickettUnsubmitted Done Reply Inline Actions Nit: put this next to the ReadPC call. Generally, order the declarations as they are used. DavidSpickett: Nit: put this next to the ReadPC call. Generally, order the declarations as they are used.

				if (CompareB(rs1, rs2, funct3))
				return emulator->WritePC(target);

				DavidSpickettUnsubmitted Done Reply Inline Actions Do this check one line up immediately after the ReadPC. DavidSpickett: Do this check one line up immediately after the ReadPC.
				return true;
				}

				static InstrPattern PATTERNS[] = {
				{"JAL", J_MASK, 0b1101111, ExecJAL},
				{"JALR", I_MASK, 0b000000001100111, ExecJALR},
				{"B<CMP>", B_MASK, 0b1100011, ExecB},
				// TODO: {LR/SC}.{W/D} and ECALL
				};

				/// This function only determines the next instruction address for software
				/// sigle stepping by emulating branching instructions including:
				/// - from RVI: JAL, JALR, B<CMP>, ECALL
				/// - from RVA: LR -> BNE -> SC -> BNE
				DavidSpickettUnsubmitted Done Reply Inline Actions I would just merge these into the CompareB below. DavidSpickett: I would just merge these into the CompareB below.
				/// We will get rid of this tedious code when the riscv debug spec is ratified.
				bool EmulateInstructionRISCV::DecodeAndExecute(uint32_t inst,
				bool ignore_cond) {
				Log *log = GetLog(LLDBLog::Process \| LLDBLog::Breakpoints);
				int length = llvm::array_lengthof(PATTERNS);
				for (int i = 0; i < length; ++i) {
				const InstrPattern &pat = PATTERNS[i];
				if ((pat.type_mask & inst) == pat.eigen) {
				LLDB_LOGF(log, "EmulateInstructionRISCV::%s: inst(%x) was decoded to %s",
				__FUNCTION__, inst, pat.name);
				return pat.exec(this, inst, ignore_cond);
				}
				}

				LLDB_LOGF(log,
				"EmulateInstructionRISCV::%s: inst(%x) does not branching: "
				"no need to calculate the next pc address which is trivial.",
				__FUNCTION__, inst);
				DavidSpickettUnsubmitted Done Reply Inline Actions Can you explain in the comments what RVI and RVA are? DavidSpickett: Can you explain in the comments what RVI and RVA are?
				return true;
				DavidSpickettUnsubmitted Done Reply Inline Actions What is not certain at this point? It would be good to record that at any point where the spec being in flux is an issue. E.g. "this code may change once the spec is ratified, at the moment X and Y aspects are not fixed" Just in case we wanted to fixup individual bits as they get decided on later (or find out where we diverge from the final spec). DavidSpickett: What is not certain at this point? It would be good to record that at any point where the spec…
				EmmmerAuthorUnsubmitted Done Reply Inline Actions It's a pity that the whole debug spec was not useable (not implemented by any emulator nor hardware). We don't use this code for platforms which implemented the debug spec. But for platforms without the debug spec we still need this emulation. We don't need to change the code here in either case in the future. So I guess it's fine to not record the issue. Emmmer: It's a pity that the whole debug spec was not useable (not implemented by any emulator nor…
				}

				bool EmulateInstructionRISCV::EvaluateInstruction(uint32_t options) {
				uint32_t inst_size = m_opcode.GetByteSize();
				uint32_t inst = m_opcode.GetOpcode32();
				DavidSpickettUnsubmitted Done Reply Inline Actions Just put the array_lengthof call in the for line. DavidSpickett: Just put the array_lengthof call in the for line.
				bool increase_pc = options & eEmulateInstructionOptionAutoAdvancePC;
				bool ignore_cond = options & eEmulateInstructionOptionIgnoreConditions;
				DavidSpickettUnsubmitted Done Reply Inline Actions I would write this `inst & pat.type_mask`. I know it works either way but it looks a bit strange in my opinion. DavidSpickett: I would write this `inst & pat.type_mask`. I know it works either way but it looks a bit…
				bool success = false;

				lldb::addr_t old_pc = 0;
				if (increase_pc) {
				old_pc = ReadPC(&success);
				if (!success)
				return false;
				}
				DavidSpickettUnsubmitted Done Reply Inline Actions `0x%x` and `does not branch:` DavidSpickett: `0x%x` and `does not branch:`

				if (inst_size == 2) {
				// TODO: execute RVC
				return false;
				}

				// Execute it.
				success = DecodeAndExecute(inst, ignore_cond);
				if (!success)
				return false;

				if (increase_pc) {
				lldb::addr_t new_pc = ReadPC(&success);
				if (!success)
				return false;

				if (new_pc == old_pc) {
				if (!WritePC(old_pc + inst_size))
				return false;
				}
				}
				return true;
				}

				bool EmulateInstructionRISCV::ReadInstruction() {
				DavidSpickettUnsubmitted Done Reply Inline Actions This comment seems redundant given that the function is called DecodeAndExecute. DavidSpickett: This comment seems redundant given that the function is called DecodeAndExecute.
				EmmmerAuthorUnsubmitted Done Reply Inline Actions A sense of ceremony haha. Will delete the comment accordingly. Emmmer: A sense of ceremony haha. Will delete the comment accordingly.
				DavidSpickettUnsubmitted Done Reply Inline Actions Now worries. Usually I end up with these because I wrote out the steps as comments first and forget to delete them. DavidSpickett: Now worries. Usually I end up with these because I wrote out the steps as comments first and…
				bool success = false;
				m_addr = ReadPC(&success);
				if (success) {
				Context ctx;
				ctx.type = eContextReadOpcode;
				ctx.SetNoArgs();
				uint32_t inst = (uint32_t)ReadMemoryUnsigned(ctx, m_addr, 4, 0, &success);
				uint16_t try_rvc = (uint16_t)(inst & 0x0000ffff);
				uint16_t mask = try_rvc & 0b11;
				if (try_rvc != 0 && (mask == 0 \|\| mask == 1 \|\| mask == 2)) {
				m_opcode.SetOpcode16(try_rvc, GetByteOrder());
				} else {
				m_opcode.SetOpcode32(inst, GetByteOrder());
				}
				}
				if (!success)
				m_addr = LLDB_INVALID_ADDRESS;
				return success;
				}

				lldb::addr_t EmulateInstructionRISCV::ReadPC(bool *success) {
				return ReadRegisterUnsigned(eRegisterKindGeneric, LLDB_REGNUM_GENERIC_PC,
				LLDB_INVALID_ADDRESS, success);
				}

				bool EmulateInstructionRISCV::WritePC(lldb::addr_t pc) {
				EmulateInstruction::Context ctx;
				DavidSpickettUnsubmitted Done Reply Inline Actions Please add comments to explain this. I think what you're doing is checking whether the compressed encoding could be valid. We do similair things for Thumb encodings for Arm. DavidSpickett: Please add comments to explain this. I think what you're doing is checking whether the…
				EmmmerAuthorUnsubmitted Done Reply Inline Actions Exactly! Comments are added now. Emmmer: Exactly! Comments are added now.
				ctx.type = eContextAdvancePC;
				DavidSpickettUnsubmitted Done Reply Inline Actions Seems easier to write this as != 3. DavidSpickett: Seems easier to write this as != 3.
				ctx.SetNoArgs();
				return WriteRegisterUnsigned(ctx, eRegisterKindGeneric,
				LLDB_REGNUM_GENERIC_PC, pc);
				}

				bool EmulateInstructionRISCV::GetRegisterInfo(lldb::RegisterKind reg_kind,
				uint32_t reg_index,
				RegisterInfo &reg_info) {
				if (reg_kind == eRegisterKindGeneric) {
				DavidSpickettUnsubmitted Done Reply Inline Actions I would change this into an early return style like this: bool success = false; m_addr = ReadPC(&success); if (!success) m_addr = LLDB_INVALID_ADDRESS; Context ctx; ctx.type = eContextReadOpcode; ctx.SetNoArgs(); uint32_t inst = (uint32_t)ReadMemoryUnsigned(ctx, m_addr, 4, 0, &success); uint16_t try_rvc = (uint16_t)(inst & 0x0000ffff); uint16_t mask = try_rvc & 0b11; if (try_rvc != 0 && (mask == 0 \|\| mask == 1 \|\| mask == 2)) { m_opcode.SetOpcode16(try_rvc, GetByteOrder()); } else { m_opcode.SetOpcode32(inst, GetByteOrder()); } return true; } DavidSpickett: I would change this into an early return style like this: ``` bool success = false; m_addr…
				switch (reg_index) {
				case LLDB_REGNUM_GENERIC_PC:
				reg_kind = eRegisterKindLLDB;
				reg_index = gpr_pc_riscv;
				break;
				case LLDB_REGNUM_GENERIC_SP:
				reg_kind = eRegisterKindLLDB;
				reg_index = gpr_sp_riscv;
				break;
				case LLDB_REGNUM_GENERIC_FP:
				reg_kind = eRegisterKindLLDB;
				reg_index = gpr_fp_riscv;
				break;
				case LLDB_REGNUM_GENERIC_RA:
				reg_kind = eRegisterKindLLDB;
				reg_index = gpr_ra_riscv;
				break;
				// We may handle LLDB_REGNUM_GENERIC_ARGx when more instructions are
				// supported.
				default:
				return false;
				}
				}

				const RegisterInfo *array =
				RegisterInfoPOSIX_riscv64::GetRegisterInfoPtr(m_arch);
				const uint32_t length =
				RegisterInfoPOSIX_riscv64::GetRegisterInfoCount(m_arch);

				if (reg_index >= length \|\| reg_kind != eRegisterKindLLDB)
				return false;

				reg_info = array[reg_index];
				return true;
				}

				bool EmulateInstructionRISCV::SetTargetTriple(const ArchSpec &arch) {
				return SupportsThisArch(arch);
				}
				DavidSpickettUnsubmitted Done Reply Inline Actions Assert here if you do not expect to hit this code. (even if that's just temporary until you emulate more instructions, it's better to know that something needs implementing) DavidSpickett: Assert here if you do not expect to hit this code. (even if that's just temporary until you…

				bool EmulateInstructionRISCV::TestEmulation(Stream *out_stream, ArchSpec &arch,
				OptionValueDictionary *test_data) {
				return false;
				}

				void EmulateInstructionRISCV::Initialize() {
				PluginManager::RegisterPlugin(GetPluginNameStatic(),
				GetPluginDescriptionStatic(), CreateInstance);
				}

				void EmulateInstructionRISCV::Terminate() {
				PluginManager::UnregisterPlugin(CreateInstance);
				}

				lldb_private::EmulateInstruction *
				EmulateInstructionRISCV::CreateInstance(const ArchSpec &arch,
				InstructionType inst_type) {
				if (EmulateInstructionRISCV::SupportsThisInstructionType(inst_type) &&
				SupportsThisArch(arch)) {
				return new EmulateInstructionRISCV(arch);
				}

				return nullptr;
				}

				bool EmulateInstructionRISCV::SupportsThisArch(const ArchSpec &arch) {
				return arch.GetTriple().isRISCV();
				}

				} // namespace lldb_private

lldb/source/Plugins/Process/Linux/NativeProcessLinux.cpp

Show First 20 Lines • Show All 876 Lines • ▼ Show 20 Lines	bool NativeProcessLinux::MonitorClone(NativeThreadLinux &parent,
default:		default:
llvm_unreachable("unknown clone_info.event");		llvm_unreachable("unknown clone_info.event");
}		}

return true;		return true;
}		}

bool NativeProcessLinux::SupportHardwareSingleStepping() const {		bool NativeProcessLinux::SupportHardwareSingleStepping() const {
if (m_arch.GetMachine() == llvm::Triple::arm \|\| m_arch.IsMIPS())		Triple::ArchType machine = m_arch.GetMachine();
		if (m_arch.IsMIPS() \|\| machine == llvm::Triple::arm \|\|
		DavidSpickettUnsubmitted Done Reply Inline Actions Could you use `isRISCV` here instead? DavidSpickett: Could you use `isRISCV` here instead?
		machine == llvm::Triple::riscv32 \|\| machine == llvm::Triple::riscv64)
return false;		return false;
return true;		return true;
}		}

Status NativeProcessLinux::Resume(const ResumeActionList &resume_actions) {		Status NativeProcessLinux::Resume(const ResumeActionList &resume_actions) {
Log *log = GetLog(POSIXLog::Process);		Log *log = GetLog(POSIXLog::Process);
LLDB_LOG(log, "pid {0}", GetID());		LLDB_LOG(log, "pid {0}", GetID());

Show All 34 Lines	for (const auto &thread : m_threads) {
LLDB_LOG(log, "processing resume action state {0} for pid {1} tid {2}",		LLDB_LOG(log, "processing resume action state {0} for pid {1} tid {2}",
action->state, GetID(), thread->GetID());		action->state, GetID(), thread->GetID());

switch (action->state) {		switch (action->state) {
case eStateRunning:		case eStateRunning:
case eStateStepping: {		case eStateStepping: {
// Run the thread, possibly feeding it the signal.		// Run the thread, possibly feeding it the signal.
const int signo = action->signal;		const int signo = action->signal;
ResumeThread(static_cast<NativeThreadLinux &>(*thread), action->state,		Status error = ResumeThread(static_cast<NativeThreadLinux &>(*thread),
signo);		action->state, signo);
		if (error.Fail()) {
		return Status("NativeProcessLinux::%s: failed to resume thread "
		"for pid %" PRIu64 ", tid %" PRIu64 ", error = %s",
		__FUNCTION__, GetID(), thread->GetID(),
		error.AsCString());
		}
break;		break;
}		}

case eStateSuspended:		case eStateSuspended:
case eStateStopped:		case eStateStopped:
break;		break;

default:		default:
▲ Show 20 Lines • Show All 1,054 Lines • Show Last 20 Lines

lldb/source/Plugins/Process/Utility/NativeProcessSoftwareSingleStep.cpp

Show First 20 Lines • Show All 122 Lines • ▼ Show 20 Lines	Status NativeProcessSoftwareSingleStep::SetupSoftwareSingleStepping(

const RegisterInfo *reg_info_pc = register_context.GetRegisterInfo(		const RegisterInfo *reg_info_pc = register_context.GetRegisterInfo(
eRegisterKindGeneric, LLDB_REGNUM_GENERIC_PC);		eRegisterKindGeneric, LLDB_REGNUM_GENERIC_PC);
const RegisterInfo *reg_info_flags = register_context.GetRegisterInfo(		const RegisterInfo *reg_info_flags = register_context.GetRegisterInfo(
eRegisterKindGeneric, LLDB_REGNUM_GENERIC_FLAGS);		eRegisterKindGeneric, LLDB_REGNUM_GENERIC_FLAGS);

auto pc_it =		auto pc_it =
baton.m_register_values.find(reg_info_pc->kinds[eRegisterKindDWARF]);		baton.m_register_values.find(reg_info_pc->kinds[eRegisterKindDWARF]);
auto flags_it =		auto flags_it = reg_info_flags == nullptr
baton.m_register_values.find(reg_info_flags->kinds[eRegisterKindDWARF]);		? baton.m_register_values.end()
		: baton.m_register_values.find(
		reg_info_flags->kinds[eRegisterKindDWARF]);

lldb::addr_t next_pc;		lldb::addr_t next_pc;
lldb::addr_t next_flags;		lldb::addr_t next_flags;
if (emulation_result) {		if (emulation_result) {
assert(pc_it != baton.m_register_values.end() &&		assert(pc_it != baton.m_register_values.end() &&
"Emulation was successfull but PC wasn't updated");		"Emulation was successfull but PC wasn't updated");
next_pc = pc_it->second.GetAsUInt64();		next_pc = pc_it->second.GetAsUInt64();

Show All 19 Lines	Status NativeProcessSoftwareSingleStep::SetupSoftwareSingleStepping(
if (arch.GetMachine() == llvm::Triple::arm) {		if (arch.GetMachine() == llvm::Triple::arm) {
if (next_flags & 0x20) {		if (next_flags & 0x20) {
// Thumb mode		// Thumb mode
size_hint = 2;		size_hint = 2;
} else {		} else {
// Arm mode		// Arm mode
size_hint = 4;		size_hint = 4;
}		}
} else if (arch.IsMIPS() \|\| arch.GetTriple().isPPC64())		} else if (arch.IsMIPS() \|\| arch.GetTriple().isPPC64() \|\|
		arch.GetTriple().isRISCV())
size_hint = 4;		size_hint = 4;
error = process.SetBreakpoint(next_pc, size_hint, /hardware=/false);		error = process.SetBreakpoint(next_pc, size_hint, /hardware=/false);

// If setting the breakpoint fails because next_pc is out of the address		// If setting the breakpoint fails because next_pc is out of the address
// space, ignore it and let the debugee segfault.		// space, ignore it and let the debugee segfault.
if (error.GetError() == EIO \|\| error.GetError() == EFAULT) {		if (error.GetError() == EIO \|\| error.GetError() == EFAULT) {
return Status();		return Status();
} else if (error.Fail())		} else if (error.Fail())
return error;		return error;

m_threads_stepping_with_breakpoint.insert({thread.GetID(), next_pc});		m_threads_stepping_with_breakpoint.insert({thread.GetID(), next_pc});

return Status();		return Status();
}		}

lldb/tools/lldb-server/CMakeLists.txt

Show First 20 Lines • Show All 47 Lines • ▼ Show 20 Lines	add_lldb_tool(lldb-server
LINK_LIBS		LINK_LIBS
lldbHost		lldbHost
lldbInitialization		lldbInitialization
lldbVersion		lldbVersion
${LLDB_PLUGINS}		${LLDB_PLUGINS}
lldbPluginInstructionARM		lldbPluginInstructionARM
lldbPluginInstructionMIPS		lldbPluginInstructionMIPS
lldbPluginInstructionMIPS64		lldbPluginInstructionMIPS64
		lldbPluginInstructionRISCV
${LLDB_SYSTEM_LIBS}		${LLDB_SYSTEM_LIBS}

LINK_COMPONENTS		LINK_COMPONENTS
Option		Option
Support		Support
)		)

add_dependencies(lldb-server		add_dependencies(lldb-server
LLGSOptionsTableGen		LLGSOptionsTableGen
${tablegen_deps}		${tablegen_deps}
)		)
target_include_directories(lldb-server PRIVATE "${LLDB_SOURCE_DIR}/source")		target_include_directories(lldb-server PRIVATE "${LLDB_SOURCE_DIR}/source")
target_link_libraries(lldb-server PRIVATE ${LLDB_SYSTEM_LIBS})		target_link_libraries(lldb-server PRIVATE ${LLDB_SYSTEM_LIBS})

lldb/tools/lldb-server/SystemInitializerLLGS.cpp

	Show All 35 Lines
	#endif			#endif

	#if defined(__mips__) \|\| defined(mips) \|\| defined(__mips) \|\| \			#if defined(__mips__) \|\| defined(mips) \|\| defined(__mips) \|\| \
	defined(__MIPS__) \|\| defined(_M_MIPS) \|\| defined(LLDB_TARGET_MIPS64)			defined(__MIPS__) \|\| defined(_M_MIPS) \|\| defined(LLDB_TARGET_MIPS64)
	#define LLDB_TARGET_MIPS			#define LLDB_TARGET_MIPS
	#include "Plugins/Instruction/MIPS/EmulateInstructionMIPS.h"			#include "Plugins/Instruction/MIPS/EmulateInstructionMIPS.h"
	#endif			#endif

				#if defined(__riscv)
				#define LLDB_TARGET_RISCV
				#include "Plugins/Instruction/RISCV/EmulateInstructionRISCV.h"
				#endif

	using namespace lldb_private;			using namespace lldb_private;

	llvm::Error SystemInitializerLLGS::Initialize() {			llvm::Error SystemInitializerLLGS::Initialize() {
	if (auto e = SystemInitializerCommon::Initialize())			if (auto e = SystemInitializerCommon::Initialize())
	return e;			return e;

	HostObjectFile::Initialize();			HostObjectFile::Initialize();

	#if defined(LLDB_TARGET_ARM) \|\| defined(LLDB_TARGET_ARM64)			#if defined(LLDB_TARGET_ARM) \|\| defined(LLDB_TARGET_ARM64)
	EmulateInstructionARM::Initialize();			EmulateInstructionARM::Initialize();
	#endif			#endif
	#if defined(LLDB_TARGET_MIPS) \|\| defined(LLDB_TARGET_MIPS64)			#if defined(LLDB_TARGET_MIPS) \|\| defined(LLDB_TARGET_MIPS64)
	EmulateInstructionMIPS::Initialize();			EmulateInstructionMIPS::Initialize();
	#endif			#endif
	#if defined(LLDB_TARGET_MIPS64)			#if defined(LLDB_TARGET_MIPS64)
	EmulateInstructionMIPS64::Initialize();			EmulateInstructionMIPS64::Initialize();
	#endif			#endif
				#if defined(LLDB_TARGET_RISCV)
				EmulateInstructionRISCV::Initialize();
				#endif

	return llvm::Error::success();			return llvm::Error::success();
	}			}

	void SystemInitializerLLGS::Terminate() {			void SystemInitializerLLGS::Terminate() {
	HostObjectFile::Terminate();			HostObjectFile::Terminate();

	#if defined(LLDB_TARGET_ARM) \|\| defined(LLDB_TARGET_ARM64)			#if defined(LLDB_TARGET_ARM) \|\| defined(LLDB_TARGET_ARM64)
	EmulateInstructionARM::Terminate();			EmulateInstructionARM::Terminate();
	#endif			#endif
	#if defined(LLDB_TARGET_MIPS) \|\| defined(LLDB_TARGET_MIPS64)			#if defined(LLDB_TARGET_MIPS) \|\| defined(LLDB_TARGET_MIPS64)
	EmulateInstructionMIPS::Terminate();			EmulateInstructionMIPS::Terminate();
	#endif			#endif
	#if defined(LLDB_TARGET_MIPS64)			#if defined(LLDB_TARGET_MIPS64)
	EmulateInstructionMIPS64::Terminate();			EmulateInstructionMIPS64::Terminate();
	#endif			#endif
				#if defined(LLDB_TARGET_RISCV)
				EmulateInstructionRISCV::Terminate();
				#endif

	SystemInitializerCommon::Terminate();			SystemInitializerCommon::Terminate();
	}			}

This is an archive of the discontinued LLVM Phabricator instance.

[LLDB][RISCV] Make software single stepping workClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 452128

lldb/source/Plugins/Instruction/CMakeLists.txt

lldb/source/Plugins/Instruction/RISCV/CMakeLists.txt

lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.h

lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.cpp

lldb/source/Plugins/Process/Linux/NativeProcessLinux.cpp

lldb/source/Plugins/Process/Utility/NativeProcessSoftwareSingleStep.cpp

lldb/tools/lldb-server/CMakeLists.txt

lldb/tools/lldb-server/SystemInitializerLLGS.cpp

[LLDB][RISCV] Make software single stepping work
ClosedPublic