This is an archive of the discontinued LLVM Phabricator instance.

Are these two generic changes (not specific to riscv)? If so can you make a patch for each. If it can be tested as well then great, if it's some theoretical corner case then should be fine without.

I presume that with this change a bunch of tests started passing? Would be good to summarise that in the commit message. "after this change a further N tests that use single stepping now pass".

Address review comments:

Add unittests for EmulateInstructionRISCV.
Split "thread error" and "nullptr dereference" as separate PRs.

Harbormaster completed remote builds in B181457: Diff 452909.Aug 16 2022, 1:30 AM

DavidSpickett added inline comments.Aug 16 2022, 2:32 AM

lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.cpp
35	Move this next to first use if possible.
37	What part of the encoding does this help you extract? Can you add a comment with the name of the field as the spec calls it.
38	Also what does this mean?
82	Here we know that reg_encode is not 0, and reg_encode is unsigned so it cannot be negative. So it could just be `if (reg_encode <= 31)`.
108	Does compressed come into this at any point? Just checking since I see a lot of +4. It's fine to say it's not supported at this time.
119	JALR clears the bottom bit, why is this? On Arm thumb we have bx which can change modes between arm and thumb code. In thumb code the bottom bit is used as a mode bit to say you're in thumb. Is it anything like that? Please add comments to explain.
125	Is funct3 the name of a field from the spec? If so can you add a comment like "funct3 is the type of the compare", whatever it means.
138	Do you ever expect to hit this? Add an assert if you don't.
145	Nit: put this next to the ReadPC call. Generally, order the declarations as they are used.
149	Do this check one line up immediately after the ReadPC.
163	I would just merge these into the CompareB below.
181	Can you explain in the comments what RVI and RVA are?
182	What is not certain at this point? It would be good to record that at any point where the spec being in flux is an issue. E.g. "this code may change once the spec is ratified, at the moment X and Y aspects are not fixed" Just in case we wanted to fixup individual bits as they get decided on later (or find out where we diverge from the final spec).
187	Just put the array_lengthof call in the for line.
189	I would write this `inst & pat.type_mask`. I know it works either way but it looks a bit strange in my opinion.
197	`0x%x` and `does not branch:`
222	This comment seems redundant given that the function is called DecodeAndExecute.
249	Please add comments to explain this. I think what you're doing is checking whether the compressed encoding could be valid. We do similair things for Thumb encodings for Arm.
250	Seems easier to write this as != 3.
259	I would change this into an early return style like this: bool success = false; m_addr = ReadPC(&success); if (!success) m_addr = LLDB_INVALID_ADDRESS; Context ctx; ctx.type = eContextReadOpcode; ctx.SetNoArgs(); uint32_t inst = (uint32_t)ReadMemoryUnsigned(ctx, m_addr, 4, 0, &success); uint16_t try_rvc = (uint16_t)(inst & 0x0000ffff); uint16_t mask = try_rvc & 0b11; if (try_rvc != 0 && (mask == 0 \|\| mask == 1 \|\| mask == 2)) { m_opcode.SetOpcode16(try_rvc, GetByteOrder()); } else { m_opcode.SetOpcode32(inst, GetByteOrder()); } return true; }
lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.h
55	Do we need this wrapper/forwarder? It just passes on the arg unchanged.
lldb/unittests/Instruction/CMakeLists.txt
4	Also for AArch64 this should be "AARCH64" or "ARM64" (I think it's the former). In this context ARM == 32 bit Arm and AArch64 == 64 bit Arm (where elsewhere Arm means both, and I agree, it's confusing).
16	Why do we need this conditional logic here? Surely unittests don't need a riscv or arm64 host and shouldn't clash with each other, or did you find some issue including both?

I know this is a flood of comments but the overall impression is that this is mostly fine. A bunch of small things.

Thanks for splitting out the other patches and adding the test cases here.

lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.cpp
298	Assert here if you do not expect to hit this code. (even if that's just temporary until you emulate more instructions, it's better to know that something needs implementing)
lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.h
20	You have 2 public markers in the same class with no private between them. Intentional?
lldb/source/Plugins/Process/Linux/NativeProcessLinux.cpp
886	Could you use `isRISCV` here instead?
lldb/unittests/Instruction/RISCV/TestRISCVEmulator.cpp
9	This appears unused.
39	Please explain what this does. Just on the face of it, something a bit dodgy :) (maybe there is a better way to express it, even if it is legit)
45	if (reg == gpr_x0_riscv) reg_value.SetUInt(0, reg_info->byte_size); else reg_value.SetUInt(tester->gpr.gpr[reg], reg_info->byte_size); return true;
55–59	if (reg != gpr_x0_riscv) tester->gpr.gpr[reg] = reg_value.GetAsUInt64(); return true;
68	Assert that this was successful.
79	EncodeInstruction? I don't think we need to shorten the names quite this much.
103	You check the `~1` here but your input values don't have the bottom bit set. Does that mean this test is not covering that aspect of the emulation? Seems like it should be.
106	EncodeBranch
140	testBranch (you get the idea by now)
164	Add comments to explain what these numbers are. With the macro wrapper it's hard to tell what they mean and what this is actually checking.

Oh and please note in the commit message that this does not support compressed instructions (or does it, I assume no but make it obvious).

Emmmer marked 30 inline comments as done.Aug 16 2022, 5:00 AM

Emmmer added inline comments.

lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.cpp
82	Just to make the if condition explicitly exhaustive. Not relying on implicitly derived conditions helps the code survive larger refactorings that may break the implicit conditions. Sure it can be simplified to `reg_encode <= 31`, but we should let compilers do it for us rather than doing it manually.
108	No. `jal/jalr/b` are always 4-byte instructions. Supporting RVC does not need to change the +4 here.
119	It is like the mode bit in ARM but more flexible since riscv does not have two modes. The spec says auxiliary information can be stored to LSB in function addresses (like JIT compilers can use this bit). It is open to developers.
125	Yes. `funct3` is a widely used name in spec meaning `3-bits function selector`
182	It's a pity that the whole debug spec was not useable (not implemented by any emulator nor hardware). We don't use this code for platforms which implemented the debug spec. But for platforms without the debug spec we still need this emulation. We don't need to change the code here in either case in the future. So I guess it's fine to not record the issue.
222	A sense of ceremony haha. Will delete the comment accordingly.
249	Exactly! Comments are added now.
lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.h
20	Yes, to distinguish public static members from public instance members.
55	Yes. `SupportsThisInstructionType` is also used in `CreateInstance` which is a static method.
lldb/unittests/Instruction/CMakeLists.txt
4	This code confused me a lot and thank you for the explanation. I am going to check ARM and AARCH64 explicitly.
16	I don't know either. I see the original code checking TARGETS_TO_BUILD and there might be special consideration I suppose?
lldb/unittests/Instruction/RISCV/TestRISCVEmulator.cpp
9	nice catch!
39	It is actually an always-success dynamic_cast because: The `RISCVEmulatorTester` is derived from `EmulateInstruction` These callbacks are set in the constructor of `RISCVEmulatorTester` All `EmulateInstruction` instances created in this unittest are always `RISCVEmulatorTester` Signature of these callbacks is not polymorphic (like for Java we have `<T extends EmulateInstruction>` with bounded constraints.) and that's the reason why we need this cast. We may turn it into a template but it requires more effort. I also noticed that there's a `void *baton` in parameters. But using that field looks more terrible. I guess we could have a `userdata` whose type was parametrized.
79	The `I` stands for `I-Type`, a riscv instruction format. Like `EncodeB` below.
103	`0x1024 - 255` = 3877 `(0x1024 - 255) & ~1` = 3876 I think it is covered 😃
106	The `B` stands for `B-Type` which works for all B-type instructions. Branch instructions are all of B-type so renaming it to `EncodeBranch` seems Okay right now, but we may add more B-type instructions in the future.
140	Yes this should be `Branch` since it only tests branch instructions.

address review comments

Emmmer marked an inline comment as done.Aug 16 2022, 5:03 AM

Harbormaster completed remote builds in B181491: Diff 452959.Aug 16 2022, 5:05 AM

Are we expecting this code to only ever be given JAL or JALR at this time?

If it's the case that anything else will just return false or do nothing, might be worth adding a couple of tests that check that encodings that aren't branches just do nothing or fail. Obviously not every encoding maybe just flip 1 bit in the instruction type field just as a smoke test for each of jal and jalr.

lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.cpp
222	Now worries. Usually I end up with these because I wrote out the steps as comments first and forget to delete them.
lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.h
20	Got it, I missed that.
lldb/unittests/Instruction/CMakeLists.txt
16	Sorry I completely missed that first line (the diff split at that point). Right, so you're just adding the riscv equivalent of what's already there. So actually just leave the ARM bit as is, I'll do some builds and find out whether it needs to change for AArch64.
lldb/unittests/Instruction/RISCV/TestRISCVEmulator.cpp
39	But using that field looks more terrible. I guess we could have a userdata whose type was parametrized. Yeah there's a lot of...not ideal patterns in this area but I wouldn't pay much attention to them. Without opting into llvm's rtti equivalent I guess this is the best way to do it.
79	Cool. It's mostly that EncodeB looks like the author got distracted half way through the name. But anyway I'm not familiar with the spec so if it matches that then great.
99	Add a comment like "JALR will always zero the bottom bit of the target".
103	Doh, yes I wasn't looking at the offset.
162	Now I understand what's going on. I thought that this somehow included non branch encodings when in fact what it's doing is saying for a branch equals we should only branch if the operands are equal.

Emmmer added inline comments.Aug 16 2022, 6:35 AM

lldb/unittests/Instruction/CMakeLists.txt
16	Oh... thank you! I am gonna revert the change for AArch64.
lldb/unittests/Instruction/RISCV/TestRISCVEmulator.cpp
79	You're right, the name looks unfriendly to other developers. So I changed its name to `EncodeIType` (also for `EncodeBType`) to avoid confusion.

address review comments

Harbormaster completed remote builds in B181524: Diff 452997.Aug 16 2022, 7:16 AM

LGTM

This revision is now accepted and ready to land.Aug 16 2022, 7:22 AM

Closed by commit rG4fc7e9cba24b: [LLDB][RISCV] Make software single stepping work (authored by Emmmer). · Explain WhyAug 16 2022, 8:45 AM

This revision was automatically updated to reflect the committed changes.

Emmmer added a commit: rG4fc7e9cba24b: [LLDB][RISCV] Make software single stepping work.

Herald added a subscriber: lldb-commits. · View Herald TranscriptAug 16 2022, 8:45 AM

Revision Contents

Path

Size

lldb/

source/

Plugins/

Instruction/

CMakeLists.txt

1 line

RISCV/

CMakeLists.txt

11 lines

EmulateInstructionRISCV.h

72 lines

EmulateInstructionRISCV.cpp

347 lines

Process/

Linux/

NativeProcessLinux.cpp

4 lines

Utility/

NativeProcessSoftwareSingleStep.cpp

3 lines

tools/

lldb-server/

CMakeLists.txt

1 line

SystemInitializerLLGS.cpp

11 lines

unittests/

Instruction/

	ARM64/

TestAArch64Emulator.cpp

CMakeLists.txt

39 lines

RISCV/

TestRISCVEmulator.cpp

171 lines

TestAArch64Emulator.cpp

Diff 452909

lldb/source/Plugins/Instruction/CMakeLists.txt

	add_subdirectory(ARM)			add_subdirectory(ARM)
	add_subdirectory(ARM64)			add_subdirectory(ARM64)
	add_subdirectory(MIPS)			add_subdirectory(MIPS)
	add_subdirectory(MIPS64)			add_subdirectory(MIPS64)
	add_subdirectory(PPC64)			add_subdirectory(PPC64)
				add_subdirectory(RISCV)

lldb/source/Plugins/Instruction/RISCV/CMakeLists.txt

This file was added.

				add_lldb_library(lldbPluginInstructionRISCV PLUGIN
				EmulateInstructionRISCV.cpp

				LINK_LIBS
				lldbCore
				lldbInterpreter
				lldbSymbol
				lldbPluginProcessUtility
				LINK_COMPONENTS
				Support
				)

lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.h

This file was added.

				//===-- EmulateInstructionRISCV.h -----------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLDB_SOURCE_PLUGINS_INSTRUCTION_RISCV_EMULATEINSTRUCTIONRISCV_H
				#define LLDB_SOURCE_PLUGINS_INSTRUCTION_RISCV_EMULATEINSTRUCTIONRISCV_H

				#include "lldb/Core/EmulateInstruction.h"
				#include "lldb/Interpreter/OptionValue.h"
				#include "lldb/Utility/Log.h"
				#include "lldb/Utility/Status.h"

				namespace lldb_private {

				class EmulateInstructionRISCV : public EmulateInstruction {
				public:
				DavidSpickettUnsubmitted Done Reply Inline Actions You have 2 public markers in the same class with no private between them. Intentional? DavidSpickett: You have 2 public markers in the same class with no private between them. Intentional?
				EmmmerAuthorUnsubmitted Done Reply Inline Actions Yes, to distinguish public static members from public instance members. Emmmer: Yes, to distinguish public static members from public instance members.
				DavidSpickettUnsubmitted Done Reply Inline Actions Got it, I missed that. DavidSpickett: Got it, I missed that.
				static llvm::StringRef GetPluginNameStatic() { return "riscv"; }

				static llvm::StringRef GetPluginDescriptionStatic() {
				return "Emulate instructions for the RISC-V architecture.";
				}

				static bool SupportsThisInstructionType(InstructionType inst_type) {
				switch (inst_type) {
				case eInstructionTypeAny:
				case eInstructionTypePCModifying:
				return true;
				case eInstructionTypePrologueEpilogue:
				case eInstructionTypeAll:
				default:
				return false;
				}
				}

				static bool SupportsThisArch(const ArchSpec &arch);

				static lldb_private::EmulateInstruction *
				CreateInstance(const lldb_private::ArchSpec &arch, InstructionType inst_type);

				static void Initialize();

				static void Terminate();

				public:
				EmulateInstructionRISCV(const ArchSpec &arch) : EmulateInstruction(arch) {}

				llvm::StringRef GetPluginName() override { return GetPluginNameStatic(); }

				bool SupportsEmulatingInstructionsOfType(InstructionType inst_type) override {
				return SupportsThisInstructionType(inst_type);
				}
				DavidSpickettUnsubmitted Done Reply Inline Actions Do we need this wrapper/forwarder? It just passes on the arg unchanged. DavidSpickett: Do we need this wrapper/forwarder? It just passes on the arg unchanged.
				EmmmerAuthorUnsubmitted Done Reply Inline Actions Yes. `SupportsThisInstructionType` is also used in `CreateInstance` which is a static method. Emmmer: Yes. `SupportsThisInstructionType` is also used in `CreateInstance` which is a static method.

				bool SetTargetTriple(const ArchSpec &arch) override;
				bool ReadInstruction() override;
				bool EvaluateInstruction(uint32_t options) override;
				bool TestEmulation(Stream *out_stream, ArchSpec &arch,
				OptionValueDictionary *test_data) override;
				bool GetRegisterInfo(lldb::RegisterKind reg_kind, uint32_t reg_num,
				RegisterInfo &reg_info) override;

				lldb::addr_t ReadPC(bool *success);
				bool WritePC(lldb::addr_t pc);
				bool DecodeAndExecute(uint32_t inst, bool ignore_cond);
				};

				} // namespace lldb_private

				#endif // LLDB_SOURCE_PLUGINS_INSTRUCTION_RISCV_EMULATEINSTRUCTIONRISCV_H

lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.cpp

This file was added.

				//===-- EmulateInstructionRISCV.cpp ---------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include <cstdlib>

				#include "EmulateInstructionRISCV.h"
				#include "Plugins/Process/Utility/RegisterInfoPOSIX_riscv64.h"
				#include "Plugins/Process/Utility/lldb-riscv-register-enums.h"

				#include "lldb/Core/Address.h"
				#include "lldb/Core/PluginManager.h"
				#include "lldb/Interpreter/OptionValueArray.h"
				#include "lldb/Interpreter/OptionValueDictionary.h"
				#include "lldb/Symbol/UnwindPlan.h"
				#include "lldb/Utility/ArchSpec.h"
				#include "lldb/Utility/LLDBLog.h"
				#include "lldb/Utility/RegisterValue.h"
				#include "lldb/Utility/Stream.h"

				#include "llvm/ADT/STLExtras.h"
				#include "llvm/Support/MathExtras.h"

				using namespace lldb;
				using namespace lldb_private;

				LLDB_PLUGIN_DEFINE_ADV(EmulateInstructionRISCV, InstructionRISCV)

				namespace lldb_private {

				struct InstrPattern {
				DavidSpickettUnsubmitted Done Reply Inline Actions Move this next to first use if possible. DavidSpickett: Move this next to first use if possible.
				const char *name;
				uint32_t type_mask;
				DavidSpickettUnsubmitted Done Reply Inline Actions What part of the encoding does this help you extract? Can you add a comment with the name of the field as the spec calls it. DavidSpickett: What part of the encoding does this help you extract? Can you add a comment with the name of…
				uint32_t eigen;
				DavidSpickettUnsubmitted Done Reply Inline Actions Also what does this mean? DavidSpickett: Also what does this mean?
				bool (exec)(EmulateInstructionRISCV emulator, uint32_t inst,
				bool ignore_cond);
				};

				constexpr uint32_t I_MASK = 0b111000001111111;
				constexpr uint32_t J_MASK = 0b000000001111111;
				// no funct3 in the b-mask because the logic executing B<CMP> is quite similar.
				constexpr uint32_t B_MASK = 0b000000001111111;
				constexpr uint32_t BEQ = 0b000;
				constexpr uint32_t BNE = 0b001;
				constexpr uint32_t BLT = 0b100;
				constexpr uint32_t BGE = 0b101;
				constexpr uint32_t BLTU = 0b110;
				constexpr uint32_t BGEU = 0b111;

				constexpr uint32_t DecodeRD(uint32_t inst) { return (inst & 0xF80) >> 7; }
				constexpr uint32_t DecodeRS1(uint32_t inst) { return (inst & 0xF8000) >> 15; }
				constexpr uint32_t DecodeRS2(uint32_t inst) { return (inst & 0x1F00000) >> 20; }
				constexpr uint32_t DecodeFunct3(uint32_t inst) { return (inst & 0x7000) >> 12; }

				constexpr int32_t SignExt(uint32_t imm) { return int32_t(imm); }

				constexpr uint32_t DecodeJImm(uint32_t inst) {
				return (uint64_t(int64_t(int32_t(inst & 0x80000000)) >> 11)) // imm[20]
				\| (inst & 0xff000) // imm[19:12]
				\| ((inst >> 9) & 0x800) // imm[11]
				\| ((inst >> 20) & 0x7fe); // imm[10:1]
				}

				constexpr uint32_t DecodeIImm(uint32_t inst) {
				return int64_t(int32_t(inst)) >> 20; // imm[11:0]
				}

				constexpr uint32_t DecodeBImm(uint32_t inst) {
				return (uint64_t(int64_t(int32_t(inst & 0x80000000)) >> 19)) // imm[12]
				\| ((inst & 0x80) << 4) // imm[11]
				\| ((inst >> 20) & 0x7e0) // imm[10:5]
				\| ((inst >> 7) & 0x1e); // imm[4:1]
				}

				static uint32_t GPREncodingToLLDB(uint32_t reg_encode) {
				if (reg_encode == 0)
				return gpr_x0_riscv;
				if (reg_encode >= 1 && reg_encode <= 31)
				DavidSpickettUnsubmitted Done Reply Inline Actions Here we know that reg_encode is not 0, and reg_encode is unsigned so it cannot be negative. So it could just be `if (reg_encode <= 31)`. DavidSpickett: Here we know that reg_encode is not 0, and reg_encode is unsigned so it cannot be negative. So…
				EmmmerAuthorUnsubmitted Done Reply Inline Actions Just to make the if condition explicitly exhaustive. Not relying on implicitly derived conditions helps the code survive larger refactorings that may break the implicit conditions. Sure it can be simplified to `reg_encode <= 31`, but we should let compilers do it for us rather than doing it manually. Emmmer: Just to make the if condition explicitly exhaustive. Not relying on implicitly derived…
				return gpr_x1_riscv + reg_encode - 1;
				return LLDB_INVALID_REGNUM;
				}

				static bool ReadRegister(EmulateInstructionRISCV *emulator, uint32_t reg_encode,
				RegisterValue &value) {
				uint32_t lldb_reg = GPREncodingToLLDB(reg_encode);
				return emulator->ReadRegister(eRegisterKindLLDB, lldb_reg, value);
				}

				static bool WriteRegister(EmulateInstructionRISCV *emulator,
				uint32_t reg_encode, const RegisterValue &value) {
				uint32_t lldb_reg = GPREncodingToLLDB(reg_encode);
				EmulateInstruction::Context ctx;
				ctx.type = EmulateInstruction::eContextRegisterStore;
				ctx.SetNoArgs();
				return emulator->WriteRegister(ctx, eRegisterKindLLDB, lldb_reg, value);
				}

				static bool ExecJAL(EmulateInstructionRISCV *emulator, uint32_t inst, bool) {
				bool success = false;
				int64_t offset = SignExt(DecodeJImm(inst));
				int64_t pc = emulator->ReadPC(&success);
				return success && emulator->WritePC(pc + offset) &&
				WriteRegister(emulator, DecodeRD(inst),
				RegisterValue(uint64_t(pc + 4)));
				DavidSpickettUnsubmitted Done Reply Inline Actions Does compressed come into this at any point? Just checking since I see a lot of +4. It's fine to say it's not supported at this time. DavidSpickett: Does compressed come into this at any point? Just checking since I see a lot of +4. It's fine…
				EmmmerAuthorUnsubmitted Done Reply Inline Actions No. `jal/jalr/b` are always 4-byte instructions. Supporting RVC does not need to change the +4 here. Emmmer: No. `jal/jalr/b` are always 4-byte instructions. Supporting RVC does not need to change the +4…
				}

				static bool ExecJALR(EmulateInstructionRISCV *emulator, uint32_t inst, bool) {
				int64_t offset = SignExt(DecodeIImm(inst));
				RegisterValue value;
				if (!ReadRegister(emulator, DecodeRS1(inst), value))
				return false;
				bool success = false;
				int64_t pc = emulator->ReadPC(&success);
				int64_t rs1 = int64_t(value.GetAsUInt64());
				return emulator->WritePC((rs1 + offset) & ~1) &&
				DavidSpickettUnsubmitted Done Reply Inline Actions JALR clears the bottom bit, why is this? On Arm thumb we have bx which can change modes between arm and thumb code. In thumb code the bottom bit is used as a mode bit to say you're in thumb. Is it anything like that? Please add comments to explain. DavidSpickett: JALR clears the bottom bit, why is this? On Arm thumb we have bx which can change modes…
				EmmmerAuthorUnsubmitted Done Reply Inline Actions It is like the mode bit in ARM but more flexible since riscv does not have two modes. The spec says auxiliary information can be stored to LSB in function addresses (like JIT compilers can use this bit). It is open to developers. Emmmer: It is like the mode bit in ARM but more flexible since riscv does not have two modes. The…
				WriteRegister(emulator, DecodeRD(inst),
				RegisterValue(uint64_t(pc + 4)));
				}

				static bool CompareB(uint64_t rs1, uint64_t rs2, uint32_t funct3) {
				switch (funct3) {
				DavidSpickettUnsubmitted Done Reply Inline Actions Is funct3 the name of a field from the spec? If so can you add a comment like "funct3 is the type of the compare", whatever it means. DavidSpickett: Is funct3 the name of a field from the spec? If so can you add a comment like "funct3 is the…
				EmmmerAuthorUnsubmitted Done Reply Inline Actions Yes. `funct3` is a widely used name in spec meaning `3-bits function selector` Emmmer: Yes. `funct3` is a widely used name in spec meaning `3-bits function selector`
				case BEQ:
				return rs1 == rs2;
				case BNE:
				return rs1 != rs2;
				case BLT:
				return int64_t(rs1) < int64_t(rs2);
				case BGE:
				return int64_t(rs1) >= int64_t(rs2);
				case BLTU:
				return rs1 < rs2;
				case BGEU:
				return rs1 >= rs2;
				default:
				DavidSpickettUnsubmitted Done Reply Inline Actions Do you ever expect to hit this? Add an assert if you don't. DavidSpickett: Do you ever expect to hit this? Add an assert if you don't.
				return false;
				}
				}

				static bool ExecB(EmulateInstructionRISCV *emulator, uint32_t inst,
				bool ignore_cond) {
				bool success = false;
				DavidSpickettUnsubmitted Done Reply Inline Actions Nit: put this next to the ReadPC call. Generally, order the declarations as they are used. DavidSpickett: Nit: put this next to the ReadPC call. Generally, order the declarations as they are used.
				uint64_t offset = SignExt(DecodeBImm(inst));
				uint64_t pc = emulator->ReadPC(&success);
				uint64_t target = pc + offset;
				if (!success)
				DavidSpickettUnsubmitted Done Reply Inline Actions Do this check one line up immediately after the ReadPC. DavidSpickett: Do this check one line up immediately after the ReadPC.
				return false;
				if (ignore_cond)
				return emulator->WritePC(target);

				RegisterValue value1;
				RegisterValue value2;
				if (!ReadRegister(emulator, DecodeRS1(inst), value1) \|\|
				!ReadRegister(emulator, DecodeRS2(inst), value2))
				return false;

				uint32_t funct3 = DecodeFunct3(inst);

				uint64_t rs1 = value1.GetAsUInt64();
				uint64_t rs2 = value2.GetAsUInt64();
				DavidSpickettUnsubmitted Done Reply Inline Actions I would just merge these into the CompareB below. DavidSpickett: I would just merge these into the CompareB below.

				if (CompareB(rs1, rs2, funct3))
				return emulator->WritePC(target);

				return true;
				}

				static InstrPattern PATTERNS[] = {
				{"JAL", J_MASK, 0b1101111, ExecJAL},
				{"JALR", I_MASK, 0b000000001100111, ExecJALR},
				{"B<CMP>", B_MASK, 0b1100011, ExecB},
				// TODO: {LR/SC}.{W/D} and ECALL
				};

				/// This function only determines the next instruction address for software
				/// sigle stepping by emulating branching instructions including:
				/// - from RVI: JAL, JALR, B<CMP>, ECALL
				/// - from RVA: LR -> BNE -> SC -> BNE
				DavidSpickettUnsubmitted Done Reply Inline Actions Can you explain in the comments what RVI and RVA are? DavidSpickett: Can you explain in the comments what RVI and RVA are?
				/// We will get rid of this tedious code when the riscv debug spec is ratified.
				DavidSpickettUnsubmitted Done Reply Inline Actions What is not certain at this point? It would be good to record that at any point where the spec being in flux is an issue. E.g. "this code may change once the spec is ratified, at the moment X and Y aspects are not fixed" Just in case we wanted to fixup individual bits as they get decided on later (or find out where we diverge from the final spec). DavidSpickett: What is not certain at this point? It would be good to record that at any point where the spec…
				EmmmerAuthorUnsubmitted Done Reply Inline Actions It's a pity that the whole debug spec was not useable (not implemented by any emulator nor hardware). We don't use this code for platforms which implemented the debug spec. But for platforms without the debug spec we still need this emulation. We don't need to change the code here in either case in the future. So I guess it's fine to not record the issue. Emmmer: It's a pity that the whole debug spec was not useable (not implemented by any emulator nor…
				bool EmulateInstructionRISCV::DecodeAndExecute(uint32_t inst,
				bool ignore_cond) {
				Log *log = GetLog(LLDBLog::Process \| LLDBLog::Breakpoints);
				int length = llvm::array_lengthof(PATTERNS);
				for (int i = 0; i < length; ++i) {
				DavidSpickettUnsubmitted Done Reply Inline Actions Just put the array_lengthof call in the for line. DavidSpickett: Just put the array_lengthof call in the for line.
				const InstrPattern &pat = PATTERNS[i];
				if ((pat.type_mask & inst) == pat.eigen) {
				DavidSpickettUnsubmitted Done Reply Inline Actions I would write this `inst & pat.type_mask`. I know it works either way but it looks a bit strange in my opinion. DavidSpickett: I would write this `inst & pat.type_mask`. I know it works either way but it looks a bit…
				LLDB_LOGF(log, "EmulateInstructionRISCV::%s: inst(%x) was decoded to %s",
				__FUNCTION__, inst, pat.name);
				return pat.exec(this, inst, ignore_cond);
				}
				}

				LLDB_LOGF(log,
				"EmulateInstructionRISCV::%s: inst(%x) does not branching: "
				DavidSpickettUnsubmitted Done Reply Inline Actions `0x%x` and `does not branch:` DavidSpickett: `0x%x` and `does not branch:`
				"no need to calculate the next pc address which is trivial.",
				__FUNCTION__, inst);
				return true;
				}

				bool EmulateInstructionRISCV::EvaluateInstruction(uint32_t options) {
				uint32_t inst_size = m_opcode.GetByteSize();
				uint32_t inst = m_opcode.GetOpcode32();
				bool increase_pc = options & eEmulateInstructionOptionAutoAdvancePC;
				bool ignore_cond = options & eEmulateInstructionOptionIgnoreConditions;
				bool success = false;

				lldb::addr_t old_pc = 0;
				if (increase_pc) {
				old_pc = ReadPC(&success);
				if (!success)
				return false;
				}

				if (inst_size == 2) {
				// TODO: execute RVC
				return false;
				}

				// Execute it.
				DavidSpickettUnsubmitted Done Reply Inline Actions This comment seems redundant given that the function is called DecodeAndExecute. DavidSpickett: This comment seems redundant given that the function is called DecodeAndExecute.
				EmmmerAuthorUnsubmitted Done Reply Inline Actions A sense of ceremony haha. Will delete the comment accordingly. Emmmer: A sense of ceremony haha. Will delete the comment accordingly.
				DavidSpickettUnsubmitted Done Reply Inline Actions Now worries. Usually I end up with these because I wrote out the steps as comments first and forget to delete them. DavidSpickett: Now worries. Usually I end up with these because I wrote out the steps as comments first and…
				success = DecodeAndExecute(inst, ignore_cond);
				if (!success)
				return false;

				if (increase_pc) {
				lldb::addr_t new_pc = ReadPC(&success);
				if (!success)
				return false;

				if (new_pc == old_pc) {
				if (!WritePC(old_pc + inst_size))
				return false;
				}
				}
				return true;
				}

				bool EmulateInstructionRISCV::ReadInstruction() {
				bool success = false;
				m_addr = ReadPC(&success);
				if (success) {
				Context ctx;
				ctx.type = eContextReadOpcode;
				ctx.SetNoArgs();
				uint32_t inst = (uint32_t)ReadMemoryUnsigned(ctx, m_addr, 4, 0, &success);
				uint16_t try_rvc = (uint16_t)(inst & 0x0000ffff);
				uint16_t mask = try_rvc & 0b11;
				DavidSpickettUnsubmitted Done Reply Inline Actions Please add comments to explain this. I think what you're doing is checking whether the compressed encoding could be valid. We do similair things for Thumb encodings for Arm. DavidSpickett: Please add comments to explain this. I think what you're doing is checking whether the…
				EmmmerAuthorUnsubmitted Done Reply Inline Actions Exactly! Comments are added now. Emmmer: Exactly! Comments are added now.
				if (try_rvc != 0 && (mask == 0 \|\| mask == 1 \|\| mask == 2)) {
				DavidSpickettUnsubmitted Done Reply Inline Actions Seems easier to write this as != 3. DavidSpickett: Seems easier to write this as != 3.
				m_opcode.SetOpcode16(try_rvc, GetByteOrder());
				} else {
				m_opcode.SetOpcode32(inst, GetByteOrder());
				}
				}
				if (!success)
				m_addr = LLDB_INVALID_ADDRESS;
				return success;
				}
				DavidSpickettUnsubmitted Done Reply Inline Actions I would change this into an early return style like this: bool success = false; m_addr = ReadPC(&success); if (!success) m_addr = LLDB_INVALID_ADDRESS; Context ctx; ctx.type = eContextReadOpcode; ctx.SetNoArgs(); uint32_t inst = (uint32_t)ReadMemoryUnsigned(ctx, m_addr, 4, 0, &success); uint16_t try_rvc = (uint16_t)(inst & 0x0000ffff); uint16_t mask = try_rvc & 0b11; if (try_rvc != 0 && (mask == 0 \|\| mask == 1 \|\| mask == 2)) { m_opcode.SetOpcode16(try_rvc, GetByteOrder()); } else { m_opcode.SetOpcode32(inst, GetByteOrder()); } return true; } DavidSpickett: I would change this into an early return style like this: ``` bool success = false; m_addr…

				lldb::addr_t EmulateInstructionRISCV::ReadPC(bool *success) {
				return ReadRegisterUnsigned(eRegisterKindGeneric, LLDB_REGNUM_GENERIC_PC,
				LLDB_INVALID_ADDRESS, success);
				}

				bool EmulateInstructionRISCV::WritePC(lldb::addr_t pc) {
				EmulateInstruction::Context ctx;
				ctx.type = eContextAdvancePC;
				ctx.SetNoArgs();
				return WriteRegisterUnsigned(ctx, eRegisterKindGeneric,
				LLDB_REGNUM_GENERIC_PC, pc);
				}

				bool EmulateInstructionRISCV::GetRegisterInfo(lldb::RegisterKind reg_kind,
				uint32_t reg_index,
				RegisterInfo &reg_info) {
				if (reg_kind == eRegisterKindGeneric) {
				switch (reg_index) {
				case LLDB_REGNUM_GENERIC_PC:
				reg_kind = eRegisterKindLLDB;
				reg_index = gpr_pc_riscv;
				break;
				case LLDB_REGNUM_GENERIC_SP:
				reg_kind = eRegisterKindLLDB;
				reg_index = gpr_sp_riscv;
				break;
				case LLDB_REGNUM_GENERIC_FP:
				reg_kind = eRegisterKindLLDB;
				reg_index = gpr_fp_riscv;
				break;
				case LLDB_REGNUM_GENERIC_RA:
				reg_kind = eRegisterKindLLDB;
				reg_index = gpr_ra_riscv;
				break;
				// We may handle LLDB_REGNUM_GENERIC_ARGx when more instructions are
				// supported.
				default:
				return false;
				DavidSpickettUnsubmitted Done Reply Inline Actions Assert here if you do not expect to hit this code. (even if that's just temporary until you emulate more instructions, it's better to know that something needs implementing) DavidSpickett: Assert here if you do not expect to hit this code. (even if that's just temporary until you…
				}
				}

				const RegisterInfo *array =
				RegisterInfoPOSIX_riscv64::GetRegisterInfoPtr(m_arch);
				const uint32_t length =
				RegisterInfoPOSIX_riscv64::GetRegisterInfoCount(m_arch);

				if (reg_index >= length \|\| reg_kind != eRegisterKindLLDB)
				return false;

				reg_info = array[reg_index];
				return true;
				}

				bool EmulateInstructionRISCV::SetTargetTriple(const ArchSpec &arch) {
				return SupportsThisArch(arch);
				}

				bool EmulateInstructionRISCV::TestEmulation(Stream *out_stream, ArchSpec &arch,
				OptionValueDictionary *test_data) {
				return false;
				}

				void EmulateInstructionRISCV::Initialize() {
				PluginManager::RegisterPlugin(GetPluginNameStatic(),
				GetPluginDescriptionStatic(), CreateInstance);
				}

				void EmulateInstructionRISCV::Terminate() {
				PluginManager::UnregisterPlugin(CreateInstance);
				}

				lldb_private::EmulateInstruction *
				EmulateInstructionRISCV::CreateInstance(const ArchSpec &arch,
				InstructionType inst_type) {
				if (EmulateInstructionRISCV::SupportsThisInstructionType(inst_type) &&
				SupportsThisArch(arch)) {
				return new EmulateInstructionRISCV(arch);
				}

				return nullptr;
				}

				bool EmulateInstructionRISCV::SupportsThisArch(const ArchSpec &arch) {
				return arch.GetTriple().isRISCV();
				}

				} // namespace lldb_private

lldb/source/Plugins/Process/Linux/NativeProcessLinux.cpp

Show First 20 Lines • Show All 876 Lines • ▼ Show 20 Lines	bool NativeProcessLinux::MonitorClone(NativeThreadLinux &parent,
default:		default:
llvm_unreachable("unknown clone_info.event");		llvm_unreachable("unknown clone_info.event");
}		}

return true;		return true;
}		}

bool NativeProcessLinux::SupportHardwareSingleStepping() const {		bool NativeProcessLinux::SupportHardwareSingleStepping() const {
if (m_arch.GetMachine() == llvm::Triple::arm \|\| m_arch.IsMIPS())		Triple::ArchType machine = m_arch.GetMachine();
		if (m_arch.IsMIPS() \|\| machine == llvm::Triple::arm \|\|
		DavidSpickettUnsubmitted Done Reply Inline Actions Could you use `isRISCV` here instead? DavidSpickett: Could you use `isRISCV` here instead?
		machine == llvm::Triple::riscv32 \|\| machine == llvm::Triple::riscv64)
return false;		return false;
return true;		return true;
}		}

Status NativeProcessLinux::Resume(const ResumeActionList &resume_actions) {		Status NativeProcessLinux::Resume(const ResumeActionList &resume_actions) {
Log *log = GetLog(POSIXLog::Process);		Log *log = GetLog(POSIXLog::Process);
LLDB_LOG(log, "pid {0}", GetID());		LLDB_LOG(log, "pid {0}", GetID());

▲ Show 20 Lines • Show All 1,112 Lines • Show Last 20 Lines

lldb/source/Plugins/Process/Utility/NativeProcessSoftwareSingleStep.cpp

Show First 20 Lines • Show All 161 Lines • ▼ Show 20 Lines	Status NativeProcessSoftwareSingleStep::SetupSoftwareSingleStepping(
if (arch.GetMachine() == llvm::Triple::arm) {		if (arch.GetMachine() == llvm::Triple::arm) {
if (next_flags & 0x20) {		if (next_flags & 0x20) {
// Thumb mode		// Thumb mode
size_hint = 2;		size_hint = 2;
} else {		} else {
// Arm mode		// Arm mode
size_hint = 4;		size_hint = 4;
}		}
} else if (arch.IsMIPS() \|\| arch.GetTriple().isPPC64())		} else if (arch.IsMIPS() \|\| arch.GetTriple().isPPC64() \|\|
		arch.GetTriple().isRISCV())
size_hint = 4;		size_hint = 4;
error = process.SetBreakpoint(next_pc, size_hint, /hardware=/false);		error = process.SetBreakpoint(next_pc, size_hint, /hardware=/false);

// If setting the breakpoint fails because next_pc is out of the address		// If setting the breakpoint fails because next_pc is out of the address
// space, ignore it and let the debugee segfault.		// space, ignore it and let the debugee segfault.
if (error.GetError() == EIO \|\| error.GetError() == EFAULT) {		if (error.GetError() == EIO \|\| error.GetError() == EFAULT) {
return Status();		return Status();
} else if (error.Fail())		} else if (error.Fail())
return error;		return error;

m_threads_stepping_with_breakpoint.insert({thread.GetID(), next_pc});		m_threads_stepping_with_breakpoint.insert({thread.GetID(), next_pc});

return Status();		return Status();
}		}

lldb/tools/lldb-server/CMakeLists.txt

Show First 20 Lines • Show All 47 Lines • ▼ Show 20 Lines	add_lldb_tool(lldb-server
LINK_LIBS		LINK_LIBS
lldbHost		lldbHost
lldbInitialization		lldbInitialization
lldbVersion		lldbVersion
${LLDB_PLUGINS}		${LLDB_PLUGINS}
lldbPluginInstructionARM		lldbPluginInstructionARM
lldbPluginInstructionMIPS		lldbPluginInstructionMIPS
lldbPluginInstructionMIPS64		lldbPluginInstructionMIPS64
		lldbPluginInstructionRISCV
${LLDB_SYSTEM_LIBS}		${LLDB_SYSTEM_LIBS}

LINK_COMPONENTS		LINK_COMPONENTS
Option		Option
Support		Support
)		)

add_dependencies(lldb-server		add_dependencies(lldb-server
LLGSOptionsTableGen		LLGSOptionsTableGen
${tablegen_deps}		${tablegen_deps}
)		)
target_include_directories(lldb-server PRIVATE "${LLDB_SOURCE_DIR}/source")		target_include_directories(lldb-server PRIVATE "${LLDB_SOURCE_DIR}/source")
target_link_libraries(lldb-server PRIVATE ${LLDB_SYSTEM_LIBS})		target_link_libraries(lldb-server PRIVATE ${LLDB_SYSTEM_LIBS})

lldb/tools/lldb-server/SystemInitializerLLGS.cpp

	Show All 35 Lines
	#endif			#endif

	#if defined(__mips__) \|\| defined(mips) \|\| defined(__mips) \|\| \			#if defined(__mips__) \|\| defined(mips) \|\| defined(__mips) \|\| \
	defined(__MIPS__) \|\| defined(_M_MIPS) \|\| defined(LLDB_TARGET_MIPS64)			defined(__MIPS__) \|\| defined(_M_MIPS) \|\| defined(LLDB_TARGET_MIPS64)
	#define LLDB_TARGET_MIPS			#define LLDB_TARGET_MIPS
	#include "Plugins/Instruction/MIPS/EmulateInstructionMIPS.h"			#include "Plugins/Instruction/MIPS/EmulateInstructionMIPS.h"
	#endif			#endif

				#if defined(__riscv)
				#define LLDB_TARGET_RISCV
				#include "Plugins/Instruction/RISCV/EmulateInstructionRISCV.h"
				#endif

	using namespace lldb_private;			using namespace lldb_private;

	llvm::Error SystemInitializerLLGS::Initialize() {			llvm::Error SystemInitializerLLGS::Initialize() {
	if (auto e = SystemInitializerCommon::Initialize())			if (auto e = SystemInitializerCommon::Initialize())
	return e;			return e;

	HostObjectFile::Initialize();			HostObjectFile::Initialize();

	#if defined(LLDB_TARGET_ARM) \|\| defined(LLDB_TARGET_ARM64)			#if defined(LLDB_TARGET_ARM) \|\| defined(LLDB_TARGET_ARM64)
	EmulateInstructionARM::Initialize();			EmulateInstructionARM::Initialize();
	#endif			#endif
	#if defined(LLDB_TARGET_MIPS) \|\| defined(LLDB_TARGET_MIPS64)			#if defined(LLDB_TARGET_MIPS) \|\| defined(LLDB_TARGET_MIPS64)
	EmulateInstructionMIPS::Initialize();			EmulateInstructionMIPS::Initialize();
	#endif			#endif
	#if defined(LLDB_TARGET_MIPS64)			#if defined(LLDB_TARGET_MIPS64)
	EmulateInstructionMIPS64::Initialize();			EmulateInstructionMIPS64::Initialize();
	#endif			#endif
				#if defined(LLDB_TARGET_RISCV)
				EmulateInstructionRISCV::Initialize();
				#endif

	return llvm::Error::success();			return llvm::Error::success();
	}			}

	void SystemInitializerLLGS::Terminate() {			void SystemInitializerLLGS::Terminate() {
	HostObjectFile::Terminate();			HostObjectFile::Terminate();

	#if defined(LLDB_TARGET_ARM) \|\| defined(LLDB_TARGET_ARM64)			#if defined(LLDB_TARGET_ARM) \|\| defined(LLDB_TARGET_ARM64)
	EmulateInstructionARM::Terminate();			EmulateInstructionARM::Terminate();
	#endif			#endif
	#if defined(LLDB_TARGET_MIPS) \|\| defined(LLDB_TARGET_MIPS64)			#if defined(LLDB_TARGET_MIPS) \|\| defined(LLDB_TARGET_MIPS64)
	EmulateInstructionMIPS::Terminate();			EmulateInstructionMIPS::Terminate();
	#endif			#endif
	#if defined(LLDB_TARGET_MIPS64)			#if defined(LLDB_TARGET_MIPS64)
	EmulateInstructionMIPS64::Terminate();			EmulateInstructionMIPS64::Terminate();
	#endif			#endif
				#if defined(LLDB_TARGET_RISCV)
				EmulateInstructionRISCV::Terminate();
				#endif

	SystemInitializerCommon::Terminate();			SystemInitializerCommon::Terminate();
	}			}

lldb/unittests/Instruction/ARM64/TestAArch64Emulator.cpp

This file was moved from lldb/unittests/Instruction/TestAArch64Emulator.cpp.

The contents of this file were not changed.

lldb/unittests/Instruction/CMakeLists.txt

				set(FILES "")
				set(DEPS "")

	if("ARM" IN_LIST LLVM_TARGETS_TO_BUILD)			if("ARM" IN_LIST LLVM_TARGETS_TO_BUILD)
				DavidSpickettUnsubmitted Not Done Reply Inline Actions Also for AArch64 this should be "AARCH64" or "ARM64" (I think it's the former). In this context ARM == 32 bit Arm and AArch64 == 64 bit Arm (where elsewhere Arm means both, and I agree, it's confusing). DavidSpickett: Also for AArch64 this should be "AARCH64" or "ARM64" (I think it's the former). In this…
				EmmmerAuthorUnsubmitted Done Reply Inline Actions This code confused me a lot and thank you for the explanation. I am going to check ARM and AARCH64 explicitly. Emmmer: This code confused me a lot and thank you for the explanation. I am going to check ARM and…
				list(APPEND FILES
				ARM64/TestAArch64Emulator.cpp
				)
				list(APPEND DEPS lldbPluginInstructionARM64)
				endif()

				if("RISCV" IN_LIST LLVM_TARGETS_TO_BUILD)
				list(APPEND FILES
				RISCV/TestRISCVEmulator.cpp
				)
				list(APPEND DEPS lldbPluginInstructionRISCV)
				endif()
				DavidSpickettUnsubmitted Not Done Reply Inline Actions Why do we need this conditional logic here? Surely unittests don't need a riscv or arm64 host and shouldn't clash with each other, or did you find some issue including both? DavidSpickett: Why do we need this conditional logic here? Surely unittests don't need a riscv or arm64 host…
				EmmmerAuthorUnsubmitted Done Reply Inline Actions I don't know either. I see the original code checking TARGETS_TO_BUILD and there might be special consideration I suppose? Emmmer: I don't know either. I see the original code checking TARGETS_TO_BUILD and there might be…
				DavidSpickettUnsubmitted Done Reply Inline Actions Sorry I completely missed that first line (the diff split at that point). Right, so you're just adding the riscv equivalent of what's already there. So actually just leave the ARM bit as is, I'll do some builds and find out whether it needs to change for AArch64. DavidSpickett: Sorry I completely missed that first line (the diff split at that point). Right, so you're just…
				EmmmerAuthorUnsubmitted Done Reply Inline Actions Oh... thank you! I am gonna revert the change for AArch64. Emmmer: Oh... thank you! I am gonna revert the change for AArch64.

				list(LENGTH FILES LISTLEN)

				if (LISTLEN GREATER 0)
	add_lldb_unittest(EmulatorTests			add_lldb_unittest(EmulatorTests
	TestAArch64Emulator.cpp			${FILES}
	LINK_LIBS			LINK_LIBS
	lldbCore			lldbCore
	lldbSymbol			lldbSymbol
	lldbTarget			lldbTarget
	lldbPluginInstructionARM64			${DEPS}
	LINK_COMPONENTS			LINK_COMPONENTS
	Support			Support
	${LLVM_TARGETS_TO_BUILD})			${LLVM_TARGETS_TO_BUILD})
	endif()			endif ()

lldb/unittests/Instruction/RISCV/TestRISCVEmulator.cpp

This file was added.

				//===-- TestRISCVEmulator.cpp ---------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include <unordered_map>
				DavidSpickettUnsubmitted Done Reply Inline Actions This appears unused. DavidSpickett: This appears unused.
				EmmmerAuthorUnsubmitted Done Reply Inline Actions nice catch! Emmmer: nice catch!

				#include "gtest/gtest.h"

				#include "lldb/Core/Address.h"
				#include "lldb/Core/Disassembler.h"
				#include "lldb/Core/PluginManager.h"
				#include "lldb/Target/ExecutionContext.h"
				#include "lldb/Utility/ArchSpec.h"
				#include "lldb/Utility/RegisterValue.h"

				#include "Plugins/Instruction/RISCV/EmulateInstructionRISCV.h"
				#include "Plugins/Process/Utility/RegisterInfoPOSIX_riscv64.h"
				#include "Plugins/Process/Utility/lldb-riscv-register-enums.h"

				using namespace lldb;
				using namespace lldb_private;

				struct RISCVEmulatorTester : public EmulateInstructionRISCV, testing::Test {
				RegisterInfoPOSIX_riscv64::GPR gpr;

				RISCVEmulatorTester()
				: EmulateInstructionRISCV(ArchSpec("riscv64-unknown-linux-gnu")) {
				EmulateInstruction::SetReadRegCallback(ReadRegisterCallback);
				EmulateInstruction::SetWriteRegCallback(WriteRegisterCallback);
				}

				static bool ReadRegisterCallback(EmulateInstruction instruction, void baton,
				const RegisterInfo *reg_info,
				RegisterValue &reg_value) {
				RISCVEmulatorTester tester = (RISCVEmulatorTester )instruction;
				DavidSpickettUnsubmitted Not Done Reply Inline Actions Please explain what this does. Just on the face of it, something a bit dodgy :) (maybe there is a better way to express it, even if it is legit) DavidSpickett: Please explain what this does. Just on the face of it, something a bit dodgy :) (maybe there is…
				EmmmerAuthorUnsubmitted Done Reply Inline Actions It is actually an always-success dynamic_cast because: The `RISCVEmulatorTester` is derived from `EmulateInstruction` These callbacks are set in the constructor of `RISCVEmulatorTester` All `EmulateInstruction` instances created in this unittest are always `RISCVEmulatorTester` Signature of these callbacks is not polymorphic (like for Java we have `<T extends EmulateInstruction>` with bounded constraints.) and that's the reason why we need this cast. We may turn it into a template but it requires more effort. I also noticed that there's a `void baton` in parameters. But using that field looks more terrible. I guess we could have a `userdata` whose type was parametrized. Emmmer:* It is actually an always-success dynamic_cast because: - The `RISCVEmulatorTester` is derived…
				DavidSpickettUnsubmitted Done Reply Inline Actions But using that field looks more terrible. I guess we could have a userdata whose type was parametrized. Yeah there's a lot of...not ideal patterns in this area but I wouldn't pay much attention to them. Without opting into llvm's rtti equivalent I guess this is the best way to do it. DavidSpickett: > But using that field looks more terrible. I guess we could have a userdata whose type was…
				uint32_t reg = reg_info->kinds[eRegisterKindLLDB];
				if (reg == gpr_x0_riscv) {
				reg_value.SetUInt(0, reg_info->byte_size);
				return true;
				}
				reg_value.SetUInt(tester->gpr.gpr[reg], reg_info->byte_size);
				DavidSpickettUnsubmitted Done Reply Inline Actions if (reg == gpr_x0_riscv) reg_value.SetUInt(0, reg_info->byte_size); else reg_value.SetUInt(tester->gpr.gpr[reg], reg_info->byte_size); return true; DavidSpickett: ``` if (reg == gpr_x0_riscv) reg_value.SetUInt(0, reg_info->byte_size); else…
				return true;
				}

				static bool WriteRegisterCallback(EmulateInstruction *instruction,
				void *baton, const Context &context,
				const RegisterInfo *reg_info,
				const RegisterValue &reg_value) {
				RISCVEmulatorTester tester = (RISCVEmulatorTester )instruction;
				uint32_t reg = reg_info->kinds[eRegisterKindLLDB];
				if (reg == gpr_x0_riscv) {
				return true;
				}
				tester->gpr.gpr[reg] = reg_value.GetAsUInt64();
				return true;
				DavidSpickettUnsubmitted Done Reply Inline Actions if (reg != gpr_x0_riscv) tester->gpr.gpr[reg] = reg_value.GetAsUInt64(); return true; DavidSpickett: ``` if (reg != gpr_x0_riscv) tester->gpr.gpr[reg] = reg_value.GetAsUInt64(); return true; ```
				}
				};

				TEST_F(RISCVEmulatorTester, testJAL) {
				lldb::addr_t old_pc = 0x114514;
				WritePC(old_pc);
				// jal x1, -6*4
				uint32_t inst = 0b11111110100111111111000011101111;
				DecodeAndExecute(inst, false);
				DavidSpickettUnsubmitted Done Reply Inline Actions Assert that this was successful. DavidSpickett: Assert that this was successful.
				auto x1 = gpr.gpr[1];

				bool success = false;
				auto pc = ReadPC(&success);

				ASSERT_TRUE(success);
				ASSERT_EQ(x1, old_pc + 4);
				ASSERT_EQ(pc, old_pc + (-6 * 4));
				}

				constexpr uint32_t EncodeI(uint32_t opcode, uint32_t funct3, uint32_t rd,
				DavidSpickettUnsubmitted Not Done Reply Inline Actions EncodeInstruction? I don't think we need to shorten the names quite this much. DavidSpickett: EncodeInstruction? I don't think we need to shorten the names quite this much.
				EmmmerAuthorUnsubmitted Done Reply Inline Actions The `I` stands for `I-Type`, a riscv instruction format. Like `EncodeB` below. Emmmer: The `I` stands for `I-Type`, a riscv instruction format. Like `EncodeB` below.
				DavidSpickettUnsubmitted Done Reply Inline Actions Cool. It's mostly that EncodeB looks like the author got distracted half way through the name. But anyway I'm not familiar with the spec so if it matches that then great. DavidSpickett: Cool. It's mostly that EncodeB looks like the author got distracted half way through the name.
				EmmmerAuthorUnsubmitted Done Reply Inline Actions You're right, the name looks unfriendly to other developers. So I changed its name to `EncodeIType` (also for `EncodeBType`) to avoid confusion. Emmmer: You're right, the name looks unfriendly to other developers. So I changed its name to…
				uint32_t rs1, uint32_t imm) {
				return imm << 20 \| rs1 << 15 \| funct3 << 12 \| rd << 7 \| opcode;
				}

				constexpr uint32_t JALR(uint32_t rd, uint32_t rs1, int32_t offset) {
				return EncodeI(0b1100111, 0, rd, rs1, uint32_t(offset));
				}

				TEST_F(RISCVEmulatorTester, testJALR) {
				lldb::addr_t old_pc = 0x114514;
				lldb::addr_t old_x2 = 0x1024;
				WritePC(old_pc);
				gpr.gpr[2] = old_x2;
				// jalr x1, x2(-255)
				uint32_t inst = JALR(1, 2, -255);
				DecodeAndExecute(inst, false);
				auto x1 = gpr.gpr[1];

				bool success = false;
				auto pc = ReadPC(&success);
				DavidSpickettUnsubmitted Not Done Reply Inline Actions Add a comment like "JALR will always zero the bottom bit of the target". DavidSpickett: Add a comment like "JALR will always zero the bottom bit of the target".

				ASSERT_TRUE(success);
				ASSERT_EQ(x1, old_pc + 4);
				ASSERT_EQ(pc, (old_x2 + (-255)) & (~1));
				DavidSpickettUnsubmitted Not Done Reply Inline Actions You check the `~1` here but your input values don't have the bottom bit set. Does that mean this test is not covering that aspect of the emulation? Seems like it should be. DavidSpickett: You check the `~1` here but your input values don't have the bottom bit set. Does that mean…
				EmmmerAuthorUnsubmitted Done Reply Inline Actions `0x1024 - 255` = 3877 `(0x1024 - 255) & ~1` = 3876 I think it is covered 😃 Emmmer: `0x1024 - 255` = 3877 `(0x1024 - 255) & ~1` = 3876 I think it is covered 😃
				DavidSpickettUnsubmitted Done Reply Inline Actions Doh, yes I wasn't looking at the offset. DavidSpickett: Doh, yes I wasn't looking at the offset.
				}

				constexpr uint32_t EncodeB(uint32_t opcode, uint32_t funct3, uint32_t rs1,
				DavidSpickettUnsubmitted Done Reply Inline Actions EncodeBranch DavidSpickett: EncodeBranch
				EmmmerAuthorUnsubmitted Done Reply Inline Actions The `B` stands for `B-Type` which works for all B-type instructions. Branch instructions are all of B-type so renaming it to `EncodeBranch` seems Okay right now, but we may add more B-type instructions in the future. Emmmer: The `B` stands for `B-Type` which works for all B-type instructions. Branch instructions are…
				uint32_t rs2, uint32_t imm) {
				uint32_t bimm = (imm & (0b1 << 11)) >> 4 \| (imm & (0b11110)) << 7 \|
				(imm & (0b111111 << 5)) << 20 \| (imm & (0b1 << 12)) << 19;

				return rs2 << 20 \| rs1 << 15 \| funct3 << 12 \| opcode \| bimm;
				}

				constexpr uint32_t BEQ(uint32_t rs1, uint32_t rs2, int32_t offset) {
				return EncodeB(0b1100011, 0b000, rs1, rs2, uint32_t(offset));
				}

				constexpr uint32_t BNE(uint32_t rs1, uint32_t rs2, int32_t offset) {
				return EncodeB(0b1100011, 0b001, rs1, rs2, uint32_t(offset));
				}

				constexpr uint32_t BLT(uint32_t rs1, uint32_t rs2, int32_t offset) {
				return EncodeB(0b1100011, 0b100, rs1, rs2, uint32_t(offset));
				}

				constexpr uint32_t BGE(uint32_t rs1, uint32_t rs2, int32_t offset) {
				return EncodeB(0b1100011, 0b101, rs1, rs2, uint32_t(offset));
				}

				constexpr uint32_t BLTU(uint32_t rs1, uint32_t rs2, int32_t offset) {
				return EncodeB(0b1100011, 0b110, rs1, rs2, uint32_t(offset));
				}

				constexpr uint32_t BGEU(uint32_t rs1, uint32_t rs2, int32_t offset) {
				return EncodeB(0b1100011, 0b111, rs1, rs2, uint32_t(offset));
				}

				using EncoderB = uint32_t (*)(uint32_t rs1, uint32_t rs2, int32_t offset);

				void testB(RISCVEmulatorTester *tester, EncoderB encoder, bool branched,
				DavidSpickettUnsubmitted Done Reply Inline Actions testBranch (you get the idea by now) DavidSpickett: testBranch (you get the idea by now)
				EmmmerAuthorUnsubmitted Done Reply Inline Actions Yes this should be `Branch` since it only tests branch instructions. Emmmer: Yes this should be `Branch` since it only tests branch instructions.
				uint64_t rs1, uint64_t rs2) {
				// prepare test registers
				lldb::addr_t old_pc = 0x114514;
				tester->WritePC(old_pc);
				tester->gpr.gpr[1] = rs1;
				tester->gpr.gpr[2] = rs2;
				// b<cmp> x1, x2, (-256)
				uint32_t inst = encoder(1, 2, -256);
				tester->DecodeAndExecute(inst, false);
				bool success = false;
				auto pc = tester->ReadPC(&success);
				ASSERT_TRUE(success);
				ASSERT_EQ(pc, old_pc + (branched ? (-256) : 0));
				}

				#define GEN_TEST_B(name, rs1, rs2_branched, rs2_continued) \
				TEST_F(RISCVEmulatorTester, test##name##Branched) { \
				testB(this, name, true, rs1, rs2_branched); \
				} \
				TEST_F(RISCVEmulatorTester, test##name##Continued) { \
				testB(this, name, false, rs1, rs2_continued); \
				}
				DavidSpickettUnsubmitted Done Reply Inline Actions Now I understand what's going on. I thought that this somehow included non branch encodings when in fact what it's doing is saying for a branch equals we should only branch if the operands are equal. DavidSpickett: Now I understand what's going on. I thought that this somehow included non branch encodings…

				GEN_TEST_B(BEQ, 1, 1, 0)
				DavidSpickettUnsubmitted Done Reply Inline Actions Add comments to explain what these numbers are. With the macro wrapper it's hard to tell what they mean and what this is actually checking. DavidSpickett: Add comments to explain what these numbers are. With the macro wrapper it's hard to tell what…
				GEN_TEST_B(BNE, 1, 0, 1)

				GEN_TEST_B(BLT, -2, 1, -3)
				GEN_TEST_B(BGE, -2, -3, 1)

				GEN_TEST_B(BLTU, -2, -1, 1)
				GEN_TEST_B(BGEU, -2, 1, -1)

lldb/unittests/Instruction/TestAArch64Emulator.cpp

This file was moved to lldb/unittests/Instruction/ARM64/TestAArch64Emulator.cpp.

This is an archive of the discontinued LLVM Phabricator instance.

[LLDB][RISCV] Make software single stepping workClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 452909

lldb/source/Plugins/Instruction/CMakeLists.txt

lldb/source/Plugins/Instruction/RISCV/CMakeLists.txt

lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.h

lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.cpp

lldb/source/Plugins/Process/Linux/NativeProcessLinux.cpp

lldb/source/Plugins/Process/Utility/NativeProcessSoftwareSingleStep.cpp

lldb/tools/lldb-server/CMakeLists.txt

lldb/tools/lldb-server/SystemInitializerLLGS.cpp

lldb/unittests/Instruction/ARM64/TestAArch64Emulator.cpp

lldb/unittests/Instruction/CMakeLists.txt

lldb/unittests/Instruction/RISCV/TestRISCVEmulator.cpp

lldb/unittests/Instruction/TestAArch64Emulator.cpp

[LLDB][RISCV] Make software single stepping work
ClosedPublic