This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lldb/source/Plugins/Process/Linux/
-
source/
-
Plugins/
-
Process/
-
Linux/
-
NativeRegisterContextLinux_arm64.h
3/5
NativeRegisterContextLinux_arm64.cpp

Differential D156687

[lldb][AArch64] Add type marker to ReadAll/WriteALLRegisterValues data
ClosedPublic

Authored by DavidSpickett on Jul 31 2023, 6:28 AM.

Download Raw Diff

Details

Reviewers

omjavaid

Commits

rG8f2ffb1cf31f: [lldb][AArch64] Add type marker to ReadAll/WriteALLRegisterValues data

Summary

While working in support for SME's ZA register, I found a circumstance
where restoring ZA after SVE, when the current SVE mode is non-streaming,
will kick the process back into FPSIMD mode. Meaning the SVE values that
you just wrote are now cut off at 128 bit.

The fix for that is to write ZA then SVE. Problem with that
is, the current ReadAll/WriteAll makes a lot of assumptions about the
saved data length.

This patch changes the format so there is a "type" written before
each data block. This tells WriteAllRegisterValues what it's looking at
without brittle checks on length, or assumptions about ordering.

If we want to change the order of restoration, all we now have to
do is change the order of saving.

This exposes a bug where the TLS registers are not restored.
This will be fixed by https://reviews.llvm.org/D156512 in some form,
depending on what lands first.

Existing SVE tests certainly check restoration and when I got this
wrong, many, many tests failed. So I think we have enough coverage
already, and more will be coming with future ZA changes.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

DavidSpickett created this revision.Jul 31 2023, 6:28 AM

Herald added a project: Restricted Project. · View Herald TranscriptJul 31 2023, 6:28 AM

Herald added subscribers: ctetreau, kristof.beyls. · View Herald Transcript

DavidSpickett requested review of this revision.Jul 31 2023, 6:28 AM

Herald added a project: Restricted Project. · View Herald TranscriptJul 31 2023, 6:28 AM

Herald added a subscriber: lldb-commits. · View Herald Transcript

Even if this turns out not to be needed for Za handling, I think WriteAllRegisterValues not having to care about the layout is an improvement in itself.

Plus, this will give us a warning if we forget to save/restore new registers in future. It would have told me I had not done TLS.

Herald added a subscriber: JDevlieghere. · View Herald TranscriptJul 31 2023, 6:30 AM

Harbormaster completed remote builds in B249196: Diff 545628.Jul 31 2023, 6:32 AM

And an important note, you don't need the size for each block because the scalable blocks have headers from ptrace embedded in them anyway (SVE/SSVE/ZA).

I did remove some checks that seemed redundant because all data read by WriteAll is going to have been written by ReadALL, so they were unlikely to fail.

I have some failing tests on Graviton to debug:

Failed Tests (2):
  lldb-api :: commands/register/register/aarch64_sve_registers/rw_access_static_config/TestSVERegisters.py
  lldb-api :: tools/lldb-server/TestGdbRemoteRegisterState.py

But the idea still stands.

Correct m_sve_state size to fix tests on SVE machines.

Harbormaster completed remote builds in B249213: Diff 545652.Jul 31 2023, 7:38 AM

Remove misleading comment. We save the whole SVE context, FPSIMD or not.

Harbormaster completed remote builds in B249217: Diff 545657.Jul 31 2023, 7:47 AM

DavidSpickett edited the summary of this revision. (Show Details)Jul 31 2023, 8:53 AM

DavidSpickett retitled this revision from [LLDB][AArch64] Add kind marker to ReadAll/WriteALLRegisterValues data to [lldb][AArch64] Add kind marker to ReadAll/WriteALLRegisterValues data.Jul 31 2023, 9:02 AM

Matt added a subscriber: Matt.Jul 31 2023, 12:12 PM

DavidSpickett added a parent revision: D157000: [lldb][AArch64] Check SIMD save/restore in SVE SIMD test.Aug 3 2023, 6:50 AM

Use uint8_t for kind, fix some places doing sizeof uint32_t not of the kind type.

Put back memcopy of fpr registers, which is now tested by the parent patch.

Oh, and fixed the FIXME now that the TLS fixes have landed.

Harbormaster completed remote builds in B250050: Diff 546842.Aug 3 2023, 6:55 AM

DavidSpickett edited parent revisions, added: D157488: [lldb][AArch64] Add testing of save/restore for Linux MTE control register; removed: D157000: [lldb][AArch64] Check SIMD save/restore in SVE SIMD test.Aug 9 2023, 3:25 AM

Rebase

DavidSpickett added a child revision: D157845: [lldb][AArch64] Remove bool return from UpdateARM64SVERegistersInfos.Aug 14 2023, 2:21 AM

Harbormaster completed remote builds in B252283: Diff 549836.Aug 14 2023, 2:25 AM

I have a feeling that expressions are becoming an expensive operation with amount of data we need to move back and forth between various buffers. Is there a way we can optimise this may be write register directly from source buffer.
Also how much minimum data we need to move when SME is enabled?

I have a feeling that expressions are becoming an expensive operation with amount of data we need to move back and forth between various buffers. Is there a way we can optimise this may be write register directly from source buffer.

I don't know about expensive literally but I did wonder if we could rely on the NativeRegisterContextLinux_arm64 buffers not being modified while the expression runs. Then do what I think you mean which is just flush them all to the process.

I'll need to work out exactly when SaveAll/WriteAll is and can be called. Part of me wonders why we even have such methods if you could just flush the existing buffers, but it wouldn't be the first time we've duplicated things.

Also how much minimum data we need to move when SME is enabled?

At minimum the SVE registers and the ZA header and tpidr2. So ~ (45*16) + 16 + 16 + 8. If ZA is on then you've got another 16*16 on top of that.

But to be clear, the amount of data is not the issue this change aims to address, it's purely the ordering of the restoration.

Is there a way we can optimise this may be write register directly from source buffer.

Though now I wonder if when you say source here you mean the member buffers in the register context, or literally the src pointer we use in the code.

I don't know about expensive literally but I did wonder if we could rely on the NativeRegisterContextLinux_arm64 buffers not being modified while the expression runs. Then do what I think you mean which is just flush them all to the process.

I'll need to work out exactly when SaveAll/WriteAll is and can be called. Part of me wonders why we even have such methods if you could just flush the existing buffers, but it wouldn't be the first time we've duplicated things.

I guess it looks like a lucrative solution but process/registers state between two consecutive stops can not be determined hence we are stuck with old fashioned way of save/restore.

At minimum the SVE registers and the ZA header and tpidr2. So ~ (45*16) + 16 + 16 + 8. If ZA is on then you've got another 16*16 on top of that.

But to be clear, the amount of data is not the issue this change aims to address, it's purely the ordering of the restoration.

Agreed, I am just curious if we can save some cycles by avoiding copying first from source to cache and then writing to registers. Specially when we are going to invalidate the cache anyway after the write.

I understand now. If we take WriteTLS for example, that ends with m_tls_is_valid = false;. So you're suggesting instead we have a WriteTLS whose source is not the usual TLS buffer, but the set of saved register data instead. I'll look into it.

clang-format some missing bits.

Replace a missing memcpy in the FPR case. This only works because the buffer happens to
still contain the previous state. If there is some route to restore arbitray states,
this would be broken, but I don't know how that might happen.

First thing to note is that WriteRegister also behaves this way, but there it is more appropriate because it updates only part of the buffer before writing it out in its entirety. Useful to know where the pattern came from though.

You would need roughly the following per WriteXYZ:

-      error = WriteTLS();
+      error = WriteTLS(src);

-Status NativeRegisterContextLinux_arm64::WriteTLS() {
+Status NativeRegisterContextLinux_arm64::WriteTLS(const void* src/*=nullptr*/) {

-  ioVec.iov_base = GetTLSBuffer();
+  ioVec.iov_base = src ? const_cast<void*>(src) : GetTLSBuffer();

-  Status WriteTLS();
+  Status WriteTLS(const void* src=nullptr);

We can assume that the buffer is the same as the data to be written back if it's something static like TLS. For SVE/SME, we would have resized our buffer first, so it holds there too.

The added complexity isn't that much but I think it adds more thinking time for a developer than it potentially saves in copying time. I already feel like the separate xyz_is_valid is enough to think about and having a potential second data source just adds to that load.

From the header docs it seems I was right that this is used primarily for expression evaluation:

// These two functions are used to implement "push" and "pop" of register
// states.  They are used primarily for expression evaluation, where we need
// to push a new state (storing the old one in data_sp) and then restoring
// the original state by passing the data_sp we got from ReadAllRegisters to
// WriteAllRegisterValues.

Which you would be doing a lot of in a formatter for example, but you'd get better savings implementing a more efficient packet format to do all that at once, I guess.

QSaveRegisterState / QRestoreRegisterState packets call it as part of expression evaluation, though in theory it's not always that. That's an lldb extension anyway so we're in control of it at least. In theory this could be used to restore state that is not just the previous state but I don't know how you'd trigger that from lldb.

The other use is NativeProcessLinux::Syscall which is sufficiently rare we can ignore that.

I did do a very rough benchmark where I printed the same expression 2000 times, so each one is doing a save/restore. Once with the code in this review right now, and again with this potential optimisation added to GPR/FPR/TLS (I'm on a Mountain Jade machine without SVE). Caveat shared machine, made up benchmark, etc. but all runs of both hovered between 16 and 17 seconds. Neither seemed to be consistently lower or higher than the other. Doesn't mean this isn't a speedup in isolation but if it is, it's dwarfed by the syscalls and packets sent back and forth.

In D156687#4628743, @DavidSpickett wrote:
First thing to note is that WriteRegister also behaves this way, but there it is more appropriate because it updates only part of the buffer before writing it out in its entirety. Useful to know where the pattern came from though.

You would need roughly the following per WriteXYZ:
-      error = WriteTLS();
+      error = WriteTLS(src);

-Status NativeRegisterContextLinux_arm64::WriteTLS() {
+Status NativeRegisterContextLinux_arm64::WriteTLS(const void* src/*=nullptr*/) {

-  ioVec.iov_base = GetTLSBuffer();
+  ioVec.iov_base = src ? const_cast<void*>(src) : GetTLSBuffer();

-  Status WriteTLS();
+  Status WriteTLS(const void* src=nullptr);
We can assume that the buffer is the same as the data to be written back if it's something static like TLS. For SVE/SME, we would have resized our buffer first, so it holds there too.

The added complexity isn't that much but I think it adds more thinking time for a developer than it potentially saves in copying time. I already feel like the separate xyz_is_valid is enough to think about and having a potential second data source just adds to that load.

From the header docs it seems I was right that this is used primarily for expression evaluation:
// These two functions are used to implement "push" and "pop" of register
// states.  They are used primarily for expression evaluation, where we need
// to push a new state (storing the old one in data_sp) and then restoring
// the original state by passing the data_sp we got from ReadAllRegisters to
// WriteAllRegisterValues.
Which you would be doing a lot of in a formatter for example, but you'd get better savings implementing a more efficient packet format to do all that at once, I guess.

QSaveRegisterState / QRestoreRegisterState packets call it as part of expression evaluation, though in theory it's not always that. That's an lldb extension anyway so we're in control of it at least. In theory this could be used to restore state that is not just the previous state but I don't know how you'd trigger that from lldb.

The other use is NativeProcessLinux::Syscall which is sufficiently rare we can ignore that.

I did do a very rough benchmark where I printed the same expression 2000 times, so each one is doing a save/restore. Once with the code in this review right now, and again with this potential optimisation added to GPR/FPR/TLS (I'm on a Mountain Jade machine without SVE). Caveat shared machine, made up benchmark, etc. but all runs of both hovered between 16 and 17 seconds. Neither seemed to be consistently lower or higher than the other. Doesn't mean this isn't a speedup in isolation but if it is, it's dwarfed by the syscalls and packets sent back and forth.

I mostly agree with what you have above. I was only thinking about ever increasing size of this buffer and thought if we can find a way around duplication. Most probably we ll never have SVE/SME enabled on a slow/small machine to bother us. The only area of concern could be a LLDB running on a resource constrained container but we can ignore that for now.

The only area of concern could be a LLDB running on a resource constrained container but we can ignore that for now.

Ironically we're the ones who are most likely to see that issue :)

omjavaid removed a parent revision: D157488: [lldb][AArch64] Add testing of save/restore for Linux MTE control register.Sep 1 2023, 3:12 AM

omjavaid added inline comments.Sep 1 2023, 4:05 AM

lldb/source/Plugins/Process/Linux/NativeRegisterContextLinux_arm64.cpp
583–584	I think we have lost some of code readability of ReadAllRegisterValues function by introducing AddSavedRegisters mechanism. It does remove some duplication of memcpy but it also introduces some differentiation between Read/WritellRegisterValues functions. I am inclined that we should put the memcpy into ReadAllRegisterValues to make it look similar to its Write counterpart.

DavidSpickett added inline comments.Sep 1 2023, 4:50 AM

lldb/source/Plugins/Process/Linux/NativeRegisterContextLinux_arm64.cpp
583–584	I added it to avoid repeating: Add kind whatever ::memcpy(dst, src, size); dst = dst + size; Or rather, not repeating it because it was very easy to forget to advance the pointer. The same applies to WriteAllRegisterValues, but there is some more logic in between the steps so I didn't do it there. If memcpy returned a pointer one beyond the end of the buffer I wouldn't mind doing: Add kind dst = memcpy(...) But it only returns the original destination pointer. So I'd rather not inline all the memcopies here but I could try harder to add a convenience function for WriteAllRegisterValues, so they more closely match.

So this is almost the opposite of what you asked for, but I made WriteAllRegisterValues
similar to ReadAll by giving it it's own convenient wrapper function.

Which ensures you don't forget to set a valid bool, or increment the src pointer.

Also I realised I can dedupe a lot of if (error.Fail()). SVE is a bit special due
to writing header then header+register, so it does the header manually.

So hopefully it is more readable in both in the sense that Readall/writeall is almost declarative. The details are all in the helpers so you can't forget one of them.

Harbormaster completed remote builds in B256244: Diff 555347.Sep 1 2023, 6:01 AM

omjavaid added inline comments.Sep 1 2023, 7:26 AM

lldb/source/Plugins/Process/Linux/NativeRegisterContextLinux_arm64.cpp
531	Now that we are trying to make both Read and Write look similar lets pull out size calculation into a separate helper function.
587–588	Another point that just came to my mind is the kind terminology used here. In LLDB we differentiate registers into kinds example: LLDB kinds, DWARF kinds. For differentiating between register in this an appropraite term would be "register set". So lets rename this to SaveRegisterSet/RestoreRegisterSet.

omjavaid added inline comments.Sep 1 2023, 7:29 AM

lldb/source/Plugins/Process/Linux/NativeRegisterContextLinux_arm64.cpp
503	Consider renaming this to RegisterSetName or RegisterSetType

Kind -> Type

Pull out the caching and size calculation into its own function.

DavidSpickett marked 2 inline comments as done.Sep 4 2023, 9:32 AM

DavidSpickett retitled this revision from [lldb][AArch64] Add kind marker to ReadAll/WriteALLRegisterValues data to [lldb][AArch64] Add type marker to ReadAll/WriteALLRegisterValues data.

DavidSpickett edited the summary of this revision. (Show Details)

DavidSpickett removed a child revision: D157845: [lldb][AArch64] Remove bool return from UpdateARM64SVERegistersInfos.Sep 5 2023, 2:03 AM

DavidSpickett added a child revision: D157846: [lldb][AArch64] Add tests for SME's SVE register state to TestArm64DynamicRegsets.

omjavaid accepted this revision.Sep 10 2023, 7:05 PM

This revision is now accepted and ready to land.Sep 10 2023, 7:05 PM

This revision was landed with ongoing or failed builds.Sep 10 2023, 11:57 PM

Closed by commit rG8f2ffb1cf31f: [lldb][AArch64] Add type marker to ReadAll/WriteALLRegisterValues data (authored by DavidSpickett). · Explain Why

This revision was automatically updated to reflect the committed changes.

DavidSpickett added a commit: rG8f2ffb1cf31f: [lldb][AArch64] Add type marker to ReadAll/WriteALLRegisterValues data.

omjavaid removed a child revision: D157846: [lldb][AArch64] Add tests for SME's SVE register state to TestArm64DynamicRegsets.Sep 12 2023, 1:15 AM

Revision Contents

Path

Size

lldb/

source/

Plugins/

Process/

Linux/

NativeRegisterContextLinux_arm64.h

2 lines

NativeRegisterContextLinux_arm64.cpp

223 lines

Diff 556395

lldb/source/Plugins/Process/Linux/NativeRegisterContextLinux_arm64.h

Show First 20 Lines • Show All 175 Lines • ▼ Show 20 Lines	private:

uint32_t CalculateFprOffset(const RegisterInfo *reg_info) const;		uint32_t CalculateFprOffset(const RegisterInfo *reg_info) const;

RegisterInfoPOSIX_arm64 &GetRegisterInfo() const;		RegisterInfoPOSIX_arm64 &GetRegisterInfo() const;

void ConfigureRegisterContext();		void ConfigureRegisterContext();

uint32_t CalculateSVEOffset(const RegisterInfo *reg_info) const;		uint32_t CalculateSVEOffset(const RegisterInfo *reg_info) const;

		Status CacheAllRegisters(uint32_t &cached_size);
};		};

} // namespace process_linux		} // namespace process_linux
} // namespace lldb_private		} // namespace lldb_private

#endif // #ifndef lldb_NativeRegisterContextLinux_arm64_h		#endif // #ifndef lldb_NativeRegisterContextLinux_arm64_h

#endif // defined (__arm64__) \|\| defined (__aarch64__)		#endif // defined (__arm64__) \|\| defined (__aarch64__)

lldb/source/Plugins/Process/Linux/NativeRegisterContextLinux_arm64.cpp

Show First 20 Lines • Show All 493 Lines • ▼ Show 20 Lines	if (IsGPR(reg)) {
::memcpy(dst, reg_value.GetBytes(), reg_info->byte_size);		::memcpy(dst, reg_value.GetBytes(), reg_info->byte_size);

return WriteTLS();		return WriteTLS();
}		}

return Status("Failed to write register value");		return Status("Failed to write register value");
}		}

Status NativeRegisterContextLinux_arm64::ReadAllRegisterValues(		enum RegisterSetType : uint32_t {
lldb::WritableDataBufferSP &data_sp) {		GPR,
		omjavaidUnsubmitted Done Reply Inline Actions Consider renaming this to RegisterSetName or RegisterSetType omjavaid: Consider renaming this to RegisterSetName or RegisterSetType
// AArch64 register data must contain GPRs and either FPR or SVE registers.		SVE, // Used for SVE and SSVE.
// SVE registers can be non-streaming (aka SVE) or streaming (aka SSVE).		FPR, // When there is no SVE, or SVE in FPSIMD mode.
// Finally an optional MTE register. Pointer Authentication (PAC) registers		MTE,
// are read-only and will be skipped.		TLS,
		};
// In order to create register data checkpoint we first read all register
// values if not done already and calculate total size of register set data.		static uint8_t AddRegisterSetType(uint8_t dst,
// We store all register values in data_sp by copying full PTrace data that		RegisterSetType register_set_type) {
// corresponds to register sets enabled by current register context.		(reinterpret_cast<uint32_t >(dst)) = register_set_type;
		return dst + sizeof(uint32_t);
		}

		static uint8_t AddSavedRegistersData(uint8_t dst, void *src, size_t size) {
		::memcpy(dst, src, size);
		return dst + size;
		}

		static uint8_t AddSavedRegisters(uint8_t dst,
		enum RegisterSetType register_set_type,
		void *src, size_t size) {
		dst = AddRegisterSetType(dst, register_set_type);
		return AddSavedRegistersData(dst, src, size);
		}

		Status
		NativeRegisterContextLinux_arm64::CacheAllRegisters(uint32_t &cached_size) {
Status error;		Status error;
uint32_t reg_data_byte_size = GetGPRBufferSize();		cached_size = sizeof(RegisterSetType) + GetGPRBufferSize();
		omjavaidUnsubmitted Done Reply Inline Actions Now that we are trying to make both Read and Write look similar lets pull out size calculation into a separate helper function. omjavaid: Now that we are trying to make both Read and Write look similar lets pull out size calculation…
error = ReadGPR();		error = ReadGPR();
if (error.Fail())		if (error.Fail())
return error;		return error;

// If SVE is enabled we need not copy FPR separately.		// If SVE is enabled we need not copy FPR separately.
if (GetRegisterInfo().IsSVEEnabled() \|\| GetRegisterInfo().IsSSVEEnabled()) {		if (GetRegisterInfo().IsSVEEnabled() \|\| GetRegisterInfo().IsSSVEEnabled()) {
reg_data_byte_size += GetSVEBufferSize();		// Store mode and register data.
// Also store the current SVE mode.		cached_size +=
reg_data_byte_size += sizeof(uint32_t);		sizeof(RegisterSetType) + sizeof(m_sve_state) + GetSVEBufferSize();
error = ReadAllSVE();		error = ReadAllSVE();
} else {		} else {
reg_data_byte_size += GetFPRSize();		cached_size += sizeof(RegisterSetType) + GetFPRSize();
error = ReadFPR();		error = ReadFPR();
}		}
if (error.Fail())		if (error.Fail())
return error;		return error;

if (GetRegisterInfo().IsMTEEnabled()) {		if (GetRegisterInfo().IsMTEEnabled()) {
reg_data_byte_size += GetMTEControlSize();		cached_size += sizeof(RegisterSetType) + GetMTEControlSize();
error = ReadMTEControl();		error = ReadMTEControl();
if (error.Fail())		if (error.Fail())
return error;		return error;
}		}

// tpidr is always present but tpidr2 depends on SME.		// tpidr is always present but tpidr2 depends on SME.
reg_data_byte_size += GetTLSBufferSize();		cached_size += sizeof(RegisterSetType) + GetTLSBufferSize();
error = ReadTLS();		error = ReadTLS();

		return error;
		}

		Status NativeRegisterContextLinux_arm64::ReadAllRegisterValues(
		lldb::WritableDataBufferSP &data_sp) {
		// AArch64 register data must contain GPRs and either FPR or SVE registers.
		// SVE registers can be non-streaming (aka SVE) or streaming (aka SSVE).
		// Finally an optional MTE register. Pointer Authentication (PAC) registers
		// are read-only and will be skipped.

		// In order to create register data checkpoint we first read all register
		// values if not done already and calculate total size of register set data.
		// We store all register values in data_sp by copying full PTrace data that
		// corresponds to register sets enabled by current register context.

		uint32_t reg_data_byte_size = 0;
		Status error = CacheAllRegisters(reg_data_byte_size);
if (error.Fail())		if (error.Fail())
return error;		return error;

data_sp.reset(new DataBufferHeap(reg_data_byte_size, 0));		data_sp.reset(new DataBufferHeap(reg_data_byte_size, 0));
uint8_t *dst = data_sp->GetBytes();		uint8_t *dst = data_sp->GetBytes();

::memcpy(dst, GetGPRBuffer(), GetGPRBufferSize());		dst = AddSavedRegisters(dst, RegisterSetType::GPR, GetGPRBuffer(),
dst += GetGPRBufferSize();		GetGPRBufferSize());
		omjavaidUnsubmitted Not Done Reply Inline Actions I think we have lost some of code readability of ReadAllRegisterValues function by introducing AddSavedRegisters mechanism. It does remove some duplication of memcpy but it also introduces some differentiation between Read/WritellRegisterValues functions. I am inclined that we should put the memcpy into ReadAllRegisterValues to make it look similar to its Write counterpart. omjavaid: I think we have lost some of code readability of ReadAllRegisterValues function by introducing…
		DavidSpickettAuthorUnsubmitted Done Reply Inline Actions I added it to avoid repeating: Add kind whatever ::memcpy(dst, src, size); dst = dst + size; Or rather, not repeating it because it was very easy to forget to advance the pointer. The same applies to WriteAllRegisterValues, but there is some more logic in between the steps so I didn't do it there. If memcpy returned a pointer one beyond the end of the buffer I wouldn't mind doing: Add kind dst = memcpy(...) But it only returns the original destination pointer. So I'd rather not inline all the memcopies here but I could try harder to add a convenience function for WriteAllRegisterValues, so they more closely match. DavidSpickett: I added it to avoid repeating: ``` Add kind whatever ::memcpy(dst, src, size); dst = dst…

if (GetRegisterInfo().IsSVEEnabled() \|\| GetRegisterInfo().IsSSVEEnabled()) {		if (GetRegisterInfo().IsSVEEnabled() \|\| GetRegisterInfo().IsSSVEEnabled()) {
*dst = static_cast<uint8_t>(m_sve_state);		dst = AddRegisterSetType(dst, RegisterSetType::SVE);
		(reinterpret_cast<SVEState >(dst)) = m_sve_state;
		omjavaidUnsubmitted Not Done Reply Inline Actions Another point that just came to my mind is the kind terminology used here. In LLDB we differentiate registers into kinds example: LLDB kinds, DWARF kinds. For differentiating between register in this an appropraite term would be "register set". So lets rename this to SaveRegisterSet/RestoreRegisterSet. omjavaid: Another point that just came to my mind is the kind terminology used here. In LLDB we…
dst += sizeof(m_sve_state);		dst += sizeof(m_sve_state);
::memcpy(dst, GetSVEBuffer(), GetSVEBufferSize());		dst = AddSavedRegistersData(dst, GetSVEBuffer(), GetSVEBufferSize());
dst += GetSVEBufferSize();
} else {		} else {
::memcpy(dst, GetFPRBuffer(), GetFPRSize());		dst = AddSavedRegisters(dst, RegisterSetType::FPR, GetFPRBuffer(),
dst += GetFPRSize();		GetFPRSize());
}		}

if (GetRegisterInfo().IsMTEEnabled()) {		if (GetRegisterInfo().IsMTEEnabled()) {
::memcpy(dst, GetMTEControl(), GetMTEControlSize());		dst = AddSavedRegisters(dst, RegisterSetType::MTE, GetMTEControl(),
dst += GetMTEControlSize();		GetMTEControlSize());
}		}

::memcpy(dst, GetTLSBuffer(), GetTLSBufferSize());		dst = AddSavedRegisters(dst, RegisterSetType::TLS, GetTLSBuffer(),
		GetTLSBufferSize());

return error;		return error;
}		}

		static Status RestoreRegisters(void buffer, const uint8_t *src, size_t len,
		bool &is_valid, std::function<Status()> writer) {
		::memcpy(buffer, *src, len);
		is_valid = true;
		*src += len;
		return writer();
		}

Status NativeRegisterContextLinux_arm64::WriteAllRegisterValues(		Status NativeRegisterContextLinux_arm64::WriteAllRegisterValues(
const lldb::DataBufferSP &data_sp) {		const lldb::DataBufferSP &data_sp) {
// AArch64 register data must contain GPRs, either FPR or SVE registers		// AArch64 register data must contain GPRs, either FPR or SVE registers
// (which can be streaming or non-streaming) and optional MTE register.		// (which can be streaming or non-streaming) and optional MTE register.
// Pointer Authentication (PAC) registers are read-only and will be skipped.		// Pointer Authentication (PAC) registers are read-only and will be skipped.

// We store all register values in data_sp by copying full PTrace data that		// We store all register values in data_sp by copying full PTrace data that
// corresponds to register sets enabled by current register context. In order		// corresponds to register sets enabled by current register context. In order
Show All 14 Lines	Status NativeRegisterContextLinux_arm64::WriteAllRegisterValues(
if (src == nullptr) {		if (src == nullptr) {
error.SetErrorStringWithFormat("NativeRegisterContextLinux_arm64::%s "		error.SetErrorStringWithFormat("NativeRegisterContextLinux_arm64::%s "
"DataBuffer::GetBytes() returned a null "		"DataBuffer::GetBytes() returned a null "
"pointer",		"pointer",
__FUNCTION__);		__FUNCTION__);
return error;		return error;
}		}

uint64_t reg_data_min_size = GetGPRBufferSize() + GetFPRSize();		uint64_t reg_data_min_size =
		GetGPRBufferSize() + GetFPRSize() + 2 * (sizeof(RegisterSetType));
if (data_sp->GetByteSize() < reg_data_min_size) {		if (data_sp->GetByteSize() < reg_data_min_size) {
error.SetErrorStringWithFormat(		error.SetErrorStringWithFormat(
"NativeRegisterContextLinux_arm64::%s data_sp contained insufficient "		"NativeRegisterContextLinux_arm64::%s data_sp contained insufficient "
"register data bytes, expected at least %" PRIu64 ", actual %" PRIu64,		"register data bytes, expected at least %" PRIu64 ", actual %" PRIu64,
__FUNCTION__, reg_data_min_size, data_sp->GetByteSize());		__FUNCTION__, reg_data_min_size, data_sp->GetByteSize());
return error;		return error;
}		}

// Register data starts with GPRs		const uint8_t *end = src + data_sp->GetByteSize();
::memcpy(GetGPRBuffer(), src, GetGPRBufferSize());		while (src < end) {
m_gpr_is_valid = true;		const RegisterSetType kind =
		reinterpret_cast<const RegisterSetType >(src);
error = WriteGPR();		src += sizeof(RegisterSetType);
if (error.Fail())
return error;		switch (kind) {
		case RegisterSetType::GPR:
src += GetGPRBufferSize();		error = RestoreRegisters(
		GetGPRBuffer(), &src, GetGPRBufferSize(), m_gpr_is_valid,
// Verify if register data may contain SVE register values.		std::bind(&NativeRegisterContextLinux_arm64::WriteGPR, this));
bool contains_sve_reg_data =		break;
(data_sp->GetByteSize() > (reg_data_min_size + GetSVEHeaderSize()));		case RegisterSetType::SVE:

if (contains_sve_reg_data) {
// Restore to the correct mode, streaming or not.		// Restore to the correct mode, streaming or not.
m_sve_state = static_cast<SVEState>(*src);		m_sve_state = static_cast<SVEState>(*src);
src += sizeof(m_sve_state);		src += sizeof(m_sve_state);

// We have SVE register data first write SVE header.		// First write SVE header. We do not use RestoreRegisters because we do
		// not want src to be modified yet.
::memcpy(GetSVEHeader(), src, GetSVEHeaderSize());		::memcpy(GetSVEHeader(), src, GetSVEHeaderSize());
if (!sve::vl_valid(m_sve_header.vl)) {		if (!sve::vl_valid(m_sve_header.vl)) {
m_sve_header_is_valid = false;		m_sve_header_is_valid = false;
error.SetErrorStringWithFormat("NativeRegisterContextLinux_arm64::%s "		error.SetErrorStringWithFormat("NativeRegisterContextLinux_arm64::%s "
"Invalid SVE header in data_sp",		"Invalid SVE header in data_sp",
__FUNCTION__);		__FUNCTION__);
return error;		return error;
}		}
m_sve_header_is_valid = true;		m_sve_header_is_valid = true;
error = WriteSVEHeader();		error = WriteSVEHeader();
if (error.Fail())		if (error.Fail())
return error;		return error;

// SVE header has been written configure SVE vector length if needed.		// SVE header has been written configure SVE vector length if needed.
ConfigureRegisterContext();		ConfigureRegisterContext();

// Make sure data_sp contains sufficient data to write all SVE registers.		// Write header and register data, incrementing src this time.
reg_data_min_size = GetGPRBufferSize() + GetSVEBufferSize();		error = RestoreRegisters(
if (data_sp->GetByteSize() < reg_data_min_size) {		GetSVEBuffer(), &src, GetSVEBufferSize(), m_sve_buffer_is_valid,
error.SetErrorStringWithFormat(		std::bind(&NativeRegisterContextLinux_arm64::WriteAllSVE, this));
"NativeRegisterContextLinux_arm64::%s data_sp contained insufficient "		break;
"register data bytes, expected %" PRIu64 ", actual %" PRIu64,		case RegisterSetType::FPR:
__FUNCTION__, reg_data_min_size, data_sp->GetByteSize());		error = RestoreRegisters(
return error;		GetFPRBuffer(), &src, GetFPRSize(), m_fpu_is_valid,
}		std::bind(&NativeRegisterContextLinux_arm64::WriteFPR, this));
		break;
::memcpy(GetSVEBuffer(), src, GetSVEBufferSize());		case RegisterSetType::MTE:
m_sve_buffer_is_valid = true;		error = RestoreRegisters(
error = WriteAllSVE();		GetMTEControl(), &src, GetMTEControlSize(), m_mte_ctrl_is_valid,
src += GetSVEBufferSize();		std::bind(&NativeRegisterContextLinux_arm64::WriteMTEControl, this));
} else {		break;
::memcpy(GetFPRBuffer(), src, GetFPRSize());		case RegisterSetType::TLS:
m_fpu_is_valid = true;		error = RestoreRegisters(
error = WriteFPR();		GetTLSBuffer(), &src, GetTLSBufferSize(), m_tls_is_valid,
src += GetFPRSize();		std::bind(&NativeRegisterContextLinux_arm64::WriteTLS, this));
		break;
}		}

if (error.Fail())		if (error.Fail())
return error;		return error;

if (GetRegisterInfo().IsMTEEnabled() &&
data_sp->GetByteSize() > reg_data_min_size) {
::memcpy(GetMTEControl(), src, GetMTEControlSize());
m_mte_ctrl_is_valid = true;
error = WriteMTEControl();
if (error.Fail())
return error;
src += GetMTEControlSize();
}		}

// There is always a TLS set. It changes size based on system properties, it's
// not something an expression can change.
::memcpy(GetTLSBuffer(), src, GetTLSBufferSize());
m_tls_is_valid = true;
error = WriteTLS();

return error;		return error;
}		}

bool NativeRegisterContextLinux_arm64::IsGPR(unsigned reg) const {		bool NativeRegisterContextLinux_arm64::IsGPR(unsigned reg) const {
if (GetRegisterInfo().GetRegisterSetFromRegisterIndex(reg) ==		if (GetRegisterInfo().GetRegisterSetFromRegisterIndex(reg) ==
RegisterInfoPOSIX_arm64::GPRegSet)		RegisterInfoPOSIX_arm64::GPRegSet)
return true;		return true;
return false;		return false;
▲ Show 20 Lines • Show All 447 Lines • Show Last 20 Lines