This is an archive of the discontinued LLVM Phabricator instance.

lldb/source/Plugins/Trace/intel-pt/DecodedThread.h
181	you need to have something like std::unordered_map<uint64_t, llvm::Error> m_errors; that way, you'll be able to quickly look for the error associated with an instruction index. The IntelPTInstruction int his case, instead of storing the Error, can just store one bit of information has_error = true/false;

Introduced unordered map for errors in DecodedThread

Harbormaster completed remote builds in B155927: Diff 417709.Mar 23 2022, 12:22 PM

jj10306 requested changes to this revision.Mar 23 2022, 12:35 PM

jj10306 added inline comments.

lldb/source/Plugins/Trace/intel-pt/DecodedThread.h
181	nit: from https://llvm.org/docs/CodingStandards.html#c-standard-library prefer `llvm::DenseMap` over `std::map`'s unless there's a specific reason not to. Don't forget to update `CalculateApproximateMemoryUsage()` as well! Also, besides for being inline with the coding standards I linked above, using `llvm::DenseMap` here has the actual advantage that it exposes its approximate size via `getMemorySize()`, whereas there is no easy way to get the size of `std::map`.
lldb/source/Plugins/Trace/intel-pt/IntelPTDecoder.cpp
114

This revision now requires changes to proceed.Mar 23 2022, 12:35 PM

zrthxn added inline comments.Mar 23 2022, 12:50 PM

lldb/source/Plugins/Trace/intel-pt/DecodedThread.h
181	DenseMap looks interesting we should try that. Yes i will update the mem calculation and make it more accurate so some refactor will be needed. That'll be the next small patch once this works.

jj10306 added inline comments.Mar 23 2022, 12:56 PM

lldb/source/Plugins/Trace/intel-pt/DecodedThread.h
147	Return a reference here to avoid potential expensive copy when returning. Something else to consider is if we need/want an API exposing all the entire error map or if something like: `llvm::Error GetErrorByInstructionIndex(uint64_t insn_index) const` that allows the caller to specify the key into the map would make more sense? This also has the advantage that it hides the implementation detail of what data type is being used under the hood to represent the error map! @wallace @zrthxn wdyt?

zrthxn added inline comments.Mar 23 2022, 1:00 PM

lldb/source/Plugins/Trace/intel-pt/DecodedThread.h
147	Yea I did think about that. In my opinion returning a reference would be good since with that you can get the size and error at each index and we don't need to have functions for each operation which would have long clunky names making the code less readable...

wallace requested changes to this revision.Mar 23 2022, 5:15 PM

wallace added inline comments.

lldb/source/Plugins/Trace/intel-pt/DecodedThread.cpp
102–104	remove this
lldb/source/Plugins/Trace/intel-pt/DecodedThread.h
147	ahh!! DenseMap is for sure the way to go. The getMemorySize() method will help us a good deal. Thanks @jj10306 Besides that, we shouldn't expose publicly the internal representation of an object whose data structure might change. For example, sooner than later we might prefer to store the errors in a different way. So, to prevent breaking callers if we change this data structure, let's go for what Jakob proposes llvm::Error GetErrorByInstructionIndex(uint64_t insn_index) const which returns Error::success() in case the instruction is not an actual error
149	Omit `the thread`, because you are appending to a `DecodedThread`. In any case, it's redundant to mention where you are appending to
152	you should be able to omit the insn_index by using `m_instructions.size() + m_errors.size()` as the index
lldb/source/Plugins/Trace/intel-pt/IntelPTDecoder.cpp
103–105	update the documentation
107	don't call it thread, call it decoded_thread. The thread is the one passed as parameter
107	use make_shared here and return a DecodedThreadSP (alias for shared_ptr<DecodedThread>). That way, you'll avoid having to invoke shared_from_this() below
119	remove this
145	This will create a copy of the IntelPTInstruction before storing it in the vector. Instead, you should use the same semantics as vector::emplace_back(), which uses paratemer packs/variadic templates. You can even rename Append to Emplace in this case
175–207	see below. We don't need to pass the raw_buffer size because it can be gotten from the buffer itself. Also, return the DecodedThreadSP directly
211–224	Now we don't need to return the raw_trace_size as an out parameter because we are creating the decoded thread deeper in the stack.
214–215	we don't need raw_trace_size to be an out parameter anymore. We can directly use it deep inside our decoding logic
249	remove raw_trace_size as an out parameter
255–256	the trace_size doesn't need to be passed

zrthxn marked 8 inline comments as done.Mar 23 2022, 11:07 PM

zrthxn added inline comments.

lldb/source/Plugins/Trace/intel-pt/IntelPTDecoder.cpp
145	Yea I was doing that before, the idea was to send those variadic args to emplace_back but that wasnt working. I introduced this to avoid having 2 Appends since we already have 2 constructors which fulfill that requirement, and I can change this to std::move to avoid copies if thats a concern

wallace added inline comments.Mar 23 2022, 11:08 PM

lldb/source/Plugins/Trace/intel-pt/IntelPTDecoder.cpp
145	Yes, use the same pattern than emolace_back uses with a parameter pack

A few changes to remove redundant things

Harbormaster completed remote builds in B155996: Diff 417832.Mar 23 2022, 11:14 PM

wallace added inline comments.Mar 24 2022, 8:30 AM

lldb/source/Plugins/Trace/intel-pt/DecodedThread.cpp
102–104	Errors can only be copied, that's why we need to create a new instance of the error that is a copy of the original one. We can draw inspiration from IntelPTInstruction::ToError(), which can now be deleted
lldb/source/Plugins/Trace/intel-pt/DecodedThread.h
143–146	we can make the documentation clearer
150	this has to use the new parameter pack semantics, so that you can pass either `{pt_insn}` or `{pt_insn, timestamp}` without having to create copies of the IntelPTInstruction class
152
lldb/source/Plugins/Trace/intel-pt/IntelPTDecoder.cpp
107	make this a shared pointer since the beginning. Use make_shared here

wallace added inline comments.Mar 24 2022, 8:30 AM

lldb/source/Plugins/Trace/intel-pt/IntelPTDecoder.cpp
145	ideally you should be able to do `decoded_thread.AppendInstruction({insn})` here

wallace added inline comments.Mar 24 2022, 8:33 AM

lldb/source/Plugins/Trace/intel-pt/IntelPTDecoder.cpp
211	don't use expected. The DecodedThread object already can store errors. Just return a DecodedThreadSP and assign it a single error and return it. If you need to return the failed DecodedThread before you know the size of the buffer, just pass 0 as size
214	remove the *, because we will make DecodeLiveThread not return an Expected

Error gettting method

Harbormaster completed remote builds in B156090: Diff 417969.Mar 24 2022, 10:12 AM

there are many comments from the previous versions of this diff that you didn't apply. Go through all of them first :)

lldb/source/Plugins/Trace/intel-pt/DecodedThread.h
85–87	delete it. We don't want to leave old code as comments
117	now that you changed this, could show share in the description of this diff the difference in byte size between the old and new code when tracing the same number of instructions?
143–146	you didn't apply these changes
147	or GetErrorByInstructionIndex
152	same here

This revision now requires changes to proceed.Mar 24 2022, 10:15 AM

zrthxn marked 17 inline comments as done.Mar 24 2022, 11:05 AM

Incorporate other comments

zrthxn retitled this revision from [wip][intelpt] Refactoring instruction decoding for flexibility to [intelpt] Refactoring instruction decoding for flexibility.Mar 24 2022, 11:48 AM

Harbormaster completed remote builds in B156113: Diff 418000.Mar 24 2022, 11:49 AM

Resolved many runtime errors.
One small thing still remains, with post mortem decoder

Harbormaster completed remote builds in B156233: Diff 418154.Mar 25 2022, 1:26 AM

Refactor to use more templates and param packs

Harbormaster completed remote builds in B156305: Diff 418249.Mar 25 2022, 9:12 AM

much closer! I'm glad you are starting to understand the patterns we use for this kind of code

lldb/source/Plugins/Trace/intel-pt/DecodedThread.cpp
13	delete this
14	you don't need to import it. Maybe your c++ vscode extension has been autoimporting this one, in which case it's better to disable that feature.
128–131	in order to have correct formatting all along, you need to use git clang-format: https://llvm.org/docs/Contributing.html#format-patches follow that guide. Whenever you are going to submit a patch, first run git clang-format and it will format your code correctly, so that you never again have to lose time doing that. It can even format comments
129–130	here you also need to ask for the size of the DenseMap
136–139	emplace will prevent unnecessary copies and also doesn't need you to pass a pair
lldb/source/Plugins/Trace/intel-pt/DecodedThread.h
43–46	delete commented code
72	Pass the error by const reference, because we don't modify it
135–137
146
151	remove this one. We need the version that accepts a parameter pack, as we discussed offline
153–154	same here
lldb/source/Plugins/Trace/intel-pt/IntelPTDecoder.cpp
108–109	the magic of make_shared is that it uses parameter packs, so that it only constructs the object right where it'll store it, thus preventing unnecessary copies
175	don't pass the process, because we can get it from the thread_sp object
199–201	if the compiler complains mentioning that the Process pointer you are passing is const, you can do an unsafe cast that removes the const qualifier and write a comment here. I hope you don't need it anyway
206–207	use make_shared instead of shared_from_this. That will be much more performant
210	don't pass the process, it's not necessary anymore
214–218	same here
245–246	just return a direct DecodedThreadSP that holds any errors you might find here, same like LiveThreadDecoder::DoDecode()
256–261	this will become simpler once DecodeTraceFile returns directly a DecodedThreadSP

This revision now requires changes to proceed.Mar 25 2022, 9:40 AM

Before refactor

thread #1: tid = 37275
  Raw trace size: 4 KiB
  Total number of instructions: 21
  Total approximate memory usage: 5.38 KiB

After refactor

(lldb) thread trace dump info
Trace technology: intel-pt

thread #1: tid = 13690
  Raw trace size: 4 KiB
  Total number of instructions: 20
  Total approximate memory usage: 5.34 KiB

zrthxn marked 2 inline comments as done.Mar 25 2022, 12:37 PM

Incoporate more feedback,
Only parameter pack isnt done yet

Harbormaster completed remote builds in B156339: Diff 418299.Mar 25 2022, 12:41 PM

zrthxn marked 4 inline comments as done.Mar 25 2022, 1:03 PM

Finalize diff

Harbormaster completed remote builds in B156345: Diff 418306.Mar 25 2022, 1:06 PM

Added average memory per instruction

Harbormaster completed remote builds in B156347: Diff 418309.Mar 25 2022, 1:16 PM

jj10306 added inline comments.Mar 25 2022, 2:02 PM

lldb/source/Plugins/Trace/intel-pt/DecodedThread.cpp
38	Feels kinda weird that the `err` param isn't being used? Not sure if it would be preferred but another option would be to make the default constructor construct an error instruction. Passing the error is more clear that this constructor will create an error instruction, but then it feels a bit unnecessary to pass it since it's now unused. I'm a little torn so would be curious to hear others opinions (:
41	Is this boolean necessary? In the case of an error, the other two fields also indicate an error so this doesn't seem to add much information. If we remove it, you can just update IsError to check the other two fields accordingly.
lldb/source/Plugins/Trace/intel-pt/DecodedThread.h
152	Should we be using `std::forward()` here? Same question for the `AppendError` function
lldb/source/Plugins/Trace/intel-pt/TraceCursorIntelPT.cpp
77	nit: should we update this to use the error map? I don't think there's a significant difference performance wise, but the code would be a little cleaner imo and consistent with how `GetError()` works.

zrthxn added inline comments.Mar 25 2022, 2:22 PM

lldb/source/Plugins/Trace/intel-pt/TraceCursorIntelPT.cpp
77	That would sort of look like this I think if (m_decoded_thread_sp->GetErrorByInstructionIndex(m_pos).isA<ErrorSuccess>()) return false; else true;

Updated tests

Harbormaster completed remote builds in B156361: Diff 418327.Mar 25 2022, 2:31 PM

jj10306 added inline comments.Mar 25 2022, 3:28 PM

lldb/source/Plugins/Trace/intel-pt/TraceCursorIntelPT.cpp
77	What about `return (bool)m_decoded_thread_sp->GetErrorByInstructionIndex(m_pos);` Another idea is to just remove the `IsError()` function entirely since calling `GetError()` tells you if it's an error. iirc all error checks actually use `GetError` except for the checks inside of `TraceHtr` which is soon going to be deleted by @wallace in new patches, so you could just change those couple instances of `IsError` and remove it all together. Definitely not necessary, just spitballing ideas (: @wallace what do you think?

wallace added inline comments.Mar 25 2022, 3:56 PM

lldb/source/Plugins/Trace/intel-pt/DecodedThread.cpp
25
101–107	let's improve this method a bit
lldb/source/Plugins/Trace/intel-pt/DecodedThread.h
85–87	good
151	just call it args or instruction_args instead of __args
152	the goal is to avoid creating explicitly a IntelPTInstruction, so you should be able to achieve something like m_instructions.emplace_back(args...);
lldb/source/Plugins/Trace/intel-pt/IntelPTDecoder.cpp
199–201	delete this
lldb/source/Plugins/Trace/intel-pt/TraceIntelPT.cpp
122–123	print it in bytes instead

wallace added inline comments.Mar 25 2022, 3:59 PM

lldb/source/Plugins/Trace/intel-pt/TraceCursorIntelPT.cpp
77	I don't think that's a good idea. The problem is calling `GetErrorByInstructionIndex` is that you then have an Error object that you need to consume. There's also the cost of creating this object even if you just want to know if there's an error or not and you don't want to do anything with the actual error message. It's better then to create the Error object only when needed.

zrthxn marked 13 inline comments as done.Mar 25 2022, 10:12 PM

zrthxn added inline comments.

lldb/source/Plugins/Trace/intel-pt/DecodedThread.cpp
25	This is needed at DecodedThread.h:66, not here

Test program info, (llvm-project/lldb/test/API/commands/trace/intelpt-trace/a.out)

(lldb) thread trace dump info
Trace technology: intel-pt

thread #1: tid = 3842849
  Raw trace size: 4 KiB
  Total number of instructions: 22
  Total approximate memory usage: 6.38 KiB
  Average memory usage per instruction: 296 bytes

Clean up and finalize

Harbormaster completed remote builds in B156398: Diff 418376.Mar 25 2022, 10:19 PM

wallace commandeered this revision.Mar 26 2022, 10:53 AM

wallace edited reviewers, added: zrthxn; removed: wallace.

make tests pass
simplified the error handling. In fact, using Error objects might be too expensive and potentially provides little

value in the API, because the user needs to consume the Error forcefully. Besides that, once we expose this python,
the error will be a plain string, therefore, I'm now storing the error as a string. Error won't be that frequent,
so the cost of that is okay.

Harbormaster completed remote builds in B156418: Diff 418404.Mar 26 2022, 10:57 AM

wallace accepted this revision.Mar 26 2022, 11:03 AM

zrthxn commandeered this revision.Mar 26 2022, 11:04 AM

zrthxn removed a reviewer: zrthxn.

jj10306 accepted this revision.Mar 26 2022, 11:14 AM

This revision is now accepted and ready to land.Mar 26 2022, 11:14 AM

Closed by commit rGbcf1978a8715: [intelpt] Refactoring instruction decoding for flexibility (authored by zrthxn, committed by Walter Erquinigo <wallace@fb.com>). · Explain WhyMar 26 2022, 11:36 AM

This revision was automatically updated to reflect the committed changes.

Walter Erquinigo <wallace@fb.com> added a commit: rGbcf1978a8715: [intelpt] Refactoring instruction decoding for flexibility.

Revision Contents

Path

Size

lldb/

include/

lldb/

Target/

TraceCursor.h

6 lines

source/

Plugins/

Trace/

intel-pt/

69 lines

65 lines

150 lines

2 lines

TraceCursorIntelPT.cpp

4 lines

TraceIntelPT.cpp

7 lines

Target/

TraceInstructionDumper.cpp

4 lines

test/

API/

commands/

trace/

TestTraceDumpInfo.py

3 lines

TestTraceLoad.py

3 lines

Diff 418408

lldb/include/lldb/Target/TraceCursor.h

Show First 20 Lines • Show All 164 Lines • ▼ Show 20 Lines	public:
/// \return		/// \return
/// Whether the cursor points to an error or not.		/// Whether the cursor points to an error or not.
virtual bool IsError() = 0;		virtual bool IsError() = 0;

/// Get the corresponding error message if the cursor points to an error in		/// Get the corresponding error message if the cursor points to an error in
/// the trace.		/// the trace.
///		///
/// \return		/// \return
/// \b llvm::Error::success if the cursor is not pointing to an error in		/// \b nullptr if the cursor is not pointing to an error in
/// the trace. Otherwise return an \a llvm::Error describing the issue.		/// the trace. Otherwise return the actual error message.
virtual llvm::Error GetError() = 0;		virtual const char *GetError() = 0;

/// \return		/// \return
/// The load address of the instruction the cursor is pointing at. If the		/// The load address of the instruction the cursor is pointing at. If the
/// cursor points to an error in the trace, return \b		/// cursor points to an error in the trace, return \b
/// LLDB_INVALID_ADDRESS.		/// LLDB_INVALID_ADDRESS.
virtual lldb::addr_t GetLoadAddress() = 0;		virtual lldb::addr_t GetLoadAddress() = 0;

/// Get the hardware counter of a given type associated with the current		/// Get the hardware counter of a given type associated with the current
Show All 30 Lines

lldb/source/Plugins/Trace/intel-pt/DecodedThread.h

//===-- DecodedThread.h -----------------------------------------*- C++ -*-===// //===-- DecodedThread.h -----------------------------------------*- C++ -*-===//

// //

// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions. // Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.

// See https://llvm.org/LICENSE.txt for license information. // See https://llvm.org/LICENSE.txt for license information.

// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception // SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception

// //

//===----------------------------------------------------------------------===// //===----------------------------------------------------------------------===//

#ifndef LLDB_SOURCE_PLUGINS_TRACE_INTEL_PT_DECODEDTHREAD_H #ifndef LLDB_SOURCE_PLUGINS_TRACE_INTEL_PT_DECODEDTHREAD_H

#define LLDB_SOURCE_PLUGINS_TRACE_INTEL_PT_DECODEDTHREAD_H #define LLDB_SOURCE_PLUGINS_TRACE_INTEL_PT_DECODEDTHREAD_H

#include <utility>

#include <vector> #include <vector>

#include "llvm/Support/Errc.h" #include "llvm/Support/Errc.h"

#include "llvm/Support/Error.h" #include "llvm/Support/Error.h"

#include "lldb/Target/Trace.h" #include "lldb/Target/Trace.h"

#include "lldb/Utility/TraceIntelPTGDBRemotePackets.h" #include "lldb/Utility/TraceIntelPTGDBRemotePackets.h"

Show All 14 Lines public:

/// \param[in] address /// \param[in] address

/// Optional instruction address. When decoding an individual instruction, /// Optional instruction address. When decoding an individual instruction,

/// its address might be available in the \a pt_insn object, and should be /// its address might be available in the \a pt_insn object, and should be

/// passed to this constructor. Other errors don't have an associated /// passed to this constructor. Other errors don't have an associated

/// address. /// address.

IntelPTError(int libipt_error_code, IntelPTError(int libipt_error_code,

lldb::addr_t address = LLDB_INVALID_ADDRESS); lldb::addr_t address = LLDB_INVALID_ADDRESS);

std::error_code convertToErrorCode() const override { std::error_code convertToErrorCode() const override {

return llvm::errc::not_supported; return llvm::errc::not_supported;

} }

wallaceUnsubmitted

Done

delete commented code

wallace: delete commented code

void log(llvm::raw_ostream &OS) const override; void log(llvm::raw_ostream &OS) const override;

private: private:

int m_libipt_error_code; int m_libipt_error_code;

lldb::addr_t m_address; lldb::addr_t m_address;

}; };

/// \class IntelPTInstruction /// \class IntelPTInstruction

/// An instruction obtained from decoding a trace. It is either an actual /// An instruction obtained from decoding a trace. It is either an actual

/// instruction or an error indicating a gap in the trace. /// instruction or an error indicating a gap in the trace.

/// ///

/// Gaps in the trace can come in a few flavors: /// Gaps in the trace can come in a few flavors:

/// - tracing gaps (e.g. tracing was paused and then resumed) /// - tracing gaps (e.g. tracing was paused and then resumed)

/// - tracing errors (e.g. buffer overflow) /// - tracing errors (e.g. buffer overflow)

/// - decoding errors (e.g. some memory region couldn't be decoded) /// - decoding errors (e.g. some memory region couldn't be decoded)

/// As mentioned, any gap is represented as an error in this class. /// As mentioned, any gap is represented as an error in this class.

class IntelPTInstruction { class IntelPTInstruction {

public: public:

IntelPTInstruction(const pt_insn &pt_insn, uint64_t timestamp) IntelPTInstruction(const pt_insn &pt_insn, uint64_t timestamp)

: m_pt_insn(pt_insn), m_timestamp(timestamp) {} : m_pt_insn(pt_insn), m_timestamp(timestamp), m_is_error(false) {}

IntelPTInstruction(const pt_insn &pt_insn) : m_pt_insn(pt_insn) {} IntelPTInstruction(const pt_insn &pt_insn)

: m_pt_insn(pt_insn), m_is_error(false) {}

/// Error constructor /// Error constructor

/// IntelPTInstruction();

wallaceUnsubmitted

Done

/// libipt errors should use the underlying \a IntelPTError class.

- IntelPTInstruction(llvm::Error &err);

+ IntelPTInstruction(const llvm::Error &err);

/// Check if this object represents an error (i.e. a gap).

Pass the error by const reference, because we don't modify it

wallace: Pass the error by const reference, because we don't modify it

/// libipt errors should use the underlying \a IntelPTError class.

IntelPTInstruction(llvm::Error err);

/// Check if this object represents an error (i.e. a gap). /// Check if this object represents an error (i.e. a gap).

/// ///

/// \return /// \return

/// Whether this object represents an error. /// Whether this object represents an error.

bool IsError() const; bool IsError() const;

/// \return /// \return

/// The instruction pointer address, or \a LLDB_INVALID_ADDRESS if it is /// The instruction pointer address, or \a LLDB_INVALID_ADDRESS if it is

/// an error. /// an error.

lldb::addr_t GetLoadAddress() const; lldb::addr_t GetLoadAddress() const;

/// Get the size in bytes of a non-error instance of this class /// Get the size in bytes of an instance of this class

static size_t GetNonErrorMemoryUsage(); static size_t GetMemoryUsage();

/// \return

/// An \a llvm::Error object if this class corresponds to an Error, or an

/// \a llvm::Error::success otherwise.

llvm::Error ToError() const;

wallaceUnsubmitted

Done

delete it. We don't want to leave old code as comments

wallace: delete it. We don't want to leave old code as comments

wallaceUnsubmitted

Done

good

wallace: good

/// Get the timestamp associated with the current instruction. The timestamp /// Get the timestamp associated with the current instruction. The timestamp

/// is similar to what a rdtsc instruction would return. /// is similar to what a rdtsc instruction would return.

/// ///

/// \return /// \return

/// The timestamp or \b llvm::None if not available. /// The timestamp or \b llvm::None if not available.

llvm::Optional<uint64_t> GetTimestampCounter() const; llvm::Optional<uint64_t> GetTimestampCounter() const;

/// Get the \a lldb::TraceInstructionControlFlowType categories of the /// Get the \a lldb::TraceInstructionControlFlowType categories of the

Show All 9 Lines public:

GetControlFlowType(lldb::addr_t next_load_address) const; GetControlFlowType(lldb::addr_t next_load_address) const;

IntelPTInstruction(IntelPTInstruction &&other) = default; IntelPTInstruction(IntelPTInstruction &&other) = default;

private: private:

IntelPTInstruction(const IntelPTInstruction &other) = delete; IntelPTInstruction(const IntelPTInstruction &other) = delete;

const IntelPTInstruction &operator=(const IntelPTInstruction &other) = delete; const IntelPTInstruction &operator=(const IntelPTInstruction &other) = delete;

// When adding new members to this class, make sure to update // When adding new members to this class, make sure to update

// IntelPTInstruction::GetNonErrorMemoryUsage() if needed. // IntelPTInstruction::GetNonErrorMemoryUsage() if needed.

pt_insn m_pt_insn; pt_insn m_pt_insn;

llvm::Optional<uint64_t> m_timestamp; llvm::Optional<uint64_t> m_timestamp;

std::unique_ptr<llvm::ErrorInfoBase> m_error; bool m_is_error;

wallaceUnsubmitted

Done

now that you changed this, could show share in the description of this diff the difference in byte size between the old and new code when tracing the same number of instructions?

wallace: now that you changed this, could show share in the description of this diff the difference in…

}; };

/// \class DecodedThread /// \class DecodedThread

/// Class holding the instructions and function call hierarchy obtained from /// Class holding the instructions and function call hierarchy obtained from

/// decoding a trace, as well as a position cursor used when reverse debugging /// decoding a trace, as well as a position cursor used when reverse debugging

/// the trace. /// the trace.

/// ///

/// Each decoded thread contains a cursor to the current position the user is /// Each decoded thread contains a cursor to the current position the user is

/// stopped at. See \a Trace::GetCursorPosition for more information. /// stopped at. See \a Trace::GetCursorPosition for more information.

class DecodedThread : public std::enable_shared_from_this<DecodedThread> { class DecodedThread : public std::enable_shared_from_this<DecodedThread> {

public: public:

DecodedThread(lldb::ThreadSP thread_sp, DecodedThread(lldb::ThreadSP thread_sp);

std::vector<IntelPTInstruction> &&instructions,

size_t raw_trace_size); /// Utility constructor that initializes the trace with a provided error.

DecodedThread(lldb::ThreadSP thread_sp, llvm::Error &&err);

/// Constructor with a single error signaling a complete failure of the

/// decoding process.

DecodedThread(lldb::ThreadSP thread_sp, llvm::Error error);

/// Get the instructions from the decoded trace. Some of them might indicate /// Get the instructions from the decoded trace. Some of them might indicate

/// errors (i.e. gaps) in the trace. /// errors (i.e. gaps) in the trace. For an instruction error, you can access

/// its underlying error message with the \a GetErrorByInstructionIndex()

/// method.

wallaceUnsubmitted

Done

DecodedThread(lldb::ThreadSP thread_sp, llvm::Error error);

- /// Get the instructions from the decoded trace.

+ /// Get the instructions from the decoded trace. Some of them might indicate

+ /// errors (i.e. gaps) in the trace. For an instruction error, you can access

+ /// its underlying Error object with the \a GetErrorByInstructionIndex() method.

///

/// \return

wallace:

/// ///

/// \return /// \return

/// The instructions of the trace. /// The instructions of the trace.

llvm::ArrayRef<IntelPTInstruction> GetInstructions() const; llvm::ArrayRef<IntelPTInstruction> GetInstructions() const;

/// Get the error associated with a given instruction index.

///

/// \return

/// The error message of \b nullptr if the given index

wallaceUnsubmitted

Done

llvm::ArrayRef<IntelPTInstruction> GetInstructions() const;

- /// Get the error at some instruction index from the decoded trace.

+ /// Get the error associated with a given instruction index.

///

/// \return

- /// The error of the trace.

+ /// The error or \a llvm::Error::success if the given index points to a valid

+ /// instruction.

llvm::Error GetError(uint64_t ins_idx) const;

we can make the documentation clearer

wallace: we can make the documentation clearer

wallaceUnsubmitted

Done

you didn't apply these changes

wallace: you didn't apply these changes

wallaceUnsubmitted

Done

/// \return

- /// The error or \a llvm::Error::success if the given index

+ /// The error or \a llvm::Error::success if the given index.

/// points to a valid instruction.

wallace:

/// points to a valid instruction.

jj10306Unsubmitted

Done

/// The errors of the trace.

- std::unordered_map<uint64_t, llvm::Error> GetErrors() const;

+ llvm::DenseMap<uint64_t, llvm::Error> const &GetErrors() const;

/// Append a successfully decoded instruction to thread.

Return a reference here to avoid potential expensive copy when returning.

Something else to consider is if we need/want an API exposing all the entire error map or if something like:
llvm::Error GetErrorByInstructionIndex(uint64_t insn_index) const that allows the caller to specify the key into the map would make more sense?
This also has the advantage that it hides the implementation detail of what data type is being used under the hood to represent the error map!
@wallace @zrthxn wdyt?

jj10306: Return a reference here to avoid potential expensive copy when returning. Something else to…

zrthxnAuthorUnsubmitted

Done

Yea I did think about that. In my opinion returning a reference would be good since with that you can get the size and error at each index and we don't need to have functions for each operation which would have long clunky names making the code less readable...

zrthxn: Yea I did think about that. In my opinion returning a reference would be good since with that…

wallaceUnsubmitted

Done

ahh!! DenseMap is for sure the way to go. The getMemorySize() method will help us a good deal. Thanks @jj10306

Besides that, we shouldn't expose publicly the internal representation of an object whose data structure might change. For example, sooner than later we might prefer to store the errors in a different way. So, to prevent breaking callers if we change this data structure, let's go for what Jakob proposes

llvm::Error GetErrorByInstructionIndex(uint64_t insn_index) const

which returns Error::success() in case the instruction is not an actual error

wallace: ahh!! DenseMap is for sure the way to go. The getMemorySize() method will help us a good deal.

wallaceUnsubmitted

Done

/// The error of the trace.

- llvm::Error GetError(uint64_t ins_idx);

+ llvm::Error GetErrorForInstruction(uint64_t ins_idx);

/// Append a successfully decoded instruction.

or GetErrorByInstructionIndex

wallace: or GetErrorByInstructionIndex

const char *GetErrorByInstructionIndex(uint64_t ins_idx);

wallaceUnsubmitted

Done

std::unordered_map<uint64_t, llvm::Error> GetErrors() const;

- /// Append a successfully decoded instruction to thread.

+ /// Append a successfully decoded instruction.

void AppendInstruction(IntelPTInstruction ins);

Omit the thread, because you are appending to a DecodedThread. In any case, it's redundant to mention where you are appending to

wallace: Omit `the thread`, because you are appending to a `DecodedThread`. In any case, it's redundant…

/// Append a successfully decoded instruction.

wallaceUnsubmitted

Done

this has to use the new parameter pack semantics, so that you can pass either {pt_insn} or {pt_insn, timestamp} without having to create copies of the IntelPTInstruction class

wallace: this has to use the new parameter pack semantics, so that you can pass either `{pt_insn}` or…

template <typename... Ts> void AppendInstruction(Ts... instruction_args) {

wallaceUnsubmitted

Done

remove this one. We need the version that accepts a parameter pack, as we discussed offline

wallace: remove this one. We need the version that accepts a parameter pack, as we discussed offline

wallaceUnsubmitted

Done

just call it args or instruction_args instead of __args

wallace: just call it args or instruction_args instead of __args

m_instructions.emplace_back(instruction_args...);

wallaceUnsubmitted

Done

void AppendInstruction(IntelPTInstruction ins);

- /// Append a error of instruction decoding to thread.

- void AppendError(uint64_t insn_index, llvm::Error err);

+ /// Append a decoding error. void AppendError(uint64_t insn_index, llvm::Error err);

you should be able to omit the insn_index by using m_instructions.size() + m_errors.size() as the index

wallace: you should be able to omit the insn_index by using `m_instructions.size() + m_errors.size()` as…

wallaceUnsubmitted

Done

void AppendInstruction(IntelPTInstruction ins);

- /// Append an error of instruction decoding.

+ /// Append a decoding error (i.e. an instruction that failed to be decoded).

void AppendError(llvm::Error err);

wallace:

wallaceUnsubmitted

Done

same here

wallace: same here

wallaceUnsubmitted

Done

the goal is to avoid creating explicitly a IntelPTInstruction, so you should be able to achieve something like

m_instructions.emplace_back(args...);

wallace: the goal is to avoid creating explicitly a IntelPTInstruction, so you should be able to…

jj10306Unsubmitted

Done

Should we be using std::forward() here? Same question for the AppendError function

jj10306: Should we be using `std::forward()` here? Same question for the `AppendError` function

}

wallaceUnsubmitted

Done

same here

wallace: same here

/// Append a decoding error (i.e. an instruction that failed to be decoded).

void AppendError(llvm::Error &&error);

/// Get a new cursor for the decoded thread. /// Get a new cursor for the decoded thread.

lldb::TraceCursorUP GetCursor(); lldb::TraceCursorUP GetCursor();

/// Get the size in bytes of the corresponding Intel PT raw trace /// Set the size in bytes of the corresponding Intel PT raw trace.

void SetRawTraceSize(size_t size);

/// Get the size in bytes of the corresponding Intel PT raw trace.

/// ///

/// \return /// \return

/// The size of the trace. /// The size of the trace, or \b llvm::None if not available.

size_t GetRawTraceSize() const; llvm::Optional<size_t> GetRawTraceSize() const;

/// The approximate size in bytes used by this instance, /// The approximate size in bytes used by this instance,

/// including all the already decoded instructions. /// including all the already decoded instructions.

size_t CalculateApproximateMemoryUsage() const; size_t CalculateApproximateMemoryUsage() const;

lldb::ThreadSP GetThread();

private: private:

/// When adding new members to this class, make sure /// When adding new members to this class, make sure

/// to update \a CalculateApproximateMemoryUsage() accordingly. /// to update \a CalculateApproximateMemoryUsage() accordingly.

lldb::ThreadSP m_thread_sp; lldb::ThreadSP m_thread_sp;

std::vector<IntelPTInstruction> m_instructions; std::vector<IntelPTInstruction> m_instructions;

size_t m_raw_trace_size; llvm::DenseMap<uint64_t, std::string> m_errors;

wallaceUnsubmitted

Done

you need to have something like

std::unordered_map<uint64_t, llvm::Error> m_errors;

that way, you'll be able to quickly look for the error associated with an instruction index. The IntelPTInstruction int his case, instead of storing the Error, can just store one bit of information has_error = true/false;

wallace: you need to have something like std::unordered_map<uint64_t, llvm::Error> m_errors; that way…

jj10306Unsubmitted

Done

nit: from https://llvm.org/docs/CodingStandards.html#c-standard-library
prefer llvm::DenseMap over std::map's unless there's a specific reason not to.

Don't forget to update CalculateApproximateMemoryUsage() as well! Also, besides for being inline with the coding standards I linked above, using llvm::DenseMap here has the actual advantage that it exposes its approximate size via getMemorySize(), whereas there is no easy way to get the size of std::map.

jj10306: nit: from https://llvm.org/docs/CodingStandards.html#c-standard-library prefer `llvm…

zrthxnAuthorUnsubmitted

Done

DenseMap looks interesting we should try that.

Yes i will update the mem calculation and make it more accurate so some refactor will be needed. That'll be the next small patch once this works.

zrthxn: DenseMap looks interesting we should try that. Yes i will update the mem calculation and make…

llvm::Optional<size_t> m_raw_trace_size;

}; };

using DecodedThreadSP = std::shared_ptr<DecodedThread>; using DecodedThreadSP = std::shared_ptr<DecodedThread>;

} // namespace trace_intel_pt } // namespace trace_intel_pt

} // namespace lldb_private } // namespace lldb_private

#endif // LLDB_SOURCE_PLUGINS_TRACE_INTEL_PT_DECODEDTHREAD_H #endif // LLDB_SOURCE_PLUGINS_TRACE_INTEL_PT_DECODEDTHREAD_H

lldb/source/Plugins/Trace/intel-pt/DecodedThread.cpp

//===-- DecodedThread.cpp -------------------------------------------------===// //===-- DecodedThread.cpp -------------------------------------------------===//

// //

// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions. // Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.

// See https://llvm.org/LICENSE.txt for license information. // See https://llvm.org/LICENSE.txt for license information.

// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception // SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception

// //

//===----------------------------------------------------------------------===// //===----------------------------------------------------------------------===//

#include "DecodedThread.h" #include "DecodedThread.h"

#include <intel-pt.h> #include <intel-pt.h>

#include <memory> #include <memory>

wallaceUnsubmitted

Done

delete this

wallace: delete this

#include "TraceCursorIntelPT.h" #include "TraceCursorIntelPT.h"

wallaceUnsubmitted

Done

you don't need to import it. Maybe your c++ vscode extension has been autoimporting this one, in which case it's better to disable that feature.

wallace: you don't need to import it. Maybe your c++ vscode extension has been autoimporting this one…

#include "lldb/Utility/StreamString.h" #include "lldb/Utility/StreamString.h"

using namespace lldb; using namespace lldb;

using namespace lldb_private; using namespace lldb_private;

using namespace lldb_private::trace_intel_pt; using namespace lldb_private::trace_intel_pt;

using namespace llvm; using namespace llvm;

char IntelPTError::ID; char IntelPTError::ID;

IntelPTError::IntelPTError(int libipt_error_code, lldb::addr_t address) IntelPTError::IntelPTError(int libipt_error_code, lldb::addr_t address)

: m_libipt_error_code(libipt_error_code), m_address(address) { : m_libipt_error_code(libipt_error_code), m_address(address) {

wallaceUnsubmitted

Done

IntelPTError::IntelPTError(int libipt_error_code, lldb::addr_t address)

- : m_libipt_error_code(libipt_error_code), m_address(address) {

+ : m_libipt_error_code(libipt_error_code), m_address(address), m_is_error(false) {

assert(libipt_error_code < 0);

wallace:

zrthxnAuthorUnsubmitted

Done

This is needed at DecodedThread.h:66, not here

zrthxn: This is needed at DecodedThread.h:66, not here

assert(libipt_error_code < 0); assert(libipt_error_code < 0);

} }

void IntelPTError::log(llvm::raw_ostream &OS) const { void IntelPTError::log(llvm::raw_ostream &OS) const {

const char *libipt_error_message = pt_errstr(pt_errcode(m_libipt_error_code)); const char *libipt_error_message = pt_errstr(pt_errcode(m_libipt_error_code));

if (m_address != LLDB_INVALID_ADDRESS && m_address > 0) { if (m_address != LLDB_INVALID_ADDRESS && m_address > 0) {

write_hex(OS, m_address, HexPrintStyle::PrefixLower, 18); write_hex(OS, m_address, HexPrintStyle::PrefixLower, 18);

OS << " "; OS << " ";

} }

OS << "error: " << libipt_error_message; OS << "error: " << libipt_error_message;

} }

IntelPTInstruction::IntelPTInstruction(llvm::Error err) { IntelPTInstruction::IntelPTInstruction() {

jj10306Unsubmitted

Done

Feels kinda weird that the err param isn't being used? Not sure if it would be preferred but another option would be to make the default constructor construct an error instruction.
Passing the error is more clear that this constructor will create an error instruction, but then it feels a bit unnecessary to pass it since it's now unused. I'm a little torn so would be curious to hear others opinions (:

jj10306: Feels kinda weird that the `err` param isn't being used? Not sure if it would be preferred but…

llvm::handleAllErrors(std::move(err),

[&](std::unique_ptr<llvm::ErrorInfoBase> info) {

m_error = std::move(info);

});

m_pt_insn.ip = LLDB_INVALID_ADDRESS; m_pt_insn.ip = LLDB_INVALID_ADDRESS;

m_pt_insn.iclass = ptic_error; m_pt_insn.iclass = ptic_error;

m_is_error = true;

jj10306Unsubmitted

Done

Is this boolean necessary? In the case of an error, the other two fields also indicate an error so this doesn't seem to add much information.
If we remove it, you can just update IsError to check the other two fields accordingly.

jj10306: Is this boolean necessary? In the case of an error, the other two fields also indicate an error…

} }

bool IntelPTInstruction::IsError() const { return (bool)m_error; } bool IntelPTInstruction::IsError() const { return m_is_error; }

lldb::addr_t IntelPTInstruction::GetLoadAddress() const { return m_pt_insn.ip; } lldb::addr_t IntelPTInstruction::GetLoadAddress() const { return m_pt_insn.ip; }

size_t IntelPTInstruction::GetNonErrorMemoryUsage() { return sizeof(IntelPTInstruction); } size_t IntelPTInstruction::GetMemoryUsage() {

return sizeof(IntelPTInstruction);

}

Optional<uint64_t> IntelPTInstruction::GetTimestampCounter() const { Optional<uint64_t> IntelPTInstruction::GetTimestampCounter() const {

return m_timestamp; return m_timestamp;

} }

Error IntelPTInstruction::ToError() const { Optional<size_t> DecodedThread::GetRawTraceSize() const {

if (!IsError()) return m_raw_trace_size;

return Error::success();

if (m_error->isA<IntelPTError>())

return make_error<IntelPTError>(static_cast<IntelPTError &>(*m_error));

return make_error<StringError>(m_error->message(),

m_error->convertToErrorCode());

} }

size_t DecodedThread::GetRawTraceSize() const { return m_raw_trace_size; }

TraceInstructionControlFlowType TraceInstructionControlFlowType

IntelPTInstruction::GetControlFlowType(lldb::addr_t next_load_address) const { IntelPTInstruction::GetControlFlowType(lldb::addr_t next_load_address) const {

if (IsError()) if (IsError())

return (TraceInstructionControlFlowType)0; return (TraceInstructionControlFlowType)0;

TraceInstructionControlFlowType mask = TraceInstructionControlFlowType mask =

eTraceInstructionControlFlowTypeInstruction; eTraceInstructionControlFlowTypeInstruction;

Show All 16 Lines case ptic_far_call:

break; break;

default: default:

break; break;

} }

return mask; return mask;

} }

ThreadSP DecodedThread::GetThread() { return m_thread_sp; }

void DecodedThread::AppendError(llvm::Error &&error) {

m_errors.try_emplace(m_instructions.size(), toString(std::move(error)));

m_instructions.emplace_back();

}

ArrayRef<IntelPTInstruction> DecodedThread::GetInstructions() const { ArrayRef<IntelPTInstruction> DecodedThread::GetInstructions() const {

return makeArrayRef(m_instructions); return makeArrayRef(m_instructions);

} }

DecodedThread::DecodedThread(ThreadSP thread_sp, Error error) const char *DecodedThread::GetErrorByInstructionIndex(uint64_t idx) {

: m_thread_sp(thread_sp) { auto it = m_errors.find(idx);

m_instructions.emplace_back(std::move(error)); if (it == m_errors.end())

wallaceUnsubmitted

Not Done

remove this

wallace: remove this

wallaceUnsubmitted

Not Done

return makeArrayRef(m_instructions);

}

- // llvm::Error DecodedThread::GetError(uint64_t idx) const {

- // return m_errors.at(idx);

- // }

+ llvm::Error DecodedThread::GetError(uint64_t idx) const {

+ auto it = m_errors.find(idx);

+ if (it == m_errors.end())

+ return Error::success();

+ Error &err = it->second;

+ if (err.isA<IntelPTError>())

+ return make_error<IntelPTError>(static_cast<IntelPTError &>(err));

+ return make_error<StringError>(err.message(), error->convertToErrorCode());

+ }

DecodedThread::DecodedThread(ThreadSP thread_sp, Error error)

Errors can only be copied, that's why we need to create a new instance of the error that is a copy of the original one. We can draw inspiration from IntelPTInstruction::ToError(), which can now be deleted

wallace: Errors can only be copied, that's why we need to create a new instance of the error that is a…

return nullptr;

return it->second.c_str();

wallaceUnsubmitted

Done

return make_error<StringError>(err->message(), err->convertToErrorCode());

}

- DecodedThread::DecodedThread(ThreadSP thread_sp, Error error)

+ DecodedThread::DecodedThread(ThreadSP thread_sp, Error &&error)

: m_thread_sp(thread_sp) {

- m_instructions.emplace_back(error);

+ m_instructions.emplace_back(std::move(error));

}

DecodedThread::DecodedThread(ThreadSP thread_sp,

let's improve this method a bit

wallace: let's improve this method a bit

} }

DecodedThread::DecodedThread(ThreadSP thread_sp, DecodedThread::DecodedThread(ThreadSP thread_sp) : m_thread_sp(thread_sp) {}

std::vector<IntelPTInstruction> &&instructions,

size_t raw_trace_size) DecodedThread::DecodedThread(ThreadSP thread_sp, Error &&error)

: m_thread_sp(thread_sp), m_instructions(std::move(instructions)), : m_thread_sp(thread_sp) {

m_raw_trace_size(raw_trace_size) { AppendError(std::move(error));

if (m_instructions.empty())

m_instructions.emplace_back(

createStringError(inconvertibleErrorCode(), "empty trace"));

} }

void DecodedThread::SetRawTraceSize(size_t size) { m_raw_trace_size = size; }

lldb::TraceCursorUP DecodedThread::GetCursor() { lldb::TraceCursorUP DecodedThread::GetCursor() {

// We insert a fake error signaling an empty trace if needed becasue the

// TraceCursor requires non-empty traces.

if (m_instructions.empty())

AppendError(createStringError(inconvertibleErrorCode(), "empty trace"));

return std::make_unique<TraceCursorIntelPT>(m_thread_sp, shared_from_this()); return std::make_unique<TraceCursorIntelPT>(m_thread_sp, shared_from_this());

} }

size_t DecodedThread::CalculateApproximateMemoryUsage() const { size_t DecodedThread::CalculateApproximateMemoryUsage() const {

return m_raw_trace_size return m_raw_trace_size.getValueOr(0) +

+ IntelPTInstruction::GetNonErrorMemoryUsage() * m_instructions.size() IntelPTInstruction::GetMemoryUsage() * m_instructions.size() +

+ sizeof(DecodedThread); m_errors.getMemorySize();

wallaceUnsubmitted

Done

here you also need to ask for the size of the DenseMap

wallace: here you also need to ask for the size of the DenseMap

} }

wallaceUnsubmitted

Done

[&](std::unique_ptr<llvm::ErrorInfoBase> info) {

- m_errors.insert(std::pair<uint64_t, std::unique_ptr<ErrorInfoBase>>(

- m_instructions.size(),

- std::move(info)

- ));

- });

+ m_errors.emplace(m_instructions.size(), std::move(info));

+ /// if you have errors with that, you might try the following

+ m_errors.emplace(m_instructions.size(), {std::move(info)});

+ m_errors.emplace(m_instructions.size(), std::make_shared<ErrorInfoBase>(std::move(info)));

+ // at least one should work. Ideally the first one });

}

emplace will prevent unnecessary copies and also doesn't need you to pass a pair

wallace: emplace will prevent unnecessary copies and also doesn't need you to pass a pair

wallaceUnsubmitted

Done

in order to have correct formatting all along, you need to use git clang-format: https://llvm.org/docs/Contributing.html#format-patches

follow that guide. Whenever you are going to submit a patch, first run git clang-format and it will format your code correctly, so that you never again have to lose time doing that. It can even format comments

wallace: in order to have correct formatting all along, you need to use git clang-format: https://llvm.

lldb/source/Plugins/Trace/intel-pt/IntelPTDecoder.cpp

Show All 10 Lines

#include "../common/ThreadPostMortemTrace.h" #include "../common/ThreadPostMortemTrace.h"

#include "DecodedThread.h" #include "DecodedThread.h"

#include "TraceIntelPT.h" #include "TraceIntelPT.h"

#include "lldb/Core/Module.h" #include "lldb/Core/Module.h"

#include "lldb/Core/Section.h" #include "lldb/Core/Section.h"

#include "lldb/Target/Target.h" #include "lldb/Target/Target.h"

#include "lldb/Utility/StringExtractor.h" #include "lldb/Utility/StringExtractor.h"

#include <utility>

using namespace lldb; using namespace lldb;

using namespace lldb_private; using namespace lldb_private;

using namespace lldb_private::trace_intel_pt; using namespace lldb_private::trace_intel_pt;

using namespace llvm; using namespace llvm;

/// Move the decoder forward to the next synchronization point (i.e. next PSB /// Move the decoder forward to the next synchronization point (i.e. next PSB

/// packet). /// packet).

▲ Show 20 Lines • Show All 67 Lines • ▼ Show 20 Lines

/// https://github.com/intel/libipt/blob/master/doc/howto_libipt.md#the-instruction-flow-decode-loop /// https://github.com/intel/libipt/blob/master/doc/howto_libipt.md#the-instruction-flow-decode-loop

/// but with some relaxation to allow for gaps in the trace. /// but with some relaxation to allow for gaps in the trace.

/// ///

/// Error codes returned by libipt while decoding are: /// Error codes returned by libipt while decoding are:

/// - negative: actual errors /// - negative: actual errors

/// - positive or zero: not an error, but a list of bits signaling the status of /// - positive or zero: not an error, but a list of bits signaling the status of

/// the decoder /// the decoder

/// ///

/// \param[in] decoder /// \param[in] decoder

/// A configured libipt \a pt_insn_decoder. /// A configured libipt \a pt_insn_decoder.

/// static void DecodeInstructions(pt_insn_decoder &decoder,

wallaceUnsubmitted

Not Done

update the documentation

wallace: update the documentation

/// \return DecodedThreadSP &decoded_thread_sp) {

/// The decoded instructions.

static std::vector<IntelPTInstruction>

DecodeInstructions(pt_insn_decoder &decoder) {

std::vector<IntelPTInstruction> instructions;

while (true) { while (true) {

wallaceUnsubmitted

Done

don't call it thread, call it decoded_thread. The thread is the one passed as parameter

wallace: don't call it thread, call it decoded_thread. The thread is the one passed as parameter

wallaceUnsubmitted

Done

use make_shared here and return a DecodedThreadSP (alias for shared_ptr<DecodedThread>). That way, you'll avoid having to invoke shared_from_this() below

wallace: use make_shared here and return a DecodedThreadSP (alias for shared_ptr<DecodedThread>). That…

wallaceUnsubmitted

Not Done

make this a shared pointer since the beginning. Use make_shared here

wallace: make this a shared pointer since the beginning. Use make_shared here

int errcode = FindNextSynchronizationPoint(decoder); int errcode = FindNextSynchronizationPoint(decoder);

if (errcode == -pte_eos) if (errcode == -pte_eos)

wallaceUnsubmitted

Done

DecodedThreadSP decoded_thread =

- std::make_shared<DecodedThread>(DecodedThread(

- thread_sp, std::vector<IntelPTInstruction>(), raw_trace_size));

+ std::make_shared<DecodedThread>(thread_sp, std::vector<IntelPTInstruction>(), raw_trace_size);

while (true) {

the magic of make_shared is that it uses parameter packs, so that it only constructs the object right where it'll store it, thus preventing unnecessary copies

wallace: the magic of make_shared is that it uses parameter packs, so that it only constructs the object…

break; break;

if (errcode < 0) { if (errcode < 0) {

instructions.emplace_back(make_error<IntelPTError>(errcode)); decoded_thread_sp->AppendError(make_error<IntelPTError>(errcode));

break; break;

jj10306Unsubmitted

Done

thread.AppendError(make_error<IntelPTError>(errcode));

- // instructions.emplace_back(make_error<IntelPTError>(errcode));

break;

jj10306:

} }

// We have synchronized, so we can start decoding // We have synchronized, so we can start decoding

// instructions and events. // instructions and events.

while (true) { while (true) {

wallaceUnsubmitted

Done

remove this

wallace: remove this

errcode = ProcessPTEvents(decoder, errcode); errcode = ProcessPTEvents(decoder, errcode);

if (errcode < 0) { if (errcode < 0) {

instructions.emplace_back(make_error<IntelPTError>(errcode)); decoded_thread_sp->AppendError(make_error<IntelPTError>(errcode));

break; break;

} }

pt_insn insn;

errcode = pt_insn_next(&decoder, &insn, sizeof(insn)); errcode = pt_insn_next(&decoder, &insn, sizeof(insn));

if (errcode == -pte_eos) if (errcode == -pte_eos)

break; break;

if (errcode < 0) { if (errcode < 0) {

instructions.emplace_back(make_error<IntelPTError>(errcode, insn.ip)); decoded_thread_sp->AppendError(

make_error<IntelPTError>(errcode, insn.ip));

break; break;

} }

uint64_t time; uint64_t time;

int time_error = pt_insn_time(&decoder, &time, nullptr, nullptr); int time_error = pt_insn_time(&decoder, &time, nullptr, nullptr);

if (time_error == -pte_invalid) { if (time_error == -pte_invalid) {

// This happens if we invoke the pt_insn_time method incorrectly, // This happens if we invoke the pt_insn_time method incorrectly,

// but the instruction is good though. // but the instruction is good though.

instructions.emplace_back( decoded_thread_sp->AppendError(

make_error<IntelPTError>(time_error, insn.ip)); make_error<IntelPTError>(time_error, insn.ip));

instructions.emplace_back(insn); decoded_thread_sp->AppendInstruction(insn);

break; break;

wallaceUnsubmitted

Done

This will create a copy of the IntelPTInstruction before storing it in the vector. Instead, you should use the same semantics as vector::emplace_back(), which uses paratemer packs/variadic templates. You can even rename Append to Emplace in this case

wallace: This will create a copy of the IntelPTInstruction before storing it in the vector. Instead, you…

zrthxnAuthorUnsubmitted

Done

Yea I was doing that before, the idea was to send those variadic args to emplace_back but that wasnt working. I introduced this to avoid having 2 Appends since we already have 2 constructors which fulfill that requirement, and I can change this to std::move to avoid copies if thats a concern

zrthxn: Yea I was doing that before, the idea was to send those variadic args to emplace_back but that…

wallaceUnsubmitted

Done

Yes, use the same pattern than emolace_back uses with a parameter pack

wallace: Yes, use the same pattern than emolace_back uses with a parameter pack

wallaceUnsubmitted

Done

ideally you should be able to do decoded_thread.AppendInstruction({insn}) here

wallace: ideally you should be able to do `decoded_thread.AppendInstruction({insn})` here

} }

if (time_error == -pte_no_time) { if (time_error == -pte_no_time) {

// We simply don't have time information, i.e. None of TSC, MTC or CYC // We simply don't have time information, i.e. None of TSC, MTC or CYC

// was enabled. // was enabled.

instructions.emplace_back(insn); decoded_thread_sp->AppendInstruction(insn);

} else { } else {

instructions.emplace_back(insn, time); decoded_thread_sp->AppendInstruction(insn, time);

} }

return instructions;

} }

/// Callback used by libipt for reading the process memory. /// Callback used by libipt for reading the process memory.

/// ///

/// More information can be found in /// More information can be found in

/// https://github.com/intel/libipt/blob/master/doc/man/pt_image_set_callback.3.md /// https://github.com/intel/libipt/blob/master/doc/man/pt_image_set_callback.3.md

static int ReadProcessMemory(uint8_t *buffer, size_t size, static int ReadProcessMemory(uint8_t *buffer, size_t size,

const pt_asid * /* unused */, uint64_t pc, const pt_asid * /* unused */, uint64_t pc,

void *context) { void *context) {

Process *process = static_cast<Process *>(context); Process *process = static_cast<Process *>(context);

Status error; Status error;

int bytes_read = process->ReadMemory(pc, buffer, size, error); int bytes_read = process->ReadMemory(pc, buffer, size, error);

if (error.Fail()) if (error.Fail())

return -pte_nomap; return -pte_nomap;

return bytes_read; return bytes_read;

} }

static Expected<std::vector<IntelPTInstruction>> static void DecodeInMemoryTrace(DecodedThreadSP &decoded_thread_sp,

wallaceUnsubmitted

Done

don't pass the process, because we can get it from the thread_sp object

wallace: don't pass the process, because we can get it from the thread_sp object

DecodeInMemoryTrace(Process &process, TraceIntelPT &trace_intel_pt, TraceIntelPT &trace_intel_pt,

MutableArrayRef<uint8_t> buffer) { MutableArrayRef<uint8_t> buffer) {

Expected<pt_cpu> cpu_info = trace_intel_pt.GetCPUInfo(); Expected<pt_cpu> cpu_info = trace_intel_pt.GetCPUInfo();

if (!cpu_info) if (!cpu_info) {

return cpu_info.takeError(); return decoded_thread_sp->AppendError(cpu_info.takeError());

}

pt_config config; pt_config config;

pt_config_init(&config); pt_config_init(&config);

config.cpu = *cpu_info; config.cpu = *cpu_info;

if (int errcode = pt_cpu_errata(&config.errata, &config.cpu)) if (int errcode = pt_cpu_errata(&config.errata, &config.cpu))

return make_error<IntelPTError>(errcode); return decoded_thread_sp->AppendError(make_error<IntelPTError>(errcode));

config.begin = buffer.data(); config.begin = buffer.data();

config.end = buffer.data() + buffer.size(); config.end = buffer.data() + buffer.size();

pt_insn_decoder *decoder = pt_insn_alloc_decoder(&config); pt_insn_decoder *decoder = pt_insn_alloc_decoder(&config);

if (!decoder) if (!decoder)

return make_error<IntelPTError>(-pte_nomem); return decoded_thread_sp->AppendError(make_error<IntelPTError>(-pte_nomem));

pt_image *image = pt_insn_get_image(decoder); pt_image *image = pt_insn_get_image(decoder);

int errcode = pt_image_set_callback(image, ReadProcessMemory, &process); int errcode =

pt_image_set_callback(image, ReadProcessMemory,

decoded_thread_sp->GetThread()->GetProcess().get());

wallaceUnsubmitted

Done

pt_image *image = pt_insn_get_image(decoder);

- int errcode = pt_image_set_callback(image, ReadProcessMemory, &process);

+ int errcode = pt_image_set_callback(image, ReadProcessMemory, thread_sp->GetProcess().get());

assert(errcode == 0);

if the compiler complains mentioning that the Process pointer you are passing is const, you can do an unsafe cast that removes the const qualifier and write a comment here. I hope you don't need it anyway

wallace: if the compiler complains mentioning that the Process pointer you are passing is const, you can…

wallaceUnsubmitted

Done

delete this

wallace: delete this

assert(errcode == 0); assert(errcode == 0);

(void)errcode; (void)errcode;

std::vector<IntelPTInstruction> instructions = DecodeInstructions(*decoder); DecodeInstructions(*decoder, decoded_thread_sp);

pt_insn_free_decoder(decoder); pt_insn_free_decoder(decoder);

return instructions;

} }

wallaceUnsubmitted

Done

return bytes_read;

}

static Expected<DecodedThreadSP> DecodeInMemoryTrace(

const ThreadSP &thread_sp, Process &process, TraceIntelPT &trace_intel_pt,

- MutableArrayRef<uint8_t> buffer, const size_t raw_trace_size) {

+ MutableArrayRef<uint8_t> buffer) {

Expected<pt_cpu> cpu_info = trace_intel_pt.GetCPUInfo();

if (!cpu_info)

return cpu_info.takeError();

pt_config config;

pt_config_init(&config);

config.cpu = *cpu_info;

if (int errcode = pt_cpu_errata(&config.errata, &config.cpu))

return make_error<IntelPTError>(errcode);

config.begin = buffer.data();

config.end = buffer.data() + buffer.size();

pt_insn_decoder *decoder = pt_insn_alloc_decoder(&config);

if (!decoder)

return make_error<IntelPTError>(-pte_nomem);

pt_image *image = pt_insn_get_image(decoder);

int errcode = pt_image_set_callback(image, ReadProcessMemory, &process);

assert(errcode == 0);

(void)errcode;

- DecodedThread decoded_thread =

- DecodeInstructions(*decoder, thread_sp, raw_trace_size);

+ DecodedThreadSP decoded_thread_sp =

+ DecodeInstructions(*decoder, thread_sp, buffer.size());

pt_insn_free_decoder(decoder);

- return decoded_thread.shared_from_this();

+ return decoded_thread_sp();

}

// ---------------------------

see below. We don't need to pass the raw_buffer size because it can be gotten from the buffer itself. Also, return the DecodedThreadSP directly

wallace: see below. We don't need to pass the raw_buffer size because it can be gotten from the buffer…

wallaceUnsubmitted

Done

use make_shared instead of shared_from_this. That will be much more performant

wallace: use make_shared instead of shared_from_this. That will be much more performant

// ---------------------------

static Expected<std::vector<IntelPTInstruction>> DecodedThreadSP ThreadDecoder::Decode() {

wallaceUnsubmitted

Done

don't pass the process, it's not necessary anymore

wallace: don't pass the process, it's not necessary anymore

DecodeTraceFile(Process &process, TraceIntelPT &trace_intel_pt, if (!m_decoded_thread.hasValue())

wallaceUnsubmitted

Done

don't use expected. The DecodedThread object already can store errors. Just return a DecodedThreadSP and assign it a single error and return it. If you need to return the failed DecodedThread before you know the size of the buffer, just pass 0 as size

wallace: don't use expected. The DecodedThread object already can store errors. Just return a…

const FileSpec &trace_file, size_t &raw_trace_size) { m_decoded_thread = DoDecode();

ErrorOr<std::unique_ptr<MemoryBuffer>> trace_or_error = return *m_decoded_thread;

MemoryBuffer::getFile(trace_file.GetPath());

if (std::error_code err = trace_or_error.getError())

return errorCodeToError(err);

MemoryBuffer &trace = **trace_or_error;

MutableArrayRef<uint8_t> trace_data(

// The libipt library does not modify the trace buffer, hence the

// following cast is safe.

reinterpret_cast<uint8_t *>(const_cast<char *>(trace.getBufferStart())),

trace.getBufferSize());

raw_trace_size = trace_data.size();

return DecodeInMemoryTrace(process, trace_intel_pt, trace_data);

} }

wallaceUnsubmitted

Done

remove the *, because we will make DecodeLiveThread not return an Expected

wallace: remove the *, because we will make DecodeLiveThread not return an Expected

wallaceUnsubmitted

Done

DecodedThreadSP LiveThreadDecoder::DoDecode() {

- size_t raw_trace_size = 0;

- return *(DecodeLiveThread(m_thread_sp, m_trace, raw_trace_size));

+ return *(DecodeLiveThread(m_thread_sp, m_trace));

}

// PostMortemThreadDecoder =======================

we don't need raw_trace_size to be an out parameter anymore. We can directly use it deep inside our decoding logic

wallace: we don't need raw_trace_size to be an out parameter anymore. We can directly use it deep inside…

static Expected<std::vector<IntelPTInstruction>> // LiveThreadDecoder ====================

DecodeLiveThread(Thread &thread, TraceIntelPT &trace, size_t &raw_trace_size) {

LiveThreadDecoder::LiveThreadDecoder(Thread &thread, TraceIntelPT &trace)

wallaceUnsubmitted

Done

same here

wallace: same here

: m_thread_sp(thread.shared_from_this()), m_trace(trace) {}

DecodedThreadSP LiveThreadDecoder::DoDecode() {

DecodedThreadSP decoded_thread_sp =

std::make_shared<DecodedThread>(m_thread_sp);

wallaceUnsubmitted

Done

: m_thread_sp(thread.shared_from_this()), m_trace(trace) {}

static Expected<DecodedThreadSP> DecodeLiveThread(const ThreadSP &thread_sp,

- TraceIntelPT &trace,

- size_t &raw_trace_size) {

+ TraceIntelPT &trace) {

Expected<std::vector<uint8_t>> buffer =

trace.GetLiveThreadBuffer(thread_sp->GetID());

if (!buffer)

return buffer.takeError();

- raw_trace_size = buffer->size();

if (Expected<pt_cpu> cpu_info = trace.GetCPUInfo())

return DecodeInMemoryTrace(thread_sp, *thread_sp->GetProcess(), trace,

- MutableArrayRef<uint8_t>(*buffer),

- raw_trace_size);

+ MutableArrayRef<uint8_t>(*buffer));

else

return cpu_info.takeError();

}

DecodedThreadSP LiveThreadDecoder::DoDecode() {

Now we don't need to return the raw_trace_size as an out parameter because we are creating the decoded thread deeper in the stack.

wallace: Now we don't need to return the raw_trace_size as an out parameter because we are creating the…

Expected<std::vector<uint8_t>> buffer = Expected<std::vector<uint8_t>> buffer =

trace.GetLiveThreadBuffer(thread.GetID()); m_trace.GetLiveThreadBuffer(m_thread_sp->GetID());

if (!buffer) if (!buffer) {

return buffer.takeError(); decoded_thread_sp->AppendError(buffer.takeError());

raw_trace_size = buffer->size(); return decoded_thread_sp;

if (Expected<pt_cpu> cpu_info = trace.GetCPUInfo())

return DecodeInMemoryTrace(*thread.GetProcess(), trace,

MutableArrayRef<uint8_t>(*buffer));

else

return cpu_info.takeError();

} }

DecodedThreadSP ThreadDecoder::Decode() { decoded_thread_sp->SetRawTraceSize(buffer->size());

if (!m_decoded_thread.hasValue()) DecodeInMemoryTrace(decoded_thread_sp, m_trace,

m_decoded_thread = DoDecode(); MutableArrayRef<uint8_t>(*buffer));

return *m_decoded_thread; return decoded_thread_sp;

} }

// PostMortemThreadDecoder =======================

PostMortemThreadDecoder::PostMortemThreadDecoder( PostMortemThreadDecoder::PostMortemThreadDecoder(

const lldb::ThreadPostMortemTraceSP &trace_thread, TraceIntelPT &trace) const lldb::ThreadPostMortemTraceSP &trace_thread, TraceIntelPT &trace)

: m_trace_thread(trace_thread), m_trace(trace) {} : m_trace_thread(trace_thread), m_trace(trace) {}

DecodedThreadSP PostMortemThreadDecoder::DoDecode() { DecodedThreadSP PostMortemThreadDecoder::DoDecode() {

size_t raw_trace_size = 0; DecodedThreadSP decoded_thread_sp =

if (Expected<std::vector<IntelPTInstruction>> instructions = std::make_shared<DecodedThread>(m_trace_thread);

wallaceUnsubmitted

Done

just return a direct DecodedThreadSP that holds any errors you might find here, same like LiveThreadDecoder::DoDecode()

wallace: just return a direct DecodedThreadSP that holds any errors you might find here, same like…

DecodeTraceFile(*m_trace_thread->GetProcess(), m_trace,

m_trace_thread->GetTraceFile(), raw_trace_size)) ErrorOr<std::unique_ptr<MemoryBuffer>> trace_or_error =

return std::make_shared<DecodedThread>(m_trace_thread->shared_from_this(), MemoryBuffer::getFile(m_trace_thread->GetTraceFile().GetPath());

wallaceUnsubmitted

Done

remove raw_trace_size as an out parameter

wallace: remove raw_trace_size as an out parameter

std::move(*instructions), if (std::error_code err = trace_or_error.getError()) {

raw_trace_size); decoded_thread_sp->AppendError(errorCodeToError(err));

else return decoded_thread_sp;

return std::make_shared<DecodedThread>(m_trace_thread->shared_from_this(),

instructions.takeError());

} }

LiveThreadDecoder::LiveThreadDecoder(Thread &thread, TraceIntelPT &trace) MemoryBuffer &trace = **trace_or_error;

: m_thread_sp(thread.shared_from_this()), m_trace(trace) {} MutableArrayRef<uint8_t> trace_data(

wallaceUnsubmitted

Done

raw_trace_size = trace_data.size();

- return DecodeInMemoryTrace(thread_sp, process, trace_intel_pt, trace_data,

- raw_trace_size);

+ return DecodeInMemoryTrace(thread_sp, process, trace_intel_pt, trace_data);

}

DecodedThreadSP PostMortemThreadDecoder::DoDecode() {

the trace_size doesn't need to be passed

wallace: the trace_size doesn't need to be passed

// The libipt library does not modify the trace buffer, hence the

// following cast is safe.

reinterpret_cast<uint8_t *>(const_cast<char *>(trace.getBufferStart())),

trace.getBufferSize());

decoded_thread_sp->SetRawTraceSize(trace_data.size());

wallaceUnsubmitted

Not Done

this will become simpler once DecodeTraceFile returns directly a DecodedThreadSP

wallace: this will become simpler once DecodeTraceFile returns directly a DecodedThreadSP

DecodedThreadSP LiveThreadDecoder::DoDecode() { DecodeInMemoryTrace(decoded_thread_sp, m_trace, trace_data);

size_t raw_trace_size = 0; return decoded_thread_sp;

if (Expected<std::vector<IntelPTInstruction>> instructions =

DecodeLiveThread(*m_thread_sp, m_trace, raw_trace_size))

return std::make_shared<DecodedThread>(

m_thread_sp, std::move(*instructions), raw_trace_size);

else

return std::make_shared<DecodedThread>(m_thread_sp,

instructions.takeError());

} }

lldb/source/Plugins/Trace/intel-pt/TraceCursorIntelPT.h

	Show All 18 Lines
	public:			public:
	TraceCursorIntelPT(lldb::ThreadSP thread_sp,			TraceCursorIntelPT(lldb::ThreadSP thread_sp,
	DecodedThreadSP decoded_thread_sp);			DecodedThreadSP decoded_thread_sp);

	size_t Seek(int64_t offset, SeekType origin) override;			size_t Seek(int64_t offset, SeekType origin) override;

	virtual bool Next() override;			virtual bool Next() override;

	llvm::Error GetError() override;			const char *GetError() override;

	lldb::addr_t GetLoadAddress() override;			lldb::addr_t GetLoadAddress() override;

	llvm::Optional<uint64_t> GetCounter(lldb::TraceCounter counter_type) override;			llvm::Optional<uint64_t> GetCounter(lldb::TraceCounter counter_type) override;

	lldb::TraceInstructionControlFlowType			lldb::TraceInstructionControlFlowType
	GetInstructionControlFlowType() override;			GetInstructionControlFlowType() override;

	Show All 15 Lines

lldb/source/Plugins/Trace/intel-pt/TraceCursorIntelPT.cpp

Show First 20 Lines • Show All 68 Lines • ▼ Show 20 Lines	case TraceCursor::SeekType::Current:
int64_t new_pos = fitPosToBounds(offset + m_pos);		int64_t new_pos = fitPosToBounds(offset + m_pos);
int64_t dist = m_pos - new_pos;		int64_t dist = m_pos - new_pos;
m_pos = new_pos;		m_pos = new_pos;
return std::abs(dist);		return std::abs(dist);
}		}
}		}

bool TraceCursorIntelPT::IsError() {		bool TraceCursorIntelPT::IsError() {
return m_decoded_thread_sp->GetInstructions()[m_pos].IsError();		return m_decoded_thread_sp->GetInstructions()[m_pos].IsError();
		jj10306Unsubmitted Done Reply Inline Actions nit: should we update this to use the error map? I don't think there's a significant difference performance wise, but the code would be a little cleaner imo and consistent with how `GetError()` works. jj10306: nit: should we update this to use the error map? I don't think there's a significant difference…
		zrthxnAuthorUnsubmitted Done Reply Inline Actions That would sort of look like this I think if (m_decoded_thread_sp->GetErrorByInstructionIndex(m_pos).isA<ErrorSuccess>()) return false; else true; zrthxn: That would sort of look like this I think ``` if (m_decoded_thread_sp…
		jj10306Unsubmitted Done Reply Inline Actions What about `return (bool)m_decoded_thread_sp->GetErrorByInstructionIndex(m_pos);` Another idea is to just remove the `IsError()` function entirely since calling `GetError()` tells you if it's an error. iirc all error checks actually use `GetError` except for the checks inside of `TraceHtr` which is soon going to be deleted by @wallace in new patches, so you could just change those couple instances of `IsError` and remove it all together. Definitely not necessary, just spitballing ideas (: @wallace what do you think? jj10306: What about `return (bool)m_decoded_thread_sp->GetErrorByInstructionIndex(m_pos);` Another idea…
		wallaceUnsubmitted Done Reply Inline Actions I don't think that's a good idea. The problem is calling `GetErrorByInstructionIndex` is that you then have an Error object that you need to consume. There's also the cost of creating this object even if you just want to know if there's an error or not and you don't want to do anything with the actual error message. It's better then to create the Error object only when needed. wallace: I don't think that's a good idea. The problem is calling `GetErrorByInstructionIndex` is that…
}		}

Error TraceCursorIntelPT::GetError() {		const char *TraceCursorIntelPT::GetError() {
return m_decoded_thread_sp->GetInstructions()[m_pos].ToError();		return m_decoded_thread_sp->GetErrorByInstructionIndex(m_pos);
}		}

lldb::addr_t TraceCursorIntelPT::GetLoadAddress() {		lldb::addr_t TraceCursorIntelPT::GetLoadAddress() {
return m_decoded_thread_sp->GetInstructions()[m_pos].GetLoadAddress();		return m_decoded_thread_sp->GetInstructions()[m_pos].GetLoadAddress();
}		}

Optional<uint64_t> TraceCursorIntelPT::GetCounter(lldb::TraceCounter counter_type) {		Optional<uint64_t> TraceCursorIntelPT::GetCounter(lldb::TraceCounter counter_type) {
switch (counter_type) {		switch (counter_type) {
Show All 14 Lines

lldb/source/Plugins/Trace/intel-pt/TraceIntelPT.cpp

Show First 20 Lines • Show All 106 Lines • ▼ Show 20 Lines	void TraceIntelPT::DumpTraceInfo(Thread &thread, Stream &s, bool verbose) {
Optional<size_t> raw_size = GetRawTraceSize(thread);		Optional<size_t> raw_size = GetRawTraceSize(thread);
s.Printf("\nthread #%u: tid = %" PRIu64, thread.GetIndexID(), thread.GetID());		s.Printf("\nthread #%u: tid = %" PRIu64, thread.GetIndexID(), thread.GetID());
if (!raw_size) {		if (!raw_size) {
s.Printf(", not traced\n");		s.Printf(", not traced\n");
return;		return;
}		}
s.Printf("\n");		s.Printf("\n");

		size_t insn_len = Decode(thread)->GetInstructions().size();
size_t mem_used = Decode(thread)->CalculateApproximateMemoryUsage();		size_t mem_used = Decode(thread)->CalculateApproximateMemoryUsage();

s.Printf(" Raw trace size: %zu KiB\n", *raw_size / 1024);		s.Printf(" Raw trace size: %zu KiB\n", *raw_size / 1024);
s.Printf(" Total number of instructions: %zu\n",		s.Printf(" Total number of instructions: %zu\n", insn_len);
Decode(thread)->GetInstructions().size());
s.Printf(" Total approximate memory usage: %0.2lf KiB\n",		s.Printf(" Total approximate memory usage: %0.2lf KiB\n",
(double)mem_used / 1024);		(double)mem_used / 1024);
		s.Printf(" Average memory usage per instruction: %zu bytes\n",
		mem_used / insn_len);
		wallaceUnsubmitted Done Reply Inline Actions print it in bytes instead wallace: print it in bytes instead
return;		return;
}		}

Optional<size_t> TraceIntelPT::GetRawTraceSize(Thread &thread) {		Optional<size_t> TraceIntelPT::GetRawTraceSize(Thread &thread) {
if (IsTraced(thread.GetID()))		if (IsTraced(thread.GetID()))
return Decode(thread)->GetRawTraceSize();		return Decode(thread)->GetRawTraceSize();
else		else
return None;		return None;
▲ Show 20 Lines • Show All 219 Lines • Show Last 20 Lines

lldb/source/Target/TraceInstructionDumper.cpp

Show First 20 Lines • Show All 244 Lines • ▼ Show 20 Lines	void TraceInstructionDumper::DumpInstructions(Stream &s, size_t count) {
};		};

for (size_t i = 0; i < count; i++) {		for (size_t i = 0; i < count; i++) {
if (!HasMoreData()) {		if (!HasMoreData()) {
s.Printf(" no more data\n");		s.Printf(" no more data\n");
break;		break;
}		}

if (Error err = m_cursor_up->GetError()) {		if (const char *err = m_cursor_up->GetError()) {
if (!m_cursor_up->IsForwards() && !was_prev_instruction_an_error)		if (!m_cursor_up->IsForwards() && !was_prev_instruction_an_error)
printMissingInstructionsMessage();		printMissingInstructionsMessage();

was_prev_instruction_an_error = true;		was_prev_instruction_an_error = true;

printInstructionIndex();		printInstructionIndex();
s << toString(std::move(err));		s << err;
} else {		} else {
if (m_cursor_up->IsForwards() && was_prev_instruction_an_error)		if (m_cursor_up->IsForwards() && was_prev_instruction_an_error)
printMissingInstructionsMessage();		printMissingInstructionsMessage();

was_prev_instruction_an_error = false;		was_prev_instruction_an_error = false;

InstructionSymbolInfo insn_info;		InstructionSymbolInfo insn_info;

Show All 24 Lines

lldb/test/API/commands/trace/TestTraceDumpInfo.py

Show All 34 Lines	def testDumpRawTraceSize(self):
substrs=["intel-pt"])		substrs=["intel-pt"])

self.expect("thread trace dump info",		self.expect("thread trace dump info",
substrs=['''Trace technology: intel-pt		substrs=['''Trace technology: intel-pt

thread #1: tid = 3842849		thread #1: tid = 3842849
Raw trace size: 4 KiB		Raw trace size: 4 KiB
Total number of instructions: 21		Total number of instructions: 21
Total approximate memory usage: 5.38 KiB'''])		Total approximate memory usage: 5.31 KiB
		Average memory usage per instruction: 259 bytes'''])

lldb/test/API/commands/trace/TestTraceLoad.py

Show All 32 Lines	def testLoadTrace(self):
# check that the Process and Thread objects were created correctly		# check that the Process and Thread objects were created correctly
self.expect("thread info", substrs=["tid = 3842849"])		self.expect("thread info", substrs=["tid = 3842849"])
self.expect("thread list", substrs=["Process 1234 stopped", "tid = 3842849"])		self.expect("thread list", substrs=["Process 1234 stopped", "tid = 3842849"])
self.expect("thread trace dump info", substrs=['''Trace technology: intel-pt		self.expect("thread trace dump info", substrs=['''Trace technology: intel-pt

thread #1: tid = 3842849		thread #1: tid = 3842849
Raw trace size: 4 KiB		Raw trace size: 4 KiB
Total number of instructions: 21		Total number of instructions: 21
Total approximate memory usage: 5.38 KiB'''])		Total approximate memory usage: 5.31 KiB
		Average memory usage per instruction: 259 bytes'''])

def testLoadInvalidTraces(self):		def testLoadInvalidTraces(self):
src_dir = self.getSourceDir()		src_dir = self.getSourceDir()
# We test first an invalid type		# We test first an invalid type
self.expect("trace load -v " + os.path.join(src_dir, "intelpt-trace", "trace_bad.json"), error=True,		self.expect("trace load -v " + os.path.join(src_dir, "intelpt-trace", "trace_bad.json"), error=True,
substrs=['''error: expected object at traceSession.processes[0]		substrs=['''error: expected object at traceSession.processes[0]

Context:		Context:
▲ Show 20 Lines • Show All 52 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[intelpt] Refactoring instruction decoding for flexibilityClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 418408

lldb/include/lldb/Target/TraceCursor.h

lldb/source/Plugins/Trace/intel-pt/DecodedThread.h

lldb/source/Plugins/Trace/intel-pt/DecodedThread.cpp

lldb/source/Plugins/Trace/intel-pt/IntelPTDecoder.cpp

lldb/source/Plugins/Trace/intel-pt/TraceCursorIntelPT.h

lldb/source/Plugins/Trace/intel-pt/TraceCursorIntelPT.cpp

lldb/source/Plugins/Trace/intel-pt/TraceIntelPT.cpp

lldb/source/Target/TraceInstructionDumper.cpp

lldb/test/API/commands/trace/TestTraceDumpInfo.py

lldb/test/API/commands/trace/TestTraceLoad.py

[intelpt] Refactoring instruction decoding for flexibility
ClosedPublic