This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lldb/
-
include/lldb/Utility/
-
lldb/
-
Utility/
-
TraceIntelPTGDBRemotePackets.h
-
source/
-
Plugins/
-
Process/Linux/
-
Linux/
1/3
IntelPTCollector.cpp
-
IntelPTMultiCoreTrace.h
-
IntelPTMultiCoreTrace.cpp
-
IntelPTSingleBufferTrace.h
-
IntelPTSingleBufferTrace.cpp
-
Trace/intel-pt/
-
intel-pt/
-
CommandObjectTraceStartIntelPT.h
-
CommandObjectTraceStartIntelPT.cpp
-
TraceIntelPT.h
-
TraceIntelPT.cpp
-
TraceIntelPTConstants.h
-
TraceIntelPTMultiCpuDecoder.h
1/3
TraceIntelPTMultiCpuDecoder.cpp
-
TraceIntelPTOptions.td
-
Utility/
-
TraceIntelPTGDBRemotePackets.cpp

Differential D129257

[trace][intel pt] Add a cgroup filter
ClosedPublic

Authored by wallace on Jul 7 2022, 12:50 AM.

Download Raw Diff

Details

Reviewers

jj10306
persona0220

Commits

rGd30fd5c3a17b: [trace][intel pt] Add a cgroup filter

Summary

Thanks to Gaurav Gaur (gaur@fb.com), who made most of the investigation to make this happen.

It turns out that cgroup filtering is relatively trivial and works
really nicely. Thid diffs adds automatic cgroup filtering when in
per-cpu mode, unless a new --disable-cgroup-filtering flag is passed in
the start command. At least on Meta machines, all processes are spawned
inside a cgroup by default, which comes super handy, because per cpu
tracing is now much more precise.

A manual test gave me this result

Without filtering:

Total number of trace items: 36083
Total number of continuous executions found: 229
Number of continuous executions for this thread: 2
Total number of PSB blocks found: 98
Number of PSB blocks for this thread 2
Total number of unattributed PSB blocks found: 38

With filtering:

Total number of trace items: 87756
Total number of continuous executions found: 123
Number of continuous executions for this thread: 2
Total number of PSB blocks found: 10
Number of PSB blocks for this thread 3
Total number of unattributed PSB blocks found: 2

Filtering gives us great results. The number of instructions collected
more than double (probalby because we have less noise in the trace), and
we have much less unattributed PSBs blocks and unrelated PSBs in
general. The ones that are unrelated probably belong to other processes
in the same cgroup.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

wallace created this revision.Jul 7 2022, 12:50 AM

Herald added a project: Restricted Project. · View Herald TranscriptJul 7 2022, 12:50 AM

wallace requested review of this revision.Jul 7 2022, 12:50 AM

Herald added a project: Restricted Project. · View Herald TranscriptJul 7 2022, 12:50 AM

Herald added a subscriber: lldb-commits. · View Herald Transcript

wallace edited the summary of this revision. (Show Details)Jul 7 2022, 12:50 AM

Herald added a subscriber: JDevlieghere. · View Herald TranscriptJul 7 2022, 12:50 AM

wallace edited the summary of this revision. (Show Details)Jul 7 2022, 12:52 AM

Harbormaster completed remote builds in B174075: Diff 442807.Jul 7 2022, 12:53 AM

looks good overall, just a couple questions from my end

lldb/source/Plugins/Process/Linux/IntelPTCollector.cpp
87–90	isn't the cgroup_file path going to have two slashes since slice starts with a slash? in the case of the image above, wouldn't cgroup_file be "/sys/fs/cgroup//foo.slice/bar.service" instead of "/sys/fs/cgroup/foo.slice/bar.service"
lldb/source/Plugins/Trace/intel-pt/TraceIntelPTMultiCpuDecoder.cpp
122	Help me understand this. To me this seems like it is counting all of the PSBs that don't belong to the current thread, whereas I would expect this to only count the PSBs that don't belong to any thread? So based on my understanding we would be severely overcounting the number of unattributed PSB, but I think I'm just misunderstanding how this code flows.

This revision now requires changes to proceed.Jul 13 2022, 8:15 AM

wallace added inline comments.Jul 13 2022, 12:00 PM

lldb/source/Plugins/Process/Linux/IntelPTCollector.cpp
87–90	I think that's fine for system paths. You can have multiple consecutive //// and the system collapses them
lldb/source/Plugins/Trace/intel-pt/TraceIntelPTMultiCpuDecoder.cpp
122	the current code is correct, I'll explain: on_new_thread_execution is executed on all threads of the same CPU in chronological order as on_new_thread_execution is being invoked repeatedly, the `it` iterator is being traversed and is always moving forwards when on_new_thread_execution is invoked, the `it` iterator will look for psb blocks that happened before the given execution. These blocks do not belong to any thread execution. Graphically, we have ----exec 1---- ---exec 2---- ---exec 3---- PSB1 PSB2 PSB3 PSB4 PSB5 PSB6 when on_new_thread_execution is invoked for exec2, `it` will be pointing at PSB3, which is the first PSB after exec 1. PSB3 comes before exec 2, so it'll be unattributed, then it will move to PSB4 and so on

wallace requested review of this revision.Jul 13 2022, 12:00 PM

thanks for answering those questions, lgtm

lldb/source/Plugins/Process/Linux/IntelPTCollector.cpp
87–90	TIL
lldb/source/Plugins/Trace/intel-pt/TraceIntelPTMultiCpuDecoder.cpp
122	makes sense, thanks for the explanation!

This revision is now accepted and ready to land.Jul 13 2022, 12:25 PM

Closed by commit rGd30fd5c3a17b: [trace][intel pt] Add a cgroup filter (authored by Gaurav Gaur <gaur@fb.com>, committed by Walter Erquinigo <wallace@fb.com>). · Explain WhyJul 13 2022, 12:26 PM

This revision was automatically updated to reflect the committed changes.

Walter Erquinigo <wallace@fb.com> added a commit: rGd30fd5c3a17b: [trace][intel pt] Add a cgroup filter.

Revision Contents

Path

Size

lldb/

include/

lldb/

Utility/

TraceIntelPTGDBRemotePackets.h

5 lines

source/

Plugins/

Process/

Linux/

IntelPTCollector.cpp

43 lines

IntelPTMultiCoreTrace.h

14 lines

IntelPTMultiCoreTrace.cpp

8 lines

IntelPTSingleBufferTrace.h

7 lines

IntelPTSingleBufferTrace.cpp

15 lines

Trace/

intel-pt/

CommandObjectTraceStartIntelPT.h

1 line

CommandObjectTraceStartIntelPT.cpp

13 lines

TraceIntelPT.h

6 lines

TraceIntelPT.cpp

21 lines

TraceIntelPTConstants.h

1 line

TraceIntelPTMultiCpuDecoder.h

16 lines

TraceIntelPTMultiCpuDecoder.cpp

24 lines

TraceIntelPTOptions.td

5 lines

Utility/

TraceIntelPTGDBRemotePackets.cpp

12 lines

Diff 442807

lldb/include/lldb/Utility/TraceIntelPTGDBRemotePackets.h

Show First 20 Lines • Show All 44 Lines • ▼ Show 20 Lines	struct TraceIntelPTStartRequest : TraceStartRequest {
/// Limit in bytes on all the thread traces started by this "process trace"		/// Limit in bytes on all the thread traces started by this "process trace"
/// instance. When a thread is about to be traced and the limit would be hit,		/// instance. When a thread is about to be traced and the limit would be hit,
/// then a "tracing" stop event is triggered.		/// then a "tracing" stop event is triggered.
llvm::Optional<uint64_t> process_buffer_size_limit;		llvm::Optional<uint64_t> process_buffer_size_limit;

/// Whether to have a trace buffer per thread or per cpu cpu.		/// Whether to have a trace buffer per thread or per cpu cpu.
llvm::Optional<bool> per_cpu_tracing;		llvm::Optional<bool> per_cpu_tracing;

		/// Disable the cgroup filtering that is automatically applied in per cpu
		/// mode.
		llvm::Optional<bool> disable_cgroup_filtering;

bool IsPerCpuTracing() const;		bool IsPerCpuTracing() const;
};		};

bool fromJSON(const llvm::json::Value &value, TraceIntelPTStartRequest &packet,		bool fromJSON(const llvm::json::Value &value, TraceIntelPTStartRequest &packet,
llvm::json::Path path);		llvm::json::Path path);

llvm::json::Value toJSON(const TraceIntelPTStartRequest &packet);		llvm::json::Value toJSON(const TraceIntelPTStartRequest &packet);
/// \}		/// \}
▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines	struct LinuxPerfZeroTscConversion {
uint32_t time_mult;		uint32_t time_mult;
uint16_t time_shift;		uint16_t time_shift;
JSONUINT64 time_zero;		JSONUINT64 time_zero;
};		};

struct TraceIntelPTGetStateResponse : TraceGetStateResponse {		struct TraceIntelPTGetStateResponse : TraceGetStateResponse {
/// The TSC to wall time conversion if it exists, otherwise \b nullptr.		/// The TSC to wall time conversion if it exists, otherwise \b nullptr.
llvm::Optional<LinuxPerfZeroTscConversion> tsc_perf_zero_conversion;		llvm::Optional<LinuxPerfZeroTscConversion> tsc_perf_zero_conversion;
		bool using_cgroup_filtering = false;
};		};

bool fromJSON(const llvm::json::Value &value,		bool fromJSON(const llvm::json::Value &value,
LinuxPerfZeroTscConversion &packet, llvm::json::Path path);		LinuxPerfZeroTscConversion &packet, llvm::json::Path path);

llvm::json::Value toJSON(const LinuxPerfZeroTscConversion &packet);		llvm::json::Value toJSON(const LinuxPerfZeroTscConversion &packet);

bool fromJSON(const llvm::json::Value &value,		bool fromJSON(const llvm::json::Value &value,
TraceIntelPTGetStateResponse &packet, llvm::json::Path path);		TraceIntelPTGetStateResponse &packet, llvm::json::Path path);

llvm::json::Value toJSON(const TraceIntelPTGetStateResponse &packet);		llvm::json::Value toJSON(const TraceIntelPTGetStateResponse &packet);
/// \}		/// \}

} // namespace lldb_private		} // namespace lldb_private

#endif // LLDB_UTILITY_TRACEINTELPTGDBREMOTEPACKETS_H		#endif // LLDB_UTILITY_TRACEINTELPTGDBREMOTEPACKETS_H

lldb/source/Plugins/Process/Linux/IntelPTCollector.cpp

Show All 15 Lines
#include "lldb/Utility/StreamString.h"		#include "lldb/Utility/StreamString.h"

#include "llvm/ADT/StringRef.h"		#include "llvm/ADT/StringRef.h"
#include "llvm/Support/Error.h"		#include "llvm/Support/Error.h"
#include "llvm/Support/MathExtras.h"		#include "llvm/Support/MathExtras.h"

#include <algorithm>		#include <algorithm>
#include <cstddef>		#include <cstddef>
		#include <fcntl.h>
#include <fstream>		#include <fstream>
#include <linux/perf_event.h>		#include <linux/perf_event.h>
#include <sstream>		#include <sstream>
#include <sys/ioctl.h>		#include <sys/ioctl.h>
#include <sys/syscall.h>		#include <sys/syscall.h>

using namespace lldb;		using namespace lldb;
using namespace lldb_private;		using namespace lldb_private;
Show All 28 Lines	if (request.IsProcessTracing()) {
Error error = Error::success();		Error error = Error::success();
for (int64_t tid : *request.tids)		for (int64_t tid : *request.tids)
error = joinErrors(std::move(error),		error = joinErrors(std::move(error),
TraceStop(static_cast<lldb::tid_t>(tid)));		TraceStop(static_cast<lldb::tid_t>(tid)));
return error;		return error;
}		}
}		}

		/// \return
		/// some file descriptor in /sys/fs/ associated with the cgroup of the given
		/// pid, or \a llvm::None if the pid is not part of a cgroup.
		static Optional<int> GetCGroupFileDescriptor(lldb::pid_t pid) {
		static Optional<int> fd;
		if (fd)
		return fd;

		std::ifstream ifile;
		ifile.open(formatv("/proc/{0}/cgroup", pid));
		if (!ifile)
		return None;

		std::string line;
		while (std::getline(ifile, line)) {
		if (line.find("0:") != 0)
		continue;

		std::string slice = line.substr(line.find_first_of("/"));
		if (slice.empty())
		return None;
		std::string cgroup_file = formatv("/sys/fs/cgroup/{0}", slice);
		jj10306Unsubmitted Not Done Reply Inline Actions isn't the cgroup_file path going to have two slashes since slice starts with a slash? in the case of the image above, wouldn't cgroup_file be "/sys/fs/cgroup//foo.slice/bar.service" instead of "/sys/fs/cgroup/foo.slice/bar.service" jj10306: isn't the cgroup_file path going to have two slashes since slice starts with a slash?
		wallaceAuthorUnsubmitted Done Reply Inline Actions I think that's fine for system paths. You can have multiple consecutive //// and the system collapses them wallace: I think that's fine for system paths. You can have multiple consecutive //// and the system…
		jj10306Unsubmitted Not Done Reply Inline Actions TIL jj10306: TIL
		// This cgroup should for the duration of the target, so we don't need to
		// invoke close ourselves.
		int maybe_fd = open(cgroup_file.c_str(), O_RDONLY);
		if (maybe_fd != -1) {
		fd = maybe_fd;
		return fd;
		}
		}
		return None;
		}

Error IntelPTCollector::TraceStart(const TraceIntelPTStartRequest &request) {		Error IntelPTCollector::TraceStart(const TraceIntelPTStartRequest &request) {
if (request.IsProcessTracing()) {		if (request.IsProcessTracing()) {
if (m_process_trace_up) {		if (m_process_trace_up) {
return createStringError(		return createStringError(
inconvertibleErrorCode(),		inconvertibleErrorCode(),
"Process currently traced. Stop process tracing first");		"Process currently traced. Stop process tracing first");
}		}
if (request.IsPerCpuTracing()) {		if (request.IsPerCpuTracing()) {
if (m_thread_traces.GetTracedThreadsCount() > 0)		if (m_thread_traces.GetTracedThreadsCount() > 0)
return createStringError(		return createStringError(
inconvertibleErrorCode(),		inconvertibleErrorCode(),
"Threads currently traced. Stop tracing them first.");		"Threads currently traced. Stop tracing them first.");
// CPU tracing is useless if we can't convert tsc to nanos.		// CPU tracing is useless if we can't convert tsc to nanos.
Expected<LinuxPerfZeroTscConversion &> tsc_conversion =		Expected<LinuxPerfZeroTscConversion &> tsc_conversion =
FetchPerfTscConversionParameters();		FetchPerfTscConversionParameters();
if (!tsc_conversion)		if (!tsc_conversion)
return tsc_conversion.takeError();		return tsc_conversion.takeError();

// We force the enabledment of TSCs, which is needed for correlating the		// We force the enablement of TSCs, which is needed for correlating the
// cpu traces.		// cpu traces.
TraceIntelPTStartRequest effective_request = request;		TraceIntelPTStartRequest effective_request = request;
effective_request.enable_tsc = true;		effective_request.enable_tsc = true;

		// We try to use cgroup filtering whenever possible
		Optional<int> cgroup_fd;
		if (!request.disable_cgroup_filtering.getValueOr(false))
		cgroup_fd = GetCGroupFileDescriptor(m_process.GetID());

if (Expected<IntelPTProcessTraceUP> trace =		if (Expected<IntelPTProcessTraceUP> trace =
IntelPTMultiCoreTrace::StartOnAllCores(effective_request,		IntelPTMultiCoreTrace::StartOnAllCores(effective_request,
m_process)) {		m_process, cgroup_fd)) {
m_process_trace_up = std::move(*trace);		m_process_trace_up = std::move(*trace);
return Error::success();		return Error::success();
} else {		} else {
return trace.takeError();		return trace.takeError();
}		}
} else {		} else {
std::vector<lldb::tid_t> process_threads;		std::vector<lldb::tid_t> process_threads;
for (NativeThreadProtocol &thread : m_process.Threads())		for (NativeThreadProtocol &thread : m_process.Threads())
▲ Show 20 Lines • Show All 128 Lines • Show Last 20 Lines

lldb/source/Plugins/Process/Linux/IntelPTMultiCoreTrace.h

Show All 29 Lines	public:
/// Start tracing all CPU cores.		/// Start tracing all CPU cores.
///		///
/// \param[in] request		/// \param[in] request
/// Intel PT configuration parameters.		/// Intel PT configuration parameters.
///		///
/// \param[in] process		/// \param[in] process
/// The process being debugged.		/// The process being debugged.
///		///
		/// \param[in] cgroup_fd
		/// A file descriptor in /sys/fs associated with the cgroup of the process to
		/// trace. If not \a llvm::None, then the trace sesion will use cgroup
		/// filtering.
		///
/// \return		/// \return
/// An \a IntelPTMultiCoreTrace instance if tracing was successful, or		/// An \a IntelPTMultiCoreTrace instance if tracing was successful, or
/// an \a llvm::Error otherwise.		/// an \a llvm::Error otherwise.
static llvm::Expected<std::unique_ptr<IntelPTMultiCoreTrace>>		static llvm::Expected<std::unique_ptr<IntelPTMultiCoreTrace>>
StartOnAllCores(const TraceIntelPTStartRequest &request,		StartOnAllCores(const TraceIntelPTStartRequest &request,
NativeProcessProtocol &process);		NativeProcessProtocol &process,
		llvm::Optional<int> cgroup_fd = llvm::None);

/// Execute the provided callback on each core that is being traced.		/// Execute the provided callback on each core that is being traced.
///		///
/// \param[in] callback.cpu_id		/// \param[in] callback.cpu_id
/// The core id that is being traced.		/// The core id that is being traced.
///		///
/// \param[in] callback.core_trace		/// \param[in] callback.core_trace
/// The single-buffer trace instance for the given core.		/// The single-buffer trace instance for the given core.
Show All 33 Lines

private:		private:
/// This assumes that all underlying perf_events for each core are part of the		/// This assumes that all underlying perf_events for each core are part of the
/// same perf event group.		/// same perf event group.
IntelPTMultiCoreTrace(		IntelPTMultiCoreTrace(
llvm::DenseMap<lldb::cpu_id_t,		llvm::DenseMap<lldb::cpu_id_t,
std::pair<IntelPTSingleBufferTrace, ContextSwitchTrace>>		std::pair<IntelPTSingleBufferTrace, ContextSwitchTrace>>
&&traces_per_core,		&&traces_per_core,
NativeProcessProtocol &process)		NativeProcessProtocol &process, bool using_cgroup_filtering)
: m_traces_per_core(std::move(traces_per_core)), m_process(process) {}		: m_traces_per_core(std::move(traces_per_core)), m_process(process),
		m_using_cgroup_filtering(using_cgroup_filtering) {}

llvm::DenseMap<lldb::cpu_id_t,		llvm::DenseMap<lldb::cpu_id_t,
std::pair<IntelPTSingleBufferTrace, ContextSwitchTrace>>		std::pair<IntelPTSingleBufferTrace, ContextSwitchTrace>>
m_traces_per_core;		m_traces_per_core;

/// The target process.		/// The target process.
NativeProcessProtocol &m_process;		NativeProcessProtocol &m_process;
		bool m_using_cgroup_filtering;
};		};

} // namespace process_linux		} // namespace process_linux
} // namespace lldb_private		} // namespace lldb_private

#endif // liblldb_IntelPTMultiCoreTrace_H_		#endif // liblldb_IntelPTMultiCoreTrace_H_

lldb/source/Plugins/Process/Linux/IntelPTMultiCoreTrace.cpp

Show All 29 Lines	return createStringError(
inconvertibleErrorCode(),		inconvertibleErrorCode(),
"%s\nYou might need to rerun as sudo or to set "		"%s\nYou might need to rerun as sudo or to set "
"/proc/sys/kernel/perf_event_paranoid to a value of 0 or -1.",		"/proc/sys/kernel/perf_event_paranoid to a value of 0 or -1.",
toString(std::move(error)).c_str());		toString(std::move(error)).c_str());
}		}

Expected<std::unique_ptr<IntelPTMultiCoreTrace>>		Expected<std::unique_ptr<IntelPTMultiCoreTrace>>
IntelPTMultiCoreTrace::StartOnAllCores(const TraceIntelPTStartRequest &request,		IntelPTMultiCoreTrace::StartOnAllCores(const TraceIntelPTStartRequest &request,
NativeProcessProtocol &process) {		NativeProcessProtocol &process,
		Optional<int> cgroup_fd) {
Expected<ArrayRef<cpu_id_t>> cpu_ids = GetAvailableLogicalCoreIDs();		Expected<ArrayRef<cpu_id_t>> cpu_ids = GetAvailableLogicalCoreIDs();
if (!cpu_ids)		if (!cpu_ids)
return cpu_ids.takeError();		return cpu_ids.takeError();

if (IsTotalBufferLimitReached(*cpu_ids, request))		if (IsTotalBufferLimitReached(*cpu_ids, request))
return createStringError(		return createStringError(
inconvertibleErrorCode(),		inconvertibleErrorCode(),
"The process can't be traced because the process trace size limit "		"The process can't be traced because the process trace size limit "
"has been reached. Consider retracing with a higher limit.");		"has been reached. Consider retracing with a higher limit.");

DenseMap<cpu_id_t, std::pair<IntelPTSingleBufferTrace, ContextSwitchTrace>>		DenseMap<cpu_id_t, std::pair<IntelPTSingleBufferTrace, ContextSwitchTrace>>
traces;		traces;

for (cpu_id_t cpu_id : *cpu_ids) {		for (cpu_id_t cpu_id : *cpu_ids) {
Expected<IntelPTSingleBufferTrace> core_trace =		Expected<IntelPTSingleBufferTrace> core_trace =
IntelPTSingleBufferTrace::Start(request, /tid=/None, cpu_id,		IntelPTSingleBufferTrace::Start(request, /tid=/None, cpu_id,
/disabled=/true);		/disabled=/true, cgroup_fd);
if (!core_trace)		if (!core_trace)
return IncludePerfEventParanoidMessageInError(core_trace.takeError());		return IncludePerfEventParanoidMessageInError(core_trace.takeError());

if (Expected<PerfEvent> context_switch_trace =		if (Expected<PerfEvent> context_switch_trace =
CreateContextSwitchTracePerfEvent(cpu_id,		CreateContextSwitchTracePerfEvent(cpu_id,
&core_trace->GetPerfEvent())) {		&core_trace->GetPerfEvent())) {
traces.try_emplace(cpu_id,		traces.try_emplace(cpu_id,
std::make_pair(std::move(*core_trace),		std::make_pair(std::move(*core_trace),
std::move(*context_switch_trace)));		std::move(*context_switch_trace)));
} else {		} else {
return context_switch_trace.takeError();		return context_switch_trace.takeError();
}		}
}		}

return std::unique_ptr<IntelPTMultiCoreTrace>(		return std::unique_ptr<IntelPTMultiCoreTrace>(
new IntelPTMultiCoreTrace(std::move(traces), process));		new IntelPTMultiCoreTrace(std::move(traces), process, (bool)cgroup_fd));
}		}

void IntelPTMultiCoreTrace::ForEachCore(		void IntelPTMultiCoreTrace::ForEachCore(
std::function<void(cpu_id_t cpu_id, IntelPTSingleBufferTrace &core_trace)>		std::function<void(cpu_id_t cpu_id, IntelPTSingleBufferTrace &core_trace)>
callback) {		callback) {
for (auto &it : m_traces_per_core)		for (auto &it : m_traces_per_core)
callback(it.first, it.second.first);		callback(it.first, it.second.first);
}		}
Show All 21 Lines	if (Error err = core_trace.Resume()) {
LLDB_LOG_ERROR(GetLog(POSIXLog::Trace), std::move(err),		LLDB_LOG_ERROR(GetLog(POSIXLog::Trace), std::move(err),
"Unable to resume the core trace for core {0}", cpu_id);		"Unable to resume the core trace for core {0}", cpu_id);
}		}
});		});
}		}

TraceIntelPTGetStateResponse IntelPTMultiCoreTrace::GetState() {		TraceIntelPTGetStateResponse IntelPTMultiCoreTrace::GetState() {
TraceIntelPTGetStateResponse state;		TraceIntelPTGetStateResponse state;
		state.using_cgroup_filtering = m_using_cgroup_filtering;

for (NativeThreadProtocol &thread : m_process.Threads())		for (NativeThreadProtocol &thread : m_process.Threads())
state.traced_threads.push_back(		state.traced_threads.push_back(
TraceThreadState{thread.GetID(), {}});		TraceThreadState{thread.GetID(), {}});

state.cpus.emplace();		state.cpus.emplace();
ForEachCore([&](lldb::cpu_id_t cpu_id,		ForEachCore([&](lldb::cpu_id_t cpu_id,
const IntelPTSingleBufferTrace &core_trace,		const IntelPTSingleBufferTrace &core_trace,
▲ Show 20 Lines • Show All 48 Lines • Show Last 20 Lines

lldb/source/Plugins/Process/Linux/IntelPTSingleBufferTrace.h

Show All 38 Lines	public:
/// \param[in] cpu_id		/// \param[in] cpu_id
/// The CPU core id where to trace. If \b None, then this traces all CPUs.		/// The CPU core id where to trace. If \b None, then this traces all CPUs.
///		///
/// \param[in] disabled		/// \param[in] disabled
/// If \b true, then no data is collected until \a Resume is invoked.		/// If \b true, then no data is collected until \a Resume is invoked.
/// Similarly, if \b false, data is collected right away until \a Pause is		/// Similarly, if \b false, data is collected right away until \a Pause is
/// invoked.		/// invoked.
///		///
		/// \param[in] cgroup_fd
		/// A file descriptor in /sys/fs associated with the cgroup of the process
		/// to trace. If not \a llvm::None, then the trace sesion will use cgroup
		/// filtering.
		///
/// \return		/// \return
/// A \a IntelPTSingleBufferTrace instance if tracing was successful, or		/// A \a IntelPTSingleBufferTrace instance if tracing was successful, or
/// an \a llvm::Error otherwise.		/// an \a llvm::Error otherwise.
static llvm::Expected<IntelPTSingleBufferTrace>		static llvm::Expected<IntelPTSingleBufferTrace>
Start(const TraceIntelPTStartRequest &request,		Start(const TraceIntelPTStartRequest &request,
llvm::Optional<lldb::tid_t> tid,		llvm::Optional<lldb::tid_t> tid,
llvm::Optional<lldb::cpu_id_t> cpu_id = llvm::None,		llvm::Optional<lldb::cpu_id_t> cpu_id = llvm::None,
bool disabled = false);		bool disabled = false, llvm::Optional<int> cgroup_fd = llvm::None);

/// \return		/// \return
/// The bytes requested by a jLLDBTraceGetBinaryData packet that was routed		/// The bytes requested by a jLLDBTraceGetBinaryData packet that was routed
/// to this trace instace.		/// to this trace instace.
llvm::Expected<std::vector<uint8_t>>		llvm::Expected<std::vector<uint8_t>>
GetBinaryData(const TraceGetBinaryDataRequest &request) const;		GetBinaryData(const TraceGetBinaryDataRequest &request) const;

/// Read the intel pt trace buffer managed by this trace instance. To ensure		/// Read the intel pt trace buffer managed by this trace instance. To ensure
▲ Show 20 Lines • Show All 52 Lines • Show Last 20 Lines

lldb/source/Plugins/Process/Linux/IntelPTSingleBufferTrace.cpp

Show First 20 Lines • Show All 225 Lines • ▼ Show 20 Lines	Expected<std::vector<uint8_t>> IntelPTSingleBufferTrace::GetIptTrace() {
// clearing IA32_RTIT_CTL.TraceEn (see “Disabling Packet Generation” in		// clearing IA32_RTIT_CTL.TraceEn (see “Disabling Packet Generation” in
// Section 35.2.7.2).		// Section 35.2.7.2).
//		//
// This is achieved by the PERF_EVENT_IOC_DISABLE ioctl request, as		// This is achieved by the PERF_EVENT_IOC_DISABLE ioctl request, as
// mentioned in the man page of perf_event_open.		// mentioned in the man page of perf_event_open.
return m_perf_event.GetReadOnlyAuxBuffer();		return m_perf_event.GetReadOnlyAuxBuffer();
}		}

Expected<IntelPTSingleBufferTrace>		Expected<IntelPTSingleBufferTrace> IntelPTSingleBufferTrace::Start(
IntelPTSingleBufferTrace::Start(const TraceIntelPTStartRequest &request,		const TraceIntelPTStartRequest &request, Optional<lldb::tid_t> tid,
Optional<lldb::tid_t> tid,		Optional<cpu_id_t> cpu_id, bool disabled, Optional<int> cgroup_fd) {
Optional<cpu_id_t> cpu_id, bool disabled) {
#ifndef PERF_ATTR_SIZE_VER5		#ifndef PERF_ATTR_SIZE_VER5
return createStringError(inconvertibleErrorCode(),		return createStringError(inconvertibleErrorCode(),
"Intel PT Linux perf event not supported");		"Intel PT Linux perf event not supported");
#else		#else
Log *log = GetLog(POSIXLog::Trace);		Log *log = GetLog(POSIXLog::Trace);

LLDB_LOG(log, "Will start tracing thread id {0} and cpu id {1}", tid, cpu_id);		LLDB_LOG(log, "Will start tracing thread id {0} and cpu id {1}", tid, cpu_id);

Show All 14 Lines	Expected<perf_event_attr> attr = CreateIntelPTPerfEventConfiguration(
return static_cast<uint64_t>(value);		return static_cast<uint64_t>(value);
}));		}));
if (!attr)		if (!attr)
return attr.takeError();		return attr.takeError();
attr->disabled = disabled;		attr->disabled = disabled;

LLDB_LOG(log, "Will create intel pt trace buffer of size {0}",		LLDB_LOG(log, "Will create intel pt trace buffer of size {0}",
request.ipt_trace_size);		request.ipt_trace_size);
		unsigned long flags = 0;
		if (cgroup_fd) {
		tid = *cgroup_fd;
		flags \|= PERF_FLAG_PID_CGROUP;
		}

if (Expected<PerfEvent> perf_event = PerfEvent::Init(*attr, tid, cpu_id)) {		if (Expected<PerfEvent> perf_event =
		PerfEvent::Init(*attr, tid, cpu_id, -1, flags)) {
if (Error mmap_err = perf_event->MmapMetadataAndBuffers(		if (Error mmap_err = perf_event->MmapMetadataAndBuffers(
/num_data_pages=/0, aux_buffer_numpages,		/num_data_pages=/0, aux_buffer_numpages,
/data_buffer_write=/true)) {		/data_buffer_write=/true)) {
return std::move(mmap_err);		return std::move(mmap_err);
}		}
return IntelPTSingleBufferTrace(std::move(*perf_event));		return IntelPTSingleBufferTrace(std::move(*perf_event));
} else {		} else {
return perf_event.takeError();		return perf_event.takeError();
}		}
#endif		#endif
}		}

const PerfEvent &IntelPTSingleBufferTrace::GetPerfEvent() const {		const PerfEvent &IntelPTSingleBufferTrace::GetPerfEvent() const {
return m_perf_event;		return m_perf_event;
}		}

lldb/source/Plugins/Trace/intel-pt/CommandObjectTraceStartIntelPT.h

Show First 20 Lines • Show All 73 Lines • ▼ Show 20 Lines	public:

llvm::ArrayRef<OptionDefinition> GetDefinitions() override;		llvm::ArrayRef<OptionDefinition> GetDefinitions() override;

uint64_t m_ipt_trace_size;		uint64_t m_ipt_trace_size;
uint64_t m_process_buffer_size_limit;		uint64_t m_process_buffer_size_limit;
bool m_enable_tsc;		bool m_enable_tsc;
llvm::Optional<uint64_t> m_psb_period;		llvm::Optional<uint64_t> m_psb_period;
bool m_per_cpu_tracing;		bool m_per_cpu_tracing;
		bool m_disable_cgroup_filtering;
};		};

CommandObjectProcessTraceStartIntelPT(TraceIntelPT &trace,		CommandObjectProcessTraceStartIntelPT(TraceIntelPT &trace,
CommandInterpreter &interpreter)		CommandInterpreter &interpreter)
: CommandObjectParsed(		: CommandObjectParsed(
interpreter, "process trace start",		interpreter, "process trace start",
"Start tracing this process with intel-pt, including future "		"Start tracing this process with intel-pt, including future "
"threads. If --per-cpu-tracing is not provided, this traces each "		"threads. If --per-cpu-tracing is not provided, this traces each "
Show All 26 Lines

lldb/source/Plugins/Trace/intel-pt/CommandObjectTraceStartIntelPT.cpp

Show First 20 Lines • Show All 120 Lines • ▼ Show 20 Lines	Status CommandObjectProcessTraceStartIntelPT::CommandOptions::SetOptionValue(
case 't': {		case 't': {
m_enable_tsc = true;		m_enable_tsc = true;
break;		break;
}		}
case 'c': {		case 'c': {
m_per_cpu_tracing = true;		m_per_cpu_tracing = true;
break;		break;
}		}
		case 'd': {
		m_disable_cgroup_filtering = true;
		break;
		}
case 'p': {		case 'p': {
int64_t psb_period;		int64_t psb_period;
if (option_arg.empty() \|\| option_arg.getAsInteger(0, psb_period) \|\|		if (option_arg.empty() \|\| option_arg.getAsInteger(0, psb_period) \|\|
psb_period < 0)		psb_period < 0)
error.SetErrorStringWithFormat("invalid integer value for option '%s'",		error.SetErrorStringWithFormat("invalid integer value for option '%s'",
option_arg.str().c_str());		option_arg.str().c_str());
else		else
m_psb_period = psb_period;		m_psb_period = psb_period;
break;		break;
}		}
default:		default:
llvm_unreachable("Unimplemented option");		llvm_unreachable("Unimplemented option");
}		}
return error;		return error;
}		}

void CommandObjectProcessTraceStartIntelPT::CommandOptions::		void CommandObjectProcessTraceStartIntelPT::CommandOptions::
OptionParsingStarting(ExecutionContext *execution_context) {		OptionParsingStarting(ExecutionContext *execution_context) {
m_ipt_trace_size = kDefaultIptTraceSize;		m_ipt_trace_size = kDefaultIptTraceSize;
m_process_buffer_size_limit = kDefaultProcessBufferSizeLimit;		m_process_buffer_size_limit = kDefaultProcessBufferSizeLimit;
m_enable_tsc = kDefaultEnableTscValue;		m_enable_tsc = kDefaultEnableTscValue;
m_psb_period = kDefaultPsbPeriod;		m_psb_period = kDefaultPsbPeriod;
m_per_cpu_tracing = kDefaultPerCpuTracing;		m_per_cpu_tracing = kDefaultPerCpuTracing;
		m_disable_cgroup_filtering = kDefaultDisableCgroupFiltering;
}		}

llvm::ArrayRef<OptionDefinition>		llvm::ArrayRef<OptionDefinition>
CommandObjectProcessTraceStartIntelPT::CommandOptions::GetDefinitions() {		CommandObjectProcessTraceStartIntelPT::CommandOptions::GetDefinitions() {
return llvm::makeArrayRef(g_process_trace_start_intel_pt_options);		return llvm::makeArrayRef(g_process_trace_start_intel_pt_options);
}		}

bool CommandObjectProcessTraceStartIntelPT::DoExecute(		bool CommandObjectProcessTraceStartIntelPT::DoExecute(
Args &command, CommandReturnObject &result) {		Args &command, CommandReturnObject &result) {
if (Error err = m_trace.Start(m_options.m_ipt_trace_size,		if (Error err = m_trace.Start(
m_options.m_process_buffer_size_limit,		m_options.m_ipt_trace_size, m_options.m_process_buffer_size_limit,
m_options.m_enable_tsc, m_options.m_psb_period,		m_options.m_enable_tsc, m_options.m_psb_period,
m_options.m_per_cpu_tracing))		m_options.m_per_cpu_tracing, m_options.m_disable_cgroup_filtering))
result.SetError(Status(std::move(err)));		result.SetError(Status(std::move(err)));
else		else
result.SetStatus(eReturnStatusSuccessFinishResult);		result.SetStatus(eReturnStatusSuccessFinishResult);

return result.Succeeded();		return result.Succeeded();
}		}

lldb/source/Plugins/Trace/intel-pt/TraceIntelPT.h

Show First 20 Lines • Show All 98 Lines • ▼ Show 20 Lines	public:
///		///
/// \param[in] psb_period		/// \param[in] psb_period
/// This value defines the period in which PSB packets will be generated.		/// This value defines the period in which PSB packets will be generated.
///		///
/// \param[in] per_cpu_tracing		/// \param[in] per_cpu_tracing
/// This value defines whether to have an intel pt trace buffer per thread		/// This value defines whether to have an intel pt trace buffer per thread
/// or per cpu core.		/// or per cpu core.
///		///
		/// \param[in] disable_cgroup_filtering
		/// Disable the cgroup filtering that is automatically applied when doing
		/// per cpu tracing.
		///
/// \return		/// \return
/// \a llvm::Error::success if the operation was successful, or		/// \a llvm::Error::success if the operation was successful, or
/// \a llvm::Error otherwise.		/// \a llvm::Error otherwise.
llvm::Error Start(uint64_t ipt_trace_size, uint64_t total_buffer_size_limit,		llvm::Error Start(uint64_t ipt_trace_size, uint64_t total_buffer_size_limit,
bool enable_tsc, llvm::Optional<uint64_t> psb_period,		bool enable_tsc, llvm::Optional<uint64_t> psb_period,
bool m_per_cpu_tracing);		bool m_per_cpu_tracing, bool disable_cgroup_filtering);

/// \copydoc Trace::Start		/// \copydoc Trace::Start
llvm::Error Start(StructuredData::ObjectSP configuration =		llvm::Error Start(StructuredData::ObjectSP configuration =
StructuredData::ObjectSP()) override;		StructuredData::ObjectSP()) override;

/// Start tracing live threads.		/// Start tracing live threads.
///		///
/// More information on the parameters below can be found in the		/// More information on the parameters below can be found in the
▲ Show 20 Lines • Show All 116 Lines • Show Last 20 Lines

lldb/source/Plugins/Trace/intel-pt/TraceIntelPT.cpp

Show First 20 Lines • Show All 220 Lines • ▼ Show 20 Lines	void TraceIntelPT::DumpTraceInfo(Thread &thread, Stream &s, bool verbose) {

if (storage.multicpu_decoder) {		if (storage.multicpu_decoder) {
s << "\n Multi-cpu decoding:\n";		s << "\n Multi-cpu decoding:\n";
s.Format(" Total number of continuous executions found: {0}\n",		s.Format(" Total number of continuous executions found: {0}\n",
storage.multicpu_decoder->GetTotalContinuousExecutionsCount());		storage.multicpu_decoder->GetTotalContinuousExecutionsCount());
s.Format(		s.Format(
" Number of continuous executions for this thread: {0}\n",		" Number of continuous executions for this thread: {0}\n",
storage.multicpu_decoder->GetNumContinuousExecutionsForThread(tid));		storage.multicpu_decoder->GetNumContinuousExecutionsForThread(tid));
		s.Format(" Total number of PSB blocks found: {0}\n",
		storage.multicpu_decoder->GetTotalPSBBlocksCount());
		s.Format(" Number of PSB blocks for this thread {0}\n",
		storage.multicpu_decoder->GePSBBlocksCountForThread(tid));
		s.Format(" Total number of unattributed PSB blocks found: {0}\n",
		storage.multicpu_decoder->GetUnattributedPSBBlocksCount());
}		}

// Errors		// Errors
{		{
s << "\n Errors:\n";		s << "\n Errors:\n";
const DecodedThread::LibiptErrorsStats &tsc_errors_stats =		const DecodedThread::LibiptErrorsStats &tsc_errors_stats =
decoded_thread_sp->GetTscErrorsStats();		decoded_thread_sp->GetTscErrorsStats();
s.Format(" Number of TSC decoding errors: {0}\n",		s.Format(" Number of TSC decoding errors: {0}\n",
▲ Show 20 Lines • Show All 166 Lines • ▼ Show 20 Lines	const char *TraceIntelPT::GetStartConfigurationHelp() {

- int psbPeriod (defaults to {2}):		- int psbPeriod (defaults to {2}):
[process and thread tracing]		[process and thread tracing]

- boolean perCpuTracing (default to {3}):		- boolean perCpuTracing (default to {3}):
[process tracing only]		[process tracing only]

- int processBufferSizeLimit (defaults to {4} MiB):		- int processBufferSizeLimit (defaults to {4} MiB):
		[process tracing only]

		- boolean disableCgroupFiltering (default to {5}):
[process tracing only])",		[process tracing only])",
kDefaultIptTraceSize, kDefaultEnableTscValue,		kDefaultIptTraceSize, kDefaultEnableTscValue,
kDefaultPsbPeriod, kDefaultPerCpuTracing,		kDefaultPsbPeriod, kDefaultPerCpuTracing,
kDefaultProcessBufferSizeLimit / 1024 / 1024));		kDefaultProcessBufferSizeLimit / 1024 / 1024,
		kDefaultDisableCgroupFiltering));
}		}
return message->c_str();		return message->c_str();
}		}

Error TraceIntelPT::Start(uint64_t ipt_trace_size,		Error TraceIntelPT::Start(uint64_t ipt_trace_size,
uint64_t total_buffer_size_limit, bool enable_tsc,		uint64_t total_buffer_size_limit, bool enable_tsc,
Optional<uint64_t> psb_period, bool per_cpu_tracing) {		Optional<uint64_t> psb_period, bool per_cpu_tracing,
		bool disable_cgroup_filtering) {
TraceIntelPTStartRequest request;		TraceIntelPTStartRequest request;
request.ipt_trace_size = ipt_trace_size;		request.ipt_trace_size = ipt_trace_size;
request.process_buffer_size_limit = total_buffer_size_limit;		request.process_buffer_size_limit = total_buffer_size_limit;
request.enable_tsc = enable_tsc;		request.enable_tsc = enable_tsc;
request.psb_period = psb_period;		request.psb_period = psb_period;
request.type = GetPluginName().str();		request.type = GetPluginName().str();
request.per_cpu_tracing = per_cpu_tracing;		request.per_cpu_tracing = per_cpu_tracing;
		request.disable_cgroup_filtering = disable_cgroup_filtering;
return Trace::Start(toJSON(request));		return Trace::Start(toJSON(request));
}		}

Error TraceIntelPT::Start(StructuredData::ObjectSP configuration) {		Error TraceIntelPT::Start(StructuredData::ObjectSP configuration) {
uint64_t ipt_trace_size = kDefaultIptTraceSize;		uint64_t ipt_trace_size = kDefaultIptTraceSize;
uint64_t process_buffer_size_limit = kDefaultProcessBufferSizeLimit;		uint64_t process_buffer_size_limit = kDefaultProcessBufferSizeLimit;
bool enable_tsc = kDefaultEnableTscValue;		bool enable_tsc = kDefaultEnableTscValue;
Optional<uint64_t> psb_period = kDefaultPsbPeriod;		Optional<uint64_t> psb_period = kDefaultPsbPeriod;
bool per_cpu_tracing = kDefaultPerCpuTracing;		bool per_cpu_tracing = kDefaultPerCpuTracing;
		bool disable_cgroup_filtering = kDefaultDisableCgroupFiltering;

if (configuration) {		if (configuration) {
if (StructuredData::Dictionary *dict = configuration->GetAsDictionary()) {		if (StructuredData::Dictionary *dict = configuration->GetAsDictionary()) {
dict->GetValueForKeyAsInteger("iptTraceSize", ipt_trace_size);		dict->GetValueForKeyAsInteger("iptTraceSize", ipt_trace_size);
dict->GetValueForKeyAsInteger("processBufferSizeLimit",		dict->GetValueForKeyAsInteger("processBufferSizeLimit",
process_buffer_size_limit);		process_buffer_size_limit);
dict->GetValueForKeyAsBoolean("enableTsc", enable_tsc);		dict->GetValueForKeyAsBoolean("enableTsc", enable_tsc);
dict->GetValueForKeyAsInteger("psbPeriod", psb_period);		dict->GetValueForKeyAsInteger("psbPeriod", psb_period);
dict->GetValueForKeyAsBoolean("perCpuTracing", per_cpu_tracing);		dict->GetValueForKeyAsBoolean("perCpuTracing", per_cpu_tracing);
		dict->GetValueForKeyAsBoolean("disableCgroupFiltering",
		disable_cgroup_filtering);
} else {		} else {
return createStringError(inconvertibleErrorCode(),		return createStringError(inconvertibleErrorCode(),
"configuration object is not a dictionary");		"configuration object is not a dictionary");
}		}
}		}

return Start(ipt_trace_size, process_buffer_size_limit, enable_tsc,		return Start(ipt_trace_size, process_buffer_size_limit, enable_tsc,
psb_period, per_cpu_tracing);		psb_period, per_cpu_tracing, disable_cgroup_filtering);
}		}

llvm::Error TraceIntelPT::Start(llvm::ArrayRef<lldb::tid_t> tids,		llvm::Error TraceIntelPT::Start(llvm::ArrayRef<lldb::tid_t> tids,
uint64_t ipt_trace_size, bool enable_tsc,		uint64_t ipt_trace_size, bool enable_tsc,
Optional<uint64_t> psb_period) {		Optional<uint64_t> psb_period) {
TraceIntelPTStartRequest request;		TraceIntelPTStartRequest request;
request.ipt_trace_size = ipt_trace_size;		request.ipt_trace_size = ipt_trace_size;
request.enable_tsc = enable_tsc;		request.enable_tsc = enable_tsc;
Show All 34 Lines

lldb/source/Plugins/Trace/intel-pt/TraceIntelPTConstants.h

	Show All 15 Lines
	namespace lldb_private {			namespace lldb_private {
	namespace trace_intel_pt {			namespace trace_intel_pt {

	const size_t kDefaultIptTraceSize = 4 * 1024; // 4KB			const size_t kDefaultIptTraceSize = 4 * 1024; // 4KB
	const size_t kDefaultProcessBufferSizeLimit = 5 * 1024 * 1024; // 500MB			const size_t kDefaultProcessBufferSizeLimit = 5 * 1024 * 1024; // 500MB
	const bool kDefaultEnableTscValue = false;			const bool kDefaultEnableTscValue = false;
	const llvm::Optional<size_t> kDefaultPsbPeriod = llvm::None;			const llvm::Optional<size_t> kDefaultPsbPeriod = llvm::None;
	const bool kDefaultPerCpuTracing = false;			const bool kDefaultPerCpuTracing = false;
				const bool kDefaultDisableCgroupFiltering = false;

	} // namespace trace_intel_pt			} // namespace trace_intel_pt
	} // namespace lldb_private			} // namespace lldb_private

	#endif // LLDB_SOURCE_PLUGINS_TRACE_INTEL_PT_CONSTANTS_H			#endif // LLDB_SOURCE_PLUGINS_TRACE_INTEL_PT_CONSTANTS_H

lldb/source/Plugins/Trace/intel-pt/TraceIntelPTMultiCpuDecoder.h

Show First 20 Lines • Show All 44 Lines • ▼ Show 20 Lines	public:
/// whether there's tracing data associated to it or not.		/// whether there's tracing data associated to it or not.
bool TracesThread(lldb::tid_t tid) const;		bool TracesThread(lldb::tid_t tid) const;

/// \return		/// \return
/// The number of continuous executions found for the given \p tid.		/// The number of continuous executions found for the given \p tid.
size_t GetNumContinuousExecutionsForThread(lldb::tid_t tid) const;		size_t GetNumContinuousExecutionsForThread(lldb::tid_t tid) const;

/// \return		/// \return
		/// The number of PSB blocks for a given thread in all cores.
		size_t GePSBBlocksCountForThread(lldb::tid_t tid) const;

		/// \return
/// The total number of continuous executions found across CPUs.		/// The total number of continuous executions found across CPUs.
size_t GetTotalContinuousExecutionsCount() const;		size_t GetTotalContinuousExecutionsCount() const;

		/// \return
		/// The number of psb blocks in all cores that couldn't be matched with a
		/// thread execution coming from context switch traces.
		size_t GetUnattributedPSBBlocksCount() const;

		/// \return
		/// The total number of PSB blocks in all cores.
		size_t GetTotalPSBBlocksCount() const;

private:		private:
/// Traverse the context switch traces and the basic intel pt continuous		/// Traverse the context switch traces and the basic intel pt continuous
/// subtraces and produce a list of continuous executions for each process and		/// subtraces and produce a list of continuous executions for each process and
/// thread.		/// thread.
///		///
/// See \a DoCorrelateContextSwitchesAndIntelPtTraces.		/// See \a DoCorrelateContextSwitchesAndIntelPtTraces.
///		///
/// Any errors are stored in \a m_setup_error.		/// Any errors are stored in \a m_setup_error.
Show All 11 Lines	private:
std::set<lldb::tid_t> m_tids;		std::set<lldb::tid_t> m_tids;
llvm::Optional<		llvm::Optional<
llvm::DenseMap<lldb::tid_t, std::vector<IntelPTThreadContinousExecution>>>		llvm::DenseMap<lldb::tid_t, std::vector<IntelPTThreadContinousExecution>>>
m_continuous_executions_per_thread;		m_continuous_executions_per_thread;
llvm::DenseMap<lldb::tid_t, DecodedThreadSP> m_decoded_threads;		llvm::DenseMap<lldb::tid_t, DecodedThreadSP> m_decoded_threads;
/// This variable will be non-None if a severe error happened during the setup		/// This variable will be non-None if a severe error happened during the setup
/// of the decoder and we don't want decoding to be reattempted.		/// of the decoder and we don't want decoding to be reattempted.
llvm::Optional<std::string> m_setup_error;		llvm::Optional<std::string> m_setup_error;
uint64_t m_unattributed_intelpt_subtraces;		uint64_t m_unattributed_psb_blocks = 0;
		uint64_t m_total_psb_blocks = 0;
};		};

} // namespace trace_intel_pt		} // namespace trace_intel_pt
} // namespace lldb_private		} // namespace lldb_private

#endif // LLDB_SOURCE_PLUGINS_TRACE_INTEL_PT_TRACEINTELPTMULTICPUDECODER_H		#endif // LLDB_SOURCE_PLUGINS_TRACE_INTEL_PT_TRACEINTELPTMULTICPUDECODER_H

lldb/source/Plugins/Trace/intel-pt/TraceIntelPTMultiCpuDecoder.cpp

Show First 20 Lines • Show All 99 Lines • ▼ Show 20 Lines	TraceIntelPTMultiCpuDecoder::DoCorrelateContextSwitchesAndIntelPtTraces() {
LinuxPerfZeroTscConversion tsc_conversion = *conv_opt;		LinuxPerfZeroTscConversion tsc_conversion = *conv_opt;

for (cpu_id_t cpu_id : trace_sp->GetTracedCpus()) {		for (cpu_id_t cpu_id : trace_sp->GetTracedCpus()) {
Expected<std::vector<IntelPTThreadSubtrace>> intel_pt_subtraces =		Expected<std::vector<IntelPTThreadSubtrace>> intel_pt_subtraces =
GetIntelPTSubtracesForCpu(*trace_sp, cpu_id);		GetIntelPTSubtracesForCpu(*trace_sp, cpu_id);
if (!intel_pt_subtraces)		if (!intel_pt_subtraces)
return intel_pt_subtraces.takeError();		return intel_pt_subtraces.takeError();

		m_total_psb_blocks += intel_pt_subtraces->size();
// We'll be iterating through the thread continuous executions and the intel		// We'll be iterating through the thread continuous executions and the intel
// pt subtraces sorted by time.		// pt subtraces sorted by time.
auto it = intel_pt_subtraces->begin();		auto it = intel_pt_subtraces->begin();
auto on_new_thread_execution =		auto on_new_thread_execution =
[&](const ThreadContinuousExecution &thread_execution) {		[&](const ThreadContinuousExecution &thread_execution) {
IntelPTThreadContinousExecution execution(thread_execution);		IntelPTThreadContinousExecution execution(thread_execution);

for (; it != intel_pt_subtraces->end() &&		for (; it != intel_pt_subtraces->end() &&
it->tsc < thread_execution.GetEndTSC();		it->tsc < thread_execution.GetEndTSC();
it++) {		it++) {
if (it->tsc > thread_execution.GetStartTSC()) {		if (it->tsc > thread_execution.GetStartTSC()) {
execution.intelpt_subtraces.push_back(*it);		execution.intelpt_subtraces.push_back(*it);
} else {		} else {
m_unattributed_intelpt_subtraces++;		m_unattributed_psb_blocks++;
		jj10306Unsubmitted Not Done Reply Inline Actions Help me understand this. To me this seems like it is counting all of the PSBs that don't belong to the current thread, whereas I would expect this to only count the PSBs that don't belong to any thread? So based on my understanding we would be severely overcounting the number of unattributed PSB, but I think I'm just misunderstanding how this code flows. jj10306: Help me understand this. To me this seems like it is counting all of the PSBs that don't belong…
		wallaceAuthorUnsubmitted Done Reply Inline Actions the current code is correct, I'll explain: on_new_thread_execution is executed on all threads of the same CPU in chronological order as on_new_thread_execution is being invoked repeatedly, the `it` iterator is being traversed and is always moving forwards when on_new_thread_execution is invoked, the `it` iterator will look for psb blocks that happened before the given execution. These blocks do not belong to any thread execution. Graphically, we have ----exec 1---- ---exec 2---- ---exec 3---- PSB1 PSB2 PSB3 PSB4 PSB5 PSB6 when on_new_thread_execution is invoked for exec2, `it` will be pointing at PSB3, which is the first PSB after exec 1. PSB3 comes before exec 2, so it'll be unattributed, then it will move to PSB4 and so on wallace: the current code is correct, I'll explain: - on_new_thread_execution is executed on all…
		jj10306Unsubmitted Not Done Reply Inline Actions makes sense, thanks for the explanation! jj10306: makes sense, thanks for the explanation!
}		}
}		}
continuous_executions_per_thread[thread_execution.tid].push_back(		continuous_executions_per_thread[thread_execution.tid].push_back(
execution);		execution);
};		};
Error err = trace_sp->OnCpuBinaryDataRead(		Error err = trace_sp->OnCpuBinaryDataRead(
cpu_id, IntelPTDataKinds::kPerfContextSwitchTrace,		cpu_id, IntelPTDataKinds::kPerfContextSwitchTrace,
[&](ArrayRef<uint8_t> data) -> Error {		[&](ArrayRef<uint8_t> data) -> Error {
Expected<std::vector<ThreadContinuousExecution>> executions =		Expected<std::vector<ThreadContinuousExecution>> executions =
DecodePerfContextSwitchTrace(data, cpu_id, tsc_conversion);		DecodePerfContextSwitchTrace(data, cpu_id, tsc_conversion);
if (!executions)		if (!executions)
return executions.takeError();		return executions.takeError();
for (const ThreadContinuousExecution &exec : *executions)		for (const ThreadContinuousExecution &exec : *executions)
on_new_thread_execution(exec);		on_new_thread_execution(exec);
return Error::success();		return Error::success();
});		});
if (err)		if (err)
return std::move(err);		return std::move(err);

		m_unattributed_psb_blocks += intel_pt_subtraces->end() - it;
}		}
// We now sort the executions of each thread to have them ready for		// We now sort the executions of each thread to have them ready for
// instruction decoding		// instruction decoding
for (auto &tid_executions : continuous_executions_per_thread)		for (auto &tid_executions : continuous_executions_per_thread)
std::sort(tid_executions.second.begin(), tid_executions.second.end());		std::sort(tid_executions.second.begin(), tid_executions.second.end());

return continuous_executions_per_thread;		return continuous_executions_per_thread;
}		}
Show All 34 Lines
size_t TraceIntelPTMultiCpuDecoder::GetTotalContinuousExecutionsCount() const {		size_t TraceIntelPTMultiCpuDecoder::GetTotalContinuousExecutionsCount() const {
if (!m_continuous_executions_per_thread)		if (!m_continuous_executions_per_thread)
return 0;		return 0;
size_t count = 0;		size_t count = 0;
for (const auto &kv : *m_continuous_executions_per_thread)		for (const auto &kv : *m_continuous_executions_per_thread)
count += kv.second.size();		count += kv.second.size();
return count;		return count;
}		}

		size_t
		TraceIntelPTMultiCpuDecoder::GePSBBlocksCountForThread(lldb::tid_t tid) const {
		if (!m_continuous_executions_per_thread)
		return 0;
		size_t count = 0;
		auto it = m_continuous_executions_per_thread->find(tid);
		for (const IntelPTThreadContinousExecution &execution : it->second)
		count += execution.intelpt_subtraces.size();
		return count;
		}

		size_t TraceIntelPTMultiCpuDecoder::GetUnattributedPSBBlocksCount() const {
		return m_unattributed_psb_blocks;
		}

		size_t TraceIntelPTMultiCpuDecoder::GetTotalPSBBlocksCount() const {
		return m_total_psb_blocks;
		}

lldb/source/Plugins/Trace/intel-pt/TraceIntelPTOptions.td

Show First 20 Lines • Show All 77 Lines • ▼ Show 20 Lines	Desc<"This value defines the period in which PSB packets will be "
"the PSB period will be defined by the processor. If supported, valid "		"the PSB period will be defined by the processor. If supported, valid "
"values for this period can be found in "		"values for this period can be found in "
"/sys/bus/event_source/devices/intel_pt/caps/psb_periods which "		"/sys/bus/event_source/devices/intel_pt/caps/psb_periods which "
"contains a hexadecimal number, whose bits represent valid values "		"contains a hexadecimal number, whose bits represent valid values "
"e.g. if bit 2 is set, then value 2 is valid. The psb_period value is "		"e.g. if bit 2 is set, then value 2 is valid. The psb_period value is "
"converted to the approximate number of raw trace bytes between PSB "		"converted to the approximate number of raw trace bytes between PSB "
"packets as: 2 ^ (value + 11), e.g. value 3 means 16KiB between PSB "		"packets as: 2 ^ (value + 11), e.g. value 3 means 16KiB between PSB "
"packets. Defaults to 0 if supported.">;		"packets. Defaults to 0 if supported.">;
		def process_trace_start_intel_pt_disable_cgroup_filtering:
		Option<"disable-cgroup-filtering", "d">,
		Desc<"Disable the automatic cgroup filtering that is applied if --per-cpu "
		"is provided. Cgroup filtering allows collecting intel pt data "
		"exclusively of processes of the same cgroup as the target.">;
}		}

let Command = "process trace save intel pt" in {		let Command = "process trace save intel pt" in {
def process_trace_save_intel_directory: Option<"directory", "d">,		def process_trace_save_intel_directory: Option<"directory", "d">,
Group<1>,		Group<1>,
Arg<"Value">, Required,		Arg<"Value">, Required,
Desc<"This value defines the directory where the trace will be saved."		Desc<"This value defines the directory where the trace will be saved."
"It will be created if it does not exist. It will also create a "		"It will be created if it does not exist. It will also create a "
"trace files with the trace data and a trace.json with the main "		"trace files with the trace data and a trace.json with the main "
"properties of the trace session.">;		"properties of the trace session.">;
}		}

lldb/source/Utility/TraceIntelPTGDBRemotePackets.cpp

Show First 20 Lines • Show All 47 Lines • ▼ Show 20 Lines	bool fromJSON(const json::Value &value, TraceIntelPTStartRequest &packet,
if (!(o && fromJSON(value, (TraceStartRequest &)packet, path) &&		if (!(o && fromJSON(value, (TraceStartRequest &)packet, path) &&
o.map("enableTsc", packet.enable_tsc) &&		o.map("enableTsc", packet.enable_tsc) &&
o.map("psbPeriod", packet.psb_period) &&		o.map("psbPeriod", packet.psb_period) &&
o.map("iptTraceSize", packet.ipt_trace_size)))		o.map("iptTraceSize", packet.ipt_trace_size)))
return false;		return false;

if (packet.IsProcessTracing()) {		if (packet.IsProcessTracing()) {
if (!o.map("processBufferSizeLimit", packet.process_buffer_size_limit) \|\|		if (!o.map("processBufferSizeLimit", packet.process_buffer_size_limit) \|\|
!o.map("perCpuTracing", packet.per_cpu_tracing))		!o.map("perCpuTracing", packet.per_cpu_tracing) \|\|
		!o.map("disableCgroupTracing", packet.disable_cgroup_filtering))
return false;		return false;
}		}
return true;		return true;
}		}

json::Value toJSON(const TraceIntelPTStartRequest &packet) {		json::Value toJSON(const TraceIntelPTStartRequest &packet) {
json::Value base = toJSON((const TraceStartRequest &)packet);		json::Value base = toJSON((const TraceStartRequest &)packet);
json::Object &obj = *base.getAsObject();		json::Object &obj = *base.getAsObject();
obj.try_emplace("iptTraceSize", packet.ipt_trace_size);		obj.try_emplace("iptTraceSize", packet.ipt_trace_size);
obj.try_emplace("processBufferSizeLimit", packet.process_buffer_size_limit);		obj.try_emplace("processBufferSizeLimit", packet.process_buffer_size_limit);
obj.try_emplace("psbPeriod", packet.psb_period);		obj.try_emplace("psbPeriod", packet.psb_period);
obj.try_emplace("enableTsc", packet.enable_tsc);		obj.try_emplace("enableTsc", packet.enable_tsc);
obj.try_emplace("perCpuTracing", packet.per_cpu_tracing);		obj.try_emplace("perCpuTracing", packet.per_cpu_tracing);
		obj.try_emplace("disableCgroupTracing", packet.disable_cgroup_filtering);
return base;		return base;
}		}

uint64_t LinuxPerfZeroTscConversion::ToNanos(uint64_t tsc) const {		uint64_t LinuxPerfZeroTscConversion::ToNanos(uint64_t tsc) const {
uint64_t quot = tsc >> time_shift;		uint64_t quot = tsc >> time_shift;
uint64_t rem_flag = (((uint64_t)1 << time_shift) - 1);		uint64_t rem_flag = (((uint64_t)1 << time_shift) - 1);
uint64_t rem = tsc & rem_flag;		uint64_t rem = tsc & rem_flag;
return time_zero.value + quot * time_mult + ((rem * time_mult) >> time_shift);		return time_zero.value + quot * time_mult + ((rem * time_mult) >> time_shift);
Show All 25 Lines	bool fromJSON(const json::Value &value, LinuxPerfZeroTscConversion &packet,
packet.time_shift = time_shift;		packet.time_shift = time_shift;
return true;		return true;
}		}

bool fromJSON(const json::Value &value, TraceIntelPTGetStateResponse &packet,		bool fromJSON(const json::Value &value, TraceIntelPTGetStateResponse &packet,
json::Path path) {		json::Path path) {
ObjectMapper o(value, path);		ObjectMapper o(value, path);
return o && fromJSON(value, (TraceGetStateResponse &)packet, path) &&		return o && fromJSON(value, (TraceGetStateResponse &)packet, path) &&
o.map("tscPerfZeroConversion", packet.tsc_perf_zero_conversion);		o.map("tscPerfZeroConversion", packet.tsc_perf_zero_conversion) &&
		o.map("usingCgroupFiltering", packet.using_cgroup_filtering);
}		}

json::Value toJSON(const TraceIntelPTGetStateResponse &packet) {		json::Value toJSON(const TraceIntelPTGetStateResponse &packet) {
json::Value base = toJSON((const TraceGetStateResponse &)packet);		json::Value base = toJSON((const TraceGetStateResponse &)packet);
base.getAsObject()->insert(		json::Object &obj = *base.getAsObject();
{"tscPerfZeroConversion", packet.tsc_perf_zero_conversion});		obj.insert({"tscPerfZeroConversion", packet.tsc_perf_zero_conversion});
		obj.insert({"usingCgroupFiltering", packet.using_cgroup_filtering});
return base;		return base;
}		}

} // namespace lldb_private		} // namespace lldb_private