This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lldb/
-
include/lldb/Core/
-
lldb/
-
Core/
-
Debugger.h
-
source/Core/
-
Core/
-
CoreProperties.td
-
Debugger.cpp

Differential D150805

Rate limit progress reporting
AbandonedPublic

Authored by saugustine on May 17 2023, 11:12 AM.

Download Raw Diff

Details

Reviewers

clayborg
JDevlieghere

Summary

Reporting progress for every DIE read turns out to be very slow when
run over a remote connection such as ssh. We have a report of it
taking over 30-minutes to load the Dwarf for Chrome via ssh (which
transfers every single write) and only about a minute over
Chrome-Remote Desktop, which is a video-conferencing style link, and
so doesn't update nearly as often.

For a 7k DIE target, this improves the speed of reading on my personal
machine (entirely local) by about 3%; data below. Over remote, slower
connections the increase is likely much greater.

Top of trunk:
(lldb) target create "crash_test"
Current executable set to 'crash_test' (aarch64).
(lldb) log timers dump
12.509606661 sec (total: 12.510s; child: 0.000s; count: 7826) for void DWARFUnit::ExtractDIEsRWLocked()
...

With this change:
(lldb) target create "crash_test"
Current executable set to 'crash_test' (aarch64).
(lldb) log timers dump
12.139054862 sec (total: 12.139s; child: 0.000s; count: 7826) for void DWARFUnit::ExtractDIEsRWLocked()

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

saugustine created this revision.May 17 2023, 11:12 AM

Herald added a project: Restricted Project. · View Herald TranscriptMay 17 2023, 11:12 AM

Herald added subscribers: arphaman, kristof.beyls. · View Herald Transcript

saugustine requested review of this revision.May 17 2023, 11:12 AM

Herald added a project: Restricted Project. · View Herald TranscriptMay 17 2023, 11:12 AM

Herald added a subscriber: lldb-commits. · View Herald Transcript

Harbormaster completed remote builds in B232659: Diff 523118.May 17 2023, 11:16 AM

Herald added a subscriber: JDevlieghere. · View Herald TranscriptMay 17 2023, 11:16 AM

This is a nice proof of concept, but I think we should go with a time-based approach to rate limit this. (Anyone else in LLDB know if we have some utils around to help with this?)

e.g. if the first 1000 files are small and the last 1000 are large, the user will see this quickly get to 1000/2000 and then appear to hang, until it magically/surprisingly finishes.

With a time-based rateacl, you can watch it slowly chug along 1001/2000, 1002/2000, etc. If there is a hanging problem, you'll deterministically see it hang on the same file number.

JDevlieghere added reviewers: clayborg, JDevlieghere.May 17 2023, 12:45 PM

Given that here we know the total_progress in advance and assuming the operation is relatively evenly distributed, would it make sense to report for every percentage? We could do this in ManualDWARFIndex::Index or we could add something like a "Policy" or "Granularity" to the progress class (similar to what report_increment is doing in this class). This seems preferable compared to the rather arbitrary value of report every 1000.

I also like Jordan's rate limiting idea. In my mind that should be a property of the broadcaster. Different tools (e.g. vscode vs the command line) could specify different values when register a listener.

Swtich rate-limiting to a time-based mechanism.

Harbormaster completed remote builds in B232715: Diff 523188.May 17 2023, 3:26 PM

Switch rate-limiting to a time-based mechanism

Harbormaster completed remote builds in B232717: Diff 523191.May 17 2023, 3:34 PM

This update switches to a time-based approach as suggested by Jordan. However, the timing is about the same as the original. I believe because calling getCurrentTime every iteration is comparably slow as printing the progress report itself.

It probably is still a win over very slow connections, where printing is even slower and time would remain the same.

What would be ideal is a timing thread that wakes up every X seconds and prints the results, but there isn't a good mechanism for that, and doing that portably is way out of scope for this.

Shall we just switch to a percentage? Printing it every percent update?

That has the issues Jordan described, where things appear to progress quickly, and then may grind to a halt due to some big DIE.

But I think the perfect shouldn't be the enemy of the good here.

In D150805#4351277, @saugustine wrote:

This update switches to a time-based approach as suggested by Jordan. However, the timing is about the same as the original. I believe because calling getCurrentTime every iteration is comparably slow as printing the progress report itself.

It probably is still a win over very slow connections, where printing is even slower and time would remain the same.

What would be ideal is a timing thread that wakes up every X seconds and prints the results, but there isn't a good mechanism for that, and doing that portably is way out of scope for this.

As I said in my previous comment, if we want to do anything timeout based it should be done in the the broadcaster/listener machinery so that other types of events can benefit from it too. I think the issue you're describing can be solved at that level, for example by blocking for a the desired time and flushing all but the last event received until that point.

Shall we just switch to a percentage? Printing it every percent update?

That has the issues Jordan described, where things appear to progress quickly, and then may grind to a halt due to some big DIE.

But I think the perfect shouldn't be the enemy of the good here.

Yes, that's definitely a shortcoming of this approach. What makes this somewhat less bad in my mind is that the inaccuracy is bounded by the granularity: e.g. something if you report every percentage, the error can never be more than a percentage. It would be nice if the consumer could set this property though, but I don't think there's a straightforward way to do that.

Adding support for a generic, user-specified, rate limit to the listeners would be my preferred solution, but I don't know how much work that would be.

Moved the rate-limiting to Debugger.[cpp|h]

Also wrote a custom getCurrentTime function, which doesn't do the
much of the extra work the Timer.h version does.

With this change, the timing is much better:

On my local machine, for a 93k DIE application, I get the following
timings:

1 second rate limit
(lldb) log timers dump
580.971832328 sec (total: 580.972s; child: 0.000s; count: 93007) for void DWARFUnit::ExtractDIEsRWLocked()

0 second rate limit, but with this change in place
663.114765369 sec (total: 663.115s; child: 0.000s; count: 93007) for void DWARFUnit::ExtractDIEsRWLocked()

Without this change in place
651.826884735 sec (total: 651.827s; child: 0.000s; count: 93007) for void DWARFUnit::ExtractDIEsRWLocked()

saugustine retitled this revision from Proof of concept for reducing progress-reporting frequency. to Rate limit progress reporting.May 19 2023, 3:41 PM

Harbormaster completed remote builds in B233308: Diff 523951.May 19 2023, 3:44 PM

Any more thoughts on this from the reviewers?

My suggestion was to add rate limiting support to the listener so that all events could benefit from this. The current patch seems unnecessary ad-hoc: it's limited to (1) progress events and (2) the default event handler. I expects that lldb-vscode plugin users suffer from this as well for example.

I'll let someone with a better understanding of the proper implementation take it from here.

In D150805#4350849, @JDevlieghere wrote:

I also like Jordan's rate limiting idea. In my mind that should be a property of the broadcaster. Different tools (e.g. vscode vs the command line) could specify different values when register a listener.

This makes sense: we could augment lldb_private::Listener with additional members that keep track of when the last broadcast time was, and if we're rate limiting. Then we could change the implementation of Listener::GetEvent(lldb::EventSP &event_sp, const Timeout<std::micro> &timeout) to continuously churn through m_events, returning the most recent one by the time the rate limiting window is over, and discarding any intermediate ones in between.

One thing I'm not sure of though is how we'll avoid an unnecessary pause for rate limiting on the last item. This patch avoids that because it checks data->GetCompleted() != data->GetTotal() to decide if we should actually rate limit. In the generic case, how does the listener know that an event it returns is the final one, and that it should ignore the rate limiting delay?

I think we could address that by adding a bool m_important member to lldb::Event, and then it would be up to the broadcaster to set that to true for the last one (or any intermediate ones that are similarly important & should be immediately shown, e.g. warnings/errors). Would that be reasonable?

In D150805#4397940, @rupprecht wrote:

In D150805#4350849, @JDevlieghere wrote:

I also like Jordan's rate limiting idea. In my mind that should be a property of the broadcaster. Different tools (e.g. vscode vs the command line) could specify different values when register a listener.

This makes sense: we could augment lldb_private::Listener with additional members that keep track of when the last broadcast time was, and if we're rate limiting. Then we could change the implementation of Listener::GetEvent(lldb::EventSP &event_sp, const Timeout<std::micro> &timeout) to continuously churn through m_events, returning the most recent one by the time the rate limiting window is over, and discarding any intermediate ones in between.

One thing I'm not sure of though is how we'll avoid an unnecessary pause for rate limiting on the last item. This patch avoids that because it checks data->GetCompleted() != data->GetTotal() to decide if we should actually rate limit. In the generic case, how does the listener know that an event it returns is the final one, and that it should ignore the rate limiting delay?

I think we could address that by adding a bool m_important member to lldb::Event, and then it would be up to the broadcaster to set that to true for the last one (or any intermediate ones that are similarly important & should be immediately shown, e.g. warnings/errors). Would that be reasonable?

The later a decision the decision not to update is made, the more work is wasted. Even the fairly simple solution that checked time in a somewhat expensive way (via the misnamed getCurrentTime that also gets memory used) ended up being slower overall. In my opinion, the problem is that a single-die is too small a unit of work to be worth reporting on.

The other tricky part I missed before is this bit in between pulling the event off the listener queue and deciding to show it:

auto *data = ProgressEventData::GetEventDataFromEvent(event_sp.get());
if (!data)
  return;

// Do some bookkeeping for the current event, regardless of whether we're
// going to show the progress.
const uint64_t id = data->GetID();
if (m_current_event_id) {
  ...
  if (id != *m_current_event_id)
    return;
  if (data->GetCompleted() == data->GetTotal())
    m_current_event_id.reset();
} else {
  m_current_event_id = id;
}

When we pull the event off listener off the queue, we need to do post-processing ourselves to decide if it's even a progress event at all, and if it's meant for us. If we put the listener in charge of rate limiting and returning only the most recent event, it needs to know that the event it's returning is interesting. Otherwise the rate limiting might hide all the interesting events. On top of that, there are events that are *interesting* even if we don't want to show them.

Another option is to provide Listener::GetRateLimitedEvents (name TBD) instead. It (potentially) blocks for the rate limiting period and then returns a *list* of all the events that have happened within that time frame. Then we let the caller process it as it pleases and display only the most recent relevant one. It feels a little weird at first but might actually make sense compared to alternatives?

I think it would be nice to abstract the rate limiting somewhere, although I'm not sure anymore if having it directly in the broadcast/listen classes makes sense.

I agree that we should have a rate limiting mechanism very close to the source, to avoid wasting work for events that aren't going to be used. This is particularly important for debug info parsing, where we have multiple threads working in parallel and taking a lock even just to check whether we should report something could be a choke point.

In D150805#4351277, @saugustine wrote:

What would be ideal is a timing thread that wakes up every X seconds and prints the results, but there isn't a good mechanism for that, and doing that portably is way out of scope for this.

I've implemented something like that in D152364. Let me know what you think. I like it because the act of reporting progress does block the reporting thread in any way. (At least for update-string-free updates that is, but I expect that we won't be sending update strings for extremely high frequency events.) However, I'm not entirely sure whether it meets everyone's use cases, mainly because I don't know what those use cases are (e.g. this implementation can "lose" an update string if they are coming too fast -- is that acceptable ?)

In my opinion, the problem is that a single-die is too small a unit of work to be worth reporting on.

I don't think this is correct. The unit of reporting is single DWARF Unit, which feels OK, assuming we don't do anything egregious for each update. What might have confused you is this code here, which tries to parse DIE trees for all units (and updates progress after each unit. However, in my test at least, the DWARF units had all their DIEs extracted by the time we got to this point, which meant that code was essentially doing nothing else except generating progress reports. I'd be tempted to just remove progress reporting from this step altogether, though if we go with something like that patch above (where a single update just increments an atomic var), then I guess keeping it in would not be such a problem either.

JDevlieghere mentioned this in D152364: [lldb] Rate limit progress reports -- different approach [WIP-ish].Jun 8 2023, 12:46 PM

Revision Contents

Path

Size

lldb/

include/

lldb/

Core/

Debugger.h

7 lines

source/

Core/

CoreProperties.td

4 lines

Debugger.cpp

32 lines

Diff 523951

lldb/include/lldb/Core/Debugger.h

Show First 20 Lines • Show All 301 Lines • ▼ Show 20 Lines	public:
bool GetUseColor() const;		bool GetUseColor() const;

bool SetUseColor(bool use_color);		bool SetUseColor(bool use_color);

bool GetShowProgress() const;		bool GetShowProgress() const;

bool SetShowProgress(bool show_progress);		bool SetShowProgress(bool show_progress);

		uint64_t GetRateLimitProgress() const;

		bool SetRateLimitProgress(uint64_t rate_limit);

llvm::StringRef GetShowProgressAnsiPrefix() const;		llvm::StringRef GetShowProgressAnsiPrefix() const;

llvm::StringRef GetShowProgressAnsiSuffix() const;		llvm::StringRef GetShowProgressAnsiSuffix() const;

bool GetUseAutosuggestion() const;		bool GetUseAutosuggestion() const;

llvm::StringRef GetAutosuggestionAnsiPrefix() const;		llvm::StringRef GetAutosuggestionAnsiPrefix() const;

▲ Show 20 Lines • Show All 345 Lines • ▼ Show 20 Lines	protected:
uint32_t m_interrupt_requested = 0; ///< Tracks interrupt requests		uint32_t m_interrupt_requested = 0; ///< Tracks interrupt requests
std::mutex m_interrupt_mutex;		std::mutex m_interrupt_mutex;

// Events for m_sync_broadcaster		// Events for m_sync_broadcaster
enum {		enum {
eBroadcastBitEventThreadIsListening = (1 << 0),		eBroadcastBitEventThreadIsListening = (1 << 0),
};		};

		// Used to rate-limit progress reports;
		double m_next_report_time = 0.0;

private:		private:
// Use Debugger::CreateInstance() to get a shared pointer to a new debugger		// Use Debugger::CreateInstance() to get a shared pointer to a new debugger
// object		// object
Debugger(lldb::LogOutputCallback m_log_callback, void *baton);		Debugger(lldb::LogOutputCallback m_log_callback, void *baton);

Debugger(const Debugger &) = delete;		Debugger(const Debugger &) = delete;
const Debugger &operator=(const Debugger &) = delete;		const Debugger &operator=(const Debugger &) = delete;
};		};

} // namespace lldb_private		} // namespace lldb_private

#endif // LLDB_CORE_DEBUGGER_H		#endif // LLDB_CORE_DEBUGGER_H

lldb/source/Core/CoreProperties.td

Show First 20 Lines • Show All 141 Lines • ▼ Show 20 Lines	let Definition = "debugger" in {
def UseColor: Property<"use-color", "Boolean">,		def UseColor: Property<"use-color", "Boolean">,
Global,		Global,
DefaultTrue,		DefaultTrue,
Desc<"Whether to use Ansi color codes or not.">;		Desc<"Whether to use Ansi color codes or not.">;
def ShowProgress: Property<"show-progress", "Boolean">,		def ShowProgress: Property<"show-progress", "Boolean">,
Global,		Global,
DefaultTrue,		DefaultTrue,
Desc<"Whether to show progress or not if the debugger's output is an interactive color-enabled terminal.">;		Desc<"Whether to show progress or not if the debugger's output is an interactive color-enabled terminal.">;
		def RateLimitProgress: Property<"rate-limit-progress", "UInt64">,
		Global,
		DefaultUnsignedValue<1>,
		Desc<"Seconds to wait between progress reports.">;
def ShowProgressAnsiPrefix: Property<"show-progress-ansi-prefix", "String">,		def ShowProgressAnsiPrefix: Property<"show-progress-ansi-prefix", "String">,
Global,		Global,
DefaultStringValue<"${ansi.faint}">,		DefaultStringValue<"${ansi.faint}">,
Desc<"When displaying progress in a color-enabled terminal, use the ANSI terminal code specified in this format immediately before the progress message.">;		Desc<"When displaying progress in a color-enabled terminal, use the ANSI terminal code specified in this format immediately before the progress message.">;
def ShowProgressAnsiSuffix: Property<"show-progress-ansi-suffix", "String">,		def ShowProgressAnsiSuffix: Property<"show-progress-ansi-suffix", "String">,
Global,		Global,
DefaultStringValue<"${ansi.normal}">,		DefaultStringValue<"${ansi.normal}">,
Desc<"When displaying progress in a color-enabled terminal, use the ANSI terminal code specified in this format immediately after the progress message.">;		Desc<"When displaying progress in a color-enabled terminal, use the ANSI terminal code specified in this format immediately after the progress message.">;
▲ Show 20 Lines • Show All 46 Lines • Show Last 20 Lines

lldb/source/Core/Debugger.cpp

Show First 20 Lines • Show All 60 Lines • ▼ Show 20 Lines
#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
#include "llvm/ADT/StringRef.h"		#include "llvm/ADT/StringRef.h"
#include "llvm/ADT/iterator.h"		#include "llvm/ADT/iterator.h"
#include "llvm/Support/DynamicLibrary.h"		#include "llvm/Support/DynamicLibrary.h"
#include "llvm/Support/FileSystem.h"		#include "llvm/Support/FileSystem.h"
#include "llvm/Support/Process.h"		#include "llvm/Support/Process.h"
#include "llvm/Support/ThreadPool.h"		#include "llvm/Support/ThreadPool.h"
#include "llvm/Support/Threading.h"		#include "llvm/Support/Threading.h"
		#include "llvm/Support/Timer.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"

#include <cstdio>		#include <cstdio>
#include <cstdlib>		#include <cstdlib>
#include <cstring>		#include <cstring>
#include <list>		#include <list>
#include <memory>		#include <memory>
#include <mutex>		#include <mutex>
▲ Show 20 Lines • Show All 321 Lines • ▼ Show 20 Lines	return GetPropertyAtIndexAs<bool>(
idx, g_debugger_properties[idx].default_uint_value != 0);		idx, g_debugger_properties[idx].default_uint_value != 0);
}		}

bool Debugger::SetShowProgress(bool show_progress) {		bool Debugger::SetShowProgress(bool show_progress) {
const uint32_t idx = ePropertyShowProgress;		const uint32_t idx = ePropertyShowProgress;
return SetPropertyAtIndex(idx, show_progress);		return SetPropertyAtIndex(idx, show_progress);
}		}

		uint64_t Debugger::GetRateLimitProgress() const {
		const uint32_t idx = ePropertyRateLimitProgress;
		return GetPropertyAtIndexAs<uint64_t>(
		idx, g_debugger_properties[idx].default_uint_value != 0);
		}

		bool Debugger::SetRateLimitProgress(uint64_t rate_limit) {
		const uint32_t idx = ePropertyShowProgress;
		return SetPropertyAtIndex(idx, rate_limit);
		}

llvm::StringRef Debugger::GetShowProgressAnsiPrefix() const {		llvm::StringRef Debugger::GetShowProgressAnsiPrefix() const {
const uint32_t idx = ePropertyShowProgressAnsiPrefix;		const uint32_t idx = ePropertyShowProgressAnsiPrefix;
return GetPropertyAtIndexAs<llvm::StringRef>(		return GetPropertyAtIndexAs<llvm::StringRef>(
idx, g_debugger_properties[idx].default_cstr_value);		idx, g_debugger_properties[idx].default_cstr_value);
}		}

llvm::StringRef Debugger::GetShowProgressAnsiSuffix() const {		llvm::StringRef Debugger::GetShowProgressAnsiSuffix() const {
const uint32_t idx = ePropertyShowProgressAnsiSuffix;		const uint32_t idx = ePropertyShowProgressAnsiSuffix;
▲ Show 20 Lines • Show All 1,515 Lines • ▼ Show 20 Lines
}		}

lldb::thread_result_t Debugger::IOHandlerThread() {		lldb::thread_result_t Debugger::IOHandlerThread() {
RunIOHandlers();		RunIOHandlers();
StopEventHandlerThread();		StopEventHandlerThread();
return {};		return {};
}		}

		// Rate-limit calculations should be fast. TimePoints collect memory and
		// instruction counts, which is slow.
		static double getCurrentTime() {
		using Seconds = std::chrono::duration<double, std::ratio<1>>;
		llvm::sys::TimePoint<> now;
		std::chrono::nanoseconds user, sys;
		llvm::sys::Process::GetTimeUsage(now, user, sys);
		return Seconds(now.time_since_epoch()).count();
		}

void Debugger::HandleProgressEvent(const lldb::EventSP &event_sp) {		void Debugger::HandleProgressEvent(const lldb::EventSP &event_sp) {
auto *data = ProgressEventData::GetEventDataFromEvent(event_sp.get());		auto *data = ProgressEventData::GetEventDataFromEvent(event_sp.get());
if (!data)		if (!data)
return;		return;

// Do some bookkeeping for the current event, regardless of whether we're		// Do some bookkeeping for the current event, regardless of whether we're
// going to show the progress.		// going to show the progress.
const uint64_t id = data->GetID();		const uint64_t id = data->GetID();
Show All 26 Lines	void Debugger::HandleProgressEvent(const lldb::EventSP &event_sp) {
// color support. We assume that if we support ANSI escape codes we support		// color support. We assume that if we support ANSI escape codes we support
// vt100 escape codes.		// vt100 escape codes.
File &file = GetOutputFile();		File &file = GetOutputFile();
if (!file.GetIsInteractive() \|\| !file.GetIsTerminalWithColors())		if (!file.GetIsInteractive() \|\| !file.GetIsTerminalWithColors())
return;		return;

StreamSP output = GetAsyncOutputStream();		StreamSP output = GetAsyncOutputStream();

		// Rate limit progress messages, but always show the last event.
		double current_time = getCurrentTime();
		if (current_time < m_next_report_time &&
		data->GetCompleted() != data->GetTotal())
		return;
		m_next_report_time = current_time + GetRateLimitProgress();

// Print over previous line, if any.		// Print over previous line, if any.
output->Printf("\r");		output->Printf("\r");

if (data->GetCompleted() == data->GetTotal()) {		if (data->GetCompleted() == data->GetTotal()) {
// Clear the current line.		// Clear the current line.
output->Printf("\x1B[2K");		output->Printf("\x1B[2K");
output->Flush();		output->Flush();
		// This set of progress reports is complete. Reset to show the first
		// progress report of the next set.
		m_next_report_time = 0.0;
return;		return;
}		}

// Trim the progress message if it exceeds the window's width and print it.		// Trim the progress message if it exceeds the window's width and print it.
std::string message = data->GetMessage();		std::string message = data->GetMessage();
if (data->IsFinite())		if (data->IsFinite())
message = llvm::formatv("[{0}/{1}] {2}", data->GetCompleted(),		message = llvm::formatv("[{0}/{1}] {2}", data->GetCompleted(),
data->GetTotal(), message)		data->GetTotal(), message)
▲ Show 20 Lines • Show All 140 Lines • Show Last 20 Lines