This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lld/
-
ELF/
-
Driver.cpp
-
Options.td
-
tools/lld/
-
lld/
-
lld.cpp

Differential D102304

[WIP][lld] Implement crash reproducer
Needs ReviewPublic

Authored by haowei on May 11 2021, 8:47 PM.

Download Raw Diff

Details

Reviewers

phosek
mcgrathr
MaskRay

Summary

This is still work in progress to test my ideas. Not ready for review yet.

In this patch, I tried to use CrashRecoveryContext to re-run lldMain with '--reproduce' when the first run crashes. Flag 'reproduce-on-crash' is added to optionally turn on this feature. Flag '--debug-crash' to allow lld crash on purpose to test crash reproducer.

The reason I choose this approach is that it has a few advantages:

It is quite simple.
It works with every lld driver which already support '--reproduce'.

However, during testing, I saw a few issues:

If the crash happens before the driver initialize the TarWriter for reproducer, this implementation will not catch it.
lldMain is not designed to run twice internally. It will write unexpected errors to stderr. (e.g. I saw 'lld/ELF/InputSection.cpp:1402: void lld::elf::MergeInputSection::splitIntoPieces(): Assertion `pieces.empty()' failed.' error in tests)

Therefore, I am wondering if I should consider another approach: reimplement the reproducer each driver (start with ELF driver) to make it independent from the lld driver. In this case the reproducer can be directly invoked from CrashRecoveryContext in the lld/tools/lld/lld.cpp instead of being invoked from the rerun lld driver.

Discussion thread: https://lists.llvm.org/pipermail/llvm-dev/2021-April/149853.html

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

haowei created this revision.May 11 2021, 8:47 PM

Herald added a reviewer: MaskRay. · View Herald TranscriptMay 11 2021, 8:47 PM

Herald added subscribers: dang, arichardson, emaste. · View Herald Transcript

haowei requested review of this revision.May 11 2021, 8:47 PM

Herald added a project: Restricted Project. · View Herald TranscriptMay 11 2021, 8:47 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B103929: Diff 344639.May 11 2021, 9:02 PM

I don't understand the purpose. Didn't we agree that implementing this on a different process is a correct approach?

(it'd be good to link the description to the discussion thread on the issues, whatever way this goes)

haowei mentioned this in D102892: [WIP][lld] Implement crash reproducer for ELF.May 20 2021, 5:30 PM

haowei edited the summary of this revision. (Show Details)May 20 2021, 5:34 PM

In D102304#2762298, @MaskRay wrote:

I don't understand the purpose. Didn't we agree that implementing this on a different process is a correct approach?

Sorry for the confusion, I created this change so I can share the code with other people. It's not in a reviewable state and I mainly use it as a baseline for testing.

If I understood correctly, we agreed to implement a lld crash reproducer in a similar way like clang with "-fintegrated-cc1" as the first step, which relies on CrashRecoveryContext instead of relying on a second process. The crash reproducer can be further improved by implementing a reproducer that works on a different process if the first step is not good enough.

D102304 added a createCrashReproduceTar to the lld elf driver, which generates a reproducer tarball without re-invoke the lldMain and will be used by CrashRecoveryContext in case lld recovered from a crash. The principle is to record the file names used by lld and use them to generate the reproducer in case lld crashes. I tried the other approach which makes the reproducer to determine the file needs to be included but it turns out very easy to miss some files and hard to maintain. So I just record the filenames, which also seems to be easier to be extended to other drivers. Please let me know your opinions on this.

Revision Contents

Path

Size

lld/

ELF/

Driver.cpp

4 lines

Options.td

6 lines

tools/

lld/

lld.cpp

36 lines

Diff 344639

lld/ELF/Driver.cpp

Show First 20 Lines • Show All 576 Lines • ▼ Show 20 Lines	if (auto E = timeTraceProfilerWrite(args.getLastArgValue(OPT_time_trace_file_eq).str(),
handleAllErrors(std::move(E), [&](const StringError &SE) {		handleAllErrors(std::move(E), [&](const StringError &SE) {
error(SE.getMessage());		error(SE.getMessage());
});		});
return;		return;
}		}

timeTraceProfilerCleanup();		timeTraceProfilerCleanup();
}		}

		if (args.getLastArg(OPT_debug_crash)) {
		LLVM_BUILTIN_TRAP;
		}
}		}

static std::string getRpath(opt::InputArgList &args) {		static std::string getRpath(opt::InputArgList &args) {
std::vector<StringRef> v = args::getStrings(args, OPT_rpath);		std::vector<StringRef> v = args::getStrings(args, OPT_rpath);
return llvm::join(v.begin(), v.end(), ":");		return llvm::join(v.begin(), v.end(), ":");
}		}

// Determines what we should do if there are remaining unresolved		// Determines what we should do if there are remaining unresolved
▲ Show 20 Lines • Show All 1,822 Lines • Show Last 20 Lines

lld/ELF/Options.td

	Show First 20 Lines • Show All 345 Lines • ▼ Show 20 Lines

	def print_map: F<"print-map">,			def print_map: F<"print-map">,
	HelpText<"Print a link map to the standard output">;			HelpText<"Print a link map to the standard output">;

	defm reproduce:			defm reproduce:
	Eq<"reproduce",			Eq<"reproduce",
	"Write tar file containing inputs and command to reproduce link">;			"Write tar file containing inputs and command to reproduce link">;

				def reproduce_on_crash: F<"reproduce-on-crash">,
				HelpText<"Generate tar file containing inputs and command when program crashes">;

				def debug_crash: F<"debug-crash">,
				HelpText<"Debug flag to crash lld on purpose">;

	defm rosegment: BB<"rosegment",			defm rosegment: BB<"rosegment",
	"Put read-only non-executable sections in their own segment (default)",			"Put read-only non-executable sections in their own segment (default)",
	"Do not put read-only non-executable sections in their own segment">;			"Do not put read-only non-executable sections in their own segment">;

	defm rpath: Eq<"rpath", "Add a DT_RUNPATH to the output">;			defm rpath: Eq<"rpath", "Add a DT_RUNPATH to the output">;

	def relocatable: F<"relocatable">, HelpText<"Create relocatable object file">;			def relocatable: F<"relocatable">, HelpText<"Create relocatable object file">;

	▲ Show 20 Lines • Show All 351 Lines • Show Last 20 Lines

lld/tools/lld/lld.cpp

Show First 20 Lines • Show All 167 Lines • ▼ Show 20 Lines	SafeReturn lld::safeLldMain(int argc, const char **argv,
int r = 0;		int r = 0;
{		{
// The crash recovery is here only to be able to recover from arbitrary		// The crash recovery is here only to be able to recover from arbitrary
// control flow when fatal() is called (through setjmp/longjmp or		// control flow when fatal() is called (through setjmp/longjmp or
// __try/__except).		// __try/__except).
llvm::CrashRecoveryContext crc;		llvm::CrashRecoveryContext crc;
if (!crc.RunSafely([&]() {		if (!crc.RunSafely([&]() {
r = lldMain(argc, argv, stdoutOS, stderrOS, /exitEarly=/false);		r = lldMain(argc, argv, stdoutOS, stderrOS, /exitEarly=/false);
}))		})) {
return {crc.RetCode, /canRunAgain=/false};		// lld crashed, save return code.
		SafeReturn ret{crc.RetCode, /canRunAgain=/false};
		// Generate reproducer if related flag is specified.
		std::vector<const char *> args(argv, argv + argc);
		bool hasReproducer = false;
		bool hasReproduceOnCrash = false;
		for (const char *arg : args) {
		std::string arg_str(arg);
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for variable 'arg_str' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for variable 'arg_str' [readability-identifier-naming]…
		if (arg_str == "--reproduce-on-crash")
		hasReproduceOnCrash = true;
		if (StringRef(arg_str).startswith("--reproduce="))
		hasReproducer = true;
		}
		if (hasReproduceOnCrash && !hasReproducer) {
		// Creat temp file for reproducer.
		SmallString<128> reproducerPath;
		std::error_code EC =
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for variable 'EC' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for variable 'EC' [readability-identifier-naming]…
		llvm::sys::fs::createTemporaryFile("lld", "tar", reproducerPath);
		Lint: Pre-merge checks Inline Actions clang-tidy: error: no member named 'createTemporaryFile' in namespace 'llvm::sys::fs' [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: no member named 'createTemporaryFile' in namespace 'llvm::sys::fs' [clang…
		if (EC) {
		stderrOS << "failed to create temporary file for reproducer\n";
		return ret;
		}
		stdoutOS << "reproducer writes to \"" << reproducerPath << "\"\n";
		std::string reproduceArg = "--reproduce=" + std::string(reproducerPath);
		args.push_back(reproduceArg.c_str());
		llvm::CrashRecoveryContext crcReproducer;
		crcReproducer.RunSafely([&]() {
		lldMain(args.size(), args.data(), stdoutOS, stderrOS,
		/exitEarly=/false);
		});
		}
		return ret;
		}
}		}

// Cleanup memory and reset everything back in pristine condition. This path		// Cleanup memory and reset everything back in pristine condition. This path
// is only taken when LLD is in test, or when it is used as a library.		// is only taken when LLD is in test, or when it is used as a library.
llvm::CrashRecoveryContext crc;		llvm::CrashRecoveryContext crc;
if (!crc.RunSafely([&]() { errorHandler().reset(); })) {		if (!crc.RunSafely([&]() { errorHandler().reset(); })) {
// The memory is corrupted beyond any possible recovery.		// The memory is corrupted beyond any possible recovery.
return {r, /canRunAgain=/false};		return {r, /canRunAgain=/false};
▲ Show 20 Lines • Show All 44 Lines • Show Last 20 Lines