This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
compiler-rt/test/sanitizer_common/TestCases/
-
test/
-
sanitizer_common/
-
TestCases/
1/3
symbolize_stack.cpp

Differential D126102

[compiler-rt][test] Fix flake in symbolize_stack test
ClosedPublic

Authored by paulkirth on May 20 2022, 4:13 PM.

Download Raw Diff

Details

Reviewers

vitalybuka
phosek
thakis
leonardchan

Commits

rG7f5439945b1f: [compiler-rt][test] Fix flake in symbolize_stack test

Summary

Addresses tests flakes described in
https://github.com/llvm/llvm-project/issues/55460

The test being updated can fail in FileCheck to match when given long
enough stack traces. This can be problematic when file system paths
become long enough to cause the majority of the long function name to
become truncated. We found in our CI that the truncated output would
often fail to match, thereby causing the test to fail when it should not.

Here we change the test to match on sybolizer output that should be more
reliable than matching inside the long function name.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

paulkirth created this revision.May 20 2022, 4:13 PM

Herald added a project: Restricted Project. · View Herald TranscriptMay 20 2022, 4:13 PM

Herald added a subscriber: dberris. · View Herald Transcript

paulkirth requested review of this revision.May 20 2022, 4:13 PM

Herald added a project: Restricted Project. · View Herald TranscriptMay 20 2022, 4:13 PM

Herald added a subscriber: Restricted Project. · View Herald Transcript

paulkirth added reviewers: vitalybuka, phosek, thakis, leonardchan.May 20 2022, 4:20 PM

For context: https://discourse.llvm.org/t/post-mortem-on-test-flake-in-sanitizer-common/62657 describes how I got here.

I'm open to other suggestions, bu this flake is occuring quite frequently in Fuchsia's clang CI, and the only purpose of the test is to ensure it doesn't crash when function names are long.

I also considered using GTest, but that has its own issues, since this test links against all the various sanitizers, and I didn't see a clean way to do that.

Removing the FileCheck bit altogether is an option but may be a step too far.

Harbormaster completed remote builds in B165612: Diff 431090.May 20 2022, 4:35 PM

Just match on any frame in the symbolizer output.

Harbormaster completed remote builds in B165970: Diff 431545.May 23 2022, 6:08 PM

leonardchan added inline comments.May 23 2022, 7:40 PM

compiler-rt/test/sanitizer_common/TestCases/symbolize_stack.cpp
35	Will this just match on any digit we see in the output?

vitalybuka added inline comments.May 24 2022, 4:49 PM

compiler-rt/test/sanitizer_common/TestCases/symbolize_stack.cpp
35	From https://discourse.llvm.org/t/post-mortem-on-test-flake-in-sanitizer-common/62657 it looks like a bug which needs to be fixed

paulkirth added inline comments.May 24 2022, 6:21 PM

compiler-rt/test/sanitizer_common/TestCases/symbolize_stack.cpp
35	Will this just match on any digit we see in the output? it will match when the symbolizer prints a stack frame, which is `#<frame number>`, so it should match that.

@vitalybuka I'm hoping to just address the flaky test for now. There's some value in ensuring that long output doesn't crash symbolization w/in sanitizer runtimes, so I've opted to keep the test, and try to make it less prone to flake. For now, I'm happy just to remove the flake so we can update our Toolchain.

Is there something specific you think we should do to address the underlying problem?

In D126102#3536037, @paulkirth wrote:

@vitalybuka I'm hoping to just address the flaky test for now. There's some value in ensuring that long output doesn't crash symbolization w/in sanitizer runtimes, so I've opted to keep the test, and try to make it less prone to flake. For now, I'm happy just to remove the flake so we can update our Toolchain.

I would prefer you just disable the test for your platform.
We still want to see that the tool produce some meaningful output.

Is there something specific you think we should do to address the underlying problem?

No particular ideas yet. That's just my impression from your investigation.

In D126102#3536038, @vitalybuka wrote:

I would prefer you just disable the test for your platform.

The failures we see are for AArch64 and x86_64 Linux... so are you saying we should disable the test for Linux?

We still want to see that the tool produce some meaningful output.

I'm not convinced that output is meaningful at all. The correctness of the symbolizer's output is already tested elsewhere in a more reliable way. Can you explain why matching part of the long function name makes the test better? Especially given that we see it isn't reliably output?

In D126102#3536097, @paulkirth wrote:

In D126102#3536038, @vitalybuka wrote:

I would prefer you just disable the test for your platform.

The failures we see are for AArch64 and x86_64 Linux... so are you saying we should disable the test for Linux?

Yes, even "// UNSUPPORTED: *" with FIXME is fine

We still want to see that the tool produce some meaningful output.

I'm not convinced that output is meaningful at all. The correctness of the symbolizer's output is already tested elsewhere in a more reliable way. Can you explain why matching part of the long function name makes the test better? Especially given that we see it isn't reliably output?

We test the bad case where output is huge. E.g. another way to fail that is to output nothing without crash.

Restore exisiting checks, and mark the test unsupported on Linux

@vitalybuka what if we just changed the implementation to return whatever it had read so far?

so in https://github.com/llvm/llvm-project/blob/06fee478d217a9fbd2ba31f92bc595ed327635a5/compiler-rt/lib/sanitizer_common/sanitizer_symbolizer_libcdep.cpp#L535, we just break instead of setting the read_len = 0, and allow the filled buffer to be output?

Harbormaster completed remote builds in B166285: Diff 432012.May 25 2022, 9:19 AM

In D126102#3537519, @paulkirth wrote:

@vitalybuka what if we just changed the implementation to return whatever it had read so far?

so in https://github.com/llvm/llvm-project/blob/06fee478d217a9fbd2ba31f92bc595ed327635a5/compiler-rt/lib/sanitizer_common/sanitizer_symbolizer_libcdep.cpp#L535, we just break instead of setting the read_len = 0, and allow the filled buffer to be output?

It's internal API, we probably can just use InternalScopedString and append as needed

vitalybuka accepted this revision.May 25 2022, 10:51 AM

This revision is now accepted and ready to land.May 25 2022, 10:51 AM

Closed by commit rG7f5439945b1f: [compiler-rt][test] Fix flake in symbolize_stack test (authored by paulkirth). · Explain WhyMay 25 2022, 12:02 PM

This revision was automatically updated to reflect the committed changes.

paulkirth added a commit: rG7f5439945b1f: [compiler-rt][test] Fix flake in symbolize_stack test.

paulkirth mentioned this in D126580: [compiler-rt] Avoid truncating Symbolizer output.Jun 6 2022, 10:24 AM

Revision Contents

Path

Size

compiler-rt/

test/

sanitizer_common/

TestCases/

symbolize_stack.cpp

5 lines

Diff 432072

compiler-rt/test/sanitizer_common/TestCases/symbolize_stack.cpp

	// RUN: %clangxx -O0 %s -o %t && %run %t 2>&1 \| FileCheck %s			// RUN: %clangxx -O0 %s -o %t && %run %t 2>&1 \| FileCheck %s

	// Test that symbolizer does not crash on frame with large function name.			// Test that symbolizer does not crash on frame with large function name.

	// On Darwin LSan reports a false positive			// On Darwin LSan reports a false positive
	// XFAIL: darwin && lsan			// XFAIL: darwin && lsan

				// FIXME: https://github.com/llvm/llvm-project/issues/55460
				// On Linux its possible for symbolizer output to be truncated and to match the
				// check below. Remove when the underlying problem has been addressed.
				// UNSUPPORTED: linux

	#include <sanitizer/common_interface_defs.h>			#include <sanitizer/common_interface_defs.h>
	#include <vector>			#include <vector>

	template <int N> struct A {			template <int N> struct A {
	template <class T> void RecursiveTemplateFunction(const T &t);			template <class T> void RecursiveTemplateFunction(const T &t);
	};			};

	template <int N>			template <int N>
	template <class T>			template <class T>
	__attribute__((noinline)) void A<N>::RecursiveTemplateFunction(const T &) {			__attribute__((noinline)) void A<N>::RecursiveTemplateFunction(const T &) {
	std::vector<T> t;			std::vector<T> t;
	return A<N - 1>().RecursiveTemplateFunction(t);			return A<N - 1>().RecursiveTemplateFunction(t);
	}			}

	template <>			template <>
	template <class T>			template <class T>
	__attribute__((noinline)) void A<0>::RecursiveTemplateFunction(const T &) {			__attribute__((noinline)) void A<0>::RecursiveTemplateFunction(const T &) {
	__sanitizer_print_stack_trace();			__sanitizer_print_stack_trace();
	}			}

	int main() {			int main() {
	// CHECK: {{vector<.vector<.vector<.vector<.vector<}}			// CHECK: {{vector<.vector<.vector<.vector<.vector<}}
	A<10>().RecursiveTemplateFunction(0);			A<10>().RecursiveTemplateFunction(0);
				leonardchanUnsubmitted Not Done Reply Inline Actions Will this just match on any digit we see in the output? leonardchan: Will this just match on any digit we see in the output?
				vitalybukaUnsubmitted Not Done Reply Inline Actions From https://discourse.llvm.org/t/post-mortem-on-test-flake-in-sanitizer-common/62657 it looks like a bug which needs to be fixed vitalybuka: From https://discourse.llvm.org/t/post-mortem-on-test-flake-in-sanitizer-common/62657 it looks…
				paulkirthAuthorUnsubmitted Done Reply Inline Actions Will this just match on any digit we see in the output? it will match when the symbolizer prints a stack frame, which is `#<frame number>`, so it should match that. paulkirth: > Will this just match on any digit we see in the output? it will match when the symbolizer…
	}			}