This is an archive of the discontinued LLVM Phabricator instance.

Add new test for stress testing stack unwinding
ClosedPublic

Authored by tberghammer on Jun 15 2015, 11:46 AM.

Download Raw Diff

Details

Reviewers

vharron
ovyalov
jasonmolenda
labath

Summary

Add new test for stress testing stack unwinding

This test case generates new tests from the source files dropped into
its directory. For stress testing stack unwinding it steps through the
code line by line and then tests unwinding from each instruction.

Note: Source files will be added separately. The exact source of them is still under investigation/discussion.

Diff Detail

Event Timeline

tberghammer updated this revision to Diff 27697.Jun 15 2015, 11:46 AM

tberghammer retitled this revision from to Add new test for stress testing stack unwinding.

tberghammer updated this object.

tberghammer edited the test plan for this revision. (Show Details)

tberghammer added reviewers: vharron, labath, jasonmolenda.

tberghammer added a subscriber: Unknown Object (MLST).

Herald added a subscriber: tberghammer. · View Herald TranscriptJun 15 2015, 11:46 AM

Fix test case name generation

I think having tests like these will help our unwinding reliability. However, I think it can be done without python introspection. Also, I'm not sure if it make sense to commit this, while we don't have any tests to run this on.

test/functionalities/unwind/standard/TestStandardUnwind.py
33	s/it/hit/?
57	AssertTrue(process != None) ?
65	I am confused by this comment. How will using step-out instead of step-inst help us unstuck a program which is waiting for STDIO? If it is waiting for an input event, it will keep waiting no matter how we step it...
74	s/0/i/ ?
99	This design, besides being confusing to non-python experts, makes it impossible for someone to selectively enable tests. I think this is important, as there sure will be many failures once somebody starts bringing this up on a new type of platform. I would prefer refactoring this so that every source file can be defined using two lines: def test_unwind_foo(self): run_test("foo.cpp") These lines can be generated by a script on the initial import.
116	What kind and how many compile errors were you encountering? I'm a bit worried that these skips everywhere will make it hard to figure out which tests are actually being run.

tberghammer updated this revision to Diff 27792.Jun 16 2015, 4:07 PM

tberghammer added a reviewer: ovyalov.

I few general notes about this test case and why I prefer to generate test functions automatically:

It won't be run as part of the default test suit because the running time is far to high (~1 minutes for a simple code and I plan to have a a lot of source file). As a consequences only people who working on stack unwinding have to touch this file (and even they can do most of the things without knowing how the tests are generated)
Adding new chunk of source files should be very easy so we can get high coverage with small effort and we can run tests where the source files coming from a separate repository (e.g.: llvm nightly tests, gcc/g++ test suit, etc.) and we don't want to copy them into the lldb one

test/functionalities/unwind/standard/TestStandardUnwind.py
33	Done
57	Done
65	Fixed. The issue is caused by a bug in LLDB where single stepping an inferior in ARM change the behavior of the inferior.
74	Done. Nice catch
116	Primarily I expect 2 type of compile error because of the type of the test suit: Features not supported by the compiler we are currently testing with Source code collected from some source with copy/paste or with a script contains files what aren't compiling

Ok, I can sort of see where you are going with this. I guess auto-generating test cases makes sense for this use-case. I'd be interested to hear what the others make of it though.

In any case, before checking this in, I would propose to add at least a couple of (hand-written?) test cases, so that it is possible to verify that the test logic actually works. I propose two test cases: one single threaded, and one multi.

test/functionalities/unwind/standard/TestStandardUnwind.py
45	As I understand it, the idea is to use this list as a sort of an XFAIL list. In that case, maybe we should add a link to the relevant bug, where applicable?
49	interferes
116	Features not supported by the compiler we are currently testing with Ok, that makes sense. Source code collected from some source with copy/paste or with a script contains files what aren't compiling Makes sense if testing against a foreign file collection, but if you are going to check those files in, I would expect you to go through them (automatically if you want) and make sure they make sense, so that we don't end up with a bunch of garbage in the repository.

I don't have any objections to this test idea. Trying to encapsulate tricky unwind scenarios in an arch-independent manner is very hard. IMO the only way to do this is hand-written platform-specific assembly or platform-specific corefiles that capture a problematic program state.

Realistically, the unwinder doesn't fail on C/C++ compiled code -- I mean, if it does, there are some big problems that we need to address. The tricky stuff is always dealing with hand-written assembly code, or trying to backtrace through an asynchronous signal handler like sigtramp()/sigtrap(), or backtracing from address 0, or backtracing as we step through an assembly stub routine that jumps to another real destination function, or backtracing through jitted code that has no associated Module at all in lldb. Sure, turn on -fomit-frame-pointer and see if lldb follows the eh_frame correctly as you stepi through a function (prologues and epilogues are always the most likely to get you the wrong caller func) but I don't think it'll be a rich source for regression detection.

I don't mean to discourage this, please do this. I've been thinking about the problem of testing unwinds for a while now, and I'm not happy with any of the obvious approaches. And it's such a critical component of the debugger, and so easy to break, that we really do need to work on this more.

I am using this approach in the last few weeks to find issues and unfortunately I see a lot of case when we can't unwind. Most of the failures came from libc.so and/or libc++.so and I think those functions contain some quite tricky and possibly hand crafted code (haven't seen failure in user code so far).
At the current stage I would be happy if we can unwind from any code generated by the compiler but we are very far from that (at least on android).

Excellent, then I'm all for it. :) I didn't think it would turn up anything but it's great to hear that it is. Thanks for working on this.

Having a couple of hand-written tests sounds as a good idea to me - it will be kind of proof-of-concept.

test/functionalities/unwind/standard/TestStandardUnwind.py
40	Add __start_thread ti track unwinding in a thread?
66	What if an inferior exits immediately after launch - for example, reporting 126, 127 exit code? Do we need to have assert here to check that process indeed in stopped state?
72	print "INDEX: %d, THREAD %d" % (index, i) ?
107	f should contain a full path - do we need this join here?
117	Could you log exception if Trace is on?

Remove multi threading support (don't fit into the current concept because SBThread::StepInstruction blocks if try to step over a blocking syscall)
Address other review comments

tberghammer added inline comments.Jun 17 2015, 3:08 PM

test/functionalities/unwind/standard/TestStandardUnwind.py
45	Done
49	Done
66	Added check
72	Done
107	Fixed (I don't know why it worked before)
117	Done

lgtm, thanks

This revision is now accepted and ready to land.Jun 17 2015, 3:14 PM

LGTM

tberghammer mentioned this in D10447: Improve instruction emulation based stack unwinding on ARM.Jun 18 2015, 10:14 AM

rL240030

Revision Contents

Path

Size

test/

functionalities/

unwind/

standard/

Makefile

3 lines

TestStandardUnwind.py

138 lines

hand_written/

fprintf.cpp

16 lines

new_delete.cpp

15 lines

Diff 27878

test/functionalities/unwind/standard/Makefile

This file was added.

				LEVEL = ../../../make

				include $(LEVEL)/Makefile.rules

test/functionalities/unwind/standard/TestStandardUnwind.py

This file was added.

				"""
				Test that we can backtrace correctly from standard functions.

				This test suit is a collection of automatically generated tests from the source files in the
				directory. Please DON'T add individual test cases to this file.

				To add a new test case to this test suit please create a simple C/C++ application and put the
				source file into the directory of the test cases. The test suit will automatically pick the
				file up and generate a test case from it in run time (with name test_standard_unwind_<file_name>
				after escaping some special characters).
				"""

				import os, time
				import unittest2
				import lldb
				from lldbtest import *
				import lldbutil

				test_source_dirs = ["."]

				class StandardUnwindTest(TestBase):
				mydir = TestBase.compute_mydir(__file__)

				def standard_unwind_tests (self):
				# The following variables have to be defined for each architecture and OS we testing for:
				# base_function_names: List of function names where we accept that the stack unwinding is
				# correct if they are on the stack. It should include the bottom most
				# function on the stack and a list of functions where we know we can't
				# unwind for any reason (list of expected failure functions)
				# no_step_function_names: The list of functions where we don't want to step through
				# instruction by instruction for any reason. (A valid reason is if
				# it is impossible to step through a function instruction by
				# instruction because it is special for some reason.) For these
				labathUnsubmitted Not Done Reply Inline Actions s/it/hit/? labath: s/it/hit/?
				tberghammerAuthorUnsubmitted Not Done Reply Inline Actions Done tberghammer: Done
				# functions we will immediately do a step-out when we hit them.

				triple = self.dbg.GetSelectedPlatform().GetTriple()
				if re.match("arm-.-.-android", triple):
				base_function_names = [
				"_start", # Base function on the stack
				"__memcpy_base", # Function reached by a fall through from the previous function
				ovyalovUnsubmitted Not Done Reply Inline Actions Add __start_thread ti track unwinding in a thread? ovyalov: Add __start_thread ti track unwinding in a thread?
				"__memcpy_base_aligned", # Function reached by a fall through from the previous function
				"__subdf3", # __aeabi_ui2d jumps into the middle of the function. Possibly missing symbol?
				"__aeabi_ldivmod", # llvm.org/pr23879 ("push {sp}" not handled correctly)
				"__aeabi_uldivmod", # llvm.org/pr23879 ("push {sp}" not handled correctly)
				]
				labathUnsubmitted Not Done Reply Inline Actions As I understand it, the idea is to use this list as a sort of an XFAIL list. In that case, maybe we should add a link to the relevant bug, where applicable? labath: As I understand it, the idea is to use this list as a sort of an XFAIL list. In that case…
				tberghammerAuthorUnsubmitted Not Done Reply Inline Actions Done tberghammer: Done
				no_step_function_names = [
				"__sync_fetch_and_add_4", # Calls into a special SO where we can't set a breakpoint
				"pthread_mutex_lock", # Uses ldrex and strex what interferes with the software single stepping
				"pthread_mutex_unlock", # Uses ldrex and strex what interferes with the software single stepping
				labathUnsubmitted Not Done Reply Inline Actions interferes labath: interferes
				tberghammerAuthorUnsubmitted Not Done Reply Inline Actions Done tberghammer: Done
				"pthread_once", # Uses ldrex and strex what interferes with the software single stepping
				]
				else:
				self.skipTest("No expectations for the current architecture")

				exe = os.path.join(os.getcwd(), "a.out")
				target = self.dbg.CreateTarget(exe)
				self.assertTrue(target, VALID_TARGET)
				labathUnsubmitted Not Done Reply Inline Actions AssertTrue(process != None) ? labath: AssertTrue(process != None) ?
				tberghammerAuthorUnsubmitted Not Done Reply Inline Actions Done tberghammer: Done

				target.BreakpointCreateByName("main")

				process = target.LaunchSimple (None, None, self.get_process_working_directory())
				self.assertTrue(process is not None, "SBTarget.Launch() failed")
				self.assertEqual(process.GetState(), lldb.eStateStopped, "The process didn't hit main")

				index = 0
				labathUnsubmitted Not Done Reply Inline Actions I am confused by this comment. How will using step-out instead of step-inst help us unstuck a program which is waiting for STDIO? If it is waiting for an input event, it will keep waiting no matter how we step it... labath: I am confused by this comment. How will using step-out instead of step-inst help us unstuck a…
				tberghammerAuthorUnsubmitted Not Done Reply Inline Actions Fixed. The issue is caused by a bug in LLDB where single stepping an inferior in ARM change the behavior of the inferior. tberghammer: Fixed. The issue is caused by a bug in LLDB where single stepping an inferior in ARM change the…
				while process.GetState() == lldb.eStateStopped:
				ovyalovUnsubmitted Not Done Reply Inline Actions What if an inferior exits immediately after launch - for example, reporting 126, 127 exit code? Do we need to have assert here to check that process indeed in stopped state? ovyalov: What if an inferior exits immediately after launch - for example, reporting 126, 127 exit code?
				tberghammerAuthorUnsubmitted Not Done Reply Inline Actions Added check tberghammer: Added check
				index += 1
				if process.GetNumThreads() > 1:
				# In case of a multi threaded inferior if one of the thread is stopped in a blocking
				# syscall and we try to step it then SBThread::StepInstruction() will block forever
				self.skipTest("Multi threaded inferiors are not supported by this test")

				ovyalovUnsubmitted Not Done Reply Inline Actions print "INDEX: %d, THREAD %d" % (index, i) ? ovyalov: print "INDEX: %d, THREAD %d" % (index, i) ?
				tberghammerAuthorUnsubmitted Not Done Reply Inline Actions Done tberghammer: Done
				thread = process.GetThreadAtIndex(0)

				labathUnsubmitted Not Done Reply Inline Actions s/0/i/ ? labath: s/0/i/ ?
				tberghammerAuthorUnsubmitted Not Done Reply Inline Actions Done. Nice catch tberghammer: Done. Nice catch
				if self.TraceOn():
				print "INDEX: %u" % index
				for f in thread.frames:
				print f

				if thread.GetFrameAtIndex(0).GetFunctionName() is not None:
				found_main = False
				for f in thread.frames:
				if f.GetFunctionName() in base_function_names:
				found_main = True
				break
				self.assertTrue(found_main, "Main function isn't found on the backtrace")

				if thread.GetFrameAtIndex(0).GetFunctionName() in no_step_function_names:
				thread.StepOut()
				else:
				thread.StepInstruction(False)

				# Collect source files in the specified directories
				test_source_files = set([])
				for d in test_source_dirs:
				if os.path.isabs(d):
				dirname = d
				else:
				dirname = os.path.join(os.path.dirname(__file__), d)
				labathUnsubmitted Not Done Reply Inline Actions This design, besides being confusing to non-python experts, makes it impossible for someone to selectively enable tests. I think this is important, as there sure will be many failures once somebody starts bringing this up on a new type of platform. I would prefer refactoring this so that every source file can be defined using two lines: def test_unwind_foo(self): run_test("foo.cpp") These lines can be generated by a script on the initial import. labath: This design, besides being confusing to non-python experts, makes it impossible for someone to…

				for root, _, files in os.walk(dirname):
				test_source_files = test_source_files \| set(os.path.abspath(os.path.join(root, f)) for f in files)

				# Generate test cases based on the collected source files
				for f in test_source_files:
				if f.endswith(".cpp") or f.endswith(".c"):
				@dwarf_test
				ovyalovUnsubmitted Not Done Reply Inline Actions f should contain a full path - do we need this join here? ovyalov: f should contain a full path - do we need this join here?
				tberghammerAuthorUnsubmitted Not Done Reply Inline Actions Fixed (I don't know why it worked before) tberghammer: Fixed (I don't know why it worked before)
				@unittest2.skipIf(TestBase.skipLongRunningTest(), "Skip this long running test")
				def test_function_dwarf(self, f=f):
				if f.endswith(".cpp"):
				d = {'CXX_SOURCES': f}
				elif f.endswith(".c"):
				d = {'C_SOURCES': f}

				# If we can't compile the inferior just skip the test instead of failing it.
				# It makes the test suit more robust when testing on several different architecture
				labathUnsubmitted Not Done Reply Inline Actions What kind and how many compile errors were you encountering? I'm a bit worried that these skips everywhere will make it hard to figure out which tests are actually being run. labath: What kind and how many compile errors were you encountering? I'm a bit worried that these skips…
				tberghammerAuthorUnsubmitted Not Done Reply Inline Actions Primarily I expect 2 type of compile error because of the type of the test suit: Features not supported by the compiler we are currently testing with Source code collected from some source with copy/paste or with a script contains files what aren't compiling tberghammer: Primarily I expect 2 type of compile error because of the type of the test suit: * Features not…
				labathUnsubmitted Not Done Reply Inline Actions Features not supported by the compiler we are currently testing with Ok, that makes sense. Source code collected from some source with copy/paste or with a script contains files what aren't compiling Makes sense if testing against a foreign file collection, but if you are going to check those files in, I would expect you to go through them (automatically if you want) and make sure they make sense, so that we don't end up with a bunch of garbage in the repository. labath: > Features not supported by the compiler we are currently testing with Ok, that makes sense. >…
				# avoid the hassle of skipping tests manually.
				ovyalovUnsubmitted Not Done Reply Inline Actions Could you log exception if Trace is on? ovyalov: Could you log exception if Trace is on?
				tberghammerAuthorUnsubmitted Not Done Reply Inline Actions Done tberghammer: Done
				try:
				self.buildDwarf(dictionary=d)
				self.setTearDownCleanup(d)
				except:
				if self.TraceOn():
				print sys.exc_info()[0]
				self.skipTest("Inferior not supported")
				self.standard_unwind_tests()

				test_name = "test_unwind_" + str(f)
				for c in ".=()/\\":
				test_name = test_name.replace(c, '_')

				test_function_dwarf.__name__ = test_name
				setattr(StandardUnwindTest, test_function_dwarf.__name__, test_function_dwarf)

				if __name__ == '__main__':
				import atexit
				lldb.SBDebugger.Initialize()
				atexit.register(lambda: lldb.SBDebugger.Terminate())
				unittest2.main()

test/functionalities/unwind/standard/hand_written/fprintf.cpp

This file was added.

				//===-- main.cpp ------------------------------------------------- C++ --===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//

				#include <cstdio>

				int
				main(int argc, char const *argv[])
				{
				fprintf(stderr, "%d %p %s\n", argc, argv, argv[0]);
				}

test/functionalities/unwind/standard/hand_written/new_delete.cpp

This file was added.

				//===-- main.cpp ------------------------------------------------- C++ --===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//

				int
				main(int argc, char const *argv[])
				{
				int* p = new int;
				delete p;
				}