This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
source/
-
Core/
-
Module.cpp
-
Target/
2/4
Process.cpp

Differential D58405

Merge target triple into module triple when constructing module from memory
ClosedPublic

Authored by xiaobai on Feb 19 2019, 1:46 PM.

Download Raw Diff

Details

Reviewers

clayborg
zturner
JDevlieghere
compnerd
aprantl
labath

Commits

rGa07287ecc5e3: Merge target triple into module triple when constructing module from memory
rLLDB354526: Merge target triple into module triple when constructing module from memory
rL354526: Merge target triple into module triple when constructing module from memory

Summary

While debugging an android process remotely from a windows machine, I
noticed that the modules constructed from an object file in memory only had
information about the architecture. Without knowledge of the OS or environment,
expression evaluation sometimes leads to incorrectly generated code or a
debugger crash. While we cannot know for certain what triple a module
constructed from an in-memory object file will have, we can use the triple
from the target to try and fill in the missing details.

Diff Detail

Build Status

Buildable 28294
Build 28293: arc lint + arc unit

Event Timeline

xiaobai created this revision.Feb 19 2019, 1:46 PM

Herald added a subscriber: jdoerfert. · View Herald TranscriptFeb 19 2019, 1:46 PM

Harbormaster completed remote builds in B28294: Diff 187438.Feb 19 2019, 1:46 PM

a good guess is the triple that the target executable was compiled for.

What does this do when the executable has multiple slices, such as a Mach-O universal binary?

In D58405#1402908, @aprantl wrote:

a good guess is the triple that the target executable was compiled for.

What does this do when the executable has multiple slices, such as a Mach-O universal binary?

Hmm, I hadn't considered that case. I'll test that and see what happens. Thanks for pointing that out.

clayborg added inline comments.Feb 19 2019, 2:05 PM

source/Target/Process.cpp
2638–2639	Is the MergeFrom in the first part not enough? I am worried about the case where we don't have even an executable, no one has set the architecture on the target, or worse yet, they have set the wrong architecture on the target. We want to correct the architecture on the target if we didn't specify it or the target was wrong. I am worried if we do this here we might hose up things in those cases.

xiaobai marked an inline comment as done.Feb 19 2019, 2:10 PM

xiaobai added inline comments.

source/Target/Process.cpp
2638–2639	MergeFrom is not enough. When debugging an android-aarch64 binary, the triple was just set to `aarch64---` for modules constructed from in-memory object files, which is not enough info to do anything meaningful. However, thinking about this further, MergeFrom might not even be what we want here. Specifically, this is just a guess and the information from the in-memory object file is likely more reliable. It would probably better not to merge but to overwrite when information is available.

clayborg added inline comments.Feb 19 2019, 2:28 PM

source/Target/Process.cpp
2639	So if you have a target that is set to "aarch64-linux-android', and you have "aarch64---" in your object file/module, you would want to merge the missing bits from the target to augment the module's architecture so it grabs the "linux-android" bits. So I do believe MergeFrom should be used to augment the Module's architecture.

xiaobai marked an inline comment as done.Feb 19 2019, 2:35 PM

xiaobai added inline comments.

source/Target/Process.cpp
2639	Ah, yes! Merging from the other direction is definitely what I wanted. Thanks for pointing that out.

Updating based on feedback from clayborg

Minor change

xiaobai retitled this revision from Make educated guess when constructing module from memory to Merge target triple into module triple when constructing module from memory.Feb 19 2019, 3:30 PM

xiaobai edited the summary of this revision. (Show Details)

Harbormaster completed remote builds in B28300: Diff 187458.Feb 19 2019, 3:30 PM

Harbormaster completed remote builds in B28302: Diff 187460.

Can you think of a way to test this?

Probably just a replay of GDB protocol packets connected to a testing server would work. But, I don't know if there is infrastructure for that yet.

Perhaps a unit test?

Augmenting the architecture with the target information sounds right to me. If we're reading a module from memory, then we already have a running target, and I consider that information more reliable (since it comes from os/lldb-server) than the object file architecture (which is just random bits in process memory). I might even go so far as to say that if we detect that the module and target ArchSpecs substantially differ (for some meaning of "substantially"), we should reject the attempt to create the module in the first place. Maybe we even do that already at some level?

Unfortunately, I think testing this is going to be hard. The infrastructure for testing in-memory object files is very under-developed. I think the best attempt would be to have a test program which creates some object files in memory (via llvm jit, or perhaps by loading pre-prepared binaries from disk), and then using the gdb jit interface to notify lldb about it. But I don't think we have anything similar to this currently.

In D58405#1402908, @aprantl wrote:

a good guess is the triple that the target executable was compiled for.

What does this do when the executable has multiple slices, such as a Mach-O universal binary?

I don't think this should be an issue since a Module object always points to a specific slice in the fat binary.

This revision is now accepted and ready to land.Feb 19 2019, 11:50 PM

Looks good.

Closed by commit rL354526: Merge target triple into module triple when constructing module from memory (authored by xiaobai). · Explain WhyFeb 20 2019, 3:12 PM

This revision was automatically updated to reflect the committed changes.

Herald added a project: Restricted Project. · View Herald TranscriptFeb 20 2019, 3:12 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Revision Contents

Path

Size

source/

Core/

Module.cpp

2 lines

Target/

Process.cpp

3 lines

Diff 187438

source/Core/Module.cpp

Show First 20 Lines • Show All 302 Lines • ▼ Show 20 Lines	if (process_sp) {
if (m_objfile_sp) {		if (m_objfile_sp) {
StreamString s;		StreamString s;
s.Printf("0x%16.16" PRIx64, header_addr);		s.Printf("0x%16.16" PRIx64, header_addr);
m_object_name.SetString(s.GetString());		m_object_name.SetString(s.GetString());

// Once we get the object file, update our module with the object		// Once we get the object file, update our module with the object
// file's architecture since it might differ in vendor/os if some		// file's architecture since it might differ in vendor/os if some
// parts were unknown.		// parts were unknown.
m_arch = m_objfile_sp->GetArchitecture();		m_arch.MergeFrom(m_objfile_sp->GetArchitecture());
} else {		} else {
error.SetErrorString("unable to find suitable object file plug-in");		error.SetErrorString("unable to find suitable object file plug-in");
}		}
} else {		} else {
error.SetErrorStringWithFormat("unable to read header from memory: %s",		error.SetErrorStringWithFormat("unable to read header from memory: %s",
readmem_error.AsCString());		readmem_error.AsCString());
}		}
} else {		} else {
▲ Show 20 Lines • Show All 1,338 Lines • Show Last 20 Lines

source/Target/Process.cpp

	Show First 20 Lines • Show All 2,629 Lines • ▼ Show 20 Lines
	ModuleSP Process::ReadModuleFromMemory(const FileSpec &file_spec,			ModuleSP Process::ReadModuleFromMemory(const FileSpec &file_spec,
	lldb::addr_t header_addr,			lldb::addr_t header_addr,
	size_t size_to_read) {			size_t size_to_read) {
	Log *log = lldb_private::GetLogIfAllCategoriesSet(LIBLLDB_LOG_HOST);			Log *log = lldb_private::GetLogIfAllCategoriesSet(LIBLLDB_LOG_HOST);
	if (log) {			if (log) {
	log->Printf("Process::ReadModuleFromMemory reading %s binary from memory",			log->Printf("Process::ReadModuleFromMemory reading %s binary from memory",
	file_spec.GetPath().c_str());			file_spec.GetPath().c_str());
	}			}
	ModuleSP module_sp(new Module(file_spec, ArchSpec()));			ModuleSP module_sp(new Module(
				file_spec, GetTarget().GetExecutableModule()->GetArchitecture()));
				clayborgUnsubmitted Not Done Reply Inline Actions Is the MergeFrom in the first part not enough? I am worried about the case where we don't have even an executable, no one has set the architecture on the target, or worse yet, they have set the wrong architecture on the target. We want to correct the architecture on the target if we didn't specify it or the target was wrong. I am worried if we do this here we might hose up things in those cases. clayborg: Is the MergeFrom in the first part not enough? I am worried about the case where we don't have…
				xiaobaiAuthorUnsubmitted Done Reply Inline Actions MergeFrom is not enough. When debugging an android-aarch64 binary, the triple was just set to `aarch64---` for modules constructed from in-memory object files, which is not enough info to do anything meaningful. However, thinking about this further, MergeFrom might not even be what we want here. Specifically, this is just a guess and the information from the in-memory object file is likely more reliable. It would probably better not to merge but to overwrite when information is available. xiaobai: MergeFrom is not enough. When debugging an android-aarch64 binary, the triple was just set to…
				clayborgUnsubmitted Not Done Reply Inline Actions So if you have a target that is set to "aarch64-linux-android', and you have "aarch64---" in your object file/module, you would want to merge the missing bits from the target to augment the module's architecture so it grabs the "linux-android" bits. So I do believe MergeFrom should be used to augment the Module's architecture. clayborg: So if you have a target that is set to "aarch64-linux-android', and you have "aarch64---" in…
				xiaobaiAuthorUnsubmitted Done Reply Inline Actions Ah, yes! Merging from the other direction is definitely what I wanted. Thanks for pointing that out. xiaobai: Ah, yes! Merging from the other direction is definitely what I wanted. Thanks for pointing that…
	if (module_sp) {			if (module_sp) {
	Status error;			Status error;
	ObjectFile *objfile = module_sp->GetMemoryObjectFile(			ObjectFile *objfile = module_sp->GetMemoryObjectFile(
	shared_from_this(), header_addr, error, size_to_read);			shared_from_this(), header_addr, error, size_to_read);
	if (objfile)			if (objfile)
	return module_sp;			return module_sp;
	}			}
	return ModuleSP();			return ModuleSP();
	▲ Show 20 Lines • Show All 3,550 Lines • Show Last 20 Lines