This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lldb/
-
include/lldb/Host/
-
lldb/
-
Host/
1
HostInfoBase.h
-
macosx/
1/3
HostInfoMacOSX.h
-
source/
-
Host/macosx/objcxx/
-
macosx/
-
objcxx/
3
HostInfoMacOSX.mm
-
Plugins/
-
DynamicLoader/MacOSX-DYLD/
-
MacOSX-DYLD/
-
DynamicLoaderDarwin.cpp
-
ObjectFile/Mach-O/
-
Mach-O/
-
ObjectFileMachO.h
2
ObjectFileMachO.cpp
-
Platform/MacOSX/
-
MacOSX/
1/2
PlatformDarwin.cpp
-
unittests/ObjectFile/
-
ObjectFile/
-
CMakeLists.txt
-
MachO/
-
CMakeLists.txt
-
TestObjectFileMachO.cpp

Differential D83023

[lldb/ObjectFileMachO] Fetch shared cache images from our own shared cache
ClosedPublic

Authored by friss on Jul 1 2020, 10:15 PM.

Download Raw Diff

Details

Reviewers

jasonmolenda
labath

Commits

rG8113a8bb7934: [lldb/ObjectFileMachO] Fetch shared cache images from our own shared cache

Summary

On macOS 11, the libraries that have been integrated in the system
shared cache are not present on the filesystem anymore. LLDB was
using those files to get access to the symbols of those libraries.
LLDB can get the images from the target process memory though.

This has 2 consequences:

LLDB cannot load the images before the process starts, reporting an error if someone tries to break on a system symbol.
Loading the symbols by downloading the data from the inferior is super slow. It takes tens of seconds at the start of the debug session to populate the Module list.

To fix this, we can use the library images LLDB has in its own
mapping of the shared cache. To do this patch extends ModuleSpec
to be able to store a DataBuffer for the Module that the MacOS
platform will provide by querying a new HostInfo utility which
describes the contents of the shared cache.

This patch fixes a number of test failures on macOS 11 due to the
first problem described above.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

friss created this revision.Jul 1 2020, 10:15 PM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptJul 1 2020, 10:15 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

@labath I tagged you mostly for the generic parts of the patch. I don't suppose you care a lot about ObjectFileMachO.cpp

I think this is a very interesting feature (lldb being able to load modules from memory; the mac shared cache thingy is interesting too, but in a different way). We have a feature request from people who are downloading modules from a network (from a proprietary symbol server, etc.) and would like to pass them to lldb without having to serialize them to disk. This would be a step towards making that happen. It could also be useful for our own unit tests which now have to do a similar thing.

However, I think this could use some clean up. There's a lot of juggling of data/file/object offsets going on, and it all seems inconsistent and avoidable to me. Please see inline comments for details.

The patch is also quite light on testing. If done right, I believe this should make it possible to yaml2obj a file into memory in a unit test and then create a Module from that. That would enable us to test the Module(Spec) changes in isolation, and move them to a separate patch.

In D83023#2127232, @friss wrote:

I don't suppose you care a lot about ObjectFileMachO.cpp

I care about ObjectFileMachO to the extent that I need to occasionally touch it when working on generic interfaces. And I gotta say that changing anything in there is pretty hard right now... :/

lldb/include/lldb/Host/macosx/HostInfoMacOSX.h
21–42	The way we've done this elsewhere is to add the interface to the base class with a default stubbed-out implementation. That way, you don't have to put `#ifdef __APPLE__` into all of the code which tries to use this. `HostInfo::GetXcodeSDKPath` is the latest example of that.
37	I think this could just be an ArrayRef<uint8_t>, or a void* or something, and then you could create an appropriately sized DataBufferHostMemory (or whatever ends up called) when working with a specific module.
lldb/include/lldb/Utility/DataBuffer.h
82 ↗	(On Diff #275004)	All of our other DataBuffers also point to host memory (how could they not do that?). I guess what really makes this one special is that it does not own the storage that it points to...
lldb/source/Core/Module.cpp
154–159 ↗	(On Diff #275004)	I think this overloads the meaning of `module_spec.GetObjectOffset()` in a fairly confusing way. Normally, I believe `ModuleSpec::GetObject{Name,Offset,Size}` is used to refer to the name/offse/size of a object file located in an archive (.a). However, here it seems you are reusing it for something else. It seems unfortunate that the meaning of this field should change depending on the "data" field being present. What exactly is the purpose of this field? Could we avoid this by just creating an appropriately-sized DataBuffer ?
1265–1272 ↗	(On Diff #275004)	With an appropriately sized data_sp, I'm hoping most if this could go away...
lldb/source/Plugins/ObjectFile/Mach-O/ObjectFileMachO.cpp
934–947	Wouldn't this be better handled by adjusting the file offsets during Section creation?
lldb/source/Plugins/Platform/MacOSX/PlatformDarwin.cpp
42–44	Leftovers from an earlier implementation?
lldb/source/Symbol/ObjectFile.cpp
217 ↗	(On Diff #275004)	this data/file_offset business would be nice to get rid of too...
lldb/unittests/Host/HostInfoTest.cpp
74 ↗	(On Diff #275004)	EXPECT_LT
llvm/include/llvm/BinaryFormat/MachO.h
86 ↗	(On Diff #275004)	Maybe just commit this, and all the 0x80000000u replacements straight away.

In D83023#2128298, @labath wrote:

I think this is a very interesting feature (lldb being able to load modules from memory; the mac shared cache thingy is interesting too, but in a different way). We have a feature request from people who are downloading modules from a network (from a proprietary symbol server, etc.) and would like to pass them to lldb without having to serialize them to disk. This would be a step towards making that happen. It could also be useful for our own unit tests which now have to do a similar thing.

However, I think this could use some clean up. There's a lot of juggling of data/file/object offsets going on, and it all seems inconsistent and avoidable to me. Please see inline comments for details.

I'll see what can be done. My main goal while working on this was to avoid changing the semantics outside of the shared cache usecase. I understand fairly well the codepath that I added and then just moved some other bits around to keep the existing semantics for the rest. Happy to rework this.

The patch is also quite light on testing. If done right, I believe this should make it possible to yaml2obj a file into memory in a unit test and then create a Module from that. That would enable us to test the Module(Spec) changes in isolation, and move them to a separate patch.

I agree about the light testing. FWIW, this got significant living-on testing on Apple platforms, but some more targeted testing would be great.

In D83023#2127232, @friss wrote:

I don't suppose you care a lot about ObjectFileMachO.cpp

I care about ObjectFileMachO to the extent that I need to occasionally touch it when working on generic interfaces. And I gotta say that changing anything in there is pretty hard right now... :/

Tell me about it...

I'll do a couple experiments in response to your comments. Thanks for the review!

lldb/include/lldb/Host/macosx/HostInfoMacOSX.h
21–42	Yeah. When I wrote this patch, GetXcodeSDKPath didn't exist and I did it this way to avoid putting a very non-generic API at the top-level. As we're fine with this, it's easy to change.
lldb/include/lldb/Utility/DataBuffer.h
82 ↗	(On Diff #275004)	Good point. What about `DataBufferUnowned` ? (and adding a comment to explain what it's used for)
lldb/source/Core/Module.cpp
154–159 ↗	(On Diff #275004)	So the shared cache is some kind of a container in some ways similar to a `.a`, file, in some ways very different. The main difference is that the contained dylibs are not each in their own subpart of the file. For example, the __LINKEDIT segment is shared by all of them. The load commands that describe the images are relative to the shared cache itself, not to one specific image. So I reused the object_offset field in its "offset from the beginning of a container sense". I think I tried having the SharedCacheInfo return only the subpart of the cache that contains the image, and ran into issues with this. But I don't remember exactly what, and I have a much better understanding now, so I should try it again.
lldb/source/Plugins/Platform/MacOSX/PlatformDarwin.cpp
42–44	yep, good catch

In D83023#2128475, @friss wrote:

In D83023#2128298, @labath wrote:

I think this is a very interesting feature (lldb being able to load modules from memory; the mac shared cache thingy is interesting too, but in a different way). We have a feature request from people who are downloading modules from a network (from a proprietary symbol server, etc.) and would like to pass them to lldb without having to serialize them to disk. This would be a step towards making that happen. It could also be useful for our own unit tests which now have to do a similar thing.

However, I think this could use some clean up. There's a lot of juggling of data/file/object offsets going on, and it all seems inconsistent and avoidable to me. Please see inline comments for details.

I'll see what can be done. My main goal while working on this was to avoid changing the semantics outside of the shared cache usecase. I understand fairly well the codepath that I added and then just moved some other bits around to keep the existing semantics for the rest. Happy to rework this.

So, if an object file needs to access some data which is outside of "its" image then my idea about using sliced data buffers will probably not work. It that case, using the "object offset" field to communicate the location of the "object" might not be a bad idea (it's still different than the use in .a files, but maybe we can stretch that definition). The part that bugs me then is having this functionality key off of the "data" field being set. Ideally, these would be two orthogonal features:

the "data" would control whether you read the file system to obtain the object contents
the "object offset" would tell you where to locate the desired object inside these "jumbo" objects

I think that might be doable by adding one more argument to the ObjectFile::GetModuleSpecifications interface, which says "this buffer contains a lot of stuff, but the object I am really interested in is at <offset>". Then ObjectFileMachO could do what it needs to read the object from that offset, but it would still have access to the entire buffer to read the __LINKEDIT thingy.

Or.... we could try to do something completely different, and avoid touching the common interfaces altogether. The way I see it, this shared cache thing is unlikely to be reusable for anything else, and the reason we're adding these interfaces is so that we can communicate information from DynamicLoaderDarwin to ObjectFileMachO through Target::GetOrCreateModule. But that is silly, because DynamicLoaderDarwin already knows that the request is going to go to ObjectFileMachO (and indeed things would break if it doesn't). If there was a way for these two to communicate directly, then all of this would be unneeded, because DynamicLoaderDarwin could use a ObjectFileMachO interface, which would be specially crafted for this purpose.

It also seems to me we are only really interested in the "Get" part of Target::GetOrCreateModule -- i.e., ensuring we don't create multiple module instances for the same object. The rest of the function deals with various ways to try to find a module, but that's the thing we actually want to avoid, as we already know the data buffer that contains it. And all of the checks about matching uuids/architecures/etc. don't seem useful either as DynamicLoaderDarwin is repeating most of these anyway.

We actually already have an interface that almost does this: Module::CreateModuleFromObjectFile. It allows one to create a Module while directly specifying the ObjectFile class to use, and passing arbitrary constructor arguments. The part it does *not* do is register this module into the global module cache. That's because so far, this function is used to create "weird" modules that we wouldn't want to cache anyway. But, what if we created a version of this function which does that? Could DynamicLoaderDarwin then instead of target.GetOrCreateModule(shared_cache_spec) do something like:

module_sp = target.GetImages().FindFirstModule(shared_cache_spec);
// If target already contains the module, we're done.
if (!module_sp) {
  // This checks the global module cache for a matching module. If it finds one, it returns it. Otherwise, it creates a module through CreateModuleFromObjectFile, and registers it into the cache.
  module_sp = Module::GetOrCreateModuleFromObjectFile<ObjectFileMachO>(shared_cache_spec, whatever_args_we_need_to_get_macho_to_load_properly);
  target.GetImages().Append(module_sp, /*notify=*/false);
}

I think an interface like this could be useful in the future as an escape hatch, because there are a lot of situations where one already knows he is dealing with a specific kind of object files, but is not able to take advantage of that. For example, DynamicLoaderPOSIXDYLD "knows" it's going to load an ELF file, that knowledge does not help it in any way. So far, we haven't needed to do anything super weird there, but that class already does contain a bunch of fallbacks/workarounds for bugs in various dynamic loaders and other platform features (e.g. the VDSO pseudo-module). I can certainly see how being able to create an module more directly could be helpful at times.

The patch is also quite light on testing. If done right, I believe this should make it possible to yaml2obj a file into memory in a unit test and then create a Module from that. That would enable us to test the Module(Spec) changes in isolation, and move them to a separate patch.

I agree about the light testing. FWIW, this got significant living-on testing on Apple platforms, but some more targeted testing would be great.

Yeah, I'm sure it works now. I'm more worried about it continuing to work in face of other random changes. :)

lldb/include/lldb/Utility/DataBuffer.h
82 ↗	(On Diff #275004)	Sounds good. I don't think that the comment really needs to mention the shared cache. I can see this being useful in other circumstances too...
lldb/source/Core/Module.cpp
154–159 ↗	(On Diff #275004)	Hmm... interesting. I'm going to touch on this more in the main comment.

In D83023#2129985, @labath wrote:

In D83023#2128475, @friss wrote:

In D83023#2128298, @labath wrote:

I think this is a very interesting feature (lldb being able to load modules from memory; the mac shared cache thingy is interesting too, but in a different way). We have a feature request from people who are downloading modules from a network (from a proprietary symbol server, etc.) and would like to pass them to lldb without having to serialize them to disk. This would be a step towards making that happen. It could also be useful for our own unit tests which now have to do a similar thing.

However, I think this could use some clean up. There's a lot of juggling of data/file/object offsets going on, and it all seems inconsistent and avoidable to me. Please see inline comments for details.

I'll see what can be done. My main goal while working on this was to avoid changing the semantics outside of the shared cache usecase. I understand fairly well the codepath that I added and then just moved some other bits around to keep the existing semantics for the rest. Happy to rework this.

So, if an object file needs to access some data which is outside of "its" image then my idea about using sliced data buffers will probably not work. It that case, using the "object offset" field to communicate the location of the "object" might not be a bad idea (it's still different than the use in .a files, but maybe we can stretch that definition). The part that bugs me then is having this functionality key off of the "data" field being set. Ideally, these would be two orthogonal features:

the "data" would control whether you read the file system to obtain the object contents

the "object offset" would tell you where to locate the desired object inside these "jumbo" objects

I think this will work. And I can hide the ugliness inside ObjectFileMachO. A shared cache image only ever needs to access data after its start, so I can model images a stretching from their starting point to the end of the shared cache. I remember doing it this way first, and the reason I changed my mind was because of some checks in the ObjectFileMachO::CreateSections which fired because the load commands were relative to the full shared cache instead of just the image. This can be dealt with locally in this function (the rest of the code has to deal with it anyway, because once an ObjectFile plugin claims an input, the data gets clamped to not have a starting offset anymore

Take a look at https://reviews.llvm.org/D83512, it implements the generic part and it required basically no work to support ELF in-memory files.

Rebase on top of D83512
Change the ObjectFileMachO pieces to rewrite offsets to look like a standard Mach-o image instead of adding a bunch of conditionals to handle the new cases.

Herald added a subscriber: mgorny. · View Herald TranscriptJul 15 2020, 3:40 PM

Harbormaster failed remote builds in B64435: Diff 278331!Jul 15 2020, 4:04 PM

The rewrite of the ObjectFileMachO parts is very nice. LGTM.

lldb/source/Host/macosx/objcxx/HostInfoMacOSX.mm
463	not important but #include "Utility/UuidCompatibility.h" would get you this.
lldb/source/Plugins/ObjectFile/Mach-O/ObjectFileMachO.cpp
2346	this is a bool, maybe !is_shared_cache_image would be clearer? The original code was comparing a bitfield to 0 so it made a little more sense.

This revision is now accepted and ready to land.Jul 15 2020, 6:01 PM

I like how this has turned out. Some small requests inline.

lldb/include/lldb/Host/HostInfoBase.h
107	Some doxygen here? "Try to find a module with the given name in the address space of the current process?"
lldb/source/Host/macosx/objcxx/HostInfoMacOSX.mm
494–495	The class is already in an implementation file (you could empasize that by making that an anonymous namespace: http://llvm.org/docs/CodingStandards.html#anonymous-namespaces). I don't think you need to go through all that trouble to avoid someone instantiating it...
lldb/unittests/Host/HostInfoTest.cpp
81–117 ↗	(On Diff #278331)	This is mostly about checking the ObjectFileMachO functionality, is it not? Could you make that a ObjectFileMachO unittest? I'm mainly trying to avoid pulling in lots of libraries into the host unittest binary. The unittest binary is a good way to ensure that that the host module does not grow external dependencies (as the unittest wouldn't link), but this would pull in pretty much everything into that binary.

Address review feedback

labath accepted this revision.Jul 16 2020, 8:27 AM

labath added inline comments.

lldb/source/Host/macosx/objcxx/HostInfoMacOSX.mm
474–481	extern "C" in an anonymous namespace looks weird, even if it does work. Best move this part out...

I'll commit after I've re-built top of tree and fully retested

Harbormaster failed remote builds in B64530: Diff 278495!Jul 16 2020, 8:59 AM

Closed by commit rG8113a8bb7934: [lldb/ObjectFileMachO] Fetch shared cache images from our own shared cache (authored by friss). · Explain WhyJul 16 2020, 10:40 AM

This revision was automatically updated to reflect the committed changes.

jasonmolenda mentioned this in D100164: Don't treat corefile binaries like dylibs in the shared cache, even if they say they are.Apr 9 2021, 12:16 AM

Revision Contents

Path

Size

lldb/

include/

lldb/

Host/

HostInfoBase.h

13 lines

macosx/

HostInfoMacOSX.h

5 lines

source/

Host/

macosx/

objcxx/

HostInfoMacOSX.mm

63 lines

Plugins/

DynamicLoader/

MacOSX-DYLD/

DynamicLoaderDarwin.cpp

43 lines

ObjectFile/

Mach-O/

ObjectFileMachO.h

2 lines

ObjectFileMachO.cpp

110 lines

Platform/

MacOSX/

PlatformDarwin.cpp

24 lines

unittests/

ObjectFile/

CMakeLists.txt

1 line

MachO/

CMakeLists.txt

10 lines

TestObjectFileMachO.cpp

79 lines

Diff 278537

lldb/include/lldb/Host/HostInfoBase.h

//===-- HostInfoBase.h ------------------------------------------- C++ --===//		//===-- HostInfoBase.h ------------------------------------------- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLDB_HOST_HOSTINFOBASE_H		#ifndef LLDB_HOST_HOSTINFOBASE_H
#define LLDB_HOST_HOSTINFOBASE_H		#define LLDB_HOST_HOSTINFOBASE_H

#include "lldb/Utility/ArchSpec.h"		#include "lldb/Utility/ArchSpec.h"
#include "lldb/Utility/FileSpec.h"		#include "lldb/Utility/FileSpec.h"
		#include "lldb/Utility/UUID.h"
#include "lldb/Utility/UserIDResolver.h"		#include "lldb/Utility/UserIDResolver.h"
#include "lldb/Utility/XcodeSDK.h"		#include "lldb/Utility/XcodeSDK.h"
#include "lldb/lldb-enumerations.h"		#include "lldb/lldb-enumerations.h"
#include "llvm/ADT/StringRef.h"		#include "llvm/ADT/StringRef.h"

#include <stdint.h>		#include <stdint.h>

#include <string>		#include <string>

namespace lldb_private {		namespace lldb_private {

class FileSpec;		class FileSpec;

		struct SharedCacheImageInfo {
		UUID uuid;
		lldb::DataBufferSP data_sp;
		};

class HostInfoBase {		class HostInfoBase {
private:		private:
// Static class, unconstructable.		// Static class, unconstructable.
HostInfoBase() {}		HostInfoBase() {}
~HostInfoBase() {}		~HostInfoBase() {}

public:		public:
static void Initialize();		static void Initialize();
▲ Show 20 Lines • Show All 58 Lines • ▼ Show 20 Lines	static bool ComputePathRelativeToLibrary(FileSpec &file_spec,
llvm::StringRef dir);		llvm::StringRef dir);

static FileSpec GetXcodeContentsDirectory() { return {}; }		static FileSpec GetXcodeContentsDirectory() { return {}; }
static FileSpec GetXcodeDeveloperDirectory() { return {}; }		static FileSpec GetXcodeDeveloperDirectory() { return {}; }

/// Return the directory containing a specific Xcode SDK.		/// Return the directory containing a specific Xcode SDK.
static llvm::StringRef GetXcodeSDKPath(XcodeSDK sdk) { return {}; }		static llvm::StringRef GetXcodeSDKPath(XcodeSDK sdk) { return {}; }

		/// Return information about module \p image_name if it is loaded in
		labathUnsubmitted Not Done Reply Inline Actions Some doxygen here? "Try to find a module with the given name in the address space of the current process?" labath: Some doxygen here? "Try to find a module with the given name in the address space of the…
		/// the current process's address space.
		static SharedCacheImageInfo
		GetSharedCacheImageInfo(llvm::StringRef image_name) {
		return {};
		}

protected:		protected:
static bool ComputeSharedLibraryDirectory(FileSpec &file_spec);		static bool ComputeSharedLibraryDirectory(FileSpec &file_spec);
static bool ComputeSupportExeDirectory(FileSpec &file_spec);		static bool ComputeSupportExeDirectory(FileSpec &file_spec);
static bool ComputeProcessTempFileDirectory(FileSpec &file_spec);		static bool ComputeProcessTempFileDirectory(FileSpec &file_spec);
static bool ComputeGlobalTempFileDirectory(FileSpec &file_spec);		static bool ComputeGlobalTempFileDirectory(FileSpec &file_spec);
static bool ComputeTempFileBaseDirectory(FileSpec &file_spec);		static bool ComputeTempFileBaseDirectory(FileSpec &file_spec);
static bool ComputeHeaderDirectory(FileSpec &file_spec);		static bool ComputeHeaderDirectory(FileSpec &file_spec);
static bool ComputeSystemPluginsDirectory(FileSpec &file_spec);		static bool ComputeSystemPluginsDirectory(FileSpec &file_spec);
static bool ComputeUserPluginsDirectory(FileSpec &file_spec);		static bool ComputeUserPluginsDirectory(FileSpec &file_spec);

static void ComputeHostArchitectureSupport(ArchSpec &arch_32,		static void ComputeHostArchitectureSupport(ArchSpec &arch_32,
ArchSpec &arch_64);		ArchSpec &arch_64);
};		};
}		}

#endif		#endif

lldb/include/lldb/Host/macosx/HostInfoMacOSX.h

	Show All 12 Lines
	#include "lldb/Utility/FileSpec.h"			#include "lldb/Utility/FileSpec.h"
	#include "lldb/Utility/XcodeSDK.h"			#include "lldb/Utility/XcodeSDK.h"
	#include "llvm/Support/VersionTuple.h"			#include "llvm/Support/VersionTuple.h"

	namespace lldb_private {			namespace lldb_private {

	class ArchSpec;			class ArchSpec;

	class HostInfoMacOSX : public HostInfoPosix {			class HostInfoMacOSX : public HostInfoPosix {
	friend class HostInfoBase;			friend class HostInfoBase;

	private:			private:
	// Static class, unconstructable.			// Static class, unconstructable.
	HostInfoMacOSX() = delete;			HostInfoMacOSX() = delete;
	~HostInfoMacOSX() = delete;			~HostInfoMacOSX() = delete;

	public:			public:
	static llvm::VersionTuple GetOSVersion();			static llvm::VersionTuple GetOSVersion();
	static llvm::VersionTuple GetMacCatalystVersion();			static llvm::VersionTuple GetMacCatalystVersion();
	static bool GetOSBuildString(std::string &s);			static bool GetOSBuildString(std::string &s);
	static bool GetOSKernelDescription(std::string &s);			static bool GetOSKernelDescription(std::string &s);
	static FileSpec GetProgramFileSpec();			static FileSpec GetProgramFileSpec();
	static FileSpec GetXcodeContentsDirectory();			static FileSpec GetXcodeContentsDirectory();
	static FileSpec GetXcodeDeveloperDirectory();			static FileSpec GetXcodeDeveloperDirectory();

				labathUnsubmitted Not Done Reply Inline Actions I think this could just be an ArrayRef<uint8_t>, or a void* or something, and then you could create an appropriately sized DataBufferHostMemory (or whatever ends up called) when working with a specific module. labath: I think this could just be an ArrayRef<uint8_t>, or a void* or something, and then you could…
	/// Query xcrun to find an Xcode SDK directory.			/// Query xcrun to find an Xcode SDK directory.
	static llvm::StringRef GetXcodeSDKPath(XcodeSDK sdk);			static llvm::StringRef GetXcodeSDKPath(XcodeSDK sdk);

				/// Shared cache utilities
				static SharedCacheImageInfo
				labathUnsubmitted Not Done Reply Inline Actions The way we've done this elsewhere is to add the interface to the base class with a default stubbed-out implementation. That way, you don't have to put `#ifdef __APPLE__` into all of the code which tries to use this. `HostInfo::GetXcodeSDKPath` is the latest example of that. labath: The way we've done this elsewhere is to add the interface to the base class with a default…
				frissAuthorUnsubmitted Done Reply Inline Actions Yeah. When I wrote this patch, GetXcodeSDKPath didn't exist and I did it this way to avoid putting a very non-generic API at the top-level. As we're fine with this, it's easy to change. friss: Yeah. When I wrote this patch, GetXcodeSDKPath didn't exist and I did it this way to avoid…
				GetSharedCacheImageInfo(llvm::StringRef image_name);

	protected:			protected:
	static bool ComputeSupportExeDirectory(FileSpec &file_spec);			static bool ComputeSupportExeDirectory(FileSpec &file_spec);
	static void ComputeHostArchitectureSupport(ArchSpec &arch_32,			static void ComputeHostArchitectureSupport(ArchSpec &arch_32,
	ArchSpec &arch_64);			ArchSpec &arch_64);
	static bool ComputeHeaderDirectory(FileSpec &file_spec);			static bool ComputeHeaderDirectory(FileSpec &file_spec);
	static bool ComputeSystemPluginsDirectory(FileSpec &file_spec);			static bool ComputeSystemPluginsDirectory(FileSpec &file_spec);
	static bool ComputeUserPluginsDirectory(FileSpec &file_spec);			static bool ComputeUserPluginsDirectory(FileSpec &file_spec);
	};			};
	}			}

	#endif			#endif

lldb/source/Host/macosx/objcxx/HostInfoMacOSX.mm

//===-- HostInfoMacOSX.mm ---------------------------------------- C++ --===//		//===-- HostInfoMacOSX.mm ---------------------------------------- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "lldb/Host/macosx/HostInfoMacOSX.h"		#include "lldb/Host/macosx/HostInfoMacOSX.h"
#include "lldb/Host/FileSystem.h"		#include "lldb/Host/FileSystem.h"
#include "lldb/Host/Host.h"		#include "lldb/Host/Host.h"
#include "lldb/Host/HostInfo.h"		#include "lldb/Host/HostInfo.h"
#include "lldb/Utility/Args.h"		#include "lldb/Utility/Args.h"
#include "lldb/Utility/Log.h"		#include "lldb/Utility/Log.h"
		#include "Utility/UuidCompatibility.h"

#include "llvm/ADT/SmallString.h"		#include "llvm/ADT/SmallString.h"
		#include "llvm/ADT/StringMap.h"
#include "llvm/Support/FileSystem.h"		#include "llvm/Support/FileSystem.h"
#include "llvm/Support/Path.h"		#include "llvm/Support/Path.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"

// C++ Includes		// C++ Includes
#include <string>		#include <string>

// C inclues		// C inclues
▲ Show 20 Lines • Show All 427 Lines • ▼ Show 20 Lines	llvm::StringRef HostInfoMacOSX::GetXcodeSDKPath(XcodeSDK sdk) {

std::lock_guard<std::mutex> guard(g_sdk_path_mutex);		std::lock_guard<std::mutex> guard(g_sdk_path_mutex);
auto it = g_sdk_path.find(sdk.GetString());		auto it = g_sdk_path.find(sdk.GetString());
if (it != g_sdk_path.end())		if (it != g_sdk_path.end())
return it->second;		return it->second;
auto it_new = g_sdk_path.insert({sdk.GetString(), GetXcodeSDK(sdk)});		auto it_new = g_sdk_path.insert({sdk.GetString(), GetXcodeSDK(sdk)});
return it_new.first->second;		return it_new.first->second;
}		}

		namespace {
		jasonmolendaUnsubmitted Not Done Reply Inline Actions not important but #include "Utility/UuidCompatibility.h" would get you this. jasonmolenda: not important but #include "Utility/UuidCompatibility.h" would get you this.
		struct dyld_shared_cache_dylib_text_info {
		uint64_t version; // current version 1
		// following fields all exist in version 1
		uint64_t loadAddressUnslid;
		uint64_t textSegmentSize;
		uuid_t dylibUuid;
		const char *path; // pointer invalid at end of iterations
		// following fields all exist in version 2
		uint64_t textSegmentOffset; // offset from start of cache
		};
		typedef struct dyld_shared_cache_dylib_text_info
		dyld_shared_cache_dylib_text_info;
		}

		extern "C" int dyld_shared_cache_iterate_text(
		const uuid_t cacheUuid,
		void (^callback)(const dyld_shared_cache_dylib_text_info *info));
		extern "C" uint8_t _dyld_get_shared_cache_range(size_t length);
		labathUnsubmitted Not Done Reply Inline Actions extern "C" in an anonymous namespace looks weird, even if it does work. Best move this part out... labath: extern "C" in an anonymous namespace looks weird, even if it does work. Best move this part out.
		extern "C" bool _dyld_get_shared_cache_uuid(uuid_t uuid);

		namespace {
		class SharedCacheInfo {
		public:
		const UUID &GetUUID() const { return m_uuid; };
		const llvm::StringMap<SharedCacheImageInfo> &GetImages() const {
		return m_images;
		};

		SharedCacheInfo();

		private:
		llvm::StringMap<SharedCacheImageInfo> m_images;
		labathUnsubmitted Not Done Reply Inline Actions The class is already in an implementation file (you could empasize that by making that an anonymous namespace: http://llvm.org/docs/CodingStandards.html#anonymous-namespaces). I don't think you need to go through all that trouble to avoid someone instantiating it... labath: The class is already in an implementation file (you could empasize that by making that an…
		UUID m_uuid;
		};
		}

		SharedCacheInfo::SharedCacheInfo() {
		size_t shared_cache_size;
		uint8_t *shared_cache_start =
		_dyld_get_shared_cache_range(&shared_cache_size);
		uuid_t dsc_uuid;
		_dyld_get_shared_cache_uuid(dsc_uuid);
		m_uuid = UUID::fromData(dsc_uuid);

		dyld_shared_cache_iterate_text(
		dsc_uuid, ^(const dyld_shared_cache_dylib_text_info *info) {
		m_images[info->path] = SharedCacheImageInfo{
		UUID::fromData(info->dylibUuid, 16),
		std::make_shared<DataBufferUnowned>(
		shared_cache_start + info->textSegmentOffset,
		shared_cache_size - info->textSegmentOffset)};
		});
		}

		SharedCacheImageInfo
		HostInfoMacOSX::GetSharedCacheImageInfo(llvm::StringRef image_name) {
		static SharedCacheInfo g_shared_cache_info;
		return g_shared_cache_info.GetImages().lookup(image_name);
		}

lldb/source/Plugins/DynamicLoader/MacOSX-DYLD/DynamicLoaderDarwin.cpp

Show All 10 Lines
#include "lldb/Breakpoint/StoppointCallbackContext.h"		#include "lldb/Breakpoint/StoppointCallbackContext.h"
#include "lldb/Core/Debugger.h"		#include "lldb/Core/Debugger.h"
#include "lldb/Core/Module.h"		#include "lldb/Core/Module.h"
#include "lldb/Core/ModuleSpec.h"		#include "lldb/Core/ModuleSpec.h"
#include "lldb/Core/PluginManager.h"		#include "lldb/Core/PluginManager.h"
#include "lldb/Core/Section.h"		#include "lldb/Core/Section.h"
#include "lldb/Expression/DiagnosticManager.h"		#include "lldb/Expression/DiagnosticManager.h"
#include "lldb/Host/FileSystem.h"		#include "lldb/Host/FileSystem.h"
		#include "lldb/Host/HostInfo.h"
#include "lldb/Symbol/Function.h"		#include "lldb/Symbol/Function.h"
#include "lldb/Symbol/ObjectFile.h"		#include "lldb/Symbol/ObjectFile.h"
#include "lldb/Target/ABI.h"		#include "lldb/Target/ABI.h"
#include "lldb/Target/RegisterContext.h"		#include "lldb/Target/RegisterContext.h"
#include "lldb/Target/StackFrame.h"		#include "lldb/Target/StackFrame.h"
#include "lldb/Target/Target.h"		#include "lldb/Target/Target.h"
#include "lldb/Target/Thread.h"		#include "lldb/Target/Thread.h"
#include "lldb/Target/ThreadPlanCallFunction.h"		#include "lldb/Target/ThreadPlanCallFunction.h"
▲ Show 20 Lines • Show All 91 Lines • ▼ Show 20 Lines	if (module_sp && !module_spec.GetUUID().IsValid() &&
!module_sp->GetUUID().IsValid()) {		!module_sp->GetUUID().IsValid()) {
// No UUID, we must rely upon the cached module modification time and the		// No UUID, we must rely upon the cached module modification time and the
// modification time of the file on disk		// modification time of the file on disk
if (module_sp->GetModificationTime() !=		if (module_sp->GetModificationTime() !=
FileSystem::Instance().GetModificationTime(module_sp->GetFileSpec()))		FileSystem::Instance().GetModificationTime(module_sp->GetFileSpec()))
module_sp.reset();		module_sp.reset();
}		}

if (!module_sp) {		if (module_sp \|\| !can_create)
if (can_create) {		return module_sp;

		if (HostInfo::GetArchitecture().IsCompatibleMatch(target.GetArchitecture())) {
		// When debugging on the host, we are most likely using the same shared
		// cache as our inferior. The dylibs from the shared cache might not
		// exist on the filesystem, so let's use the images in our own memory
		// to create the modules.
		// Check if the requested image is in our shared cache.
		SharedCacheImageInfo image_info =
		HostInfo::GetSharedCacheImageInfo(module_spec.GetFileSpec().GetPath());

		// If we found it and it has the correct UUID, let's proceed with
		// creating a module from the memory contents.
		if (image_info.uuid &&
		(!module_spec.GetUUID() \|\| module_spec.GetUUID() == image_info.uuid)) {
		ModuleSpec shared_cache_spec(module_spec.GetFileSpec(), image_info.uuid,
		image_info.data_sp);
		module_sp =
		target.GetOrCreateModule(shared_cache_spec, false /* notify */);
		}
		}
// We'll call Target::ModulesDidLoad after all the modules have been		// We'll call Target::ModulesDidLoad after all the modules have been
// added to the target, don't let it be called for every one.		// added to the target, don't let it be called for every one.
		if (!module_sp)
module_sp = target.GetOrCreateModule(module_spec, false /* notify */);		module_sp = target.GetOrCreateModule(module_spec, false /* notify */);
if (!module_sp \|\| module_sp->GetObjectFile() == nullptr)		if (!module_sp \|\| module_sp->GetObjectFile() == nullptr)
module_sp = m_process->ReadModuleFromMemory(image_info.file_spec,		module_sp = m_process->ReadModuleFromMemory(image_info.file_spec,
image_info.address);		image_info.address);

if (did_create_ptr)		if (did_create_ptr)
*did_create_ptr = (bool)module_sp;		*did_create_ptr = (bool)module_sp;
}
}
return module_sp;		return module_sp;
}		}

void DynamicLoaderDarwin::UnloadImages(		void DynamicLoaderDarwin::UnloadImages(
const std::vector<lldb::addr_t> &solib_addresses) {		const std::vector<lldb::addr_t> &solib_addresses) {
std::lock_guard<std::recursive_mutex> guard(m_mutex);		std::lock_guard<std::recursive_mutex> guard(m_mutex);
if (m_process->GetStopID() == m_dyld_image_infos_stop_id)		if (m_process->GetStopID() == m_dyld_image_infos_stop_id)
return;		return;
▲ Show 20 Lines • Show All 1,029 Lines • Show Last 20 Lines

lldb/source/Plugins/ObjectFile/Mach-O/ObjectFileMachO.h

Show First 20 Lines • Show All 219 Lines • ▼ Show 20 Lines	protected:
llvm::MachO::dysymtab_command m_dysymtab;		llvm::MachO::dysymtab_command m_dysymtab;
std::vector<llvm::MachO::segment_command_64> m_mach_segments;		std::vector<llvm::MachO::segment_command_64> m_mach_segments;
std::vector<llvm::MachO::section_64> m_mach_sections;		std::vector<llvm::MachO::section_64> m_mach_sections;
llvm::Optional<llvm::VersionTuple> m_min_os_version;		llvm::Optional<llvm::VersionTuple> m_min_os_version;
llvm::Optional<llvm::VersionTuple> m_sdk_versions;		llvm::Optional<llvm::VersionTuple> m_sdk_versions;
typedef lldb_private::RangeVector<uint32_t, uint32_t> FileRangeArray;		typedef lldb_private::RangeVector<uint32_t, uint32_t> FileRangeArray;
lldb_private::Address m_entry_point_address;		lldb_private::Address m_entry_point_address;
FileRangeArray m_thread_context_offsets;		FileRangeArray m_thread_context_offsets;
		lldb::offset_t m_linkedit_original_offset;
		lldb::addr_t m_text_address;
bool m_thread_context_offsets_valid;		bool m_thread_context_offsets_valid;
lldb_private::FileSpecList m_reexported_dylibs;		lldb_private::FileSpecList m_reexported_dylibs;
bool m_allow_assembly_emulation_unwind_plans;		bool m_allow_assembly_emulation_unwind_plans;
};		};

#endif // LLDB_SOURCE_PLUGINS_OBJECTFILE_MACH_O_OBJECTFILEMACHO_H		#endif // LLDB_SOURCE_PLUGINS_OBJECTFILE_MACH_O_OBJECTFILEMACHO_H

lldb/source/Plugins/ObjectFile/Mach-O/ObjectFileMachO.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 41 Lines • ▼ Show 20 Lines
#include "lldb/Utility/UUID.h"		#include "lldb/Utility/UUID.h"

#include "lldb/Host/SafeMachO.h"		#include "lldb/Host/SafeMachO.h"

#include "llvm/Support/MemoryBuffer.h"		#include "llvm/Support/MemoryBuffer.h"

#include "ObjectFileMachO.h"		#include "ObjectFileMachO.h"

#if defined(__APPLE__) && \		#if defined(__APPLE__)
(defined(__arm__) \|\| defined(__arm64__) \|\| defined(__aarch64__))		#include <TargetConditionals.h>
// GetLLDBSharedCacheUUID() needs to call dlsym()		// GetLLDBSharedCacheUUID() needs to call dlsym()
#include <dlfcn.h>		#include <dlfcn.h>
#endif		#endif

#ifndef __APPLE__		#ifndef __APPLE__
#include "Utility/UuidCompatibility.h"		#include "Utility/UuidCompatibility.h"
#else		#else
#include <uuid/uuid.h>		#include <uuid/uuid.h>
▲ Show 20 Lines • Show All 866 Lines • ▼ Show 20 Lines
bool ObjectFileMachO::ParseHeader(DataExtractor &data,		bool ObjectFileMachO::ParseHeader(DataExtractor &data,
lldb::offset_t *data_offset_ptr,		lldb::offset_t *data_offset_ptr,
llvm::MachO::mach_header &header) {		llvm::MachO::mach_header &header) {
data.SetByteOrder(endian::InlHostByteOrder());		data.SetByteOrder(endian::InlHostByteOrder());
// Leave magic in the original byte order		// Leave magic in the original byte order
header.magic = data.GetU32(data_offset_ptr);		header.magic = data.GetU32(data_offset_ptr);
bool can_parse = false;		bool can_parse = false;
bool is_64_bit = false;		bool is_64_bit = false;
switch (header.magic) {		switch (header.magic) {
case MH_MAGIC:		case MH_MAGIC:
data.SetByteOrder(endian::InlHostByteOrder());		data.SetByteOrder(endian::InlHostByteOrder());
data.SetAddressByteSize(4);		data.SetAddressByteSize(4);
can_parse = true;		can_parse = true;
break;		break;

case MH_MAGIC_64:		case MH_MAGIC_64:
data.SetByteOrder(endian::InlHostByteOrder());		data.SetByteOrder(endian::InlHostByteOrder());
data.SetAddressByteSize(8);		data.SetAddressByteSize(8);
can_parse = true;		can_parse = true;
is_64_bit = true;		is_64_bit = true;
break;		break;

		labathUnsubmitted Not Done Reply Inline Actions Wouldn't this be better handled by adjusting the file offsets during Section creation? labath: Wouldn't this be better handled by adjusting the file offsets during Section creation?
case MH_CIGAM:		case MH_CIGAM:
data.SetByteOrder(endian::InlHostByteOrder() == eByteOrderBig		data.SetByteOrder(endian::InlHostByteOrder() == eByteOrderBig
? eByteOrderLittle		? eByteOrderLittle
: eByteOrderBig);		: eByteOrderBig);
data.SetAddressByteSize(4);		data.SetAddressByteSize(4);
can_parse = true;		can_parse = true;
break;		break;

▲ Show 20 Lines • Show All 367 Lines • ▼ Show 20 Lines	ObjectFileMachO::EncryptedFileRanges ObjectFileMachO::GetEncryptedFileRanges() {
return result;		return result;
}		}

void ObjectFileMachO::SanitizeSegmentCommand(segment_command_64 &seg_cmd,		void ObjectFileMachO::SanitizeSegmentCommand(segment_command_64 &seg_cmd,
uint32_t cmd_idx) {		uint32_t cmd_idx) {
if (m_length == 0 \|\| seg_cmd.filesize == 0)		if (m_length == 0 \|\| seg_cmd.filesize == 0)
return;		return;

		if ((m_header.flags & MH_DYLIB_IN_CACHE) && !IsInMemory()) {
		// In shared cache images, the load commands are relative to the
		// shared cache file, and not the the specific image we are
		// examining. Let's fix this up so that it looks like a normal
		// image.
		if (strncmp(seg_cmd.segname, "__TEXT", sizeof(seg_cmd.segname)) == 0)
		m_text_address = seg_cmd.vmaddr;
		if (strncmp(seg_cmd.segname, "__LINKEDIT", sizeof(seg_cmd.segname)) == 0)
		m_linkedit_original_offset = seg_cmd.fileoff;

		seg_cmd.fileoff = seg_cmd.vmaddr - m_text_address;
		}

if (seg_cmd.fileoff > m_length) {		if (seg_cmd.fileoff > m_length) {
// We have a load command that says it extends past the end of the file.		// We have a load command that says it extends past the end of the file.
// This is likely a corrupt file. We don't have any way to return an error		// This is likely a corrupt file. We don't have any way to return an error
// condition here (this method was likely invoked from something like		// condition here (this method was likely invoked from something like
// ObjectFile::GetSectionList()), so we just null out the section contents,		// ObjectFile::GetSectionList()), so we just null out the section contents,
// and dump a message to stdout. The most common case here is core file		// and dump a message to stdout. The most common case here is core file
// debugging with a truncated file.		// debugging with a truncated file.
const char *lc_segment_name =		const char *lc_segment_name =
▲ Show 20 Lines • Show All 320 Lines • ▼ Show 20 Lines	if (m_data.GetU8(&offset, (uint8_t *)sect64.segname,
sizeof(sect64.segname)) == nullptr)		sizeof(sect64.segname)) == nullptr)
break;		break;
sect64.addr = m_data.GetAddress(&offset);		sect64.addr = m_data.GetAddress(&offset);
sect64.size = m_data.GetAddress(&offset);		sect64.size = m_data.GetAddress(&offset);

if (m_data.GetU32(&offset, &sect64.offset, num_u32s) == nullptr)		if (m_data.GetU32(&offset, &sect64.offset, num_u32s) == nullptr)
break;		break;

		if ((m_header.flags & MH_DYLIB_IN_CACHE) && !IsInMemory()) {
		sect64.offset = sect64.addr - m_text_address;
		}

// Keep a list of mach sections around in case we need to get at data that		// Keep a list of mach sections around in case we need to get at data that
// isn't stored in the abstracted Sections.		// isn't stored in the abstracted Sections.
m_mach_sections.push_back(sect64);		m_mach_sections.push_back(sect64);

if (add_section) {		if (add_section) {
ConstString section_name(		ConstString section_name(
sect64.sectname, strnlen(sect64.sectname, sizeof(sect64.sectname)));		sect64.sectname, strnlen(sect64.sectname, sizeof(sect64.sectname)));
if (!const_segname) {		if (!const_segname) {
▲ Show 20 Lines • Show All 584 Lines • ▼ Show 20 Lines	const addr_t nlist_data_byte_size =
symtab_load_command.nsyms * nlist_byte_size;		symtab_load_command.nsyms * nlist_byte_size;
const addr_t strtab_data_byte_size = symtab_load_command.strsize;		const addr_t strtab_data_byte_size = symtab_load_command.strsize;
addr_t strtab_addr = LLDB_INVALID_ADDRESS;		addr_t strtab_addr = LLDB_INVALID_ADDRESS;

ProcessSP process_sp(m_process_wp.lock());		ProcessSP process_sp(m_process_wp.lock());
Process *process = process_sp.get();		Process *process = process_sp.get();

uint32_t memory_module_load_level = eMemoryModuleLoadLevelComplete;		uint32_t memory_module_load_level = eMemoryModuleLoadLevelComplete;
		bool is_shared_cache_image = m_header.flags & MH_DYLIB_IN_CACHE;
		bool is_local_shared_cache_image = is_shared_cache_image && !IsInMemory();
		SectionSP linkedit_section_sp(
		section_list->FindSectionByName(GetSegmentNameLINKEDIT()));

if (process && m_header.filetype != llvm::MachO::MH_OBJECT) {		if (process && m_header.filetype != llvm::MachO::MH_OBJECT &&
		!is_local_shared_cache_image) {
Target &target = process->GetTarget();		Target &target = process->GetTarget();

memory_module_load_level = target.GetMemoryModuleLoadLevel();		memory_module_load_level = target.GetMemoryModuleLoadLevel();

SectionSP linkedit_section_sp(
section_list->FindSectionByName(GetSegmentNameLINKEDIT()));
// Reading mach file from memory in a process or core file...		// Reading mach file from memory in a process or core file...

if (linkedit_section_sp) {		if (linkedit_section_sp) {
addr_t linkedit_load_addr =		addr_t linkedit_load_addr =
linkedit_section_sp->GetLoadBaseAddress(&target);		linkedit_section_sp->GetLoadBaseAddress(&target);
if (linkedit_load_addr == LLDB_INVALID_ADDRESS) {		if (linkedit_load_addr == LLDB_INVALID_ADDRESS) {
// We might be trying to access the symbol table before the		// We might be trying to access the symbol table before the
// __LINKEDIT's load address has been set in the target. We can't		// __LINKEDIT's load address has been set in the target. We can't
// fail to read the symbol table, so calculate the right address		// fail to read the symbol table, so calculate the right address
// manually		// manually
linkedit_load_addr = CalculateSectionLoadAddressForMemoryImage(		linkedit_load_addr = CalculateSectionLoadAddressForMemoryImage(
m_memory_addr, GetMachHeaderSection(), linkedit_section_sp.get());		m_memory_addr, GetMachHeaderSection(), linkedit_section_sp.get());
}		}

const addr_t linkedit_file_offset = linkedit_section_sp->GetFileOffset();		const addr_t linkedit_file_offset = linkedit_section_sp->GetFileOffset();
const addr_t symoff_addr = linkedit_load_addr +		const addr_t symoff_addr = linkedit_load_addr +
symtab_load_command.symoff -		symtab_load_command.symoff -
linkedit_file_offset;		linkedit_file_offset;
strtab_addr = linkedit_load_addr + symtab_load_command.stroff -		strtab_addr = linkedit_load_addr + symtab_load_command.stroff -
linkedit_file_offset;		linkedit_file_offset;

bool data_was_read = false;

#if defined(__APPLE__) && \
(defined(__arm__) \|\| defined(__arm64__) \|\| defined(__aarch64__))
if (m_header.flags & MH_DYLIB_IN_CACHE &&
process->GetAddressByteSize() == sizeof(void *)) {
// This mach-o memory file is in the dyld shared cache. If this
// program is not remote and this is iOS, then this process will
// share the same shared cache as the process we are debugging and we
// can read the entire __LINKEDIT from the address space in this
// process. This is a needed optimization that is used for local iOS
// debugging only since all shared libraries in the shared cache do
// not have corresponding files that exist in the file system of the
// device. They have been combined into a single file. This means we
// always have to load these files from memory. All of the symbol and
// string tables from all of the __LINKEDIT sections from the shared
// libraries in the shared cache have been merged into a single large
// symbol and string table. Reading all of this symbol and string
// table data across can slow down debug launch times, so we optimize
// this by reading the memory for the __LINKEDIT section from this
// process.

UUID lldb_shared_cache;
addr_t lldb_shared_cache_addr;
GetLLDBSharedCacheUUID(lldb_shared_cache_addr, lldb_shared_cache);
UUID process_shared_cache;
addr_t process_shared_cache_addr;
GetProcessSharedCacheUUID(process, process_shared_cache_addr,
process_shared_cache);
bool use_lldb_cache = true;
if (lldb_shared_cache.IsValid() && process_shared_cache.IsValid() &&
(lldb_shared_cache != process_shared_cache \|\|
process_shared_cache_addr != lldb_shared_cache_addr)) {
use_lldb_cache = false;
}

PlatformSP platform_sp(target.GetPlatform());
if (platform_sp && platform_sp->IsHost() && use_lldb_cache) {
data_was_read = true;
nlist_data.SetData((void *)symoff_addr, nlist_data_byte_size,
eByteOrderLittle);
strtab_data.SetData((void *)strtab_addr, strtab_data_byte_size,
eByteOrderLittle);
if (function_starts_load_command.cmd) {
const addr_t func_start_addr =
linkedit_load_addr + function_starts_load_command.dataoff -
linkedit_file_offset;
function_starts_data.SetData((void *)func_start_addr,
function_starts_load_command.datasize,
eByteOrderLittle);
}
}
}
#endif

if (!data_was_read) {
// Always load dyld - the dynamic linker - from memory if we didn't		// Always load dyld - the dynamic linker - from memory if we didn't
// find a binary anywhere else. lldb will not register		// find a binary anywhere else. lldb will not register
// dylib/framework/bundle loads/unloads if we don't have the dyld		// dylib/framework/bundle loads/unloads if we don't have the dyld
// symbols, we force dyld to load from memory despite the user's		// symbols, we force dyld to load from memory despite the user's
// target.memory-module-load-level setting.		// target.memory-module-load-level setting.
if (memory_module_load_level == eMemoryModuleLoadLevelComplete \|\|		if (memory_module_load_level == eMemoryModuleLoadLevelComplete \|\|
m_header.filetype == llvm::MachO::MH_DYLINKER) {		m_header.filetype == llvm::MachO::MH_DYLINKER) {
DataBufferSP nlist_data_sp(		DataBufferSP nlist_data_sp(
Show All 14 Lines	if (linkedit_section_sp) {
// cache the string table.		// cache the string table.
// Binaries in the shared cache all share a giant string table,		// Binaries in the shared cache all share a giant string table,
// and we can't share the string tables across multiple		// and we can't share the string tables across multiple
// ObjectFileMachO's, so we'd end up re-reading this mega-strtab		// ObjectFileMachO's, so we'd end up re-reading this mega-strtab
// for every binary in the shared cache - it would be a big perf		// for every binary in the shared cache - it would be a big perf
// problem. For binaries outside the shared cache, it's faster to		// problem. For binaries outside the shared cache, it's faster to
// read the entire strtab at once instead of piece-by-piece as we		// read the entire strtab at once instead of piece-by-piece as we
// process the nlist records.		// process the nlist records.
if ((m_header.flags & MH_DYLIB_IN_CACHE) == 0) {		if (!is_shared_cache_image) {
		jasonmolendaUnsubmitted Not Done Reply Inline Actions this is a bool, maybe !is_shared_cache_image would be clearer? The original code was comparing a bitfield to 0 so it made a little more sense. jasonmolenda: this is a bool, maybe !is_shared_cache_image would be clearer? The original code was comparing…
DataBufferSP strtab_data_sp(		DataBufferSP strtab_data_sp(
ReadMemory(process_sp, strtab_addr, strtab_data_byte_size));		ReadMemory(process_sp, strtab_addr, strtab_data_byte_size));
if (strtab_data_sp) {		if (strtab_data_sp) {
strtab_data.SetData(strtab_data_sp, 0,		strtab_data.SetData(strtab_data_sp, 0,
strtab_data_sp->GetByteSize());		strtab_data_sp->GetByteSize());
}		}
}		}
}		}
}
if (memory_module_load_level >= eMemoryModuleLoadLevelPartial) {		if (memory_module_load_level >= eMemoryModuleLoadLevelPartial) {
if (function_starts_load_command.cmd) {		if (function_starts_load_command.cmd) {
const addr_t func_start_addr =		const addr_t func_start_addr =
linkedit_load_addr + function_starts_load_command.dataoff -		linkedit_load_addr + function_starts_load_command.dataoff -
linkedit_file_offset;		linkedit_file_offset;
DataBufferSP func_start_data_sp(		DataBufferSP func_start_data_sp(
ReadMemory(process_sp, func_start_addr,		ReadMemory(process_sp, func_start_addr,
function_starts_load_command.datasize));		function_starts_load_command.datasize));
if (func_start_data_sp)		if (func_start_data_sp)
function_starts_data.SetData(func_start_data_sp, 0,		function_starts_data.SetData(func_start_data_sp, 0,
func_start_data_sp->GetByteSize());		func_start_data_sp->GetByteSize());
}		}
}		}
}		}
}		}
} else {		} else {
		if (is_local_shared_cache_image) {
		// The load commands in shared cache images are relative to the
		// beginning of the shared cache, not the library image. The
		// data we get handed when creating the ObjectFileMachO starts
		// at the beginning of a specific library and spans to the end
		// of the cache to be able to reach the shared LINKEDIT
		// segments. We need to convert the load command offsets to be
		// relative to the beginning of our specific image.
		lldb::addr_t linkedit_offset = linkedit_section_sp->GetFileOffset();
		lldb::offset_t linkedit_slide =
		linkedit_offset - m_linkedit_original_offset;
		symtab_load_command.symoff += linkedit_slide;
		symtab_load_command.stroff += linkedit_slide;
		dyld_info.export_off += linkedit_slide;
		m_dysymtab.indirectsymoff += linkedit_slide;
		function_starts_load_command.dataoff += linkedit_slide;
		}

nlist_data.SetData(m_data, symtab_load_command.symoff,		nlist_data.SetData(m_data, symtab_load_command.symoff,
nlist_data_byte_size);		nlist_data_byte_size);
strtab_data.SetData(m_data, symtab_load_command.stroff,		strtab_data.SetData(m_data, symtab_load_command.stroff,
strtab_data_byte_size);		strtab_data_byte_size);

if (dyld_info.export_size > 0) {		if (dyld_info.export_size > 0) {
dyld_trie_data.SetData(m_data, dyld_info.export_off,		dyld_trie_data.SetData(m_data, dyld_info.export_off,
dyld_info.export_size);		dyld_info.export_size);
▲ Show 20 Lines • Show All 3,386 Lines • ▼ Show 20 Lines
// errors. So we need to use the actual underlying types of task_t and		// errors. So we need to use the actual underlying types of task_t and
// kern_return_t below.		// kern_return_t below.
extern "C" unsigned int /task_t/ mach_task_self();		extern "C" unsigned int /task_t/ mach_task_self();

void ObjectFileMachO::GetLLDBSharedCacheUUID(addr_t &base_addr, UUID &uuid) {		void ObjectFileMachO::GetLLDBSharedCacheUUID(addr_t &base_addr, UUID &uuid) {
uuid.Clear();		uuid.Clear();
base_addr = LLDB_INVALID_ADDRESS;		base_addr = LLDB_INVALID_ADDRESS;

#if defined(__APPLE__) && \		#if defined(__APPLE__)
(defined(__arm__) \|\| defined(__arm64__) \|\| defined(__aarch64__))
uint8_t (dyld_get_all_image_infos)(void);		uint8_t (dyld_get_all_image_infos)(void);
dyld_get_all_image_infos =		dyld_get_all_image_infos =
(uint8_t * (*)()) dlsym(RTLD_DEFAULT, "_dyld_get_all_image_infos");		(uint8_t * (*)()) dlsym(RTLD_DEFAULT, "_dyld_get_all_image_infos");
if (dyld_get_all_image_infos) {		if (dyld_get_all_image_infos) {
uint8_t *dyld_all_image_infos_address = dyld_get_all_image_infos();		uint8_t *dyld_all_image_infos_address = dyld_get_all_image_infos();
if (dyld_all_image_infos_address) {		if (dyld_all_image_infos_address) {
uint32_t version = (uint32_t )		uint32_t version = (uint32_t )
dyld_all_image_infos_address; // version <mach-o/dyld_images.h>		dyld_all_image_infos_address; // version <mach-o/dyld_images.h>
▲ Show 20 Lines • Show All 611 Lines • Show Last 20 Lines

lldb/source/Plugins/Platform/MacOSX/PlatformDarwin.cpp

Show All 33 Lines
#include "lldb/Utility/Log.h"		#include "lldb/Utility/Log.h"
#include "lldb/Utility/ProcessInfo.h"		#include "lldb/Utility/ProcessInfo.h"
#include "lldb/Utility/Status.h"		#include "lldb/Utility/Status.h"
#include "lldb/Utility/Timer.h"		#include "lldb/Utility/Timer.h"
#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
#include "llvm/Support/FileSystem.h"		#include "llvm/Support/FileSystem.h"
#include "llvm/Support/Threading.h"		#include "llvm/Support/Threading.h"
#include "llvm/Support/VersionTuple.h"		#include "llvm/Support/VersionTuple.h"

#if defined(__APPLE__)		#if defined(__APPLE__)
#include <TargetConditionals.h>		#include <TargetConditionals.h>
		labathUnsubmitted Not Done Reply Inline Actions Leftovers from an earlier implementation? labath: Leftovers from an earlier implementation?
		frissAuthorUnsubmitted Done Reply Inline Actions yep, good catch friss: yep, good catch
#endif		#endif

using namespace lldb;		using namespace lldb;
using namespace lldb_private;		using namespace lldb_private;

/// Default Constructor		/// Default Constructor
PlatformDarwin::PlatformDarwin(bool is_host) : PlatformPOSIX(is_host) {}		PlatformDarwin::PlatformDarwin(bool is_host) : PlatformPOSIX(is_host) {}

▲ Show 20 Lines • Show All 179 Lines • ▼ Show 20 Lines	LLDB_LOGF(log,
module_spec.GetFileSpec().GetFilename().AsCString(),		module_spec.GetFileSpec().GetFilename().AsCString(),
module_spec.GetPlatformFileSpec().GetDirectory().AsCString(),		module_spec.GetPlatformFileSpec().GetDirectory().AsCString(),
module_spec.GetPlatformFileSpec().GetFilename().AsCString(),		module_spec.GetPlatformFileSpec().GetFilename().AsCString(),
module_spec.GetSymbolFileSpec().GetDirectory().AsCString(),		module_spec.GetSymbolFileSpec().GetDirectory().AsCString(),
module_spec.GetSymbolFileSpec().GetFilename().AsCString());		module_spec.GetSymbolFileSpec().GetFilename().AsCString());

Status err;		Status err;

		if (IsHost()) {
		// When debugging on the host, we are most likely using the same shared
		// cache as our inferior. The dylibs from the shared cache might not
		// exist on the filesystem, so let's use the images in our own memory
		// to create the modules.

		// Check if the requested image is in our shared cache.
		SharedCacheImageInfo image_info =
		HostInfo::GetSharedCacheImageInfo(module_spec.GetFileSpec().GetPath());

		// If we found it and it has the correct UUID, let's proceed with
		// creating a module from the memory contents.
		if (image_info.uuid &&
		(!module_spec.GetUUID() \|\| module_spec.GetUUID() == image_info.uuid)) {
		ModuleSpec shared_cache_spec(module_spec.GetFileSpec(), image_info.uuid,
		image_info.data_sp);
		err = ModuleList::GetSharedModule(shared_cache_spec, module_sp,
		module_search_paths_ptr,
		old_module_sp_ptr, did_create_ptr);
		if (module_sp)
		return err;
		}
		}

err = ModuleList::GetSharedModule(module_spec, module_sp,		err = ModuleList::GetSharedModule(module_spec, module_sp,
module_search_paths_ptr, old_module_sp_ptr,		module_search_paths_ptr, old_module_sp_ptr,
did_create_ptr);		did_create_ptr);
if (module_sp)		if (module_sp)
return err;		return err;

if (!IsHost()) {		if (!IsHost()) {
std::string cache_path(GetLocalCacheDirectory());		std::string cache_path(GetLocalCacheDirectory());
▲ Show 20 Lines • Show All 1,493 Lines • Show Last 20 Lines

lldb/unittests/ObjectFile/CMakeLists.txt

	add_subdirectory(Breakpad)			add_subdirectory(Breakpad)
	add_subdirectory(ELF)			add_subdirectory(ELF)
				add_subdirectory(MachO)
	add_subdirectory(PECOFF)			add_subdirectory(PECOFF)

lldb/unittests/ObjectFile/MachO/CMakeLists.txt

This file was added.

				add_lldb_unittest(ObjectFileMachOTests
				TestObjectFileMachO.cpp

				LINK_LIBS
				lldbPluginObjectFileMachO
				lldbPluginSymbolFileSymtab
				lldbCore
				lldbUtilityHelpers
				LLVMTestingSupport
				)

lldb/unittests/ObjectFile/MachO/TestObjectFileMachO.cpp

This file was added.

				//===-- ObjectFileMachOTest.cpp -------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "lldb/Host/HostInfo.h"
				#include "Plugins/ObjectFile/Mach-O/ObjectFileMachO.h"
				#include "TestingSupport/SubsystemRAII.h"
				#include "TestingSupport/TestUtilities.h"
				#include "lldb/Core/Module.h"
				#include "lldb/Host/FileSystem.h"
				#include "lldb/lldb-defines.h"
				#include "gtest/gtest.h"

				#ifdef __APPLE__
				#include <dlfcn.h>
				#endif

				using namespace lldb_private;
				using namespace llvm;

				namespace {
				class ObjectFileMachOTest : public ::testing::Test {
				SubsystemRAII<FileSystem, HostInfo, ObjectFileMachO> subsystems;
				};
				} // namespace

				#if defined(__APPLE__)
				TEST_F(ObjectFileMachOTest, ModuleFromSharedCacheInfo) {
				SharedCacheImageInfo image_info =
				HostInfo::GetSharedCacheImageInfo("/usr/lib/libobjc.A.dylib");
				EXPECT_TRUE(image_info.uuid);
				EXPECT_TRUE(image_info.data_sp);

				ModuleSpec spec(FileSpec(), UUID(), image_info.data_sp);
				lldb::ModuleSP module = std::make_shared<Module>(spec);
				ObjectFile *OF = module->GetObjectFile();
				ASSERT_TRUE(llvm::isa<ObjectFileMachO>(OF));
				EXPECT_TRUE(
				OF->GetArchitecture().IsCompatibleMatch(HostInfo::GetArchitecture()));
				Symtab *symtab = OF->GetSymtab();
				ASSERT_NE(symtab, nullptr);
				void *libobjc = dlopen("/usr/lib/libobjc.A.dylib", RTLD_LAZY);
				ASSERT_NE(libobjc, nullptr);

				// This function checks that if we read something from the
				// ObjectFile we get through the shared cache in-mmeory
				// buffer, it matches what we get by reading directly the
				// memory of the symbol.
				auto check_symbol = [&](const char *sym_name) {
				std::vector<uint32_t> symbol_indices;
				symtab->FindAllSymbolsWithNameAndType(ConstString(sym_name),
				lldb::eSymbolTypeAny, symbol_indices);
				EXPECT_EQ(symbol_indices.size(), 1u);

				Symbol *sym = symtab->SymbolAtIndex(symbol_indices[0]);
				ASSERT_NE(sym, nullptr);
				Address base = sym->GetAddress();
				size_t size = sym->GetByteSize();
				ASSERT_NE(size, 0u);
				uint8_t buffer[size];
				EXPECT_EQ(OF->ReadSectionData(base.GetSection().get(), base.GetOffset(),
				buffer, size),
				size);

				void *sym_addr = dlsym(libobjc, sym_name);
				ASSERT_NE(sym_addr, nullptr);
				EXPECT_EQ(memcmp(buffer, sym_addr, size), 0);
				};

				// Read a symbol from the __TEXT segment...
				check_symbol("objc_msgSend");
				// ... and one from the __DATA segment
				check_symbol("OBJC_CLASS_$_NSObject");
				}
				#endif