This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/lldb/
-
lldb/
-
Core/
1/1
Mangled.h
-
RichManglingInfo.h
-
Symbol/
6/13
Symtab.h
-
lldb-forward.h
-
source/
-
Core/
-
CMakeLists.txt
4
Mangled.cpp
-
RichManglingInfo.cpp
-
Symbol/
3/8
Symtab.cpp

Differential D49990

Use rich mangling information in Symtab::InitNameIndexes()
AbandonedPublic

Authored by sgraenitz on Jul 30 2018, 8:07 AM.

Download Raw Diff

Details

Reviewers

labath
jingham
JDevlieghere
erik.pilkington

Summary

This review is about getting your feedback for the patch. If it doesn't make it in this form, I can adjust everything that's necessary and open a new review once I am done. So please don't hesitate to share your honest opinions.

In preparation for this review, there were two earlier ones:

https://reviews.llvm.org/D49612 introduced the ItaniumPartialDemangler to LLDB demangling without conceptual changes
https://reviews.llvm.org/D49909 added a unit test that covers all relevant code paths in the InitNameIndexes() function

Primary goals for this patch are:
(1) Use ItaniumPartialDemangler's rich mangling info for building LLDB's name index.
(2) Make a uniform interface so that Symtab doesn't get involved with mangling details too much.
(3) Improve indexing performance.

In order to achive (1) and (2) I added two classes:

RichManglingInfo offers a uniform interface to query symbol properties like getFunctionDeclContextName() or isCtorOrDtor(). It can switch between different providers internally. At the moment it supports llvm::ItaniumPartialDemangler and lldb_private::CPlusPlusLanguage::MethodName (legacy/fallback implementation).
RichManglingSpec handles configuration and lifetime of RichManglingInfos. It is likely stack-allocated and can be reused for multiple queries during batch processing.

These classes are used for wrapping the input and output of DemangleWithRichManglingInfo(), our new function for explicit demangling. It will return a properly initialized RichManglingInfo on success, or otherwise null:

RichManglingInfo *Mangled::DemangleWithRichManglingInfo(RichManglingSpec, SkipMangledNameFn)

Thus RichManglingInfo does not need to support a None-state (it's not accessible in this state). In order to avoid an extra heap allocation per invocation for storing the result of DemangleWithRichManglingInfo(), the actual instance is owned by RichManglingSpec. This aids (3) and we want to use a single RichManglingSpec instance for the entire index anyway (it also owns the IPD). An efficient filtering function SkipMangledNameFn contributes here too and helps to mimic the original behavior of InitNameIndexes.

The old implementation only parsed and indexed Itanium mangled names. The new RichManglingInfo can be easily extended for various mangling schemes and languages.

One problem with the implementation of RichManglingInfo is the inaccessibility of class CPlusPlusLanguage::MethodName (defined in source/Plugins/Language/..), from within any header in the Core components of LLDB. The rather hacky solution is to store a type erased pointer and cast it to the correct type on access in the cpp - see RichManglingInfo::get<ParserT>(). Not sure if there's a better way to do it. IMHO CPlusPlusLanguage::MethodName should be a top-level class in order to enable forward delcarations (but that is a rather big change I guess).

I also found a few minor bugs/smells, which I will mark with inline comments. First simple profiling shows a good speedup. target create clang now takes 0.64s on average (over 5 runs). Before the change I observed runtimes between 0.76s an 1.01s. This is still no bulletproof data (I only ran it on one machine!), but it's a promising indicator I think.

What do you think?
Is this too unconventional?
Do you have ideas for improvements?

Diff Detail

Build Status

Buildable 20895
Build 20895: arc lint + arc unit

Event Timeline

sgraenitz created this revision.Jul 30 2018, 8:07 AM

Harbormaster completed remote builds in B20839: Diff 157967.Jul 30 2018, 8:07 AM

sgraenitz edited the summary of this revision. (Show Details)Jul 30 2018, 8:10 AM

sgraenitz added inline comments.Jul 30 2018, 8:24 AM

include/lldb/Symbol/Symtab.h
42	We don't need a None-case here.
57	This is the hackiest point I guess.
105	^^^^^ May have its own header & cpp
include/lldb/Utility/ConstString.h
357 ↗	(On Diff #157967)	Fixing a related issue: There is no way to determine whether or not the internal string is null or empty. In fact, `operator!` does the same as the above `IsEmpty()`. The `Mangled::GetDemangledName()` function, however, thinks there would be a difference and wants to benefit from it. The fixed version should be correct now.
source/Core/Mangled.cpp
199	Fixing bug: This is no dead code, but well, maybe in a rare branch.
367	Using the difference between null and empty.
398	Using the difference between null and empty.
source/Symbol/Symtab.cpp
220	This uses a raw C-string instead of `llvm::StringRef` in order to achieve `O(1)` runtime.

I haven't read through this in detail yet, but I think this is a good start!

The part I'm not sure about is whether the RichManglingInfo vs. RichManglingSpec distinction brings any value. I mean, the lifetime of the first is tied to the lifetime of the second, and the Spec class can only have one active Info instance at any given moment. So you might as well just have one class, pass that to DemangleWithRichManglingInfo, and then query the same object when the call returns. The current interface with createItaniumInfo et al. makes it seem like one could call it multiple times in sequence, stashing the results, and then doing some post-processing on them.

I'll have to think about the C++::MethodName issue a bit more, but in general, I don't think moving that class to a separate file is a too disruptive change. If it means we don't have to mess with untyped pointers, then we should just do it. (Ideally, I wouldn't want the common code to reference that plugin at all, but that ship has already sailed, so I don't think this patch should be predicated on fixing that.)

include/lldb/Symbol/Symtab.h
80–82	This is implied by the deleted copy operations.
source/Symbol/Symtab.cpp
220	If you changed the caller to use StringRef too (it seems possible at a first glance) then this would still be O(1)
271–289	Could these return StringRef instead of C strings?

zturner added a subscriber: zturner.Jul 30 2018, 9:23 AM

zturner added inline comments.

include/lldb/Symbol/Symtab.h
57	We have `llvm::Any`. Perhaps you want to use that here instead of `void*`?

sgraenitz added inline comments.Jul 30 2018, 9:41 AM

source/Symbol/Symtab.cpp
274	@erik.pilkington Is it acceptable/good practice to pass `(nullptr, 0)` here? At the moment this safes some lines of initialization checks for `m_IPD_buf` and `m_IPD_size`.

clayborg added a subscriber: clayborg.Jul 30 2018, 9:50 AM

clayborg added inline comments.

include/lldb/Core/Mangled.h
23–32	move any forward decls to lldb-forward.h and remove all manual forward declarations in lldb_private from here.
include/lldb/Symbol/Symtab.h
25	move to separate files RichManglingInfo.h and RichManglingInfo.cpp
92	move to separate files RichManglingInfo.h and RichManglingInfo.cpp

Thanks or the quick reviews! Follow-ups inline.

include/lldb/Symbol/Symtab.h
57	Thanks. I will check that.
80–82	Which are implicitly deleted too, due to the existence of the destructor right? Does LLVM/LLDB have some kind of convention for it? I like to be explicit on ctors&assignment ("rule of 5"), because it aids error messages, but I would be fine with following the existing convention here.
source/Symbol/Symtab.cpp
220	Right, thanks there's a `ConstString::GetStringRef()`. Perfect.
271–289	Yes. So far it's simply the closest superset of the two interfaces, but I will try using `StringRef` where possible.

erik.pilkington added inline comments.Jul 30 2018, 10:00 AM

source/Symbol/Symtab.cpp
274	Sure, thats fine! Those parameters act the same way as `buf` and `size` in __cxa_demangle. `getFunctionBaseName` will return nullptr if the mangled name isn't a function. Is it a precondition of this function that m_IPD stores a function? If not, it looks like you'll leak the buffer.

sgraenitz added inline comments.Jul 30 2018, 10:27 AM

include/lldb/Symbol/Symtab.h
57	@zturner Where is `llvm::Any`? Expected it in ADT or Support, but can't find it. IIUC `llvm::Optional` does something similar, but uses its own `optional_detail::OptionalStorage`. Same for `llvm::Expected`. Or is it a very recent addition?

zturner added inline comments.Jul 30 2018, 10:29 AM

include/lldb/Symbol/Symtab.h
57	It's pretty recent. I was actually the one who added it, about maybe 2 weeks ago. It's in `include/llvm/ADT/Any.h`

sgraenitz added inline comments.Jul 30 2018, 11:41 AM

source/Symbol/Symtab.cpp
274	Oh that is a very good note. I had it as a precondition in the client function in Symtab. When I removed that and started to just check the result for `nullptr`, I didn't think about the buffer. Gonna fix it, the generalized interface shouldn't have that precondition anyway. Thanks!

Simple fixes

Herald added a subscriber: mgorny. · View Herald TranscriptJul 30 2018, 12:18 PM

Moved forward decls

Sorry, I accidentally added the tests from https://reviews.llvm.org/D49909 also to this review. I will clean this up tomorrow.

Remove test code, that I added accidentally. Move RichManglingInfo and RichManglingSpec to their own header and cpp.

The part I'm not sure about is whether the RichManglingInfo vs. RichManglingSpec distinction brings any value. [...] So you might as well just have one class, pass that to DemangleWithRichManglingInfo, and then query the same object when the call returns.

The idea here was that DemangleWithRichManglingInfo() acts like a gate keeper. If it succeeds it provides read-only access to the updated RichManglingInfo in RichManglingSpec, otherwise it returns null. IMHO the value behind it is that RichManglingInfo does not need to handle a NoInfo case next to ItaniumPartialDemangler and PluginCxxLanguage in every single getter. Instead it's just not accessible in that state. (Plus: there is no maintenance functions that confuse the public interface of RichManglingInfo.) I don't know how to do this with only one class.

Maybe RichManglingSpec is not the perfect name. What about renaming it to RichManglingContext?

I mean, the lifetime of the first is tied to the lifetime of the second, and the Spec class can only have one active Info instance at any given moment.

Yes, this was handy and avoids extra heap-allocations. const RichManglingInfo * was intended to clarify: lifetimes are handled elsewhere.

The current interface with createItaniumInfo et al. makes it seem like one could call it multiple times in sequence, stashing the results, and then doing some post-processing on them.

Yes, I can see that this is implicit knowledge. Do you have an idea how to make this more explicit? Rename to SetItaniumInfo() maybe?

In the end, I definitely prefer this approach over having a NoInfo state in RichManglingInfo. What do you think?

I think there is still something wrong with the diff. I can't see any of the callers of e.g. createItaniumInfo but I can see the function on both LHS and RHS of the diff (which shouldn't be the case as it's a new function). It looks like you uploaded just an ammending patch instead of the entire work. Can you fix that?

In D49990#1182003, @sgraenitz wrote:

The part I'm not sure about is whether the RichManglingInfo vs. RichManglingSpec distinction brings any value. [...] So you might as well just have one class, pass that to DemangleWithRichManglingInfo, and then query the same object when the call returns.

The idea here was that DemangleWithRichManglingInfo() acts like a gate keeper. If it succeeds it provides read-only access to the updated RichManglingInfo in RichManglingSpec, otherwise it returns null. IMHO the value behind it is that RichManglingInfo does not need to handle a NoInfo case next to ItaniumPartialDemangler and PluginCxxLanguage in every single getter. Instead it's just not accessible in that state. (Plus: there is no maintenance functions that confuse the public interface of RichManglingInfo.) I don't know how to do this with only one class.

Maybe RichManglingSpec is not the perfect name. What about renaming it to RichManglingContext?

I mean, the lifetime of the first is tied to the lifetime of the second, and the Spec class can only have one active Info instance at any given moment.

Yes, this was handy and avoids extra heap-allocations. const RichManglingInfo * was intended to clarify: lifetimes are handled elsewhere.

The current interface with createItaniumInfo et al. makes it seem like one could call it multiple times in sequence, stashing the results, and then doing some post-processing on them.

Yes, I can see that this is implicit knowledge. Do you have an idea how to make this more explicit? Rename to SetItaniumInfo() maybe?

In the end, I definitely prefer this approach over having a NoInfo state in RichManglingInfo. What do you think?

Yes, I can see what you mean here. Neither of the solutions is particularly appealing. I guess if I were implementing this, I'd go with the "invalid state" option, though I am not sure why, as usually I am opposed to invalid states. Maybe we can leave this to the discretion of the implementor (you).

include/lldb/Symbol/Symtab.h
80–82	As far as I know, the presence of a destructor has no impact on the state of copy/move operations, so you still need to delete the copy operations explicitly. I don't know if there is an official policy on explicitly deleting move operations, but I don't remember seeing that style anywhere. However, I don't care much about that either.

sgraenitz added inline comments.Jul 31 2018, 5:28 AM

include/lldb/Symbol/Symtab.h
80–82	The generation of the implicitly-defined copy constructor is deprecated if T has a user-defined destructor or user-defined copy assignment operator. https://en.cppreference.com/w/cpp/language/copy_constructor Actually, copy has no implications here and move won't work on the const pointer. Thus I will just remove it :)
source/Core/Mangled.cpp
325	I think there is still something wrong with the diff. I can't see any of the callers of e.g. createItaniumInfo Weird. The caller is here, but not shown as a change anymore..

Fix potential leak of m_IPD_buf. Use llvm::Any instead of void*.
Rename: RichManglingSpec -> RichManglingContext, RichManglingContext::CreateXyInfo() -> RichManglingContext::SetXyInfo()

Harbormaster completed remote builds in B20895: Diff 158231.Jul 31 2018, 5:52 AM

sgraenitz marked 4 inline comments as done.Jul 31 2018, 5:57 AM

I think there is still something wrong with the diff. I can't see any of the callers of e.g. createItaniumInfo

Weird. The caller is here, but not shown as a change anymore..

I created a new review where all my changes are marked in green and red: https://reviews.llvm.org/D50071
If you have any more feedback, please let me know. I will keep the new one open for a few days, so Jim can review it when he is back from vacation.

Revision Contents

Path

Size

include/

lldb/

Core/

Mangled.h

18 lines

RichManglingInfo.h

95 lines

Symbol/

Symtab.h

84 lines

lldb-forward.h

2 lines

source/

Core/

CMakeLists.txt

1 line

Mangled.cpp

9 lines

RichManglingInfo.cpp

93 lines

Symbol/

Symtab.cpp

89 lines

Diff 158231

include/lldb/Core/Mangled.h

//===-- Mangled.h ------------------------------------------------ C++ --===//		//===-- Mangled.h ------------------------------------------------ C++ --===//
//		//
// The LLVM Compiler Infrastructure		// The LLVM Compiler Infrastructure
//		//
// This file is distributed under the University of Illinois Open Source		// This file is distributed under the University of Illinois Open Source
// License. See LICENSE.TXT for details.		// License. See LICENSE.TXT for details.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef liblldb_Mangled_h_		#ifndef liblldb_Mangled_h_
#define liblldb_Mangled_h_		#define liblldb_Mangled_h_
#if defined(__cplusplus)		#if defined(__cplusplus)

#include "lldb/Utility/ConstString.h"
#include "lldb/lldb-enumerations.h"		#include "lldb/lldb-enumerations.h"
		#include "lldb/lldb-forward.h"
		#include "lldb/Utility/ConstString.h"
#include "llvm/ADT/StringRef.h"		#include "llvm/ADT/StringRef.h"

#include <memory>		#include <memory>
#include <stddef.h>		#include <stddef.h>

namespace lldb_private {		namespace lldb_private {

//----------------------------------------------------------------------		//----------------------------------------------------------------------
/// @class Mangled Mangled.h "lldb/Core/Mangled.h"		/// @class Mangled Mangled.h "lldb/Core/Mangled.h"
/// A class that handles mangled names.		/// A class that handles mangled names.
///		///
/// Designed to handle mangled names. The demangled version of any names will		/// Designed to handle mangled names. The demangled version of any names will
/// be computed when the demangled name is accessed through the Demangled()		/// be computed when the demangled name is accessed through the Demangled()
/// acccessor. This class can also tokenize the demangled version of the name		/// acccessor. This class can also tokenize the demangled version of the name
/// for powerful searches. Functions and symbols could make instances of this		/// for powerful searches. Functions and symbols could make instances of this
/// class for their mangled names. Uniqued string pools are used for the		/// class for their mangled names. Uniqued string pools are used for the
		clayborgUnsubmitted Done Reply Inline Actions move any forward decls to lldb-forward.h and remove all manual forward declarations in lldb_private from here. clayborg: move any forward decls to lldb-forward.h and remove all manual forward declarations in…
/// mangled, demangled, and token string values to allow for faster		/// mangled, demangled, and token string values to allow for faster
/// comparisons and for efficient memory use.		/// comparisons and for efficient memory use.
//----------------------------------------------------------------------		//----------------------------------------------------------------------
class Mangled {		class Mangled {
public:		public:
enum NamePreference {		enum NamePreference {
ePreferMangled,		ePreferMangled,
ePreferDemangled,		ePreferDemangled,
▲ Show 20 Lines • Show All 261 Lines • ▼ Show 20 Lines	public:
/// optimized for batch processing while populating a name index. To get the		/// optimized for batch processing while populating a name index. To get the
/// pure demangled name string for a single entity, use GetDemangledName()		/// pure demangled name string for a single entity, use GetDemangledName()
/// instead.		/// instead.
///		///
/// For names that match the Itanium mangling scheme, this uses LLVM's		/// For names that match the Itanium mangling scheme, this uses LLVM's
/// ItaniumPartialDemangler. All other names fall back to LLDB's builtin		/// ItaniumPartialDemangler. All other names fall back to LLDB's builtin
/// parser currently.		/// parser currently.
///		///
/// @param[in] spec		/// This function is thread-safe when used with different \a context
/// The RichManglingSpec that provides the context for this function. One		/// instances in different threads.
/// instance can be used for multiple calls. Should be stack-allocated in		///
/// the caller's frame.		/// @param[in] context
		/// The context for this function. A single instance can be stack-
		/// allocated in the caller's frame and used for multiple calls.
///		///
/// @param[in] skip_mangled_name		/// @param[in] skip_mangled_name
/// A filtering function for skipping entities based on name and mangling		/// A filtering function for skipping entities based on name and mangling
/// scheme. This can be null if unused.		/// scheme. This can be null if unused.
///		///
/// @return		/// @return
/// The rich mangling info on success, null otherwise.		/// The rich mangling info on success, null otherwise. Expect the pointer
		/// to be valid only until the next call to this funtion.
//----------------------------------------------------------------------		//----------------------------------------------------------------------
const RichManglingInfo *		const RichManglingInfo *
DemangleWithRichManglingInfo(RichManglingSpec &spec,		DemangleWithRichManglingInfo(RichManglingContext &context,
SkipMangledNameFn *skip_mangled_name);		SkipMangledNameFn *skip_mangled_name);

private:		private:
//----------------------------------------------------------------------		//----------------------------------------------------------------------
/// Mangled member variables.		/// Mangled member variables.
//----------------------------------------------------------------------		//----------------------------------------------------------------------
ConstString m_mangled; ///< The mangled version of the name		ConstString m_mangled; ///< The mangled version of the name
mutable ConstString m_demangled; ///< Mutable so we can get it on demand with		mutable ConstString m_demangled; ///< Mutable so we can get it on demand with
Show All 9 Lines

include/lldb/Core/RichManglingInfo.h

This file was added.

				//===-- RichManglingInfo.h --------------------------------------- C++ --===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//

				#ifndef liblldb_RichManglingInfo_h_
				#define liblldb_RichManglingInfo_h_

				#include "lldb/lldb-private.h"
				#include "llvm/ADT/Any.h"
				#include "llvm/Demangle/Demangle.h"

				namespace lldb_private {

				/// Uniform wrapper for access to rich mangling information from different
				/// providers. See Mangled::DemangleWithRichManglingInfo()
				class RichManglingInfo {
				public:
				/// If this symbol describes a constructor or destructor.
				bool IsCtorOrDtor() const;

				/// If this symbol describes a function.
				bool IsFunction() const;

				/// Get the base name of a function. This doesn't include trailing template
				/// arguments, ie for "a::b<int>" this function returns "b".
				const char *GetFunctionBaseName() const;

				/// Get the context name for a function. For "a::b::c", this function returns
				/// "a::b".
				const char *GetFunctionDeclContextName() const;

				private:
				enum InfoProvider { ItaniumPartialDemangler, PluginCxxLanguage };

				/// Selects the rich mangling info provider. Initially undefined. Configured
				/// from RichManglingContext::SetX (instance not accessible before).
				InfoProvider m_provider;

				/// Members for ItaniumPartialDemangler
				llvm::ItaniumPartialDemangler *m_IPD = nullptr;
				mutable size_t m_IPD_size = 0;
				mutable char *m_IPD_buf = nullptr;

				/// Members for PluginCxxLanguage
				/// Cannot forward declare inner class CPlusPlusLanguage::MethodName. The
				/// respective header is in Plugins and including it from here causes cyclic
				/// dependency. Instead keep a llvm::Any and cast it on-access in the cpp.
				mutable llvm::Any m_legacy_parser;

				/// Obtain the legacy parser casted to the given type. Ideally we had a type
				/// trait to deduce \a ParserT from a given InfoProvider, but unfortunately we
				/// can't access CPlusPlusLanguage::MethodName from within the header.
				template <class ParserT> ParserT *get() const {
				assert(m_legacy_parser.hasValue());
				assert(llvm::any_isa<ParserT>(m_legacy_parser));
				return llvm::any_cast<ParserT *>(m_legacy_parser);
				}

				/// Reset the provider and clean up memory before reassigning/destroying.
				void ResetProvider();

				// Default construction in undefined state from RichManglingContext.
				RichManglingInfo() = default;

				// Destruction from RichManglingContext.
				~RichManglingInfo();

				// Declare RichManglingContext as friend so it can access the default ctor and
				// assign to members in its SetX methods.
				friend class RichManglingContext;
				};

				//----------------------------------------------------------------------

				/// Unique owner of RichManglingInfo. Handles configuration and lifetime.
				class RichManglingContext {
				public:
				RichManglingInfo *SetItaniumInfo();
				RichManglingInfo *SetLegacyCxxParserInfo(const ConstString &mangled);

				llvm::ItaniumPartialDemangler &GetIPD() { return m_IPD; }

				private:
				RichManglingInfo m_info;
				llvm::ItaniumPartialDemangler m_IPD;
				};

				} // namespace lldb_private

				#endif

include/lldb/Symbol/Symtab.h

	Show All 11 Lines

	#include <mutex>			#include <mutex>
	#include <vector>			#include <vector>

	#include "lldb/Core/RangeMap.h"			#include "lldb/Core/RangeMap.h"
	#include "lldb/Core/UniqueCStringMap.h"			#include "lldb/Core/UniqueCStringMap.h"
	#include "lldb/Symbol/Symbol.h"			#include "lldb/Symbol/Symbol.h"
	#include "lldb/lldb-private.h"			#include "lldb/lldb-private.h"
	#include "llvm/Demangle/Demangle.h"

	namespace lldb_private {			namespace lldb_private {

	/// Uniform wrapper for access to rich mangling information from different
	/// providers. See Mangled::DemangleWithRichManglingInfo()
	class RichManglingInfo {
	public:
	/// If this symbol describes a constructor or destructor.
	bool isCtorOrDtor() const;

	/// If this symbol describes a function.
	bool isFunction() const;

	/// Get the base name of a function. This doesn't include trailing template
	/// arguments, ie for "a::b<int>" this function returns "b".
	const char *getFunctionBaseName() const;

	/// Get the context name for a function. For "a::b::c", this function returns
	/// "a::b".
	const char *getFunctionDeclContextName() const;

	private:
	enum InfoProvider { ItaniumPartialDemangler, PluginCxxLanguage };

	/// Selects the rich mangling info provider. Initially undefined, but
	/// initialized in RichManglingSpec::CreateX (instance not accessible before).
	InfoProvider m_provider;

	/// Members for ItaniumPartialDemangler
	llvm::ItaniumPartialDemangler *m_IPD = nullptr;
	mutable size_t m_IPD_size = 0;
	mutable char *m_IPD_buf = nullptr;

	/// Members for PluginCxxLanguage
	/// Cannot forward declare inner class CPlusPlusLanguage::MethodName. The
	/// respective header is in Plugins and including it from here causes cyclic
	/// dependency. Keep a void* here instead and cast it on-demand on the cpp.
	void *m_legacy_parser = nullptr;

	/// Obtain the legacy parser casted to the given type. Ideally we had a type
	/// trait to deduce \a ParserT from a given InfoProvider, but unfortunately we
	/// can't access CPlusPlusLanguage::MethodName from within the header.
	template <class ParserT> ParserT *get() const {
	assert(m_legacy_parser);
	return reinterpret_cast<ParserT *>(m_legacy_parser);
	}

	/// Reset the provider and clean up memory before reassigning/destroying.
	void ResetProvider();

	// Default construction in undefined state from RichManglingSpec.
	RichManglingInfo() = default;

	// Destruction from RichManglingSpec.
	~RichManglingInfo();

	// No copy
	RichManglingInfo(const RichManglingInfo &) = delete;
	RichManglingInfo &operator=(const RichManglingInfo &) = delete;

	// No move
	RichManglingInfo(RichManglingInfo &&) = delete;
	RichManglingInfo &operator=(RichManglingInfo &&) = delete;

	// Declare RichManglingSpec as friend so it can access the default ctor and
	// assign to members in its CreateX methods.
	friend class RichManglingSpec;
	};

	//----------------------------------------------------------------------

	/// Unique owner of RichManglingInfo. Handles initialization and lifetime.
	class RichManglingSpec {
	public:
	RichManglingInfo *CreateItaniumInfo();
	RichManglingInfo *CreateLegacyCxxParserInfo(const ConstString &mangled);

	llvm::ItaniumPartialDemangler &GetIPD() { return m_IPD; }

	private:
	RichManglingInfo m_info;
	llvm::ItaniumPartialDemangler m_IPD;
	};

	//----------------------------------------------------------------------

	class Symtab {			class Symtab {
	public:			public:
	typedef std::vector<uint32_t> IndexCollection;			typedef std::vector<uint32_t> IndexCollection;
				clayborgUnsubmitted Done Reply Inline Actions move to separate files RichManglingInfo.h and RichManglingInfo.cpp clayborg: move to separate files RichManglingInfo.h and RichManglingInfo.cpp
	typedef UniqueCStringMap<uint32_t> NameToIndexMap;			typedef UniqueCStringMap<uint32_t> NameToIndexMap;

	typedef enum Debug {			typedef enum Debug {
	eDebugNo, // Not a debug symbol			eDebugNo, // Not a debug symbol
	eDebugYes, // A debug symbol			eDebugYes, // A debug symbol
	eDebugAny			eDebugAny
	} Debug;			} Debug;

	typedef enum Visibility {			typedef enum Visibility {
	eVisibilityAny,			eVisibilityAny,
	eVisibilityExtern,			eVisibilityExtern,
	eVisibilityPrivate			eVisibilityPrivate
	} Visibility;			} Visibility;

	Symtab(ObjectFile *objfile);			Symtab(ObjectFile *objfile);
	~Symtab();			~Symtab();

				sgraenitzAuthorUnsubmitted Not Done Reply Inline Actions We don't need a None-case here. sgraenitz: We don't need a None-case here.
	void PreloadSymbols();			void PreloadSymbols();
	void Reserve(size_t count);			void Reserve(size_t count);
	Symbol *Resize(size_t count);			Symbol *Resize(size_t count);
	uint32_t AddSymbol(const Symbol &symbol);			uint32_t AddSymbol(const Symbol &symbol);
	size_t GetNumSymbols() const;			size_t GetNumSymbols() const;
	void SectionFileAddressesChanged();			void SectionFileAddressesChanged();
	void Dump(Stream s, Target target, SortOrder sort_type);			void Dump(Stream s, Target target, SortOrder sort_type);
	void Dump(Stream s, Target target, std::vector<uint32_t> &indexes) const;			void Dump(Stream s, Target target, std::vector<uint32_t> &indexes) const;
	uint32_t GetIndexForSymbol(const Symbol *symbol) const;			uint32_t GetIndexForSymbol(const Symbol *symbol) const;
	std::recursive_mutex &GetMutex() { return m_mutex; }			std::recursive_mutex &GetMutex() { return m_mutex; }
	Symbol *FindSymbolByID(lldb::user_id_t uid) const;			Symbol *FindSymbolByID(lldb::user_id_t uid) const;
	Symbol *SymbolAtIndex(size_t idx);			Symbol *SymbolAtIndex(size_t idx);
	const Symbol *SymbolAtIndex(size_t idx) const;			const Symbol *SymbolAtIndex(size_t idx) const;
	Symbol *FindSymbolWithType(lldb::SymbolType symbol_type,			Symbol *FindSymbolWithType(lldb::SymbolType symbol_type,
	Debug symbol_debug_type,			Debug symbol_debug_type,
				sgraenitzAuthorUnsubmitted Not Done Reply Inline Actions This is the hackiest point I guess. sgraenitz: This is the hackiest point I guess.
				zturnerUnsubmitted Done Reply Inline Actions We have `llvm::Any`. Perhaps you want to use that here instead of `void`? zturner:* We have `llvm::Any`. Perhaps you want to use that here instead of `void*`?
				sgraenitzAuthorUnsubmitted Not Done Reply Inline Actions Thanks. I will check that. sgraenitz: Thanks. I will check that.
				sgraenitzAuthorUnsubmitted Not Done Reply Inline Actions @zturner Where is `llvm::Any`? Expected it in ADT or Support, but can't find it. IIUC `llvm::Optional` does something similar, but uses its own `optional_detail::OptionalStorage`. Same for `llvm::Expected`. Or is it a very recent addition? sgraenitz: @zturner Where is `llvm::Any`? Expected it in ADT or Support, but can't find it. IIUC `llvm…
				zturnerUnsubmitted Not Done Reply Inline Actions It's pretty recent. I was actually the one who added it, about maybe 2 weeks ago. It's in `include/llvm/ADT/Any.h` zturner: It's pretty recent. I was actually the one who added it, about maybe 2 weeks ago. It's in…
	Visibility symbol_visibility, uint32_t &start_idx);			Visibility symbol_visibility, uint32_t &start_idx);
	//----------------------------------------------------------------------			//----------------------------------------------------------------------
	/// Get the parent symbol for the given symbol.			/// Get the parent symbol for the given symbol.
	///			///
	/// Many symbols in symbol tables are scoped by other symbols that			/// Many symbols in symbol tables are scoped by other symbols that
	/// contain one or more symbol. This function will look for such a			/// contain one or more symbol. This function will look for such a
	/// containing symbol and return it if there is one.			/// containing symbol and return it if there is one.
	//----------------------------------------------------------------------			//----------------------------------------------------------------------
	const Symbol GetParent(Symbol symbol) const;			const Symbol GetParent(Symbol symbol) const;
	uint32_t AppendSymbolIndexesWithType(lldb::SymbolType symbol_type,			uint32_t AppendSymbolIndexesWithType(lldb::SymbolType symbol_type,
	std::vector<uint32_t> &indexes,			std::vector<uint32_t> &indexes,
	uint32_t start_idx = 0,			uint32_t start_idx = 0,
	uint32_t end_index = UINT32_MAX) const;			uint32_t end_index = UINT32_MAX) const;
	uint32_t AppendSymbolIndexesWithTypeAndFlagsValue(			uint32_t AppendSymbolIndexesWithTypeAndFlagsValue(
	lldb::SymbolType symbol_type, uint32_t flags_value,			lldb::SymbolType symbol_type, uint32_t flags_value,
	std::vector<uint32_t> &indexes, uint32_t start_idx = 0,			std::vector<uint32_t> &indexes, uint32_t start_idx = 0,
	uint32_t end_index = UINT32_MAX) const;			uint32_t end_index = UINT32_MAX) const;
	uint32_t AppendSymbolIndexesWithType(lldb::SymbolType symbol_type,			uint32_t AppendSymbolIndexesWithType(lldb::SymbolType symbol_type,
	Debug symbol_debug_type,			Debug symbol_debug_type,
	Visibility symbol_visibility,			Visibility symbol_visibility,
	std::vector<uint32_t> &matches,			std::vector<uint32_t> &matches,
	uint32_t start_idx = 0,			uint32_t start_idx = 0,
	uint32_t end_index = UINT32_MAX) const;			uint32_t end_index = UINT32_MAX) const;
	uint32_t AppendSymbolIndexesWithName(const ConstString &symbol_name,			uint32_t AppendSymbolIndexesWithName(const ConstString &symbol_name,
	std::vector<uint32_t> &matches);			std::vector<uint32_t> &matches);
				labathUnsubmitted Done Reply Inline Actions This is implied by the deleted copy operations. labath: This is implied by the deleted copy operations.
				sgraenitzAuthorUnsubmitted Not Done Reply Inline Actions Which are implicitly deleted too, due to the existence of the destructor right? Does LLVM/LLDB have some kind of convention for it? I like to be explicit on ctors&assignment ("rule of 5"), because it aids error messages, but I would be fine with following the existing convention here. sgraenitz: Which are implicitly deleted too, due to the existence of the destructor right? Does LLVM/LLDB…
				labathUnsubmitted Not Done Reply Inline Actions As far as I know, the presence of a destructor has no impact on the state of copy/move operations, so you still need to delete the copy operations explicitly. I don't know if there is an official policy on explicitly deleting move operations, but I don't remember seeing that style anywhere. However, I don't care much about that either. labath: As far as I know, the presence of a destructor has no impact on the state of copy/move…
				sgraenitzAuthorUnsubmitted Done Reply Inline Actions The generation of the implicitly-defined copy constructor is deprecated if T has a user-defined destructor or user-defined copy assignment operator. https://en.cppreference.com/w/cpp/language/copy_constructor Actually, copy has no implications here and move won't work on the const pointer. Thus I will just remove it :) sgraenitz: > The generation of the implicitly-defined copy constructor is deprecated if T has a user…
	uint32_t AppendSymbolIndexesWithName(const ConstString &symbol_name,			uint32_t AppendSymbolIndexesWithName(const ConstString &symbol_name,
	Debug symbol_debug_type,			Debug symbol_debug_type,
	Visibility symbol_visibility,			Visibility symbol_visibility,
	std::vector<uint32_t> &matches);			std::vector<uint32_t> &matches);
	uint32_t AppendSymbolIndexesWithNameAndType(const ConstString &symbol_name,			uint32_t AppendSymbolIndexesWithNameAndType(const ConstString &symbol_name,
	lldb::SymbolType symbol_type,			lldb::SymbolType symbol_type,
	std::vector<uint32_t> &matches);			std::vector<uint32_t> &matches);
	uint32_t AppendSymbolIndexesWithNameAndType(const ConstString &symbol_name,			uint32_t AppendSymbolIndexesWithNameAndType(const ConstString &symbol_name,
	lldb::SymbolType symbol_type,			lldb::SymbolType symbol_type,
	Debug symbol_debug_type,			Debug symbol_debug_type,
				clayborgUnsubmitted Done Reply Inline Actions move to separate files RichManglingInfo.h and RichManglingInfo.cpp clayborg: move to separate files RichManglingInfo.h and RichManglingInfo.cpp
	Visibility symbol_visibility,			Visibility symbol_visibility,
	std::vector<uint32_t> &matches);			std::vector<uint32_t> &matches);
	uint32_t			uint32_t
	AppendSymbolIndexesMatchingRegExAndType(const RegularExpression &regex,			AppendSymbolIndexesMatchingRegExAndType(const RegularExpression &regex,
	lldb::SymbolType symbol_type,			lldb::SymbolType symbol_type,
	std::vector<uint32_t> &indexes);			std::vector<uint32_t> &indexes);
	uint32_t AppendSymbolIndexesMatchingRegExAndType(			uint32_t AppendSymbolIndexesMatchingRegExAndType(
	const RegularExpression &regex, lldb::SymbolType symbol_type,			const RegularExpression &regex, lldb::SymbolType symbol_type,
	Debug symbol_debug_type, Visibility symbol_visibility,			Debug symbol_debug_type, Visibility symbol_visibility,
	std::vector<uint32_t> &indexes);			std::vector<uint32_t> &indexes);
	size_t FindAllSymbolsWithNameAndType(const ConstString &name,			size_t FindAllSymbolsWithNameAndType(const ConstString &name,
	lldb::SymbolType symbol_type,			lldb::SymbolType symbol_type,
	std::vector<uint32_t> &symbol_indexes);			std::vector<uint32_t> &symbol_indexes);
				sgraenitzAuthorUnsubmitted Done Reply Inline Actions ^^^^^ May have its own header & cpp sgraenitz: ^^^^^ May have its own header & cpp
	size_t FindAllSymbolsWithNameAndType(const ConstString &name,			size_t FindAllSymbolsWithNameAndType(const ConstString &name,
	lldb::SymbolType symbol_type,			lldb::SymbolType symbol_type,
	Debug symbol_debug_type,			Debug symbol_debug_type,
	Visibility symbol_visibility,			Visibility symbol_visibility,
	std::vector<uint32_t> &symbol_indexes);			std::vector<uint32_t> &symbol_indexes);
	size_t FindAllSymbolsMatchingRexExAndType(			size_t FindAllSymbolsMatchingRexExAndType(
	const RegularExpression &regex, lldb::SymbolType symbol_type,			const RegularExpression &regex, lldb::SymbolType symbol_type,
	Debug symbol_debug_type, Visibility symbol_visibility,			Debug symbol_debug_type, Visibility symbol_visibility,
	▲ Show 20 Lines • Show All 97 Lines • Show Last 20 Lines

include/lldb/lldb-forward.h

	Show First 20 Lines • Show All 186 Lines • ▼ Show 20 Lines
	class RegisterCheckpoint;			class RegisterCheckpoint;
	class RegisterContext;			class RegisterContext;
	class RegisterLocation;			class RegisterLocation;
	class RegisterLocationList;			class RegisterLocationList;
	class RegisterValue;			class RegisterValue;
	class RegularExpression;			class RegularExpression;
	class REPL;			class REPL;
	class RichManglingInfo;			class RichManglingInfo;
	class RichManglingSpec;			class RichManglingContext;
	class Scalar;			class Scalar;
	class ScriptInterpreter;			class ScriptInterpreter;
	class ScriptInterpreterLocker;			class ScriptInterpreterLocker;
	struct ScriptSummaryFormat;			struct ScriptSummaryFormat;
	class SearchFilter;			class SearchFilter;
	class Section;			class Section;
	class SectionImpl;			class SectionImpl;
	class SectionList;			class SectionList;
	▲ Show 20 Lines • Show All 304 Lines • Show Last 20 Lines

source/Core/CMakeLists.txt

Show All 28 Lines	add_lldb_library(lldbCore
Listener.cpp		Listener.cpp
Mangled.cpp		Mangled.cpp
Module.cpp		Module.cpp
ModuleChild.cpp		ModuleChild.cpp
ModuleList.cpp		ModuleList.cpp
Opcode.cpp		Opcode.cpp
PluginManager.cpp		PluginManager.cpp
RegisterValue.cpp		RegisterValue.cpp
		RichManglingInfo.cpp
Scalar.cpp		Scalar.cpp
SearchFilter.cpp		SearchFilter.cpp
Section.cpp		Section.cpp
SourceManager.cpp		SourceManager.cpp
State.cpp		State.cpp
StreamAsynchronousIO.cpp		StreamAsynchronousIO.cpp
StreamFile.cpp		StreamFile.cpp
UserSettingsController.cpp		UserSettingsController.cpp
Show All 39 Lines

source/Core/Mangled.cpp

Show All 10 Lines

#if defined(_WIN32)		#if defined(_WIN32)
#include "lldb/Host/windows/windows.h"		#include "lldb/Host/windows/windows.h"

#include <dbghelp.h>		#include <dbghelp.h>
#pragma comment(lib, "dbghelp.lib")		#pragma comment(lib, "dbghelp.lib")
#endif		#endif

		#include "lldb/Core/RichManglingInfo.h"
#include "lldb/Utility/ConstString.h"		#include "lldb/Utility/ConstString.h"
#include "lldb/Utility/Log.h"		#include "lldb/Utility/Log.h"
#include "lldb/Utility/Logging.h"		#include "lldb/Utility/Logging.h"
#include "lldb/Utility/RegularExpression.h"		#include "lldb/Utility/RegularExpression.h"
#include "lldb/Utility/Stream.h"		#include "lldb/Utility/Stream.h"
#include "lldb/Utility/Timer.h"		#include "lldb/Utility/Timer.h"
#include "lldb/lldb-enumerations.h"		#include "lldb/lldb-enumerations.h"

▲ Show 20 Lines • Show All 163 Lines • ▼ Show 20 Lines
}		}

//----------------------------------------------------------------------		//----------------------------------------------------------------------
// Compare the string values.		// Compare the string values.
//----------------------------------------------------------------------		//----------------------------------------------------------------------
int Mangled::Compare(const Mangled &a, const Mangled &b) {		int Mangled::Compare(const Mangled &a, const Mangled &b) {
return ConstString::Compare(		return ConstString::Compare(
a.GetName(lldb::eLanguageTypeUnknown, ePreferMangled),		a.GetName(lldb::eLanguageTypeUnknown, ePreferMangled),
b.GetName(lldb::eLanguageTypeUnknown, ePreferMangled));		b.GetName(lldb::eLanguageTypeUnknown, ePreferMangled));
		sgraenitzAuthorUnsubmitted Not Done Reply Inline Actions Fixing bug: This is no dead code, but well, maybe in a rare branch. sgraenitz: Fixing bug: This is no dead code, but well, maybe in a rare branch.
}		}

//----------------------------------------------------------------------		//----------------------------------------------------------------------
// Set the string value in this objects. If "mangled" is true, then the mangled		// Set the string value in this objects. If "mangled" is true, then the mangled
// named is set with the new value in "s", else the demangled name is set.		// named is set with the new value in "s", else the demangled name is set.
//----------------------------------------------------------------------		//----------------------------------------------------------------------
void Mangled::SetValue(const ConstString &s, bool mangled) {		void Mangled::SetValue(const ConstString &s, bool mangled) {
if (s) {		if (s) {
▲ Show 20 Lines • Show All 77 Lines • ▼ Show 20 Lines
}		}
} // namespace		} // namespace

//----------------------------------------------------------------------		//----------------------------------------------------------------------
// Explicit demangling for scheduled requests during batch processing. This		// Explicit demangling for scheduled requests during batch processing. This
// makes use of ItaniumPartialDemangler's rich demangle info		// makes use of ItaniumPartialDemangler's rich demangle info
//----------------------------------------------------------------------		//----------------------------------------------------------------------
const RichManglingInfo *		const RichManglingInfo *
Mangled::DemangleWithRichManglingInfo(RichManglingSpec &spec,		Mangled::DemangleWithRichManglingInfo(RichManglingContext &context,
SkipMangledNameFn *skip_mangled_name) {		SkipMangledNameFn *skip_mangled_name) {
// We need to generate and cache the demangled name.		// We need to generate and cache the demangled name.
static Timer::Category func_cat(LLVM_PRETTY_FUNCTION);		static Timer::Category func_cat(LLVM_PRETTY_FUNCTION);
Timer scoped_timer(func_cat,		Timer scoped_timer(func_cat,
"Mangled::DemangleWithRichNameIndexInfo (m_mangled = %s)",		"Mangled::DemangleWithRichNameIndexInfo (m_mangled = %s)",
m_mangled.GetCString());		m_mangled.GetCString());

// Others are not meant to arrive here. ObjC names or C's main() for example		// Others are not meant to arrive here. ObjC names or C's main() for example
Show All 9 Lines	Mangled::DemangleWithRichManglingInfo(RichManglingContext &context,
switch (S) {		switch (S) {
case eManglingSchemeNone:		case eManglingSchemeNone:
// The current mangled_name_filter would allow llvm_unreachable here.		// The current mangled_name_filter would allow llvm_unreachable here.
return nullptr;		return nullptr;

case eManglingSchemeItanium:		case eManglingSchemeItanium:
// We want the rich mangling info here, so we don't care whether or not		// We want the rich mangling info here, so we don't care whether or not
// there is a demangled string in the pool already.		// there is a demangled string in the pool already.
if (char *D = GetItaniumRichDemangleInfo(M.data(), spec.GetIPD())) {		if (char *D = GetItaniumRichDemangleInfo(M.data(), context.GetIPD())) {
// Connect the counterparts in the string pool to accelerate subsequent		// Connect the counterparts in the string pool to accelerate subsequent
// access in GetDemangledName().		// access in GetDemangledName().
m_demangled.SetCStringWithMangledCounterpart(D, m_mangled);		m_demangled.SetCStringWithMangledCounterpart(D, m_mangled);
std::free(D);		std::free(D);

return spec.CreateItaniumInfo();		return context.SetItaniumInfo();
		sgraenitzAuthorUnsubmitted Not Done Reply Inline Actions I think there is still something wrong with the diff. I can't see any of the callers of e.g. createItaniumInfo Weird. The caller is here, but not shown as a change anymore.. sgraenitz: > I think there is still something wrong with the diff. I can't see any of the callers of e.g.
} else {		} else {
m_demangled.SetCString("");		m_demangled.SetCString("");
return nullptr;		return nullptr;
}		}

case eManglingSchemeMSVC: {		case eManglingSchemeMSVC: {
// We have no rich mangling for MSVC-mangled names yet, so first try to		// We have no rich mangling for MSVC-mangled names yet, so first try to
// demangle it if necessary.		// demangle it if necessary.
Show All 9 Lines	case eManglingSchemeMSVC: {
}		}

if (m_demangled.IsEmpty()) {		if (m_demangled.IsEmpty()) {
// Cannot demangle it, so don't try parsing.		// Cannot demangle it, so don't try parsing.
return nullptr;		return nullptr;
} else {		} else {
// Demangled successfully, we can try and parse it with		// Demangled successfully, we can try and parse it with
// CPlusPlusLanguage::MethodName.		// CPlusPlusLanguage::MethodName.
return spec.CreateLegacyCxxParserInfo(m_mangled);		return context.SetLegacyCxxParserInfo(m_mangled);
}		}
}		}
}		}
}		}

//----------------------------------------------------------------------		//----------------------------------------------------------------------
// Generate the demangled name on demand using this accessor. Code in this		// Generate the demangled name on demand using this accessor. Code in this
// class will need to use this accessor if it wishes to decode the demangled		// class will need to use this accessor if it wishes to decode the demangled
// name. The result is cached and will be kept until a new string value is		// name. The result is cached and will be kept until a new string value is
// supplied to this object, or until the end of the object's lifetime.		// supplied to this object, or until the end of the object's lifetime.
//----------------------------------------------------------------------		//----------------------------------------------------------------------
const ConstString &		const ConstString &
Mangled::GetDemangledName(lldb::LanguageType language) const {		Mangled::GetDemangledName(lldb::LanguageType language) const {
// Check to make sure we have a valid mangled name and that we haven't		// Check to make sure we have a valid mangled name and that we haven't
// already decoded our mangled name.		// already decoded our mangled name.
if (m_mangled && m_demangled.IsNull()) {		if (m_mangled && m_demangled.IsNull()) {
		sgraenitzAuthorUnsubmitted Not Done Reply Inline Actions Using the difference between null and empty. sgraenitz: Using the difference between null and empty.
// We need to generate and cache the demangled name.		// We need to generate and cache the demangled name.
static Timer::Category func_cat(LLVM_PRETTY_FUNCTION);		static Timer::Category func_cat(LLVM_PRETTY_FUNCTION);
Timer scoped_timer(func_cat, "Mangled::GetDemangledName (m_mangled = %s)",		Timer scoped_timer(func_cat, "Mangled::GetDemangledName (m_mangled = %s)",
m_mangled.GetCString());		m_mangled.GetCString());

// Don't bother running anything that isn't mangled		// Don't bother running anything that isn't mangled
const char *mangled_name = m_mangled.GetCString();		const char *mangled_name = m_mangled.GetCString();
ManglingScheme mangling_scheme{cstring_mangling_scheme(mangled_name)};		ManglingScheme mangling_scheme{cstring_mangling_scheme(mangled_name)};
Show All 14 Lines	if (mangling_scheme != eManglingSchemeNone &&
case eManglingSchemeNone:		case eManglingSchemeNone:
llvm_unreachable("eManglingSchemeNone was handled already");		llvm_unreachable("eManglingSchemeNone was handled already");
}		}
if (demangled_name) {		if (demangled_name) {
m_demangled.SetCStringWithMangledCounterpart(demangled_name, m_mangled);		m_demangled.SetCStringWithMangledCounterpart(demangled_name, m_mangled);
free(demangled_name);		free(demangled_name);
}		}
}		}
if (m_demangled.IsNull()) {		if (m_demangled.IsNull()) {
		sgraenitzAuthorUnsubmitted Not Done Reply Inline Actions Using the difference between null and empty. sgraenitz: Using the difference between null and empty.
// Set the demangled string to the empty string to indicate we tried to		// Set the demangled string to the empty string to indicate we tried to
// parse it once and failed.		// parse it once and failed.
m_demangled.SetCString("");		m_demangled.SetCString("");
}		}
}		}

return m_demangled;		return m_demangled;
}		}
▲ Show 20 Lines • Show All 120 Lines • Show Last 20 Lines

source/Core/RichManglingInfo.cpp

This file was added.

				//===-- RichManglingInfo.cpp ------------------------------------- C++ --===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//

				#include "lldb/Core/RichManglingInfo.h"
				#include "lldb/Utility/ConstString.h"

				#include "Plugins/Language/CPlusPlus/CPlusPlusLanguage.h"

				using namespace lldb;
				using namespace lldb_private;

				void RichManglingInfo::ResetProvider() {
				// If we want to support parsers for other languages some day, we need a
				// switch here to delete the correct parser type.
				if (m_legacy_parser.hasValue()) {
				assert(m_provider == RichManglingInfo::PluginCxxLanguage);
				delete get<CPlusPlusLanguage::MethodName>();
				m_legacy_parser = nullptr;
				}
				}

				RichManglingInfo *RichManglingContext::SetItaniumInfo() {
				m_info.ResetProvider();
				m_info.m_provider = RichManglingInfo::ItaniumPartialDemangler;
				m_info.m_IPD = &m_IPD;
				return &m_info;
				}

				RichManglingInfo *
				RichManglingContext::SetLegacyCxxParserInfo(const ConstString &mangled) {
				m_info.ResetProvider();
				m_info.m_provider = RichManglingInfo::PluginCxxLanguage;
				m_info.m_legacy_parser = new CPlusPlusLanguage::MethodName(mangled);
				return &m_info;
				}

				RichManglingInfo::~RichManglingInfo() {
				ResetProvider();
				delete m_IPD_buf;
				}

				bool RichManglingInfo::IsCtorOrDtor() const {
				switch (m_provider) {
				case ItaniumPartialDemangler:
				return m_IPD->isCtorOrDtor();
				case PluginCxxLanguage: {
				// We can only check for destructors here.
				auto base_name = get<CPlusPlusLanguage::MethodName>()->GetBasename();
				return base_name.front() == '~';
				}
				}
				}

				bool RichManglingInfo::IsFunction() const {
				switch (m_provider) {
				case ItaniumPartialDemangler:
				return m_IPD->isFunction();
				case PluginCxxLanguage:
				return get<CPlusPlusLanguage::MethodName>()->IsValid();
				}
				}

				const char *RichManglingInfo::GetFunctionBaseName() const {
				switch (m_provider) {
				case ItaniumPartialDemangler:
				if (auto buf = m_IPD->getFunctionBaseName(m_IPD_buf, &m_IPD_size)) {
				m_IPD_buf = buf;
				return buf;
				}
				return nullptr;
				case PluginCxxLanguage:
				return get<CPlusPlusLanguage::MethodName>()->GetBasename().data();
				}
				}

				const char *RichManglingInfo::GetFunctionDeclContextName() const {
				switch (m_provider) {
				case ItaniumPartialDemangler:
				if (auto buf = m_IPD->getFunctionDeclContextName(m_IPD_buf, &m_IPD_size)) {
				m_IPD_buf = buf;
				return buf;
				}
				return nullptr;
				case PluginCxxLanguage:
				return get<CPlusPlusLanguage::MethodName>()->GetContext().data();
				}
				}

source/Symbol/Symtab.cpp

//===-- Symtab.cpp ----------------------------------------------- C++ --===//		//===-- Symtab.cpp ----------------------------------------------- C++ --===//
//		//
// The LLVM Compiler Infrastructure		// The LLVM Compiler Infrastructure
//		//
// This file is distributed under the University of Illinois Open Source		// This file is distributed under the University of Illinois Open Source
// License. See LICENSE.TXT for details.		// License. See LICENSE.TXT for details.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include <map>		#include <map>
#include <set>		#include <set>

#include "Plugins/Language/CPlusPlus/CPlusPlusLanguage.h"
#include "Plugins/Language/ObjC/ObjCLanguage.h"		#include "Plugins/Language/ObjC/ObjCLanguage.h"

#include "lldb/Core/Module.h"		#include "lldb/Core/Module.h"
		#include "lldb/Core/RichManglingInfo.h"
#include "lldb/Core/STLUtils.h"		#include "lldb/Core/STLUtils.h"
#include "lldb/Core/Section.h"		#include "lldb/Core/Section.h"
#include "lldb/Symbol/ObjectFile.h"		#include "lldb/Symbol/ObjectFile.h"
#include "lldb/Symbol/Symbol.h"		#include "lldb/Symbol/Symbol.h"
#include "lldb/Symbol/SymbolContext.h"		#include "lldb/Symbol/SymbolContext.h"
#include "lldb/Symbol/Symtab.h"		#include "lldb/Symbol/Symtab.h"
#include "lldb/Utility/RegularExpression.h"		#include "lldb/Utility/RegularExpression.h"
#include "lldb/Utility/Stream.h"		#include "lldb/Utility/Stream.h"
#include "lldb/Utility/Timer.h"		#include "lldb/Utility/Timer.h"
#include "llvm/Demangle/Demangle.h"

using namespace lldb;		using namespace lldb;
using namespace lldb_private;		using namespace lldb_private;

Symtab::Symtab(ObjectFile *objfile)		Symtab::Symtab(ObjectFile *objfile)
: m_objfile(objfile), m_symbols(), m_file_addr_to_index(),		: m_objfile(objfile), m_symbols(), m_file_addr_to_index(),
m_name_to_index(), m_mutex(), m_file_addr_to_index_computed(false),		m_name_to_index(), m_mutex(), m_file_addr_to_index_computed(false),
m_name_indexes_computed(false) {}		m_name_indexes_computed(false) {}
▲ Show 20 Lines • Show All 175 Lines • ▼ Show 20 Lines	const Symbol *Symtab::SymbolAtIndex(size_t idx) const {
// Clients should grab the mutex from this symbol table and lock it manually		// Clients should grab the mutex from this symbol table and lock it manually
// when calling this function to avoid performance issues.		// when calling this function to avoid performance issues.
if (idx < m_symbols.size())		if (idx < m_symbols.size())
return &m_symbols[idx];		return &m_symbols[idx];
return nullptr;		return nullptr;
}		}

//----------------------------------------------------------------------		//----------------------------------------------------------------------
// RichManglingInfo
//----------------------------------------------------------------------

void RichManglingInfo::ResetProvider() {
// If we want to support parsers for other languages some day, we need a
// switch here to delete the correct parser type.
if (m_legacy_parser) {
assert(m_provider == RichManglingInfo::PluginCxxLanguage);
delete get<CPlusPlusLanguage::MethodName>();
m_legacy_parser = nullptr;
}
}

RichManglingInfo *RichManglingSpec::CreateItaniumInfo() {
m_info.ResetProvider();
m_info.m_provider = RichManglingInfo::ItaniumPartialDemangler;
m_info.m_IPD = &m_IPD;
return &m_info;
}

RichManglingInfo *
RichManglingSpec::CreateLegacyCxxParserInfo(const ConstString &mangled) {
m_info.ResetProvider();
m_info.m_provider = RichManglingInfo::PluginCxxLanguage;
m_info.m_legacy_parser = new CPlusPlusLanguage::MethodName(mangled);
return &m_info;
}

RichManglingInfo::~RichManglingInfo() {
ResetProvider();
delete m_IPD_buf;
}

bool RichManglingInfo::isCtorOrDtor() const {
switch (m_provider) {
case ItaniumPartialDemangler:
return m_IPD->isCtorOrDtor();
case PluginCxxLanguage: {
// We can only check for destructors here.
auto base_name = get<CPlusPlusLanguage::MethodName>()->GetBasename();
return base_name.front() == '~';
}
}
}

bool RichManglingInfo::isFunction() const {
switch (m_provider) {
case ItaniumPartialDemangler:
return m_IPD->isFunction();
case PluginCxxLanguage:
return get<CPlusPlusLanguage::MethodName>()->IsValid();
}
}

const char *RichManglingInfo::getFunctionBaseName() const {
switch (m_provider) {
case ItaniumPartialDemangler:
m_IPD_buf = m_IPD->getFunctionBaseName(m_IPD_buf, &m_IPD_size);
return m_IPD_buf;
case PluginCxxLanguage:
return get<CPlusPlusLanguage::MethodName>()->GetBasename().data();
}
}

const char *RichManglingInfo::getFunctionDeclContextName() const {
switch (m_provider) {
case ItaniumPartialDemangler:
m_IPD_buf = m_IPD->getFunctionDeclContextName(m_IPD_buf, &m_IPD_size);
return m_IPD_buf;
case PluginCxxLanguage:
return get<CPlusPlusLanguage::MethodName>()->GetContext().data();
}
}

//----------------------------------------------------------------------
// InitNameIndexes		// InitNameIndexes
//----------------------------------------------------------------------		//----------------------------------------------------------------------
namespace {		namespace {
bool lldb_skip_name(llvm::StringRef mangled, Mangled::ManglingScheme scheme) {		bool lldb_skip_name(llvm::StringRef mangled, Mangled::ManglingScheme scheme) {
		sgraenitzAuthorUnsubmitted Not Done Reply Inline Actions This uses a raw C-string instead of `llvm::StringRef` in order to achieve `O(1)` runtime. sgraenitz: This uses a raw C-string instead of `llvm::StringRef` in order to achieve `O(1)` runtime.
		labathUnsubmitted Done Reply Inline Actions If you changed the caller to use StringRef too (it seems possible at a first glance) then this would still be O(1) labath: If you changed the caller to use StringRef too (it seems possible at a first glance) then this…
		sgraenitzAuthorUnsubmitted Not Done Reply Inline Actions Right, thanks there's a `ConstString::GetStringRef()`. Perfect. sgraenitz: Right, thanks there's a `ConstString::GetStringRef()`. Perfect.
switch (scheme) {		switch (scheme) {
case Mangled::eManglingSchemeItanium: {		case Mangled::eManglingSchemeItanium: {
if (mangled.size() < 3 \|\| !mangled.startswith("_Z"))		if (mangled.size() < 3 \|\| !mangled.startswith("_Z"))
return true;		return true;

// Avoid the following types of symbols in the index.		// Avoid the following types of symbols in the index.
switch (mangled[2]) {		switch (mangled[2]) {
case 'G': // guard variables		case 'G': // guard variables
Show All 34 Lines
#else		#else
// TODO: benchmark this to see if we save any memory. Otherwise we		// TODO: benchmark this to see if we save any memory. Otherwise we
// will always keep the memory reserved in the vector unless we pull some		// will always keep the memory reserved in the vector unless we pull some
// STL swap magic and then recopy...		// STL swap magic and then recopy...
uint32_t actual_count = 0;		uint32_t actual_count = 0;
for (const_iterator pos = m_symbols.begin(), end = m_symbols.end();		for (const_iterator pos = m_symbols.begin(), end = m_symbols.end();
pos != end; ++pos) {		pos != end; ++pos) {
const Mangled &mangled = pos->GetMangled();		const Mangled &mangled = pos->GetMangled();
if (mangled.GetMangledName())		if (mangled.GetMangledName())
++actual_count;		++actual_count;

if (mangled.GetDemangledName())		if (mangled.GetDemangledName())
		sgraenitzAuthorUnsubmitted Not Done Reply Inline Actions @erik.pilkington Is it acceptable/good practice to pass `(nullptr, 0)` here? At the moment this safes some lines of initialization checks for `m_IPD_buf` and `m_IPD_size`. sgraenitz: @erik.pilkington Is it acceptable/good practice to pass `(nullptr, 0)` here? At the moment this…
		erik.pilkingtonUnsubmitted Done Reply Inline Actions Sure, thats fine! Those parameters act the same way as `buf` and `size` in __cxa_demangle. `getFunctionBaseName` will return nullptr if the mangled name isn't a function. Is it a precondition of this function that m_IPD stores a function? If not, it looks like you'll leak the buffer. erik.pilkington: Sure, thats fine! Those parameters act the same way as `buf` and `size` in __cxa_demangle.
		sgraenitzAuthorUnsubmitted Not Done Reply Inline Actions Oh that is a very good note. I had it as a precondition in the client function in Symtab. When I removed that and started to just check the result for `nullptr`, I didn't think about the buffer. Gonna fix it, the generalized interface shouldn't have that precondition anyway. Thanks! sgraenitz: Oh that is a very good note. I had it as a precondition in the client function in Symtab. When…
++actual_count;		++actual_count;
}		}

m_name_to_index.Reserve(actual_count);		m_name_to_index.Reserve(actual_count);
#endif		#endif

// The "const char *" in "class_contexts" must come from a		// The "const char *" in "class_contexts" must come from a
// ConstString::GetCString()		// ConstString::GetCString()
std::set<const char *> class_contexts;		std::set<const char *> class_contexts;
UniqueCStringMap<uint32_t> mangled_name_to_index;		UniqueCStringMap<uint32_t> mangled_name_to_index;
std::vector<const char *> symbol_contexts(num_symbols, nullptr);		std::vector<const char *> symbol_contexts(num_symbols, nullptr);

// Instantiation of the demangler is expensive, so better use a single one		// Instantiation of the demangler is expensive, so better use a single one
// for all entries during batch processing.		// for all entries during batch processing.
RichManglingSpec spec;		RichManglingContext MC;
		labathUnsubmitted Done Reply Inline Actions Could these return StringRef instead of C strings? labath: Could these return StringRef instead of C strings?
		sgraenitzAuthorUnsubmitted Not Done Reply Inline Actions Yes. So far it's simply the closest superset of the two interfaces, but I will try using `StringRef` where possible. sgraenitz: Yes. So far it's simply the closest superset of the two interfaces, but I will try using…
NameToIndexMap::Entry entry;		NameToIndexMap::Entry entry;

for (entry.value = 0; entry.value < num_symbols; ++entry.value) {		for (entry.value = 0; entry.value < num_symbols; ++entry.value) {
Symbol *symbol = &m_symbols[entry.value];		Symbol *symbol = &m_symbols[entry.value];

// Don't let trampolines get into the lookup by name map If we ever need		// Don't let trampolines get into the lookup by name map If we ever need
// the trampoline symbols to be searchable by name we can remove this and		// the trampoline symbols to be searchable by name we can remove this and
// then possibly add a new bool to any of the Symtab functions that		// then possibly add a new bool to any of the Symtab functions that
Show All 14 Lines	for (entry.value = 0; entry.value < num_symbols; ++entry.value) {
entry.cstring = ConstString(m_objfile->StripLinkerSymbolAnnotations(		entry.cstring = ConstString(m_objfile->StripLinkerSymbolAnnotations(
entry.cstring.GetStringRef()));		entry.cstring.GetStringRef()));
m_name_to_index.Append(entry);		m_name_to_index.Append(entry);
}		}

const SymbolType type = symbol->GetType();		const SymbolType type = symbol->GetType();
if (type == eSymbolTypeCode \|\| type == eSymbolTypeResolver) {		if (type == eSymbolTypeCode \|\| type == eSymbolTypeResolver) {
if (const RichManglingInfo *info =		if (const RichManglingInfo *info =
mangled.DemangleWithRichManglingInfo(spec, lldb_skip_name))		mangled.DemangleWithRichManglingInfo(MC, lldb_skip_name))
RegisterMangledNameEntry(entry, class_contexts,		RegisterMangledNameEntry(entry, class_contexts,
mangled_name_to_index, symbol_contexts,		mangled_name_to_index, symbol_contexts,
*info);		*info);
}		}
}		}

// Symbol name strings that didn't match a Mangled::ManglingScheme, are		// Symbol name strings that didn't match a Mangled::ManglingScheme, are
// stored in the demangled field.		// stored in the demangled field.
▲ Show 20 Lines • Show All 56 Lines • ▼ Show 20 Lines	#endif
}		}
}		}

void Symtab::RegisterMangledNameEntry(		void Symtab::RegisterMangledNameEntry(
NameToIndexMap::Entry &entry, std::set<const char *> &class_contexts,		NameToIndexMap::Entry &entry, std::set<const char *> &class_contexts,
UniqueCStringMap<uint32_t> &mangled_name_to_index,		UniqueCStringMap<uint32_t> &mangled_name_to_index,
std::vector<const char *> &symbol_contexts, const RichManglingInfo &info) {		std::vector<const char *> &symbol_contexts, const RichManglingInfo &info) {
// Only register functions that have a base name.		// Only register functions that have a base name.
const char *base_name_cstr = info.getFunctionBaseName();		const char *base_name_cstr = info.GetFunctionBaseName();
if (base_name_cstr == nullptr)		if (base_name_cstr == nullptr)
return;		return;

// The base name will be our entry's name.		// The base name will be our entry's name.
entry.cstring = ConstString(base_name_cstr);		entry.cstring = ConstString(base_name_cstr);

// Register functions with no context.		// Register functions with no context.
ConstString decl_context(info.getFunctionDeclContextName());		ConstString decl_context(info.GetFunctionDeclContextName());
if (decl_context.IsEmpty()) {		if (decl_context.IsEmpty()) {
// This has to be a basename		// This has to be a basename
m_basename_to_index.Append(entry);		m_basename_to_index.Append(entry);
// If there is no context (no namespaces or class scopes that come before		// If there is no context (no namespaces or class scopes that come before
// the function name) then this also could be a fullname.		// the function name) then this also could be a fullname.
m_name_to_index.Append(entry);		m_name_to_index.Append(entry);
return;		return;
}		}

// See if we already know this context.		// See if we already know this context.
auto it = class_contexts.find(decl_context.GetCString());		auto it = class_contexts.find(decl_context.GetCString());

// Register constructors and destructors. They are methods and create		// Register constructors and destructors. They are methods and create
// declaration contexts.		// declaration contexts.
if (info.isCtorOrDtor()) {		if (info.IsCtorOrDtor()) {
m_method_to_index.Append(entry);		m_method_to_index.Append(entry);
if (it == class_contexts.end())		if (it == class_contexts.end())
class_contexts.insert(it, decl_context.GetCString());		class_contexts.insert(it, decl_context.GetCString());
return;		return;
}		}

// Register regular methods with a known declaration context.		// Register regular methods with a known declaration context.
if (it != class_contexts.end()) {		if (it != class_contexts.end()) {
▲ Show 20 Lines • Show All 753 Lines • Show Last 20 Lines