This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lldb/source/Core/
-
source/
-
Core/
-
Mangled.cpp
-
llvm/
-
include/llvm/Demangle/
-
llvm/
-
Demangle/
-
Demangle.h
-
lib/Demangle/
-
Demangle/
-
Demangle.cpp
-
RustDemangle.cpp
-
tools/llvm-rust-demangle-fuzzer/
-
llvm-rust-demangle-fuzzer/
-
llvm-rust-demangle-fuzzer.cpp
-
unittests/Demangle/
-
Demangle/
-
RustDemangleTest.cpp

Differential D123420

[demangler] Rust demangler buffer reuse
ClosedPublic

Authored by urnathan on Apr 8 2022, 12:13 PM.

Download Raw Diff

Details

Reviewers

dblaikie
iains
tmiasko

Commits

rG201c4b9cc4a6: [demangler] Rust demangler buffer return

Summary

The rust demangler has some odd buffer handling code, which will copy the demangled string into the provided buffer, if it will fit. Otherwise it uses the allocated buffer it made. But the length of the incoming buffer will have come from a previous call, which was the length of the demangled string -- not the buffer size. And of course, we're unconditionally allocating a temporary buffer in the first place. So we don't actually get buffer reuse, and we get a memcpy in somecases.

However, nothing in LLVM ever passes in a non-null pointer. Neither does anything pass in a status pointer, that is then made use of. The only exercise these have is in the test suite.

So let's just make the rust demangler have the same API as the dlang demangler.

[this is part of a series trying to clean up the demangler's APIs]

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

urnathan created this revision.Apr 8 2022, 12:13 PM

Herald added a project: Restricted Project. · View Herald TranscriptApr 8 2022, 12:13 PM

Herald added subscribers: JDevlieghere, hiraditya. · View Herald Transcript

urnathan requested review of this revision.Apr 8 2022, 12:13 PM

Herald added a project: Restricted Project. · View Herald TranscriptApr 8 2022, 12:13 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B158766: Diff 421611.Apr 8 2022, 1:20 PM

The API thus far was intended to match that of __cxa_demangle, where the caller retains the ownership of the provided buffer when demangling fails. This is no longer the case in the new implementation. I would rather avoid diverging from __cxa_demangle in such a subtle way.

The Itanium demangler gets away with the direct use of the buffer because it separates fallible parsing from infallible printing.

the length of the incoming buffer will have come from a previous call, which was the length of the demangled string

I think this is a bug in the implementation. The *N should have been updated to reflect allocated memory size. From Demangler API in Itanium C++ ABI:

In either case, the new buffer size will be stored in *n.

In D123420#3440296, @tmiasko wrote:

The API thus far was intended to match that of __cxa_demangle, where the caller retains the ownership of the provided buffer when demangling fails. This is no longer the case in the new implementation. I would rather avoid diverging from __cxa_demangle in such a subtle way.

Ah, it took me a while to figure out where that's happening, I see it now. Awkward. I suppose the rust demangler could always return the new buffer (and free any incoming buffer) in the success case? Something like if (Buf) std::free(Buf); instead of that memcpy/free dance?

The Itanium demangler gets away with the direct use of the buffer because it separates fallible parsing from infallible printing.

Any thoughts about doing similar in the Rust demangler -- passing an output buffer reference around or something?

the length of the incoming buffer will have come from a previous call, which was the length of the demangled string

I think this is a bug in the implementation. The *N should have been updated to reflect allocated memory size. From Demangler API in Itanium C++ ABI:

In either case, the new buffer size will be stored in *n.

Indeed, that's the direction I'm trying to go in :)

FWIW, this is part of a series trying to make buffer ownership transfer clearer (and other issues)

In D123420#3442384, @urnathan wrote:

In D123420#3440296, @tmiasko wrote:

The API thus far was intended to match that of __cxa_demangle, where the caller retains the ownership of the provided buffer when demangling fails. This is no longer the case in the new implementation. I would rather avoid diverging from __cxa_demangle in such a subtle way.

Ah, it took me a while to figure out where that's happening, I see it now. Awkward. I suppose the rust demangler could always return the new buffer (and free any incoming buffer) in the success case? Something like if (Buf) std::free(Buf); instead of that memcpy/free dance?

Sounds good. At that point we might remove N and Buf parameters altogether? This functionality is not used anywhere in LLVM.

In D123420#3442902, @tmiasko wrote:

Sounds good. At that point we might remove N and Buf parameters altogether? This functionality is not used anywhere in LLVM.

good point, updated to do exactly that.

LGTM. Thanks!

This revision is now accepted and ready to land.Apr 13 2022, 5:00 AM

Harbormaster completed remote builds in B159412: Diff 422463.Apr 13 2022, 5:05 AM

This revision was landed with ongoing or failed builds.Apr 13 2022, 8:50 AM

Closed by commit rG201c4b9cc4a6: [demangler] Rust demangler buffer return (authored by urnathan). · Explain Why

This revision was automatically updated to reflect the committed changes.

urnathan added a commit: rG201c4b9cc4a6: [demangler] Rust demangler buffer return.

Herald added a project: Restricted Project. · View Herald TranscriptApr 13 2022, 8:50 AM

Herald added a subscriber: lldb-commits. · View Herald Transcript

urnathan mentioned this in D122604: [demangler] Simplify OutputBuffer initialization.Apr 13 2022, 11:15 AM

Revision Contents

Path

Size

lldb/

source/

Core/

Mangled.cpp

2 lines

llvm/

include/

llvm/

Demangle/

Demangle.h

4 lines

lib/

Demangle/

Demangle.cpp

2 lines

RustDemangle.cpp

40 lines

tools/

llvm-rust-demangle-fuzzer/

llvm-rust-demangle-fuzzer.cpp

4 lines

unittests/

Demangle/

RustDemangleTest.cpp

65 lines

Diff 422531

lldb/source/Core/Mangled.cpp

Show First 20 Lines • Show All 162 Lines • ▼ Show 20 Lines	if (Log *log = GetLog(LLDBLog::Demangle)) {
else		else
LLDB_LOGF(log, "demangled itanium: %s -> error: failed to demangle", M);		LLDB_LOGF(log, "demangled itanium: %s -> error: failed to demangle", M);
}		}

return demangled_cstr;		return demangled_cstr;
}		}

static char GetRustV0DemangledStr(const char M) {		static char GetRustV0DemangledStr(const char M) {
char *demangled_cstr = llvm::rustDemangle(M, nullptr, nullptr, nullptr);		char *demangled_cstr = llvm::rustDemangle(M);

if (Log *log = GetLog(LLDBLog::Demangle)) {		if (Log *log = GetLog(LLDBLog::Demangle)) {
if (demangled_cstr && demangled_cstr[0])		if (demangled_cstr && demangled_cstr[0])
LLDB_LOG(log, "demangled rustv0: {0} -> \"{1}\"", M, demangled_cstr);		LLDB_LOG(log, "demangled rustv0: {0} -> \"{1}\"", M, demangled_cstr);
else		else
LLDB_LOG(log, "demangled rustv0: {0} -> error: failed to demangle", M);		LLDB_LOG(log, "demangled rustv0: {0} -> error: failed to demangle", M);
}		}

▲ Show 20 Lines • Show All 325 Lines • Show Last 20 Lines

llvm/include/llvm/Demangle/Demangle.h

	Show First 20 Lines • Show All 51 Lines • ▼ Show 20 Lines
	/// *n_buf stores the size of buf on input if buf is non-nullptr, and it			/// *n_buf stores the size of buf on input if buf is non-nullptr, and it
	/// receives the size of the demangled string on output if n_buf is not nullptr.			/// receives the size of the demangled string on output if n_buf is not nullptr.
	/// status receives one of the demangle_ enum entries above if it's not nullptr.			/// status receives one of the demangle_ enum entries above if it's not nullptr.
	/// Flags controls various details of the demangled representation.			/// Flags controls various details of the demangled representation.
	char microsoftDemangle(const char mangled_name, size_t n_read, char buf,			char microsoftDemangle(const char mangled_name, size_t n_read, char buf,
	size_t n_buf, int status,			size_t n_buf, int status,
	MSDemangleFlags Flags = MSDF_None);			MSDemangleFlags Flags = MSDF_None);

	// Demangles a Rust v0 mangled symbol. The API follows that of __cxa_demangle.			// Demangles a Rust v0 mangled symbol.
	char rustDemangle(const char MangledName, char Buf, size_t N, int *Status);			char rustDemangle(const char MangledName);

	// Demangles a D mangled symbol.			// Demangles a D mangled symbol.
	char dlangDemangle(const char MangledName);			char dlangDemangle(const char MangledName);

	/// Attempt to demangle a string using different demangling schemes.			/// Attempt to demangle a string using different demangling schemes.
	/// The function uses heuristics to determine which demangling scheme to use.			/// The function uses heuristics to determine which demangling scheme to use.
	/// \param MangledName - reference to string to demangle.			/// \param MangledName - reference to string to demangle.
	/// \returns - the demangled string, or a copy of the input string if no			/// \returns - the demangled string, or a copy of the input string if no
	▲ Show 20 Lines • Show All 64 Lines • Show Last 20 Lines

llvm/lib/Demangle/Demangle.cpp

Show First 20 Lines • Show All 45 Lines • ▼ Show 20 Lines	std::string llvm::demangle(const std::string &MangledName) {
return MangledName;		return MangledName;
}		}

bool llvm::nonMicrosoftDemangle(const char *MangledName, std::string &Result) {		bool llvm::nonMicrosoftDemangle(const char *MangledName, std::string &Result) {
char *Demangled = nullptr;		char *Demangled = nullptr;
if (isItaniumEncoding(MangledName))		if (isItaniumEncoding(MangledName))
Demangled = itaniumDemangle(MangledName, nullptr, nullptr, nullptr);		Demangled = itaniumDemangle(MangledName, nullptr, nullptr, nullptr);
else if (isRustEncoding(MangledName))		else if (isRustEncoding(MangledName))
Demangled = rustDemangle(MangledName, nullptr, nullptr, nullptr);		Demangled = rustDemangle(MangledName);
else if (isDLangEncoding(MangledName))		else if (isDLangEncoding(MangledName))
Demangled = dlangDemangle(MangledName);		Demangled = dlangDemangle(MangledName);

if (!Demangled)		if (!Demangled)
return false;		return false;

Result = Demangled;		Result = Demangled;
std::free(Demangled);		std::free(Demangled);
return true;		return true;
}		}

llvm/lib/Demangle/RustDemangle.cpp

Show First 20 Lines • Show All 141 Lines • ▼ Show 20 Lines	private:
bool consumeIf(char Prefix);		bool consumeIf(char Prefix);

bool addAssign(uint64_t &A, uint64_t B);		bool addAssign(uint64_t &A, uint64_t B);
bool mulAssign(uint64_t &A, uint64_t B);		bool mulAssign(uint64_t &A, uint64_t B);
};		};

} // namespace		} // namespace

char llvm::rustDemangle(const char MangledName, char Buf, size_t N,		char llvm::rustDemangle(const char MangledName) {
int *Status) {		if (MangledName == nullptr)
if (MangledName == nullptr \|\| (Buf != nullptr && N == nullptr)) {
if (Status != nullptr)
*Status = demangle_invalid_args;
return nullptr;		return nullptr;
}

// Return early if mangled name doesn't look like a Rust symbol.		// Return early if mangled name doesn't look like a Rust symbol.
StringView Mangled(MangledName);		StringView Mangled(MangledName);
if (!Mangled.startsWith("_R")) {		if (!Mangled.startsWith("_R"))
if (Status != nullptr)
*Status = demangle_invalid_mangled_name;
return nullptr;		return nullptr;
}

Demangler D;		Demangler D;
if (!initializeOutputBuffer(nullptr, nullptr, D.Output, 1024)) {		if (!initializeOutputBuffer(nullptr, nullptr, D.Output, 1024))
if (Status != nullptr)
*Status = demangle_memory_alloc_failure;
return nullptr;		return nullptr;
}

if (!D.demangle(Mangled)) {		if (!D.demangle(Mangled)) {
if (Status != nullptr)
*Status = demangle_invalid_mangled_name;
std::free(D.Output.getBuffer());		std::free(D.Output.getBuffer());
return nullptr;		return nullptr;
}		}

D.Output += '\0';		D.Output += '\0';
char *Demangled = D.Output.getBuffer();
size_t DemangledLen = D.Output.getCurrentPosition();

if (Buf != nullptr) {
if (DemangledLen <= *N) {
std::memcpy(Buf, Demangled, DemangledLen);
std::free(Demangled);
Demangled = Buf;
} else {
std::free(Buf);
}
}

if (N != nullptr)
*N = DemangledLen;

if (Status != nullptr)
*Status = demangle_success;

return Demangled;		return D.Output.getBuffer();
}		}

Demangler::Demangler(size_t MaxRecursionLevel)		Demangler::Demangler(size_t MaxRecursionLevel)
: MaxRecursionLevel(MaxRecursionLevel) {}		: MaxRecursionLevel(MaxRecursionLevel) {}

static inline bool isDigit(const char C) { return '0' <= C && C <= '9'; }		static inline bool isDigit(const char C) { return '0' <= C && C <= '9'; }

static inline bool isHexDigit(const char C) {		static inline bool isHexDigit(const char C) {
▲ Show 20 Lines • Show All 1,087 Lines • Show Last 20 Lines

llvm/tools/llvm-rust-demangle-fuzzer/llvm-rust-demangle-fuzzer.cpp

	//===--- llvm-demangle-fuzzer.cpp - Fuzzer for the Rust Demangler ---------===//			//===--- llvm-demangle-fuzzer.cpp - Fuzzer for the Rust Demangler ---------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "llvm/Demangle/Demangle.h"			#include "llvm/Demangle/Demangle.h"
	#include <cstdint>			#include <cstdint>
	#include <cstdlib>			#include <cstdlib>
	#include <string>			#include <string>

	extern "C" int LLVMFuzzerTestOneInput(const uint8_t *Data, size_t Size) {			extern "C" int LLVMFuzzerTestOneInput(const uint8_t *Data, size_t Size) {
	std::string NullTerminatedString((const char *)Data, Size);			std::string NullTerminatedString((const char *)Data, Size);
	int Status = 0;			char *Demangled = llvm::rustDemangle(NullTerminatedString.c_str());
	char *Demangled = llvm::rustDemangle(NullTerminatedString.c_str(), nullptr,
	nullptr, &Status);
	std::free(Demangled);			std::free(Demangled);
	return 0;			return 0;
	}			}

llvm/unittests/Demangle/RustDemangleTest.cpp

	//===------------------ RustDemangleTest.cpp ------------------------------===//			//===------------------ RustDemangleTest.cpp ------------------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "llvm/Demangle/Demangle.h"			#include "llvm/Demangle/Demangle.h"
	#include "gmock/gmock.h"			#include "gmock/gmock.h"
	#include "gtest/gtest.h"			#include "gtest/gtest.h"

	#include <cstdlib>			#include <cstdlib>

	TEST(RustDemangle, Success) {			TEST(RustDemangle, Success) {
	char *Demangled =			char *Demangled = llvm::rustDemangle("_RNvC1a4main");
	llvm::rustDemangle("_RNvC1a4main", nullptr, nullptr, nullptr);
	EXPECT_STREQ(Demangled, "a::main");
	std::free(Demangled);

	// With status.
	int Status = 0;
	Demangled = llvm::rustDemangle("_RNvC1a4main", nullptr, nullptr, &Status);
	EXPECT_EQ(Status, llvm::demangle_success);
	EXPECT_STREQ(Demangled, "a::main");
	std::free(Demangled);

	// With status and length.
	size_t N = 0;
	Demangled = llvm::rustDemangle("_RNvC1a4main", nullptr, &N, &Status);
	EXPECT_EQ(Status, llvm::demangle_success);
	EXPECT_EQ(N, 8u);
	EXPECT_STREQ(Demangled, "a::main");			EXPECT_STREQ(Demangled, "a::main");
	std::free(Demangled);			std::free(Demangled);
	}			}

	TEST(RustDemangle, Invalid) {			TEST(RustDemangle, Invalid) {
	int Status = 0;
	char *Demangled = nullptr;			char *Demangled = nullptr;

	// Invalid prefix.			// Invalid prefix.
	Demangled = llvm::rustDemangle("_ABCDEF", nullptr, nullptr, &Status);			Demangled = llvm::rustDemangle("_ABCDEF");
	EXPECT_EQ(Status, llvm::demangle_invalid_mangled_name);
	EXPECT_EQ(Demangled, nullptr);			EXPECT_EQ(Demangled, nullptr);

	// Correct prefix but still invalid.			// Correct prefix but still invalid.
	Demangled = llvm::rustDemangle("_RRR", nullptr, nullptr, &Status);			Demangled = llvm::rustDemangle("_RRR");
	EXPECT_EQ(Status, llvm::demangle_invalid_mangled_name);
	EXPECT_EQ(Demangled, nullptr);			EXPECT_EQ(Demangled, nullptr);
	}			}

	TEST(RustDemangle, OutputBufferWithoutLength) {
	char Buffer = static_cast<char >(std::malloc(1024));
	ASSERT_NE(Buffer, nullptr);

	int Status = 0;
	char *Demangled =
	llvm::rustDemangle("_RNvC1a4main", Buffer, nullptr, &Status);

	EXPECT_EQ(Status, llvm::demangle_invalid_args);
	EXPECT_EQ(Demangled, nullptr);
	std::free(Buffer);
	}

	TEST(RustDemangle, OutputBuffer) {
	size_t N = 1024;
	char Buffer = static_cast<char >(std::malloc(N));
	ASSERT_NE(Buffer, nullptr);

	int Status = 0;
	char *Demangled = llvm::rustDemangle("_RNvC1a4main", Buffer, &N, &Status);

	EXPECT_EQ(Status, llvm::demangle_success);
	EXPECT_EQ(Demangled, Buffer);
	EXPECT_STREQ(Demangled, "a::main");
	std::free(Demangled);
	}

	TEST(RustDemangle, SmallOutputBuffer) {
	size_t N = 1;
	char Buffer = static_cast<char >(std::malloc(N));
	ASSERT_NE(Buffer, nullptr);

	int Status = 0;
	char *Demangled = llvm::rustDemangle("_RNvC1a4main", Buffer, &N, &Status);

	EXPECT_EQ(Status, llvm::demangle_success);
	EXPECT_STREQ(Demangled, "a::main");
	std::free(Demangled);
	}

This is an archive of the discontinued LLVM Phabricator instance.

[demangler] Rust demangler buffer reuseClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 422531

lldb/source/Core/Mangled.cpp

llvm/include/llvm/Demangle/Demangle.h

llvm/lib/Demangle/Demangle.cpp

llvm/lib/Demangle/RustDemangle.cpp

llvm/tools/llvm-rust-demangle-fuzzer/llvm-rust-demangle-fuzzer.cpp

llvm/unittests/Demangle/RustDemangleTest.cpp

[demangler] Rust demangler buffer reuse
ClosedPublic