This is an archive of the discontinued LLVM Phabricator instance.

Handle bit fields on big-endian systems correctly
ClosedPublic

Authored by uweigand on Apr 11 2016, 11:44 AM.

Download Raw Diff

Details

Reviewers

clayborg
granata.enrico
labath

Commits

rG461bd680c33f: Handle bit fields on big-endian systems correctly
rLLDB266312: Handle bit fields on big-endian systems correctly
rL266312: Handle bit fields on big-endian systems correctly

Summary

Currently, the DataExtractor::GetMaxU64Bitfield and GetMaxS64Bitfield
routines assume the incoming "bitfield_bit_offset" parameter uses
little-endian bit numbering, i.e. a bitfield_bit_offset 0 refers to
a bitfield whose least-significant bit coincides with the least-
significant bit of the surrounding integer.

On many big-endian systems, however, the big-endian bit numbering
is used for bit fields. Here, a bitfield_bit_offset 0 refers to
a bitfield whose most-significant bit conincides with the most-
significant bit of the surrounding integer.

Now, in principle LLDB could arbitrarily choose which semantics of
bitfield_bit_offset to use. However, there are two problems with
the current approach:

When parsing DWARF, LLDB decodes bit offsets in little-endian bit numbering on LE systems, but in big-endian bit numbering on BE systems. Passing those offsets later on into the DataExtractor routines gives incorrect results on BE.

In the interim, LLDB's type layer combines byte and bit offsets into a single number. I.e. instead of recording bitfields by specifying the byte offset and byte size of the surrounding integer *plus* the bit offset of the bit field within that field, it simply records a single bit offset number.

Now, note that converting from byte offset + bit offset to a single offset value and back is well-defined if we either use little-endian byte order *and* little-endian bit numbering, or use big-endian byte order *and* big-endian bit numbering. Any other combination will yield incorrect results.

Therefore, the simplest approach would seem to be to always use
the bit numbering that matches the system byte order. This makes
storing a single bit offset valid, and makes the existing DWARF
code correct. The only place to fix is to teach DataExtractor
to use big-endian bit numbering on big endian systems.

However, there is only additional caveat: we also get bit offsets
from LLDB synthetic bitfields. While the exact semantics of those
doesn't seem to be well-defined, from test cases it appears that
the intent was for the user-provided synthetic bitfield offset to
always use little-endian bit numbering. Therefore, on a big-endian
system we now have to convert those to big-endian bit numbering
to remain consistent.

Diff Detail

Repository: rL LLVM

Event Timeline

uweigand updated this revision to Diff 53298.Apr 11 2016, 11:44 AM

uweigand retitled this revision from to Handle bit fields on big-endian systems correctly.

uweigand updated this object.

uweigand added reviewers: granata.enrico, clayborg.

uweigand added a subscriber: lldb-commits.

Looks good as long as all tests still pass on all other systems.

This revision is now accepted and ready to land.Apr 11 2016, 1:57 PM

It would be worthwhile to add a unit test for the DataExtractor fix (we don't have many of those, but we're trying to build them up).

Updated interface documentation in DataExtractor.h.

Added unit test case for the DataExtractor::GetMaxU64Bitfield and GetMaxS64Bitfield routines.

Retested on System z and Intel.

labath accepted this revision.Apr 14 2016, 1:44 AM

labath added a reviewer: labath.

mamai added a subscriber: mamai.Apr 14 2016, 6:58 AM

Closed by commit rL266312: Handle bit fields on big-endian systems correctly (authored by uweigand). · Explain WhyApr 14 2016, 7:38 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

lldb/

trunk/

include/

lldb/

Core/

DataExtractor.h

12 lines

source/

Core/

DataExtractor.cpp

14 lines

ValueObject.cpp

8 lines

unittests/

Core/

CMakeLists.txt

1 line

DataExtractorTest.cpp

39 lines

Diff 53711

lldb/trunk/include/lldb/Core/DataExtractor.h

Show First 20 Lines • Show All 757 Lines • ▼ Show 20 Lines	public:
/// The size in byte of the integer to extract.		/// The size in byte of the integer to extract.
///		///
/// @param[in] bitfield_bit_size		/// @param[in] bitfield_bit_size
/// The size in bits of the bitfield value to extract, or zero		/// The size in bits of the bitfield value to extract, or zero
/// to just extract the entire integer value.		/// to just extract the entire integer value.
///		///
/// @param[in] bitfield_bit_offset		/// @param[in] bitfield_bit_offset
/// The bit offset of the bitfield value in the extracted		/// The bit offset of the bitfield value in the extracted
/// integer (the number of bits to shift the integer to the		/// integer. For little-endian data, this is the offset of
/// right).		/// the LSB of the bitfield from the LSB of the integer.
		/// For big-endian data, this is the offset of the MSB of the
		/// bitfield from the MSB of the integer.
///		///
/// @return		/// @return
/// The unsigned bitfield integer value that was extracted, or		/// The unsigned bitfield integer value that was extracted, or
/// zero on failure.		/// zero on failure.
//------------------------------------------------------------------		//------------------------------------------------------------------
uint64_t		uint64_t
GetMaxU64Bitfield (lldb::offset_t *offset_ptr,		GetMaxU64Bitfield (lldb::offset_t *offset_ptr,
size_t size,		size_t size,
Show All 24 Lines	public:
/// The size in bytes of the integer to extract.		/// The size in bytes of the integer to extract.
///		///
/// @param[in] bitfield_bit_size		/// @param[in] bitfield_bit_size
/// The size in bits of the bitfield value to extract, or zero		/// The size in bits of the bitfield value to extract, or zero
/// to just extract the entire integer value.		/// to just extract the entire integer value.
///		///
/// @param[in] bitfield_bit_offset		/// @param[in] bitfield_bit_offset
/// The bit offset of the bitfield value in the extracted		/// The bit offset of the bitfield value in the extracted
/// integer (the number of bits to shift the integer to the		/// integer. For little-endian data, this is the offset of
/// right).		/// the LSB of the bitfield from the LSB of the integer.
		/// For big-endian data, this is the offset of the MSB of the
		/// bitfield from the MSB of the integer.
///		///
/// @return		/// @return
/// The signed bitfield integer value that was extracted, or		/// The signed bitfield integer value that was extracted, or
/// zero on failure.		/// zero on failure.
//------------------------------------------------------------------		//------------------------------------------------------------------
int64_t		int64_t
GetMaxS64Bitfield (lldb::offset_t *offset_ptr,		GetMaxS64Bitfield (lldb::offset_t *offset_ptr,
size_t size,		size_t size,
▲ Show 20 Lines • Show All 523 Lines • Show Last 20 Lines

lldb/trunk/source/Core/DataExtractor.cpp

	Show First 20 Lines • Show All 727 Lines • ▼ Show 20 Lines
	}			}

	uint64_t			uint64_t
	DataExtractor::GetMaxU64Bitfield (offset_t *offset_ptr, size_t size, uint32_t bitfield_bit_size, uint32_t bitfield_bit_offset) const			DataExtractor::GetMaxU64Bitfield (offset_t *offset_ptr, size_t size, uint32_t bitfield_bit_size, uint32_t bitfield_bit_offset) const
	{			{
	uint64_t uval64 = GetMaxU64 (offset_ptr, size);			uint64_t uval64 = GetMaxU64 (offset_ptr, size);
	if (bitfield_bit_size > 0)			if (bitfield_bit_size > 0)
	{			{
	if (bitfield_bit_offset > 0)			int32_t lsbcount = bitfield_bit_offset;
	uval64 >>= bitfield_bit_offset;			if (m_byte_order == eByteOrderBig)
				lsbcount = size * 8 - bitfield_bit_offset - bitfield_bit_size;
				if (lsbcount > 0)
				uval64 >>= lsbcount;
	uint64_t bitfield_mask = ((1ul << bitfield_bit_size) - 1);			uint64_t bitfield_mask = ((1ul << bitfield_bit_size) - 1);
	if (!bitfield_mask && bitfield_bit_offset == 0 && bitfield_bit_size == 64)			if (!bitfield_mask && bitfield_bit_offset == 0 && bitfield_bit_size == 64)
	return uval64;			return uval64;
	uval64 &= bitfield_mask;			uval64 &= bitfield_mask;
	}			}
	return uval64;			return uval64;
	}			}

	int64_t			int64_t
	DataExtractor::GetMaxS64Bitfield (offset_t *offset_ptr, size_t size, uint32_t bitfield_bit_size, uint32_t bitfield_bit_offset) const			DataExtractor::GetMaxS64Bitfield (offset_t *offset_ptr, size_t size, uint32_t bitfield_bit_size, uint32_t bitfield_bit_offset) const
	{			{
	int64_t sval64 = GetMaxS64 (offset_ptr, size);			int64_t sval64 = GetMaxS64 (offset_ptr, size);
	if (bitfield_bit_size > 0)			if (bitfield_bit_size > 0)
	{			{
	if (bitfield_bit_offset > 0)			int32_t lsbcount = bitfield_bit_offset;
	sval64 >>= bitfield_bit_offset;			if (m_byte_order == eByteOrderBig)
				lsbcount = size * 8 - bitfield_bit_offset - bitfield_bit_size;
				if (lsbcount > 0)
				sval64 >>= lsbcount;
	uint64_t bitfield_mask = (((uint64_t)1) << bitfield_bit_size) - 1;			uint64_t bitfield_mask = (((uint64_t)1) << bitfield_bit_size) - 1;
	sval64 &= bitfield_mask;			sval64 &= bitfield_mask;
	// sign extend if needed			// sign extend if needed
	if (sval64 & (((uint64_t)1) << (bitfield_bit_size - 1)))			if (sval64 & (((uint64_t)1) << (bitfield_bit_size - 1)))
	sval64 \|= ~bitfield_mask;			sval64 \|= ~bitfield_mask;
	}			}
	return sval64;			return sval64;
	}			}
	▲ Show 20 Lines • Show All 1,506 Lines • Show Last 20 Lines

lldb/trunk/source/Core/ValueObject.cpp

Show First 20 Lines • Show All 2,140 Lines • ▼ Show 20 Lines	if (IsScalarType ())
char index_str[64];		char index_str[64];
snprintf(index_str, sizeof(index_str), "[%i-%i]", from, to);		snprintf(index_str, sizeof(index_str), "[%i-%i]", from, to);
ConstString index_const_str(index_str);		ConstString index_const_str(index_str);
// Check if we have already created a synthetic array member in this		// Check if we have already created a synthetic array member in this
// valid object. If we have we will re-use it.		// valid object. If we have we will re-use it.
synthetic_child_sp = GetSyntheticChild (index_const_str);		synthetic_child_sp = GetSyntheticChild (index_const_str);
if (!synthetic_child_sp)		if (!synthetic_child_sp)
{		{
		uint32_t bit_field_size = to - from + 1;
		uint32_t bit_field_offset = from;
		if (GetDataExtractor().GetByteOrder() == eByteOrderBig)
		bit_field_offset = GetByteSize() * 8 - bit_field_size - bit_field_offset;
// We haven't made a synthetic array member for INDEX yet, so		// We haven't made a synthetic array member for INDEX yet, so
// lets make one and cache it for any future reference.		// lets make one and cache it for any future reference.
ValueObjectChild synthetic_child = new ValueObjectChild (this,		ValueObjectChild synthetic_child = new ValueObjectChild (this,
GetCompilerType(),		GetCompilerType(),
index_const_str,		index_const_str,
GetByteSize(),		GetByteSize(),
0,		0,
to-from+1,		bit_field_size,
from,		bit_field_offset,
false,		false,
false,		false,
eAddressTypeInvalid,		eAddressTypeInvalid,
0);		0);

// Cache the value if we got one back...		// Cache the value if we got one back...
if (synthetic_child)		if (synthetic_child)
{		{
▲ Show 20 Lines • Show All 2,167 Lines • Show Last 20 Lines

lldb/trunk/unittests/Core/CMakeLists.txt

	add_lldb_unittest(LLDBCoreTests			add_lldb_unittest(LLDBCoreTests
				DataExtractorTest.cpp
	ScalarTest.cpp			ScalarTest.cpp
	)			)

lldb/trunk/unittests/Core/DataExtractorTest.cpp

				//===-- DataExtractorTest.cpp ------------------------------------ C++ --===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//

				#if defined(_MSC_VER) && (_HAS_EXCEPTIONS == 0)
				// Workaround for MSVC standard library bug, which fails to include <thread> when
				// exceptions are disabled.
				#include <eh.h>
				#endif

				#include "gtest/gtest.h"

				#include "lldb/Core/DataExtractor.h"

				using namespace lldb_private;

				TEST(DataExtractorTest, GetBitfield)
				{
				char buffer[] = { 0x01, 0x23, 0x45, 0x67 };
				DataExtractor LE(buffer, sizeof(buffer), lldb::eByteOrderLittle, sizeof(void *));
				DataExtractor BE(buffer, sizeof(buffer), lldb::eByteOrderBig, sizeof(void *));

				lldb::offset_t offset;

				offset = 0;
				ASSERT_EQ(buffer[1], LE.GetMaxU64Bitfield(&offset, sizeof(buffer), 8, 8));
				offset = 0;
				ASSERT_EQ(buffer[1], BE.GetMaxU64Bitfield(&offset, sizeof(buffer), 8, 8));

				offset = 0;
				ASSERT_EQ(buffer[1], LE.GetMaxS64Bitfield(&offset, sizeof(buffer), 8, 8));
				offset = 0;
				ASSERT_EQ(buffer[1], BE.GetMaxS64Bitfield(&offset, sizeof(buffer), 8, 8));
				}

This is an archive of the discontinued LLVM Phabricator instance.

Handle bit fields on big-endian systems correctlyClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 53711

lldb/trunk/include/lldb/Core/DataExtractor.h

lldb/trunk/source/Core/DataExtractor.cpp

lldb/trunk/source/Core/ValueObject.cpp

lldb/trunk/unittests/Core/CMakeLists.txt

lldb/trunk/unittests/Core/DataExtractorTest.cpp

Handle bit fields on big-endian systems correctly
ClosedPublic