This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/ADT/
-
llvm/
-
ADT/
4
SmallVector.h
-
lib/Support/
-
Support/
-
SmallVector.cpp

Differential D49163

ADT: Shrink SmallVector size 0 to 16B on 64-bit platforms
ClosedPublic

Authored by dexonsmith on Jul 10 2018, 5:14 PM.

Download Raw Diff

Details

Reviewers

rsmith
rnk

Summary

SmallVectorTemplateCommon wants to know the address of the first element so it can detect whether it's in "small size" mode.

The old implementation split the small array, creating the storage for the first element in SmallVectorTemplateCommon, and pulling the rest into SmallVectorStorage where we know the size of the array. This bloats SmallVector small-size 0 by the larger of sizeof(void*) and sizeof(T) unnecessarily.

The new implementation leaves the full small storage to SmallVectorStorage. To calculate the offset of the first element in SmallVectorTemplateCommon, we just need to know how far to jump, which we can calculate out-of-band. One subtlety is that we need SmallVectorStorage to be properly aligned even when the size is 0, to be sure that (for large alignments) we actually have the padding and it's well defined to do the pointer math.

Diff Detail

Event Timeline

dexonsmith created this revision.Jul 10 2018, 5:14 PM

Herald added a subscriber: hiraditya. · View Herald TranscriptJul 10 2018, 5:14 PM

dexonsmith added a parent revision: D48518: ADT: Shrink SmallVector by 8B on 64-bit platforms.Jul 10 2018, 5:14 PM

@rsmith, what's the least bad way to do what SmallVector is trying to do?

llvm/include/llvm/ADT/SmallVector.h
90	This assumes that base classes are laid out more or less the same as fields, but we were already assuming that there wouldn't be padding between FirstEl and InlineElts, so this seems like an improvement.

dexonsmith added inline comments.Jul 19 2018, 3:37 PM

llvm/include/llvm/ADT/SmallVector.h

What about this?

struct SmallVectorAlignmentAndSizeBase {
  AlignedCharArrayUnion<SmallVectorBase> Base;
};
template <class T>
struct SmallVectorAlignmentAndSize : SmallVectorAlignmentAndSizeBase {
  AlignedCharArrayUnion<T> FirstEl;
};

// still doing this later:
  void *getFirstEl() const {
    return const_cast<void *>(reinterpret_cast<const void *>(
        reinterpret_cast<const char *>(this) +
        offsetof(SmallVectorAlignmentAndSize<T>, FirstEl)));
  }

dexonsmith added inline comments.Jul 19 2018, 5:35 PM

llvm/include/llvm/ADT/SmallVector.h
90	we were already assuming that there wouldn't be padding between FirstEl and InlineElts BTW, I don't think we had to assuming that there wouldn't be padding. As long as there's enough otherwise-unused storage after `FirstEl`, we won't overflow the allocation.

lgtm, I was hoping @rsmith would have a better suggestion, but we don't need to wait on it.

llvm/include/llvm/ADT/SmallVector.h
90	I think there are interesting differences between multiple inheritance and single inheritance, so I don't think this is an improvement over the simpler `offsetof` calculation that you have now.

This revision is now accepted and ready to land.Jul 23 2018, 11:41 AM

Committed in r337820.

Revision Contents

Path

Size

llvm/

include/

llvm/

ADT/

SmallVector.h

46 lines

lib/

Support/

SmallVector.cpp

16 lines

Diff 154907

llvm/include/llvm/ADT/SmallVector.h

Show First 20 Lines • Show All 64 Lines • ▼ Show 20 Lines	public:
/// update the size later. This avoids the cost of value initializing elements		/// update the size later. This avoids the cost of value initializing elements
/// which will only be overwritten.		/// which will only be overwritten.
void set_size(size_t Size) {		void set_size(size_t Size) {
assert(Size <= capacity());		assert(Size <= capacity());
this->Size = Size;		this->Size = Size;
}		}
};		};

		/// Figure out the offset of the first element.
		template <class T, typename = void> struct SmallVectorAlignmentAndSize {
		AlignedCharArrayUnion<SmallVectorBase> Base;
		AlignedCharArrayUnion<T> FirstEl;
		};

/// This is the part of SmallVectorTemplateBase which does not depend on whether		/// This is the part of SmallVectorTemplateBase which does not depend on whether
/// the type T is a POD. The extra dummy template argument is used by ArrayRef		/// the type T is a POD. The extra dummy template argument is used by ArrayRef
/// to avoid unnecessarily requiring T to be complete.		/// to avoid unnecessarily requiring T to be complete.
template <typename T, typename = void>		template <typename T, typename = void>
class SmallVectorTemplateCommon : public SmallVectorBase {		class SmallVectorTemplateCommon : public SmallVectorBase {
private:		/// Find the address of the first element. For this pointer math to be valid
template <typename, unsigned> friend struct SmallVectorStorage;		/// with small-size of 0 for T with lots of alignment, it's important that
		/// SmallVectorStorage is properly-aligned even for small-size of 0.
// Allocate raw space for N elements of type T. If T has a ctor or dtor, we		void *getFirstEl() const {
// don't want it to be automatically run, so we need to represent the space as		return const_cast<void >(reinterpret_cast<const void >(
// something else. Use an array of char of sufficient alignment.		reinterpret_cast<const char *>(this) +
using U = AlignedCharArrayUnion<T>;		offsetof(SmallVectorAlignmentAndSize<T>, FirstEl)));
		rnkUnsubmitted Not Done Reply Inline Actions This assumes that base classes are laid out more or less the same as fields, but we were already assuming that there wouldn't be padding between FirstEl and InlineElts, so this seems like an improvement. rnk: This assumes that base classes are laid out more or less the same as fields, but we were…
		dexonsmithAuthorUnsubmitted Not Done Reply Inline Actions What about this? struct SmallVectorAlignmentAndSizeBase { AlignedCharArrayUnion<SmallVectorBase> Base; }; template <class T> struct SmallVectorAlignmentAndSize : SmallVectorAlignmentAndSizeBase { AlignedCharArrayUnion<T> FirstEl; }; // still doing this later: void getFirstEl() const { return const_cast<void >(reinterpret_cast<const void >( reinterpret_cast<const char >(this) + offsetof(SmallVectorAlignmentAndSize<T>, FirstEl))); } dexonsmith: What about this? ``` struct SmallVectorAlignmentAndSizeBase {…
		dexonsmithAuthorUnsubmitted Not Done Reply Inline Actions we were already assuming that there wouldn't be padding between FirstEl and InlineElts BTW, I don't think we had to assuming that there wouldn't be padding. As long as there's enough otherwise-unused storage after `FirstEl`, we won't overflow the allocation. dexonsmith: > we were already assuming that there wouldn't be padding between FirstEl and InlineElts BTW…
		rnkUnsubmitted Not Done Reply Inline Actions I think there are interesting differences between multiple inheritance and single inheritance, so I don't think this is an improvement over the simpler `offsetof` calculation that you have now. rnk: I think there are interesting differences between multiple inheritance and single inheritance…
U FirstEl;		}
// Space after 'FirstEl' is clobbered, do not add any instance vars after it.		// Space after 'FirstEl' is clobbered, do not add any instance vars after it.

protected:		protected:
SmallVectorTemplateCommon(size_t Size) : SmallVectorBase(&FirstEl, Size) {}		SmallVectorTemplateCommon(size_t Size)
		: SmallVectorBase(getFirstEl(), Size) {}

void grow_pod(size_t MinCapacity, size_t TSize) {		void grow_pod(size_t MinCapacity, size_t TSize) {
SmallVectorBase::grow_pod(&FirstEl, MinCapacity, TSize);		SmallVectorBase::grow_pod(getFirstEl(), MinCapacity, TSize);
}		}

/// Return true if this is a smallvector which has not had dynamic		/// Return true if this is a smallvector which has not had dynamic
/// memory allocated for it.		/// memory allocated for it.
bool isSmall() const {		bool isSmall() const { return BeginX == getFirstEl(); }
return BeginX == static_cast<const void*>(&FirstEl);
}

/// Put this vector in a state of being small.		/// Put this vector in a state of being small.
void resetToSmall() {		void resetToSmall() {
BeginX = &FirstEl;		BeginX = getFirstEl();
Size = Capacity = 0; // FIXME: Setting Capacity to 0 is suspect.		Size = Capacity = 0; // FIXME: Setting Capacity to 0 is suspect.
}		}

public:		public:
using size_type = size_t;		using size_type = size_t;
using difference_type = ptrdiff_t;		using difference_type = ptrdiff_t;
using value_type = T;		using value_type = T;
using iterator = T *;		using iterator = T *;
▲ Show 20 Lines • Show All 701 Lines • ▼ Show 20 Lines	SmallVectorImpl<T> &SmallVectorImpl<T>::operator=(SmallVectorImpl<T> &&RHS) {

// Set end.		// Set end.
this->set_size(RHSSize);		this->set_size(RHSSize);

RHS.clear();		RHS.clear();
return *this;		return *this;
}		}

/// Storage for the SmallVector elements which aren't contained in		/// Storage for the SmallVector elements. This is specialized for the N=0 case
/// SmallVectorTemplateCommon. There are 'N-1' elements here. The remaining '1'
/// element is in the base class. This is specialized for the N=1 and N=0 cases
/// to avoid allocating unnecessary storage.		/// to avoid allocating unnecessary storage.
template <typename T, unsigned N>		template <typename T, unsigned N>
struct SmallVectorStorage {		struct SmallVectorStorage {
typename SmallVectorTemplateCommon<T>::U InlineElts[N - 1];		AlignedCharArrayUnion<T> InlineElts[N];
};		};
template <typename T> struct SmallVectorStorage<T, 1> {};
template <typename T> struct SmallVectorStorage<T, 0> {};		/// We need the storage to be properly aligned even for small-size of 0 so that
		/// the pointer math in \a SmallVectorTemplateCommon::getFirstEl() is
		/// well-defined.
		template <typename T> struct alignas(alignof(T)) SmallVectorStorage<T, 0> {};

/// This is a 'vector' (really, a variable-sized array), optimized		/// This is a 'vector' (really, a variable-sized array), optimized
/// for the case when the array is small. It contains some number of elements		/// for the case when the array is small. It contains some number of elements
/// in-place, which allows it to avoid heap allocation when the actual number of		/// in-place, which allows it to avoid heap allocation when the actual number of
/// elements is below that threshold. This allows normal "small" cases to be		/// elements is below that threshold. This allows normal "small" cases to be
/// fast without losing generality for large inputs.		/// fast without losing generality for large inputs.
///		///
/// Note that this does not attempt to be exception safe.		/// Note that this does not attempt to be exception safe.
▲ Show 20 Lines • Show All 96 Lines • Show Last 20 Lines

llvm/lib/Support/SmallVector.cpp

	//===- llvm/ADT/SmallVector.cpp - 'Normally small' vectors ----------------===//			//===- llvm/ADT/SmallVector.cpp - 'Normally small' vectors ----------------===//
	//			//
	// The LLVM Compiler Infrastructure			// The LLVM Compiler Infrastructure
	//			//
	// This file is distributed under the University of Illinois Open Source			// This file is distributed under the University of Illinois Open Source
	// License. See LICENSE.TXT for details.			// License. See LICENSE.TXT for details.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// This file implements the SmallVector class.			// This file implements the SmallVector class.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "llvm/ADT/SmallVector.h"			#include "llvm/ADT/SmallVector.h"
	using namespace llvm;			using namespace llvm;

	// Check that no bytes are wasted.			// Check that no bytes are wasted and everything is well-aligned.
				namespace {
				struct Struct16B {
				alignas(16) void *X;
				};
				}
				static_assert(sizeof(SmallVector<void *, 0>) ==
				sizeof(unsigned) * 2 + sizeof(void *),
				"wasted space in SmallVector size 0");
				static_assert(alignof(SmallVector<Struct16B, 0>) == 16,
				"wrong alignment with no small size");
				static_assert(sizeof(SmallVector<Struct16B, 0>) == 16,
				"missing padding with no small size");
	static_assert(sizeof(SmallVector<void *, 1>) ==			static_assert(sizeof(SmallVector<void *, 1>) ==
	sizeof(unsigned) * 2 + sizeof(void ) 2,			sizeof(unsigned) * 2 + sizeof(void ) 2,
	"wasted space in SmallVector size 1; missing EBO?");			"wasted space in SmallVector size 1");

	/// grow_pod - This is an implementation of the grow() method which only works			/// grow_pod - This is an implementation of the grow() method which only works
	/// on POD-like datatypes and is out of line to reduce code duplication.			/// on POD-like datatypes and is out of line to reduce code duplication.
	void SmallVectorBase::grow_pod(void *FirstEl, size_t MinCapacity,			void SmallVectorBase::grow_pod(void *FirstEl, size_t MinCapacity,
	size_t TSize) {			size_t TSize) {
	// Ensure we can fit the new capacity in 32 bits.			// Ensure we can fit the new capacity in 32 bits.
	if (MinCapacity > UINT32_MAX)			if (MinCapacity > UINT32_MAX)
	report_bad_alloc_error("SmallVector capacity overflow during allocation");			report_bad_alloc_error("SmallVector capacity overflow during allocation");
	Show All 19 Lines

This is an archive of the discontinued LLVM Phabricator instance.

ADT: Shrink SmallVector size 0 to 16B on 64-bit platformsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 154907

llvm/include/llvm/ADT/SmallVector.h

llvm/lib/Support/SmallVector.cpp

ADT: Shrink SmallVector size 0 to 16B on 64-bit platforms
ClosedPublic