This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/Support/
-
llvm/
-
Support/
3/6
OptimalLayout.h
-
lib/Support/
-
Support/
-
CMakeLists.txt
-
OptimalLayout.cpp
-
unittests/Support/
-
Support/
-
CMakeLists.txt
-
OptimalLayoutTest.cpp

Differential D76526

Add an algorithm for performing "optimal" layout of a struct.
AcceptedPublic

Authored by rjmccall on Mar 20 2020, 2:03 PM.

Download Raw Diff

Details

Reviewers

echristo

Summary

The algorithm supports both assigning a fixed offset to a field prior to layout and allowing fields to have sizes that aren't multiples of their required alignments. This means that the well-known algorithm of sorting by decreasing alignment isn't always good enough. Still, we start with that, and only if that leaves padding around do we fall back on a greedy padding-minimizing algorithm.

There is no known efficient algorithm for producing a guaranteed-minimal layout in all cases. In fact, allowing arbitrary fixed-offset fields means there's a straightforward reduction from bin-packing, making this NP-hard. But as usual with such problems, we can still efficiently produce adequate solutions to the cases that matter most to us.

I intend to use this in coroutine frame layout, where the retcon lowerings very badly want to minimize total space usage, and where the switch lowering can indeed produce a header with interior padding if the promise field is highly-aligned. But it may be useful in a much wider variety of situations.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

rjmccall created this revision.Mar 20 2020, 2:03 PM

Herald added a project: Restricted Project. · View Herald TranscriptMar 20 2020, 2:03 PM

Herald added subscribers: llvm-commits, dexonsmith, modocache and 2 others. · View Herald Transcript

rjmccall edited the summary of this revision. (Show Details)Mar 20 2020, 2:09 PM

rjmccall added a reviewer: echristo.

Harbormaster completed remote builds in B49954: Diff 251755.Mar 20 2020, 2:40 PM

Looks great to me as is. Thanks.

-eric

This revision is now accepted and ready to land.Apr 7 2020, 11:02 AM

Thanks. Chris asked me offline to rename it to something like OptimizedStructLayout, which seems reasonable to me; I just haven't had the time to do that yet.

atmnpatel mentioned this in D90550: [Coroutines][Docs] Remove frame packing as a TODO.Nov 1 2020, 3:52 AM

atmnpatel mentioned this in rGeed8df6a1314: [Coroutines][Docs] Remove frame packing as a TODO.Nov 2 2020, 12:57 PM

aykevl added a subscriber: aykevl.Apr 10 2021, 3:24 PM

aykevl added inline comments.

llvm/include/llvm/Support/OptimalLayout.h
40	This line is triggered in the coroutine lowering passes, see: https://bugs.llvm.org/show_bug.cgi?id=49916 I don't know whether this is a bug in OptimizedStructLayout or in the coroutine lowering passes, but I'm noting it here just in case.

Herald added a subscriber: lxfind. · View Herald TranscriptApr 10 2021, 3:24 PM

rjmccall added inline comments.Apr 10 2021, 9:15 PM

llvm/include/llvm/Support/OptimalLayout.h
40	Hmm. Abstractly it seems like a reasonable precondition, so it's probably best to fix in the coroutine lowering passes.

lxfind added inline comments.Apr 10 2021, 10:13 PM

llvm/include/llvm/Support/OptimalLayout.h
40	I guess we could explicitly check for the size of Alloca in corosplit, and if it's 0, don't put it on the frame?

rjmccall added inline comments.Apr 10 2021, 11:13 PM

llvm/include/llvm/Support/OptimalLayout.h
40	It depends on what semantics we need to provide for the alloca. If the address needs to be consistent and unique throughout the coroutine, which I think is the usual `alloca` guarantee, then coroutine lowering should probably ask layout for something as if the object had size 1.

Out of curiosity, what's the motivating use case for the fixed offset fields in this tool?

In D76526#2688002, @dblaikie wrote:

Out of curiosity, what's the motivating use case for the fixed offset fields in this tool?

There is a need for this in coroutines. In the coroutine frame layout, the first few fields are fixed according to C++ standard.

In D76526#2689041, @lxfind wrote:

In D76526#2688002, @dblaikie wrote:

Out of curiosity, what's the motivating use case for the fixed offset fields in this tool?

There is a need for this in coroutines. In the coroutine frame layout, the first few fields are fixed according to C++ standard.

Ah, makes sense - is that only for prefixes, or can that result in arbitrary non-initial-sequence fixed offset fields too?

In D76526#2689767, @dblaikie wrote:

In D76526#2689041, @lxfind wrote:

In D76526#2688002, @dblaikie wrote:

Out of curiosity, what's the motivating use case for the fixed offset fields in this tool?

There is a need for this in coroutines. In the coroutine frame layout, the first few fields are fixed according to C++ standard.

Ah, makes sense - is that only for prefixes, or can that result in arbitrary non-initial-sequence fixed offset fields too?

It does support filling holes between fixed fields, yes. This is actually currently used in coroutines with over-aligned promises that require padding. (Apparently we are currently putting the padding in the wrong place, but that doesn't deeply matter, since in either case there's padding we can fill.)

lxfind added inline comments.May 3 2021, 4:47 PM

llvm/include/llvm/Support/OptimalLayout.h
40	Curious, would it just work if we remove this assertion? That would still allow the alloca to have a consistent address and I cannot think of a reason this algorithm will fail because a field is 0 size?

rjmccall added inline comments.May 3 2021, 5:26 PM

llvm/include/llvm/Support/OptimalLayout.h
40	The assertion is there because I don't want this library to have to start making policy decisions like that. You can very easily handle zero-sized allocas that don't have to have a unique address by assigning them an offset of zero within the frame. It looks like `alloca` doesn't guarantee a unique address, so that should be fine.

Matt added a subscriber: Matt.May 31 2021, 9:52 AM

Revision Contents

Path

Size

llvm/

include/

llvm/

Support/

OptimalLayout.h

130 lines

lib/

Support/

CMakeLists.txt

1 line

OptimalLayout.cpp

452 lines

unittests/

Support/

CMakeLists.txt

1 line

OptimalLayoutTest.cpp

132 lines

Diff 251755

llvm/include/llvm/Support/OptimalLayout.h

This file was added.

				//===-- OptimalLayout.h - Optimal data layout algorithm ------------ C++ --=//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				///
				/// This file provides an interface for laying out a sequence of fields
				/// as a struct in a way that attempts to minimizes the total space
				/// requirements of the struct.
				///
				/// The word "optimal" is a misnomer in several ways. First, minimizing
				/// space usage doesn't necessarily yield optimal performance because it
				/// may decrease locality. Second, there is no known efficient algorithm
				/// that guarantees a minimal layout for arbitrary inputs. Nonetheless,
				/// this algorithm is likely to produce much more compact layouts than
				/// would be produced by just allocating space in a buffer.
				///
				//===----------------------------------------------------------------------===//

				#ifndef LLVM_SUPPORT_OPTIMALLAYOUT_H
				#define LLVM_SUPPORT_OPTIMALLAYOUT_H

				#include "llvm/Support/Alignment.h"
				#include "llvm/ADT/ArrayRef.h"
				#include <utility>

				namespace llvm {

				/// A field in a structure.
				struct OptimalLayoutField {
				/// A special value for Offset indicating that the field can be moved
				/// anywhere.
				static constexpr uint64_t FlexibleOffset = ~(uint64_t)0;

				OptimalLayoutField(const void *Id, uint64_t Size, Align Alignment,
				uint64_t FixedOffset = FlexibleOffset)
				: Offset(FixedOffset), Size(Size), Id(Id), Alignment(Alignment) {
				assert(Size > 0 && "adding an empty field to the layout");
				aykevlUnsubmitted Not Done Reply Inline Actions This line is triggered in the coroutine lowering passes, see: https://bugs.llvm.org/show_bug.cgi?id=49916 I don't know whether this is a bug in OptimizedStructLayout or in the coroutine lowering passes, but I'm noting it here just in case. aykevl: This line is triggered in the coroutine lowering passes, see: https://bugs.llvm.org/show_bug.
				rjmccallAuthorUnsubmitted Done Reply Inline Actions Hmm. Abstractly it seems like a reasonable precondition, so it's probably best to fix in the coroutine lowering passes. rjmccall: Hmm. Abstractly it seems like a reasonable precondition, so it's probably best to fix in the…
				lxfindUnsubmitted Not Done Reply Inline Actions I guess we could explicitly check for the size of Alloca in corosplit, and if it's 0, don't put it on the frame? lxfind: I guess we could explicitly check for the size of Alloca in corosplit, and if it's 0, don't put…
				rjmccallAuthorUnsubmitted Done Reply Inline Actions It depends on what semantics we need to provide for the alloca. If the address needs to be consistent and unique throughout the coroutine, which I think is the usual `alloca` guarantee, then coroutine lowering should probably ask layout for something as if the object had size 1. rjmccall: It depends on what semantics we need to provide for the alloca. If the address needs to be…
				lxfindUnsubmitted Not Done Reply Inline Actions Curious, would it just work if we remove this assertion? That would still allow the alloca to have a consistent address and I cannot think of a reason this algorithm will fail because a field is 0 size? lxfind: Curious, would it just work if we remove this assertion? That would still allow the alloca to…
				rjmccallAuthorUnsubmitted Done Reply Inline Actions The assertion is there because I don't want this library to have to start making policy decisions like that. You can very easily handle zero-sized allocas that don't have to have a unique address by assigning them an offset of zero within the frame. It looks like `alloca` doesn't guarantee a unique address, so that should be fine. rjmccall: The assertion is there because I don't want this library to have to start making policy…
				}

				/// The offset of this field in the final layout. If this is
				/// initialized to FlexibleOffset, layout will overwrite it with
				/// the assigned offset of the field.
				uint64_t Offset;

				/// The required size of this field in bytes. Does not have to be
				/// a multiple of Alignment. Must be non-zero.
				uint64_t Size;

				/// A opaque value which uniquely identifies this field.
				const void *Id;

				/// Private scratch space for the algorithm. The implementation
				/// must treat this as uninitialized memory on entry.
				void *Scratch;

				/// The required alignment of this field.
				Align Alignment;

				/// Return true if this field has been assigned a fixed offset.
				/// After layout, this will be true of all the fields.
				bool hasFixedOffset() const {
				return (Offset != FlexibleOffset);
				}

				/// Given that this field has a fixed offset, return the offset
				/// of the first byte following it.
				uint64_t getEndOffset() const {
				assert(hasFixedOffset());
				return Offset + Size;
				}
				};

				/// Compute a layout for a struct containing the given fields, making a
				/// best-effort attempt to minimize the amount of space required.
				///
				/// Two features are supported which require a more careful solution
				/// than the well-known "sort by decreasing alignment" solution:
				///
				/// - Fields may be assigned a fixed offset in the layout. If there are
				/// gaps among the fixed-offset fields, the algorithm may attempt
				/// to allocate flexible-offset fields into those gaps. If that's
				/// undesirable, the caller should "block out" those gaps by e.g.
				/// just creating a single fixed-offset field that represents the
				/// entire "header".
				///
				/// - The size of a field is not required to be a multiple of, or even
				/// greater than, the field's required alignment. The only constraint
				/// on fields is that they must not be zero-sized.
				///
				/// To simplify the implementation, any fixed-offset fields in the
				/// layout must appear at the start of the field array, and they must
				/// be ordered by increasing offset.
				///
				/// The algorithm will produce a guaranteed-minimal layout with no
				/// interior padding in the following "C-style" case:
				///
				/// - every field's size is a multiple of its required alignment and
				/// - either no fields have initially fixed offsets, or the fixed-offset
				/// fields have no interior padding and end at an offset that is at
				/// least as aligned as all the flexible-offset fields.
				///
				/// Otherwise, while the algorithm will make a best-effort attempt to
				/// avoid padding, it cannot guarantee a minimal layout, as there is
				/// no known efficient algorithm for doing so.
				///
				/// The layout produced by this algorithm may not be stable across LLVM
				/// releases. Do not use this anywhere where ABI stability is required.
				///
				/// Flexible-offset fields with the same size and alignment will be ordered
				/// the same way they were in the initial array. Otherwise the current
				/// algorithm makes no effort to preserve the initial order of
				/// flexible-offset fields.
				///
				/// On return, all fields will have been assigned a fixed offset, and the
				/// array will be sorted in order of ascending offsets. Note that this
				/// means that the fixed-offset fields may no longer form a strict prefix
				/// if there's any padding before they end.
				///
				/// The return value is the total size of the struct and its required
				/// alignment. Note that the total size is not rounded up to a multiple
				/// of the required alignment; clients which require this can do so easily.
				std::pair<uint64_t, Align>
				performOptimalLayout(MutableArrayRef<OptimalLayoutField> Fields);

				} // namespace llvm

				#endif

llvm/lib/Support/CMakeLists.txt

Context not available.
	MemoryBuffer.cpp	MemoryBuffer.cpp
	MD5.cpp	MD5.cpp
	NativeFormatting.cpp	NativeFormatting.cpp
		OptimalLayout.cpp
	Optional.cpp	Optional.cpp
	Parallel.cpp	Parallel.cpp
	PluginLoader.cpp	PluginLoader.cpp
Context not available.

llvm/lib/Support/OptimalLayout.cpp

This file was added.

				//===--- OptimalLayout.cpp - Optimal data layout algorithm ----------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// This file implements the performOptimalLayout interface.
				//
				//===----------------------------------------------------------------------===//

				#include "llvm/Support/OptimalLayout.h"

				using namespace llvm;

				#ifndef NDEBUG
				void checkValidLayout(ArrayRef<OptimalLayoutField> Fields,
				uint64_t Size, Align MaxAlign) {
				uint64_t LastEnd = 0;
				Align ComputedMaxAlign;
				for (auto &Field : Fields) {
				assert(Field.hasFixedOffset() &&
				"didn't assign a fixed offset to field");
				assert(isAligned(Field.Alignment, Field.Offset) &&
				"didn't assign a correctly-aligned offset to field");
				assert(Field.Offset >= LastEnd &&
				"didn't assign offsets in ascending order");
				LastEnd = Field.getEndOffset();
				assert(Field.Alignment <= MaxAlign &&
				"didn't compute MaxAlign correctly");
				ComputedMaxAlign = std::max(Field.Alignment, MaxAlign);
				}
				assert(LastEnd == Size && "didn't compute LastEnd correctly");
				assert(ComputedMaxAlign == MaxAlign && "didn't compute MaxAlign correctly");
				}
				#endif

				std::pair<uint64_t, Align>
				llvm::performOptimalLayout(MutableArrayRef<OptimalLayoutField> Fields) {
				#ifndef NDEBUG
				// Do some simple precondition checks.
				{
				bool InFixedPrefix = true;
				size_t LastEnd = 0;
				for (auto &Field : Fields) {
				assert(Field.Size > 0 && "field of zero size");
				if (Field.hasFixedOffset()) {
				assert(InFixedPrefix &&
				"fixed-offset fields are not a strict prefix of array");
				assert(LastEnd <= Field.Offset &&
				"fixed-offset fields overlap or are not in order");
				LastEnd = Field.getEndOffset();
				assert(LastEnd > Field.Offset &&
				"overflow in fixed-offset end offset");
				} else {
				InFixedPrefix = false;
				}
				}
				}
				#endif

				// Do an initial pass over the fields.
				Align MaxAlign;

				// Find the first flexible-offset field, tracking MaxAlign.
				auto FirstFlexible = Fields.begin(), E = Fields.end();
				while (FirstFlexible != E && FirstFlexible->hasFixedOffset()) {
				MaxAlign = std::max(MaxAlign, FirstFlexible->Alignment);
				++FirstFlexible;
				}

				// If there are no flexible fields, we're done.
				if (FirstFlexible == E) {
				uint64_t Size = 0;
				if (!Fields.empty())
				Size = Fields.back().getEndOffset();

				#ifndef NDEBUG
				checkValidLayout(Fields, Size, MaxAlign);
				#endif
				return std::make_pair(Size, MaxAlign);
				}

				// Walk over the flexible-offset fields, tracking MaxAlign and
				// assigning them a unique number in order of their appearance.
				// We'll use this unique number in the comparison below so that
				// we can use array_pod_sort, which isn't stable. We won't use it
				// past that point.
				{
				uintptr_t UniqueNumber = 0;
				for (auto I = FirstFlexible; I != E; ++I) {
				I->Scratch = reinterpret_cast<void*>(UniqueNumber++);
				MaxAlign = std::max(MaxAlign, I->Alignment);
				}
				}

				// Sort the flexible elements in order of decreasing alignment,
				// then decreasing size, and then the original order as recorded
				// in Scratch. The decreasing-size aspect of this is only really
				// important if we get into the gap-filling stage below, but it
				// doesn't hurt here.
				array_pod_sort(FirstFlexible, E,
				[](const OptimalLayoutField *lhs,
				const OptimalLayoutField *rhs) -> int {
				// Decreasing alignment.
				if (lhs->Alignment != rhs->Alignment)
				return (lhs->Alignment < rhs->Alignment ? 1 : -1);

				// Decreasing size.
				if (lhs->Size != rhs->Size)
				return (lhs->Size < rhs->Size ? 1 : -1);

				// Original order.
				auto lhsNumber = reinterpret_cast<uintptr_t>(lhs->Scratch);
				auto rhsNumber = reinterpret_cast<uintptr_t>(rhs->Scratch);
				if (lhsNumber != rhsNumber)
				return (lhsNumber < rhsNumber ? -1 : 1);

				return 0;
				});

				// Do a quick check for whether that sort alone has given us a perfect
				// layout with no interior padding. This is very common: if the
				// fixed-layout fields have no interior padding, and they end at a
				// sufficiently-aligned offset for all the flexible-layout fields,
				// and the flexible-layout fields all have sizes that are multiples
				// of their alignment, then this will reliably trigger.
				{
				bool HasPadding = false;
				uint64_t LastEnd = 0;

				// Walk the fixed-offset fields.
				for (auto I = Fields.begin(); I != FirstFlexible; ++I) {
				assert(I->hasFixedOffset());
				if (LastEnd != I->Offset) {
				HasPadding = true;
				break;
				}
				LastEnd = I->getEndOffset();
				}

				// Walk the flexible-offset fields, optimistically assigning fixed
				// offsets. Note that we maintain a strict division between the
				// fixed-offset and flexible-offset fields, so if we end up
				// discovering padding later in this loop, we can just abandon this
				// work and we'll ignore the offsets we already assigned.
				if (!HasPadding) {
				for (auto I = FirstFlexible; I != E; ++I) {
				auto Offset = alignTo(LastEnd, I->Alignment);
				if (LastEnd != Offset) {
				HasPadding = true;
				break;
				}
				I->Offset = Offset;
				LastEnd = I->getEndOffset();
				}
				}

				// If we already have a perfect layout, we're done.
				if (!HasPadding) {
				#ifndef NDEBUG
				checkValidLayout(Fields, LastEnd, MaxAlign);
				#endif
				return std::make_pair(LastEnd, MaxAlign);
				}
				}

				// The algorithm sketch at this point is as follows.
				//
				// Consider the padding gaps between fixed-offset fields in ascending
				// order. Let LastEnd be the offset of the first byte following the
				// field before the gap, or 0 if the gap is at the beginning of the
				// structure. Find the "best" flexible-offset field according to the
				// criteria below. If no such field exists, proceed to the next gap.
				// Otherwise, add the field at the first properly-aligned offset for
				// that field that is >= LastEnd, then update LastEnd and repeat in
				// order to fill any remaining gap following that field.
				//
				// Next, let LastEnd to be the offset of the first byte following the
				// last fixed-offset field, or 0 if there are no fixed-offset fields.
				// While there are flexible-offset fields remaining, find the "best"
				// flexible-offset field according to the criteria below, add it at
				// the first properly-aligned offset for that field that is >= LastEnd,
				// and update LastEnd to the first byte following the field.
				//
				// The "best" field is chosen by the following criteria, considered
				// strictly in order:
				//
				// - When filling a gap betweeen fields, the field must fit.
				// - A field is preferred if it requires less padding following LastEnd.
				// - A field is preferred if it is more aligned.
				// - A field is preferred if it is larger.
				// - A field is preferred if it appeared earlier in the initial order.
				//
				// Minimizing leading padding is a greedy attempt to avoid padding
				// entirely. Preferring more-aligned fields is an attempt to eliminate
				// stricter constraints earlier, with the idea that weaker alignment
				// constraints may be resolvable with less padding elsewhere. These
				// These two rules are sufficient to ensure that we get the optimal
				// layout in the "C-style" case. Preferring larger fields tends to take
				// better advantage of large gaps and may be more likely to have a size
				// that's a multiple of a useful alignment. Preferring the initial
				// order may help somewhat with locality but is mostly just a way of
				// ensuring deterministic output.
				//
				// Note that this algorithm does not guarantee a minimal layout. Picking
				// a larger object greedily may leave a gap that cannot be filled as
				// efficiently. Unfortunately, solving this perfectly is an NP-complete
				// problem (by reduction from bin-packing: let B_i be the bin sizes and
				// O_j be the object sizes; add fixed-offset fields such that the gaps
				// between them have size B_i, and add flexible-offset fields with
				// alignment 1 and size O_j; if the layout size is equal to the end of
				// the last fixed-layout field, the objects fit in the bins; note that
				// this doesn't even require the complexity of alignment).

				// The implementation below is essentially just an optimized version of
				// scanning the list of remaining fields looking for the best, which
				// would be O(n^2). In the worst case, it doesn't improve on that.
				// However, in practice it'll just scan the array of alignment bins
				// and consider the first few elements from one or two bins. The
				// number of bins is bounded by a small constant: alignments are powers
				// of two that are vanishingly unlikely to be over 64 and fairly unlikely
				// to be over 8. And multiple elements only need to be considered when
				// filling a gap between fixed-offset fields, which doesn't happen very
				// often. We could use a data structure within bins that optimizes for
				// finding the best-sized match, but it would require allocating memory
				// and copying data, so it's unlikely to be worthwhile.


				// Start by organizing the flexible-offset fields into bins according to
				// their alignment. We expect a small enough number of bins that we
				// don't care about the asymptotic costs of walking this.
				struct AlignmentQueue {
				/// The minimum size of anything currently in this queue.
				uint64_t MinSize;

				/// The head of the queue. A singly-linked list. The order here should
				/// be consistent with the earlier sort, i.e. the elements should be
				/// monotonically descending in size and otherwise in the original order.
				///
				/// We remove the queue from the array as soon as this is empty.
				OptimalLayoutField *Head;

				/// The alignment requirement of the queue.
				Align Alignment;

				static OptimalLayoutField getNext(OptimalLayoutField Cur) {
				return static_cast<OptimalLayoutField*>(Cur->Scratch);
				}
				};
				SmallVector<AlignmentQueue, 8> FlexibleFieldsByAlignment;
				for (auto I = FirstFlexible; I != E; ) {
				auto Head = I;
				auto Alignment = I->Alignment;

				uint64_t MinSize = I->Size;
				auto LastInQueue = I;
				for (++I; I != E && I->Alignment == Alignment; ++I) {
				LastInQueue->Scratch = I;
				LastInQueue = I;
				MinSize = std::min(MinSize, I->Size);
				}
				LastInQueue->Scratch = nullptr;

				FlexibleFieldsByAlignment.push_back({MinSize, Head, Alignment});
				}

				#ifndef NDEBUG
				// Verify that we set the queues up correctly.
				auto checkQueues = [&]{
				bool FirstQueue = true;
				Align LastQueueAlignment;
				for (auto &Queue : FlexibleFieldsByAlignment) {
				assert((FirstQueue \|\| Queue.Alignment < LastQueueAlignment) &&
				"bins not in order of descending alignment");
				LastQueueAlignment = Queue.Alignment;
				FirstQueue = false;

				assert(Queue.Head && "queue was empty");
				uint64_t LastSize = ~(uint64_t)0;
				for (auto I = Queue.Head; I; I = Queue.getNext(I)) {
				assert(I->Alignment == Queue.Alignment && "bad field in queue");
				assert(I->Size <= LastSize && "queue not in descending size order");
				LastSize = I->Size;
				}
				}
				};
				checkQueues();
				#endif

				/// Helper function to remove a field from a queue.
				auto spliceFromQueue = [&](AlignmentQueue *Queue,
				OptimalLayoutField *Last,
				OptimalLayoutField *Cur) {
				assert(Last ? Queue->getNext(Last) == Cur : Queue->Head == Cur);

				// If we're removing Cur from a non-initial position, splice it out
				// of the linked list.
				if (Last) {
				Last->Scratch = Cur->Scratch;

				// If Cur was the last field in the list, we need to update MinSize.
				// We can just use the last field's size because the list is in
				// descending order of size.
				if (!Cur->Scratch)
				Queue->MinSize = Last->Size;

				// Otherwise, replace the head.
				} else {
				if (auto NewHead = Queue->getNext(Cur))
				Queue->Head = NewHead;

				// If we just emptied the queue, destroy its bin.
				else
				FlexibleFieldsByAlignment.erase(Queue);
				}
				};

				// Do layout into a local array. Doing this in-place on Fields is
				// not really feasible.
				SmallVector<OptimalLayoutField, 16> Layout;
				Layout.reserve(Fields.size());

				// The offset that we're currently looking to insert at (or after).
				uint64_t LastEnd = 0;

				// Helper function to splice Cur out of the given queue and add it
				// to the layout at the given offset.
				auto addToLayout = [&](AlignmentQueue *Queue,
				OptimalLayoutField *Last,
				OptimalLayoutField *Cur,
				uint64_t Offset) -> bool {
				assert(Offset == alignTo(LastEnd, Cur->Alignment));

				// Splice out. This potentially invalidates Queue.
				spliceFromQueue(Queue, Last, Cur);

				// Add Cur to the layout.
				Layout.push_back(*Cur);
				Layout.back().Offset = Offset;
				LastEnd = Layout.back().getEndOffset();

				// Always return true so that we can be tail-called.
				return true;
				};

				// Helper function to try to find a field in the given queue that'll
				// fit starting at StartOffset but before EndOffset (if present).
				// Note that this never fails if EndOffset is not provided.
				auto tryAddFillerFromQueue = [&](AlignmentQueue *Queue,
				uint64_t StartOffset,
				Optional<uint64_t> EndOffset) -> bool {
				assert(Queue->Head);
				assert(StartOffset == alignTo(LastEnd, Queue->Alignment));

				// Figure out the maximum size that a field can be, and ignore this
				// queue if there's nothing in it that small.
				auto MaxViableSize =
				(EndOffset ? *EndOffset - StartOffset : ~(uint64_t)0);
				if (Queue->MinSize > MaxViableSize) return false;

				// Find the matching field. Note that this should always find
				// something because of the MinSize check above.
				for (OptimalLayoutField Cur = Queue->Head, Last = nullptr;
				true; Last = Cur, Cur = Queue->getNext(Cur)) {
				assert(Cur && "didn't find a match in queue despite its MinSize");
				if (Cur->Size <= MaxViableSize)
				return addToLayout(Queue, Last, Cur, StartOffset);
				}

				llvm_unreachable("didn't find a match in queue despite its MinSize");
				};

				// Helper function to find the "best" flexible-offset field according
				// to the criteria described above.
				auto tryAddBestField = [&](Optional<uint64_t> BeforeOffset) -> bool {
				auto QueueB = FlexibleFieldsByAlignment.begin();
				auto QueueE = FlexibleFieldsByAlignment.end();

				// Start by looking for the most-aligned queue that doesn't need any
				// leading padding after LastEnd.
				auto FirstQueueToSearch = QueueB;
				for (; FirstQueueToSearch != QueueE; ++FirstQueueToSearch) {
				if (isAligned(FirstQueueToSearch->Alignment, LastEnd))
				break;
				}

				uint64_t Offset = LastEnd;
				while (true) {
				// Invariant: all of the queues in [FirstQueueToSearch, QueueE)
				// require the same initial padding offset.

				// Search those queues in descending order of alignment for a
				// satisfactory field.
				for (auto Queue = FirstQueueToSearch; Queue != QueueE; ++Queue) {
				if (tryAddFillerFromQueue(Queue, Offset, BeforeOffset))
				return true;
				}

				// Okay, we don't need to scan those again.
				QueueE = FirstQueueToSearch;

				// If we started from the first queue, we're done.
				if (FirstQueueToSearch == QueueB)
				return false;

				// Otherwise, scan backwards to find the most-aligned queue that
				// still has minimal leading padding after LastEnd.
				--FirstQueueToSearch;
				Offset = alignTo(LastEnd, FirstQueueToSearch->Alignment);
				while (FirstQueueToSearch != QueueB &&
				Offset == alignTo(LastEnd, FirstQueueToSearch[-1].Alignment))
				--FirstQueueToSearch;
				}
				};

				// Phase 1: fill the gaps between fixed-offset fields with the best
				// flexible-offset field that fits.
				for (auto I = Fields.begin(); I != FirstFlexible; ++I) {
				while (LastEnd != I->Offset) {
				if (!tryAddBestField(I->Offset))
				break;
				}
				Layout.push_back(*I);
				LastEnd = I->getEndOffset();
				}

				#ifndef NDEBUG
				checkQueues();
				#endif

				// Phase 2: repeatedly add the best flexible-offset field until
				// they're all gone.
				while (!FlexibleFieldsByAlignment.empty()) {
				bool Success = tryAddBestField(None);
				assert(Success && "didn't find a field with no fixed limit?");
				(void) Success;
				}

				// Copy the layout back into place.
				assert(Layout.size() == Fields.size());
				memcpy(Fields.data(), Layout.data(),
				Fields.size() * sizeof(OptimalLayoutField));

				#ifndef NDEBUG
				// Make a final check that the layout is valid.
				checkValidLayout(Fields, LastEnd, MaxAlign);
				#endif

				return std::make_pair(LastEnd, MaxAlign);
				}

llvm/unittests/Support/CMakeLists.txt

Context not available.
	MemoryBufferTest.cpp	MemoryBufferTest.cpp
	MemoryTest.cpp	MemoryTest.cpp
	NativeFormatTests.cpp	NativeFormatTests.cpp
		OptimalLayoutTest.cpp
	ParallelTest.cpp	ParallelTest.cpp
	Path.cpp	Path.cpp
	ProcessTest.cpp	ProcessTest.cpp
Context not available.

llvm/unittests/Support/OptimalLayoutTest.cpp

This file was added.

				//=== - llvm/unittest/Support/OptimalLayoutTest.cpp - Layout tests --------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "llvm/Support/OptimalLayout.h"
				#include "gtest/gtest.h"

				using namespace llvm;

				namespace {

				class LayoutTest {
				struct Field {
				uint64_t Size;
				Align Alignment;
				uint64_t ForcedOffset;
				uint64_t ExpectedOffset;
				};

				SmallVector<Field, 16> Fields;
				bool Verified = false;

				public:
				LayoutTest() {}
				LayoutTest(const LayoutTest &) = delete;
				LayoutTest &operator=(const LayoutTest &) = delete;
				~LayoutTest() { assert(Verified); }

				LayoutTest &flexible(uint64_t Size, uint64_t Alignment,
				uint64_t ExpectedOffset) {
				Fields.push_back({Size, Align(Alignment),
				OptimalLayoutField::FlexibleOffset, ExpectedOffset});
				return *this;
				}

				LayoutTest &fixed(uint64_t Size, uint64_t Alignment, uint64_t Offset) {
				Fields.push_back({Size, Align(Alignment), Offset, Offset});
				return *this;
				}

				void verify(uint64_t ExpectedSize, uint64_t ExpectedAlignment) {
				SmallVector<OptimalLayoutField, 8> LayoutFields;
				LayoutFields.reserve(Fields.size());
				for (auto &F : Fields)
				LayoutFields.emplace_back(&F, F.Size, F.Alignment, F.ForcedOffset);

				auto SizeAndAlign = performOptimalLayout(LayoutFields);

				EXPECT_EQ(SizeAndAlign.first, ExpectedSize);
				EXPECT_EQ(SizeAndAlign.second, Align(ExpectedAlignment));

				for (auto &LF : LayoutFields) {
				auto &F = static_cast<const Field >(LF.Id);
				EXPECT_EQ(LF.Offset, F.ExpectedOffset);
				}

				Verified = true;
				}
				};

				}

				TEST(OptimalLayoutTest, Basic) {
				LayoutTest()
				.flexible(12, 4, 8)
				.flexible(8, 8, 0)
				.flexible(4, 4, 20)
				.verify(24, 8);
				}

				TEST(OptimalLayoutTest, OddSize) {
				LayoutTest()
				.flexible(8, 8, 16)
				.flexible(4, 4, 12)
				.flexible(1, 1, 10)
				.flexible(10, 8, 0)
				.verify(24, 8);
				}

				TEST(OptimalLayoutTest, Gaps) {
				LayoutTest()
				.fixed(4, 4, 8)
				.fixed(4, 4, 16)
				.flexible(4, 4, 0)
				.flexible(4, 4, 4)
				.flexible(4, 4, 12)
				.flexible(4, 4, 20)
				.verify(24, 4);
				}

				TEST(OptimalLayoutTest, Greed) {
				// The greedy algorithm doesn't find the optimal layout here, which
				// would be to put the 5-byte field at the end.
				LayoutTest()
				.fixed(4, 4, 8)
				.flexible(5, 4, 0)
				.flexible(4, 4, 12)
				.flexible(4, 4, 16)
				.flexible(4, 4, 20)
				.verify(24, 4);
				}

				TEST(OptimalLayoutTest, Jagged) {
				LayoutTest()
				.flexible(1, 2, 18)
				.flexible(13, 8, 0)
				.flexible(3, 2, 14)
				.verify(19, 8);
				}

				TEST(OptimalLayoutTest, GardenPath) {
				// The 4-byte-aligned field is our highest priority, but the less-aligned
				// fields keep leaving the end offset mis-aligned.
				LayoutTest()
				.fixed(7, 4, 0)
				.flexible(4, 4, 44)
				.flexible(6, 1, 7)
				.flexible(5, 1, 13)
				.flexible(7, 2, 18)
				.flexible(4, 1, 25)
				.flexible(4, 1, 29)
				.flexible(1, 1, 33)
				.flexible(4, 2, 34)
				.flexible(4, 2, 38)
				.flexible(2, 2, 42)
				.flexible(2, 2, 48)
				.verify(50, 4);
				}
				No newline at end of file

This is an archive of the discontinued LLVM Phabricator instance.

Add an algorithm for performing "optimal" layout of a struct.AcceptedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 251755

llvm/include/llvm/Support/OptimalLayout.h

llvm/lib/Support/CMakeLists.txt

llvm/lib/Support/OptimalLayout.cpp

llvm/unittests/Support/CMakeLists.txt

llvm/unittests/Support/OptimalLayoutTest.cpp

Add an algorithm for performing "optimal" layout of a struct.
AcceptedPublic