This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/Support/
-
llvm/
-
Support/
41/41
TypeSize.h
-
unittests/Support/
-
Support/
-
CMakeLists.txt
-
LinearPolyBaseTest.cpp

Differential D88982

[NFCI] Add StackOffset class and base classes for ElementCount, TypeSize.
ClosedPublic

Authored by sdesmalen on Oct 7 2020, 10:03 AM.

Download Raw Diff

Details

Reviewers

ctetreau
efriedma
david-arm
paulwalker-arm
vkmr
ftynse
DavidTruby
georges

Commits

rG1667d23e585c: [NFCI] Add StackOffset class and base classes for ElementCount, TypeSize.

Summary

This patch adds a linear polynomial base class, called LinearPolyBase, which
serves as a base class for StackOffset. It tries to represent a linear
polynomial like:

c0 * scale0 + c1 * scale1 + ... + cK * scaleK

where the scale is implicit, meaning that only the coefficients are
encoded.

This patch also adds a univariate linear polynomial, which serves as
a base class for ElementCount and TypeSize. This tries to represent a
linear polynomial where only one dimension can be set at any one time,
i.e. a TypeSize is either fixed-sized, or scalable-sized, but cannot be
a combination of the two.

class LinearPolyBase
   ^
   |
   +---- class StackOffset  (dimensions = 2 (fixed/scalable), type = int64_t)

class UnivariateLinearPolyBase
   |
   |
   +---- class LinearPolySize (dimensions = 2 (fixed/scalable))
                ^
                |
                +-------- class ElementCount  (type = unsigned)
                |
                |
                +-------- class TypeSize      (type = uint64_t)

Diff Detail

Event Timeline

sdesmalen created this revision.Oct 7 2020, 10:03 AM

Herald added a reviewer: ftynse. · View Herald TranscriptOct 7 2020, 10:03 AM

Herald added projects: Restricted Project, Restricted Project. · View Herald Transcript

Herald added subscribers: tatianashp, msifontes, jurahul and 12 others. · View Herald Transcript

sdesmalen requested review of this revision.Oct 7 2020, 10:03 AM

Herald added subscribers: stephenneuendorffer, nicolasvasilache. · View Herald TranscriptOct 7 2020, 10:03 AM

Harbormaster completed remote builds in B74302: Diff 296720.Oct 7 2020, 10:04 AM

sdesmalen added a parent revision: D88409: [SVE] Make ElementCount and TypeSize use a new PolySize class.Oct 7 2020, 10:04 AM

sdesmalen added a child revision: D88983: [NFCI] Replace AArch64StackOffset by StackOffset..

This isn't really a true polynomial because it doesn't support multiplication of two terms: 2x * 2x = 4x^2

If what we have here is all we need, then I think it's fine (having not actually code reviewed the implementation at the time of this writing), but we should call it like it is, and ideally implement it in such a way that it would be straightforward to implement a true polynomial.

sdesmalen mentioned this in D88409: [SVE] Make ElementCount and TypeSize use a new PolySize class.Oct 7 2020, 11:16 AM

In D88982#2317443, @ctetreau wrote:

This isn't really a true polynomial because it doesn't support multiplication of two terms: 2x * 2x = 4x^2

If what we have here is all we need, then I think it's fine (having not actually code reviewed the implementation at the time of this writing), but we should call it like it is, and ideally implement it in such a way that it would be straightforward to implement a true polynomial.

Are you suggesting rename PolyBase or just to remove the term 'polynomial' from the commit message and any of the comments?

In D88982#2317488, @sdesmalen wrote:

Are you suggesting rename PolyBase or just to remove the term 'polynomial' from the commit message and any of the comments?

I'm suggesting a rename. According to wikipedia, a polynomial of first degree is called a linear polynomial. So, I propose the following simplified hierarchy:

LinearPolyBase<Type, Dimensions, IsUnivariate>
   ^   
   |   
   +---- StackOffset : LinearPolyBase<int64_t, 2, false>
   ^
   |   
   +---- PolySize : LinearPolyBase<Type, 2, true>
                ^   
                |   
                +------------- ElementCount (typedef with Type=unsigned)
                |   
                +------------- TypeSize (subclass with Type=uint64_t)

If we do this, and don't define operator* with another linear polynomial on the right, then we should be good to go. I don't think MixedPolyBase2D bought us anything, all the new things you added were "scalable" specific anyway. Similarly, for SinglePolyBase2D: might as well just let it be a "scalable size" IMO.

ctetreau added inline comments.Oct 7 2020, 2:27 PM

llvm/include/llvm/Support/TypeSize.h
34	NIT: might as well use std::array
40	NIT: it would be more immediately obvious what this is if you used `std::numeric_limits<unsigned>::max()`
42	NIT: Nomenclature: suggest changing to IsUnivariate to match the terminology of the mathematical concept
64	I don't think the call to verify() is necessary in any of the arithmetic operators. Just assert that RHS and LHS have the same value set for ExclusiveDim. You could even special case: if (DimIsExclusive) { Coefficients[ExclusiveDim] += RHS.Coefficients[ExclusiveDim]; } else { ... }

Renamed PolyBase -> LinearPolyBase, DimIsExclusive -> IsUnivariate, etc.
Removed verify() method.
Replaced regular array with std::array.
Changed name of new StackOffset class in TypeSize.h to NewStackOffset, so not to clash with the one defined in AArch64StackOffset.h. This is renamed to StackOffset in D88983 when it removes AArch64StackOffset.h entirely.
Moved TestStackOffset.cpp from D88983 to this patch.
Updated commit message.

Herald added a subscriber: mgorny. · View Herald TranscriptOct 8 2020, 9:57 AM

Rebased patch.
Added class LinearPolyBaseOperators which provides a set of overloaded operators for derived classes.
Made ElementCount a deriving class, because it needs 'isScalar' and 'isVector' methods which cannot be part of its parent class.

sdesmalen edited the summary of this revision. (Show Details)Oct 12 2020, 10:03 AM

david-arm added inline comments.Oct 13 2020, 5:01 AM

llvm/include/llvm/Support/TypeSize.h
37	Can you remove the need for ScalarTy by having Ty provide the scalar type? i.e. Ty::ScalarTy, where ScalarTy would be a typedef or something inside LinearPolyBase? Instead of using multiple inheritance to get the operators, is it not possible to simply have the operators outside of any class? I think C++ allows you to do this provided the operators are friends of the relevant classes, i.e. just have template <Ty> LinearPolyBase<Ty> operator+(const LinearPolyBase<Ty> LHS, const LinearPolyBase<Ty> RHS) { for (unsigned I = 0; I < Dimensions; ++I) LHS.Coefficients[I] -= RHS.Coefficients[I]; return LHS; } although I realise the example above only works if you have a simplified LinearPolyBase that assumes 2 dimensions, rather than the 3 template parameters you currently have for LinearPolyBase.
79	Do we actually need to support this right now? I just wonder if we can simplify things by just assuming it's always univariate for now. I guess I'm just worried that we're trying to provide theoretical support for something with no proof as yet that it's needed. We're always going to be targeting one backend, so when switching backends vscale will mean whatever is appropriate for the given backend. I think this code is trying to support a single backend having multiple vscales, unless I've misunderstood something? Also, what happens when the user specifies 3 dimensions and IsUnivariate=true? I guess this is technically allowed, but looks a little odd that's all. This is just a suggestion, but I wonder if the code might be made simpler by always assuming coefficient 0 is unscaled. Then 2 dimensions always implies both univariate and that coefficient 1 is scaled? By definition, 3 dimensions automatically implies IsUnivariate=false too I think? In this case you may actually only need two template parameters. Also, if we assume for now we're always dealing with exactly two dimensions then the enable_if stuff goes away too as then you only need one two-arg constructor.

sdesmalen added inline comments.Oct 13 2020, 7:18 AM

llvm/include/llvm/Support/TypeSize.h
37	Instead of using multiple inheritance to get the operators, is it not possible to simply have the operators outside of any class That is kind of what this class tries to do. It defines the functions as friend functions, where it takes the result/operand types from the derived class, like TypeSize, ElementCount or StackOffset. The problem with using inheritance for the operators is that it inherits the parent's operators, which return the parent class' type. That requires implicit conversion of parent -> derived class. By inheriting from this class like this: class ElementCount : public LinearPolyBase<...>, public LinearPolyBaseOperators<ElementCount, ...> { }; it will automatically instantiate those operators for the derived class.
79	Do we actually need to support this right now? I just wonder if we can simplify things by just assuming it's always univariate for now. We need this for StackOffset, which has IsUnivariate = false, because it needs to support a combination of fixed and scalable elements. I think this code is trying to support a single backend having multiple vscales, unless I've misunderstood something? I've made it generic enough in case this is needed in the future, mostly because it didn't really make things more complicated. I guess we could fix the `Dimensions` to 2, and remove this template operand from the class and explicitly name these dimensions 'Fixed' and 'Scalable', although I'm not sure if that would actually simplify these classes much. Also, what happens when the user specifies 3 dimensions and IsUnivariate=true? I guess this is technically allowed, but looks a little odd that's all. This is just a suggestion, but I wonder if the code might be made simpler by always assuming coefficient 0 is unscaled. Then 2 dimensions always implies both univariate and that coefficient 1 is scaled? By definition, 3 dimensions automatically implies IsUnivariate=false too I think? In this case you may actually only need two template parameters The dimensions don't have a meaning in this class, it's just a representation for: `c0 * scale0 + c1 * scale1 + ... + cK * scaleK`. This abstract class could represent a K dimensionsal linear polynomial, where IsUnivariate would ensure that only one of {c0, c1, ..., ck} is non-zero. Also, if we assume for now we're always dealing with exactly two dimensions then the enable_if stuff goes away too as then you only need one two-arg constructor The enable_if is there to distinguish constructors for the Univariate and non-Univariate cases. i.e. If only one dimension is allowed to be set, you call it with `(cK, #dimK)`, otherwise you call it with `({c0, c1, ..., cK})`.

@sdesmalen I'm generally ok with this approach. This patch is pretty complicated, so I haven't had the time to fully digest it and decide if it's actually correct. I plan to try to find time to do this soon, but I'd feel more comfortable if others took a look as well.

llvm/include/llvm/Support/TypeSize.h
79	I like the template parameter for the number of different variables, and would prefer that we didn't fix it to 2. I think it doesn't really increase the complexity, and makes supporting more in a hypothetical future where some target has multiple vscale like things much easier. It is my personal opinion that N+1 template parameters aren't more complex than N template parameters unless N is zero.

sdesmalen added a reviewer: DavidTruby.Oct 21 2020, 10:08 AM

ctetreau added inline comments.Oct 21 2020, 10:16 AM

llvm/include/llvm/Support/TypeSize.h
37	Is it ever valid for the operator to have a different scalar type than the polynomial? If not, then I agree with David that LinearPolyBase should publicly expose a `using ScalarType = T`, and that this type should only 1 template parameter that is the polynomial type. It might also be nice to do some type_traits magic to static_assert that the polynomial type actually inherits from LinearPolyBase so the user doesn't get 20 pages of template garbage if they mess it up.
79	NIT: add static_assert that Dimensions is not unsigned max
160	why "`NewStackOffset`"?
161	Shouldn't the first template parameter be `NewStackOffset`?
177	What's the point of this? NewStackOffset is already basically a pair of int64_t. Also, the user must read the implementation to know which member of the pair is the fixed coefficient and which is the scalable one. At the very least, this should be documented.
192	If we name these dims, we can just use them instead of mapping bool -> unsigned
193–194
199–200
202
208
llvm/unittests/Support/StackOffsetTest.cpp
1 ↗	(On Diff #297619)	We should probably just write unit tests for LinearPolyBase, and get rid of tests for ElementCount, TypeSize, and StackOffset unless they test actual new features.

Removed need for LinearPolyBaseOperators. The code now passes the derived leaf type (i.e. ElementCount/TypeSize) directly to its parent classes and the operators in the base class are now defined to take/return the leaf type.
The scalar type is no longer passed separately, but derived from the Leaf type using a type traits class.
Added new tests based directly on LinearPolyBase.

Thanks for the feedback @ctetreau!

llvm/include/llvm/Support/TypeSize.h
37	Is it ever valid for the operator to have a different scalar type than the polynomial? It is not. If not, then I agree with David that LinearPolyBase should publicly expose a using ScalarType = T, and that this type should only 1 template parameter that is the polynomial type. It might also be nice to do some type_traits magic to static_assert that the polynomial type actually inherits from LinearPolyBase so the user doesn't get 20 pages of template garbage if they mess it up. Okay I've changed things a bit in the latest update, by using a Traits class to get the ScalarType, and passing the Leaf type (i.e. StackOffset/TypeSize/ElementCount) directly to the base class. That also rendered the `LinearPolyBaseOperators` class unnecessary, as this can now move directly into the base class. I've also tried adding some `static_assert(std::is_base_of<LinearPolyBase, LeafTy>)`, but that falls over because LeafTy is incomplete at the point where this is being evaluated.
160	That was to avoid a name clash with `StackOffset` defined in `AArch64StackOffset.h` when both that header and `TypeSize.h` are included. This got renamed back in D88983. But it's better to put this in a separate namespace, and remove only that namespace in D88983.
161	Yes. Not sure how this happened.
177	The NewStackOffset is indeed a pair, but it's not a `std::pair`, so I thought it would be nice to have this as convenience function. But it's probably better to remove this interface for now, as I don't use it in any of the follow-up patches.
192	That's a great suggestion, thanks!

ctetreau added inline comments.Oct 22 2020, 2:46 PM

llvm/include/llvm/Support/TypeSize.h
34	Couldn't we just require that any type used as `LeafTy` have a `ScalarTy` typedef?
158–161	Does this need to be inside the `NewStackOffset` namespace? I believe that you are forward declaring `llvm::StackOffset` and then creating the type traits for it, not `llvm::NewStackOffset::StackOffset`

Fixed namespace for StackOffset traits.
Noticed that StackOffset was still using int64_t instead of it's ScalarTy, so fixed it.
Changed instance of std::all_of -> llvm::all_of

sdesmalen added inline comments.Oct 23 2020, 2:49 AM

llvm/include/llvm/Support/TypeSize.h
34	When I try that, I get: error: invalid use of incomplete type 'class llvm::StackOffset' 55 \| using SomeOtherTy = typename LeafTy::SomeTy; \| ^~~~~~~~~~~
158–161	Good catch! I'm not sure why this still built. I can't move the specialization into the NewStackOffset namespace, as this needs to happen in the same namespace as it's declaration, but I can move the forward declaration of StackOffset into a namespace, and specify the traits for `NewStackOffset::StackOffset`.

sdesmalen added a reviewer: georges.Oct 26 2020, 3:29 AM

georges added inline comments.Oct 26 2020, 5:30 AM

llvm/include/llvm/Support/TypeSize.h
47–48	Not a suggestion, but just an observation that we're passing some arguments to the class via template parameters and others via the traits object. Seems inconsistent?
56–67	Is it worth specializing this struct template for IsUnivariate=true/false, since with the number of enable_if's scattered around they begin to look like quite different classes? It also seems a bit wasteful to be allocating Dimensions elements when all but one of them will be zero in the Univariate case?
118	nit: I think you can use the _v/_t versions of these functions here and elsewhere since we're in C++14 now, e.g. `std::enable_if_t<std::is_signed_v<U>, LeafTy>`.

ctetreau added inline comments.Oct 26 2020, 12:04 PM

llvm/include/llvm/Support/TypeSize.h
125–128	Out of bounds array access.

ctetreau added inline comments.Oct 26 2020, 1:00 PM

llvm/include/llvm/Support/TypeSize.h
34	I played around with it for a bit today, and I was unable to make it any nicer.

sdesmalen added inline comments.Oct 26 2020, 2:08 PM

llvm/include/llvm/Support/TypeSize.h
56–67	@ctetreau do you have any thoughts on this? I guess I could split this up and end up with a two base classes, as I agree with @georges that for >2 dimensions the code isn't optimal. By having two base classes, the univariate case can be optimized. The downside is some duplication, but it would avoid the need for `enable_if` and probably simplify the code.
125–128	Eek, good spot!

ctetreau added inline comments.Oct 26 2020, 4:17 PM

llvm/include/llvm/Support/TypeSize.h
56–67	I'm not so much worried about wasted space because Dimensions is likely to be low. However, my general feeling is that this class getting way too fancy with the template metaprogramming. I wasn't going to complain about it, but since you're asking I think it would be more straightforward if they were separate classes. I've also wondered on more than on occasion if we ought to just use a linear algebra lib for this. This class smells a lot like a vector to me...

Separated out the base classes, one specialized for StackOffset and another specialized for LinearPolySize.
Moved dimensions to Traits class.
Use enable_if_t instead of enable_if

Okay, I have addressed all comments. Specializing the two base classes for Mixed and Univariate cases and moving the dimension to the Traits class improves the code quite a bit.
Let me know what you think!

llvm/include/llvm/Support/TypeSize.h
56–67	Using a linear algebra lib is probably a bit overkill here as the few supported operations are simple enough to implement. Unless there is already something in LLVM for this?

georges added inline comments.Oct 28 2020, 7:14 AM

llvm/include/llvm/Support/TypeSize.h
60	worth an assert that UniveriateDim is in range?

Other than the nit, LGTM. I'll let others have a look and accept.

Looks good to me once you add that assert.

llvm/include/llvm/Support/TypeSize.h
56–67	Yeah, that was my thought too, which is why I didn't actually suggest it. I'm surprised we don't have a linear algebra lib already (either homegrown or not) though. Seems like it'd be useful for doing things like constexpr matrix multiplications.
60	I think this would be a useful assert

Herald added a subscriber: dexonsmith. · View Herald TranscriptOct 28 2020, 9:50 AM

Added assert.

sdesmalen marked an inline comment as done.Oct 28 2020, 10:40 AM

sdesmalen added inline comments.

llvm/include/llvm/Support/TypeSize.h
60	Agreed, good catch!

sdesmalen added a child revision: D90342: [POC][LoopVectorizer] Propagate ElementCount to interfaces in preparation for scalable auto-vec. .Oct 28 2020, 2:12 PM

LGTM and a lot tidier now, provided you're also happy with this @ctetreau?

This revision is now accepted and ready to land.Oct 30 2020, 10:14 AM

Yeah, I think this is good to go

Closed by commit rG1667d23e585c: [NFCI] Add StackOffset class and base classes for ElementCount, TypeSize. (authored by sdesmalen). · Explain WhyNov 3 2020, 1:44 AM

This revision was automatically updated to reflect the committed changes.

sdesmalen added a commit: rG1667d23e585c: [NFCI] Add StackOffset class and base classes for ElementCount, TypeSize..

Thanks for the reviews and great feedback, I feel like the patch has much improved since its first revision!

gchatelet mentioned this in D140263: [NFC] Vastly simplifies TypeSize.Jan 3 2023, 7:49 AM

Revision Contents

Path

Size

llvm/

include/

llvm/

Support/

TypeSize.h

454 lines

unittests/

Support/

CMakeLists.txt

1 line

LinearPolyBaseTest.cpp

163 lines

Diff 300031

llvm/include/llvm/Support/TypeSize.h

Show All 9 Lines

// which may be scalable vectors. It provides convenience operators so that // which may be scalable vectors. It provides convenience operators so that

// it can be used in much the same way as a single scalar value. // it can be used in much the same way as a single scalar value.

// //

//===----------------------------------------------------------------------===// //===----------------------------------------------------------------------===//

#ifndef LLVM_SUPPORT_TYPESIZE_H #ifndef LLVM_SUPPORT_TYPESIZE_H

#define LLVM_SUPPORT_TYPESIZE_H #define LLVM_SUPPORT_TYPESIZE_H

#include "llvm/ADT/ArrayRef.h"

#include "llvm/Support/MathExtras.h" #include "llvm/Support/MathExtras.h"

#include "llvm/Support/WithColor.h" #include "llvm/Support/WithColor.h"

#include <cstdint> #include <array>

#include <cassert> #include <cassert>

#include <cstdint>

#include <type_traits>

namespace llvm { namespace llvm {

template <typename T> struct DenseMapInfo; //===----------------------------------------------------------------------===//

// LinearPolyBase - the main base class for ElementCount, TypeSize and

// StackOffset.

//===----------------------------------------------------------------------===//

// TODO: This class will be redesigned in a later patch that introduces full template <typename LeafTy> struct LinearPolyBaseTypeTraits {};

ctetreauUnsubmitted

Done

NIT: might as well use std::array

ctetreau: NIT: might as well use std::array

ctetreauUnsubmitted

Done

Couldn't we just require that any type used as LeafTy have a ScalarTy typedef?

ctetreau: Couldn't we just require that any type used as `LeafTy` have a `ScalarTy` typedef?

sdesmalenAuthorUnsubmitted

Done

When I try that, I get:

error: invalid use of incomplete type 'class llvm::StackOffset'
   55 |   using SomeOtherTy = typename LeafTy::SomeTy;
      |         ^~~~~~~~~~~

sdesmalen: When I try that, I get: ```error: invalid use of incomplete type 'class llvm::StackOffset'…

ctetreauUnsubmitted

Done

I played around with it for a bit today, and I was unable to make it any nicer.

ctetreau: I played around with it for a bit today, and I was unable to make it any nicer.

// polynomial behaviour, i.e. the ability to have composites made up of both

// fixed and scalable sizes.

template <typename T> class PolySize {

protected:

T MinVal; // The minimum value that it could be.

bool IsScalable; // If true, the total value is determined by multiplying

// 'MinVal' by a runtime determinded quantity, 'vscale'.

constexpr PolySize(T MinVal, bool IsScalable) /// LinearPolyBase is a base class for ElementCount, TypeSize and StackOffset.

: MinVal(MinVal), IsScalable(IsScalable) {} /// It tries to represent a linear polynomial, e.g.

david-armUnsubmitted

Done

Can you remove the need for ScalarTy by having Ty provide the scalar type? i.e. Ty::ScalarTy, where ScalarTy would be a typedef or something inside LinearPolyBase?

Instead of using multiple inheritance to get the operators, is it not possible to simply have the operators outside of any class? I think C++ allows you to do this provided the operators are friends of the relevant classes, i.e. just have

template <Ty>
LinearPolyBase<Ty> operator+(const LinearPolyBase<Ty> LHS, const LinearPolyBase<Ty> RHS) {

for (unsigned I = 0; I < Dimensions; ++I)
  LHS.Coefficients[I] -= RHS.Coefficients[I];
return LHS;

}

although I realise the example above only works if you have a simplified LinearPolyBase that assumes 2 dimensions, rather than the 3 template parameters you currently have for LinearPolyBase.

david-arm: Can you remove the need for ScalarTy by having Ty provide the scalar type? i.e. Ty::ScalarTy…

sdesmalenAuthorUnsubmitted

Done

Instead of using multiple inheritance to get the operators, is it not possible to simply have the operators outside of any class

That is kind of what this class tries to do. It defines the functions as friend functions, where it takes the result/operand types from the derived class, like TypeSize, ElementCount or StackOffset.

The problem with using inheritance for the operators is that it inherits the parent's operators, which return the parent class' type. That requires implicit conversion of parent -> derived class. By inheriting from this class like this:

class ElementCount : public LinearPolyBase<...>,
                     public LinearPolyBaseOperators<ElementCount, ...> {
};

it will automatically instantiate those operators for the derived class.

sdesmalen: > Instead of using multiple inheritance to get the operators, is it not possible to simply have…

ctetreauUnsubmitted

Done

Is it ever valid for the operator to have a different scalar type than the polynomial?

If not, then I agree with David that LinearPolyBase should publicly expose a using ScalarType = T, and that this type should only 1 template parameter that is the polynomial type.

It might also be nice to do some type_traits magic to static_assert that the polynomial type actually inherits from LinearPolyBase so the user doesn't get 20 pages of template garbage if they mess it up.

ctetreau: Is it ever valid for the operator to have a different scalar type than the polynomial? If not…

sdesmalenAuthorUnsubmitted

Done

Is it ever valid for the operator to have a different scalar type than the polynomial?

It is not.

If not, then I agree with David that LinearPolyBase should publicly expose a using ScalarType = T, and that this type should only 1 template parameter that is the polynomial type.

It might also be nice to do some type_traits magic to static_assert that the polynomial type actually inherits from LinearPolyBase so the user doesn't get 20 pages of template garbage if they mess it up.

Okay I've changed things a bit in the latest update, by using a Traits class to get the ScalarType, and passing the Leaf type (i.e. StackOffset/TypeSize/ElementCount) directly to the base class. That also rendered the LinearPolyBaseOperators class unnecessary, as this can now move directly into the base class.

I've also tried adding some static_assert(std::is_base_of<LinearPolyBase, LeafTy>), but that falls over because LeafTy is incomplete at the point where this is being evaluated.

sdesmalen: > Is it ever valid for the operator to have a different scalar type than the polynomial? It is…

/// c0 * scale0 + c1 * scale1 + ... + cK * scaleK

/// where the scale is implicit. Only the coefficients are encoded.

///

ctetreauUnsubmitted

Done

NIT: it would be more immediately obvious what this is if you used std::numeric_limits<unsigned>::max()

ctetreau: NIT: it would be more immediately obvious what this is if you used `std…

/// \param LeafTy is the derived leaf class, like StackOffset or TypeSize.

/// \param Dimensions are the number of dimensions of the linear polynomial.

ctetreauUnsubmitted

Done

NIT: Nomenclature: suggest changing to IsUnivariate to match the terminology of the mathematical concept

ctetreau: NIT: Nomenclature: suggest changing to IsUnivariate to match the terminology of the…

/// \param IsUnivariate lets LinearPolyBase represent a linear polynomial with

/// coefficients for only a single dimension when true, or multiple coefficients

/// when false.

/// The LinearPolyBaseTypeTraits are used to infer the scalar type from the leaf

/// class.

template <typename LeafTy, unsigned Dimensions, bool IsUnivariate>

georgesUnsubmitted

Done

Not a suggestion, but just an observation that we're passing some arguments to the class via template parameters and others via the traits object. Seems inconsistent?

georges: Not a suggestion, but just an observation that we're passing some arguments to the class via…

class LinearPolyBase {

static_assert(Dimensions != std::numeric_limits<unsigned>::max(),

"Dimensions out of range");

public: public:

using ScalarTy = typename LinearPolyBaseTypeTraits<LeafTy>::ScalarTy;

protected:

std::array<ScalarTy, Dimensions> Coefficients;

unsigned UnivariateDim;

// Create a LinearPolyBase for multiple dimensions.

georgesUnsubmitted

Done

worth an assert that UniveriateDim is in range?

georges: worth an assert that UniveriateDim is in range?

ctetreauUnsubmitted

Done

I think this would be a useful assert

ctetreau: I think this would be a useful assert

sdesmalenAuthorUnsubmitted

Done

Agreed, good catch!

sdesmalen: Agreed, good catch!

template <bool U = IsUnivariate>

LinearPolyBase(ArrayRef<typename std::enable_if<!U, ScalarTy>::type> Values)

: UnivariateDim(std::numeric_limits<unsigned>::max()) {

assert(Values.size() == Dimensions && "Incorrect number of values");

ctetreauUnsubmitted

Done

I don't think the call to verify() is necessary in any of the arithmetic operators. Just assert that RHS and LHS have the same value set for ExclusiveDim.

You could even special case:

if (DimIsExclusive) {
   Coefficients[ExclusiveDim] += RHS.Coefficients[ExclusiveDim];
}
else {
   ...
}

ctetreau: I don't think the call to verify() is necessary in any of the arithmetic operators. Just assert…

for (unsigned I = 0; I < Dimensions; ++I)

Coefficients[I] = Values[I];

}

georgesUnsubmitted

Done

Is it worth specializing this struct template for IsUnivariate=true/false, since with the number of enable_if's scattered around they begin to look like quite different classes? It also seems a bit wasteful to be allocating Dimensions elements when all but one of them will be zero in the Univariate case?

georges: Is it worth specializing this struct template for IsUnivariate=true/false, since with the…

sdesmalenAuthorUnsubmitted

Done

@ctetreau do you have any thoughts on this? I guess I could split this up and end up with a two base classes, as I agree with @georges that for >2 dimensions the code isn't optimal. By having two base classes, the univariate case can be optimized. The downside is some duplication, but it would avoid the need for enable_if and probably simplify the code.

sdesmalen: @ctetreau do you have any thoughts on this? I guess I could split this up and end up with a two…

ctetreauUnsubmitted

Done

I'm not so much worried about wasted space because Dimensions is likely to be low.

However, my general feeling is that this class getting way too fancy with the template metaprogramming. I wasn't going to complain about it, but since you're asking I think it would be more straightforward if they were separate classes.

I've also wondered on more than on occasion if we ought to just use a linear algebra lib for this. This class smells a lot like a vector to me...

ctetreau: I'm not so much worried about wasted space because Dimensions is likely to be low. However, my…

sdesmalenAuthorUnsubmitted

Done

Using a linear algebra lib is probably a bit overkill here as the few supported operations are simple enough to implement. Unless there is already something in LLVM for this?

sdesmalen: Using a linear algebra lib is probably a bit overkill here as the few supported operations are…

ctetreauUnsubmitted

Done

Yeah, that was my thought too, which is why I didn't actually suggest it. I'm surprised we don't have a linear algebra lib already (either homegrown or not) though. Seems like it'd be useful for doing things like constexpr matrix multiplications.

ctetreau: Yeah, that was my thought too, which is why I didn't actually suggest it. I'm surprised we…

// Create a LinearPolyBase for a single dimension.

template <bool U = IsUnivariate>

LinearPolyBase(const typename std::enable_if<(U), ScalarTy>::type &Val,

unsigned UnivariateDim)

: UnivariateDim(UnivariateDim) {

assert(UnivariateDim < Dimensions && "Incorrect dimension");

Coefficients.fill(0);

Coefficients[UnivariateDim] = Val;

}

// Operators for subclasses.

david-armUnsubmitted

Done

Do we actually need to support this right now? I just wonder if we can simplify things by just assuming it's always univariate for now. I guess I'm just worried that we're trying to provide theoretical support for something with no proof as yet that it's needed. We're always going to be targeting one backend, so when switching backends vscale will mean whatever is appropriate for the given backend. I think this code is trying to support a single backend having multiple vscales, unless I've misunderstood something?

Also, what happens when the user specifies 3 dimensions and IsUnivariate=true? I guess this is technically allowed, but looks a little odd that's all.

This is just a suggestion, but I wonder if the code might be made simpler by always assuming coefficient 0 is unscaled. Then 2 dimensions always implies both univariate and that coefficient 1 is scaled? By definition, 3 dimensions automatically implies IsUnivariate=false too I think? In this case you may actually only need two template parameters. Also, if we assume for now we're always dealing with exactly two dimensions then the enable_if stuff goes away too as then you only need one two-arg constructor.

david-arm: Do we actually need to support this right now? I just wonder if we can simplify things by just…

sdesmalenAuthorUnsubmitted

Done

Do we actually need to support this right now? I just wonder if we can simplify things by just assuming it's always univariate for now.

We need this for StackOffset, which has IsUnivariate = false, because it needs to support a combination of fixed and scalable elements.

I think this code is trying to support a single backend having multiple vscales, unless I've misunderstood something?

I've made it generic enough in case this is needed in the future, mostly because it didn't really make things more complicated. I guess we could fix the Dimensions to 2, and remove this template operand from the class and explicitly name these dimensions 'Fixed' and 'Scalable', although I'm not sure if that would actually simplify these classes much.

Also, what happens when the user specifies 3 dimensions and IsUnivariate=true? I guess this is technically allowed, but looks a little odd that's all.
This is just a suggestion, but I wonder if the code might be made simpler by always assuming coefficient 0 is unscaled. Then 2 dimensions always implies both univariate and that coefficient 1 is scaled? By definition, 3 dimensions automatically implies IsUnivariate=false too I think? In this case you may actually only need two template parameters

The dimensions don't have a meaning in this class, it's just a representation for: c0 * scale0 + c1 * scale1 + ... + cK * scaleK. This abstract class could represent a K dimensionsal linear polynomial, where IsUnivariate would ensure that only one of {c0, c1, ..., ck} is non-zero.

Also, if we assume for now we're always dealing with exactly two dimensions then the enable_if stuff goes away too as then you only need one two-arg constructor

The enable_if is there to distinguish constructors for the Univariate and non-Univariate cases. i.e. If only one dimension is allowed to be set, you call it with (cK, #dimK), otherwise you call it with ({c0, c1, ..., cK}).

sdesmalen: > Do we actually need to support this right now? I just wonder if we can simplify things by…

ctetreauUnsubmitted

Done

I like the template parameter for the number of different variables, and would prefer that we didn't fix it to 2. I think it doesn't really increase the complexity, and makes supporting more in a hypothetical future where some target has multiple vscale like things much easier. It is my personal opinion that N+1 template parameters aren't more complex than N template parameters unless N is zero.

ctetreau: I like the template parameter for the number of different variables, and would prefer that we…

ctetreauUnsubmitted

Done

NIT: add static_assert that Dimensions is not unsigned max

ctetreau: NIT: add static_assert that Dimensions is not unsigned max

friend LeafTy &operator+=(LeafTy &LHS, const LeafTy &RHS) {

assert((!IsUnivariate || LHS.UnivariateDim == RHS.UnivariateDim) &&

"Invalid dimensions");

for (unsigned I = 0; I < Dimensions; ++I)

LHS.Coefficients[I] += RHS.Coefficients[I];

return LHS;

}

friend LeafTy operator+(const LeafTy &LHS, const LeafTy &RHS) {

LeafTy Copy = LHS;

return Copy += RHS;

}

friend LeafTy &operator-=(LeafTy &LHS, const LeafTy &RHS) {

assert((!IsUnivariate || LHS.UnivariateDim == RHS.UnivariateDim) &&

"Invalid dimensions");

for (unsigned I = 0; I < Dimensions; ++I)

LHS.Coefficients[I] -= RHS.Coefficients[I];

return LHS;

}

friend LeafTy operator-(const LeafTy &LHS, const LeafTy &RHS) {

LeafTy Copy = LHS;

return Copy -= RHS;

}

friend LeafTy &operator*=(LeafTy &LHS, ScalarTy RHS) {

for (unsigned I = 0; I < Dimensions; ++I)

LHS.Coefficients[I] *= RHS;

return LHS;

}

friend LeafTy operator*(const LeafTy &LHS, ScalarTy RHS) {

LeafTy Copy = LHS;

return Copy *= RHS;

}

template <typename U = ScalarTy>

friend typename std::enable_if<std::is_signed<U>::value, LeafTy>::type

georgesUnsubmitted

Done

nit: I think you can use the _v/_t versions of these functions here and elsewhere since we're in C++14 now, e.g. std::enable_if_t<std::is_signed_v<U>, LeafTy>.

georges: nit: I think you can use the _v/_t versions of these functions here and elsewhere since we're…

operator-(const LeafTy &LHS) {

LeafTy Copy = LHS;

return Copy *= -1;

}

static constexpr PolySize getFixed(T MinVal) { return {MinVal, false}; } public:

static constexpr PolySize getScalable(T MinVal) { return {MinVal, true}; } bool operator==(const LinearPolyBase &RHS) const {

static constexpr PolySize get(T MinVal, bool IsScalable) { return std::equal(&Coefficients[0], &Coefficients[Dimensions],

return {MinVal, IsScalable}; &RHS.Coefficients[0]);

} }

ctetreauUnsubmitted

Done

public:

bool operator==(const LinearPolyBase &RHS) const {

- return std::equal(&Coefficients[0], &Coefficients[Dimensions],

- &RHS.Coefficients[0]);

+ return std::equal(Coefficients.begin(), Coefficients.end(),

+ RHS.Coefficients.begin());

}

bool operator!=(const LinearPolyBase &RHS) const { return !(*this == RHS); }

Out of bounds array access.

ctetreau: Out of bounds array access.

sdesmalenAuthorUnsubmitted

Done

Eek, good spot!

sdesmalen: Eek, good spot!

static constexpr PolySize getNull() { return {0, false}; } bool operator!=(const LinearPolyBase &RHS) const { return !(*this == RHS); }

bool isZero() const {

return std::all_of(Coefficients.begin(), Coefficients.end(),

[](const ScalarTy &C) { return C == 0; });

}

/// Counting predicates. /// Counting predicates.

/// ///

///@{ No elements.. ///@{ No elements..

bool isZero() const { return MinVal == 0; }

/// At least one element.

bool isNonZero() const { return !isZero(); } bool isNonZero() const { return !isZero(); }

/// At least one element.

explicit operator bool() const { return isNonZero(); }

ScalarTy getValue(unsigned Dim) const { return Coefficients[Dim]; }

template <bool U = IsUnivariate>

typename std::enable_if<(U), ScalarTy>::type getExclusiveValue() const {

return Coefficients[UnivariateDim];

}

};

//===----------------------------------------------------------------------===//

// StackOffset - Represent a two-dimensional stack offset with fixed and

// scalable component. Leaf class that derives from LinearPolyBase directly.

//===----------------------------------------------------------------------===//

class StackOffset;

template <> struct LinearPolyBaseTypeTraits<StackOffset> {

using ScalarTy = int64_t;

ctetreauUnsubmitted

Done

why "NewStackOffset"?

ctetreau: why "`NewStackOffset`"?

sdesmalenAuthorUnsubmitted

Done

That was to avoid a name clash with StackOffset defined in AArch64StackOffset.h when both that header and TypeSize.h are included. This got renamed back in D88983. But it's better to put this in a separate namespace, and remove only that namespace in D88983.

sdesmalen: That was to avoid a name clash with `StackOffset` defined in `AArch64StackOffset.h` when both…

};

ctetreauUnsubmitted

Done

Shouldn't the first template parameter be NewStackOffset?

ctetreau: Shouldn't the first template parameter be `NewStackOffset`?

sdesmalenAuthorUnsubmitted

Done

Yes. Not sure how this happened.

sdesmalen: Yes. Not sure how this happened.

ctetreauUnsubmitted

Done

Does this need to be inside the NewStackOffset namespace? I believe that you are forward declaring llvm::StackOffset and then creating the type traits for it, not llvm::NewStackOffset::StackOffset

ctetreau: Does this need to be inside the `NewStackOffset` namespace? I believe that you are forward…

sdesmalenAuthorUnsubmitted

Done

Good catch! I'm not sure why this still built. I can't move the specialization into the NewStackOffset namespace, as this needs to happen in the same namespace as it's declaration, but I can move the forward declaration of StackOffset into a namespace, and specify the traits for NewStackOffset::StackOffset.

sdesmalen: Good catch! I'm not sure why this still built. I can't move the specialization into the…

namespace NewStackOffset {

using StackOffsetBase =

LinearPolyBase<StackOffset, /*Dimensions=*/2, /*IsUnivariate=*/false>;

/// StackOffset is a class to represent an offset with 2 dimensions,

/// named fixed and scalable, respectively. This class allows a value for both

/// dimensions to depict e.g. "8 bytes and 16 scalable bytes", which is needed

/// to represent stack offsets.

class StackOffset : public StackOffsetBase {

protected:

StackOffset(int64_t Fixed, int64_t Scalable)

: StackOffsetBase({Fixed, Scalable}) {}

public:

ctetreauUnsubmitted

Done

What's the point of this? NewStackOffset is already basically a pair of int64_t.

Also, the user must read the implementation to know which member of the pair is the fixed coefficient and which is the scalable one. At the very least, this should be documented.

ctetreau: What's the point of this? NewStackOffset is already basically a pair of int64_t. Also, the…

sdesmalenAuthorUnsubmitted

Done

The NewStackOffset is indeed a pair, but it's not a std::pair, so I thought it would be nice to have this as convenience function. But it's probably better to remove this interface for now, as I don't use it in any of the follow-up patches.

sdesmalen: The NewStackOffset is indeed a pair, but it's not a `std::pair`, so I thought it would be nice…

StackOffset() : StackOffset({0, 0}) {}

StackOffset(const StackOffsetBase &Other) : StackOffsetBase(Other) {}

static StackOffset getFixed(int64_t Fixed) { return {Fixed, 0}; }

static StackOffset getScalable(int64_t Scalable) { return {0, Scalable}; }

static StackOffset get(int64_t Fixed, int64_t Scalable) {

return {Fixed, Scalable};

}

int64_t getFixed() const { return this->getValue(0); }

int64_t getScalable() const { return this->getValue(1); }

};

} // end namespace NewStackOffset

//===----------------------------------------------------------------------===//

ctetreauUnsubmitted

Done

public LinearPolyBaseOperators<LinearPolySize<T>, T> {

protected:

+ enum Dims : unsigned {

+ FixedDim,

+ ScalableDim

+ };

LinearPolySize(T MinVal, bool IsScalable)

If we name these dims, we can just use them instead of mapping bool -> unsigned

ctetreau: If we name these dims, we can just use them instead of mapping bool -> unsigned

sdesmalenAuthorUnsubmitted

Done

That's a great suggestion, thanks!

sdesmalen: That's a great suggestion, thanks!

// LinearPolySizeBase - base class for two-dimensional sizes

// ^ ^ (either fixed or scalable)

ctetreauUnsubmitted

Done

protected:

- LinearPolySize(T MinVal, bool IsScalable)

- : LinearPolySizeBase<T>(MinVal, !IsScalable ? 0 : 1) {}

+ LinearPolySize(T MinVal, Dims D)

+ : LinearPolySizeBase<T>(MinVal, D) {}

public:

ctetreau:

// | |

// | +----- ElementCount - Leaf class to represent an element count

// | (vscale x unsigned)

// |

// +-------- TypeSize - Leaf class to represent a type size

// (vscale x uint64_t)

ctetreauUnsubmitted

Done

: LinearPolySizeBase<T>(Other) {}

- static LinearPolySize getFixed(T MinVal) { return {MinVal, false}; }

- static LinearPolySize getScalable(T MinVal) { return {MinVal, true}; }

+ static LinearPolySize getFixed(T MinVal) { return {MinVal, FixedDim}; }

+ static LinearPolySize getScalable(T MinVal) { return {MinVal, ScalableDim}; }

static LinearPolySize get(T MinVal, bool Scalable) {

ctetreau:

//===----------------------------------------------------------------------===//

ctetreauUnsubmitted

Done

static LinearPolySize get(T MinVal, bool Scalable) {

- return {MinVal, Scalable};

+ return {MinVal, Scalable ? ScalableDim : FixedDim};

}

/// Returns the minimum value this size can represent.

ctetreau:

template <typename LeafTy>

using LinearPolySizeBase =

LinearPolyBase<LeafTy, /*Dimensions=*/2, /*IsUnivariate=*/true>;

/// LinearPolySize is a base class to represent 2 dimensions, where the

/// base class can only represent 1 dimension exclusively at a time, i.e. it is

ctetreauUnsubmitted

Done

/// Returns whether the size is scaled by a runtime quantity (vscale).

- bool isScalable() const { return this->UnivariateDim == 1; }

+ bool isScalable() const { return this->UnivariateDim == ScalableDim; }

/// A return value of true indicates we know at compile time that the number

ctetreau:

/// either fixed-sized or it is scalable-sized, but it cannot be both.

template <typename LeafTy>

class LinearPolySize : public LinearPolySizeBase<LeafTy> {

public:

using ScalarTy = typename LinearPolySizeBase<LeafTy>::ScalarTy;

enum Dims : unsigned { FixedDim = 0, ScalableDim = 1 };

protected:

LinearPolySize(ScalarTy MinVal, Dims D)

: LinearPolySizeBase<LeafTy>(MinVal, D) {}

public:

static LeafTy getFixed(ScalarTy MinVal) {

return static_cast<LeafTy>(LinearPolySize(MinVal, FixedDim));

}

static LeafTy getScalable(ScalarTy MinVal) {

return static_cast<LeafTy>(LinearPolySize(MinVal, ScalableDim));

}

static LeafTy get(ScalarTy MinVal, bool Scalable) {

return static_cast<LeafTy>(

LinearPolySize(MinVal, Scalable ? ScalableDim : FixedDim));

}

/// Returns the minimum value this size can represent.

ScalarTy getKnownMinValue() const { return this->getExclusiveValue(); }

/// Returns whether the size is scaled by a runtime quantity (vscale).

bool isScalable() const { return this->UnivariateDim == ScalableDim; }

/// A return value of true indicates we know at compile time that the number /// A return value of true indicates we know at compile time that the number

/// of elements (vscale * Min) is definitely even. However, returning false /// of elements (vscale * Min) is definitely even. However, returning false

/// does not guarantee that the total number of elements is odd. /// does not guarantee that the total number of elements is odd.

bool isKnownEven() const { return (MinVal & 0x1) == 0; } bool isKnownEven() const { return (getKnownMinValue() & 0x1) == 0; }

///@} /// This function tells the caller whether the element count is known at

/// compile time to be a multiple of the scalar value RHS.

T getKnownMinValue() const { return MinVal; } bool isKnownMultipleOf(ScalarTy RHS) const {

return getKnownMinValue() % RHS == 0;

}

// Return the minimum value with the assumption that the count is exact. // Return the minimum value with the assumption that the count is exact.

// Use in places where a scalable count doesn't make sense (e.g. non-vector // Use in places where a scalable count doesn't make sense (e.g. non-vector

// types, or vectors in backends which don't support scalable vectors). // types, or vectors in backends which don't support scalable vectors).

T getFixedValue() const { ScalarTy getFixedValue() const {

assert(!IsScalable && assert(!isScalable() &&

"Request for a fixed element count on a scalable object"); "Request for a fixed element count on a scalable object");

return MinVal; return getKnownMinValue();

}

bool isScalable() const { return IsScalable; }

bool operator==(const PolySize &RHS) const {

return MinVal == RHS.MinVal && IsScalable == RHS.IsScalable;

} }

bool operator!=(const PolySize &RHS) const { return !(*this == RHS); }

// For some cases, size ordering between scalable and fixed size types cannot // For some cases, size ordering between scalable and fixed size types cannot

// be determined at compile time, so such comparisons aren't allowed. // be determined at compile time, so such comparisons aren't allowed.

// //

// e.g. <vscale x 2 x i16> could be bigger than <4 x i32> with a runtime // e.g. <vscale x 2 x i16> could be bigger than <4 x i32> with a runtime

// vscale >= 5, equal sized with a vscale of 4, and smaller with // vscale >= 5, equal sized with a vscale of 4, and smaller with

// a vscale <= 3. // a vscale <= 3.

// //

// All the functions below make use of the fact vscale is always >= 1, which // All the functions below make use of the fact vscale is always >= 1, which

// means that <vscale x 4 x i32> is guaranteed to be >= <4 x i32>, etc. // means that <vscale x 4 x i32> is guaranteed to be >= <4 x i32>, etc.

static bool isKnownLT(const PolySize &LHS, const PolySize &RHS) { static bool isKnownLT(const LinearPolySize &LHS, const LinearPolySize &RHS) {

if (!LHS.IsScalable || RHS.IsScalable) if (!LHS.isScalable() || RHS.isScalable())

return LHS.MinVal < RHS.MinVal; return LHS.getKnownMinValue() < RHS.getKnownMinValue();

// LHS.IsScalable = true, RHS.IsScalable = false

return false; return false;

} }

static bool isKnownGT(const PolySize &LHS, const PolySize &RHS) { static bool isKnownGT(const LinearPolySize &LHS, const LinearPolySize &RHS) {

if (LHS.IsScalable || !RHS.IsScalable) if (LHS.isScalable() || !RHS.isScalable())

return LHS.MinVal > RHS.MinVal; return LHS.getKnownMinValue() > RHS.getKnownMinValue();

// LHS.IsScalable = false, RHS.IsScalable = true

return false; return false;

} }

static bool isKnownLE(const PolySize &LHS, const PolySize &RHS) { static bool isKnownLE(const LinearPolySize &LHS, const LinearPolySize &RHS) {

if (!LHS.IsScalable || RHS.IsScalable) if (!LHS.isScalable() || RHS.isScalable())

return LHS.MinVal <= RHS.MinVal; return LHS.getKnownMinValue() <= RHS.getKnownMinValue();

// LHS.IsScalable = true, RHS.IsScalable = false

return false; return false;

} }

static bool isKnownGE(const PolySize &LHS, const PolySize &RHS) { static bool isKnownGE(const LinearPolySize &LHS, const LinearPolySize &RHS) {

if (LHS.IsScalable || !RHS.IsScalable) if (LHS.isScalable() || !RHS.isScalable())

return LHS.MinVal >= RHS.MinVal; return LHS.getKnownMinValue() >= RHS.getKnownMinValue();

// LHS.IsScalable = false, RHS.IsScalable = true

return false; return false;

} }

PolySize operator*(T RHS) { return {MinVal * RHS, IsScalable}; }

PolySize &operator*=(T RHS) {

MinVal *= RHS;

return *this;

}

friend PolySize operator-(const PolySize &LHS, const PolySize &RHS) {

assert(LHS.IsScalable == RHS.IsScalable &&

"Arithmetic using mixed scalable and fixed types");

return {LHS.MinVal - RHS.MinVal, LHS.IsScalable};

}

/// This function tells the caller whether the element count is known at

/// compile time to be a multiple of the scalar value RHS.

bool isKnownMultipleOf(T RHS) const { return MinVal % RHS == 0; }

/// We do not provide the '/' operator here because division for polynomial /// We do not provide the '/' operator here because division for polynomial

/// types does not work in the same way as for normal integer types. We can /// types does not work in the same way as for normal integer types. We can

/// only divide the minimum value (or coefficient) by RHS, which is not the /// only divide the minimum value (or coefficient) by RHS, which is not the

/// same as /// same as

/// (Min * Vscale) / RHS /// (Min * Vscale) / RHS

/// The caller is recommended to use this function in combination with /// The caller is recommended to use this function in combination with

/// isKnownMultipleOf(RHS), which lets the caller know if it's possible to /// isKnownMultipleOf(RHS), which lets the caller know if it's possible to

/// perform a lossless divide by RHS. /// perform a lossless divide by RHS.

PolySize divideCoefficientBy(T RHS) const { LeafTy divideCoefficientBy(ScalarTy RHS) const {

return PolySize(MinVal / RHS, IsScalable); return static_cast<LeafTy>(

LinearPolySize::get(getKnownMinValue() / RHS, isScalable()));

} }

PolySize coefficientNextPowerOf2() const { LeafTy coefficientNextPowerOf2() const {

return PolySize(static_cast<T>(llvm::NextPowerOf2(MinVal)), IsScalable); return static_cast<LeafTy>(LinearPolySize::get(

static_cast<ScalarTy>(llvm::NextPowerOf2(getKnownMinValue())),

isScalable()));

} }

/// Printing function. /// Printing function.

void print(raw_ostream &OS) const { void print(raw_ostream &OS) const {

if (IsScalable) if (isScalable())

OS << "vscale x "; OS << "vscale x ";

OS << MinVal; OS << getKnownMinValue();

} }

}; };

/// Stream operator function for `PolySize`. class ElementCount;

template <typename T> template <> struct LinearPolyBaseTypeTraits<ElementCount> {

inline raw_ostream &operator<<(raw_ostream &OS, const PolySize<T> &PS) { using ScalarTy = unsigned;

PS.print(OS); };

return OS;

}

class ElementCount : public PolySize<unsigned> { class ElementCount : public LinearPolySize<ElementCount> {

public: public:

using ScalarTy = typename LinearPolySize<ElementCount>::ScalarTy;

constexpr ElementCount(PolySize<unsigned> V) : PolySize(V) {} ElementCount(const LinearPolySize<ElementCount> &V) : LinearPolySize(V) {}

/// Counting predicates. /// Counting predicates.

/// ///

/// Notice that MinVal = 1 and IsScalable = true is considered more than ///@{ Number of elements..

/// one element.

///

///@{ No elements..

/// Exactly one element. /// Exactly one element.

bool isScalar() const { return !IsScalable && MinVal == 1; } bool isScalar() const { return !isScalable() && getKnownMinValue() == 1; }

/// One or more elements. /// One or more elements.

bool isVector() const { return (IsScalable && MinVal != 0) || MinVal > 1; } bool isVector() const {

return (isScalable() && getKnownMinValue() != 0) || getKnownMinValue() > 1;

}

///@} ///@}

}; };

// This class is used to represent the size of types. If the type is of fixed class TypeSize;

template <> struct LinearPolyBaseTypeTraits<TypeSize> {

using ScalarTy = uint64_t;

};

// TODO: Most functionality in this class will gradually be phased out

// so it will resemble LinearPolySize as much as possible.

// TypeSize is used to represent the size of types. If the type is of fixed

// size, it will represent the exact size. If the type is a scalable vector, // size, it will represent the exact size. If the type is a scalable vector,

// it will represent the known minimum size. // it will represent the known minimum size.

class TypeSize : public PolySize<uint64_t> { class TypeSize : public LinearPolySize<TypeSize> {

public: public:

constexpr TypeSize(PolySize<uint64_t> V) : PolySize(V) {} using ScalarTy = typename LinearPolySize<TypeSize>::ScalarTy;

constexpr TypeSize(uint64_t MinVal, bool IsScalable) TypeSize(const LinearPolySize<TypeSize> &V) : LinearPolySize(V) {}

: PolySize(MinVal, IsScalable) {} TypeSize(ScalarTy MinVal, bool IsScalable)

: LinearPolySize(LinearPolySize::get(MinVal, IsScalable)) {}

static constexpr TypeSize Fixed(uint64_t MinVal) { static TypeSize Fixed(ScalarTy MinVal) { return TypeSize(MinVal, false); }

return TypeSize(MinVal, false); static TypeSize Scalable(ScalarTy MinVal) { return TypeSize(MinVal, true); }

}

static constexpr TypeSize Scalable(uint64_t MinVal) { ScalarTy getFixedSize() const { return getFixedValue(); }

return TypeSize(MinVal, true); ScalarTy getKnownMinSize() const { return getKnownMinValue(); }

}

uint64_t getFixedSize() const { return getFixedValue(); } // The comparison operators are in the process of being phased out

uint64_t getKnownMinSize() const { return getKnownMinValue(); } // in favour of isKnownLT/isKnownLE of its parent class.

friend bool operator<(const TypeSize &LHS, const TypeSize &RHS) { friend bool operator<(const TypeSize &LHS, const TypeSize &RHS) {

assert(LHS.IsScalable == RHS.IsScalable && assert(LHS.isScalable() == RHS.isScalable() &&

"Ordering comparison of scalable and fixed types"); "Ordering comparison of scalable and fixed types");

return LHS.MinVal < RHS.MinVal; return LHS.getKnownMinValue() < RHS.getKnownMinValue();

} }

friend bool operator>(const TypeSize &LHS, const TypeSize &RHS) { friend bool operator>(const TypeSize &LHS, const TypeSize &RHS) {

return RHS < LHS; return RHS < LHS;

} }

friend bool operator<=(const TypeSize &LHS, const TypeSize &RHS) { friend bool operator<=(const TypeSize &LHS, const TypeSize &RHS) {

return !(RHS < LHS); return !(RHS < LHS);

} }

friend bool operator>=(const TypeSize &LHS, const TypeSize& RHS) { friend bool operator>=(const TypeSize &LHS, const TypeSize &RHS) {

return !(LHS < RHS); return !(LHS < RHS);

} }

TypeSize &operator-=(TypeSize RHS) { // All code for this class below this point is needed because of the

assert(IsScalable == RHS.IsScalable && // temporary implicit conversion to uint64_t. The operator overloads are

"Subtraction using mixed scalable and fixed types"); // needed because otherwise the conversion of the parent class LinearPolyBase

MinVal -= RHS.MinVal; // -> TypeSize is ambiguous.

return *this; // TODO: Remove the implicit conversion.

}

TypeSize &operator+=(TypeSize RHS) {

assert(IsScalable == RHS.IsScalable &&

"Addition using mixed scalable and fixed types");

MinVal += RHS.MinVal;

return *this;

}

friend TypeSize operator-(const TypeSize &LHS, const TypeSize &RHS) {

assert(LHS.IsScalable == RHS.IsScalable &&

"Arithmetic using mixed scalable and fixed types");

return {LHS.MinVal - RHS.MinVal, LHS.IsScalable};

}

// Casts to a uint64_t if this is a fixed-width size. // Casts to a uint64_t if this is a fixed-width size.

// //

// This interface is deprecated and will be removed in a future version // This interface is deprecated and will be removed in a future version

// of LLVM in favour of upgrading uses that rely on this implicit conversion // of LLVM in favour of upgrading uses that rely on this implicit conversion

// to uint64_t. Calls to functions that return a TypeSize should use the // to uint64_t. Calls to functions that return a TypeSize should use the

// proper interfaces to TypeSize. // proper interfaces to TypeSize.

// In practice this is mostly calls to MVT/EVT::getSizeInBits(). // In practice this is mostly calls to MVT/EVT::getSizeInBits().

// //

// To determine how to upgrade the code: // To determine how to upgrade the code:

// //

// if (<algorithm works for both scalable and fixed-width vectors>) // if (<algorithm works for both scalable and fixed-width vectors>)

// use getKnownMinValue() // use getKnownMinValue()

// else if (<algorithm works only for fixed-width vectors>) { // else if (<algorithm works only for fixed-width vectors>) {

// if <algorithm can be adapted for both scalable and fixed-width vectors> // if <algorithm can be adapted for both scalable and fixed-width vectors>

// update the algorithm and use getKnownMinValue() // update the algorithm and use getKnownMinValue()

// else // else

// bail out early for scalable vectors and use getFixedValue() // bail out early for scalable vectors and use getFixedValue()

// } // }

operator uint64_t() const { operator ScalarTy() const {

#ifdef STRICT_FIXED_SIZE_VECTORS #ifdef STRICT_FIXED_SIZE_VECTORS

return getFixedValue(); return getFixedValue();

#else #else

if (isScalable()) if (isScalable())

WithColor::warning() << "Compiler has made implicit assumption that " WithColor::warning() << "Compiler has made implicit assumption that "

"TypeSize is not scalable. This may or may not " "TypeSize is not scalable. This may or may not "

"lead to broken code.\n"; "lead to broken code.\n";

return getKnownMinValue(); return getKnownMinValue();

#endif #endif

} }

// Convenience operators to obtain relative sizes independently of // Additional operators needed to avoid ambiguous parses

// the scalable flag. // because of the implicit conversion hack.

TypeSize operator*(unsigned RHS) const { return {MinVal * RHS, IsScalable}; } friend TypeSize operator*(const TypeSize &LHS, const int RHS) {

return LHS * (ScalarTy)RHS;

friend TypeSize operator*(const unsigned LHS, const TypeSize &RHS) {

return {LHS * RHS.MinVal, RHS.IsScalable};

} }

friend TypeSize operator*(const TypeSize &LHS, const unsigned RHS) {

// Additional convenience operators needed to avoid ambiguous parses. return LHS * (ScalarTy)RHS;

// TODO: Make uint64_t the default operator? }

TypeSize operator*(uint64_t RHS) const { return {MinVal * RHS, IsScalable}; } friend TypeSize operator*(const TypeSize &LHS, const int64_t RHS) {

return LHS * (ScalarTy)RHS;

TypeSize operator*(int RHS) const { return {MinVal * RHS, IsScalable}; }

TypeSize operator*(int64_t RHS) const { return {MinVal * RHS, IsScalable}; }

friend TypeSize operator*(const uint64_t LHS, const TypeSize &RHS) {

return {LHS * RHS.MinVal, RHS.IsScalable};

} }

friend TypeSize operator*(const int LHS, const TypeSize &RHS) { friend TypeSize operator*(const int LHS, const TypeSize &RHS) {

return {LHS * RHS.MinVal, RHS.IsScalable}; return RHS * LHS;

}

friend TypeSize operator*(const unsigned LHS, const TypeSize &RHS) {

return RHS * LHS;

} }

friend TypeSize operator*(const int64_t LHS, const TypeSize &RHS) { friend TypeSize operator*(const int64_t LHS, const TypeSize &RHS) {

return {LHS * RHS.MinVal, RHS.IsScalable}; return RHS * LHS;

}

friend TypeSize operator*(const uint64_t LHS, const TypeSize &RHS) {

return RHS * LHS;

} }

}; };

//===----------------------------------------------------------------------===//

// Utilities

//===----------------------------------------------------------------------===//

/// Returns a TypeSize with a known minimum size that is the next integer /// Returns a TypeSize with a known minimum size that is the next integer

/// (mod 2**64) that is greater than or equal to \p Value and is a multiple /// (mod 2**64) that is greater than or equal to \p Value and is a multiple

/// of \p Align. \p Align must be non-zero. /// of \p Align. \p Align must be non-zero.

/// ///

/// Similar to the alignTo functions in MathExtras.h /// Similar to the alignTo functions in MathExtras.h

inline TypeSize alignTo(TypeSize Size, uint64_t Align) { inline TypeSize alignTo(TypeSize Size, uint64_t Align) {

assert(Align != 0u && "Align must be non-zero"); assert(Align != 0u && "Align must be non-zero");

return {(Size.getKnownMinValue() + Align - 1) / Align * Align, return {(Size.getKnownMinValue() + Align - 1) / Align * Align,

Size.isScalable()}; Size.isScalable()};

} }

/// Stream operator function for `LinearPolySize`.

template <typename LeafTy>

inline raw_ostream &operator<<(raw_ostream &OS,

const LinearPolySize<LeafTy> &PS) {

PS.print(OS);

return OS;

}

template <typename T> struct DenseMapInfo;

template <> struct DenseMapInfo<ElementCount> { template <> struct DenseMapInfo<ElementCount> {

static inline ElementCount getEmptyKey() { static inline ElementCount getEmptyKey() {

return ElementCount::getScalable(~0U); return ElementCount::getScalable(~0U);

} }

static inline ElementCount getTombstoneKey() { static inline ElementCount getTombstoneKey() {

return ElementCount::getFixed(~0U - 1); return ElementCount::getFixed(~0U - 1);

} }

static unsigned getHashValue(const ElementCount& EltCnt) { static unsigned getHashValue(const ElementCount &EltCnt) {

unsigned HashVal = EltCnt.getKnownMinValue() * 37U; unsigned HashVal = EltCnt.getKnownMinValue() * 37U;

if (EltCnt.isScalable()) if (EltCnt.isScalable())

return (HashVal - 1U); return (HashVal - 1U);

return HashVal; return HashVal;

} }

static bool isEqual(const ElementCount& LHS, const ElementCount& RHS) { static bool isEqual(const ElementCount &LHS, const ElementCount &RHS) {

return LHS == RHS; return LHS == RHS;

} }

}; };

} // end namespace llvm } // end namespace llvm

#endif // LLVM_SUPPORT_TypeSize_H #endif // LLVM_SUPPORT_TypeSize_H

llvm/unittests/Support/CMakeLists.txt

Show All 38 Lines	add_llvm_unittest(SupportTests
FormatVariadicTest.cpp		FormatVariadicTest.cpp
GlobPatternTest.cpp		GlobPatternTest.cpp
Host.cpp		Host.cpp
IndexedAccessorTest.cpp		IndexedAccessorTest.cpp
ItaniumManglingCanonicalizerTest.cpp		ItaniumManglingCanonicalizerTest.cpp
JSONTest.cpp		JSONTest.cpp
KnownBitsTest.cpp		KnownBitsTest.cpp
LEB128Test.cpp		LEB128Test.cpp
		LinearPolyBaseTest.cpp
LineIteratorTest.cpp		LineIteratorTest.cpp
LockFileManagerTest.cpp		LockFileManagerTest.cpp
MatchersTest.cpp		MatchersTest.cpp
MD5Test.cpp		MD5Test.cpp
ManagedStatic.cpp		ManagedStatic.cpp
MathExtrasTest.cpp		MathExtrasTest.cpp
MemoryBufferTest.cpp		MemoryBufferTest.cpp
MemoryTest.cpp		MemoryTest.cpp
▲ Show 20 Lines • Show All 67 Lines • Show Last 20 Lines

llvm/unittests/Support/LinearPolyBaseTest.cpp

This file was added.

				//===- TestPoly3D.cpp - Poly3D unit tests------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "llvm/Support/TypeSize.h"
				#include "gtest/gtest.h"

				using namespace llvm;

				class Poly3D;
				template <> struct llvm::LinearPolyBaseTypeTraits<Poly3D> {
				using ScalarTy = int64_t;
				};

				using Poly3DBase = LinearPolyBase<Poly3D, /Dims=/3, /IsUnivariate=/ false>;
				class Poly3D : public Poly3DBase {
				public:
				using ScalarTy = Poly3DBase::ScalarTy;
				Poly3D(ScalarTy x, ScalarTy y, ScalarTy z) : Poly3DBase({x, y, z}) {}
				Poly3D(const LinearPolyBase &Convert) : Poly3DBase(Convert) {}
				};

				TEST(LinearPolyBase, Poly3D_isZero) {
				EXPECT_TRUE(Poly3D(0, 0, 0).isZero());
				EXPECT_TRUE(Poly3D(0, 0, 1).isNonZero());
				EXPECT_TRUE(Poly3D(0, 0, 1));
				}

				TEST(LinearPolyBase, Poly3D_Equality) {
				EXPECT_EQ(Poly3D(1, 2, 3), Poly3D(1, 2, 3));
				EXPECT_NE(Poly3D(1, 2, 3), Poly3D(1, 2, 4));
				}

				TEST(LinearPolyBase, Poly3D_GetValue) {
				EXPECT_EQ(Poly3D(1, 2, 3).getValue(0), 1);
				EXPECT_EQ(Poly3D(1, 2, 3).getValue(1), 2);
				EXPECT_EQ(Poly3D(1, 2, 3).getValue(2), 3);
				}

				TEST(LinearPolyBase, Poly3D_Add) {
				// Test operator+
				EXPECT_EQ(Poly3D(42, 0, 0) + Poly3D(0, 42, 0) + Poly3D(0, 0, 42),
				Poly3D(42, 42, 42));

				// Test operator+=
				Poly3D X(42, 0, 0);
				X += Poly3D(0, 42, 0);
				X += Poly3D(0, 0, 42);
				EXPECT_EQ(X, Poly3D(42, 42, 42));
				}

				TEST(LinearPolyBase, Poly3D_Sub) {
				// Test operator-
				EXPECT_EQ(Poly3D(42, 42, 42) - Poly3D(42, 0, 0) - Poly3D(0, 42, 0) -
				Poly3D(0, 0, 42),
				Poly3D(0, 0, 0));

				// Test operator-=
				Poly3D X(42, 42, 42);
				X -= Poly3D(42, 0, 0);
				X -= Poly3D(0, 42, 0);
				X -= Poly3D(0, 0, 42);
				EXPECT_EQ(X, Poly3D(0, 0, 0));
				}

				TEST(LinearPolyBase, Poly3D_Scale) {
				// Test operator*
				EXPECT_EQ(Poly3D(1, 2, 4) * 2, Poly3D(2, 4, 8));
				EXPECT_EQ(Poly3D(1, 2, 4) * -2, Poly3D(-2, -4, -8));
				}

				TEST(LinearPolyBase, Poly3D_Invert) {
				// Test operator-
				EXPECT_EQ(-Poly3D(2, 4, 8), Poly3D(-2, -4, -8));
				}

				class Univariate3D;
				template <> struct llvm::LinearPolyBaseTypeTraits<Univariate3D> {
				using ScalarTy = int64_t;
				};

				using Univariate3DBase =
				LinearPolyBase<Univariate3D, /Dims=/3, /IsUnivariate=/true>;
				class Univariate3D : public Univariate3DBase {
				public:
				using ScalarTy = Univariate3DBase::ScalarTy;
				Univariate3D(ScalarTy x, unsigned Dim) : Univariate3DBase(x, Dim) {}
				Univariate3D(const Univariate3DBase &Convert) : Univariate3DBase(Convert) {}
				};

				TEST(LinearPolyBase, Univariate3D_isZero) {
				EXPECT_TRUE(Univariate3D(0, 0).isZero());
				EXPECT_TRUE(Univariate3D(0, 1).isZero());
				EXPECT_TRUE(Univariate3D(0, 2).isZero());
				EXPECT_TRUE(Univariate3D(1, 0).isNonZero());
				EXPECT_TRUE(Univariate3D(1, 1).isNonZero());
				EXPECT_TRUE(Univariate3D(1, 2).isNonZero());
				EXPECT_TRUE(Univariate3D(1, 0));
				}

				TEST(LinearPolyBase, Univariate3D_Equality) {
				EXPECT_EQ(Univariate3D(1, 0), Univariate3D(1, 0));
				EXPECT_NE(Univariate3D(1, 0), Univariate3D(1, 2));
				EXPECT_NE(Univariate3D(1, 0), Univariate3D(1, 1));
				EXPECT_NE(Univariate3D(1, 0), Univariate3D(2, 0));
				EXPECT_NE(Univariate3D(1, 0), Univariate3D(0, 0));
				}

				TEST(LinearPolyBase, Univariate3D_GetValue) {
				EXPECT_EQ(Univariate3D(42, 0).getValue(0), 42);
				EXPECT_EQ(Univariate3D(42, 0).getValue(1), 0);
				EXPECT_EQ(Univariate3D(42, 0).getValue(2), 0);

				EXPECT_EQ(Univariate3D(42, 1).getValue(0), 0);
				EXPECT_EQ(Univariate3D(42, 1).getValue(1), 42);
				EXPECT_EQ(Univariate3D(42, 1).getValue(2), 0);

				EXPECT_EQ(Univariate3D(42, 0).getExclusiveValue(), 42);
				EXPECT_EQ(Univariate3D(42, 1).getExclusiveValue(), 42);
				}

				TEST(LinearPolyBase, Univariate3D_Add) {
				// Test operator+
				EXPECT_EQ(Univariate3D(42, 0) + Univariate3D(42, 0), Univariate3D(84, 0));
				EXPECT_EQ(Univariate3D(42, 1) + Univariate3D(42, 1), Univariate3D(84, 1));
				EXPECT_DEBUG_DEATH(Univariate3D(42, 0) + Univariate3D(42, 1),
				"Invalid dimensions");

				// Test operator+=
				Univariate3D X(42, 0);
				X += Univariate3D(42, 0);
				EXPECT_EQ(X, Univariate3D(84, 0));
				}

				TEST(LinearPolyBase, Univariate3D_Sub) {
				// Test operator+
				EXPECT_EQ(Univariate3D(84, 0) - Univariate3D(42, 0), Univariate3D(42, 0));
				EXPECT_EQ(Univariate3D(84, 1) - Univariate3D(42, 1), Univariate3D(42, 1));
				EXPECT_DEBUG_DEATH(Univariate3D(84, 0) - Univariate3D(42, 1),
				"Invalid dimensions");

				// Test operator+=
				Univariate3D X(84, 0);
				X -= Univariate3D(42, 0);
				EXPECT_EQ(X, Univariate3D(42, 0));
				}

				TEST(LinearPolyBase, Univariate3D_Scale) {
				// Test operator*
				EXPECT_EQ(Univariate3D(4, 0) * 2, Univariate3D(8, 0));
				EXPECT_EQ(Univariate3D(4, 1) * -2, Univariate3D(-8, 1));
				}

				TEST(LinearPolyBase, Univariate3D_Invert) {
				// Test operator-
				EXPECT_EQ(-Univariate3D(4, 0), Univariate3D(-4, 0));
				EXPECT_EQ(-Univariate3D(4, 1), Univariate3D(-4, 1));
				}

This is an archive of the discontinued LLVM Phabricator instance.

[NFCI] Add StackOffset class and base classes for ElementCount, TypeSize.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 300031

llvm/include/llvm/Support/TypeSize.h

llvm/unittests/Support/CMakeLists.txt

llvm/unittests/Support/LinearPolyBaseTest.cpp

[NFCI] Add StackOffset class and base classes for ElementCount, TypeSize.
ClosedPublic