This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/lib/CodeGen/
-
lib/
-
CodeGen/
-
CGCall.cpp
-
CGStmt.cpp
-
CodeGenFunction.cpp
-
llvm/
-
include/llvm/
-
llvm/
-
ADT/
-
DenseMapInfo.h
-
IR/
11/19
DataLayout.h
-
DerivedTypes.h
-
InstrTypes.h
-
Type.h
-
Support/
6/6
MachineValueType.h
-
ScalableSize.h
3
TypeSize.h
-
lib/
-
Analysis/
-
InlineCost.cpp
-
CodeGen/
-
Analysis.cpp
-
IR/
1/1
DataLayout.cpp
5
Instructions.cpp
-
Type.cpp
-
Target/AArch64/
-
AArch64/
-
AArch64ISelLowering.cpp
-
Transforms/Scalar/
-
Scalar/
-
SROA.cpp
-
test/Other/
-
Other/
-
scalable-vectors-core-ir.ll
-
unittests/
-
CodeGen/
-
ScalableVectorMVTsTest.cpp
-
IR/
-
VectorTypesTest.cpp

Differential D53137

Scalable vector core instruction support + size queries
ClosedPublic

Authored by huntergr on Oct 11 2018, 6:59 AM.

Download Raw Diff

Details

Reviewers

hfinkel
lattner
rkruppe
greened
rovka
rengolin
sdesmalen

Commits

rL374042: [SVE][IR] Scalable Vector size queries and IR instruction support
rC374042: [SVE][IR] Scalable Vector size queries and IR instruction support
rGb302561b763a: [SVE][IR] Scalable Vector size queries and IR instruction support

Summary

Implements basic size queries to support scalable vectors in LLVM IR.

Adds simple tests for IR instructions usable with scalable vector types to ensure that they can be parsed and printed back out without dropping the scalable flag; used the size queries and getElementCount in a couple of places to get it working.

Adds a few scalable size query calls in codegen and tablegen to keep existing tests working, including the recent SVE calling convention test.

Adds checks in various backends which don't support scalable vectors to skip over them when registering legalization and DAG actions.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

huntergr created this revision.Oct 11 2018, 6:59 AM

Herald added subscribers: jsji, kbarton, nhaehnle and 3 others. · View Herald TranscriptOct 11 2018, 6:59 AM

huntergr added a child revision: D53138: Scalable type size queries (clang).Oct 11 2018, 7:02 AM

huntergr added a parent revision: D32530: [SVE][IR] Scalable Vector IR Type.

rogfer01 added a subscriber: rogfer01.Oct 11 2018, 7:08 AM

Unified ScalableSize representation
Changed to uint64_t + boolean, since we no longer allow scalable vectors in aggregates
Removed aggregate and mixed unit tests

Herald added a subscriber: javed.absar. · View Herald TranscriptNov 2 2018, 5:27 PM

This looks ok to me, but I'd rather more people look at it before approving.

Thanks!

rengolin added a reviewer: greened.Mar 7 2019, 11:34 AM

Herald added a subscriber: jdoerfert. · View Herald TranscriptMar 7 2019, 11:34 AM

I know this isn't ready for merge, but since the mailing list discussion has died down it seems like maybe we should move the discussion here. If so, it would be helpful to have comments on all the routines explaining what they do and how they differ from the existing routines, in order to aid discussion.

simoll added a subscriber: simoll.Mar 8 2019, 9:00 AM

hsaito added a subscriber: hsaito.Mar 8 2019, 2:28 PM

huntergr planned changes to this revision.Mar 22 2019, 3:06 AM

tstellar added a subscriber: tstellar.May 20 2019, 9:11 AM

vkmr added a subscriber: vkmr.Jun 14 2019, 5:10 AM

Updated based on where scalable size queries were required when running with an SVE-capable loop vectorizer on several codebases (all of LNT, various flavours of SpecCPU, lots of HPC and embedded benchmarks).

Herald added a project: Restricted Project. · View Herald TranscriptJul 16 2019, 6:59 AM

Herald added subscribers: • wuzish, jfb, hiraditya. · View Herald Transcript

Hi Graham,

I think this patch is difficult to review. It covers many different source files with only a small unit test to check the correctness. This isn't very robust against future changes and it makes it hard to know exactly what is and isn't supported.

I would find it much easier to review with an incremental strategy based on regression tests. For instance, with ToT opt, the attached testcase fails (error: '%r' defined with type '<4 x i1>' but expected '<vscale x 4 x i1>'). I would add a patch to fix that, and maybe other similar, really simple cases. We could then proceed to more complex examples, run some of the passes that come after the vectorizer on them, and progressively fix the places required to make them pass, with focused tests for each hurdle that we run into. It shouldn't be too hard to reduce such snippets from the tests you've already been running.

We would eventually end up with something like this patch, but I think that an incremental approach would have the advantage of quicker reviews with more focused discussions, more regression tests, and we would know how much was supported at any point. Does anyone else have a similar opinion?

simplest.ll415 BDownload

rengolin added a reviewer: rovka.Jul 23 2019, 3:24 AM

Hi Diana,

Thanks for the comments.

In D53137#1597022, @rovka wrote:

I think this patch is difficult to review. It covers many different source files with only a small unit test to check the correctness. This isn't very robust against future changes and it makes it hard to know exactly what is and isn't supported.

Yeah, I was worried about that -- this is basically the size queries alone without anything actually using scalable vectors. It demonstrates roughly where changes will be needed, but doesn't actually change the surrounding code to use e.g. getElementCount instead of getNumElements.

I would find it much easier to review with an incremental strategy based on regression tests. For instance, with ToT opt, the attached testcase fails (error: '%r' defined with type '<4 x i1>' but expected '<vscale x 4 x i1>'). I would add a patch to fix that, and maybe other similar, really simple cases. We could then proceed to more complex examples, run some of the passes that come after the vectorizer on them, and progressively fix the places required to make them pass, with focused tests for each hurdle that we run into. It shouldn't be too hard to reduce such snippets from the tests you've already been running.

An incremental approach sounds good; assuming nobody objects, I'll remove most of the code in this patch and just leave the core mechanism behind (in enforcing mode) and add in that test case. We can fill in the other cases as we enable codegen/acle/autovec in separate patches.

In D53137#1597022, @rovka wrote:

I would find it much easier to review with an incremental strategy based on regression tests. For instance, with ToT opt, the attached testcase fails (error: '%r' defined with type '<4 x i1>' but expected '<vscale x 4 x i1>'). I would add a patch to fix that, and maybe other similar, really simple cases. We could then proceed to more complex examples, run some of the passes that come after the vectorizer on them, and progressively fix the places required to make them pass, with focused tests for each hurdle that we run into. It shouldn't be too hard to reduce such snippets from the tests you've already been running.

+1 to incremental approach and more tests!

This change is mostly mechanical, but you're absolutely right we need to be aware of unwanted side effects. I wrongly assumed mechanical == NFC, but this clearly isn't. Thanks for the thorough review!

I think tests need to be strict on what it should support. Not necessarily test for *all* errors, but add test for the supported cases and some negative tests for the obvious unsupported stuff.

Each step of the way, revert negative tests when new features are added (and adding more tests, too!), we can make sure it's stable and robust.

In D53137#1597088, @huntergr wrote:

Hi Diana,

Thanks for the comments.

In D53137#1597022, @rovka wrote:

I think this patch is difficult to review. It covers many different source files with only a small unit test to check the correctness. This isn't very robust against future changes and it makes it hard to know exactly what is and isn't supported.

Yeah, I was worried about that -- this is basically the size queries alone without anything actually using scalable vectors. It demonstrates roughly where changes will be needed, but doesn't actually change the surrounding code to use e.g. getElementCount instead of getNumElements.

I would find it much easier to review with an incremental strategy based on regression tests. For instance, with ToT opt, the attached testcase fails (error: '%r' defined with type '<4 x i1>' but expected '<vscale x 4 x i1>'). I would add a patch to fix that, and maybe other similar, really simple cases. We could then proceed to more complex examples, run some of the passes that come after the vectorizer on them, and progressively fix the places required to make them pass, with focused tests for each hurdle that we run into. It shouldn't be too hard to reduce such snippets from the tests you've already been running.

An incremental approach sounds good; assuming nobody objects, I'll remove most of the code in this patch and just leave the core mechanism behind (in enforcing mode) and add in that test case. We can fill in the other cases as we enable codegen/acle/autovec in separate patches.

Actually my test case falls in the "replace getNumElements with getElementCount" category, rather than size queries per se. I don't think either of them is more important than the other, but you might run into the former while writing tests for the latter.

Removed most of the changes in favour of reintroducing them in separate patches later with appropriate tests.
Added tests for core IR instructions to make sure they don't drop the scalable flag.
Fixed up a couple places which broke the new tests.

greened added inline comments.Jul 31 2019, 1:05 PM

llvm/include/llvm/CodeGen/ValueTypes.h
299 ↗	(On Diff #212548)	Needs a comment about what this returns.
305 ↗	(On Diff #212548)	Needs a comment about what this returns.
321 ↗	(On Diff #212548)	Needs a comment about what this returns.
326 ↗	(On Diff #212548)	Needs a comment about what this returns.
336 ↗	(On Diff #212548)	Needs a comment about what this returns.
340 ↗	(On Diff #212548)	Needs a comment about what this returns.
llvm/include/llvm/IR/DataLayout.h
448	Needs comments about what these return.
460	Needs a comment about what this returns.
466	Needs a comment about what this returns.
471	Needs a comment about what this returns.
476	Needs a comment about what this returns.
492	Needs a comment about what this returns.
499	Needs a comment about what this returns.
504	Needs a comment about what this returns.

greened added inline comments.Jul 31 2019, 1:05 PM

llvm/include/llvm/IR/DataLayout.h
509	Needs a comment about what this returns.
llvm/include/llvm/Support/MachineValueType.h
674	Not sure why the other methods here don't have comments but we should probably have one here to say what this does.
722	Needs a comment about what this returns.
836	Needs a comment about what this returns.
841	Needs a comment about what this returns.
842	Needs a comment about what this returns.
847	Needs a comment about what this returns.
llvm/include/llvm/Support/ScalableSize.h
41	Needs comments about what this is and what the fields represent.
100	This needs an explanatory comment.
105	Needs a comment about what this returns.
llvm/lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp
4612 ↗	(On Diff #212548)	This comment should probably be moved right before the `if` below.
llvm/lib/IR/DataLayout.cpp
749–752	I might help to clarify that this comment only applies to scalable types, at least as far as I understand this changes here.
llvm/lib/Target/Hexagon/HexagonISelLowering.cpp
1438 ↗	(On Diff #212548)	What about other backends? Do they need similar checks (both here and below)?

Several more general comments:

Should all the getXSize assert when called on a scalable type? I see that for MVT::getSizeInBits and for Type::getPrimitiveSize, but not for the others. This should also be made clear in the comments for each of them.
Great test for the IR, thanks!
I don't see any test for the CodeGen stuff though. Is it possible to add one? (If not, maybe add the changes to EVT etc when we can actually test them).
Ditto for TableGen (or if that's too difficult/hairy to test, just update the commit message to explain exactly why the change belongs in this patch).

llvm/include/llvm/CodeGen/ValueTypes.h
214 ↗	(On Diff #212548)	Why does this one need to change? We're only looking at MinSize anyway, isn't getSizeInBits good enough?
llvm/include/llvm/IR/DataLayout.h
504	Where is this used?
llvm/include/llvm/Support/ScalableSize.h
41	Maybe make this a class, so people can't just get at the MinSize directly? Otherwise, it doesn't make much sense to have accessors for the fields (and especially getFixedSize, which can be completely circumvented via direct access).
48	Isn't this already deleted because we have a constructor just above?
51	I think it's more canonical to say return std::tie(MinSize, Scalable) == std::tie(RHS.MinSize, RHS.Scalable).
57	Should be implemented in terms of operator==.
64	Needs a comment saying that you can't compare scalable sizes and fixed sizes. You can simplify the code by replacing the check for Scalable with an assert and removing the unreachable at the end. Also, all the other comparison operators below should be implemented in terms of this.
92	Would be nice to also have a non-member one for symmetry (so you can write 2 * SomeSize, not just SomeSize * 2).
llvm/lib/CodeGen/ValueTypes.cpp
106 ↗	(On Diff #212548)	Should also assert that it's not called on a scalable type.

Thanks for the reviews; I left a couple of inline comments, and will make the requested changes.

llvm/include/llvm/CodeGen/ValueTypes.h
214 ↗	(On Diff #212548)	getSizeInBits will assert (actually, the underlying MVT version for it will) for a scalable vector type, so can't be used. However, I introduced the 'getMinSizeInBits' function after writing this, so it should be used here.
llvm/lib/Target/Hexagon/HexagonISelLowering.cpp
1438 ↗	(On Diff #212548)	I'll take a look, but the Hexagon backend was the only one which asserted when running `make check-all`.

Added comments explaining the new methods
Added tests for the MVT/EVT/DataLayout interfaces
Refactored ScalableSize class operators to build on '==' and '<'
Added checks in various backends that don't support scalable vectors to prevent legalizing operations involving scalable MVTs

Herald added a reviewer: rengolin. · View Herald TranscriptAug 6 2019, 5:54 AM

Herald added subscribers: atanasyan, MaskRay, jrtc27 and 7 others. · View Herald Transcript

huntergr marked 34 inline comments as done.Aug 6 2019, 5:59 AM

Thanks @huntergr for working on this!

This patch can probably be split into two separate patches, which make them easier to review;

One that fixes the Targets to ignore scalable types (see comment)
Another one that adds the interface for scalable size queries.

On the interfaces itself, I personally find ScalableSize a bit of a misnomer (see comment for other suggestions) because it describes both scalable and fixed-width sizes, which is not what the name suggests. But perhaps more importantly, my concern is that with the interface as defined in this patch, there is little incentive to move the LLVM codebase to work on ScalableSize. Especially for new code that gets added, the interface should let developers make a conscious decision whether their code is fixed-width only, scalable-width only, or agnostic to this property. I would much rather see getSizeInBits() return a ScalableSize object, which in turn only has getFixedSize() and getMinScalableSize() as query methods. If the code is agnostic to whether the size of an object is fixed or scalable, the code should simply operate on the ScalableSize object itself, rather than operating on MinSize.

I realise that the entire code-base currently expects 'unsigned' or 'int', but this can easily be fixed by adding a (hopefully temporary) overloaded cast operator to the struct that produces a scalar value, like:

/// Casts to a uint64_t if this is a fixed-width size.
///
/// NOTE: This interface is obsolescent and will be removed
/// in a future version of LLVM in favour of getFixedSize().
operator uint64_t() const {
  assert(isFixed() && "Expected fixed size data type");
  return getFixedSize();
}

We can then update the codebase piece by piece, incrementally making use of ScalableSize or its interfaces. When all that is done, the codebase should compile without errors when building for LLVM_TARGETS_TO_BUILD=AArch64 after we remove the overloaded operator.

llvm/include/llvm/Support/ScalableSize.h
83	All the comparison operators assert that the types are both fixed-width or both scalable. Is there value in also adding the following interfaces? // Returns true if A is known to be at least as big as B, e.g. // If A is scalable, and B is not, returns A.MinSize >= B.MinSize // If A is not scalable, and B is, returns false regardless of MinSize. bool knownGreaterOrEqual(const ScalableSize &B) const { ... } // or alternatively: "atLeastAsBigAs"? and // Returns true if A is known to always be larger than B, e.g. // If A is scalable, and B is not, returns A.MinSize > B.MinSize // If A is not scalable, and B is, returns false regardless of MinSize. bool knownGreater(const ScalableSize &B) const { ... }
120	I think this only really makes sense in the context of scalable vectors, and don't think we want to expose it as a property for both fixed/scalable vectors, e.g.: uint64_t getMinSize() const { assert (isScalable() && "MinSize only makes sense in the context of a scalable vector," " use getFixedSize() instead"); return MinSize; } For cases where we need to use it in a comparison (to know if it is at least a given number of bytes) we can do this in a separate comparison interface.
125	`bool ScalableSize::isScalable()` proves to me that the name `ScalableSize` is a misnomer. If you have a ScalableSize object, I'd expect it to always represent a scalable size. Can we rename this to something more generic like `ObjectSize` or `PrimitiveSize`? (suggestions welcome)
llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
9177 ↗	(On Diff #213589)	`OutputArg::PartOffset` is defined explicitly as a byte offset, so until we change that, this should be `getFixedSize()`.
llvm/lib/IR/Instructions.cpp
3053	MinSize suggests that it could be larger at runtime, so `SrcBits.getMinSize() == 0` would always be true. If you instead overload `operator bool()` which checks for Fixed size and Scalable size to be 0, you can rewrite this as `(!SrcBits \|\| !DestBits)` .
llvm/lib/Target/X86/X86ISelLowering.cpp
721 ↗	(On Diff #213589)	Adding `MVT::fixedwidth_[integer_\|fp_]vector_valuetypes()` seems like a more natural interface than skipping based on `isScalableVector()`. There already seem to be iterators for `MVT::(integer_\|fp_)_scalable_vector_valuetypes`. You can do that in a separate patch, so this patch can focus solely on scalable size queries.

huntergr added inline comments.Aug 9 2019, 3:15 AM

llvm/include/llvm/Support/ScalableSize.h
83	If you can think of an immediate use case, sure. Otherwise I'd leave it to another patch which requires that information.
120	This is used for the alignment checks at the moment, where we just need to know the minimum for scalable and exact for fixed. I'd rather not have code that looks like unsigned Align; if (VTy->isScalable()) Align = getMinSize(VTy); else Align = getFixedSize(VTy); in several places, when it could be done with one. We can certainly bikeshed the names, though, and come up with a more explicit one which acknowledges it can represent a known quantity from fixed or scalable vectors. I would also need to convert a larger part of the codebase to use scalable types in order to use this right now, and I've already had pushback on the size of the changes.
125	Yeah, I figured it needed a rename, but plain 'Size' was likely to cause problems. ObjectSize or TypeSize might work.
llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
9177 ↗	(On Diff #213589)	Then we would assert when running unit tests. I'm trying to juggle between an enormous patch that fixes everything 'properly', and something which gets us partway and keeps us running but needs fixes elsewhere later (which seemed to be what other reviewers would prefer). I could certainly add a comment here to explain that changes are needed, though.
llvm/lib/IR/Instructions.cpp
3053	No. `0 * vscale` is still 0 for any value of vscale. The ScalableSize class says the Scalable flag indicates the total size is an integer multiple of the known minimum size. I suspect a better way of doing this test would be to explicitly check for the elements of one being pointer type and not the other, instead of relying on a hack with the size.
llvm/lib/Target/X86/X86ISelLowering.cpp
721 ↗	(On Diff #213589)	Yeah, I thought of that after I'd submitted the patch. Will do.

In D53137#1621351, @sdesmalen wrote:

Thanks @huntergr for working on this!

This patch can probably be split into two separate patches, which make them easier to review;

One that fixes the Targets to ignore scalable types (see comment)

Another one that adds the interface for scalable size queries.

Ok, will do.

On the interfaces itself, I personally find ScalableSize a bit of a misnomer (see comment for other suggestions) because it describes both scalable and fixed-width sizes, which is not what the name suggests. But perhaps more importantly, my concern is that with the interface as defined in this patch, there is little incentive to move the LLVM codebase to work on ScalableSize. Especially for new code that gets added, the interface should let developers make a conscious decision whether their code is fixed-width only, scalable-width only, or agnostic to this property. I would much rather see getSizeInBits() return a ScalableSize object, which in turn only has getFixedSize() and getMinScalableSize() as query methods. If the code is agnostic to whether the size of an object is fixed or scalable, the code should simply operate on the ScalableSize object itself, rather than operating on MinSize.

See some of the inline comments -- there are a few places where we'd just end up duplicating code if used that way. The names can certainly be improved for clarity, though, and we could state that the (scalable|fixed)-only interfaces should be used in preference to a joint one.

I realise that the entire code-base currently expects 'unsigned' or 'int', but this can easily be fixed by adding a (hopefully temporary) overloaded cast operator to the struct that produces a scalar value, like:
/// Casts to a uint64_t if this is a fixed-width size.
///
/// NOTE: This interface is obsolescent and will be removed
/// in a future version of LLVM in favour of getFixedSize().
operator uint64_t() const {
  assert(isFixed() && "Expected fixed size data type");
  return getFixedSize();
}
We can then update the codebase piece by piece, incrementally making use of ScalableSize or its interfaces. When all that is done, the codebase should compile without errors when building for LLVM_TARGETS_TO_BUILD=AArch64 after we remove the overloaded operator.

I'm fine with that approach if others approve; I was trying to minimize the overall impact on the codebase though.

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
9177 ↗	(On Diff #213589)	A followup thought on this; would it be preferable to split the EVT/MVT side of this patch out into a separate one, and fix this properly there?

In D53137#1622713, @huntergr wrote:

See some of the inline comments -- there are a few places where we'd just end up duplicating code if used that way. The names can certainly be improved for clarity, though, and we could state that the (scalable|fixed)-only interfaces should be used in preference to a joint one.

Yes, adding some words to state that in the description of the method would be good. There are indeed cases where MinSize is needed directly (like in DAGCombiner.cpp where it uses MaximumLegalStoreInBits = MinSize), but such a case is very explicit (it needs to specifically query the known part of the size), which I think is not very common. We should strive to reduce those uses as much as possible. For Alignment for example, I think we can use a separate method (see comment).

I realise that the entire code-base currently expects 'unsigned' or 'int', but this can easily be fixed by adding a (hopefully temporary) overloaded cast operator to the struct that produces a scalar value, like:
/// Casts to a uint64_t if this is a fixed-width size.
///
/// NOTE: This interface is obsolescent and will be removed
/// in a future version of LLVM in favour of getFixedSize().
operator uint64_t() const {
  assert(isFixed() && "Expected fixed size data type");
  return getFixedSize();
}
We can then update the codebase piece by piece, incrementally making use of ScalableSize or its interfaces. When all that is done, the codebase should compile without errors when building for LLVM_TARGETS_TO_BUILD=AArch64 after we remove the overloaded operator.
I'm fine with that approach if others approve; I was trying to minimize the overall impact on the codebase though.

Great! Looking forward to hear how others feel about it.

llvm/include/llvm/Support/ScalableSize.h
83	One use-case is in DAGCombiner.cpp, where it currently has: VT.getMinSizeInBits() >= MaximumLegalStoreInBits which would become: VT.getSizeInBits().isKnownGreaterOrEqual(MaximumLegalStoreInBits) This removes another use-case for using `MinSize` directly.
120	For Alignment, because it is so common, I think it is worth adding a separate method: unsigned getNaturalAlignment() const { return (unsigned) getKnownSize(); };
llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
9177 ↗	(On Diff #213589)	Right, I see. At some point this class should probably use StackOffset to represent such offsets. For now, I'd say this line warrants a "FIXME" comment.
9177 ↗	(On Diff #213589)	A followup thought on this; would it be preferable to split the EVT/MVT side of this patch out into a separate one, and fix this properly there? I think so.
llvm/lib/IR/Instructions.cpp
3053	Perhaps I am a little pedantic with how I read this, because I expect the minimum size of every object to be 0, always :) So perhaps instead of using the name `MinSize`, a name like `KnownSize` would be better suited. (which would also nicely match with the `isKnownGreaterOrEqual()` suggestion for DAGCombiner.cpp).
llvm/lib/Target/X86/X86ISelLowering.cpp
721 ↗	(On Diff #213589)	Thanks!

Created D66339 to split out the code to skip scalable vector types in other backends.

huntergr mentioned this in D66339: [SVE] Fixed-length vector MVT ranges.Aug 16 2019, 4:10 AM

Split out backend code into separate patches
Renamed 'Min' to 'KnownMin' in method names.
Added a few more comments.

I tried to replace the comparison of minimum sizes to 0 in Instructions.cpp (for the bitcast checks), but that statement represents quite a lot. Could be scalar pointers, could be a vector of pointers (both of which I checked for in experiments), but there are several other types which report a size of 0 and which are checked for in some unit tests, so the number of checks ended up being fairly substantial. I've left it alone for now, but if reviewers would prefer I extract a method to explicitly check for all conditions represented by the size check I can do so.

huntergr mentioned this in D66871: [SVE] MVT scalable size queries.Aug 28 2019, 4:15 AM

huntergr added a child revision: D66871: [SVE] MVT scalable size queries.

Does anyone like Sander's suggestion to make ScalableSize (or whatever we end up naming it) the return value for all size queries and provide an overloaded cast operator to transparently work with existing code comparing against unsigned values? Or is it preferable to keep the current split?

In D53137#1650774, @huntergr wrote:

Does anyone like Sander's suggestion to make ScalableSize (or whatever we end up naming it) the return value for all size queries and provide an overloaded cast operator to transparently work with existing code comparing against unsigned values? Or is it preferable to keep the current split?

+1 from me, I definitely like Sander's suggestion. I thought we were heading towards ScalableSize-only functions anyway and that this was just a temporary path until we cleaned up the codebase from unsigned. Using a cast operator from the start seems like an even better way of getting there, since that way at least we hopefully won't get that many new uses of unsigned in the interim.

In D53137#1652189, @rovka wrote:

+1 from me, I definitely like Sander's suggestion. I thought we were heading towards ScalableSize-only functions anyway and that this was just a temporary path until we cleaned up the codebase from unsigned. Using a cast operator from the start seems like an even better way of getting there, since that way at least we hopefully won't get that many new uses of unsigned in the interim.

Changed existing interface to return ScalableSize objects and added a (mostly) transparent conversion, as per Sander's suggestion.
Removed new interfaces for DataLayout and Type.
Fixed cases where the transparent conversion doesn't quite work (e.g. std::max/min, where the types must be the same)

I suspect 'ScalableSize' is the wrong term now; 'TypeSize' may be better. Thoughts?

Herald added a project: Restricted Project. · View Herald TranscriptSep 23 2019, 4:22 AM

Herald added subscribers: cfe-commits, haicheng, eraman. · View Herald Transcript

I suspect 'ScalableSize' is the wrong term now; 'TypeSize' may be better. Thoughts?

I agree, TypeSize sounds better. Maybe we can replace the public constructor with 2 static methods, TypeSize::Fixed(Size) and TypeSize::Scalable(Size), so we don't always have to spell out /* Scalable =*/.

llvm/include/llvm/IR/DataLayout.h
457	We already overload operator /, why not overload + as well so we don't have to change the body of this method?
490	Can we add a version of alignTo that works with ScalableSize instead?
655	Maybe just return VTy->getElementCount() * getTypeSizeInBits(VTy->getElementType()).getFixedSize().

huntergr added a child revision: D47775: [AArch64][SVE] Add SPLAT_VECTOR ISD Node.Sep 30 2019, 4:22 AM

huntergr added inline comments.Oct 1 2019, 5:09 AM

llvm/include/llvm/IR/DataLayout.h
457	Scaling a size with * or / has a clear meaning to me, since it's independent of vscale; getting a vector that's half the size or four times larger just works. Using + (or -) on the other hand doesn't seem to be as clear; I wasn't sure if a standalone int should be automatically treated as being the same as the TypeSize, or always considered Fixed. If we try for the former I can imagine quite a few bugs arising. I could add a roundBitsToNearestByteSize method to move the arithmetic elsewhere if that would be acceptable?
655	There's no support for generating a TypeSize from an ElementCount in that way; is that an interface you feel is useful? (I'll certainly change the `getKnownMinSize` to `getFixedSize` though, since we're just referring to a scalar)

Renamed ScalableSize to TypeSize, including header name.
added alignTo function that takes and returns a TypeSize. I wasn't sure if this should be added to MathExtras.h where the other variants live, so just kept it in TypeSize.h for now

Herald added a subscriber: dexonsmith. · View Herald TranscriptOct 1 2019, 5:15 AM

This looks good to me, maybe wait a while to see if anyone else has any further comments.

llvm/include/llvm/IR/DataLayout.h
457	You're right, + on TypeSizes would be confusing. This looks ok as-is then, no need to fiddle with it more.
655	Actually, no, now that I think about it a bit more it might be clearer to spell it out this way.
llvm/include/llvm/Support/TypeSize.h
123	Microscopic nit: punctuation.
144	Ditto.
149	Ditto.

This revision is now accepted and ready to land.Oct 2 2019, 3:01 AM

Thanks @huntergr , I think this interface looks really nice. LGTM!

llvm/include/llvm/IR/DataLayout.h
456	nit: use TypeSize instead of auto.
622	Nice, I like the interface of `TypeSize::Fixed()` for fixed-size types.
llvm/lib/IR/Instructions.cpp
2986–2987	nit: auto -> TypeSize ?
3048–3049	nit: auto -> TypeSize ?

Closed by commit rGb302561b763a: [SVE][IR] Scalable Vector size queries and IR instruction support (authored by huntergr). · Explain WhyOct 8 2019, 5:56 AM

This revision was automatically updated to reflect the committed changes.

Hmm, forgot to add the last round of minor fixes before committing. Sorry about that, will push them as well.

huntergr mentioned this in rL375013: [SVE][IR] Small TypeSize improvements left out of initial commit.Oct 16 2019, 9:32 AM

huntergr mentioned this in rGf998fa2924c7: [SVE][IR] Small TypeSize improvements left out of initial commit.

huntergr mentioned this in rG3f08ad611aa2: [SVE][CodeGen] Scalable vector MVT size queries.Nov 18 2019, 4:35 AM

Revision Contents

Path

Size

clang/

lib/

CodeGen/

CGCall.cpp

8 lines

CGStmt.cpp

12 lines

CodeGenFunction.cpp

8 lines

llvm/

include/

llvm/

ADT/

DenseMapInfo.h

2 lines

IR/

55 lines

2 lines

2 lines

6 lines

Support/

MachineValueType.h

2 lines

ScalableSize.h

TypeSize.h

200 lines

lib/

Analysis/

InlineCost.cpp

6 lines

CodeGen/

Analysis.cpp

3 lines

IR/

DataLayout.cpp

6 lines

Instructions.cpp

17 lines

Type.cpp

27 lines

Target/

AArch64/

AArch64ISelLowering.cpp

2 lines

Transforms/

Scalar/

SROA.cpp

6 lines

test/

Other/

scalable-vectors-core-ir.ll

393 lines

unittests/

CodeGen/

ScalableVectorMVTsTest.cpp

2 lines

IR/

VectorTypesTest.cpp

116 lines

Diff 223835

clang/lib/CodeGen/CGCall.cpp

Show First 20 Lines • Show All 4,271 Lines • ▼ Show 20 Lines	for (unsigned i = 0; i < IRCallArgs.size(); ++i) {
if (i < IRFuncTy->getNumParams())		if (i < IRFuncTy->getNumParams())
assert(IRCallArgs[i]->getType() == IRFuncTy->getParamType(i));		assert(IRCallArgs[i]->getType() == IRFuncTy->getParamType(i));
}		}
#endif		#endif

// Update the largest vector width if any arguments have vector types.		// Update the largest vector width if any arguments have vector types.
for (unsigned i = 0; i < IRCallArgs.size(); ++i) {		for (unsigned i = 0; i < IRCallArgs.size(); ++i) {
if (auto *VT = dyn_cast<llvm::VectorType>(IRCallArgs[i]->getType()))		if (auto *VT = dyn_cast<llvm::VectorType>(IRCallArgs[i]->getType()))
LargestVectorWidth = std::max(LargestVectorWidth,		LargestVectorWidth = std::max((uint64_t)LargestVectorWidth,
VT->getPrimitiveSizeInBits());		VT->getPrimitiveSizeInBits().getFixedSize());
}		}

// Compute the calling convention and attributes.		// Compute the calling convention and attributes.
unsigned CallingConv;		unsigned CallingConv;
llvm::AttributeList Attrs;		llvm::AttributeList Attrs;
CGM.ConstructAttributeList(CalleePtr->getName(), CallInfo,		CGM.ConstructAttributeList(CalleePtr->getName(), CallInfo,
Callee.getAbstractInfo(), Attrs, CallingConv,		Callee.getAbstractInfo(), Attrs, CallingConv,
/AttrOnCallSite=/true);		/AttrOnCallSite=/true);
▲ Show 20 Lines • Show All 66 Lines • ▼ Show 20 Lines	#endif

// Apply various metadata.		// Apply various metadata.

if (!CI->getType()->isVoidTy())		if (!CI->getType()->isVoidTy())
CI->setName("call");		CI->setName("call");

// Update largest vector width from the return type.		// Update largest vector width from the return type.
if (auto *VT = dyn_cast<llvm::VectorType>(CI->getType()))		if (auto *VT = dyn_cast<llvm::VectorType>(CI->getType()))
LargestVectorWidth = std::max(LargestVectorWidth,		LargestVectorWidth = std::max((uint64_t)LargestVectorWidth,
VT->getPrimitiveSizeInBits());		VT->getPrimitiveSizeInBits().getFixedSize());

// Insert instrumentation or attach profile metadata at indirect call sites.		// Insert instrumentation or attach profile metadata at indirect call sites.
// For more details, see the comment before the definition of		// For more details, see the comment before the definition of
// IPVK_IndirectCallTarget in InstrProfData.inc.		// IPVK_IndirectCallTarget in InstrProfData.inc.
if (!CI->getCalledFunction())		if (!CI->getCalledFunction())
PGO.valueProfile(Builder, llvm::IPVK_IndirectCallTarget,		PGO.valueProfile(Builder, llvm::IPVK_IndirectCallTarget,
CI, CalleePtr);		CI, CalleePtr);

▲ Show 20 Lines • Show All 216 Lines • Show Last 20 Lines

clang/lib/CodeGen/CGStmt.cpp

Show First 20 Lines • Show All 2,067 Lines • ▼ Show 20 Lines	if (!Info.allowsMemory() && (hasScalarEvaluationKind(OutExpr->getType()) \|\|
else {		else {
CGM.getDiags().Report(S.getAsmLoc(),		CGM.getDiags().Report(S.getAsmLoc(),
diag::err_asm_invalid_type_in_input)		diag::err_asm_invalid_type_in_input)
<< OutExpr->getType() << OutputConstraint;		<< OutExpr->getType() << OutputConstraint;
}		}

// Update largest vector width for any vector types.		// Update largest vector width for any vector types.
if (auto *VT = dyn_cast<llvm::VectorType>(ResultRegTypes.back()))		if (auto *VT = dyn_cast<llvm::VectorType>(ResultRegTypes.back()))
LargestVectorWidth = std::max(LargestVectorWidth,		LargestVectorWidth = std::max((uint64_t)LargestVectorWidth,
VT->getPrimitiveSizeInBits());		VT->getPrimitiveSizeInBits().getFixedSize());
} else {		} else {
ArgTypes.push_back(Dest.getAddress().getType());		ArgTypes.push_back(Dest.getAddress().getType());
Args.push_back(Dest.getPointer());		Args.push_back(Dest.getPointer());
Constraints += "=*";		Constraints += "=*";
Constraints += OutputConstraint;		Constraints += OutputConstraint;
ReadOnly = ReadNone = false;		ReadOnly = ReadNone = false;
}		}

if (Info.isReadWrite()) {		if (Info.isReadWrite()) {
InOutConstraints += ',';		InOutConstraints += ',';

const Expr *InputExpr = S.getOutputExpr(i);		const Expr *InputExpr = S.getOutputExpr(i);
llvm::Value *Arg = EmitAsmInputLValue(Info, Dest, InputExpr->getType(),		llvm::Value *Arg = EmitAsmInputLValue(Info, Dest, InputExpr->getType(),
InOutConstraints,		InOutConstraints,
InputExpr->getExprLoc());		InputExpr->getExprLoc());

if (llvm::Type* AdjTy =		if (llvm::Type* AdjTy =
getTargetHooks().adjustInlineAsmType(*this, OutputConstraint,		getTargetHooks().adjustInlineAsmType(*this, OutputConstraint,
Arg->getType()))		Arg->getType()))
Arg = Builder.CreateBitCast(Arg, AdjTy);		Arg = Builder.CreateBitCast(Arg, AdjTy);

// Update largest vector width for any vector types.		// Update largest vector width for any vector types.
if (auto *VT = dyn_cast<llvm::VectorType>(Arg->getType()))		if (auto *VT = dyn_cast<llvm::VectorType>(Arg->getType()))
LargestVectorWidth = std::max(LargestVectorWidth,		LargestVectorWidth = std::max((uint64_t)LargestVectorWidth,
VT->getPrimitiveSizeInBits());		VT->getPrimitiveSizeInBits().getFixedSize());
if (Info.allowsRegister())		if (Info.allowsRegister())
InOutConstraints += llvm::utostr(i);		InOutConstraints += llvm::utostr(i);
else		else
InOutConstraints += OutputConstraint;		InOutConstraints += OutputConstraint;

InOutArgTypes.push_back(Arg->getType());		InOutArgTypes.push_back(Arg->getType());
InOutArgs.push_back(Arg);		InOutArgs.push_back(Arg);
}		}
▲ Show 20 Lines • Show All 69 Lines • ▼ Show 20 Lines	if (llvm::Type* AdjTy =
Arg->getType()))		Arg->getType()))
Arg = Builder.CreateBitCast(Arg, AdjTy);		Arg = Builder.CreateBitCast(Arg, AdjTy);
else		else
CGM.getDiags().Report(S.getAsmLoc(), diag::err_asm_invalid_type_in_input)		CGM.getDiags().Report(S.getAsmLoc(), diag::err_asm_invalid_type_in_input)
<< InputExpr->getType() << InputConstraint;		<< InputExpr->getType() << InputConstraint;

// Update largest vector width for any vector types.		// Update largest vector width for any vector types.
if (auto *VT = dyn_cast<llvm::VectorType>(Arg->getType()))		if (auto *VT = dyn_cast<llvm::VectorType>(Arg->getType()))
LargestVectorWidth = std::max(LargestVectorWidth,		LargestVectorWidth = std::max((uint64_t)LargestVectorWidth,
VT->getPrimitiveSizeInBits());		VT->getPrimitiveSizeInBits().getFixedSize());

ArgTypes.push_back(Arg->getType());		ArgTypes.push_back(Arg->getType());
Args.push_back(Arg);		Args.push_back(Arg);
Constraints += InputConstraint;		Constraints += InputConstraint;
}		}

// Append the "input" part of inout constraints last.		// Append the "input" part of inout constraints last.
for (unsigned i = 0, e = InOutArgs.size(); i != e; i++) {		for (unsigned i = 0, e = InOutArgs.size(); i != e; i++) {
▲ Show 20 Lines • Show All 247 Lines • Show Last 20 Lines

clang/lib/CodeGen/CodeGenFunction.cpp

Show First 20 Lines • Show All 425 Lines • ▼ Show 20 Lines	if (NormalCleanupDest.isValid() && isCoroutine()) {
llvm::PromoteMemToReg(		llvm::PromoteMemToReg(
cast<llvm::AllocaInst>(NormalCleanupDest.getPointer()), DT);		cast<llvm::AllocaInst>(NormalCleanupDest.getPointer()), DT);
NormalCleanupDest = Address::invalid();		NormalCleanupDest = Address::invalid();
}		}

// Scan function arguments for vector width.		// Scan function arguments for vector width.
for (llvm::Argument &A : CurFn->args())		for (llvm::Argument &A : CurFn->args())
if (auto *VT = dyn_cast<llvm::VectorType>(A.getType()))		if (auto *VT = dyn_cast<llvm::VectorType>(A.getType()))
LargestVectorWidth = std::max(LargestVectorWidth,		LargestVectorWidth = std::max((uint64_t)LargestVectorWidth,
VT->getPrimitiveSizeInBits());		VT->getPrimitiveSizeInBits().getFixedSize());

// Update vector width based on return type.		// Update vector width based on return type.
if (auto *VT = dyn_cast<llvm::VectorType>(CurFn->getReturnType()))		if (auto *VT = dyn_cast<llvm::VectorType>(CurFn->getReturnType()))
LargestVectorWidth = std::max(LargestVectorWidth,		LargestVectorWidth = std::max((uint64_t)LargestVectorWidth,
VT->getPrimitiveSizeInBits());		VT->getPrimitiveSizeInBits().getFixedSize());

// Add the required-vector-width attribute. This contains the max width from:		// Add the required-vector-width attribute. This contains the max width from:
// 1. min-vector-width attribute used in the source program.		// 1. min-vector-width attribute used in the source program.
// 2. Any builtins used that have a vector width specified.		// 2. Any builtins used that have a vector width specified.
// 3. Values passed in and out of inline assembly.		// 3. Values passed in and out of inline assembly.
// 4. Width of vector arguments and return types for this function.		// 4. Width of vector arguments and return types for this function.
// 5. Width of vector aguments and return types for functions called by this		// 5. Width of vector aguments and return types for functions called by this
// function.		// function.
▲ Show 20 Lines • Show All 1,959 Lines • Show Last 20 Lines

llvm/include/llvm/ADT/DenseMapInfo.h

	Show All 11 Lines

	#ifndef LLVM_ADT_DENSEMAPINFO_H			#ifndef LLVM_ADT_DENSEMAPINFO_H
	#define LLVM_ADT_DENSEMAPINFO_H			#define LLVM_ADT_DENSEMAPINFO_H

	#include "llvm/ADT/ArrayRef.h"			#include "llvm/ADT/ArrayRef.h"
	#include "llvm/ADT/Hashing.h"			#include "llvm/ADT/Hashing.h"
	#include "llvm/ADT/StringRef.h"			#include "llvm/ADT/StringRef.h"
	#include "llvm/Support/PointerLikeTypeTraits.h"			#include "llvm/Support/PointerLikeTypeTraits.h"
	#include "llvm/Support/ScalableSize.h"			#include "llvm/Support/TypeSize.h"
	#include <cassert>			#include <cassert>
	#include <cstddef>			#include <cstddef>
	#include <cstdint>			#include <cstdint>
	#include <utility>			#include <utility>

	namespace llvm {			namespace llvm {

	template<typename T>			template<typename T>
	▲ Show 20 Lines • Show All 261 Lines • Show Last 20 Lines

llvm/include/llvm/IR/DataLayout.h

Show All 24 Lines
#include "llvm/ADT/StringRef.h"		#include "llvm/ADT/StringRef.h"
#include "llvm/IR/DerivedTypes.h"		#include "llvm/IR/DerivedTypes.h"
#include "llvm/IR/Type.h"		#include "llvm/IR/Type.h"
#include "llvm/Pass.h"		#include "llvm/Pass.h"
#include "llvm/Support/Casting.h"		#include "llvm/Support/Casting.h"
#include "llvm/Support/ErrorHandling.h"		#include "llvm/Support/ErrorHandling.h"
#include "llvm/Support/MathExtras.h"		#include "llvm/Support/MathExtras.h"
#include "llvm/Support/Alignment.h"		#include "llvm/Support/Alignment.h"
		#include "llvm/Support/TypeSize.h"
#include <cassert>		#include <cassert>
#include <cstdint>		#include <cstdint>
#include <string>		#include <string>

// This needs to be outside of the namespace, to avoid conflict with llvm-c		// This needs to be outside of the namespace, to avoid conflict with llvm-c
// decl.		// decl.
using LLVMTargetDataRef = struct LLVMOpaqueTargetData *;		using LLVMTargetDataRef = struct LLVMOpaqueTargetData *;

▲ Show 20 Lines • Show All 391 Lines • ▼ Show 20 Lines	public:
/// Double 64 64 64		/// Double 64 64 64
/// X86_FP80 80 80 96		/// X86_FP80 80 80 96
///		///
/// [*] The alloc size depends on the alignment, and thus on the target.		/// [*] The alloc size depends on the alignment, and thus on the target.
/// These values are for x86-32 linux.		/// These values are for x86-32 linux.

/// Returns the number of bits necessary to hold the specified type.		/// Returns the number of bits necessary to hold the specified type.
///		///
		/// If Ty is a scalable vector type, the scalable property will be set and
		/// the runtime size will be a positive integer multiple of the base size.
		///
/// For example, returns 36 for i36 and 80 for x86_fp80. The type passed must		/// For example, returns 36 for i36 and 80 for x86_fp80. The type passed must
/// have a size (Type::isSized() must return true).		/// have a size (Type::isSized() must return true).
uint64_t getTypeSizeInBits(Type *Ty) const;		TypeSize getTypeSizeInBits(Type *Ty) const;

/// Returns the maximum number of bytes that may be overwritten by		/// Returns the maximum number of bytes that may be overwritten by
		greenedUnsubmitted Done Reply Inline Actions Needs comments about what these return. greened: Needs comments about what these return.
/// storing the specified type.		/// storing the specified type.
///		///
		/// If Ty is a scalable vector type, the scalable property will be set and
		/// the runtime size will be a positive integer multiple of the base size.
		///
/// For example, returns 5 for i36 and 10 for x86_fp80.		/// For example, returns 5 for i36 and 10 for x86_fp80.
uint64_t getTypeStoreSize(Type *Ty) const {		TypeSize getTypeStoreSize(Type *Ty) const {
return (getTypeSizeInBits(Ty) + 7) / 8;		auto BaseSize = getTypeSizeInBits(Ty);
		sdesmalenUnsubmitted Not Done Reply Inline Actions nit: use TypeSize instead of auto. sdesmalen: nit: use TypeSize instead of auto.
		return { (BaseSize.getKnownMinSize() + 7) / 8, BaseSize.isScalable() };
		rovkaUnsubmitted Not Done Reply Inline Actions We already overload operator /, why not overload + as well so we don't have to change the body of this method? rovka: We already overload operator /, why not overload + as well so we don't have to change the body…
		huntergrAuthorUnsubmitted Not Done Reply Inline Actions Scaling a size with * or / has a clear meaning to me, since it's independent of vscale; getting a vector that's half the size or four times larger just works. Using + (or -) on the other hand doesn't seem to be as clear; I wasn't sure if a standalone int should be automatically treated as being the same as the TypeSize, or always considered Fixed. If we try for the former I can imagine quite a few bugs arising. I could add a roundBitsToNearestByteSize method to move the arithmetic elsewhere if that would be acceptable? huntergr: Scaling a size with * or / has a clear meaning to me, since it's independent of vscale; getting…
		rovkaUnsubmitted Done Reply Inline Actions You're right, + on TypeSizes would be confusing. This looks ok as-is then, no need to fiddle with it more. rovka: You're right, + on TypeSizes would be confusing. This looks ok as-is then, no need to fiddle…
}		}

/// Returns the maximum number of bits that may be overwritten by		/// Returns the maximum number of bits that may be overwritten by
		greenedUnsubmitted Done Reply Inline Actions Needs a comment about what this returns. greened: Needs a comment about what this returns.
/// storing the specified type; always a multiple of 8.		/// storing the specified type; always a multiple of 8.
///		///
		/// If Ty is a scalable vector type, the scalable property will be set and
		/// the runtime size will be a positive integer multiple of the base size.
		///
/// For example, returns 40 for i36 and 80 for x86_fp80.		/// For example, returns 40 for i36 and 80 for x86_fp80.
		greenedUnsubmitted Done Reply Inline Actions Needs a comment about what this returns. greened: Needs a comment about what this returns.
uint64_t getTypeStoreSizeInBits(Type *Ty) const {		TypeSize getTypeStoreSizeInBits(Type *Ty) const {
return 8 * getTypeStoreSize(Ty);		return 8 * getTypeStoreSize(Ty);
}		}

/// Returns true if no extra padding bits are needed when storing the		/// Returns true if no extra padding bits are needed when storing the
		greenedUnsubmitted Done Reply Inline Actions Needs a comment about what this returns. greened: Needs a comment about what this returns.
/// specified type.		/// specified type.
///		///
/// For example, returns false for i19 that has a 24-bit store size.		/// For example, returns false for i19 that has a 24-bit store size.
bool typeSizeEqualsStoreSize(Type *Ty) const {		bool typeSizeEqualsStoreSize(Type *Ty) const {
return getTypeSizeInBits(Ty) == getTypeStoreSizeInBits(Ty);		return getTypeSizeInBits(Ty) == getTypeStoreSizeInBits(Ty);
		greenedUnsubmitted Done Reply Inline Actions Needs a comment about what this returns. greened: Needs a comment about what this returns.
}		}

/// Returns the offset in bytes between successive objects of the		/// Returns the offset in bytes between successive objects of the
/// specified type, including alignment padding.		/// specified type, including alignment padding.
///		///
		/// If Ty is a scalable vector type, the scalable property will be set and
		/// the runtime size will be a positive integer multiple of the base size.
		///
/// This is the amount that alloca reserves for this type. For example,		/// This is the amount that alloca reserves for this type. For example,
/// returns 12 or 16 for x86_fp80, depending on alignment.		/// returns 12 or 16 for x86_fp80, depending on alignment.
uint64_t getTypeAllocSize(Type *Ty) const {		TypeSize getTypeAllocSize(Type *Ty) const {
// Round up to the next alignment boundary.		// Round up to the next alignment boundary.
return alignTo(getTypeStoreSize(Ty), getABITypeAlignment(Ty));		return alignTo(getTypeStoreSize(Ty), getABITypeAlignment(Ty));
}		}
		rovkaUnsubmitted Not Done Reply Inline Actions Can we add a version of alignTo that works with ScalableSize instead? rovka: Can we add a version of alignTo that works with ScalableSize instead?

/// Returns the offset in bits between successive objects of the		/// Returns the offset in bits between successive objects of the
		greenedUnsubmitted Done Reply Inline Actions Needs a comment about what this returns. greened: Needs a comment about what this returns.
/// specified type, including alignment padding; always a multiple of 8.		/// specified type, including alignment padding; always a multiple of 8.
///		///
		/// If Ty is a scalable vector type, the scalable property will be set and
		/// the runtime size will be a positive integer multiple of the base size.
		///
/// This is the amount that alloca reserves for this type. For example,		/// This is the amount that alloca reserves for this type. For example,
/// returns 96 or 128 for x86_fp80, depending on alignment.		/// returns 96 or 128 for x86_fp80, depending on alignment.
		greenedUnsubmitted Done Reply Inline Actions Needs a comment about what this returns. greened: Needs a comment about what this returns.
uint64_t getTypeAllocSizeInBits(Type *Ty) const {		TypeSize getTypeAllocSizeInBits(Type *Ty) const {
return 8 * getTypeAllocSize(Ty);		return 8 * getTypeAllocSize(Ty);
}		}

/// Returns the minimum ABI-required alignment for the specified type.		/// Returns the minimum ABI-required alignment for the specified type.
		greenedUnsubmitted Done Reply Inline Actions Needs a comment about what this returns. greened: Needs a comment about what this returns.
		rovkaUnsubmitted Not Done Reply Inline Actions Where is this used? rovka: Where is this used?
unsigned getABITypeAlignment(Type *Ty) const;		unsigned getABITypeAlignment(Type *Ty) const;

/// Returns the minimum ABI-required alignment for an integer type of		/// Returns the minimum ABI-required alignment for an integer type of
/// the specified bitwidth.		/// the specified bitwidth.
Align getABIIntegerTypeAlignment(unsigned BitWidth) const;		Align getABIIntegerTypeAlignment(unsigned BitWidth) const;
		greenedUnsubmitted Done Reply Inline Actions Needs a comment about what this returns. greened: Needs a comment about what this returns.

/// Returns the preferred stack/global alignment for the specified		/// Returns the preferred stack/global alignment for the specified
/// type.		/// type.
///		///
/// This is always at least as good as the ABI alignment.		/// This is always at least as good as the ABI alignment.
unsigned getPrefTypeAlignment(Type *Ty) const;		unsigned getPrefTypeAlignment(Type *Ty) const;

/// Returns an integer type with size at least as big as that of a		/// Returns an integer type with size at least as big as that of a
▲ Show 20 Lines • Show All 92 Lines • ▼ Show 20 Lines
private:		private:
friend class DataLayout; // Only DataLayout can create this class		friend class DataLayout; // Only DataLayout can create this class

StructLayout(StructType *ST, const DataLayout &DL);		StructLayout(StructType *ST, const DataLayout &DL);
};		};

// The implementation of this method is provided inline as it is particularly		// The implementation of this method is provided inline as it is particularly
// well suited to constant folding when called on a specific Type subclass.		// well suited to constant folding when called on a specific Type subclass.
inline uint64_t DataLayout::getTypeSizeInBits(Type *Ty) const {		inline TypeSize DataLayout::getTypeSizeInBits(Type *Ty) const {
assert(Ty->isSized() && "Cannot getTypeInfo() on a type that is unsized!");		assert(Ty->isSized() && "Cannot getTypeInfo() on a type that is unsized!");
switch (Ty->getTypeID()) {		switch (Ty->getTypeID()) {
case Type::LabelTyID:		case Type::LabelTyID:
return getPointerSizeInBits(0);		return TypeSize::Fixed(getPointerSizeInBits(0));
		sdesmalenUnsubmitted Not Done Reply Inline Actions Nice, I like the interface of `TypeSize::Fixed()` for fixed-size types. sdesmalen: Nice, I like the interface of `TypeSize::Fixed()` for fixed-size types.
case Type::PointerTyID:		case Type::PointerTyID:
return getPointerSizeInBits(Ty->getPointerAddressSpace());		return TypeSize::Fixed(getPointerSizeInBits(Ty->getPointerAddressSpace()));
case Type::ArrayTyID: {		case Type::ArrayTyID: {
ArrayType *ATy = cast<ArrayType>(Ty);		ArrayType *ATy = cast<ArrayType>(Ty);
return ATy->getNumElements() *		return ATy->getNumElements() *
getTypeAllocSizeInBits(ATy->getElementType());		getTypeAllocSizeInBits(ATy->getElementType());
}		}
case Type::StructTyID:		case Type::StructTyID:
// Get the layout annotation... which is lazily created on demand.		// Get the layout annotation... which is lazily created on demand.
return getStructLayout(cast<StructType>(Ty))->getSizeInBits();		return TypeSize::Fixed(
		getStructLayout(cast<StructType>(Ty))->getSizeInBits());
case Type::IntegerTyID:		case Type::IntegerTyID:
return Ty->getIntegerBitWidth();		return TypeSize::Fixed(Ty->getIntegerBitWidth());
case Type::HalfTyID:		case Type::HalfTyID:
return 16;		return TypeSize::Fixed(16);
case Type::FloatTyID:		case Type::FloatTyID:
return 32;		return TypeSize::Fixed(32);
case Type::DoubleTyID:		case Type::DoubleTyID:
case Type::X86_MMXTyID:		case Type::X86_MMXTyID:
return 64;		return TypeSize::Fixed(64);
case Type::PPC_FP128TyID:		case Type::PPC_FP128TyID:
case Type::FP128TyID:		case Type::FP128TyID:
return 128;		return TypeSize::Fixed(128);
// In memory objects this is always aligned to a higher boundary, but		// In memory objects this is always aligned to a higher boundary, but
// only 80 bits contain information.		// only 80 bits contain information.
case Type::X86_FP80TyID:		case Type::X86_FP80TyID:
return 80;		return TypeSize::Fixed(80);
case Type::VectorTyID: {		case Type::VectorTyID: {
VectorType *VTy = cast<VectorType>(Ty);		VectorType *VTy = cast<VectorType>(Ty);
return VTy->getNumElements() * getTypeSizeInBits(VTy->getElementType());		auto EltCnt = VTy->getElementCount();
		uint64_t MinBits = EltCnt.Min *
		getTypeSizeInBits(VTy->getElementType()).getFixedSize();
		return TypeSize(MinBits, EltCnt.Scalable);
		rovkaUnsubmitted Not Done Reply Inline Actions Maybe just return VTy->getElementCount() * getTypeSizeInBits(VTy->getElementType()).getFixedSize(). rovka: Maybe just return VTy->getElementCount() * getTypeSizeInBits(VTy->getElementType()).
		huntergrAuthorUnsubmitted Not Done Reply Inline Actions There's no support for generating a TypeSize from an ElementCount in that way; is that an interface you feel is useful? (I'll certainly change the `getKnownMinSize` to `getFixedSize` though, since we're just referring to a scalar) huntergr: There's no support for generating a TypeSize from an ElementCount in that way; is that an…
		rovkaUnsubmitted Done Reply Inline Actions Actually, no, now that I think about it a bit more it might be clearer to spell it out this way. rovka: Actually, no, now that I think about it a bit more it might be clearer to spell it out this way.
}		}
default:		default:
llvm_unreachable("DataLayout::getTypeSizeInBits(): Unsupported type");		llvm_unreachable("DataLayout::getTypeSizeInBits(): Unsupported type");
}		}
}		}

} // end namespace llvm		} // end namespace llvm

#endif // LLVM_IR_DATALAYOUT_H		#endif // LLVM_IR_DATALAYOUT_H

llvm/include/llvm/IR/DerivedTypes.h

	Show All 17 Lines
	#define LLVM_IR_DERIVEDTYPES_H			#define LLVM_IR_DERIVEDTYPES_H

	#include "llvm/ADT/ArrayRef.h"			#include "llvm/ADT/ArrayRef.h"
	#include "llvm/ADT/STLExtras.h"			#include "llvm/ADT/STLExtras.h"
	#include "llvm/ADT/StringRef.h"			#include "llvm/ADT/StringRef.h"
	#include "llvm/IR/Type.h"			#include "llvm/IR/Type.h"
	#include "llvm/Support/Casting.h"			#include "llvm/Support/Casting.h"
	#include "llvm/Support/Compiler.h"			#include "llvm/Support/Compiler.h"
	#include "llvm/Support/ScalableSize.h"			#include "llvm/Support/TypeSize.h"
	#include <cassert>			#include <cassert>
	#include <cstdint>			#include <cstdint>

	namespace llvm {			namespace llvm {

	class Value;			class Value;
	class APInt;			class APInt;
	class LLVMContext;			class LLVMContext;
	▲ Show 20 Lines • Show All 593 Lines • Show Last 20 Lines

llvm/include/llvm/IR/InstrTypes.h

Show First 20 Lines • Show All 969 Lines • ▼ Show 20 Lines	public:
static bool classof(const Value *V) {		static bool classof(const Value *V) {
return isa<Instruction>(V) && classof(cast<Instruction>(V));		return isa<Instruction>(V) && classof(cast<Instruction>(V));
}		}

/// Create a result type for fcmp/icmp		/// Create a result type for fcmp/icmp
static Type* makeCmpResultType(Type* opnd_type) {		static Type* makeCmpResultType(Type* opnd_type) {
if (VectorType* vt = dyn_cast<VectorType>(opnd_type)) {		if (VectorType* vt = dyn_cast<VectorType>(opnd_type)) {
return VectorType::get(Type::getInt1Ty(opnd_type->getContext()),		return VectorType::get(Type::getInt1Ty(opnd_type->getContext()),
vt->getNumElements());		vt->getElementCount());
}		}
return Type::getInt1Ty(opnd_type->getContext());		return Type::getInt1Ty(opnd_type->getContext());
}		}

private:		private:
// Shadow Value::setValueSubclassData with a private forwarding method so that		// Shadow Value::setValueSubclassData with a private forwarding method so that
// subclasses cannot accidentally use it.		// subclasses cannot accidentally use it.
void setValueSubclassData(unsigned short D) {		void setValueSubclassData(unsigned short D) {
▲ Show 20 Lines • Show All 1,206 Lines • Show Last 20 Lines

llvm/include/llvm/IR/Type.h

Show All 15 Lines

#include "llvm/ADT/APFloat.h"		#include "llvm/ADT/APFloat.h"
#include "llvm/ADT/ArrayRef.h"		#include "llvm/ADT/ArrayRef.h"
#include "llvm/ADT/SmallPtrSet.h"		#include "llvm/ADT/SmallPtrSet.h"
#include "llvm/Support/CBindingWrapping.h"		#include "llvm/Support/CBindingWrapping.h"
#include "llvm/Support/Casting.h"		#include "llvm/Support/Casting.h"
#include "llvm/Support/Compiler.h"		#include "llvm/Support/Compiler.h"
#include "llvm/Support/ErrorHandling.h"		#include "llvm/Support/ErrorHandling.h"
		#include "llvm/Support/TypeSize.h"
#include <cassert>		#include <cassert>
#include <cstdint>		#include <cstdint>
#include <iterator>		#include <iterator>

namespace llvm {		namespace llvm {

template<class GraphType> struct GraphTraits;		template<class GraphType> struct GraphTraits;
class IntegerType;		class IntegerType;
▲ Show 20 Lines • Show All 244 Lines • ▼ Show 20 Lines	bool isSized(SmallPtrSetImpl<Type> Visited = nullptr) const {
return isSizedDerivedType(Visited);		return isSizedDerivedType(Visited);
}		}

/// Return the basic size of this type if it is a primitive type. These are		/// Return the basic size of this type if it is a primitive type. These are
/// fixed by LLVM and are not target-dependent.		/// fixed by LLVM and are not target-dependent.
/// This will return zero if the type does not have a size or is not a		/// This will return zero if the type does not have a size or is not a
/// primitive type.		/// primitive type.
///		///
		/// If this is a scalable vector type, the scalable property will be set and
		/// the runtime size will be a positive integer multiple of the base size.
		///
/// Note that this may not reflect the size of memory allocated for an		/// Note that this may not reflect the size of memory allocated for an
/// instance of the type or the number of bytes that are written when an		/// instance of the type or the number of bytes that are written when an
/// instance of the type is stored to memory. The DataLayout class provides		/// instance of the type is stored to memory. The DataLayout class provides
/// additional query functions to provide this information.		/// additional query functions to provide this information.
///		///
unsigned getPrimitiveSizeInBits() const LLVM_READONLY;		TypeSize getPrimitiveSizeInBits() const LLVM_READONLY;

/// If this is a vector type, return the getPrimitiveSizeInBits value for the		/// If this is a vector type, return the getPrimitiveSizeInBits value for the
/// element type. Otherwise return the getPrimitiveSizeInBits value for this		/// element type. Otherwise return the getPrimitiveSizeInBits value for this
/// type.		/// type.
unsigned getScalarSizeInBits() const LLVM_READONLY;		unsigned getScalarSizeInBits() const LLVM_READONLY;

/// Return the width of the mantissa of this type. This is only valid on		/// Return the width of the mantissa of this type. This is only valid on
/// floating-point types. If the FP type does not have a stable mantissa (e.g.		/// floating-point types. If the FP type does not have a stable mantissa (e.g.
▲ Show 20 Lines • Show All 192 Lines • Show Last 20 Lines

llvm/include/llvm/Support/MachineValueType.h

Show All 11 Lines
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_SUPPORT_MACHINEVALUETYPE_H		#ifndef LLVM_SUPPORT_MACHINEVALUETYPE_H
#define LLVM_SUPPORT_MACHINEVALUETYPE_H		#define LLVM_SUPPORT_MACHINEVALUETYPE_H

#include "llvm/ADT/iterator_range.h"		#include "llvm/ADT/iterator_range.h"
#include "llvm/Support/ErrorHandling.h"		#include "llvm/Support/ErrorHandling.h"
#include "llvm/Support/MathExtras.h"		#include "llvm/Support/MathExtras.h"
#include "llvm/Support/ScalableSize.h"		#include "llvm/Support/TypeSize.h"
#include <cassert>		#include <cassert>

namespace llvm {		namespace llvm {

class Type;		class Type;

/// Machine Value Type. Every type that is supported natively by some		/// Machine Value Type. Every type that is supported natively by some
/// processor targeted by LLVM occurs here. This means that any legal value		/// processor targeted by LLVM occurs here. This means that any legal value
▲ Show 20 Lines • Show All 637 Lines • ▼ Show 20 Lines	unsigned getVectorNumElements() const {
case nxv1f64: return 1;		case nxv1f64: return 1;
}		}
}		}

ElementCount getVectorElementCount() const {		ElementCount getVectorElementCount() const {
return { getVectorNumElements(), isScalableVector() };		return { getVectorNumElements(), isScalableVector() };
}		}

unsigned getSizeInBits() const {		unsigned getSizeInBits() const {
		greenedUnsubmitted Done Reply Inline Actions Not sure why the other methods here don't have comments but we should probably have one here to say what this does. greened: Not sure why the other methods here don't have comments but we should probably have one here to…
switch (SimpleTy) {		switch (SimpleTy) {
default:		default:
llvm_unreachable("getSizeInBits called on extended MVT.");		llvm_unreachable("getSizeInBits called on extended MVT.");
case Other:		case Other:
llvm_unreachable("Value type is non-standard value, Other.");		llvm_unreachable("Value type is non-standard value, Other.");
case iPTR:		case iPTR:
llvm_unreachable("Value type size is target-dependent. Ask TLI.");		llvm_unreachable("Value type size is target-dependent. Ask TLI.");
case iPTRAny:		case iPTRAny:
Show All 31 Lines	unsigned getSizeInBits() const {
case i32 :		case i32 :
case v32i1:		case v32i1:
case v4i8:		case v4i8:
case v2i16:		case v2i16:
case v2f16:		case v2f16:
case v1f32:		case v1f32:
case v1i32:		case v1i32:
case nxv32i1:		case nxv32i1:
case nxv4i8:		case nxv4i8:
		greenedUnsubmitted Done Reply Inline Actions Needs a comment about what this returns. greened: Needs a comment about what this returns.
case nxv2i16:		case nxv2i16:
case nxv1i32:		case nxv1i32:
case nxv2f16:		case nxv2f16:
case nxv1f32: return 32;		case nxv1f32: return 32;
case v3i16:		case v3i16:
case v3f16: return 48;		case v3f16: return 48;
case x86mmx:		case x86mmx:
case f64 :		case f64 :
▲ Show 20 Lines • Show All 97 Lines • ▼ Show 20 Lines	public:
}		}

/// Return the number of bytes overwritten by a store of the specified value		/// Return the number of bytes overwritten by a store of the specified value
/// type.		/// type.
unsigned getStoreSize() const {		unsigned getStoreSize() const {
return (getSizeInBits() + 7) / 8;		return (getSizeInBits() + 7) / 8;
}		}

/// Return the number of bits overwritten by a store of the specified value		/// Return the number of bits overwritten by a store of the specified value
		greenedUnsubmitted Done Reply Inline Actions Needs a comment about what this returns. greened: Needs a comment about what this returns.
/// type.		/// type.
unsigned getStoreSizeInBits() const {		unsigned getStoreSizeInBits() const {
return getStoreSize() * 8;		return getStoreSize() * 8;
}		}

		greenedUnsubmitted Done Reply Inline Actions Needs a comment about what this returns. greened: Needs a comment about what this returns.
/// Return true if this has more bits than VT.		/// Return true if this has more bits than VT.
		greenedUnsubmitted Done Reply Inline Actions Needs a comment about what this returns. greened: Needs a comment about what this returns.
bool bitsGT(MVT VT) const {		bool bitsGT(MVT VT) const {
return getSizeInBits() > VT.getSizeInBits();		return getSizeInBits() > VT.getSizeInBits();
}		}

/// Return true if this has no less bits than VT.		/// Return true if this has no less bits than VT.
		greenedUnsubmitted Done Reply Inline Actions Needs a comment about what this returns. greened: Needs a comment about what this returns.
bool bitsGE(MVT VT) const {		bool bitsGE(MVT VT) const {
return getSizeInBits() >= VT.getSizeInBits();		return getSizeInBits() >= VT.getSizeInBits();
}		}

/// Return true if this has less bits than VT.		/// Return true if this has less bits than VT.
bool bitsLT(MVT VT) const {		bool bitsLT(MVT VT) const {
return getSizeInBits() < VT.getSizeInBits();		return getSizeInBits() < VT.getSizeInBits();
}		}
▲ Show 20 Lines • Show All 309 Lines • Show Last 20 Lines

llvm/include/llvm/Support/ScalableSize.h

This file was deleted.

	//===- ScalableSize.h - Scalable vector size info ---------------- C++ --===//
	//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//
	//===----------------------------------------------------------------------===//
	//
	// This file provides a struct that can be used to query the size of IR types
	// which may be scalable vectors. It provides convenience operators so that
	// it can be used in much the same way as a single scalar value.
	//
	//===----------------------------------------------------------------------===//

	#ifndef LLVM_SUPPORT_SCALABLESIZE_H
	#define LLVM_SUPPORT_SCALABLESIZE_H

	namespace llvm {

	class ElementCount {
	public:
	unsigned Min; // Minimum number of vector elements.
	bool Scalable; // If true, NumElements is a multiple of 'Min' determined
	// at runtime rather than compile time.

	ElementCount(unsigned Min, bool Scalable)
	: Min(Min), Scalable(Scalable) {}

	ElementCount operator*(unsigned RHS) {
	return { Min * RHS, Scalable };
	}
	ElementCount operator/(unsigned RHS) {
	return { Min / RHS, Scalable };
	}

	bool operator==(const ElementCount& RHS) const {
	return Min == RHS.Min && Scalable == RHS.Scalable;
	}
	bool operator!=(const ElementCount& RHS) const {
	return !(*this == RHS);
	}
	};

	} // end namespace llvm

	#endif // LLVM_SUPPORT_SCALABLESIZE_H

llvm/include/llvm/Support/TypeSize.h

This file was added.

				//===- TypeSize.h - Wrapper around type sizes -------------------- C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// This file provides a struct that can be used to query the size of IR types
				// which may be scalable vectors. It provides convenience operators so that
				// it can be used in much the same way as a single scalar value.
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLVM_SUPPORT_TYPESIZE_H
				#define LLVM_SUPPORT_TYPESIZE_H

				#include <tuple>

				namespace llvm {

				class ElementCount {
				public:
				unsigned Min; // Minimum number of vector elements.
				bool Scalable; // If true, NumElements is a multiple of 'Min' determined
				// at runtime rather than compile time.

				ElementCount(unsigned Min, bool Scalable)
				: Min(Min), Scalable(Scalable) {}

				ElementCount operator*(unsigned RHS) {
				return { Min * RHS, Scalable };
				}
				ElementCount operator/(unsigned RHS) {
				return { Min / RHS, Scalable };
				}

				bool operator==(const ElementCount& RHS) const {
				return Min == RHS.Min && Scalable == RHS.Scalable;
				}
				bool operator!=(const ElementCount& RHS) const {
				return !(*this == RHS);
				}
				};

				// This class is used to represent the size of types. If the type is of fixed
				// size, it will represent the exact size. If the type is a scalable vector,
				// it will represent the known minimum size.
				class TypeSize {
				uint64_t MinSize; // The known minimum size.
				bool IsScalable; // If true, then the runtime size is an integer multiple
				// of MinSize.

				public:
				constexpr TypeSize(uint64_t MinSize, bool Scalable)
				: MinSize(MinSize), IsScalable(Scalable) {}

				static constexpr TypeSize Fixed(uint64_t Size) {
				return TypeSize(Size, /IsScalable=/false);
				}

				static constexpr TypeSize Scalable(uint64_t MinSize) {
				return TypeSize(MinSize, /IsScalable=/true);
				}

				// Scalable vector types with the same minimum size as a fixed size type are
				// not guaranteed to be the same size at runtime, so they are never
				// considered to be equal.
				friend bool operator==(const TypeSize &LHS, const TypeSize &RHS) {
				return std::tie(LHS.MinSize, LHS.IsScalable) ==
				std::tie(RHS.MinSize, RHS.IsScalable);
				}

				friend bool operator!=(const TypeSize &LHS, const TypeSize &RHS) {
				return !(LHS == RHS);
				}

				// For many cases, size ordering between scalable and fixed size types cannot
				// be determined at compile time, so such comparisons aren't allowed.
				//
				// e.g. <vscale x 2 x i16> could be bigger than <4 x i32> with a runtime
				// vscale >= 5, equal sized with a vscale of 4, and smaller with
				// a vscale <= 3.
				//
				// If the scalable flags match, just perform the requested comparison
				// between the minimum sizes.
				friend bool operator<(const TypeSize &LHS, const TypeSize &RHS) {
				assert(LHS.IsScalable == RHS.IsScalable &&
				"Ordering comparison of scalable and fixed types");

				return LHS.MinSize < RHS.MinSize;
				}

				friend bool operator>(const TypeSize &LHS, const TypeSize &RHS) {
				return RHS < LHS;
				}

				friend bool operator<=(const TypeSize &LHS, const TypeSize &RHS) {
				return !(RHS < LHS);
				}

				friend bool operator>=(const TypeSize &LHS, const TypeSize& RHS) {
				return !(LHS < RHS);
				}

				// Convenience operators to obtain relative sizes independently of
				// the scalable flag.
				TypeSize operator*(unsigned RHS) const {
				return { MinSize * RHS, IsScalable };
				}

				friend TypeSize operator*(const unsigned LHS, const TypeSize &RHS) {
				return { LHS * RHS.MinSize, RHS.IsScalable };
				}

				TypeSize operator/(unsigned RHS) const {
				return { MinSize / RHS, IsScalable };
				}

				// Return the minimum size with the assumption that the size is exact.
				// Use in places where a scalable size doesn't make sense (e.g. non-vector
				// types, or vectors in backends which don't support scalable vectors)
				uint64_t getFixedSize() const {
				rovkaUnsubmitted Not Done Reply Inline Actions Microscopic nit: punctuation. rovka: Microscopic nit: punctuation.
				assert(!IsScalable && "Request for a fixed size on a scalable object");
				return MinSize;
				}

				// Return the known minimum size. Use in places where the scalable property
				// doesn't matter (e.g. determining alignment) or in conjunction with the
				// isScalable method below.
				uint64_t getKnownMinSize() const {
				return MinSize;
				}

				// Return whether or not the size is scalable.
				bool isScalable() const {
				return IsScalable;
				}

				// Casts to a uint64_t if this is a fixed-width size.
				//
				// NOTE: This interface is obsolete and will be removed in a future version
				// of LLVM in favour of calling getFixedSize() directly
				operator uint64_t() const {
				rovkaUnsubmitted Not Done Reply Inline Actions Ditto. rovka: Ditto.
				return getFixedSize();
				}

				// Additional convenience operators needed to avoid ambiguous parses
				// TODO: Make uint64_t the default operator?
				rovkaUnsubmitted Not Done Reply Inline Actions Ditto. rovka: Ditto.
				TypeSize operator*(uint64_t RHS) const {
				return { MinSize * RHS, IsScalable };
				}

				TypeSize operator*(int RHS) const {
				return { MinSize * RHS, IsScalable };
				}

				TypeSize operator*(int64_t RHS) const {
				return { MinSize * RHS, IsScalable };
				}

				friend TypeSize operator*(const uint64_t LHS, const TypeSize &RHS) {
				return { LHS * RHS.MinSize, RHS.IsScalable };
				}

				friend TypeSize operator*(const int LHS, const TypeSize &RHS) {
				return { LHS * RHS.MinSize, RHS.IsScalable };
				}

				friend TypeSize operator*(const int64_t LHS, const TypeSize &RHS) {
				return { LHS * RHS.MinSize, RHS.IsScalable };
				}

				TypeSize operator/(uint64_t RHS) const {
				return { MinSize / RHS, IsScalable };
				}

				TypeSize operator/(int RHS) const {
				return { MinSize / RHS, IsScalable };
				}

				TypeSize operator/(int64_t RHS) const {
				return { MinSize / RHS, IsScalable };
				}
				};

				/// Returns a TypeSize with a known minimum size that is the next integer
				/// (mod 2**64) that is greater than or equal to \p Value and is a multiple
				/// of \p Align. \p Align must be non-zero.
				///
				/// Similar to the alignTo functions in MathExtras.h
				inline TypeSize alignTo(TypeSize Size, uint64_t Align) {
				assert(Align != 0u && "Align must be non-zero");
				return {(Size.getKnownMinSize() + Align - 1) / Align * Align,
				Size.isScalable()};
				}

				} // end namespace llvm

				#endif // LLVM_SUPPORT_TypeSize_H

llvm/lib/Analysis/InlineCost.cpp

	Show First 20 Lines • Show All 430 Lines • ▼ Show 20 Lines
	bool CallAnalyzer::visitAlloca(AllocaInst &I) {			bool CallAnalyzer::visitAlloca(AllocaInst &I) {
	// Check whether inlining will turn a dynamic alloca into a static			// Check whether inlining will turn a dynamic alloca into a static
	// alloca and handle that case.			// alloca and handle that case.
	if (I.isArrayAllocation()) {			if (I.isArrayAllocation()) {
	Constant *Size = SimplifiedValues.lookup(I.getArraySize());			Constant *Size = SimplifiedValues.lookup(I.getArraySize());
	if (auto *AllocSize = dyn_cast_or_null<ConstantInt>(Size)) {			if (auto *AllocSize = dyn_cast_or_null<ConstantInt>(Size)) {
	Type *Ty = I.getAllocatedType();			Type *Ty = I.getAllocatedType();
	AllocatedSize = SaturatingMultiplyAdd(			AllocatedSize = SaturatingMultiplyAdd(
	AllocSize->getLimitedValue(), DL.getTypeAllocSize(Ty), AllocatedSize);			AllocSize->getLimitedValue(), DL.getTypeAllocSize(Ty).getFixedSize(),
				AllocatedSize);
	return Base::visitAlloca(I);			return Base::visitAlloca(I);
	}			}
	}			}

	// Accumulate the allocated size.			// Accumulate the allocated size.
	if (I.isStaticAlloca()) {			if (I.isStaticAlloca()) {
	Type *Ty = I.getAllocatedType();			Type *Ty = I.getAllocatedType();
	AllocatedSize = SaturatingAdd(DL.getTypeAllocSize(Ty), AllocatedSize);			AllocatedSize = SaturatingAdd(DL.getTypeAllocSize(Ty).getFixedSize(),
				AllocatedSize);
	}			}

	// We will happily inline static alloca instructions.			// We will happily inline static alloca instructions.
	if (I.isStaticAlloca())			if (I.isStaticAlloca())
	return Base::visitAlloca(I);			return Base::visitAlloca(I);

	// FIXME: This is overly conservative. Dynamic allocas are inefficient for			// FIXME: This is overly conservative. Dynamic allocas are inefficient for
	// a variety of reasons, and so we would like to not inline them into			// a variety of reasons, and so we would like to not inline them into
	▲ Show 20 Lines • Show All 1,766 Lines • Show Last 20 Lines

llvm/lib/CodeGen/Analysis.cpp

Show First 20 Lines • Show All 303 Lines • ▼ Show 20 Lines	if (isa<BitCastInst>(I)) {
// Make sure this isn't a truncating or extending cast. We could		// Make sure this isn't a truncating or extending cast. We could
// support this eventually, but don't bother for now.		// support this eventually, but don't bother for now.
if (!isa<VectorType>(I->getType()) &&		if (!isa<VectorType>(I->getType()) &&
DL.getPointerSizeInBits() ==		DL.getPointerSizeInBits() ==
cast<IntegerType>(I->getType())->getBitWidth())		cast<IntegerType>(I->getType())->getBitWidth())
NoopInput = Op;		NoopInput = Op;
} else if (isa<TruncInst>(I) &&		} else if (isa<TruncInst>(I) &&
TLI.allowTruncateForTailCall(Op->getType(), I->getType())) {		TLI.allowTruncateForTailCall(Op->getType(), I->getType())) {
DataBits = std::min(DataBits, I->getType()->getPrimitiveSizeInBits());		DataBits = std::min((uint64_t)DataBits,
		I->getType()->getPrimitiveSizeInBits().getFixedSize());
NoopInput = Op;		NoopInput = Op;
} else if (auto CS = ImmutableCallSite(I)) {		} else if (auto CS = ImmutableCallSite(I)) {
const Value *ReturnedOp = CS.getReturnedArgOperand();		const Value *ReturnedOp = CS.getReturnedArgOperand();
if (ReturnedOp && isNoopBitcast(ReturnedOp->getType(), I->getType(), TLI))		if (ReturnedOp && isNoopBitcast(ReturnedOp->getType(), I->getType(), TLI))
NoopInput = ReturnedOp;		NoopInput = ReturnedOp;
} else if (const InsertValueInst *IVI = dyn_cast<InsertValueInst>(V)) {		} else if (const InsertValueInst *IVI = dyn_cast<InsertValueInst>(V)) {
// Value may come from either the aggregate or the scalar		// Value may come from either the aggregate or the scalar
ArrayRef<unsigned> InsertLoc = IVI->getIndices();		ArrayRef<unsigned> InsertLoc = IVI->getIndices();
▲ Show 20 Lines • Show All 467 Lines • Show Last 20 Lines

llvm/lib/IR/DataLayout.cpp

Show All 23 Lines
#include "llvm/IR/GetElementPtrTypeIterator.h"		#include "llvm/IR/GetElementPtrTypeIterator.h"
#include "llvm/IR/GlobalVariable.h"		#include "llvm/IR/GlobalVariable.h"
#include "llvm/IR/Module.h"		#include "llvm/IR/Module.h"
#include "llvm/IR/Type.h"		#include "llvm/IR/Type.h"
#include "llvm/IR/Value.h"		#include "llvm/IR/Value.h"
#include "llvm/Support/Casting.h"		#include "llvm/Support/Casting.h"
#include "llvm/Support/ErrorHandling.h"		#include "llvm/Support/ErrorHandling.h"
#include "llvm/Support/MathExtras.h"		#include "llvm/Support/MathExtras.h"
		#include "llvm/Support/TypeSize.h"
#include <algorithm>		#include <algorithm>
#include <cassert>		#include <cassert>
#include <cstdint>		#include <cstdint>
#include <cstdlib>		#include <cstdlib>
#include <tuple>		#include <tuple>
#include <utility>		#include <utility>

using namespace llvm;		using namespace llvm;
▲ Show 20 Lines • Show All 700 Lines • ▼ Show 20 Lines	Align DataLayout::getAlignment(Type *Ty, bool abi_or_pref) const {
case Type::X86_MMXTyID:		case Type::X86_MMXTyID:
case Type::VectorTyID:		case Type::VectorTyID:
AlignType = VECTOR_ALIGN;		AlignType = VECTOR_ALIGN;
break;		break;
default:		default:
llvm_unreachable("Bad type for getAlignment!!!");		llvm_unreachable("Bad type for getAlignment!!!");
}		}

return getAlignmentInfo(AlignType, getTypeSizeInBits(Ty), abi_or_pref, Ty);		// If we're dealing with a scalable vector, we just need the known minimum
		// size for determining alignment. If not, we'll get the exact size.
		return getAlignmentInfo(AlignType, getTypeSizeInBits(Ty).getKnownMinSize(),
		abi_or_pref, Ty);
		greenedUnsubmitted Done Reply Inline Actions I might help to clarify that this comment only applies to scalable types, at least as far as I understand this changes here. greened: I might help to clarify that this comment only applies to scalable types, at least as far as I…
}		}

unsigned DataLayout::getABITypeAlignment(Type *Ty) const {		unsigned DataLayout::getABITypeAlignment(Type *Ty) const {
return getAlignment(Ty, true).value();		return getAlignment(Ty, true).value();
}		}

/// getABIIntegerTypeAlignment - Return the minimum ABI-required alignment for		/// getABIIntegerTypeAlignment - Return the minimum ABI-required alignment for
/// an integer type of the specified bitwidth.		/// an integer type of the specified bitwidth.
▲ Show 20 Lines • Show All 117 Lines • Show Last 20 Lines

llvm/lib/IR/Instructions.cpp

Show All 32 Lines
#include "llvm/IR/Module.h"		#include "llvm/IR/Module.h"
#include "llvm/IR/Operator.h"		#include "llvm/IR/Operator.h"
#include "llvm/IR/Type.h"		#include "llvm/IR/Type.h"
#include "llvm/IR/Value.h"		#include "llvm/IR/Value.h"
#include "llvm/Support/AtomicOrdering.h"		#include "llvm/Support/AtomicOrdering.h"
#include "llvm/Support/Casting.h"		#include "llvm/Support/Casting.h"
#include "llvm/Support/ErrorHandling.h"		#include "llvm/Support/ErrorHandling.h"
#include "llvm/Support/MathExtras.h"		#include "llvm/Support/MathExtras.h"
		#include "llvm/Support/TypeSize.h"
#include <algorithm>		#include <algorithm>
#include <cassert>		#include <cassert>
#include <cstdint>		#include <cstdint>
#include <vector>		#include <vector>

using namespace llvm;		using namespace llvm;

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
▲ Show 20 Lines • Show All 1,738 Lines • ▼ Show 20 Lines
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// ShuffleVectorInst Implementation		// ShuffleVectorInst Implementation
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

ShuffleVectorInst::ShuffleVectorInst(Value V1, Value V2, Value *Mask,		ShuffleVectorInst::ShuffleVectorInst(Value V1, Value V2, Value *Mask,
const Twine &Name,		const Twine &Name,
Instruction *InsertBefore)		Instruction *InsertBefore)
: Instruction(VectorType::get(cast<VectorType>(V1->getType())->getElementType(),		: Instruction(VectorType::get(cast<VectorType>(V1->getType())->getElementType(),
cast<VectorType>(Mask->getType())->getNumElements()),		cast<VectorType>(Mask->getType())->getElementCount()),
ShuffleVector,		ShuffleVector,
OperandTraits<ShuffleVectorInst>::op_begin(this),		OperandTraits<ShuffleVectorInst>::op_begin(this),
OperandTraits<ShuffleVectorInst>::operands(this),		OperandTraits<ShuffleVectorInst>::operands(this),
InsertBefore) {		InsertBefore) {
assert(isValidOperands(V1, V2, Mask) &&		assert(isValidOperands(V1, V2, Mask) &&
"Invalid shuffle vector instruction operands!");		"Invalid shuffle vector instruction operands!");
Op<0>() = V1;		Op<0>() = V1;
Op<1>() = V2;		Op<1>() = V2;
Op<2>() = Mask;		Op<2>() = Mask;
setName(Name);		setName(Name);
}		}

ShuffleVectorInst::ShuffleVectorInst(Value V1, Value V2, Value *Mask,		ShuffleVectorInst::ShuffleVectorInst(Value V1, Value V2, Value *Mask,
const Twine &Name,		const Twine &Name,
BasicBlock *InsertAtEnd)		BasicBlock *InsertAtEnd)
: Instruction(VectorType::get(cast<VectorType>(V1->getType())->getElementType(),		: Instruction(VectorType::get(cast<VectorType>(V1->getType())->getElementType(),
cast<VectorType>(Mask->getType())->getNumElements()),		cast<VectorType>(Mask->getType())->getElementCount()),
ShuffleVector,		ShuffleVector,
OperandTraits<ShuffleVectorInst>::op_begin(this),		OperandTraits<ShuffleVectorInst>::op_begin(this),
OperandTraits<ShuffleVectorInst>::operands(this),		OperandTraits<ShuffleVectorInst>::operands(this),
InsertAtEnd) {		InsertAtEnd) {
assert(isValidOperands(V1, V2, Mask) &&		assert(isValidOperands(V1, V2, Mask) &&
"Invalid shuffle vector instruction operands!");		"Invalid shuffle vector instruction operands!");

Op<0>() = V1;		Op<0>() = V1;
▲ Show 20 Lines • Show All 1,156 Lines • ▼ Show 20 Lines	if (VectorType *SrcVecTy = dyn_cast<VectorType>(SrcTy))
if (VectorType *DestVecTy = dyn_cast<VectorType>(DestTy))		if (VectorType *DestVecTy = dyn_cast<VectorType>(DestTy))
if (SrcVecTy->getNumElements() == DestVecTy->getNumElements()) {		if (SrcVecTy->getNumElements() == DestVecTy->getNumElements()) {
// An element by element cast. Valid if casting the elements is valid.		// An element by element cast. Valid if casting the elements is valid.
SrcTy = SrcVecTy->getElementType();		SrcTy = SrcVecTy->getElementType();
DestTy = DestVecTy->getElementType();		DestTy = DestVecTy->getElementType();
}		}

// Get the bit sizes, we'll need these		// Get the bit sizes, we'll need these
unsigned SrcBits = SrcTy->getPrimitiveSizeInBits(); // 0 for ptr		auto SrcBits = SrcTy->getPrimitiveSizeInBits(); // 0 for ptr
unsigned DestBits = DestTy->getPrimitiveSizeInBits(); // 0 for ptr		auto DestBits = DestTy->getPrimitiveSizeInBits(); // 0 for ptr
		sdesmalenUnsubmitted Not Done Reply Inline Actions nit: auto -> TypeSize ? sdesmalen: nit: auto -> TypeSize ?

// Run through the possibilities ...		// Run through the possibilities ...
if (DestTy->isIntegerTy()) { // Casting to integral		if (DestTy->isIntegerTy()) { // Casting to integral
if (SrcTy->isIntegerTy()) // Casting from integral		if (SrcTy->isIntegerTy()) // Casting from integral
return true;		return true;
if (SrcTy->isFloatingPointTy()) // Casting from floating pt		if (SrcTy->isFloatingPointTy()) // Casting from floating pt
return true;		return true;
if (SrcTy->isVectorTy()) // Casting from vector		if (SrcTy->isVectorTy()) // Casting from vector
Show All 30 Lines	bool CastInst::isBitCastable(Type SrcTy, Type DestTy) {
if (!SrcTy->isFirstClassType() \|\| !DestTy->isFirstClassType())		if (!SrcTy->isFirstClassType() \|\| !DestTy->isFirstClassType())
return false;		return false;

if (SrcTy == DestTy)		if (SrcTy == DestTy)
return true;		return true;

if (VectorType *SrcVecTy = dyn_cast<VectorType>(SrcTy)) {		if (VectorType *SrcVecTy = dyn_cast<VectorType>(SrcTy)) {
if (VectorType *DestVecTy = dyn_cast<VectorType>(DestTy)) {		if (VectorType *DestVecTy = dyn_cast<VectorType>(DestTy)) {
if (SrcVecTy->getNumElements() == DestVecTy->getNumElements()) {		if (SrcVecTy->getElementCount() == DestVecTy->getElementCount()) {
// An element by element cast. Valid if casting the elements is valid.		// An element by element cast. Valid if casting the elements is valid.
SrcTy = SrcVecTy->getElementType();		SrcTy = SrcVecTy->getElementType();
DestTy = DestVecTy->getElementType();		DestTy = DestVecTy->getElementType();
}		}
}		}
}		}

if (PointerType *DestPtrTy = dyn_cast<PointerType>(DestTy)) {		if (PointerType *DestPtrTy = dyn_cast<PointerType>(DestTy)) {
if (PointerType *SrcPtrTy = dyn_cast<PointerType>(SrcTy)) {		if (PointerType *SrcPtrTy = dyn_cast<PointerType>(SrcTy)) {
return SrcPtrTy->getAddressSpace() == DestPtrTy->getAddressSpace();		return SrcPtrTy->getAddressSpace() == DestPtrTy->getAddressSpace();
}		}
}		}

unsigned SrcBits = SrcTy->getPrimitiveSizeInBits(); // 0 for ptr		auto SrcBits = SrcTy->getPrimitiveSizeInBits(); // 0 for ptr
unsigned DestBits = DestTy->getPrimitiveSizeInBits(); // 0 for ptr		auto DestBits = DestTy->getPrimitiveSizeInBits(); // 0 for ptr
		sdesmalenUnsubmitted Not Done Reply Inline Actions nit: auto -> TypeSize ? sdesmalen: nit: auto -> TypeSize ?

// Could still have vectors of pointers if the number of elements doesn't		// Could still have vectors of pointers if the number of elements doesn't
// match		// match
if (SrcBits == 0 \|\| DestBits == 0)		if (SrcBits.getKnownMinSize() == 0 \|\| DestBits.getKnownMinSize() == 0)
		sdesmalenUnsubmitted Not Done Reply Inline Actions MinSize suggests that it could be larger at runtime, so `SrcBits.getMinSize() == 0` would always be true. If you instead overload `operator bool()` which checks for Fixed size and Scalable size to be 0, you can rewrite this as `(!SrcBits \|\| !DestBits)` . sdesmalen: MinSize suggests that it could be larger at runtime, so `SrcBits.getMinSize() == 0` would…
		huntergrAuthorUnsubmitted Not Done Reply Inline Actions No. `0 * vscale` is still 0 for any value of vscale. The ScalableSize class says the Scalable flag indicates the total size is an integer multiple of the known minimum size. I suspect a better way of doing this test would be to explicitly check for the elements of one being pointer type and not the other, instead of relying on a hack with the size. huntergr: No. `0 * vscale` is still 0 for any value of vscale. The ScalableSize class says the Scalable…
		sdesmalenUnsubmitted Not Done Reply Inline Actions Perhaps I am a little pedantic with how I read this, because I expect the minimum size of every object to be 0, always :) So perhaps instead of using the name `MinSize`, a name like `KnownSize` would be better suited. (which would also nicely match with the `isKnownGreaterOrEqual()` suggestion for DAGCombiner.cpp). sdesmalen: Perhaps I am a little pedantic with how I read this, because I expect the minimum size of every…
return false;		return false;

if (SrcBits != DestBits)		if (SrcBits != DestBits)
return false;		return false;

if (DestTy->isX86_MMXTy() \|\| SrcTy->isX86_MMXTy())		if (DestTy->isX86_MMXTy() \|\| SrcTy->isX86_MMXTy())
return false;		return false;

▲ Show 20 Lines • Show All 1,250 Lines • Show Last 20 Lines

llvm/lib/IR/Type.cpp

Show All 20 Lines
#include "llvm/IR/Constants.h"		#include "llvm/IR/Constants.h"
#include "llvm/IR/DerivedTypes.h"		#include "llvm/IR/DerivedTypes.h"
#include "llvm/IR/LLVMContext.h"		#include "llvm/IR/LLVMContext.h"
#include "llvm/IR/Module.h"		#include "llvm/IR/Module.h"
#include "llvm/IR/Value.h"		#include "llvm/IR/Value.h"
#include "llvm/Support/Casting.h"		#include "llvm/Support/Casting.h"
#include "llvm/Support/MathExtras.h"		#include "llvm/Support/MathExtras.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
		#include "llvm/Support/TypeSize.h"
#include <cassert>		#include <cassert>
#include <utility>		#include <utility>

using namespace llvm;		using namespace llvm;

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Type Class Implementation		// Type Class Implementation
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
▲ Show 20 Lines • Show All 69 Lines • ▼ Show 20 Lines	for (unsigned i = 0; i < NumElements; ++i)
if (!STy->getElementType(i)->isEmptyTy())		if (!STy->getElementType(i)->isEmptyTy())
return false;		return false;
return true;		return true;
}		}

return false;		return false;
}		}

unsigned Type::getPrimitiveSizeInBits() const {		TypeSize Type::getPrimitiveSizeInBits() const {
switch (getTypeID()) {		switch (getTypeID()) {
case Type::HalfTyID: return 16;		case Type::HalfTyID: return TypeSize::Fixed(16);
case Type::FloatTyID: return 32;		case Type::FloatTyID: return TypeSize::Fixed(32);
case Type::DoubleTyID: return 64;		case Type::DoubleTyID: return TypeSize::Fixed(64);
case Type::X86_FP80TyID: return 80;		case Type::X86_FP80TyID: return TypeSize::Fixed(80);
case Type::FP128TyID: return 128;		case Type::FP128TyID: return TypeSize::Fixed(128);
case Type::PPC_FP128TyID: return 128;		case Type::PPC_FP128TyID: return TypeSize::Fixed(128);
case Type::X86_MMXTyID: return 64;		case Type::X86_MMXTyID: return TypeSize::Fixed(64);
case Type::IntegerTyID: return cast<IntegerType>(this)->getBitWidth();		case Type::IntegerTyID:
case Type::VectorTyID: return cast<VectorType>(this)->getBitWidth();		return TypeSize::Fixed(cast<IntegerType>(this)->getBitWidth());
default: return 0;		case Type::VectorTyID: {
		const VectorType *VTy = cast<VectorType>(this);
		return TypeSize(VTy->getBitWidth(), VTy->isScalable());
		}
		default: return TypeSize::Fixed(0);
}		}
}		}

unsigned Type::getScalarSizeInBits() const {		unsigned Type::getScalarSizeInBits() const {
return getScalarType()->getPrimitiveSizeInBits();		return getScalarType()->getPrimitiveSizeInBits();
}		}

int Type::getFPMantissaWidth() const {		int Type::getFPMantissaWidth() const {
▲ Show 20 Lines • Show All 532 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64ISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 8,520 Lines • ▼ Show 20 Lines	case Instruction::GetElementPtr: {
auto &DL = Ext->getModule()->getDataLayout();		auto &DL = Ext->getModule()->getDataLayout();
std::advance(GTI, U.getOperandNo()-1);		std::advance(GTI, U.getOperandNo()-1);
Type *IdxTy = GTI.getIndexedType();		Type *IdxTy = GTI.getIndexedType();
// This extension will end up with a shift because of the scaling factor.		// This extension will end up with a shift because of the scaling factor.
// 8-bit sized types have a scaling factor of 1, thus a shift amount of 0.		// 8-bit sized types have a scaling factor of 1, thus a shift amount of 0.
// Get the shift amount based on the scaling factor:		// Get the shift amount based on the scaling factor:
// log2(sizeof(IdxTy)) - log2(8).		// log2(sizeof(IdxTy)) - log2(8).
uint64_t ShiftAmt =		uint64_t ShiftAmt =
countTrailingZeros(DL.getTypeStoreSizeInBits(IdxTy)) - 3;		countTrailingZeros(DL.getTypeStoreSizeInBits(IdxTy).getFixedSize()) - 3;
// Is the constant foldable in the shift of the addressing mode?		// Is the constant foldable in the shift of the addressing mode?
// I.e., shift amount is between 1 and 4 inclusive.		// I.e., shift amount is between 1 and 4 inclusive.
if (ShiftAmt == 0 \|\| ShiftAmt > 4)		if (ShiftAmt == 0 \|\| ShiftAmt > 4)
return false;		return false;
break;		break;
}		}
case Instruction::Trunc:		case Instruction::Trunc:
// Check if this is a noop.		// Check if this is a noop.
▲ Show 20 Lines • Show All 3,846 Lines • Show Last 20 Lines

llvm/lib/Transforms/Scalar/SROA.cpp

Show First 20 Lines • Show All 953 Lines • ▼ Show 20 Lines	Instruction hasUnsafePHIOrSelectUse(Instruction Root, uint64_t &Size) {
// If there are no loads or stores, the access is dead. We mark that as		// If there are no loads or stores, the access is dead. We mark that as
// a size zero access.		// a size zero access.
Size = 0;		Size = 0;
do {		do {
Instruction I, UsedI;		Instruction I, UsedI;
std::tie(UsedI, I) = Uses.pop_back_val();		std::tie(UsedI, I) = Uses.pop_back_val();

if (LoadInst *LI = dyn_cast<LoadInst>(I)) {		if (LoadInst *LI = dyn_cast<LoadInst>(I)) {
Size = std::max(Size, DL.getTypeStoreSize(LI->getType()));		Size = std::max(Size,
		DL.getTypeStoreSize(LI->getType()).getFixedSize());
continue;		continue;
}		}
if (StoreInst *SI = dyn_cast<StoreInst>(I)) {		if (StoreInst *SI = dyn_cast<StoreInst>(I)) {
Value *Op = SI->getOperand(0);		Value *Op = SI->getOperand(0);
if (Op == UsedI)		if (Op == UsedI)
return SI;		return SI;
Size = std::max(Size, DL.getTypeStoreSize(Op->getType()));		Size = std::max(Size,
		DL.getTypeStoreSize(Op->getType()).getFixedSize());
continue;		continue;
}		}

if (GetElementPtrInst *GEP = dyn_cast<GetElementPtrInst>(I)) {		if (GetElementPtrInst *GEP = dyn_cast<GetElementPtrInst>(I)) {
if (!GEP->hasAllZeroIndices())		if (!GEP->hasAllZeroIndices())
return GEP;		return GEP;
} else if (!isa<BitCastInst>(I) && !isa<PHINode>(I) &&		} else if (!isa<BitCastInst>(I) && !isa<PHINode>(I) &&
!isa<SelectInst>(I) && !isa<AddrSpaceCastInst>(I)) {		!isa<SelectInst>(I) && !isa<AddrSpaceCastInst>(I)) {
▲ Show 20 Lines • Show All 3,679 Lines • Show Last 20 Lines

llvm/test/Other/scalable-vectors-core-ir.ll

This file was added.

				; RUN: opt -S -verify < %s \| FileCheck %s
				target datalayout = "e-m:e-i8:8:32-i16:16:32-i64:64-i128:128-n32:64-S128"
				target triple = "aarch64--linux-gnu"

				;; Check supported instructions are accepted without dropping 'vscale'.
				;; Same order as the LangRef

				;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;
				;; Unary Operations
				;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;


				define <vscale x 2 x double> @fneg(<vscale x 2 x double> %val) {
				; CHECK-LABEL: @fneg
				; CHECK: %r = fneg <vscale x 2 x double> %val
				; CHECK-NEXT: ret <vscale x 2 x double> %r
				%r = fneg <vscale x 2 x double> %val
				ret <vscale x 2 x double> %r
				}

				;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;
				;; Binary Operations
				;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;

				define <vscale x 8 x i16> @add(<vscale x 8 x i16> %a, <vscale x 8 x i16> %b) {
				; CHECK-LABEL: @add
				; CHECK: %r = add <vscale x 8 x i16> %a, %b
				; CHECK-NEXT: ret <vscale x 8 x i16> %r
				%r = add <vscale x 8 x i16> %a, %b
				ret <vscale x 8 x i16> %r
				}

				define <vscale x 4 x float> @fadd(<vscale x 4 x float> %a, <vscale x 4 x float> %b) {
				; CHECK-LABEL: @fadd
				; CHECK: %r = fadd <vscale x 4 x float> %a, %b
				; CHECK-NEXT: ret <vscale x 4 x float> %r
				%r = fadd <vscale x 4 x float> %a, %b
				ret <vscale x 4 x float> %r
				}

				define <vscale x 4 x i32> @sub(<vscale x 4 x i32> %a, <vscale x 4 x i32> %b) {
				; CHECK-LABEL: @sub
				; CHECK: %r = sub <vscale x 4 x i32> %a, %b
				; CHECK-NEXT: ret <vscale x 4 x i32> %r
				%r = sub <vscale x 4 x i32> %a, %b
				ret <vscale x 4 x i32> %r
				}

				define <vscale x 4 x float> @fsub(<vscale x 4 x float> %a, <vscale x 4 x float> %b) {
				; CHECK-LABEL: @fsub
				; CHECK: %r = fsub <vscale x 4 x float> %a, %b
				; CHECK-NEXT: ret <vscale x 4 x float> %r
				%r = fsub <vscale x 4 x float> %a, %b
				ret <vscale x 4 x float> %r
				}

				define <vscale x 4 x i32> @mul(<vscale x 4 x i32> %a, <vscale x 4 x i32> %b) {
				; CHECK-LABEL: @mul
				; CHECK: %r = mul <vscale x 4 x i32> %a, %b
				; CHECK-NEXT: ret <vscale x 4 x i32> %r
				%r = mul <vscale x 4 x i32> %a, %b
				ret <vscale x 4 x i32> %r
				}

				define <vscale x 4 x float> @fmul(<vscale x 4 x float> %a, <vscale x 4 x float> %b) {
				; CHECK-LABEL: @fmul
				; CHECK: %r = fmul <vscale x 4 x float> %a, %b
				; CHECK-NEXT: ret <vscale x 4 x float> %r
				%r = fmul <vscale x 4 x float> %a, %b
				ret <vscale x 4 x float> %r
				}

				define <vscale x 4 x i32> @udiv(<vscale x 4 x i32> %a, <vscale x 4 x i32> %b) {
				; CHECK-LABEL: @udiv
				; CHECK: %r = udiv <vscale x 4 x i32> %a, %b
				; CHECK-NEXT: ret <vscale x 4 x i32> %r
				%r = udiv <vscale x 4 x i32> %a, %b
				ret <vscale x 4 x i32> %r
				}

				define <vscale x 4 x i32> @sdiv(<vscale x 4 x i32> %a, <vscale x 4 x i32> %b) {
				; CHECK-LABEL: @sdiv
				; CHECK: %r = sdiv <vscale x 4 x i32> %a, %b
				; CHECK-NEXT: ret <vscale x 4 x i32> %r
				%r = sdiv <vscale x 4 x i32> %a, %b
				ret <vscale x 4 x i32> %r
				}

				define <vscale x 4 x float> @fdiv(<vscale x 4 x float> %a, <vscale x 4 x float> %b) {
				; CHECK-LABEL: @fdiv
				; CHECK: %r = fdiv <vscale x 4 x float> %a, %b
				; CHECK-NEXT: ret <vscale x 4 x float> %r
				%r = fdiv <vscale x 4 x float> %a, %b
				ret <vscale x 4 x float> %r
				}

				define <vscale x 4 x i32> @urem(<vscale x 4 x i32> %a, <vscale x 4 x i32> %b) {
				; CHECK-LABEL: @urem
				; CHECK: %r = urem <vscale x 4 x i32> %a, %b
				; CHECK-NEXT: ret <vscale x 4 x i32> %r
				%r = urem <vscale x 4 x i32> %a, %b
				ret <vscale x 4 x i32> %r
				}

				define <vscale x 4 x i32> @srem(<vscale x 4 x i32> %a, <vscale x 4 x i32> %b) {
				; CHECK-LABEL: @srem
				; CHECK: %r = srem <vscale x 4 x i32> %a, %b
				; CHECK-NEXT: ret <vscale x 4 x i32> %r
				%r = srem <vscale x 4 x i32> %a, %b
				ret <vscale x 4 x i32> %r
				}

				define <vscale x 4 x float> @frem(<vscale x 4 x float> %a, <vscale x 4 x float> %b) {
				; CHECK-LABEL: @frem
				; CHECK: %r = frem <vscale x 4 x float> %a, %b
				; CHECK-NEXT: ret <vscale x 4 x float> %r
				%r = frem <vscale x 4 x float> %a, %b
				ret <vscale x 4 x float> %r
				}

				;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;
				;; Bitwise Binary Operations
				;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;

				define <vscale x 4 x i32> @shl(<vscale x 4 x i32> %a, <vscale x 4 x i32> %b) {
				; CHECK-LABEL: @shl
				; CHECK: %r = shl <vscale x 4 x i32> %a, %b
				; CHECK-NEXT: ret <vscale x 4 x i32> %r
				%r = shl <vscale x 4 x i32> %a, %b
				ret <vscale x 4 x i32> %r
				}

				define <vscale x 4 x i32> @lshr(<vscale x 4 x i32> %a, <vscale x 4 x i32> %b) {
				; CHECK-LABEL: @lshr
				; CHECK: %r = lshr <vscale x 4 x i32> %a, %b
				; CHECK-NEXT: ret <vscale x 4 x i32> %r
				%r = lshr <vscale x 4 x i32> %a, %b
				ret <vscale x 4 x i32> %r
				}

				define <vscale x 4 x i32> @ashr(<vscale x 4 x i32> %a, <vscale x 4 x i32> %b) {
				; CHECK-LABEL: @ashr
				; CHECK: %r = ashr <vscale x 4 x i32> %a, %b
				; CHECK-NEXT: ret <vscale x 4 x i32> %r
				%r = ashr <vscale x 4 x i32> %a, %b
				ret <vscale x 4 x i32> %r
				}

				define <vscale x 4 x i32> @and(<vscale x 4 x i32> %a, <vscale x 4 x i32> %b) {
				; CHECK-LABEL: @and
				; CHECK: %r = and <vscale x 4 x i32> %a, %b
				; CHECK-NEXT: ret <vscale x 4 x i32> %r
				%r = and <vscale x 4 x i32> %a, %b
				ret <vscale x 4 x i32> %r
				}

				define <vscale x 4 x i32> @or(<vscale x 4 x i32> %a, <vscale x 4 x i32> %b) {
				; CHECK-LABEL: @or
				; CHECK: %r = or <vscale x 4 x i32> %a, %b
				; CHECK-NEXT: ret <vscale x 4 x i32> %r
				%r = or <vscale x 4 x i32> %a, %b
				ret <vscale x 4 x i32> %r
				}

				define <vscale x 4 x i32> @xor(<vscale x 4 x i32> %a, <vscale x 4 x i32> %b) {
				; CHECK-LABEL: @xor
				; CHECK: %r = xor <vscale x 4 x i32> %a, %b
				; CHECK-NEXT: ret <vscale x 4 x i32> %r
				%r = xor <vscale x 4 x i32> %a, %b
				ret <vscale x 4 x i32> %r
				}

				;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;
				;; Vector Operations
				;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;

				define i64 @extractelement(<vscale x 2 x i64> %val) {
				; CHECK-LABEL: @extractelement
				; CHECK: %r = extractelement <vscale x 2 x i64> %val, i32 0
				; CHECK-NEXT: ret i64 %r
				%r = extractelement <vscale x 2 x i64> %val, i32 0
				ret i64 %r
				}

				define <vscale x 16 x i8> @insertelement(<vscale x 16 x i8> %vec, i8 %ins) {
				; CHECK-LABEL: @insertelement
				; CHECK: %r = insertelement <vscale x 16 x i8> %vec, i8 %ins, i32 0
				; CHECK-NEXT: ret <vscale x 16 x i8> %r
				%r = insertelement <vscale x 16 x i8> %vec, i8 %ins, i32 0
				ret <vscale x 16 x i8> %r
				}

				define <vscale x 8 x half> @shufflevector(half %val) {
				; CHECK-LABEL: @shufflevector
				; CHECK: %insvec = insertelement <vscale x 8 x half> undef, half %val, i32 0
				; CHECK-NEXT: %r = shufflevector <vscale x 8 x half> %insvec, <vscale x 8 x half> undef, <vscale x 8 x i32> zeroinitializer
				; CHECK-NEXT: ret <vscale x 8 x half> %r
				%insvec = insertelement <vscale x 8 x half> undef, half %val, i32 0
				%r = shufflevector <vscale x 8 x half> %insvec, <vscale x 8 x half> undef, <vscale x 8 x i32> zeroinitializer
				ret <vscale x 8 x half> %r
				}

				;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;
				;; Memory Access and Addressing Operations
				;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;

				define void @alloca() {
				; CHECK-LABEL: @alloca
				; CHECK: %vec = alloca <vscale x 4 x i32>
				; CHECK-NEXT: ret void
				%vec = alloca <vscale x 4 x i32>
				ret void
				}

				define <vscale x 2 x double> @load(<vscale x 2 x double>* %ptr) {
				; CHECK-LABEL: @load
				; CHECK: %r = load <vscale x 2 x double>, <vscale x 2 x double>* %ptr
				; CHECK-NEXT: ret <vscale x 2 x double> %r
				%r = load <vscale x 2 x double>, <vscale x 2 x double>* %ptr
				ret <vscale x 2 x double> %r
				}

				define void @store(<vscale x 4 x i32> %data, <vscale x 4 x i32>* %ptr) {
				; CHECK-LABEL: @store
				; CHECK: store <vscale x 4 x i32> %data, <vscale x 4 x i32>* %ptr
				; CHECK-NEXT: ret void
				store <vscale x 4 x i32> %data, <vscale x 4 x i32>* %ptr
				ret void
				}

				define <vscale x 4 x float>* @getelementptr(<vscale x 4 x float>* %base) {
				; CHECK-LABEL: @getelementptr
				; CHECK: %r = getelementptr <vscale x 4 x float>, <vscale x 4 x float>* %base, i64 0
				; CHECK-NEXT: ret <vscale x 4 x float>* %r
				%r = getelementptr <vscale x 4 x float>, <vscale x 4 x float>* %base, i64 0
				ret <vscale x 4 x float>* %r
				}

				;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;
				;; Conversion Operations
				;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;

				define <vscale x 4 x i32> @truncto(<vscale x 4 x i64> %val) {
				; CHECK-LABEL: @truncto
				; CHECK: %r = trunc <vscale x 4 x i64> %val to <vscale x 4 x i32>
				; CHECK-NEXT: ret <vscale x 4 x i32> %r
				%r = trunc <vscale x 4 x i64> %val to <vscale x 4 x i32>
				ret <vscale x 4 x i32> %r
				}

				define <vscale x 2 x i64> @zextto(<vscale x 2 x i16> %val) {
				; CHECK-LABEL: @zextto
				; CHECK: %r = zext <vscale x 2 x i16> %val to <vscale x 2 x i64>
				; CHECK-NEXT: ret <vscale x 2 x i64> %r
				%r = zext <vscale x 2 x i16> %val to <vscale x 2 x i64>
				ret <vscale x 2 x i64> %r
				}

				define <vscale x 4 x i32> @sextto(<vscale x 4 x i8> %val) {
				; CHECK-LABEL: @sextto
				; CHECK: %r = sext <vscale x 4 x i8> %val to <vscale x 4 x i32>
				; CHECK-NEXT: ret <vscale x 4 x i32> %r
				%r = sext <vscale x 4 x i8> %val to <vscale x 4 x i32>
				ret <vscale x 4 x i32> %r
				}

				define <vscale x 4 x half> @fptruncto(<vscale x 4 x float> %val) {
				; CHECK-LABEL: @fptruncto
				; CHECK: %r = fptrunc <vscale x 4 x float> %val to <vscale x 4 x half>
				; CHECK-NEXT: ret <vscale x 4 x half> %r
				%r = fptrunc <vscale x 4 x float> %val to <vscale x 4 x half>
				ret <vscale x 4 x half> %r
				}

				define <vscale x 2 x double> @fpextto(<vscale x 2 x half> %val) {
				; CHECK-LABEL: @fpextto
				; CHECK: %r = fpext <vscale x 2 x half> %val to <vscale x 2 x double>
				; CHECK-NEXT: ret <vscale x 2 x double> %r
				%r = fpext <vscale x 2 x half> %val to <vscale x 2 x double>
				ret <vscale x 2 x double> %r
				}

				define <vscale x 4 x i32> @fptouito(<vscale x 4 x float> %val) {
				; CHECK-LABEL: @fptoui
				; CHECK: %r = fptoui <vscale x 4 x float> %val to <vscale x 4 x i32>
				; CHECK-NEXT: ret <vscale x 4 x i32> %r
				%r = fptoui <vscale x 4 x float> %val to <vscale x 4 x i32>
				ret <vscale x 4 x i32> %r
				}

				define <vscale x 4 x i32> @fptosito(<vscale x 4 x float> %val) {
				; CHECK-LABEL: @fptosi
				; CHECK: %r = fptosi <vscale x 4 x float> %val to <vscale x 4 x i32>
				; CHECK-NEXT: ret <vscale x 4 x i32> %r
				%r = fptosi <vscale x 4 x float> %val to <vscale x 4 x i32>
				ret <vscale x 4 x i32> %r
				}

				define <vscale x 4 x float> @uitofpto(<vscale x 4 x i32> %val) {
				; CHECK-LABEL: @uitofp
				; CHECK: %r = uitofp <vscale x 4 x i32> %val to <vscale x 4 x float>
				; CHECK-NEXT: ret <vscale x 4 x float> %r
				%r = uitofp <vscale x 4 x i32> %val to <vscale x 4 x float>
				ret <vscale x 4 x float> %r
				}

				define <vscale x 4 x float> @sitofpto(<vscale x 4 x i32> %val) {
				; CHECK-LABEL: @sitofp
				; CHECK: %r = sitofp <vscale x 4 x i32> %val to <vscale x 4 x float>
				; CHECK-NEXT: ret <vscale x 4 x float> %r
				%r = sitofp <vscale x 4 x i32> %val to <vscale x 4 x float>
				ret <vscale x 4 x float> %r
				}

				define <vscale x 2 x i64> @ptrtointto(<vscale x 2 x i32*> %val) {
				; CHECK-LABEL: @ptrtointto
				; CHECK: %r = ptrtoint <vscale x 2 x i32*> %val to <vscale x 2 x i64>
				; CHECK-NEXT: ret <vscale x 2 x i64> %r
				%r = ptrtoint <vscale x 2 x i32*> %val to <vscale x 2 x i64>
				ret <vscale x 2 x i64> %r
				}

				define <vscale x 2 x i32*> @inttoptrto(<vscale x 2 x i64> %val) {
				; CHECK-LABEL: @inttoptrto
				; CHECK: %r = inttoptr <vscale x 2 x i64> %val to <vscale x 2 x i32*>
				; CHECK-NEXT: ret <vscale x 2 x i32*> %r
				%r = inttoptr <vscale x 2 x i64> %val to <vscale x 2 x i32*>
				ret <vscale x 2 x i32*> %r
				}

				define <vscale x 2 x i64> @bitcastto(<vscale x 2 x double> %a) {
				; CHECK-LABEL: @bitcast
				; CHECK: %r = bitcast <vscale x 2 x double> %a to <vscale x 2 x i64>
				; CHECK-NEXT: ret <vscale x 2 x i64> %r
				%r = bitcast <vscale x 2 x double> %a to <vscale x 2 x i64>
				ret <vscale x 2 x i64> %r
				}

				;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;
				;; Other Operations
				;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;

				define <vscale x 4 x i1> @icmp(<vscale x 4 x i32> %a, <vscale x 4 x i32> %b) {
				; CHECK-LABEL: @icmp
				; CHECK: %r = icmp eq <vscale x 4 x i32> %a, %b
				; CHECK-NEXT: ret <vscale x 4 x i1> %r
				%r = icmp eq <vscale x 4 x i32> %a, %b
				ret <vscale x 4 x i1> %r
				}

				define <vscale x 4 x i1> @fcmp(<vscale x 4 x float> %a, <vscale x 4 x float> %b) {
				; CHECK-LABEL: @fcmp
				; CHECK: %r = fcmp une <vscale x 4 x float> %a, %b
				; CHECK-NEXT: ret <vscale x 4 x i1> %r
				%r = fcmp une <vscale x 4 x float> %a, %b
				ret <vscale x 4 x i1> %r
				}

				define <vscale x 16 x i8> @phi(<vscale x 16 x i8> %a, i32 %val) {
				; CHECK-LABEL: @phi
				; CHECK: %r = phi <vscale x 16 x i8> [ %a, %entry ], [ %added, %iszero ]
				; CHECK-NEXT: ret <vscale x 16 x i8> %r
				entry:
				%cmp = icmp eq i32 %val, 0
				br i1 %cmp, label %iszero, label %end

				iszero:
				%ins = insertelement <vscale x 16 x i8> undef, i8 1, i32 0
				%splatone = shufflevector <vscale x 16 x i8> %ins, <vscale x 16 x i8> undef, <vscale x 16 x i32> zeroinitializer
				%added = add <vscale x 16 x i8> %a, %splatone
				br label %end

				end:
				%r = phi <vscale x 16 x i8> [ %a, %entry ], [ %added, %iszero ]
				ret <vscale x 16 x i8> %r
				}

				define <vscale x 8 x half> @select(<vscale x 8 x half> %a, <vscale x 8 x half> %b, <vscale x 8 x i1> %sval) {
				; CHECK-LABEL: @select
				; CHECK: %r = select <vscale x 8 x i1> %sval, <vscale x 8 x half> %a, <vscale x 8 x half> %b
				; CHECK-NEXT: ret <vscale x 8 x half> %r
				%r = select <vscale x 8 x i1> %sval, <vscale x 8 x half> %a, <vscale x 8 x half> %b
				ret <vscale x 8 x half> %r
				}

				declare <vscale x 4 x i32> @callee(<vscale x 4 x i32>)
				define <vscale x 4 x i32> @call(<vscale x 4 x i32> %val) {
				; CHECK-LABEL: @call
				; CHECK: %r = call <vscale x 4 x i32> @callee(<vscale x 4 x i32> %val)
				; CHECK-NEXT: ret <vscale x 4 x i32> %r
				%r = call <vscale x 4 x i32> @callee(<vscale x 4 x i32> %val)
				ret <vscale x 4 x i32> %r
				}
				No newline at end of file

llvm/unittests/CodeGen/ScalableVectorMVTsTest.cpp

	//===-------- llvm/unittest/CodeGen/ScalableVectorMVTsTest.cpp ------------===//			//===-------- llvm/unittest/CodeGen/ScalableVectorMVTsTest.cpp ------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "llvm/CodeGen/ValueTypes.h"			#include "llvm/CodeGen/ValueTypes.h"
	#include "llvm/IR/DerivedTypes.h"			#include "llvm/IR/DerivedTypes.h"
	#include "llvm/IR/LLVMContext.h"			#include "llvm/IR/LLVMContext.h"
	#include "llvm/Support/MachineValueType.h"			#include "llvm/Support/MachineValueType.h"
	#include "llvm/Support/ScalableSize.h"			#include "llvm/Support/TypeSize.h"
	#include "gtest/gtest.h"			#include "gtest/gtest.h"

	using namespace llvm;			using namespace llvm;

	namespace {			namespace {

	TEST(ScalableVectorMVTsTest, IntegerMVTs) {			TEST(ScalableVectorMVTsTest, IntegerMVTs) {
	for (auto VecTy : MVT::integer_scalable_vector_valuetypes()) {			for (auto VecTy : MVT::integer_scalable_vector_valuetypes()) {
	▲ Show 20 Lines • Show All 102 Lines • Show Last 20 Lines

llvm/unittests/IR/VectorTypesTest.cpp

//===--- llvm/unittest/IR/VectorTypesTest.cpp - vector types unit tests ---===//		//===--- llvm/unittest/IR/VectorTypesTest.cpp - vector types unit tests ---===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

		#include "llvm/IR/DataLayout.h"
#include "llvm/IR/DerivedTypes.h"		#include "llvm/IR/DerivedTypes.h"
#include "llvm/IR/LLVMContext.h"		#include "llvm/IR/LLVMContext.h"
#include "llvm/Support/ScalableSize.h"		#include "llvm/Support/TypeSize.h"
#include "gtest/gtest.h"		#include "gtest/gtest.h"
using namespace llvm;		using namespace llvm;

namespace {		namespace {
TEST(VectorTypesTest, FixedLength) {		TEST(VectorTypesTest, FixedLength) {
LLVMContext Ctx;		LLVMContext Ctx;

Type *Int16Ty = Type::getInt16Ty(Ctx);		Type *Int16Ty = Type::getInt16Ty(Ctx);
▲ Show 20 Lines • Show All 136 Lines • ▼ Show 20 Lines	TEST(VectorTypesTest, Scalable) {
EXPECT_EQ(ConvTy->getNumElements(), 4U);		EXPECT_EQ(ConvTy->getNumElements(), 4U);
EXPECT_EQ(ConvTy->getElementType()->getScalarSizeInBits(), 64U);		EXPECT_EQ(ConvTy->getElementType()->getScalarSizeInBits(), 64U);

EltCnt = ScV8Int64Ty->getElementCount();		EltCnt = ScV8Int64Ty->getElementCount();
EXPECT_EQ(EltCnt.Min, 8U);		EXPECT_EQ(EltCnt.Min, 8U);
ASSERT_TRUE(EltCnt.Scalable);		ASSERT_TRUE(EltCnt.Scalable);
}		}

		TEST(VectorTypesTest, FixedLenComparisons) {
		LLVMContext Ctx;
		DataLayout DL("");

		Type *Int32Ty = Type::getInt32Ty(Ctx);
		Type *Int64Ty = Type::getInt64Ty(Ctx);

		VectorType *V2Int32Ty = VectorType::get(Int32Ty, 2);
		VectorType *V4Int32Ty = VectorType::get(Int32Ty, 4);

		VectorType *V2Int64Ty = VectorType::get(Int64Ty, 2);

		TypeSize V2I32Len = V2Int32Ty->getPrimitiveSizeInBits();
		EXPECT_EQ(V2I32Len.getKnownMinSize(), 64U);
		EXPECT_FALSE(V2I32Len.isScalable());

		EXPECT_LT(V2Int32Ty->getPrimitiveSizeInBits(),
		V4Int32Ty->getPrimitiveSizeInBits());
		EXPECT_GT(V2Int64Ty->getPrimitiveSizeInBits(),
		V2Int32Ty->getPrimitiveSizeInBits());
		EXPECT_EQ(V4Int32Ty->getPrimitiveSizeInBits(),
		V2Int64Ty->getPrimitiveSizeInBits());
		EXPECT_NE(V2Int32Ty->getPrimitiveSizeInBits(),
		V2Int64Ty->getPrimitiveSizeInBits());

		// Check that a fixed-only comparison works for fixed size vectors.
		EXPECT_EQ(V2Int64Ty->getPrimitiveSizeInBits().getFixedSize(),
		V4Int32Ty->getPrimitiveSizeInBits().getFixedSize());

		// Check the DataLayout interfaces.
		EXPECT_EQ(DL.getTypeSizeInBits(V2Int64Ty),
		DL.getTypeSizeInBits(V4Int32Ty));
		EXPECT_EQ(DL.getTypeSizeInBits(V2Int32Ty), 64U);
		EXPECT_EQ(DL.getTypeSizeInBits(V2Int64Ty), 128U);
		EXPECT_EQ(DL.getTypeStoreSize(V2Int64Ty),
		DL.getTypeStoreSize(V4Int32Ty));
		EXPECT_NE(DL.getTypeStoreSizeInBits(V2Int32Ty),
		DL.getTypeStoreSizeInBits(V2Int64Ty));
		EXPECT_EQ(DL.getTypeStoreSizeInBits(V2Int32Ty), 64U);
		EXPECT_EQ(DL.getTypeStoreSize(V2Int64Ty), 16U);
		EXPECT_EQ(DL.getTypeAllocSize(V4Int32Ty),
		DL.getTypeAllocSize(V2Int64Ty));
		EXPECT_NE(DL.getTypeAllocSizeInBits(V2Int32Ty),
		DL.getTypeAllocSizeInBits(V2Int64Ty));
		EXPECT_EQ(DL.getTypeAllocSizeInBits(V4Int32Ty), 128U);
		EXPECT_EQ(DL.getTypeAllocSize(V2Int32Ty), 8U);
		ASSERT_TRUE(DL.typeSizeEqualsStoreSize(V4Int32Ty));
		}

		TEST(VectorTypesTest, ScalableComparisons) {
		LLVMContext Ctx;
		DataLayout DL("");

		Type *Int32Ty = Type::getInt32Ty(Ctx);
		Type *Int64Ty = Type::getInt64Ty(Ctx);

		VectorType *ScV2Int32Ty = VectorType::get(Int32Ty, {2, true});
		VectorType *ScV4Int32Ty = VectorType::get(Int32Ty, {4, true});

		VectorType *ScV2Int64Ty = VectorType::get(Int64Ty, {2, true});

		TypeSize ScV2I32Len = ScV2Int32Ty->getPrimitiveSizeInBits();
		EXPECT_EQ(ScV2I32Len.getKnownMinSize(), 64U);
		EXPECT_TRUE(ScV2I32Len.isScalable());

		EXPECT_LT(ScV2Int32Ty->getPrimitiveSizeInBits(),
		ScV4Int32Ty->getPrimitiveSizeInBits());
		EXPECT_GT(ScV2Int64Ty->getPrimitiveSizeInBits(),
		ScV2Int32Ty->getPrimitiveSizeInBits());
		EXPECT_EQ(ScV4Int32Ty->getPrimitiveSizeInBits(),
		ScV2Int64Ty->getPrimitiveSizeInBits());
		EXPECT_NE(ScV2Int32Ty->getPrimitiveSizeInBits(),
		ScV2Int64Ty->getPrimitiveSizeInBits());

		// Check the DataLayout interfaces.
		EXPECT_EQ(DL.getTypeSizeInBits(ScV2Int64Ty),
		DL.getTypeSizeInBits(ScV4Int32Ty));
		EXPECT_EQ(DL.getTypeSizeInBits(ScV2Int32Ty).getKnownMinSize(), 64U);
		EXPECT_EQ(DL.getTypeStoreSize(ScV2Int64Ty),
		DL.getTypeStoreSize(ScV4Int32Ty));
		EXPECT_NE(DL.getTypeStoreSizeInBits(ScV2Int32Ty),
		DL.getTypeStoreSizeInBits(ScV2Int64Ty));
		EXPECT_EQ(DL.getTypeStoreSizeInBits(ScV2Int32Ty).getKnownMinSize(), 64U);
		EXPECT_EQ(DL.getTypeStoreSize(ScV2Int64Ty).getKnownMinSize(), 16U);
		EXPECT_EQ(DL.getTypeAllocSize(ScV4Int32Ty),
		DL.getTypeAllocSize(ScV2Int64Ty));
		EXPECT_NE(DL.getTypeAllocSizeInBits(ScV2Int32Ty),
		DL.getTypeAllocSizeInBits(ScV2Int64Ty));
		EXPECT_EQ(DL.getTypeAllocSizeInBits(ScV4Int32Ty).getKnownMinSize(), 128U);
		EXPECT_EQ(DL.getTypeAllocSize(ScV2Int32Ty).getKnownMinSize(), 8U);
		ASSERT_TRUE(DL.typeSizeEqualsStoreSize(ScV4Int32Ty));
		}

		TEST(VectorTypesTest, CrossComparisons) {
		LLVMContext Ctx;

		Type *Int32Ty = Type::getInt32Ty(Ctx);

		VectorType *V4Int32Ty = VectorType::get(Int32Ty, {4, false});
		VectorType *ScV4Int32Ty = VectorType::get(Int32Ty, {4, true});

		// Even though the minimum size is the same, a scalable vector could be
		// larger so we don't consider them to be the same size.
		EXPECT_NE(V4Int32Ty->getPrimitiveSizeInBits(),
		ScV4Int32Ty->getPrimitiveSizeInBits());
		// If we are only checking the minimum, then they are the same size.
		EXPECT_EQ(V4Int32Ty->getPrimitiveSizeInBits().getKnownMinSize(),
		ScV4Int32Ty->getPrimitiveSizeInBits().getKnownMinSize());

		// We can't use ordering comparisons (<,<=,>,>=) between scalable and
		// non-scalable vector sizes.
		}

} // end anonymous namespace		} // end anonymous namespace

This is an archive of the discontinued LLVM Phabricator instance.

Scalable vector core instruction support + size queriesClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 223835

clang/lib/CodeGen/CGCall.cpp

clang/lib/CodeGen/CGStmt.cpp

clang/lib/CodeGen/CodeGenFunction.cpp

llvm/include/llvm/ADT/DenseMapInfo.h

llvm/include/llvm/IR/DataLayout.h

llvm/include/llvm/IR/DerivedTypes.h

llvm/include/llvm/IR/InstrTypes.h

llvm/include/llvm/IR/Type.h

llvm/include/llvm/Support/MachineValueType.h

llvm/include/llvm/Support/ScalableSize.h

llvm/include/llvm/Support/TypeSize.h

llvm/lib/Analysis/InlineCost.cpp

llvm/lib/CodeGen/Analysis.cpp

llvm/lib/IR/DataLayout.cpp

llvm/lib/IR/Instructions.cpp

llvm/lib/IR/Type.cpp

llvm/lib/Target/AArch64/AArch64ISelLowering.cpp

llvm/lib/Transforms/Scalar/SROA.cpp

llvm/test/Other/scalable-vectors-core-ir.ll

llvm/unittests/CodeGen/ScalableVectorMVTsTest.cpp

llvm/unittests/IR/VectorTypesTest.cpp

Scalable vector core instruction support + size queries
ClosedPublic