This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/
-
llvm/
-
CodeGen/
-
Analysis.h
-
IR/
2/4
DataLayout.h
-
Instructions.h
-
Type.h
-
lib/
-
Analysis/
1
MemoryBuiltins.cpp
1/1
ScalarEvolution.cpp
-
CodeGen/
2/3
Analysis.cpp
-
GlobalISel/
1
CallLowering.cpp
-
SelectionDAG/
-
FunctionLoweringInfo.cpp
-
SelectionDAG.cpp
1/1
SelectionDAGBuilder.cpp
-
IR/
-
DataLayout.cpp
2/3
Type.cpp
-
Target/
-
AMDGPU/
-
AMDGPUISelLowering.cpp
-
NVPTX/
-
NVPTXISelLowering.cpp
-
X86/
-
X86CallLowering.cpp
-
Transforms/
-
IPO/
-
ArgumentPromotion.cpp
-
AttributorAttributes.cpp
-
InstCombine/
-
InstCombineLoadStoreAlloca.cpp
-
InstructionCombining.cpp
-
Scalar/
-
SROA.cpp
-
Utils/
-
ScalarEvolutionExpander.cpp
-
test/
-
CodeGen/RISCV/rvv/
-
RISCV/
-
rvv/
-
load-store-scalable-struct.ll
-
Other/
-
load-scalable-vector-struct.ll
-
store-scalable-vector-struct.ll

Differential D98169

[IR] Permit load/store/alloca for struct with the same scalable vectors.
AbandonedPublic

Authored by HsiangKai on Mar 8 2021, 4:23 AM.

Download Raw Diff

Details

Reviewers

craig.topper
efriedma
c-rhodes
sdesmalen
kmclaughlin
rogfer01
frasercrmck
lebedev.ri
david-arm
ctetreau
jdoerfert
sstefan1
baziotis

Summary

In this patch, we try to support load/store/alloca for scalable structures. We have posted a RFC for the proposal in the mailing list. https://groups.google.com/g/llvm-dev/c/6ZK2eS4-8t0/m/PG6H1NNDBAAJ

In RISC-V vector intrinsics, we have defined types containing multiple scalable vectors. Due to the flexible configuration of vector types with LMUL, the struct is a good fit to represent these types containing multiple scalable vectors. We permit users to define auto variables using these types. That is why we need to support load, store, and alloca for the scalable structure.

To support load, store and alloca for the scalable structure, we have to achieve the following two points.

In StructLayout, we use uint64_t for the StructSize. Currently, we have TypeSize to represent the size of data. We should modify the StructLayout using the TypeSize for data size regardless to support scalable struct or not.
In order to handle load and store scalable struct correctly, we need to modify ComputeValueVTs() to return the correct information. The returned Offsets should be represented as TypeSize. In order to collect the correct information, we need to represent the StructSize in StructLayout using TypeSize. (refer to the first point.)

We have some limitation for the scalable structure.

Due to the TypeSize could only represent either scalable size or fixed size, we could not mix the fixed objects and scalable objects in a struct. We only support struct with all scalable objects.
We only need load/store/alloca scalable struct. We have no need to support GEP for scalable struct. It could alleviate the effort to expand the capabilities of IR.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

HsiangKai created this revision.Mar 8 2021, 4:23 AM

Herald added subscribers: StephenFan, frasercrmck, dexonsmith and 21 others. · View Herald TranscriptMar 8 2021, 4:23 AM

HsiangKai requested review of this revision.Mar 8 2021, 4:23 AM

Herald added a project: Restricted Project. · View Herald TranscriptMar 8 2021, 4:23 AM

Herald added subscribers: llvm-commits, MaskRay. · View Herald Transcript

HsiangKai added reviewers: efriedma, c-rhodes, sdesmalen, kmclaughlin, rogfer01, frasercrmck.Mar 8 2021, 4:24 AM

Harbormaster completed remote builds in B92629: Diff 328971.Mar 8 2021, 5:59 AM

HsiangKai added reviewers: lebedev.ri, david-arm.Mar 8 2021, 1:55 PM

david-arm added inline comments.Mar 9 2021, 2:47 AM

llvm/include/llvm/IR/DataLayout.h
631	This should also return a TypeSize to be consistent with getSizeInBits.
llvm/include/llvm/Support/TypeSize.h
148 ↗	(On Diff #328971)	I'm not sure we want to use operators here - see the TypeSize class for how we did this using isKnownXY functions. There was a long history of discussion about the use of operators, which you can see on some of the earlier patches. The problem with operators is that there are cases where you simply do not know the answer. For example, there is no compile-time answer for Fixed(8) < Scalable(4) since we don't know the value of vscale.

Add comments before getSizeInBytes() and remove compare operators in the StackOffset.

HsiangKai added inline comments.Mar 9 2021, 10:59 PM

llvm/include/llvm/IR/DataLayout.h
631	How about keep the interface as so. In current implementation, the elements will be scalar or scalable. The uses of StructLayout::getSizeInBytes() will be compared with StructLayout::getElementOffset() usually. Keep the interface to return uint64_t should be fine. I added a FIXME comment before the function.

david-arm added inline comments.Mar 10 2021, 12:08 AM

llvm/include/llvm/IR/DataLayout.h
631	But isn't the problem here that the result is now wrong? We're now returning the minimum value, rather than a TypeSize that tries to express the complete value. Also, it seems quite confusing that returning a size in bits gives you TypeSize and returning a size in bytes gives you uint64_t. This is inconsistent with how sizes are returned elsewhere, for example see: include/llvm/CodeGen/ValueTypes.h where we have TypeSize getSizeInBits() const TypeSize getStoreSize() const where the latter returns bytes. If a function must return the minimum value then I think at least the function name should reflect that with a rename, i.e. getMinSizeInBytes() so that the caller understands this is not the actual size.

david-arm added a reviewer: ctetreau.Mar 10 2021, 2:51 AM

Harbormaster completed remote builds in B93017: Diff 329543.Mar 10 2021, 10:03 AM

I'm curious - the restriction that it must be all scalable or all fixed is because we're using TypeSize with only one dimension? It sounds like we'd ideally want to be able to represent the size of structs with something more like StackOffset with two dimensions?

Hi @HsiangKai, this is not a supported use-case for scalable vectors and it is explicitly called out in the LangRef:

Scalable vectors cannot be global variables or members of arrays because their size is unknown at compile time. They are allowed in structs to facilitate intrinsics returning multiple values. Structs containing scalable vectors cannot be used in loads, stores, allocas, or GEPs.

There have been a few conversations about this before. I previously left a comment on D94142 and there was also some discussion on @craig.topper's RFC to add limited support for intrinsics, which can be found here: https://lists.llvm.org/pipermail/llvm-dev/2021-January/147639.html

Supporting scalable vector types as exclusive members of structs that can be used in loads/stores/allocas is a bit of a sliding scale, because the next question will be the one @frasercrmck is asking: "Why can't you mix fixed/scalable types?".

It's been still relatively recent that we've settled on a definition for TypeSize which still needs to be supported in many places in the code-base. Mixing fixed/scalable adds another dimension of complexity, because then all interfaces that take an offset also need to consider the fact that it may be comprised of a fixed-width and scalable-width component. Also, at the moment there isn't really a use-case, there are no languages that allow or support structs/aggregates with scalable members. The original advice given for "What if C/C++ allowed such a use case?" was to solve this in Clang, so as to remove the need for such support in LLVM IR.

The only reason we wanted to allow the use-case for return-values/operands of intrinsics is so that we can model operations that return multiple values which can be scalable, such as first-faulting loads where it returns a data vector (the loaded data) and a predicate mask for the lanes that faulted.

@HsiangKai was there a specific use-case that you had in mind for this?

In D98169#2619303, @sdesmalen wrote:

Hi @HsiangKai, this is not a supported use-case for scalable vectors and it is explicitly called out in the LangRef:

Scalable vectors cannot be global variables or members of arrays because their size is unknown at compile time. They are allowed in structs to facilitate intrinsics returning multiple values. Structs containing scalable vectors cannot be used in loads, stores, allocas, or GEPs.

There have been a few conversations about this before. I previously left a comment on D94142 and there was also some discussion on @craig.topper's RFC to add limited support for intrinsics, which can be found here: https://lists.llvm.org/pipermail/llvm-dev/2021-January/147639.html

Supporting scalable vector types as exclusive members of structs that can be used in loads/stores/allocas is a bit of a sliding scale, because the next question will be the one @frasercrmck is asking: "Why can't you mix fixed/scalable types?".

It's been still relatively recent that we've settled on a definition for TypeSize which still needs to be supported in many places in the code-base. Mixing fixed/scalable adds another dimension of complexity, because then all interfaces that take an offset also need to consider the fact that it may be comprised of a fixed-width and scalable-width component. Also, at the moment there isn't really a use-case, there are no languages that allow or support structs/aggregates with scalable members. The original advice given for "What if C/C++ allowed such a use case?" was to solve this in Clang, so as to remove the need for such support in LLVM IR.

The only reason we wanted to allow the use-case for return-values/operands of intrinsics is so that we can model operations that return multiple values which can be scalable, such as first-faulting loads where it returns a data vector (the loaded data) and a predicate mask for the lanes that faulted.

@HsiangKai was there a specific use-case that you had in mind for this?

We want this to support the segment load/store intrinsics defined here https://github.com/riscv/rvv-intrinsic-doc/blob/master/intrinsic_funcs/03_vector_load_store_segment_instructions_zvlsseg.md These return 2 to 8 vectors that have been loaded into consecutive registers. I believe SVE has similar instructions. I believe SVE represents these using types wider than their normal scalable vector types and relies on the type legalizer to split them up in the backend. This works for SVE because there is only one known minimum size for all scalable vector types so the type legalizer will always split down to that minimum type.

For RISC-V vectors we already use 7 different sizes of scalable vectors to represent the ability of our instructions to operate on 2, 4, or 8 registers simultaneously. And for 1/2, 1/4, and 1/8 fractional registers. The segment load/store instructions add an extra dimension where they can produce/consume 2, 3, or 4 pairs of registers or 2 quadruples, for examples. Following the SVE strategy would give us ambiguous types for the type legalizer.

To solve this we would like to use a struct for the segment load/stores to separate them in IR. Since clang needs an address for every variable and needs to be able to load/store them we need to support load/store/alloca.

In D98169#2619874, @craig.topper wrote:

We want this to support the segment load/store intrinsics defined here https://github.com/riscv/rvv-intrinsic-doc/blob/master/intrinsic_funcs/03_vector_load_store_segment_instructions_zvlsseg.md These return 2 to 8 vectors that have been loaded into consecutive registers. I believe SVE has similar instructions. I believe SVE represents these using types wider than their normal scalable vector types and relies on the type legalizer to split them up in the backend. This works for SVE because there is only one known minimum size for all scalable vector types so the type legalizer will always split down to that minimum type.

Thanks for providing the context!

For RISC-V vectors we already use 7 different sizes of scalable vectors to represent the ability of our instructions to operate on 2, 4, or 8 registers simultaneously. And for 1/2, 1/4, and 1/8 fractional registers. The segment load/store instructions add an extra dimension where they can produce/consume 2, 3, or 4 pairs of registers or 2 quadruples, for examples. Following the SVE strategy would give us ambiguous types for the type legalizer.

How does that look in terms of IR? Is the number of registers somehow represented in the (LLVM IR) vector type? Or are the types the same, but the compiler generates different code depending on what mode is set? For SVE we know we can split the vector because <vscale x 8 x i32> is twice the size of <vscale x 4 x i32>, regardless of the value for vscale. Indeed we know SVE vectors area multiple of 128bits, and therefore that <vscale x 4 x i32> is legal. In order to make any assumptions about splitting/legalization, the compiler will need to know which types are legal, and so would expect the compiler to know the mode (2, 4 ,8) for RVV when generating the code, and therefore have similar knowledge about which types are legal and how the vectors are represented/split into registers. How does that lead to ambiguous types?

To solve this we would like to use a struct for the segment load/stores to separate them in IR. Since clang needs an address for every variable and needs to be able to load/store them we need to support load/store/alloca.

These (C/C++-level) intrinsics are probably implemented using target-specific intrinsics or perhaps a common LLVM IR intrinsic like masked.load, which should be able to take/return a struct with scalable members after D94142. If so, it should be possible to handle this in Clang by emitting extractvalue instructions and storing each member individually. That would avoid any changes to LLVM IR. Is that something you've considered?

If we do need to make this work for scalable vectors, I think it needs a message to the mailing list because it's a change to the LangRef and capabilities of scalable vectors, given previous discussions on this topic. I'd like to avoid giving the impression that we're quietly moving the goalpost on what scalable vectors can do in IR.

HsiangKai retitled this revision from [IR] Permit load/store/alloca for struct with the same scalable vectors. to [PoC][IR] Permit load/store/alloca for struct with the same scalable vectors..Mar 13 2021, 5:17 AM

This patch is a proof of concept patch. We should have a formal discussion about the idea before reviewing this patch.

Some related patches:
We model segment load/store types in D97264.
We intend to separate the uses of scalable types and scalable vector types in D98161.

Change the return type of getSizeInBytes() to TypeSize.
Mark the places we need to take care.

Herald added a reviewer: jdoerfert. · View Herald TranscriptMar 18 2021, 8:07 PM

Herald added a reviewer: sstefan1. · View Herald Transcript

Herald added a reviewer: baziotis. · View Herald Transcript

Herald added subscribers: okura, bbn, jdoerfert, kuter. · View Herald Transcript

Harbormaster completed remote builds in B94602: Diff 331747.Mar 18 2021, 9:20 PM

Use TypeSize for offsets instead of StackOffset.

The memebers are all scalable or all fixed objects. We could use TypeSize for offsets. In other places in the current implementation, it uses TypeSize as the offset type. Use TypeSize for member offsets is more consistent with the current implementation.

Herald added subscribers: kerbowa, pengfei, nhaehnle and 3 others. · View Herald TranscriptMar 24 2021, 12:10 AM

Fix test failed.

Harbormaster completed remote builds in B95414: Diff 332876.Mar 24 2021, 3:06 AM

Harbormaster completed remote builds in B95438: Diff 332911.Mar 24 2021, 7:25 AM

dexonsmith removed a subscriber: dexonsmith.Mar 24 2021, 12:14 PM

Consider scalable struct types in passes processing StructType and remove TODO comments.

Harbormaster completed remote builds in B95954: Diff 333660.Mar 27 2021, 2:28 AM

craig.topper added inline comments.Mar 29 2021, 1:27 PM

llvm/include/llvm/IR/DataLayout.h
628–629	The is special code in DataLayout::getStructLayout that allocates the size of StructLayout to account for the needed size of MemberOffsets. That code does not appear to be updated in this patch. I'm not sure we want to use SmallVector here. The minimum size is increasing the size of every StructLayout object. SmallVector contains the number of elements which is redundant with the NumElements in the StructLayout itself. SmallVector also contains an additional field for the capacity of the allocated memory. Can we go back to the variable sized array and update DataLayout::getStructLayout to use TypeSize instead of uint64_t to calculate the needed space?
llvm/lib/Analysis/MemoryBuiltins.cpp
354	Was it already a bug that scalable vectors weren't checked for here?
llvm/lib/Analysis/ScalarEvolution.cpp
3447	"could not" -> "cannot"
llvm/lib/CodeGen/Analysis.cpp
92	Can we remove the conditional here and always get the the struct layout? Then we don't need to check for null StructLayout in the loop. This was one of the change we did previously when we allowed scalable vectors in structs for intrinsic returns.
112	Is the operand reordering required? I would hope that TypeSize multiplied by unsigned can be in either order.
llvm/lib/CodeGen/GlobalISel/CallLowering.cpp
773	Does this need to be addressed as part of this review?
llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
1891	Maybe we need a new entrypoint to ComputeValueVTs that does this automatically based on the type passed to argument 2?
llvm/lib/IR/Type.cpp
187	Please address this lint warnig
188	I'm slightly concerned that containsScalableVectorType() is recursive and we're now calling isScalableType in a bunch of places. This could be bad for deeply nested structs. On that subject, are we allowing or preventing struct of scalable vectors to be part of other structs?
543	One -> Element.

Add an assertion to check if it is a scalable struct type before creating GEP instructions.
Address comments.

HsiangKai marked 7 inline comments as done.Mar 29 2021, 3:09 PM

Harbormaster completed remote builds in B96204: Diff 333993.Mar 29 2021, 3:13 PM

Fix build fail.

Harbormaster completed remote builds in B96231: Diff 334029.Mar 29 2021, 7:13 PM

craig.topper added inline comments.Mar 29 2021, 7:29 PM

llvm/lib/CodeGen/Analysis.cpp
92	Nevermind. We need this to support a mix of scalable and scalars for intrinsic returns.

Use TrailingObjects for MemberOffsets.

Harbormaster completed remote builds in B96310: Diff 334143.Mar 30 2021, 6:58 AM

HsiangKai added a parent revision: D98161: [NFC][IR] Replace isa<ScalableVectorType> with a predicator function..Mar 30 2021, 7:01 AM

HsiangKai added a child revision: D97264: [RISCV] Define types for Zvlsseg..

craig.topper mentioned this in D99608: [StructLayout] Use TrailingObjects to allocate space for MemberOffsets..Mar 30 2021, 10:10 AM

craig.topper mentioned this in rGf59ba0849f7a: [StructLayout] Use TrailingObjects to allocate space for MemberOffsets..Mar 30 2021, 5:38 PM

VincentWu added a subscriber: VincentWu.Apr 5 2021, 12:46 AM

@HsiangKai can you rebase this on trunk. Some of the TrailingObjects changes would go away.

Rebase.

Harbormaster completed remote builds in B97906: Diff 336347.Apr 9 2021, 1:51 AM

In D98169#2622089, @sdesmalen wrote:

In D98169#2619874, @craig.topper wrote:

We want this to support the segment load/store intrinsics defined here https://github.com/riscv/rvv-intrinsic-doc/blob/master/intrinsic_funcs/03_vector_load_store_segment_instructions_zvlsseg.md These return 2 to 8 vectors that have been loaded into consecutive registers. I believe SVE has similar instructions. I believe SVE represents these using types wider than their normal scalable vector types and relies on the type legalizer to split them up in the backend. This works for SVE because there is only one known minimum size for all scalable vector types so the type legalizer will always split down to that minimum type.

Thanks for providing the context!

For RISC-V vectors we already use 7 different sizes of scalable vectors to represent the ability of our instructions to operate on 2, 4, or 8 registers simultaneously. And for 1/2, 1/4, and 1/8 fractional registers. The segment load/store instructions add an extra dimension where they can produce/consume 2, 3, or 4 pairs of registers or 2 quadruples, for examples. Following the SVE strategy would give us ambiguous types for the type legalizer.

How does that look in terms of IR? Is the number of registers somehow represented in the (LLVM IR) vector type? Or are the types the same, but the compiler generates different code depending on what mode is set? For SVE we know we can split the vector because <vscale x 8 x i32> is twice the size of <vscale x 4 x i32>, regardless of the value for vscale. Indeed we know SVE vectors area multiple of 128bits, and therefore that <vscale x 4 x i32> is legal. In order to make any assumptions about splitting/legalization, the compiler will need to know which types are legal, and so would expect the compiler to know the mode (2, 4 ,8) for RVV when generating the code, and therefore have similar knowledge about which types are legal and how the vectors are represented/split into registers. How does that lead to ambiguous types?

To solve this we would like to use a struct for the segment load/stores to separate them in IR. Since clang needs an address for every variable and needs to be able to load/store them we need to support load/store/alloca.

These (C/C++-level) intrinsics are probably implemented using target-specific intrinsics or perhaps a common LLVM IR intrinsic like masked.load, which should be able to take/return a struct with scalable members after D94142. If so, it should be possible to handle this in Clang by emitting extractvalue instructions and storing each member individually. That would avoid any changes to LLVM IR. Is that something you've considered?

We have defined types containing multiple scalable vectors and we permit users to use these types to define auto variables. That is why we need load, store and alloca capabilities for scalable structure.

If we do need to make this work for scalable vectors, I think it needs a message to the mailing list because it's a change to the LangRef and capabilities of scalable vectors, given previous discussions on this topic. I'd like to avoid giving the impression that we're quietly moving the goalpost on what scalable vectors can do in IR.

I have posted a RFC for the proposal in the mailing list.
https://groups.google.com/g/llvm-dev/c/6ZK2eS4-8t0/m/PG6H1NNDBAAJ

HsiangKai retitled this revision from [PoC][IR] Permit load/store/alloca for struct with the same scalable vectors. to [IR] Permit load/store/alloca for struct with the same scalable vectors..Apr 20 2021, 8:53 AM

HsiangKai edited the summary of this revision. (Show Details)

HsiangKai abandoned this revision.Jun 2 2021, 7:45 AM

Herald added a subscriber: foad. · View Herald TranscriptJun 2 2021, 7:45 AM

eopXD mentioned this in D146872: [1/11][IR] Permit load/store/alloca for struct of the same scalable vector type.Mar 25 2023, 7:40 AM

eopXD mentioned this in rGc8eb535aed03: [1/11][IR] Permit load/store/alloca for struct of the same scalable vector type.May 19 2023, 9:39 AM

Revision Contents

Path

Size

llvm/

include/

llvm/

CodeGen/

Analysis.h

17 lines

IR/

DataLayout.h

26 lines

Instructions.h

6 lines

Type.h

2 lines

lib/

Analysis/

MemoryBuiltins.cpp

2 lines

ScalarEvolution.cpp

2 lines

CodeGen/

Analysis.cpp

39 lines

GlobalISel/

CallLowering.cpp

19 lines

SelectionDAG/

FunctionLoweringInfo.cpp

7 lines

SelectionDAG.cpp

3 lines

SelectionDAGBuilder.cpp

64 lines

IR/

DataLayout.cpp

46 lines

Type.cpp

30 lines

Target/

AMDGPU/

AMDGPUISelLowering.cpp

5 lines

NVPTX/

NVPTXISelLowering.cpp

5 lines

X86/

X86CallLowering.cpp

4 lines

Transforms/

IPO/

ArgumentPromotion.cpp

4 lines

AttributorAttributes.cpp

2 lines

InstCombine/

InstCombineLoadStoreAlloca.cpp

6 lines

InstructionCombining.cpp

5 lines

Scalar/

SROA.cpp

12 lines

Utils/

ScalarEvolutionExpander.cpp

3 lines

test/

CodeGen/

RISCV/

rvv/

load-store-scalable-struct.ll

51 lines

Other/

load-scalable-vector-struct.ll

12 lines

store-scalable-vector-struct.ll

14 lines

Diff 336347

llvm/include/llvm/CodeGen/Analysis.h

	Show First 20 Lines • Show All 61 Lines • ▼ Show 20 Lines
	/// EVTs that represent all the individual underlying			/// EVTs that represent all the individual underlying
	/// non-aggregate types that comprise it.			/// non-aggregate types that comprise it.
	///			///
	/// If Offsets is non-null, it points to a vector to be filled in			/// If Offsets is non-null, it points to a vector to be filled in
	/// with the in-memory offsets of each of the individual values.			/// with the in-memory offsets of each of the individual values.
	///			///
	void ComputeValueVTs(const TargetLowering &TLI, const DataLayout &DL, Type *Ty,			void ComputeValueVTs(const TargetLowering &TLI, const DataLayout &DL, Type *Ty,
	SmallVectorImpl<EVT> &ValueVTs,			SmallVectorImpl<EVT> &ValueVTs,
	SmallVectorImpl<uint64_t> *Offsets = nullptr,			SmallVectorImpl<TypeSize> *Offsets,
	uint64_t StartingOffset = 0);			TypeSize StartingOffset);
				void ComputeValueVTs(const TargetLowering &TLI, const DataLayout &DL, Type *Ty,
				SmallVectorImpl<EVT> &ValueVTs,
				SmallVectorImpl<TypeSize> *Offsets = nullptr,
				uint64_t Offset = 0);

	/// Variant of ComputeValueVTs that also produces the memory VTs.			/// Variant of ComputeValueVTs that also produces the memory VTs.
	void ComputeValueVTs(const TargetLowering &TLI, const DataLayout &DL, Type *Ty,			void ComputeValueVTs(const TargetLowering &TLI, const DataLayout &DL, Type *Ty,
	SmallVectorImpl<EVT> &ValueVTs,			SmallVectorImpl<EVT> &ValueVTs,
	SmallVectorImpl<EVT> *MemVTs,			SmallVectorImpl<EVT> *MemVTs,
	SmallVectorImpl<uint64_t> *Offsets = nullptr,			SmallVectorImpl<TypeSize> *Offsets,
	uint64_t StartingOffset = 0);			TypeSize StartingOffset);
				void ComputeValueVTs(const TargetLowering &TLI, const DataLayout &DL, Type *Ty,
				SmallVectorImpl<EVT> &ValueVTs,
				SmallVectorImpl<EVT> *MemVTs,
				SmallVectorImpl<TypeSize> *Offsets = nullptr,
				uint64_t Offset = 0);

	/// computeValueLLTs - Given an LLVM IR type, compute a sequence of			/// computeValueLLTs - Given an LLVM IR type, compute a sequence of
	/// LLTs that represent all the individual underlying			/// LLTs that represent all the individual underlying
	/// non-aggregate types that comprise it.			/// non-aggregate types that comprise it.
	///			///
	/// If Offsets is non-null, it points to a vector to be filled in			/// If Offsets is non-null, it points to a vector to be filled in
	/// with the in-memory offsets of each of the individual values.			/// with the in-memory offsets of each of the individual values.
	///			///
	▲ Show 20 Lines • Show All 55 Lines • Show Last 20 Lines

llvm/include/llvm/IR/DataLayout.h

	Show All 19 Lines
	#define LLVM_IR_DATALAYOUT_H			#define LLVM_IR_DATALAYOUT_H

	#include "llvm/ADT/ArrayRef.h"			#include "llvm/ADT/ArrayRef.h"
	#include "llvm/ADT/STLExtras.h"			#include "llvm/ADT/STLExtras.h"
	#include "llvm/ADT/SmallVector.h"			#include "llvm/ADT/SmallVector.h"
	#include "llvm/ADT/StringRef.h"			#include "llvm/ADT/StringRef.h"
	#include "llvm/IR/DerivedTypes.h"			#include "llvm/IR/DerivedTypes.h"
	#include "llvm/IR/Type.h"			#include "llvm/IR/Type.h"
				#include "llvm/Support/Alignment.h"
	#include "llvm/Support/Casting.h"			#include "llvm/Support/Casting.h"
	#include "llvm/Support/ErrorHandling.h"			#include "llvm/Support/ErrorHandling.h"
	#include "llvm/Support/MathExtras.h"			#include "llvm/Support/MathExtras.h"
	#include "llvm/Support/Alignment.h"			#include "llvm/Support/Alignment.h"
	#include "llvm/Support/TrailingObjects.h"			#include "llvm/Support/TrailingObjects.h"
	#include "llvm/Support/TypeSize.h"			#include "llvm/Support/TypeSize.h"
	#include <cassert>			#include <cassert>
	#include <cstdint>			#include <cstdint>
	▲ Show 20 Lines • Show All 579 Lines • ▼ Show 20 Lines
	}			}

	inline LLVMTargetDataRef wrap(const DataLayout *P) {			inline LLVMTargetDataRef wrap(const DataLayout *P) {
	return reinterpret_cast<LLVMTargetDataRef>(const_cast<DataLayout *>(P));			return reinterpret_cast<LLVMTargetDataRef>(const_cast<DataLayout *>(P));
	}			}

	/// Used to lazily calculate structure layout information for a target machine,			/// Used to lazily calculate structure layout information for a target machine,
	/// based on the DataLayout structure.			/// based on the DataLayout structure.
	class StructLayout final : public TrailingObjects<StructLayout, uint64_t> {			class StructLayout final : public TrailingObjects<StructLayout, TypeSize> {
	uint64_t StructSize;			TypeSize StructSize;
	Align StructAlignment;			Align StructAlignment;
	unsigned IsPadded : 1;			unsigned IsPadded : 1;
	unsigned NumElements : 31;			unsigned NumElements : 31;

				craig.topperUnsubmitted Done Reply Inline Actions The is special code in DataLayout::getStructLayout that allocates the size of StructLayout to account for the needed size of MemberOffsets. That code does not appear to be updated in this patch. I'm not sure we want to use SmallVector here. The minimum size is increasing the size of every StructLayout object. SmallVector contains the number of elements which is redundant with the NumElements in the StructLayout itself. SmallVector also contains an additional field for the capacity of the allocated memory. Can we go back to the variable sized array and update DataLayout::getStructLayout to use TypeSize instead of uint64_t to calculate the needed space? craig.topper: The is special code in DataLayout::getStructLayout that allocates the size of StructLayout to…
	public:			public:
	uint64_t getSizeInBytes() const { return StructSize; }			TypeSize getSizeInBytes() const { return StructSize; }
				david-armUnsubmitted Not Done Reply Inline Actions This should also return a TypeSize to be consistent with getSizeInBits. david-arm: This should also return a TypeSize to be consistent with getSizeInBits.
				HsiangKaiAuthorUnsubmitted Done Reply Inline Actions How about keep the interface as so. In current implementation, the elements will be scalar or scalable. The uses of StructLayout::getSizeInBytes() will be compared with StructLayout::getElementOffset() usually. Keep the interface to return uint64_t should be fine. I added a FIXME comment before the function. HsiangKai: How about keep the interface as so. In current implementation, the elements will be scalar or…
				david-armUnsubmitted Not Done Reply Inline Actions But isn't the problem here that the result is now wrong? We're now returning the minimum value, rather than a TypeSize that tries to express the complete value. Also, it seems quite confusing that returning a size in bits gives you TypeSize and returning a size in bytes gives you uint64_t. This is inconsistent with how sizes are returned elsewhere, for example see: include/llvm/CodeGen/ValueTypes.h where we have TypeSize getSizeInBits() const TypeSize getStoreSize() const where the latter returns bytes. If a function must return the minimum value then I think at least the function name should reflect that with a rename, i.e. getMinSizeInBytes() so that the caller understands this is not the actual size. david-arm: But isn't the problem here that the result is now wrong? We're now returning the minimum value…

	uint64_t getSizeInBits() const { return 8 * StructSize; }			TypeSize getSizeInBits() const { return 8 * StructSize; }

	Align getAlignment() const { return StructAlignment; }			Align getAlignment() const { return StructAlignment; }

	/// Returns whether the struct has padding or not between its fields.			/// Returns whether the struct has padding or not between its fields.
	/// NB: Padding in nested element is not taken into account.			/// NB: Padding in nested element is not taken into account.
	bool hasPadding() const { return IsPadded; }			bool hasPadding() const { return IsPadded; }

	/// Given a valid byte offset into the structure, returns the structure			/// Given a valid byte offset into the structure, returns the structure
	/// index that contains it.			/// index that contains it.
	unsigned getElementContainingOffset(uint64_t Offset) const;			unsigned getElementContainingOffset(uint64_t Offset) const;

	MutableArrayRef<uint64_t> getMemberOffsets() {			MutableArrayRef<TypeSize> getMemberOffsets() {
	return llvm::makeMutableArrayRef(getTrailingObjects<uint64_t>(),			return llvm::makeMutableArrayRef(getTrailingObjects<TypeSize>(),
	NumElements);			NumElements);
	}			}

	ArrayRef<uint64_t> getMemberOffsets() const {			ArrayRef<TypeSize> getMemberOffsets() const {
	return llvm::makeArrayRef(getTrailingObjects<uint64_t>(), NumElements);			return llvm::makeArrayRef(getTrailingObjects<TypeSize>(), NumElements);
	}			}

	uint64_t getElementOffset(unsigned Idx) const {			TypeSize getElementOffset(unsigned Idx) const {
	assert(Idx < NumElements && "Invalid element idx!");			assert(Idx < NumElements && "Invalid element idx!");
	return getMemberOffsets()[Idx];			return getMemberOffsets()[Idx];
	}			}

	uint64_t getElementOffsetInBits(unsigned Idx) const {			TypeSize getElementOffsetInBits(unsigned Idx) const {
	return getElementOffset(Idx) * 8;			return getElementOffset(Idx) * 8;
	}			}

	private:			private:
	friend class DataLayout; // Only DataLayout can create this class			friend class DataLayout; // Only DataLayout can create this class

	StructLayout(StructType *ST, const DataLayout &DL);			StructLayout(StructType *ST, const DataLayout &DL);

	size_t numTrailingObjects(OverloadToken<uint64_t>) const {			size_t numTrailingObjects(OverloadToken<TypeSize>) const {
	return NumElements;			return NumElements;
	}			}
	};			};

	// The implementation of this method is provided inline as it is particularly			// The implementation of this method is provided inline as it is particularly
	// well suited to constant folding when called on a specific Type subclass.			// well suited to constant folding when called on a specific Type subclass.
	inline TypeSize DataLayout::getTypeSizeInBits(Type *Ty) const {			inline TypeSize DataLayout::getTypeSizeInBits(Type *Ty) const {
	assert(Ty->isSized() && "Cannot getTypeInfo() on a type that is unsized!");			assert(Ty->isSized() && "Cannot getTypeInfo() on a type that is unsized!");
	switch (Ty->getTypeID()) {			switch (Ty->getTypeID()) {
	case Type::LabelTyID:			case Type::LabelTyID:
	return TypeSize::Fixed(getPointerSizeInBits(0));			return TypeSize::Fixed(getPointerSizeInBits(0));
	case Type::PointerTyID:			case Type::PointerTyID:
	return TypeSize::Fixed(getPointerSizeInBits(Ty->getPointerAddressSpace()));			return TypeSize::Fixed(getPointerSizeInBits(Ty->getPointerAddressSpace()));
	case Type::ArrayTyID: {			case Type::ArrayTyID: {
	ArrayType *ATy = cast<ArrayType>(Ty);			ArrayType *ATy = cast<ArrayType>(Ty);
	return ATy->getNumElements() *			return ATy->getNumElements() *
	getTypeAllocSizeInBits(ATy->getElementType());			getTypeAllocSizeInBits(ATy->getElementType());
	}			}
	case Type::StructTyID:			case Type::StructTyID:
	// Get the layout annotation... which is lazily created on demand.			// Get the layout annotation... which is lazily created on demand.
	return TypeSize::Fixed(			return getStructLayout(cast<StructType>(Ty))->getSizeInBits();
	getStructLayout(cast<StructType>(Ty))->getSizeInBits());
	case Type::IntegerTyID:			case Type::IntegerTyID:
	return TypeSize::Fixed(Ty->getIntegerBitWidth());			return TypeSize::Fixed(Ty->getIntegerBitWidth());
	case Type::HalfTyID:			case Type::HalfTyID:
	case Type::BFloatTyID:			case Type::BFloatTyID:
	return TypeSize::Fixed(16);			return TypeSize::Fixed(16);
	case Type::FloatTyID:			case Type::FloatTyID:
	return TypeSize::Fixed(32);			return TypeSize::Fixed(32);
	case Type::DoubleTyID:			case Type::DoubleTyID:
	Show All 27 Lines

llvm/include/llvm/IR/Instructions.h

Show First 20 Lines • Show All 1,142 Lines • ▼ Show 20 Lines	GetElementPtrInst::GetElementPtrInst(Type PointeeType, Value Ptr,
Instruction *InsertBefore)		Instruction *InsertBefore)
: Instruction(getGEPReturnType(PointeeType, Ptr, IdxList), GetElementPtr,		: Instruction(getGEPReturnType(PointeeType, Ptr, IdxList), GetElementPtr,
OperandTraits<GetElementPtrInst>::op_end(this) - Values,		OperandTraits<GetElementPtrInst>::op_end(this) - Values,
Values, InsertBefore),		Values, InsertBefore),
SourceElementType(PointeeType),		SourceElementType(PointeeType),
ResultElementType(getIndexedType(PointeeType, IdxList)) {		ResultElementType(getIndexedType(PointeeType, IdxList)) {
assert(ResultElementType ==		assert(ResultElementType ==
cast<PointerType>(getType()->getScalarType())->getElementType());		cast<PointerType>(getType()->getScalarType())->getElementType());
		assert(!(SourceElementType->isStructTy() &&
		SourceElementType->isScalableType()) &&
		"No support GEP for scalable struct types.");
init(Ptr, IdxList, NameStr);		init(Ptr, IdxList, NameStr);
}		}

GetElementPtrInst::GetElementPtrInst(Type PointeeType, Value Ptr,		GetElementPtrInst::GetElementPtrInst(Type PointeeType, Value Ptr,
ArrayRef<Value *> IdxList, unsigned Values,		ArrayRef<Value *> IdxList, unsigned Values,
const Twine &NameStr,		const Twine &NameStr,
BasicBlock *InsertAtEnd)		BasicBlock *InsertAtEnd)
: Instruction(getGEPReturnType(PointeeType, Ptr, IdxList), GetElementPtr,		: Instruction(getGEPReturnType(PointeeType, Ptr, IdxList), GetElementPtr,
OperandTraits<GetElementPtrInst>::op_end(this) - Values,		OperandTraits<GetElementPtrInst>::op_end(this) - Values,
Values, InsertAtEnd),		Values, InsertAtEnd),
SourceElementType(PointeeType),		SourceElementType(PointeeType),
ResultElementType(getIndexedType(PointeeType, IdxList)) {		ResultElementType(getIndexedType(PointeeType, IdxList)) {
assert(ResultElementType ==		assert(ResultElementType ==
cast<PointerType>(getType()->getScalarType())->getElementType());		cast<PointerType>(getType()->getScalarType())->getElementType());
		assert(!(SourceElementType->isStructTy() &&
		SourceElementType->isScalableType()) &&
		"No support GEP for scalable struct types.");
init(Ptr, IdxList, NameStr);		init(Ptr, IdxList, NameStr);
}		}

DEFINE_TRANSPARENT_OPERAND_ACCESSORS(GetElementPtrInst, Value)		DEFINE_TRANSPARENT_OPERAND_ACCESSORS(GetElementPtrInst, Value)

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// ICmpInst Class		// ICmpInst Class
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
▲ Show 20 Lines • Show All 4,140 Lines • Show Last 20 Lines

llvm/include/llvm/IR/Type.h

Show First 20 Lines • Show All 231 Lines • ▼ Show 20 Lines	public:
bool isPtrOrPtrVectorTy() const { return getScalarType()->isPointerTy(); }		bool isPtrOrPtrVectorTy() const { return getScalarType()->isPointerTy(); }

/// True if this is an instance of VectorType.		/// True if this is an instance of VectorType.
inline bool isVectorTy() const {		inline bool isVectorTy() const {
return getTypeID() == ScalableVectorTyID \|\| getTypeID() == FixedVectorTyID;		return getTypeID() == ScalableVectorTyID \|\| getTypeID() == FixedVectorTyID;
}		}

/// True if this is an instance of scalable types.		/// True if this is an instance of scalable types.
bool isScalableType() const { return getTypeID() == ScalableVectorTyID; }		bool isScalableType() const;

/// Return true if this type could be converted with a lossless BitCast to		/// Return true if this type could be converted with a lossless BitCast to
/// type 'Ty'. For example, i8* to i32*. BitCasts are valid for types of the		/// type 'Ty'. For example, i8* to i32*. BitCasts are valid for types of the
/// same size only where no re-interpretation of the bits is done.		/// same size only where no re-interpretation of the bits is done.
/// Determine if this type could be losslessly bitcast to Ty		/// Determine if this type could be losslessly bitcast to Ty
bool canLosslesslyBitCastTo(Type *Ty) const;		bool canLosslesslyBitCastTo(Type *Ty) const;

/// Return true if this type is empty, that is, it has no elements or all of		/// Return true if this type is empty, that is, it has no elements or all of
▲ Show 20 Lines • Show All 279 Lines • Show Last 20 Lines

llvm/lib/Analysis/MemoryBuiltins.cpp

	Show First 20 Lines • Show All 345 Lines • ▼ Show 20 Lines
	static Value computeArraySize(const CallInst CI, const DataLayout &DL,			static Value computeArraySize(const CallInst CI, const DataLayout &DL,
	const TargetLibraryInfo *TLI,			const TargetLibraryInfo *TLI,
	bool LookThroughSExt = false) {			bool LookThroughSExt = false) {
	if (!CI)			if (!CI)
	return nullptr;			return nullptr;

	// The size of the malloc's result type must be known to determine array size.			// The size of the malloc's result type must be known to determine array size.
	Type *T = getMallocAllocatedType(CI, TLI);			Type *T = getMallocAllocatedType(CI, TLI);
	if (!T \|\| !T->isSized())			if (!T \|\| !T->isSized() \|\| T->isScalableType())
				craig.topperUnsubmitted Not Done Reply Inline Actions Was it already a bug that scalable vectors weren't checked for here? craig.topper: Was it already a bug that scalable vectors weren't checked for here?
	return nullptr;			return nullptr;

	unsigned ElementSize = DL.getTypeAllocSize(T);			unsigned ElementSize = DL.getTypeAllocSize(T);
	if (StructType *ST = dyn_cast<StructType>(T))			if (StructType *ST = dyn_cast<StructType>(T))
	ElementSize = DL.getStructLayout(ST)->getSizeInBytes();			ElementSize = DL.getStructLayout(ST)->getSizeInBytes();

	// If malloc call's arg can be determined to be a multiple of ElementSize,			// If malloc call's arg can be determined to be a multiple of ElementSize,
	// return the multiple. Otherwise, return NULL.			// return the multiple. Otherwise, return NULL.
	▲ Show 20 Lines • Show All 725 Lines • Show Last 20 Lines

llvm/lib/Analysis/ScalarEvolution.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 3,437 Lines • ▼ Show 20 Lines	SCEV::NoWrapFlags OffsetWrap =
GEP->isInBounds() ? SCEV::FlagNSW : SCEV::FlagAnyWrap;		GEP->isInBounds() ? SCEV::FlagNSW : SCEV::FlagAnyWrap;

Type *CurTy = GEP->getType();		Type *CurTy = GEP->getType();
bool FirstIter = true;		bool FirstIter = true;
SmallVector<const SCEV *, 4> Offsets;		SmallVector<const SCEV *, 4> Offsets;
for (const SCEV *IndexExpr : IndexExprs) {		for (const SCEV *IndexExpr : IndexExprs) {
// Compute the (potentially symbolic) offset in bytes for this index.		// Compute the (potentially symbolic) offset in bytes for this index.
if (StructType *STy = dyn_cast<StructType>(CurTy)) {		if (StructType *STy = dyn_cast<StructType>(CurTy)) {
		assert(!STy->isScalableType() &&
		"Scalable struct cannot be used in GEP.");
		craig.topperUnsubmitted Done Reply Inline Actions "could not" -> "cannot" craig.topper: "could not" -> "cannot"
// For a struct, add the member offset.		// For a struct, add the member offset.
ConstantInt *Index = cast<SCEVConstant>(IndexExpr)->getValue();		ConstantInt *Index = cast<SCEVConstant>(IndexExpr)->getValue();
unsigned FieldNo = Index->getZExtValue();		unsigned FieldNo = Index->getZExtValue();
const SCEV *FieldOffset = getOffsetOfExpr(IntIdxTy, STy, FieldNo);		const SCEV *FieldOffset = getOffsetOfExpr(IntIdxTy, STy, FieldNo);
Offsets.push_back(FieldOffset);		Offsets.push_back(FieldOffset);

// Update CurTy to the type of the field at Index.		// Update CurTy to the type of the field at Index.
CurTy = STy->getTypeAtIndex(Index);		CurTy = STy->getTypeAtIndex(Index);
▲ Show 20 Lines • Show All 10,046 Lines • Show Last 20 Lines

llvm/lib/CodeGen/Analysis.cpp

	Show First 20 Lines • Show All 76 Lines • ▼ Show 20 Lines
	/// non-aggregate types that comprise it.			/// non-aggregate types that comprise it.
	///			///
	/// If Offsets is non-null, it points to a vector to be filled in			/// If Offsets is non-null, it points to a vector to be filled in
	/// with the in-memory offsets of each of the individual values.			/// with the in-memory offsets of each of the individual values.
	///			///
	void llvm::ComputeValueVTs(const TargetLowering &TLI, const DataLayout &DL,			void llvm::ComputeValueVTs(const TargetLowering &TLI, const DataLayout &DL,
	Type *Ty, SmallVectorImpl<EVT> &ValueVTs,			Type *Ty, SmallVectorImpl<EVT> &ValueVTs,
	SmallVectorImpl<EVT> *MemVTs,			SmallVectorImpl<EVT> *MemVTs,
	SmallVectorImpl<uint64_t> *Offsets,			SmallVectorImpl<TypeSize> *Offsets,
	uint64_t StartingOffset) {			TypeSize StartingOffset) {
	// Given a struct type, recursively traverse the elements.			// Given a struct type, recursively traverse the elements.
	if (StructType *STy = dyn_cast<StructType>(Ty)) {			if (StructType *STy = dyn_cast<StructType>(Ty)) {
	// If the Offsets aren't needed, don't query the struct layout. This allows			// If the Offsets aren't needed, don't query the struct layout. This allows
	// us to support structs with scalable vectors for operations that don't			// us to support structs with scalable vectors for operations that don't
	// need offsets.			// need offsets.
	const StructLayout *SL = Offsets ? DL.getStructLayout(STy) : nullptr;			const StructLayout *SL = Offsets ? DL.getStructLayout(STy) : nullptr;
				craig.topperUnsubmitted Done Reply Inline Actions Can we remove the conditional here and always get the the struct layout? Then we don't need to check for null StructLayout in the loop. This was one of the change we did previously when we allowed scalable vectors in structs for intrinsic returns. craig.topper: Can we remove the conditional here and always get the the struct layout? Then we don't need to…
				craig.topperUnsubmitted Not Done Reply Inline Actions Nevermind. We need this to support a mix of scalable and scalars for intrinsic returns. craig.topper: Nevermind. We need this to support a mix of scalable and scalars for intrinsic returns.
	for (StructType::element_iterator EB = STy->element_begin(),			for (StructType::element_iterator EB = STy->element_begin(), EI = EB,
	EI = EB,
	EE = STy->element_end();			EE = STy->element_end();
	EI != EE; ++EI) {			EI != EE; ++EI) {
	// Don't compute the element offset if we didn't get a StructLayout above.			// Don't compute the element offset if we didn't get a StructLayout above.
	uint64_t EltOffset = SL ? SL->getElementOffset(EI - EB) : 0;			TypeSize EltOffset =
				SL ? SL->getElementOffset(EI - EB)
				: (StartingOffset.isScalable() ? TypeSize::Scalable(0)
				: TypeSize::Fixed(0));
	ComputeValueVTs(TLI, DL, *EI, ValueVTs, MemVTs, Offsets,			ComputeValueVTs(TLI, DL, *EI, ValueVTs, MemVTs, Offsets,
	StartingOffset + EltOffset);			StartingOffset + EltOffset);
	}			}
	return;			return;
	}			}
	// Given an array type, recursively traverse the elements.			// Given an array type, recursively traverse the elements.
	if (ArrayType *ATy = dyn_cast<ArrayType>(Ty)) {			if (ArrayType *ATy = dyn_cast<ArrayType>(Ty)) {
	Type *EltTy = ATy->getElementType();			Type *EltTy = ATy->getElementType();
	uint64_t EltSize = DL.getTypeAllocSize(EltTy).getFixedValue();			TypeSize EltSize = DL.getTypeAllocSize(EltTy);
	for (unsigned i = 0, e = ATy->getNumElements(); i != e; ++i)			for (unsigned i = 0, e = ATy->getNumElements(); i != e; ++i)
	ComputeValueVTs(TLI, DL, EltTy, ValueVTs, MemVTs, Offsets,			ComputeValueVTs(TLI, DL, EltTy, ValueVTs, MemVTs, Offsets,
	StartingOffset + i * EltSize);			StartingOffset + i * EltSize);
				craig.topperUnsubmitted Done Reply Inline Actions Is the operand reordering required? I would hope that TypeSize multiplied by unsigned can be in either order. craig.topper: Is the operand reordering required? I would hope that TypeSize multiplied by unsigned can be in…
	return;			return;
	}			}
	// Interpret void as zero return values.			// Interpret void as zero return values.
	if (Ty->isVoidTy())			if (Ty->isVoidTy())
	return;			return;
	// Base case: we can get an EVT for this LLVM IR type.			// Base case: we can get an EVT for this LLVM IR type.
	ValueVTs.push_back(TLI.getValueType(DL, Ty));			ValueVTs.push_back(TLI.getValueType(DL, Ty));
	if (MemVTs)			if (MemVTs)
	MemVTs->push_back(TLI.getMemValueType(DL, Ty));			MemVTs->push_back(TLI.getMemValueType(DL, Ty));
	if (Offsets)			if (Offsets)
	Offsets->push_back(StartingOffset);			Offsets->push_back(StartingOffset);
	}			}

	void llvm::ComputeValueVTs(const TargetLowering &TLI, const DataLayout &DL,			void llvm::ComputeValueVTs(const TargetLowering &TLI, const DataLayout &DL,
	Type *Ty, SmallVectorImpl<EVT> &ValueVTs,			Type *Ty, SmallVectorImpl<EVT> &ValueVTs,
	SmallVectorImpl<uint64_t> *Offsets,			SmallVectorImpl<TypeSize> *Offsets,
	uint64_t StartingOffset) {			TypeSize StartingOffset) {
	return ComputeValueVTs(TLI, DL, Ty, ValueVTs, /MemVTs=/nullptr, Offsets,			return ComputeValueVTs(TLI, DL, Ty, ValueVTs, /MemVTs=/nullptr, Offsets,
	StartingOffset);			StartingOffset);
	}			}

				void llvm::ComputeValueVTs(const TargetLowering &TLI, const DataLayout &DL,
				Type *Ty, SmallVectorImpl<EVT> &ValueVTs,
				SmallVectorImpl<TypeSize> *Offsets,
				uint64_t Offset) {
				TypeSize StartingOffset = Ty->isScalableType() ? TypeSize::Scalable(Offset)
				: TypeSize::Fixed(Offset);
				return ComputeValueVTs(TLI, DL, Ty, ValueVTs, Offsets, StartingOffset);
				}

				void llvm::ComputeValueVTs(const TargetLowering &TLI, const DataLayout &DL,
				Type *Ty, SmallVectorImpl<EVT> &ValueVTs,
				SmallVectorImpl<EVT> *MemVTs,
				SmallVectorImpl<TypeSize> *Offsets,
				uint64_t Offset) {
				TypeSize StartingOffset = Ty->isScalableType() ? TypeSize::Scalable(Offset)
				: TypeSize::Fixed(Offset);
				return ComputeValueVTs(TLI, DL, Ty, ValueVTs, MemVTs, Offsets,
				StartingOffset);
				}

	void llvm::computeValueLLTs(const DataLayout &DL, Type &Ty,			void llvm::computeValueLLTs(const DataLayout &DL, Type &Ty,
	SmallVectorImpl<LLT> &ValueTys,			SmallVectorImpl<LLT> &ValueTys,
	SmallVectorImpl<uint64_t> *Offsets,			SmallVectorImpl<uint64_t> *Offsets,
	uint64_t StartingOffset) {			uint64_t StartingOffset) {
	// Given a struct type, recursively traverse the elements.			// Given a struct type, recursively traverse the elements.
	if (StructType *STy = dyn_cast<StructType>(&Ty)) {			if (StructType *STy = dyn_cast<StructType>(&Ty)) {
				assert(!STy->isScalableType() && "Unexpected scalable struct type.");
	// If the Offsets aren't needed, don't query the struct layout. This allows			// If the Offsets aren't needed, don't query the struct layout. This allows
	// us to support structs with scalable vectors for operations that don't			// us to support structs with scalable vectors for operations that don't
	// need offsets.			// need offsets.
	const StructLayout *SL = Offsets ? DL.getStructLayout(STy) : nullptr;			const StructLayout *SL = Offsets ? DL.getStructLayout(STy) : nullptr;
	for (unsigned I = 0, E = STy->getNumElements(); I != E; ++I) {			for (unsigned I = 0, E = STy->getNumElements(); I != E; ++I) {
	uint64_t EltOffset = SL ? SL->getElementOffset(I) : 0;			uint64_t EltOffset = SL ? SL->getElementOffset(I) : 0;
	computeValueLLTs(DL, *STy->getElementType(I), ValueTys, Offsets,			computeValueLLTs(DL, *STy->getElementType(I), ValueTys, Offsets,
	StartingOffset + EltOffset);			StartingOffset + EltOffset);
	▲ Show 20 Lines • Show All 655 Lines • Show Last 20 Lines

llvm/lib/CodeGen/GlobalISel/CallLowering.cpp

	Show First 20 Lines • Show All 188 Lines • ▼ Show 20 Lines

	void CallLowering::splitToValueTypes(const ArgInfo &OrigArg,			void CallLowering::splitToValueTypes(const ArgInfo &OrigArg,
	SmallVectorImpl<ArgInfo> &SplitArgs,			SmallVectorImpl<ArgInfo> &SplitArgs,
	const DataLayout &DL,			const DataLayout &DL,
	CallingConv::ID CallConv) const {			CallingConv::ID CallConv) const {
	LLVMContext &Ctx = OrigArg.Ty->getContext();			LLVMContext &Ctx = OrigArg.Ty->getContext();

	SmallVector<EVT, 4> SplitVTs;			SmallVector<EVT, 4> SplitVTs;
	SmallVector<uint64_t, 4> Offsets;			ComputeValueVTs(*TLI, DL, OrigArg.Ty, SplitVTs);
	ComputeValueVTs(*TLI, DL, OrigArg.Ty, SplitVTs, &Offsets, 0);

	if (SplitVTs.size() == 0)			if (SplitVTs.size() == 0)
	return;			return;

	if (SplitVTs.size() == 1) {			if (SplitVTs.size() == 1) {
	// No splitting to do, but we want to replace the original type (e.g. [1 x			// No splitting to do, but we want to replace the original type (e.g. [1 x
	// double] -> double).			// double] -> double).
	SplitArgs.emplace_back(OrigArg.Regs[0], SplitVTs[0].getTypeForEVT(Ctx),			SplitArgs.emplace_back(OrigArg.Regs[0], SplitVTs[0].getTypeForEVT(Ctx),
	▲ Show 20 Lines • Show All 512 Lines • ▼ Show 20 Lines
	void CallLowering::insertSRetLoads(MachineIRBuilder &MIRBuilder, Type *RetTy,			void CallLowering::insertSRetLoads(MachineIRBuilder &MIRBuilder, Type *RetTy,
	ArrayRef<Register> VRegs, Register DemoteReg,			ArrayRef<Register> VRegs, Register DemoteReg,
	int FI) const {			int FI) const {
	MachineFunction &MF = MIRBuilder.getMF();			MachineFunction &MF = MIRBuilder.getMF();
	MachineRegisterInfo &MRI = MF.getRegInfo();			MachineRegisterInfo &MRI = MF.getRegInfo();
	const DataLayout &DL = MF.getDataLayout();			const DataLayout &DL = MF.getDataLayout();

	SmallVector<EVT, 4> SplitVTs;			SmallVector<EVT, 4> SplitVTs;
	SmallVector<uint64_t, 4> Offsets;			SmallVector<TypeSize, 4> Offsets;
	ComputeValueVTs(*TLI, DL, RetTy, SplitVTs, &Offsets, 0);			ComputeValueVTs(*TLI, DL, RetTy, SplitVTs, &Offsets);

	assert(VRegs.size() == SplitVTs.size());			assert(VRegs.size() == SplitVTs.size());

	unsigned NumValues = SplitVTs.size();			unsigned NumValues = SplitVTs.size();
	Align BaseAlign = DL.getPrefTypeAlign(RetTy);			Align BaseAlign = DL.getPrefTypeAlign(RetTy);
	Type *RetPtrTy = RetTy->getPointerTo(DL.getAllocaAddrSpace());			Type *RetPtrTy = RetTy->getPointerTo(DL.getAllocaAddrSpace());
	LLT OffsetLLTy = getLLTForType(*DL.getIntPtrType(RetPtrTy), DL);			LLT OffsetLLTy = getLLTForType(*DL.getIntPtrType(RetPtrTy), DL);

	MachinePointerInfo PtrInfo = MachinePointerInfo::getFixedStack(MF, FI);			MachinePointerInfo PtrInfo = MachinePointerInfo::getFixedStack(MF, FI);

	for (unsigned I = 0; I < NumValues; ++I) {			for (unsigned I = 0; I < NumValues; ++I) {
	Register Addr;			Register Addr;
	MIRBuilder.materializePtrAdd(Addr, DemoteReg, OffsetLLTy, Offsets[I]);			// FIXME: Considering scalable struct types in GlobalISel.
				MIRBuilder.materializePtrAdd(Addr, DemoteReg, OffsetLLTy,
				Offsets[I].getKnownMinSize());
	auto *MMO = MF.getMachineMemOperand(PtrInfo, MachineMemOperand::MOLoad,			auto *MMO = MF.getMachineMemOperand(PtrInfo, MachineMemOperand::MOLoad,
	MRI.getType(VRegs[I]).getSizeInBytes(),			MRI.getType(VRegs[I]).getSizeInBytes(),
	commonAlignment(BaseAlign, Offsets[I]));			commonAlignment(BaseAlign, Offsets[I]));
	MIRBuilder.buildLoad(VRegs[I], Addr, *MMO);			MIRBuilder.buildLoad(VRegs[I], Addr, *MMO);
	}			}
	}			}

	void CallLowering::insertSRetStores(MachineIRBuilder &MIRBuilder, Type *RetTy,			void CallLowering::insertSRetStores(MachineIRBuilder &MIRBuilder, Type *RetTy,
	ArrayRef<Register> VRegs,			ArrayRef<Register> VRegs,
	Register DemoteReg) const {			Register DemoteReg) const {
	MachineFunction &MF = MIRBuilder.getMF();			MachineFunction &MF = MIRBuilder.getMF();
	MachineRegisterInfo &MRI = MF.getRegInfo();			MachineRegisterInfo &MRI = MF.getRegInfo();
	const DataLayout &DL = MF.getDataLayout();			const DataLayout &DL = MF.getDataLayout();

	SmallVector<EVT, 4> SplitVTs;			SmallVector<EVT, 4> SplitVTs;
	SmallVector<uint64_t, 4> Offsets;			SmallVector<TypeSize, 4> Offsets;
	ComputeValueVTs(*TLI, DL, RetTy, SplitVTs, &Offsets, 0);			ComputeValueVTs(*TLI, DL, RetTy, SplitVTs, &Offsets);

	assert(VRegs.size() == SplitVTs.size());			assert(VRegs.size() == SplitVTs.size());

	unsigned NumValues = SplitVTs.size();			unsigned NumValues = SplitVTs.size();
	Align BaseAlign = DL.getPrefTypeAlign(RetTy);			Align BaseAlign = DL.getPrefTypeAlign(RetTy);
	unsigned AS = DL.getAllocaAddrSpace();			unsigned AS = DL.getAllocaAddrSpace();
	LLT OffsetLLTy =			LLT OffsetLLTy =
	getLLTForType(*DL.getIntPtrType(RetTy->getPointerTo(AS)), DL);			getLLTForType(*DL.getIntPtrType(RetTy->getPointerTo(AS)), DL);

	MachinePointerInfo PtrInfo(AS);			MachinePointerInfo PtrInfo(AS);

	for (unsigned I = 0; I < NumValues; ++I) {			for (unsigned I = 0; I < NumValues; ++I) {
	Register Addr;			Register Addr;
	MIRBuilder.materializePtrAdd(Addr, DemoteReg, OffsetLLTy, Offsets[I]);			// FIXME: Considering scalable struct types in GlobalISel.
				craig.topperUnsubmitted Not Done Reply Inline Actions Does this need to be addressed as part of this review? craig.topper: Does this need to be addressed as part of this review?
				MIRBuilder.materializePtrAdd(Addr, DemoteReg, OffsetLLTy,
				Offsets[I].getKnownMinSize());
	auto *MMO = MF.getMachineMemOperand(PtrInfo, MachineMemOperand::MOStore,			auto *MMO = MF.getMachineMemOperand(PtrInfo, MachineMemOperand::MOStore,
	MRI.getType(VRegs[I]).getSizeInBytes(),			MRI.getType(VRegs[I]).getSizeInBytes(),
	commonAlignment(BaseAlign, Offsets[I]));			commonAlignment(BaseAlign, Offsets[I]));
	MIRBuilder.buildStore(VRegs[I], Addr, *MMO);			MIRBuilder.buildStore(VRegs[I], Addr, *MMO);
	}			}
	}			}

	void CallLowering::insertSRetIncomingArgument(			void CallLowering::insertSRetIncomingArgument(
	▲ Show 20 Lines • Show All 314 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/FunctionLoweringInfo.cpp

Show First 20 Lines • Show All 166 Lines • ▼ Show 20 Lines	for (const Instruction &I : BB) {
FrameIndex = MF->getFrameInfo().CreateFixedObject(		FrameIndex = MF->getFrameInfo().CreateFixedObject(
TySize, 0, /IsImmutable=/false, /isAliased=/true);		TySize, 0, /IsImmutable=/false, /isAliased=/true);
MF->getFrameInfo().setObjectAlignment(FrameIndex, Alignment);		MF->getFrameInfo().setObjectAlignment(FrameIndex, Alignment);
} else {		} else {
FrameIndex = MF->getFrameInfo().CreateStackObject(TySize, Alignment,		FrameIndex = MF->getFrameInfo().CreateStackObject(TySize, Alignment,
false, AI);		false, AI);
}		}

// Scalable vectors may need a special StackID to distinguish		// Scalable types may need a special StackID to distinguish
// them from other (fixed size) stack objects.		// them from other (fixed size) stack objects.
if (isa<ScalableVectorType>(Ty))		if (Ty->isScalableType()) {
		assert(Ty->isSized() && "It is only permitted to alloca struct "
		"with the same scalable vector types.");
MF->getFrameInfo().setStackID(FrameIndex,		MF->getFrameInfo().setStackID(FrameIndex,
TFI->getStackIDForScalableVectors());		TFI->getStackIDForScalableVectors());
		}

StaticAllocaMap[AI] = FrameIndex;		StaticAllocaMap[AI] = FrameIndex;
// Update the catch handler information.		// Update the catch handler information.
if (Iter != CatchObjects.end()) {		if (Iter != CatchObjects.end()) {
for (int *CatchObjPtr : Iter->second)		for (int *CatchObjPtr : Iter->second)
*CatchObjPtr = FrameIndex;		*CatchObjPtr = FrameIndex;
}		}
} else {		} else {
▲ Show 20 Lines • Show All 381 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 6,148 Lines • ▼ Show 20 Lines
	}			}

	SDValue SelectionDAG::getMemBasePlusOffset(SDValue Base, TypeSize Offset,			SDValue SelectionDAG::getMemBasePlusOffset(SDValue Base, TypeSize Offset,
	const SDLoc &DL,			const SDLoc &DL,
	const SDNodeFlags Flags) {			const SDNodeFlags Flags) {
	EVT VT = Base.getValueType();			EVT VT = Base.getValueType();
	SDValue Index;			SDValue Index;

				if (Offset.getKnownMinSize() == 0)
				return Base;

	if (Offset.isScalable())			if (Offset.isScalable())
	Index = getVScale(DL, Base.getValueType(),			Index = getVScale(DL, Base.getValueType(),
	APInt(Base.getValueSizeInBits().getFixedSize(),			APInt(Base.getValueSizeInBits().getFixedSize(),
	Offset.getKnownMinSize()));			Offset.getKnownMinSize()));
	else			else
	Index = getConstant(Offset.getFixedSize(), DL, VT);			Index = getConstant(Offset.getFixedSize(), DL, VT);

	return getMemBasePlusOffset(Base, Index, DL, Flags);			return getMemBasePlusOffset(Base, Index, DL, Flags);
	▲ Show 20 Lines • Show All 4,321 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,881 Lines • ▼ Show 20 Lines	ComputeValueVTs(TLI, DL,
DAG.getDataLayout().getAllocaAddrSpace()),		DAG.getDataLayout().getAllocaAddrSpace()),
PtrValueVTs);		PtrValueVTs);

SDValue RetPtr = DAG.getCopyFromReg(DAG.getEntryNode(), getCurSDLoc(),		SDValue RetPtr = DAG.getCopyFromReg(DAG.getEntryNode(), getCurSDLoc(),
DemoteReg, PtrValueVTs[0]);		DemoteReg, PtrValueVTs[0]);
SDValue RetOp = getValue(I.getOperand(0));		SDValue RetOp = getValue(I.getOperand(0));

SmallVector<EVT, 4> ValueVTs, MemVTs;		SmallVector<EVT, 4> ValueVTs, MemVTs;
SmallVector<uint64_t, 4> Offsets;		SmallVector<TypeSize, 4> Offsets;
ComputeValueVTs(TLI, DL, I.getOperand(0)->getType(), ValueVTs, &MemVTs,		ComputeValueVTs(TLI, DL, I.getOperand(0)->getType(), ValueVTs, &MemVTs,
		craig.topperUnsubmitted Done Reply Inline Actions Maybe we need a new entrypoint to ComputeValueVTs that does this automatically based on the type passed to argument 2? craig.topper: Maybe we need a new entrypoint to ComputeValueVTs that does this automatically based on the…
&Offsets);		&Offsets);
unsigned NumValues = ValueVTs.size();		unsigned NumValues = ValueVTs.size();

SmallVector<SDValue, 4> Chains(NumValues);		SmallVector<SDValue, 4> Chains(NumValues);
Align BaseAlign = DL.getPrefTypeAlign(I.getOperand(0)->getType());		Align BaseAlign = DL.getPrefTypeAlign(I.getOperand(0)->getType());
for (unsigned i = 0; i != NumValues; ++i) {		for (unsigned i = 0; i != NumValues; ++i) {
// An aggregate return value cannot wrap around the address space, so		// An aggregate return value cannot wrap around the address space, so
// offsets to its parts don't wrap either.		// offsets to its parts don't wrap either.
SDValue Ptr = DAG.getObjectPtrOffset(getCurSDLoc(), RetPtr,		SDValue Ptr = DAG.getObjectPtrOffset(getCurSDLoc(), RetPtr, Offsets[i]);
TypeSize::Fixed(Offsets[i]));

SDValue Val = RetOp.getValue(RetOp.getResNo() + i);		SDValue Val = RetOp.getValue(RetOp.getResNo() + i);
if (MemVTs[i] != ValueVTs[i])		if (MemVTs[i] != ValueVTs[i])
Val = DAG.getPtrExtOrTrunc(Val, getCurSDLoc(), MemVTs[i]);		Val = DAG.getPtrExtOrTrunc(Val, getCurSDLoc(), MemVTs[i]);
Chains[i] = DAG.getStore(		Chains[i] = DAG.getStore(
Chain, getCurSDLoc(), Val,		Chain, getCurSDLoc(), Val,
// FIXME: better loc info would be nice.		// FIXME: better loc info would be nice.
Ptr, MachinePointerInfo::getUnknownStack(DAG.getMachineFunction()),		Ptr, MachinePointerInfo::getUnknownStack(DAG.getMachineFunction()),
commonAlignment(BaseAlign, Offsets[i]));		commonAlignment(BaseAlign, Offsets[i].getKnownMinSize()));
}		}

Chain = DAG.getNode(ISD::TokenFactor, getCurSDLoc(),		Chain = DAG.getNode(ISD::TokenFactor, getCurSDLoc(),
MVT::Other, Chains);		MVT::Other, Chains);
} else if (I.getNumOperands() != 0) {		} else if (I.getNumOperands() != 0) {
SmallVector<EVT, 4> ValueVTs;		SmallVector<EVT, 4> ValueVTs;
ComputeValueVTs(TLI, DL, I.getOperand(0)->getType(), ValueVTs);		ComputeValueVTs(TLI, DL, I.getOperand(0)->getType(), ValueVTs);
unsigned NumValues = ValueVTs.size();		unsigned NumValues = ValueVTs.size();
▲ Show 20 Lines • Show All 2,115 Lines • ▼ Show 20 Lines	void SelectionDAGBuilder::visitLoad(const LoadInst &I) {
Type *Ty = I.getType();		Type *Ty = I.getType();
Align Alignment = I.getAlign();		Align Alignment = I.getAlign();

AAMDNodes AAInfo;		AAMDNodes AAInfo;
I.getAAMetadata(AAInfo);		I.getAAMetadata(AAInfo);
const MDNode *Ranges = I.getMetadata(LLVMContext::MD_range);		const MDNode *Ranges = I.getMetadata(LLVMContext::MD_range);

SmallVector<EVT, 4> ValueVTs, MemVTs;		SmallVector<EVT, 4> ValueVTs, MemVTs;
SmallVector<uint64_t, 4> Offsets;		SmallVector<TypeSize, 4> Offsets;
ComputeValueVTs(TLI, DAG.getDataLayout(), Ty, ValueVTs, &MemVTs, &Offsets);		ComputeValueVTs(TLI, DAG.getDataLayout(), Ty, ValueVTs, &MemVTs, &Offsets);
unsigned NumValues = ValueVTs.size();		unsigned NumValues = ValueVTs.size();
if (NumValues == 0)		if (NumValues == 0)
return;		return;

bool isVolatile = I.isVolatile();		bool isVolatile = I.isVolatile();

SDValue Root;		SDValue Root;
Show All 23 Lines	void SelectionDAGBuilder::visitLoad(const LoadInst &I) {

// An aggregate load cannot wrap around the address space, so offsets to its		// An aggregate load cannot wrap around the address space, so offsets to its
// parts don't wrap either.		// parts don't wrap either.
SDNodeFlags Flags;		SDNodeFlags Flags;
Flags.setNoUnsignedWrap(true);		Flags.setNoUnsignedWrap(true);

SmallVector<SDValue, 4> Values(NumValues);		SmallVector<SDValue, 4> Values(NumValues);
SmallVector<SDValue, 4> Chains(std::min(MaxParallelChains, NumValues));		SmallVector<SDValue, 4> Chains(std::min(MaxParallelChains, NumValues));
EVT PtrVT = Ptr.getValueType();

MachineMemOperand::Flags MMOFlags		MachineMemOperand::Flags MMOFlags
= TLI.getLoadMemOperandFlags(I, DAG.getDataLayout());		= TLI.getLoadMemOperandFlags(I, DAG.getDataLayout());

unsigned ChainI = 0;		unsigned ChainI = 0;
for (unsigned i = 0; i != NumValues; ++i, ++ChainI) {		for (unsigned i = 0; i != NumValues; ++i, ++ChainI) {
// Serializing loads here may result in excessive register pressure, and		// Serializing loads here may result in excessive register pressure, and
// TokenFactor places arbitrary choke points on the scheduler. SD scheduling		// TokenFactor places arbitrary choke points on the scheduler. SD scheduling
// could recover a bit by hoisting nodes upward in the chain by recognizing		// could recover a bit by hoisting nodes upward in the chain by recognizing
// they are side-effect free or do not alias. The optimizer should really		// they are side-effect free or do not alias. The optimizer should really
// avoid this case by converting large object/array copies to llvm.memcpy		// avoid this case by converting large object/array copies to llvm.memcpy
// (MaxParallelChains should always remain as failsafe).		// (MaxParallelChains should always remain as failsafe).
if (ChainI == MaxParallelChains) {		if (ChainI == MaxParallelChains) {
assert(PendingLoads.empty() && "PendingLoads must be serialized first");		assert(PendingLoads.empty() && "PendingLoads must be serialized first");
SDValue Chain = DAG.getNode(ISD::TokenFactor, dl, MVT::Other,		SDValue Chain = DAG.getNode(ISD::TokenFactor, dl, MVT::Other,
makeArrayRef(Chains.data(), ChainI));		makeArrayRef(Chains.data(), ChainI));
Root = Chain;		Root = Chain;
ChainI = 0;		ChainI = 0;
}		}
SDValue A = DAG.getNode(ISD::ADD, dl,		SDValue A = DAG.getMemBasePlusOffset(Ptr, Offsets[i], dl, Flags);
PtrVT, Ptr,
DAG.getConstant(Offsets[i], dl, PtrVT),
Flags);

SDValue L = DAG.getLoad(MemVTs[i], dl, Root, A,		SDValue L =
MachinePointerInfo(SV, Offsets[i]), Alignment,		DAG.getLoad(MemVTs[i], dl, Root, A,
MMOFlags, AAInfo, Ranges);		MachinePointerInfo(SV, Offsets[i].getKnownMinSize()),
		Alignment, MMOFlags, AAInfo, Ranges);
Chains[ChainI] = L.getValue(1);		Chains[ChainI] = L.getValue(1);

if (MemVTs[i] != ValueVTs[i])		if (MemVTs[i] != ValueVTs[i])
L = DAG.getZExtOrTrunc(L, dl, ValueVTs[i]);		L = DAG.getZExtOrTrunc(L, dl, ValueVTs[i]);

Values[i] = L;		Values[i] = L;
}		}

Show All 10 Lines	setValue(&I, DAG.getNode(ISD::MERGE_VALUES, dl,
DAG.getVTList(ValueVTs), Values));		DAG.getVTList(ValueVTs), Values));
}		}

void SelectionDAGBuilder::visitStoreToSwiftError(const StoreInst &I) {		void SelectionDAGBuilder::visitStoreToSwiftError(const StoreInst &I) {
assert(DAG.getTargetLoweringInfo().supportSwiftError() &&		assert(DAG.getTargetLoweringInfo().supportSwiftError() &&
"call visitStoreToSwiftError when backend supports swifterror");		"call visitStoreToSwiftError when backend supports swifterror");

SmallVector<EVT, 4> ValueVTs;		SmallVector<EVT, 4> ValueVTs;
SmallVector<uint64_t, 4> Offsets;		SmallVector<TypeSize, 4> Offsets;
const Value *SrcV = I.getOperand(0);		const Value *SrcV = I.getOperand(0);
ComputeValueVTs(DAG.getTargetLoweringInfo(), DAG.getDataLayout(),		ComputeValueVTs(DAG.getTargetLoweringInfo(), DAG.getDataLayout(),
SrcV->getType(), ValueVTs, &Offsets);		SrcV->getType(), ValueVTs, &Offsets);
assert(ValueVTs.size() == 1 && Offsets[0] == 0 &&		assert(ValueVTs.size() == 1 && Offsets[0].getKnownMinSize() == 0 &&
"expect a single EVT for swifterror");		"expect a single EVT for swifterror");

SDValue Src = getValue(SrcV);		SDValue Src = getValue(SrcV);
// Create a virtual register, then update the virtual register.		// Create a virtual register, then update the virtual register.
Register VReg =		Register VReg =
SwiftError.getOrCreateVRegDefAt(&I, FuncInfo.MBB, I.getPointerOperand());		SwiftError.getOrCreateVRegDefAt(&I, FuncInfo.MBB, I.getPointerOperand());
// Chain, DL, Reg, N or Chain, DL, Reg, N, Glue		// Chain, DL, Reg, N or Chain, DL, Reg, N, Glue
// Chain can be getRoot or getControlRoot.		// Chain can be getRoot or getControlRoot.
Show All 18 Lines	void SelectionDAGBuilder::visitLoadFromSwiftError(const LoadInst &I) {
assert(		assert(
(!AA \|\|		(!AA \|\|
!AA->pointsToConstantMemory(MemoryLocation(		!AA->pointsToConstantMemory(MemoryLocation(
SV, LocationSize::precise(DAG.getDataLayout().getTypeStoreSize(Ty)),		SV, LocationSize::precise(DAG.getDataLayout().getTypeStoreSize(Ty)),
AAInfo))) &&		AAInfo))) &&
"load_from_swift_error should not be constant memory");		"load_from_swift_error should not be constant memory");

SmallVector<EVT, 4> ValueVTs;		SmallVector<EVT, 4> ValueVTs;
SmallVector<uint64_t, 4> Offsets;		SmallVector<TypeSize, 4> Offsets;
ComputeValueVTs(DAG.getTargetLoweringInfo(), DAG.getDataLayout(), Ty,		ComputeValueVTs(DAG.getTargetLoweringInfo(), DAG.getDataLayout(), Ty,
ValueVTs, &Offsets);		ValueVTs, &Offsets);
assert(ValueVTs.size() == 1 && Offsets[0] == 0 &&		assert(ValueVTs.size() == 1 && Offsets[0].getKnownMinSize() == 0 &&
"expect a single EVT for swifterror");		"expect a single EVT for swifterror");

// Chain, DL, Reg, VT, Glue or Chain, DL, Reg, VT		// Chain, DL, Reg, VT, Glue or Chain, DL, Reg, VT
SDValue L = DAG.getCopyFromReg(		SDValue L = DAG.getCopyFromReg(
getRoot(), getCurSDLoc(),		getRoot(), getCurSDLoc(),
SwiftError.getOrCreateVRegUseAt(&I, FuncInfo.MBB, SV), ValueVTs[0]);		SwiftError.getOrCreateVRegUseAt(&I, FuncInfo.MBB, SV), ValueVTs[0]);

setValue(&I, L);		setValue(&I, L);
Show All 17 Lines	if (TLI.supportSwiftError()) {

if (const AllocaInst *Alloca = dyn_cast<AllocaInst>(PtrV)) {		if (const AllocaInst *Alloca = dyn_cast<AllocaInst>(PtrV)) {
if (Alloca->isSwiftError())		if (Alloca->isSwiftError())
return visitStoreToSwiftError(I);		return visitStoreToSwiftError(I);
}		}
}		}

SmallVector<EVT, 4> ValueVTs, MemVTs;		SmallVector<EVT, 4> ValueVTs, MemVTs;
SmallVector<uint64_t, 4> Offsets;		SmallVector<TypeSize, 4> Offsets;
ComputeValueVTs(DAG.getTargetLoweringInfo(), DAG.getDataLayout(),		ComputeValueVTs(DAG.getTargetLoweringInfo(), DAG.getDataLayout(),
SrcV->getType(), ValueVTs, &MemVTs, &Offsets);		SrcV->getType(), ValueVTs, &MemVTs, &Offsets);
unsigned NumValues = ValueVTs.size();		unsigned NumValues = ValueVTs.size();
if (NumValues == 0)		if (NumValues == 0)
return;		return;

// Get the lowered operands. Note that we do this after		// Get the lowered operands. Note that we do this after
// checking if NumResults is zero, because with zero results		// checking if NumResults is zero, because with zero results
Show All 19 Lines	void SelectionDAGBuilder::visitStore(const StoreInst &I) {
for (unsigned i = 0; i != NumValues; ++i, ++ChainI) {		for (unsigned i = 0; i != NumValues; ++i, ++ChainI) {
// See visitLoad comments.		// See visitLoad comments.
if (ChainI == MaxParallelChains) {		if (ChainI == MaxParallelChains) {
SDValue Chain = DAG.getNode(ISD::TokenFactor, dl, MVT::Other,		SDValue Chain = DAG.getNode(ISD::TokenFactor, dl, MVT::Other,
makeArrayRef(Chains.data(), ChainI));		makeArrayRef(Chains.data(), ChainI));
Root = Chain;		Root = Chain;
ChainI = 0;		ChainI = 0;
}		}
SDValue Add =		SDValue Add = DAG.getMemBasePlusOffset(Ptr, Offsets[i], dl, Flags);
DAG.getMemBasePlusOffset(Ptr, TypeSize::Fixed(Offsets[i]), dl, Flags);
SDValue Val = SDValue(Src.getNode(), Src.getResNo() + i);		SDValue Val = SDValue(Src.getNode(), Src.getResNo() + i);
if (MemVTs[i] != ValueVTs[i])		if (MemVTs[i] != ValueVTs[i])
Val = DAG.getPtrExtOrTrunc(Val, dl, MemVTs[i]);		Val = DAG.getPtrExtOrTrunc(Val, dl, MemVTs[i]);
SDValue St =		SDValue St =
DAG.getStore(Root, dl, Val, Add, MachinePointerInfo(PtrV, Offsets[i]),		DAG.getStore(Root, dl, Val, Add,
		MachinePointerInfo(PtrV, Offsets[i].getKnownMinSize()),
Alignment, MMOFlags, AAInfo);		Alignment, MMOFlags, AAInfo);
Chains[ChainI] = St;		Chains[ChainI] = St;
}		}

SDValue StoreNode = DAG.getNode(ISD::TokenFactor, dl, MVT::Other,		SDValue StoreNode = DAG.getNode(ISD::TokenFactor, dl, MVT::Other,
makeArrayRef(Chains.data(), ChainI));		makeArrayRef(Chains.data(), ChainI));
DAG.setRoot(StoreNode);		DAG.setRoot(StoreNode);
}		}
▲ Show 20 Lines • Show All 5,015 Lines • ▼ Show 20 Lines
/// FIXME: When all targets are		/// FIXME: When all targets are
/// migrated to using LowerCall, this hook should be integrated into SDISel.		/// migrated to using LowerCall, this hook should be integrated into SDISel.
std::pair<SDValue, SDValue>		std::pair<SDValue, SDValue>
TargetLowering::LowerCallTo(TargetLowering::CallLoweringInfo &CLI) const {		TargetLowering::LowerCallTo(TargetLowering::CallLoweringInfo &CLI) const {
// Handle the incoming return values from the call.		// Handle the incoming return values from the call.
CLI.Ins.clear();		CLI.Ins.clear();
Type *OrigRetTy = CLI.RetTy;		Type *OrigRetTy = CLI.RetTy;
SmallVector<EVT, 4> RetTys;		SmallVector<EVT, 4> RetTys;
SmallVector<uint64_t, 4> Offsets;		SmallVector<TypeSize, 4> Offsets;
auto &DL = CLI.DAG.getDataLayout();		auto &DL = CLI.DAG.getDataLayout();
ComputeValueVTs(*this, DL, CLI.RetTy, RetTys, &Offsets);		ComputeValueVTs(*this, DL, CLI.RetTy, RetTys, &Offsets);

if (CLI.IsPostTypeLegalization) {		if (CLI.IsPostTypeLegalization) {
// If we are lowering a libcall after legalization, split the return type.		// If we are lowering a libcall after legalization, split the return type.
SmallVector<EVT, 4> OldRetTys;		SmallVector<EVT, 4> OldRetTys;
SmallVector<uint64_t, 4> OldOffsets;		SmallVector<TypeSize, 4> OldOffsets;
RetTys.swap(OldRetTys);		RetTys.swap(OldRetTys);
Offsets.swap(OldOffsets);		Offsets.swap(OldOffsets);

for (size_t i = 0, e = OldRetTys.size(); i != e; ++i) {		for (size_t i = 0, e = OldRetTys.size(); i != e; ++i) {
EVT RetVT = OldRetTys[i];		EVT RetVT = OldRetTys[i];
uint64_t Offset = OldOffsets[i];		TypeSize Offset = OldOffsets[i];
MVT RegisterVT = getRegisterType(CLI.RetTy->getContext(), RetVT);		MVT RegisterVT = getRegisterType(CLI.RetTy->getContext(), RetVT);
unsigned NumRegs = getNumRegisters(CLI.RetTy->getContext(), RetVT);		unsigned NumRegs = getNumRegisters(CLI.RetTy->getContext(), RetVT);
unsigned RegisterVTByteSZ = RegisterVT.getSizeInBits() / 8;		unsigned RegisterVTByteSZ = RegisterVT.getSizeInBits() / 8;
RetTys.append(NumRegs, RegisterVT);		RetTys.append(NumRegs, RegisterVT);
for (unsigned j = 0; j != NumRegs; ++j)		for (unsigned j = 0; j != NumRegs; ++j)
Offsets.push_back(Offset + j * RegisterVTByteSZ);		Offsets.push_back(Offset + TypeSize::Fixed(j * RegisterVTByteSZ));
}		}
}		}

SmallVector<ISD::OutputArg, 4> Outs;		SmallVector<ISD::OutputArg, 4> Outs;
GetReturnInfo(CLI.CallConv, CLI.RetTy, getReturnAttrs(CLI), Outs, *this, DL);		GetReturnInfo(CLI.CallConv, CLI.RetTy, getReturnAttrs(CLI), Outs, *this, DL);

bool CanLowerReturn =		bool CanLowerReturn =
this->CanLowerReturn(CLI.CallConv, CLI.DAG.getMachineFunction(),		this->CanLowerReturn(CLI.CallConv, CLI.DAG.getMachineFunction(),
▲ Show 20 Lines • Show All 285 Lines • ▼ Show 20 Lines	#endif
if (!CanLowerReturn) {		if (!CanLowerReturn) {
// The instruction result is the result of loading from the		// The instruction result is the result of loading from the
// hidden sret parameter.		// hidden sret parameter.
SmallVector<EVT, 1> PVTs;		SmallVector<EVT, 1> PVTs;
Type *PtrRetTy = OrigRetTy->getPointerTo(DL.getAllocaAddrSpace());		Type *PtrRetTy = OrigRetTy->getPointerTo(DL.getAllocaAddrSpace());

ComputeValueVTs(*this, DL, PtrRetTy, PVTs);		ComputeValueVTs(*this, DL, PtrRetTy, PVTs);
assert(PVTs.size() == 1 && "Pointers should fit in one register");		assert(PVTs.size() == 1 && "Pointers should fit in one register");
EVT PtrVT = PVTs[0];

unsigned NumValues = RetTys.size();		unsigned NumValues = RetTys.size();
ReturnValues.resize(NumValues);		ReturnValues.resize(NumValues);
SmallVector<SDValue, 4> Chains(NumValues);		SmallVector<SDValue, 4> Chains(NumValues);

// An aggregate return value cannot wrap around the address space, so		// An aggregate return value cannot wrap around the address space, so
// offsets to its parts don't wrap either.		// offsets to its parts don't wrap either.
SDNodeFlags Flags;		SDNodeFlags Flags;
Flags.setNoUnsignedWrap(true);		Flags.setNoUnsignedWrap(true);

MachineFunction &MF = CLI.DAG.getMachineFunction();		MachineFunction &MF = CLI.DAG.getMachineFunction();
Align HiddenSRetAlign = MF.getFrameInfo().getObjectAlign(DemoteStackIdx);		Align HiddenSRetAlign = MF.getFrameInfo().getObjectAlign(DemoteStackIdx);
for (unsigned i = 0; i < NumValues; ++i) {		for (unsigned i = 0; i < NumValues; ++i) {
SDValue Add = CLI.DAG.getNode(ISD::ADD, CLI.DL, PtrVT, DemoteStackSlot,		SDValue Add = CLI.DAG.getMemBasePlusOffset(DemoteStackSlot, Offsets[i],
CLI.DAG.getConstant(Offsets[i], CLI.DL,		CLI.DL, Flags);
PtrVT), Flags);
SDValue L = CLI.DAG.getLoad(		SDValue L =
RetTys[i], CLI.DL, CLI.Chain, Add,		CLI.DAG.getLoad(RetTys[i], CLI.DL, CLI.Chain, Add,
MachinePointerInfo::getFixedStack(CLI.DAG.getMachineFunction(),		MachinePointerInfo::getFixedStack(
DemoteStackIdx, Offsets[i]),		CLI.DAG.getMachineFunction(), DemoteStackIdx,
		Offsets[i].getKnownMinSize()),
HiddenSRetAlign);		HiddenSRetAlign);
ReturnValues[i] = L;		ReturnValues[i] = L;
Chains[i] = L.getValue(1);		Chains[i] = L.getValue(1);
}		}

CLI.Chain = CLI.DAG.getNode(ISD::TokenFactor, CLI.DL, MVT::Other, Chains);		CLI.Chain = CLI.DAG.getNode(ISD::TokenFactor, CLI.DL, MVT::Other, Chains);
} else {		} else {
// Collect the legal value parts into potentially illegal values		// Collect the legal value parts into potentially illegal values
// that correspond to the original function's return values.		// that correspond to the original function's return values.
▲ Show 20 Lines • Show All 1,381 Lines • Show Last 20 Lines

llvm/lib/IR/DataLayout.cpp

Show All 38 Lines
#include <utility>		#include <utility>

using namespace llvm;		using namespace llvm;

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Support for StructLayout		// Support for StructLayout
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

StructLayout::StructLayout(StructType *ST, const DataLayout &DL) {		StructLayout::StructLayout(StructType *ST, const DataLayout &DL)
		: StructSize(0, false) {
assert(!ST->isOpaque() && "Cannot get layout of opaque structs");		assert(!ST->isOpaque() && "Cannot get layout of opaque structs");
StructSize = 0;		// We permit scalable vector struct as "sized" only when all the elements
		// are the same scalable vector types.
		if (ST->isScalableType())
		StructSize = TypeSize::Scalable(0);
IsPadded = false;		IsPadded = false;
NumElements = ST->getNumElements();		NumElements = ST->getNumElements();

// Loop over each of the elements, placing them in memory.		// Loop over each of the elements, placing them in memory.
for (unsigned i = 0, e = NumElements; i != e; ++i) {		for (unsigned i = 0, e = NumElements; i != e; ++i) {
Type *Ty = ST->getElementType(i);		Type *Ty = ST->getElementType(i);
const Align TyAlign = ST->isPacked() ? Align(1) : DL.getABITypeAlign(Ty);		const Align TyAlign = ST->isPacked() ? Align(1) : DL.getABITypeAlign(Ty);

// Add padding if necessary to align the data element properly.		// Add padding if necessary to align the data element properly.
if (!isAligned(TyAlign, StructSize)) {		// Scalable vector struct must be the same scalable vector types. They have
		// no alignment issues.
		if (!StructSize.isScalable() &&
		!isAligned(TyAlign, StructSize.getFixedSize())) {
IsPadded = true;		IsPadded = true;
StructSize = alignTo(StructSize, TyAlign);		StructSize = TypeSize::Fixed(alignTo(StructSize.getFixedSize(), TyAlign));
}		}

// Keep track of maximum alignment constraint.		// Keep track of maximum alignment constraint.
StructAlignment = std::max(TyAlign, StructAlignment);		StructAlignment = std::max(TyAlign, StructAlignment);

getMemberOffsets()[i] = StructSize;		getMemberOffsets()[i] = StructSize;
// Consume space for this data item		// Consume space for this data item
StructSize += DL.getTypeAllocSize(Ty).getFixedValue();		StructSize += DL.getTypeAllocSize(Ty);
}		}

// Add padding to the end of the struct so that it could be put in an array		// Add padding to the end of the struct so that it could be put in an array
// and all array elements would be aligned correctly.		// and all array elements would be aligned correctly.
if (!isAligned(StructAlignment, StructSize)) {		if (!StructSize.isScalable() &&
		!isAligned(StructAlignment, StructSize.getFixedSize())) {
IsPadded = true;		IsPadded = true;
StructSize = alignTo(StructSize, StructAlignment);		StructSize =
		TypeSize::Fixed(alignTo(StructSize.getFixedSize(), StructAlignment));
}		}
}		}

/// getElementContainingOffset - Given a valid offset into the structure,		/// getElementContainingOffset - Given a valid offset into the structure,
/// return the structure index that contains it.		/// return the structure index that contains it.
unsigned StructLayout::getElementContainingOffset(uint64_t Offset) const {		unsigned StructLayout::getElementContainingOffset(uint64_t O) const {
ArrayRef<uint64_t> MemberOffsets = getMemberOffsets();		ArrayRef<TypeSize> MemberOffsets = getMemberOffsets();
auto SI = llvm::upper_bound(MemberOffsets, Offset);		TypeSize Offset(O, StructSize.isScalable());
		auto SI =
		std::upper_bound(MemberOffsets.begin(), MemberOffsets.end(), Offset,
		[](TypeSize lhs, TypeSize rhs) -> bool {
		return TypeSize::isKnownLT(lhs, rhs);
		});
assert(SI != MemberOffsets.begin() && "Offset not in structure type!");		assert(SI != MemberOffsets.begin() && "Offset not in structure type!");
--SI;		--SI;
assert(*SI <= Offset && "upper_bound didn't work");		assert(TypeSize::isKnownLE(*SI, Offset) && "upper_bound didn't work");
assert((SI == MemberOffsets.begin() \|\| *(SI - 1) <= Offset) &&		assert(
(SI + 1 == MemberOffsets.end() \|\| *(SI + 1) > Offset) &&		(SI == MemberOffsets.begin() \|\| TypeSize::isKnownLE(*(SI - 1), Offset)) &&
		(SI + 1 == MemberOffsets.end() \|\|
		TypeSize::isKnownLT(Offset, *(SI + 1))) &&
"Upper bound didn't work!");		"Upper bound didn't work!");

// Multiple fields can have the same offset if any of them are zero sized.		// Multiple fields can have the same offset if any of them are zero sized.
// For example, in { i32, [0 x i32], i32 }, searching for offset 4 will stop		// For example, in { i32, [0 x i32], i32 }, searching for offset 4 will stop
// at the i32 element, because it is the last element at that offset. This is		// at the i32 element, because it is the last element at that offset. This is
// the right one to return, because anything after it will have a higher		// the right one to return, because anything after it will have a higher
// offset, implying that this element is non-empty.		// offset, implying that this element is non-empty.
return SI - MemberOffsets.begin();		return SI - MemberOffsets.begin();
}		}
▲ Show 20 Lines • Show All 575 Lines • ▼ Show 20 Lines	const StructLayout DataLayout::getStructLayout(StructType Ty) const {

StructLayoutMap STM = static_cast<StructLayoutMap>(LayoutMap);		StructLayoutMap STM = static_cast<StructLayoutMap>(LayoutMap);
StructLayout &SL = (STM)[Ty];		StructLayout &SL = (STM)[Ty];
if (SL) return SL;		if (SL) return SL;

// Otherwise, create the struct layout. Because it is variable length, we		// Otherwise, create the struct layout. Because it is variable length, we
// malloc it, then use placement new.		// malloc it, then use placement new.
StructLayout L = (StructLayout )safe_malloc(		StructLayout L = (StructLayout )safe_malloc(
StructLayout::totalSizeToAlloc<uint64_t>(Ty->getNumElements()));		StructLayout::totalSizeToAlloc<TypeSize>(Ty->getNumElements()));

// Set SL before calling StructLayout's ctor. The ctor could cause other		// Set SL before calling StructLayout's ctor. The ctor could cause other
// entries to be added to TheMap, invalidating our reference.		// entries to be added to TheMap, invalidating our reference.
SL = L;		SL = L;

new (L) StructLayout(Ty, *this);		new (L) StructLayout(Ty, *this);

return L;		return L;
▲ Show 20 Lines • Show All 249 Lines • Show Last 20 Lines

llvm/lib/IR/Type.cpp

Show First 20 Lines • Show All 174 Lines • ▼ Show 20 Lines	if (auto *ATy = dyn_cast<ArrayType>(this))
return ATy->getElementType()->isSized(Visited);		return ATy->getElementType()->isSized(Visited);

if (auto *VTy = dyn_cast<VectorType>(this))		if (auto *VTy = dyn_cast<VectorType>(this))
return VTy->getElementType()->isSized(Visited);		return VTy->getElementType()->isSized(Visited);

return cast<StructType>(this)->isSized(Visited);		return cast<StructType>(this)->isSized(Visited);
}		}

		bool Type::isScalableType() const {
		if (getTypeID() == ScalableVectorTyID)
		return true;

		if (const auto *STy = dyn_cast<StructType>(this)) {
		craig.topperUnsubmitted Done Reply Inline Actions Please address this lint warnig craig.topper: Please address this lint warnig
		if (STy->containsScalableVectorType())
		craig.topperUnsubmitted Not Done Reply Inline Actions I'm slightly concerned that containsScalableVectorType() is recursive and we're now calling isScalableType in a bunch of places. This could be bad for deeply nested structs. On that subject, are we allowing or preventing struct of scalable vectors to be part of other structs? craig.topper: I'm slightly concerned that containsScalableVectorType() is recursive and we're now calling…
		return true;
		}
		return false;
		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Primitive 'Type' data		// Primitive 'Type' data
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

Type *Type::getVoidTy(LLVMContext &C) { return &C.pImpl->VoidTy; }		Type *Type::getVoidTy(LLVMContext &C) { return &C.pImpl->VoidTy; }
Type *Type::getLabelTy(LLVMContext &C) { return &C.pImpl->LabelTy; }		Type *Type::getLabelTy(LLVMContext &C) { return &C.pImpl->LabelTy; }
Type *Type::getHalfTy(LLVMContext &C) { return &C.pImpl->HalfTy; }		Type *Type::getHalfTy(LLVMContext &C) { return &C.pImpl->HalfTy; }
Type *Type::getBFloatTy(LLVMContext &C) { return &C.pImpl->BFloatTy; }		Type *Type::getBFloatTy(LLVMContext &C) { return &C.pImpl->BFloatTy; }
▲ Show 20 Lines • Show All 332 Lines • ▼ Show 20 Lines	if (isOpaque())
return false;		return false;

if (Visited && !Visited->insert(const_cast<StructType*>(this)).second)		if (Visited && !Visited->insert(const_cast<StructType*>(this)).second)
return false;		return false;

// Okay, our struct is sized if all of the elements are, but if one of the		// Okay, our struct is sized if all of the elements are, but if one of the
// elements is opaque, the struct isn't sized yet, but may become sized in		// elements is opaque, the struct isn't sized yet, but may become sized in
// the future, so just bail out without caching.		// the future, so just bail out without caching.
		Type *FirstTy = getNumElements() > 0 ? elements()[0] : nullptr;
		bool IsFirstElementScalable = false;
		craig.topperUnsubmitted Done Reply Inline Actions One -> Element. craig.topper: One -> Element.
		if (FirstTy)
		IsFirstElementScalable = isa<ScalableVectorType>(FirstTy);
for (Type *Ty : elements()) {		for (Type *Ty : elements()) {
// If the struct contains a scalable vector type, don't consider it sized.		if (IsFirstElementScalable) {
// This prevents it from being used in loads/stores/allocas/GEPs.		// We do not permit mix scalar types with scalable types within struct.
		if (!isa<ScalableVectorType>(Ty))
		return false;
		// All the scalable types within struct should be the same.
		if (FirstTy != Ty)
		return false;
		} else {
if (isa<ScalableVectorType>(Ty))		if (isa<ScalableVectorType>(Ty))
return false;		return false;
		}
if (!Ty->isSized(Visited))		if (!Ty->isSized(Visited))
return false;		return false;
}		}

// Here we cheat a bit and cast away const-ness. The goal is to memoize when		// Here we cheat a bit and cast away const-ness. The goal is to memoize when
// we find a sized type, as types can only move from opaque to sized, not the		// we find a sized type, as types can only move from opaque to sized, not the
// other way.		// other way.
const_cast<StructType*>(this)->setSubclassData(		const_cast<StructType*>(this)->setSubclassData(
▲ Show 20 Lines • Show All 183 Lines • Show Last 20 Lines

llvm/lib/Target/AMDGPU/AMDGPUISelLowering.cpp

Show First 20 Lines • Show All 1,036 Lines • ▼ Show 20 Lines	for (const Argument &Arg : Fn.args()) {
// We're basically throwing away everything passed into us and starting over		// We're basically throwing away everything passed into us and starting over
// to get accurate in-memory offsets. The "PartOffset" is completely useless		// to get accurate in-memory offsets. The "PartOffset" is completely useless
// to us as computed in Ins.		// to us as computed in Ins.
//		//
// We also need to figure out what type legalization is trying to do to get		// We also need to figure out what type legalization is trying to do to get
// the correct memory offsets.		// the correct memory offsets.

SmallVector<EVT, 16> ValueVTs;		SmallVector<EVT, 16> ValueVTs;
SmallVector<uint64_t, 16> Offsets;		SmallVector<TypeSize, 16> Offsets;
ComputeValueVTs(*this, DL, BaseArgTy, ValueVTs, &Offsets, ArgOffset);		ComputeValueVTs(*this, DL, BaseArgTy, ValueVTs, &Offsets,
		TypeSize::Fixed(ArgOffset));

for (unsigned Value = 0, NumValues = ValueVTs.size();		for (unsigned Value = 0, NumValues = ValueVTs.size();
Value != NumValues; ++Value) {		Value != NumValues; ++Value) {
uint64_t BasePartOffset = Offsets[Value];		uint64_t BasePartOffset = Offsets[Value];

EVT ArgVT = ValueVTs[Value];		EVT ArgVT = ValueVTs[Value];
EVT MemVT = ArgVT;		EVT MemVT = ArgVT;
MVT RegisterVT = getRegisterTypeForCallingConv(Ctx, CC, ArgVT);		MVT RegisterVT = getRegisterTypeForCallingConv(Ctx, CC, ArgVT);
▲ Show 20 Lines • Show All 3,679 Lines • Show Last 20 Lines

llvm/lib/Target/NVPTX/NVPTXISelLowering.cpp

Show First 20 Lines • Show All 144 Lines • ▼ Show 20 Lines
/// NOTE: This is a band-aid for code that expects ComputeValueVTs to return the		/// NOTE: This is a band-aid for code that expects ComputeValueVTs to return the
/// same number of types as the Ins/Outs arrays in LowerFormalArguments,		/// same number of types as the Ins/Outs arrays in LowerFormalArguments,
/// LowerCall, and LowerReturn.		/// LowerCall, and LowerReturn.
static void ComputePTXValueVTs(const TargetLowering &TLI, const DataLayout &DL,		static void ComputePTXValueVTs(const TargetLowering &TLI, const DataLayout &DL,
Type *Ty, SmallVectorImpl<EVT> &ValueVTs,		Type *Ty, SmallVectorImpl<EVT> &ValueVTs,
SmallVectorImpl<uint64_t> *Offsets = nullptr,		SmallVectorImpl<uint64_t> *Offsets = nullptr,
uint64_t StartingOffset = 0) {		uint64_t StartingOffset = 0) {
SmallVector<EVT, 16> TempVTs;		SmallVector<EVT, 16> TempVTs;
SmallVector<uint64_t, 16> TempOffsets;		SmallVector<TypeSize, 16> TempOffsets;

// Special case for i128 - decompose to (i64, i64)		// Special case for i128 - decompose to (i64, i64)
if (Ty->isIntegerTy(128)) {		if (Ty->isIntegerTy(128)) {
ValueVTs.push_back(EVT(MVT::i64));		ValueVTs.push_back(EVT(MVT::i64));
ValueVTs.push_back(EVT(MVT::i64));		ValueVTs.push_back(EVT(MVT::i64));

if (Offsets) {		if (Offsets) {
Offsets->push_back(StartingOffset + 0);		Offsets->push_back(StartingOffset + 0);
Show All 10 Lines	if (StructType *STy = dyn_cast<StructType>(Ty)) {
for(auto *EI : STy->elements()) {		for(auto *EI : STy->elements()) {
ComputePTXValueVTs(TLI, DL, EI, ValueVTs, Offsets,		ComputePTXValueVTs(TLI, DL, EI, ValueVTs, Offsets,
StartingOffset + SL->getElementOffset(ElementNum));		StartingOffset + SL->getElementOffset(ElementNum));
++ElementNum;		++ElementNum;
}		}
return;		return;
}		}

ComputeValueVTs(TLI, DL, Ty, TempVTs, &TempOffsets, StartingOffset);		ComputeValueVTs(TLI, DL, Ty, TempVTs, &TempOffsets,
		TypeSize::Fixed(StartingOffset));
for (unsigned i = 0, e = TempVTs.size(); i != e; ++i) {		for (unsigned i = 0, e = TempVTs.size(); i != e; ++i) {
EVT VT = TempVTs[i];		EVT VT = TempVTs[i];
uint64_t Off = TempOffsets[i];		uint64_t Off = TempOffsets[i];
// Split vectors into individual elements, except for v2f16, which		// Split vectors into individual elements, except for v2f16, which
// we will pass as a single scalar.		// we will pass as a single scalar.
if (VT.isVector()) {		if (VT.isVector()) {
unsigned NumElts = VT.getVectorNumElements();		unsigned NumElts = VT.getVectorNumElements();
EVT EltVT = VT.getVectorElementType();		EVT EltVT = VT.getVectorElementType();
▲ Show 20 Lines • Show All 4,857 Lines • Show Last 20 Lines

llvm/lib/Target/X86/X86CallLowering.cpp

Show First 20 Lines • Show All 54 Lines • ▼ Show 20 Lines	bool X86CallLowering::splitToValueTypes(const ArgInfo &OrigArg,
SmallVectorImpl<ArgInfo> &SplitArgs,		SmallVectorImpl<ArgInfo> &SplitArgs,
const DataLayout &DL,		const DataLayout &DL,
MachineRegisterInfo &MRI,		MachineRegisterInfo &MRI,
SplitArgTy PerformArgSplit) const {		SplitArgTy PerformArgSplit) const {
const X86TargetLowering &TLI = *getTLI<X86TargetLowering>();		const X86TargetLowering &TLI = *getTLI<X86TargetLowering>();
LLVMContext &Context = OrigArg.Ty->getContext();		LLVMContext &Context = OrigArg.Ty->getContext();

SmallVector<EVT, 4> SplitVTs;		SmallVector<EVT, 4> SplitVTs;
SmallVector<uint64_t, 4> Offsets;		SmallVector<TypeSize, 4> Offsets;
ComputeValueVTs(TLI, DL, OrigArg.Ty, SplitVTs, &Offsets, 0);		ComputeValueVTs(TLI, DL, OrigArg.Ty, SplitVTs, &Offsets, TypeSize::Fixed(0));
assert(OrigArg.Regs.size() == 1 && "Can't handle multple regs yet");		assert(OrigArg.Regs.size() == 1 && "Can't handle multple regs yet");

if (OrigArg.Ty->isVoidTy())		if (OrigArg.Ty->isVoidTy())
return true;		return true;

EVT VT = SplitVTs[0];		EVT VT = SplitVTs[0];
unsigned NumParts = TLI.getNumRegisters(Context, VT);		unsigned NumParts = TLI.getNumRegisters(Context, VT);

▲ Show 20 Lines • Show All 429 Lines • Show Last 20 Lines

llvm/lib/Transforms/IPO/ArgumentPromotion.cpp

Show First 20 Lines • Show All 784 Lines • ▼ Show 20 Lines	bool ArgumentPromotionPass::isDenselyPacked(Type *type, const DataLayout &DL) {

// For array types, check for padding within members.		// For array types, check for padding within members.
if (ArrayType *seqTy = dyn_cast<ArrayType>(type))		if (ArrayType *seqTy = dyn_cast<ArrayType>(type))
return isDenselyPacked(seqTy->getElementType(), DL);		return isDenselyPacked(seqTy->getElementType(), DL);

if (!isa<StructType>(type))		if (!isa<StructType>(type))
return true;		return true;

		// Be conservative for scalable struct.
		if (type->isScalableType())
		return false;

// Check for padding within and between elements of a struct.		// Check for padding within and between elements of a struct.
StructType *StructTy = cast<StructType>(type);		StructType *StructTy = cast<StructType>(type);
const StructLayout *Layout = DL.getStructLayout(StructTy);		const StructLayout *Layout = DL.getStructLayout(StructTy);
uint64_t StartPos = 0;		uint64_t StartPos = 0;
for (unsigned i = 0, E = StructTy->getNumElements(); i < E; ++i) {		for (unsigned i = 0, E = StructTy->getNumElements(); i < E; ++i) {
Type *ElTy = StructTy->getElementType(i);		Type *ElTy = StructTy->getElementType(i);
if (!isDenselyPacked(ElTy, DL))		if (!isDenselyPacked(ElTy, DL))
return false;		return false;
▲ Show 20 Lines • Show All 368 Lines • Show Last 20 Lines

llvm/lib/Transforms/IPO/AttributorAttributes.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 198 Lines • ▼ Show 20 Lines	if (Offset) {
// Add 0 index to look through the pointer.		// Add 0 index to look through the pointer.
assert((uint64_t)Offset < DL.getTypeAllocSize(PtrElemTy) &&		assert((uint64_t)Offset < DL.getTypeAllocSize(PtrElemTy) &&
"Offset out of bounds");		"Offset out of bounds");
Indices.push_back(Constant::getNullValue(IRB.getInt32Ty()));		Indices.push_back(Constant::getNullValue(IRB.getInt32Ty()));

Type *Ty = PtrElemTy;		Type *Ty = PtrElemTy;
do {		do {
auto *STy = dyn_cast<StructType>(Ty);		auto *STy = dyn_cast<StructType>(Ty);
if (!STy)		if (!STy \|\| STy->isScalableType())
// Non-aggregate type, we cast and make byte-wise progress now.		// Non-aggregate type, we cast and make byte-wise progress now.
break;		break;

const StructLayout *SL = DL.getStructLayout(STy);		const StructLayout *SL = DL.getStructLayout(STy);
if (int64_t(SL->getSizeInBytes()) < Offset)		if (int64_t(SL->getSizeInBytes()) < Offset)
break;		break;

uint64_t Idx = SL->getElementContainingOffset(Offset);		uint64_t Idx = SL->getElementContainingOffset(Offset);
▲ Show 20 Lines • Show All 8,059 Lines • Show Last 20 Lines

llvm/lib/Transforms/InstCombine/InstCombineLoadStoreAlloca.cpp

Show First 20 Lines • Show All 635 Lines • ▼ Show 20 Lines	if (NumElements == 1) {
".unpack");		".unpack");
AAMDNodes AAMD;		AAMDNodes AAMD;
LI.getAAMetadata(AAMD);		LI.getAAMetadata(AAMD);
NewLoad->setAAMetadata(AAMD);		NewLoad->setAAMetadata(AAMD);
return IC.replaceInstUsesWith(LI, IC.Builder.CreateInsertValue(		return IC.replaceInstUsesWith(LI, IC.Builder.CreateInsertValue(
UndefValue::get(T), NewLoad, 0, Name));		UndefValue::get(T), NewLoad, 0, Name));
}		}

		if (ST->isScalableType())
		return nullptr;

// We don't want to break loads with padding here as we'd loose		// We don't want to break loads with padding here as we'd loose
// the knowledge that padding exists for the rest of the pipeline.		// the knowledge that padding exists for the rest of the pipeline.
const DataLayout &DL = IC.getDataLayout();		const DataLayout &DL = IC.getDataLayout();
auto *SL = DL.getStructLayout(ST);		auto *SL = DL.getStructLayout(ST);
if (SL->hasPadding())		if (SL->hasPadding())
return nullptr;		return nullptr;

const auto Align = LI.getAlign();		const auto Align = LI.getAlign();
▲ Show 20 Lines • Show All 509 Lines • ▼ Show 20 Lines	if (auto *ST = dyn_cast<StructType>(T)) {
// If the struct only have one element, we unpack.		// If the struct only have one element, we unpack.
unsigned Count = ST->getNumElements();		unsigned Count = ST->getNumElements();
if (Count == 1) {		if (Count == 1) {
V = IC.Builder.CreateExtractValue(V, 0);		V = IC.Builder.CreateExtractValue(V, 0);
combineStoreToNewValue(IC, SI, V);		combineStoreToNewValue(IC, SI, V);
return true;		return true;
}		}

		if (ST->isScalableType())
		return false;

// We don't want to break loads with padding here as we'd loose		// We don't want to break loads with padding here as we'd loose
// the knowledge that padding exists for the rest of the pipeline.		// the knowledge that padding exists for the rest of the pipeline.
const DataLayout &DL = IC.getDataLayout();		const DataLayout &DL = IC.getDataLayout();
auto *SL = DL.getStructLayout(ST);		auto *SL = DL.getStructLayout(ST);
if (SL->hasPadding())		if (SL->hasPadding())
return false;		return false;

const auto Align = SI.getAlign();		const auto Align = SI.getAlign();
▲ Show 20 Lines • Show All 392 Lines • Show Last 20 Lines

llvm/lib/Transforms/InstCombine/InstructionCombining.cpp

Show First 20 Lines • Show All 1,224 Lines • ▼ Show 20 Lines
/// Given a pointer type and a constant offset, determine whether or not there		/// Given a pointer type and a constant offset, determine whether or not there
/// is a sequence of GEP indices into the pointed type that will land us at the		/// is a sequence of GEP indices into the pointed type that will land us at the
/// specified offset. If so, fill them into NewIndices and return the resultant		/// specified offset. If so, fill them into NewIndices and return the resultant
/// element type, otherwise return null.		/// element type, otherwise return null.
Type *		Type *
InstCombinerImpl::FindElementAtOffset(PointerType *PtrTy, int64_t Offset,		InstCombinerImpl::FindElementAtOffset(PointerType *PtrTy, int64_t Offset,
SmallVectorImpl<Value *> &NewIndices) {		SmallVectorImpl<Value *> &NewIndices) {
Type *Ty = PtrTy->getElementType();		Type *Ty = PtrTy->getElementType();
if (!Ty->isSized())		if (!Ty->isSized() \|\| Ty->isScalableType())
return nullptr;		return nullptr;

// Start with the index over the outer type. Note that the type size		// Start with the index over the outer type. Note that the type size
// might be zero (even if the offset isn't zero) if the indexed type		// might be zero (even if the offset isn't zero) if the indexed type
// is something like [0 x {int, int}]		// is something like [0 x {int, int}]
Type *IndexTy = DL.getIndexType(PtrTy);		Type *IndexTy = DL.getIndexType(PtrTy);
int64_t FirstIdx = 0;		int64_t FirstIdx = 0;
if (int64_t TySize = DL.getTypeAllocSize(Ty)) {		if (int64_t TySize = DL.getTypeAllocSize(Ty)) {
▲ Show 20 Lines • Show All 1,038 Lines • ▼ Show 20 Lines	if (HasZeroPointerIndex) {
// V and GEP are both pointer types --> BitCast		// V and GEP are both pointer types --> BitCast
return CastInst::CreatePointerBitCastOrAddrSpaceCast(NewGEP, GEPType);		return CastInst::CreatePointerBitCastOrAddrSpaceCast(NewGEP, GEPType);
}		}

// Transform things like:		// Transform things like:
// %V = mul i64 %N, 4		// %V = mul i64 %N, 4
// %t = getelementptr i8* bitcast (i32* %arr to i8*), i32 %V		// %t = getelementptr i8* bitcast (i32* %arr to i8*), i32 %V
// into: %t1 = getelementptr i32* %arr, i32 %N; bitcast		// into: %t1 = getelementptr i32* %arr, i32 %N; bitcast
if (GEPEltType->isSized() && StrippedPtrEltTy->isSized()) {		if (GEPEltType->isSized() && StrippedPtrEltTy->isSized() &&
		!StrippedPtrEltTy->isScalableType()) {
// Check that changing the type amounts to dividing the index by a scale		// Check that changing the type amounts to dividing the index by a scale
// factor.		// factor.
uint64_t ResSize = DL.getTypeAllocSize(GEPEltType).getFixedSize();		uint64_t ResSize = DL.getTypeAllocSize(GEPEltType).getFixedSize();
uint64_t SrcSize = DL.getTypeAllocSize(StrippedPtrEltTy).getFixedSize();		uint64_t SrcSize = DL.getTypeAllocSize(StrippedPtrEltTy).getFixedSize();
if (ResSize && SrcSize % ResSize == 0) {		if (ResSize && SrcSize % ResSize == 0) {
Value *Idx = GEP.getOperand(1);		Value *Idx = GEP.getOperand(1);
unsigned BitWidth = Idx->getType()->getPrimitiveSizeInBits();		unsigned BitWidth = Idx->getType()->getPrimitiveSizeInBits();
uint64_t Scale = SrcSize / ResSize;		uint64_t Scale = SrcSize / ResSize;
▲ Show 20 Lines • Show All 1,859 Lines • Show Last 20 Lines

llvm/lib/Transforms/Scalar/SROA.cpp

Show First 20 Lines • Show All 1,507 Lines • ▼ Show 20 Lines	if (ArrayType *ArrTy = dyn_cast<ArrayType>(Ty)) {

Offset -= NumSkippedElements * ElementSize;		Offset -= NumSkippedElements * ElementSize;
Indices.push_back(IRB.getInt(NumSkippedElements));		Indices.push_back(IRB.getInt(NumSkippedElements));
return getNaturalGEPRecursively(IRB, DL, Ptr, ElementTy, Offset, TargetTy,		return getNaturalGEPRecursively(IRB, DL, Ptr, ElementTy, Offset, TargetTy,
Indices, NamePrefix);		Indices, NamePrefix);
}		}

StructType *STy = dyn_cast<StructType>(Ty);		StructType *STy = dyn_cast<StructType>(Ty);
if (!STy)		if (!STy \|\| STy->isScalableType())
return nullptr;		return nullptr;

const StructLayout *SL = DL.getStructLayout(STy);		const StructLayout *SL = DL.getStructLayout(STy);
uint64_t StructOffset = Offset.getZExtValue();		uint64_t StructOffset = Offset.getZExtValue();
if (StructOffset >= SL->getSizeInBytes())		if (StructOffset >= SL->getSizeInBytes())
return nullptr;		return nullptr;
unsigned Index = SL->getElementContainingOffset(StructOffset);		unsigned Index = SL->getElementContainingOffset(StructOffset);
Offset -= APInt(Offset.getBitWidth(), SL->getElementOffset(Index));		Offset -= APInt(Offset.getBitWidth(), SL->getElementOffset(Index));
▲ Show 20 Lines • Show All 2,089 Lines • ▼ Show 20 Lines
///		///
/// This removes no-op aggregate types wrapping an underlying type. It will		/// This removes no-op aggregate types wrapping an underlying type. It will
/// strip as many layers of types as it can without changing either the type		/// strip as many layers of types as it can without changing either the type
/// size or the allocated size.		/// size or the allocated size.
static Type stripAggregateTypeWrapping(const DataLayout &DL, Type Ty) {		static Type stripAggregateTypeWrapping(const DataLayout &DL, Type Ty) {
if (Ty->isSingleValueType())		if (Ty->isSingleValueType())
return Ty;		return Ty;

uint64_t AllocSize = DL.getTypeAllocSize(Ty).getFixedSize();		TypeSize AllocSize = DL.getTypeAllocSize(Ty);
uint64_t TypeSize = DL.getTypeSizeInBits(Ty).getFixedSize();		TypeSize TypeSize = DL.getTypeSizeInBits(Ty);

Type *InnerTy;		Type *InnerTy;
if (ArrayType *ArrTy = dyn_cast<ArrayType>(Ty)) {		if (ArrayType *ArrTy = dyn_cast<ArrayType>(Ty)) {
InnerTy = ArrTy->getElementType();		InnerTy = ArrTy->getElementType();
} else if (StructType *STy = dyn_cast<StructType>(Ty)) {		} else if (StructType *STy = dyn_cast<StructType>(Ty)) {
const StructLayout *SL = DL.getStructLayout(STy);		const StructLayout *SL = DL.getStructLayout(STy);
unsigned Index = SL->getElementContainingOffset(0);		unsigned Index = SL->getElementContainingOffset(0);
InnerTy = STy->getElementType(Index);		InnerTy = STy->getElementType(Index);
} else {		} else {
return Ty;		return Ty;
}		}

if (AllocSize > DL.getTypeAllocSize(InnerTy).getFixedSize() \|\|		if (TypeSize::isKnownGT(AllocSize, DL.getTypeAllocSize(InnerTy)) \|\|
TypeSize > DL.getTypeSizeInBits(InnerTy).getFixedSize())		TypeSize::isKnownGT(TypeSize, DL.getTypeSizeInBits(InnerTy)))
return Ty;		return Ty;

return stripAggregateTypeWrapping(DL, InnerTy);		return stripAggregateTypeWrapping(DL, InnerTy);
}		}

/// Try to find a partition of the aggregate type passed in for a given		/// Try to find a partition of the aggregate type passed in for a given
/// offset and size.		/// offset and size.
///		///
▲ Show 20 Lines • Show All 49 Lines • ▼ Show 20 Lines	if (isa<ArrayType>(Ty) \|\| isa<VectorType>(Ty)) {
assert(Size > ElementSize);		assert(Size > ElementSize);
uint64_t NumElements = Size / ElementSize;		uint64_t NumElements = Size / ElementSize;
if (NumElements * ElementSize != Size)		if (NumElements * ElementSize != Size)
return nullptr;		return nullptr;
return ArrayType::get(ElementTy, NumElements);		return ArrayType::get(ElementTy, NumElements);
}		}

StructType *STy = dyn_cast<StructType>(Ty);		StructType *STy = dyn_cast<StructType>(Ty);
if (!STy)		if (!STy \|\| STy->isScalableType())
return nullptr;		return nullptr;

const StructLayout *SL = DL.getStructLayout(STy);		const StructLayout *SL = DL.getStructLayout(STy);
if (Offset >= SL->getSizeInBytes())		if (Offset >= SL->getSizeInBytes())
return nullptr;		return nullptr;
uint64_t EndOffset = Offset + Size;		uint64_t EndOffset = Offset + Size;
if (EndOffset > SL->getSizeInBytes())		if (EndOffset > SL->getSizeInBytes())
return nullptr;		return nullptr;
▲ Show 20 Lines • Show All 1,125 Lines • Show Last 20 Lines

llvm/lib/Transforms/Utils/ScalarEvolutionExpander.cpp

Show First 20 Lines • Show All 490 Lines • ▼ Show 20 Lines	for (;;) {
while (StructType *STy = dyn_cast<StructType>(ElTy)) {		while (StructType *STy = dyn_cast<StructType>(ElTy)) {
bool FoundFieldNo = false;		bool FoundFieldNo = false;
// An empty struct has no fields.		// An empty struct has no fields.
if (STy->getNumElements() == 0) break;		if (STy->getNumElements() == 0) break;
// Field offsets are known. See if a constant offset falls within any of		// Field offsets are known. See if a constant offset falls within any of
// the struct fields.		// the struct fields.
if (Ops.empty())		if (Ops.empty())
break;		break;
		// Be conservative for scalable struct.
		if (STy->isScalableType())
		break;
if (const SCEVConstant *C = dyn_cast<SCEVConstant>(Ops[0]))		if (const SCEVConstant *C = dyn_cast<SCEVConstant>(Ops[0]))
if (SE.getTypeSizeInBits(C->getType()) <= 64) {		if (SE.getTypeSizeInBits(C->getType()) <= 64) {
const StructLayout &SL = *DL.getStructLayout(STy);		const StructLayout &SL = *DL.getStructLayout(STy);
uint64_t FullOffset = C->getValue()->getZExtValue();		uint64_t FullOffset = C->getValue()->getZExtValue();
if (FullOffset < SL.getSizeInBytes()) {		if (FullOffset < SL.getSizeInBytes()) {
unsigned ElIdx = SL.getElementContainingOffset(FullOffset);		unsigned ElIdx = SL.getElementContainingOffset(FullOffset);
GepIndices.push_back(		GepIndices.push_back(
ConstantInt::get(Type::getInt32Ty(Ty->getContext()), ElIdx));		ConstantInt::get(Type::getInt32Ty(Ty->getContext()), ElIdx));
▲ Show 20 Lines • Show All 2,235 Lines • Show Last 20 Lines

llvm/test/CodeGen/RISCV/rvv/load-store-scalable-struct.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
				; RUN: llc -mtriple=riscv64 -mattr=+m,+d,+experimental-v -verify-machineinstrs \
				; RUN: --riscv-no-aliases < %s \| FileCheck %s

				target triple = "riscv64-unknown-unknown-elf"

				%struct.test = type { <vscale x 1 x double>, <vscale x 1 x double> }

				define <vscale x 1 x double> @test(%struct.test* %addr, i64 %vl) {
				; CHECK-LABEL: test:
				; CHECK: # %bb.0: # %entry
				; CHECK-NEXT: addi sp, sp, -16
				; CHECK-NEXT: .cfi_def_cfa_offset 16
				; CHECK-NEXT: csrrs a2, vlenb, zero
				; CHECK-NEXT: slli a2, a2, 1
				; CHECK-NEXT: sub sp, sp, a2
				; CHECK-NEXT: csrrs a2, vlenb, zero
				; CHECK-NEXT: add a3, a0, a2
				; CHECK-NEXT: vl1re64.v v25, (a3)
				; CHECK-NEXT: vl1re64.v v26, (a0)
				; CHECK-NEXT: addi a0, sp, 16
				; CHECK-NEXT: add a0, a0, a2
				; CHECK-NEXT: vs1r.v v25, (a0)
				; CHECK-NEXT: addi a2, sp, 16
				; CHECK-NEXT: vs1r.v v26, (a2)
				; CHECK-NEXT: vl1re64.v v25, (a0)
				; CHECK-NEXT: addi a0, sp, 16
				; CHECK-NEXT: vl1re64.v v26, (a0)
				; CHECK-NEXT: vsetvli a0, a1, e64,m1,ta,mu
				; CHECK-NEXT: vfadd.vv v8, v26, v25
				; CHECK-NEXT: csrrs a0, vlenb, zero
				; CHECK-NEXT: slli a0, a0, 1
				; CHECK-NEXT: add sp, sp, a0
				; CHECK-NEXT: addi sp, sp, 16
				; CHECK-NEXT: jalr zero, 0(ra)
				entry:
				%ret = alloca %struct.test, align 8
				%val = load %struct.test, %struct.test* %addr
				store %struct.test %val, %struct.test* %ret, align 8
				%0 = load %struct.test, %struct.test* %ret, align 8
				%1 = extractvalue %struct.test %0, 0
				%2 = extractvalue %struct.test %0, 1
				%3 = call <vscale x 1 x double> @llvm.riscv.vfadd.nxv1f64.nxv1f64(
				<vscale x 1 x double> %1, <vscale x 1 x double> %2, i64 %vl)
				ret <vscale x 1 x double> %3
				}

				declare <vscale x 1 x double> @llvm.riscv.vfadd.nxv1f64.nxv1f64(
				<vscale x 1 x double>,
				<vscale x 1 x double>,
				i64);

llvm/test/Other/load-scalable-vector-struct.ll

This file was added.

				; RUN: opt -S -verify < %s 2>&1 \| FileCheck %s

				%struct.test = type { <vscale x 1 x i32>, <vscale x 1 x i32> }

				define <vscale x 1 x i32> @load(%struct.test* %x) {
				; CHECK: %a = load %struct.test, %struct.test* %x, align 4
				; CHECK: %b = extractvalue %struct.test %a, 1
				; CHECK: ret <vscale x 1 x i32> %b
				%a = load %struct.test, %struct.test* %x
				%b = extractvalue %struct.test %a, 1
				ret <vscale x 1 x i32> %b
				}

llvm/test/Other/store-scalable-vector-struct.ll

This file was added.

				; RUN: opt -S -verify < %s 2>&1 \| FileCheck %s

				%struct.test = type { <vscale x 1 x i32>, <vscale x 1 x i32> }

				define void @store(%struct.test* %x, <vscale x 1 x i32> %y, <vscale x 1 x i32> %z) {
				; CHECK: %a = insertvalue %struct.test undef, <vscale x 1 x i32> %y, 0
				; CHECK: %b = insertvalue %struct.test %a, <vscale x 1 x i32> %z, 1
				; CHECK: store %struct.test %b, %struct.test* %x
				; CHECK: ret void
				%a = insertvalue %struct.test undef, <vscale x 1 x i32> %y, 0
				%b = insertvalue %struct.test %a, <vscale x 1 x i32> %z, 1
				store %struct.test %b, %struct.test* %x
				ret void
				}

This is an archive of the discontinued LLVM Phabricator instance.

[IR] Permit load/store/alloca for struct with the same scalable vectors.AbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 336347

llvm/include/llvm/CodeGen/Analysis.h

llvm/include/llvm/IR/DataLayout.h

llvm/include/llvm/IR/Instructions.h

llvm/include/llvm/IR/Type.h

llvm/lib/Analysis/MemoryBuiltins.cpp

llvm/lib/Analysis/ScalarEvolution.cpp

llvm/lib/CodeGen/Analysis.cpp

llvm/lib/CodeGen/GlobalISel/CallLowering.cpp

llvm/lib/CodeGen/SelectionDAG/FunctionLoweringInfo.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

llvm/lib/IR/DataLayout.cpp

llvm/lib/IR/Type.cpp

llvm/lib/Target/AMDGPU/AMDGPUISelLowering.cpp

llvm/lib/Target/NVPTX/NVPTXISelLowering.cpp

llvm/lib/Target/X86/X86CallLowering.cpp

llvm/lib/Transforms/IPO/ArgumentPromotion.cpp

llvm/lib/Transforms/IPO/AttributorAttributes.cpp

llvm/lib/Transforms/InstCombine/InstCombineLoadStoreAlloca.cpp

llvm/lib/Transforms/InstCombine/InstructionCombining.cpp

llvm/lib/Transforms/Scalar/SROA.cpp

llvm/lib/Transforms/Utils/ScalarEvolutionExpander.cpp

llvm/test/CodeGen/RISCV/rvv/load-store-scalable-struct.ll

llvm/test/Other/load-scalable-vector-struct.ll

llvm/test/Other/store-scalable-vector-struct.ll

[IR] Permit load/store/alloca for struct with the same scalable vectors.
AbandonedPublic