This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/IR/
-
llvm/
-
IR/
2/2
DataLayout.h
-
lib/
-
Analysis/
2/4
ConstantFolding.cpp
-
IR/
1/1
DataLayout.cpp
-
Transforms/
-
InstCombine/
-
InstructionCombining.cpp
-
Scalar/
-
SROA.cpp
-
test/Transforms/
-
Transforms/
-
InstCombine/
-
getelementptr.ll
-
SROA/
-
scalable-vectors.ll

Differential D110043

[IR] Add helper to convert offset to GEP indices
ClosedPublic

Authored by nikic on Sep 19 2021, 1:01 PM.

Download Raw Diff

Details

Reviewers

aeubanks

Commits

rGdd0226561e86: [IR] Add helper to convert offset to GEP indices

Summary

We implement logic to convert a byte offset into a sequence of GEP indices for that offset in a number of places. This patch adds a DataLayout::getGEPIndicesForOffset() method, which implements the core logic. I've updated SROA, ConstantFolding and InstCombine to use it, and there's a few more places where it looks relevant.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

nikic created this revision.Sep 19 2021, 1:01 PM

Herald added subscribers: dexonsmith, hiraditya. · View Herald TranscriptSep 19 2021, 1:01 PM

nikic requested review of this revision.Sep 19 2021, 1:01 PM

Herald added a project: Restricted Project. · View Herald TranscriptSep 19 2021, 1:01 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B124587: Diff 373474.Sep 19 2021, 1:54 PM

aeubanks added inline comments.Sep 19 2021, 10:56 PM

llvm/include/llvm/IR/DataLayout.h
583	residual
584	`getGEPIndicesForOffset`
llvm/lib/Analysis/ConstantFolding.cpp
1008	what do you mean by this?
1021	clang-format?
llvm/lib/IR/DataLayout.cpp
913	`assert(Offset.isPositive())`? or can this be negative?

Address comments.

nikic added inline comments.Sep 20 2021, 9:59 AM

llvm/lib/Analysis/ConstantFolding.cpp
1008	We add extra zero indices in the hope that can return `ResType` without inserting a bitcast. But if that's not actually possible, we'll just add the maximum number of zeros and insert the bitcast anyway.

aeubanks accepted this revision.Sep 20 2021, 10:33 AM

aeubanks added inline comments.

llvm/lib/Analysis/ConstantFolding.cpp
1008	ah I didn't see the bitcast below

This revision is now accepted and ready to land.Sep 20 2021, 10:33 AM

Harbormaster completed remote builds in B124694: Diff 373632.Sep 20 2021, 10:53 AM

This revision was landed with ongoing or failed builds.Sep 20 2021, 11:20 AM

Closed by commit rGdd0226561e86: [IR] Add helper to convert offset to GEP indices (authored by nikic). · Explain Why

This revision was automatically updated to reflect the committed changes.

nikic added a commit: rGdd0226561e86: [IR] Add helper to convert offset to GEP indices.

Hi @nikic,

This (admittedly silly) example starts hitting an assertion with this patch:

opt -passes='globalopt' -S -o - very_large_offset.ll

It fails with:

opt: ../lib/IR/DataLayout.cpp:913: void addElementIndex(SmallVectorImpl<llvm::APInt> &, llvm::TypeSize, llvm::APInt &): Assertion `Offset.isNonNegative() && "Remaining offset shouldn't be negative"' failed.

The input contains a huge (larger than half the address space) array @s and also a Gep with a huge offset into that object so possibly this is all UB but I suppose in a very nice world the compiler shouldn't crash with an assertion anyway since I guess the input is legal?

very_large_offset.ll470 BDownload

In D110043#3019942, @uabelho wrote:
Hi @nikic,

This (admittedly silly) example starts hitting an assertion with this patch:
opt -passes='globalopt' -S -o - very_large_offset.ll
It fails with:
opt: ../lib/IR/DataLayout.cpp:913: void addElementIndex(SmallVectorImpl<llvm::APInt> &, llvm::TypeSize, llvm::APInt &): Assertion `Offset.isNonNegative() && "Remaining offset shouldn't be negative"' failed.
The input contains a huge (larger than half the address space) array @s and also a Gep with a huge offset into that object so possibly this is all UB but I suppose in a very nice world the compiler shouldn't crash with an assertion anyway since I guess the input is legal?

very_large_offset.ll470 BDownload

Seems like ElemSize is greater than 2^32 and we're getting some weird math in addElementIndex().

nikic mentioned this in D110437: [IR] Handle large element size when calculating GEP indices.Sep 24 2021, 11:48 AM

nikic mentioned this in rG5969e5743ae9: [IR] Handle large element size when calculating GEP indices.Sep 24 2021, 1:20 PM

Revision Contents

Path

Size

llvm/

include/

llvm/

IR/

DataLayout.h

4 lines

lib/

Analysis/

ConstantFolding.cpp

86 lines

IR/

DataLayout.cpp

62 lines

Transforms/

InstCombine/

InstructionCombining.cpp

56 lines

Scalar/

SROA.cpp

89 lines

test/

Transforms/

InstCombine/

getelementptr.ll

2 lines

SROA/

scalable-vectors.ll

7 lines

Diff 373664

llvm/include/llvm/IR/DataLayout.h

Show First 20 Lines • Show All 573 Lines • ▼ Show 20 Lines	public:

/// Returns the offset from the beginning of the type for the specified		/// Returns the offset from the beginning of the type for the specified
/// indices.		/// indices.
///		///
/// Note that this takes the element type, not the pointer type.		/// Note that this takes the element type, not the pointer type.
/// This is used to implement getelementptr.		/// This is used to implement getelementptr.
int64_t getIndexedOffsetInType(Type ElemTy, ArrayRef<Value > Indices) const;		int64_t getIndexedOffsetInType(Type ElemTy, ArrayRef<Value > Indices) const;

		/// Get GEP indices to access Offset inside ElemTy. ElemTy is updated to be
		/// the result element type and Offset to be the residual offset.
		aeubanksUnsubmitted Done Reply Inline Actions residual aeubanks: residual
		SmallVector<APInt> getGEPIndicesForOffset(Type *&ElemTy, APInt &Offset) const;
		aeubanksUnsubmitted Done Reply Inline Actions `getGEPIndicesForOffset` aeubanks: `getGEPIndicesForOffset`

/// Returns a StructLayout object, indicating the alignment of the		/// Returns a StructLayout object, indicating the alignment of the
/// struct, its size, and the offsets of its fields.		/// struct, its size, and the offsets of its fields.
///		///
/// Note that this information is lazily cached.		/// Note that this information is lazily cached.
const StructLayout getStructLayout(StructType Ty) const;		const StructLayout getStructLayout(StructType Ty) const;

/// Returns the preferred alignment of the specified global.		/// Returns the preferred alignment of the specified global.
///		///
▲ Show 20 Lines • Show All 116 Lines • Show Last 20 Lines

llvm/lib/Analysis/ConstantFolding.cpp

Show First 20 Lines • Show All 979 Lines • ▼ Show 20 Lines	if ((Ptr->isNullValue() \|\| BasePtr != 0) &&
Constant *C = ConstantInt::get(Ptr->getContext(), Offset + BasePtr);		Constant *C = ConstantInt::get(Ptr->getContext(), Offset + BasePtr);
return ConstantExpr::getIntToPtr(C, ResTy);		return ConstantExpr::getIntToPtr(C, ResTy);
}		}

// Otherwise form a regular getelementptr. Recompute the indices so that		// Otherwise form a regular getelementptr. Recompute the indices so that
// we eliminate over-indexing of the notional static type array bounds.		// we eliminate over-indexing of the notional static type array bounds.
// This makes it easy to determine if the getelementptr is "inbounds".		// This makes it easy to determine if the getelementptr is "inbounds".
// Also, this helps GlobalOpt do SROA on GlobalVariables.		// Also, this helps GlobalOpt do SROA on GlobalVariables.
SmallVector<Constant *, 32> NewIdxs;
Type *Ty = PTy;

// For GEPs of GlobalValues, use the value type even for opaque pointers.		// For GEPs of GlobalValues, use the value type even for opaque pointers.
// Otherwise use an i8 GEP.		// Otherwise use an i8 GEP.
if (auto *GV = dyn_cast<GlobalValue>(Ptr))		if (auto *GV = dyn_cast<GlobalValue>(Ptr))
SrcElemTy = GV->getValueType();		SrcElemTy = GV->getValueType();
else if (!PTy->isOpaque())		else if (!PTy->isOpaque())
SrcElemTy = PTy->getElementType();		SrcElemTy = PTy->getElementType();
else		else
SrcElemTy = Type::getInt8Ty(Ptr->getContext());		SrcElemTy = Type::getInt8Ty(Ptr->getContext());

do {		if (!SrcElemTy->isSized())
if (!Ty->isStructTy()) {		return nullptr;
if (Ty->isPointerTy()) {
// The only pointer indexing we'll do is on the first index of the GEP.
if (!NewIdxs.empty())
break;

Ty = SrcElemTy;

// Only handle pointers to sized types, not pointers to functions.		Type *ElemTy = SrcElemTy;
if (!Ty->isSized())		SmallVector<APInt> Indices = DL.getGEPIndicesForOffset(ElemTy, Offset);
		if (Offset != 0)
return nullptr;		return nullptr;
} else {
Type *NextTy = GetElementPtrInst::getTypeAtIndex(Ty, (uint64_t)0);		// Try to add additional zero indices to reach the desired result element
		// type.
		// TODO: Should we avoid extra zero indices if ResElemTy can't be reached and
		aeubanksUnsubmitted Not Done Reply Inline Actions what do you mean by this? aeubanks: what do you mean by this?
		nikicAuthorUnsubmitted Done Reply Inline Actions We add extra zero indices in the hope that can return `ResType` without inserting a bitcast. But if that's not actually possible, we'll just add the maximum number of zeros and insert the bitcast anyway. nikic: We add extra zero indices in the hope that can return `ResType` without inserting a bitcast.
		aeubanksUnsubmitted Not Done Reply Inline Actions ah I didn't see the bitcast below aeubanks: ah I didn't see the bitcast below
		// we'll have to insert a bitcast anyway?
		while (ElemTy != ResElemTy) {
		Type *NextTy = GetElementPtrInst::getTypeAtIndex(ElemTy, (uint64_t)0);
if (!NextTy)		if (!NextTy)
break;		break;
Ty = NextTy;
}

// Determine which element of the array the offset points into.		Indices.push_back(APInt::getZero(isa<StructType>(ElemTy) ? 32 : BitWidth));
APInt ElemSize(BitWidth, DL.getTypeAllocSize(Ty));		ElemTy = NextTy;
if (ElemSize == 0) {
// The element size is 0. This may be [0 x Ty]*, so just use a zero
// index for this level and proceed to the next level to see if it can
// accommodate the offset.
NewIdxs.push_back(ConstantInt::get(IntIdxTy, 0));
} else {
// The element size is non-zero divide the offset by the element
// size (rounding down), to compute the index at this level.
bool Overflow;
APInt NewIdx = Offset.sdiv_ov(ElemSize, Overflow);
if (Overflow)
break;
Offset -= NewIdx * ElemSize;
NewIdxs.push_back(ConstantInt::get(IntIdxTy, NewIdx));
}		}
} else {
auto *STy = cast<StructType>(Ty);		SmallVector<Constant *, 32> NewIdxs;
// If we end up with an offset that isn't valid for this struct type, we		for (const APInt &Index : Indices)
// can't re-form this GEP in a regular form, so bail out. The pointer		NewIdxs.push_back(ConstantInt::get(
		aeubanksUnsubmitted Done Reply Inline Actions clang-format? aeubanks: clang-format?
// operand likely went through casts that are necessary to make the GEP		Type::getIntNTy(Ptr->getContext(), Index.getBitWidth()), Index));
// sensible.
const StructLayout &SL = *DL.getStructLayout(STy);
if (Offset.isNegative() \|\| Offset.uge(SL.getSizeInBytes()))
break;

// Determine which field of the struct the offset points into. The
// getZExtValue is fine as we've already ensured that the offset is
// within the range representable by the StructLayout API.
unsigned ElIdx = SL.getElementContainingOffset(Offset.getZExtValue());
NewIdxs.push_back(ConstantInt::get(Type::getInt32Ty(Ty->getContext()),
ElIdx));
Offset -= APInt(BitWidth, SL.getElementOffset(ElIdx));
Ty = STy->getTypeAtIndex(ElIdx);
}
} while (Ty != ResElemTy);

// If we haven't used up the entire offset by descending the static
// type, then the offset is pointing into the middle of an indivisible
// member, so we can't simplify it.
if (Offset != 0)
return nullptr;

// Preserve the inrange index from the innermost GEP if possible. We must		// Preserve the inrange index from the innermost GEP if possible. We must
// have calculated the same indices up to and including the inrange index.		// have calculated the same indices up to and including the inrange index.
Optional<unsigned> InRangeIndex;		Optional<unsigned> InRangeIndex;
if (Optional<unsigned> LastIRIndex = InnermostGEP->getInRangeIndex())		if (Optional<unsigned> LastIRIndex = InnermostGEP->getInRangeIndex())
if (SrcElemTy == InnermostGEP->getSourceElementType() &&		if (SrcElemTy == InnermostGEP->getSourceElementType() &&
NewIdxs.size() > *LastIRIndex) {		NewIdxs.size() > *LastIRIndex) {
InRangeIndex = LastIRIndex;		InRangeIndex = LastIRIndex;
for (unsigned I = 0; I <= *LastIRIndex; ++I)		for (unsigned I = 0; I <= *LastIRIndex; ++I)
if (NewIdxs[I] != InnermostGEP->getOperand(I + 1))		if (NewIdxs[I] != InnermostGEP->getOperand(I + 1))
return nullptr;		return nullptr;
}		}

// Create a GEP.		// Create a GEP.
Constant *C = ConstantExpr::getGetElementPtr(SrcElemTy, Ptr, NewIdxs,		Constant *C = ConstantExpr::getGetElementPtr(SrcElemTy, Ptr, NewIdxs,
InBounds, InRangeIndex);		InBounds, InRangeIndex);
assert(cast<PointerType>(C->getType())->isOpaqueOrPointeeTypeMatches(Ty) &&		assert(
		cast<PointerType>(C->getType())->isOpaqueOrPointeeTypeMatches(ElemTy) &&
"Computed GetElementPtr has unexpected type!");		"Computed GetElementPtr has unexpected type!");

// If we ended up indexing a member with a type that doesn't match		// If we ended up indexing a member with a type that doesn't match
// the type of what the original indices indexed, add a cast.		// the type of what the original indices indexed, add a cast.
if (C->getType() != ResTy)		if (C->getType() != ResTy)
C = FoldBitCast(C, ResTy, DL);		C = FoldBitCast(C, ResTy, DL);

return C;		return C;
}		}
▲ Show 20 Lines • Show All 2,243 Lines • Show Last 20 Lines

llvm/lib/IR/DataLayout.cpp

Show First 20 Lines • Show All 890 Lines • ▼ Show 20 Lines	if (StructType *STy = GTI.getStructTypeOrNull()) {
if (int64_t arrayIdx = cast<ConstantInt>(Idx)->getSExtValue())		if (int64_t arrayIdx = cast<ConstantInt>(Idx)->getSExtValue())
Result += arrayIdx * getTypeAllocSize(GTI.getIndexedType());		Result += arrayIdx * getTypeAllocSize(GTI.getIndexedType());
}		}
}		}

return Result;		return Result;
}		}

		static void addElementIndex(SmallVectorImpl<APInt> &Indices, TypeSize ElemSize,
		APInt &Offset) {
		// Skip over scalable or zero size elements.
		if (ElemSize.isScalable() \|\| ElemSize == 0) {
		Indices.push_back(APInt::getZero(Offset.getBitWidth()));
		return;
		}

		APInt Index = Offset.sdiv(ElemSize);
		Offset -= Index * ElemSize;
		if (Offset.isNegative()) {
		// Prefer a positive remaining offset to allow struct indexing.
		--Index;
		Offset += ElemSize;
		assert(Offset.isNonNegative() && "Remaining offset shouldn't be negative");
		aeubanksUnsubmitted Done Reply Inline Actions `assert(Offset.isPositive())`? or can this be negative? aeubanks: `assert(Offset.isPositive())`? or can this be negative?
		}
		Indices.push_back(Index);
		}

		SmallVector<APInt> DataLayout::getGEPIndicesForOffset(Type *&ElemTy,
		APInt &Offset) const {
		assert(ElemTy->isSized() && "Element type must be sized");
		SmallVector<APInt> Indices;
		addElementIndex(Indices, getTypeAllocSize(ElemTy), Offset);
		while (Offset != 0) {
		if (auto *ArrTy = dyn_cast<ArrayType>(ElemTy)) {
		ElemTy = ArrTy->getElementType();
		addElementIndex(Indices, getTypeAllocSize(ElemTy), Offset);
		continue;
		}

		if (auto *VecTy = dyn_cast<VectorType>(ElemTy)) {
		ElemTy = VecTy->getElementType();
		unsigned ElemSizeInBits = getTypeSizeInBits(ElemTy).getFixedSize();
		// GEPs over non-multiple of 8 size vector elements are invalid.
		if (ElemSizeInBits % 8 != 0)
		break;

		addElementIndex(Indices, TypeSize::Fixed(ElemSizeInBits / 8), Offset);
		continue;
		}

		if (auto *STy = dyn_cast<StructType>(ElemTy)) {
		const StructLayout *SL = getStructLayout(STy);
		uint64_t IntOffset = Offset.getZExtValue();
		if (IntOffset >= SL->getSizeInBytes())
		break;

		unsigned Index = SL->getElementContainingOffset(IntOffset);
		Offset -= SL->getElementOffset(Index);
		ElemTy = STy->getElementType(Index);
		Indices.push_back(APInt(32, Index));
		continue;
		}

		// Can't index into non-aggregate type.
		break;
		}

		return Indices;
		}

/// getPreferredAlign - Return the preferred alignment of the specified global.		/// getPreferredAlign - Return the preferred alignment of the specified global.
/// This includes an explicitly requested alignment (if the global has one).		/// This includes an explicitly requested alignment (if the global has one).
Align DataLayout::getPreferredAlign(const GlobalVariable *GV) const {		Align DataLayout::getPreferredAlign(const GlobalVariable *GV) const {
MaybeAlign GVAlignment = GV->getAlign();		MaybeAlign GVAlignment = GV->getAlign();
// If a section is specified, always precisely honor explicit alignment,		// If a section is specified, always precisely honor explicit alignment,
// so we don't insert padding into a section we don't control.		// so we don't insert padding into a section we don't control.
if (GVAlignment && GV->hasSection())		if (GVAlignment && GV->hasSection())
return *GVAlignment;		return *GVAlignment;
Show All 29 Lines

llvm/lib/Transforms/InstCombine/InstructionCombining.cpp

Show First 20 Lines • Show All 1,263 Lines • ▼ Show 20 Lines	Instruction *InstCombinerImpl::foldBinOpIntoSelectOrPhi(BinaryOperator &I) {
return nullptr;		return nullptr;
}		}

/// Given a pointer type and a constant offset, determine whether or not there		/// Given a pointer type and a constant offset, determine whether or not there
/// is a sequence of GEP indices into the pointed type that will land us at the		/// is a sequence of GEP indices into the pointed type that will land us at the
/// specified offset. If so, fill them into NewIndices and return the resultant		/// specified offset. If so, fill them into NewIndices and return the resultant
/// element type, otherwise return null.		/// element type, otherwise return null.
Type *		Type *
InstCombinerImpl::FindElementAtOffset(PointerType *PtrTy, int64_t Offset,		InstCombinerImpl::FindElementAtOffset(PointerType *PtrTy, int64_t IntOffset,
SmallVectorImpl<Value *> &NewIndices) {		SmallVectorImpl<Value *> &NewIndices) {
Type *Ty = PtrTy->getElementType();		Type *Ty = PtrTy->getElementType();
if (!Ty->isSized())		if (!Ty->isSized())
return nullptr;		return nullptr;

// Start with the index over the outer type. Note that the type size		APInt Offset(DL.getIndexTypeSizeInBits(PtrTy), IntOffset);
// might be zero (even if the offset isn't zero) if the indexed type		SmallVector<APInt> Indices = DL.getGEPIndicesForOffset(Ty, Offset);
// is something like [0 x {int, int}]		if (!Offset.isZero())
Type *IndexTy = DL.getIndexType(PtrTy);
int64_t FirstIdx = 0;
if (int64_t TySize = DL.getTypeAllocSize(Ty)) {
FirstIdx = Offset/TySize;
Offset -= FirstIdx*TySize;

// Handle hosts where % returns negative instead of values [0..TySize).
if (Offset < 0) {
--FirstIdx;
Offset += TySize;
assert(Offset >= 0);
}
assert((uint64_t)Offset < (uint64_t)TySize && "Out of range offset");
}

NewIndices.push_back(ConstantInt::get(IndexTy, FirstIdx));

// Index into the types. If we fail, set OrigBase to null.
while (Offset) {
// Indexing into tail padding between struct/array elements.
if (uint64_t(Offset * 8) >= DL.getTypeSizeInBits(Ty))
return nullptr;

if (StructType *STy = dyn_cast<StructType>(Ty)) {
const StructLayout *SL = DL.getStructLayout(STy);
assert(Offset < (int64_t)SL->getSizeInBytes() &&
"Offset must stay within the indexed type");

unsigned Elt = SL->getElementContainingOffset(Offset);
NewIndices.push_back(ConstantInt::get(Type::getInt32Ty(Ty->getContext()),
Elt));

Offset -= SL->getElementOffset(Elt);
Ty = STy->getElementType(Elt);
} else if (ArrayType *AT = dyn_cast<ArrayType>(Ty)) {
uint64_t EltSize = DL.getTypeAllocSize(AT->getElementType());
assert(EltSize && "Cannot index into a zero-sized array");
NewIndices.push_back(ConstantInt::get(IndexTy,Offset/EltSize));
Offset %= EltSize;
Ty = AT->getElementType();
} else {
// Otherwise, we can't index into the middle of this atomic type, bail.
return nullptr;		return nullptr;
}
}

		for (const APInt &Index : Indices)
		NewIndices.push_back(Builder.getInt(Index));
return Ty;		return Ty;
}		}

static bool shouldMergeGEPs(GEPOperator &GEP, GEPOperator &Src) {		static bool shouldMergeGEPs(GEPOperator &GEP, GEPOperator &Src) {
// If this GEP has only 0 indices, it is the same pointer as		// If this GEP has only 0 indices, it is the same pointer as
// Src. If Src is not a trivial GEP too, don't combine		// Src. If Src is not a trivial GEP too, don't combine
// the indices.		// the indices.
if (GEP.hasAllZeroIndices() && !Src.hasAllZeroIndices() &&		if (GEP.hasAllZeroIndices() && !Src.hasAllZeroIndices() &&
▲ Show 20 Lines • Show All 2,963 Lines • Show Last 20 Lines

llvm/lib/Transforms/Scalar/SROA.cpp

Show First 20 Lines • Show All 1,477 Lines • ▼ Show 20 Lines	do {
++NumLayers;		++NumLayers;
} while (ElementTy != TargetTy);		} while (ElementTy != TargetTy);
if (ElementTy != TargetTy)		if (ElementTy != TargetTy)
Indices.erase(Indices.end() - NumLayers, Indices.end());		Indices.erase(Indices.end() - NumLayers, Indices.end());

return buildGEP(IRB, BasePtr, Indices, NamePrefix);		return buildGEP(IRB, BasePtr, Indices, NamePrefix);
}		}

/// Recursively compute indices for a natural GEP.
///
/// This is the recursive step for getNaturalGEPWithOffset that walks down the
/// element types adding appropriate indices for the GEP.
static Value *getNaturalGEPRecursively(IRBuilderTy &IRB, const DataLayout &DL,
Value Ptr, Type Ty, APInt &Offset,
Type *TargetTy,
SmallVectorImpl<Value *> &Indices,
const Twine &NamePrefix) {
if (Offset == 0)
return getNaturalGEPWithType(IRB, DL, Ptr, Ty, TargetTy, Indices,
NamePrefix);

// We can't recurse through pointer types.
if (Ty->isPointerTy())
return nullptr;

// We try to analyze GEPs over vectors here, but note that these GEPs are
// extremely poorly defined currently. The long-term goal is to remove GEPing
// over a vector from the IR completely.
if (VectorType *VecTy = dyn_cast<VectorType>(Ty)) {
unsigned ElementSizeInBits =
DL.getTypeSizeInBits(VecTy->getScalarType()).getFixedSize();
if (ElementSizeInBits % 8 != 0) {
// GEPs over non-multiple of 8 size vector elements are invalid.
return nullptr;
}
APInt ElementSize(Offset.getBitWidth(), ElementSizeInBits / 8);
APInt NumSkippedElements = Offset.sdiv(ElementSize);
if (NumSkippedElements.ugt(cast<FixedVectorType>(VecTy)->getNumElements()))
return nullptr;
Offset -= NumSkippedElements * ElementSize;
Indices.push_back(IRB.getInt(NumSkippedElements));
return getNaturalGEPRecursively(IRB, DL, Ptr, VecTy->getElementType(),
Offset, TargetTy, Indices, NamePrefix);
}

if (ArrayType *ArrTy = dyn_cast<ArrayType>(Ty)) {
Type *ElementTy = ArrTy->getElementType();
APInt ElementSize(Offset.getBitWidth(),
DL.getTypeAllocSize(ElementTy).getFixedSize());
APInt NumSkippedElements = Offset.sdiv(ElementSize);
if (NumSkippedElements.ugt(ArrTy->getNumElements()))
return nullptr;

Offset -= NumSkippedElements * ElementSize;
Indices.push_back(IRB.getInt(NumSkippedElements));
return getNaturalGEPRecursively(IRB, DL, Ptr, ElementTy, Offset, TargetTy,
Indices, NamePrefix);
}

StructType *STy = dyn_cast<StructType>(Ty);
if (!STy)
return nullptr;

const StructLayout *SL = DL.getStructLayout(STy);
uint64_t StructOffset = Offset.getZExtValue();
if (StructOffset >= SL->getSizeInBytes())
return nullptr;
unsigned Index = SL->getElementContainingOffset(StructOffset);
Offset -= APInt(Offset.getBitWidth(), SL->getElementOffset(Index));
Type *ElementTy = STy->getElementType(Index);
if (Offset.uge(DL.getTypeAllocSize(ElementTy).getFixedSize()))
return nullptr; // The offset points into alignment padding.

Indices.push_back(IRB.getInt32(Index));
return getNaturalGEPRecursively(IRB, DL, Ptr, ElementTy, Offset, TargetTy,
Indices, NamePrefix);
}

/// Get a natural GEP from a base pointer to a particular offset and		/// Get a natural GEP from a base pointer to a particular offset and
/// resulting in a particular type.		/// resulting in a particular type.
///		///
/// The goal is to produce a "natural" looking GEP that works with the existing		/// The goal is to produce a "natural" looking GEP that works with the existing
/// composite types to arrive at the appropriate offset and element type for		/// composite types to arrive at the appropriate offset and element type for
/// a pointer. TargetTy is the element type the returned GEP should point-to if		/// a pointer. TargetTy is the element type the returned GEP should point-to if
/// possible. We recurse by decreasing Offset, adding the appropriate index to		/// possible. We recurse by decreasing Offset, adding the appropriate index to
/// Indices, and setting Ty to the result subtype.		/// Indices, and setting Ty to the result subtype.
///		///
/// If no natural GEP can be constructed, this function returns null.		/// If no natural GEP can be constructed, this function returns null.
static Value *getNaturalGEPWithOffset(IRBuilderTy &IRB, const DataLayout &DL,		static Value *getNaturalGEPWithOffset(IRBuilderTy &IRB, const DataLayout &DL,
Value Ptr, APInt Offset, Type TargetTy,		Value Ptr, APInt Offset, Type TargetTy,
SmallVectorImpl<Value *> &Indices,		SmallVectorImpl<Value *> &Indices,
const Twine &NamePrefix) {		const Twine &NamePrefix) {
PointerType *Ty = cast<PointerType>(Ptr->getType());		PointerType *Ty = cast<PointerType>(Ptr->getType());

// Don't consider any GEPs through an i8* as natural unless the TargetTy is		// Don't consider any GEPs through an i8* as natural unless the TargetTy is
// an i8.		// an i8.
if (Ty == IRB.getInt8PtrTy(Ty->getAddressSpace()) && TargetTy->isIntegerTy(8))		if (Ty == IRB.getInt8PtrTy(Ty->getAddressSpace()) && TargetTy->isIntegerTy(8))
return nullptr;		return nullptr;

Type *ElementTy = Ty->getElementType();		Type *ElementTy = Ty->getElementType();
if (!ElementTy->isSized())		if (!ElementTy->isSized())
return nullptr; // We can't GEP through an unsized element.		return nullptr; // We can't GEP through an unsized element.
if (isa<ScalableVectorType>(ElementTy))
		SmallVector<APInt> IntIndices = DL.getGEPIndicesForOffset(ElementTy, Offset);
		if (Offset != 0)
return nullptr;		return nullptr;
APInt ElementSize(Offset.getBitWidth(),
DL.getTypeAllocSize(ElementTy).getFixedSize());
if (ElementSize == 0)
return nullptr; // Zero-length arrays can't help us build a natural GEP.
APInt NumSkippedElements = Offset.sdiv(ElementSize);

Offset -= NumSkippedElements * ElementSize;		for (const APInt &Index : IntIndices)
Indices.push_back(IRB.getInt(NumSkippedElements));		Indices.push_back(IRB.getInt(Index));
return getNaturalGEPRecursively(IRB, DL, Ptr, ElementTy, Offset, TargetTy,		return getNaturalGEPWithType(IRB, DL, Ptr, ElementTy, TargetTy, Indices,
Indices, NamePrefix);		NamePrefix);
}		}

/// Compute an adjusted pointer from Ptr by Offset bytes where the		/// Compute an adjusted pointer from Ptr by Offset bytes where the
/// resulting pointer has PointerTy.		/// resulting pointer has PointerTy.
///		///
/// This tries very hard to compute a "natural" GEP which arrives at the offset		/// This tries very hard to compute a "natural" GEP which arrives at the offset
/// and produces the pointer type desired. Where it cannot, it will try to use		/// and produces the pointer type desired. Where it cannot, it will try to use
/// the natural GEP to arrive at the offset and bitcast to the type. Where that		/// the natural GEP to arrive at the offset and bitcast to the type. Where that
▲ Show 20 Lines • Show All 3,283 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/getelementptr.ll

Show First 20 Lines • Show All 788 Lines • ▼ Show 20 Lines	;
call i32 (i8, ...) @printf(i8 getelementptr ([17 x i8], [17 x i8]* @"\01LC8", i32 0, i32 0),		call i32 (i8, ...) @printf(i8 getelementptr ([17 x i8], [17 x i8]* @"\01LC8", i32 0, i32 0),
i8* getelementptr (%t1, %t1* bitcast (%t0* @s to %t1*), i32 0, i32 1, i32 0)) nounwind		i8* getelementptr (%t1, %t1* bitcast (%t0* @s to %t1*), i32 0, i32 1, i32 0)) nounwind
ret i32 0		ret i32 0
}		}

; Don't treat signed offsets as unsigned.		; Don't treat signed offsets as unsigned.
define i8* @test36() nounwind {		define i8* @test36() nounwind {
; CHECK-LABEL: @test36(		; CHECK-LABEL: @test36(
; CHECK-NEXT: ret i8* getelementptr ([11 x i8], [11 x i8]* @array, i64 0, i64 -1)		; CHECK-NEXT: ret i8* getelementptr ([11 x i8], [11 x i8]* @array, i64 -1, i64 10)
;		;
ret i8* getelementptr ([11 x i8], [11 x i8]* @array, i32 0, i64 -1)		ret i8* getelementptr ([11 x i8], [11 x i8]* @array, i32 0, i64 -1)
}		}

; Instcombine shouldn't assume that gep(A,0,1) != gep(A,1,0).		; Instcombine shouldn't assume that gep(A,0,1) != gep(A,1,0).
@A37 = external constant [1 x i8]		@A37 = external constant [1 x i8]
define i1 @test37() nounwind {		define i1 @test37() nounwind {
; CHECK-LABEL: @test37(		; CHECK-LABEL: @test37(
▲ Show 20 Lines • Show All 518 Lines • Show Last 20 Lines

llvm/test/Transforms/SROA/scalable-vectors.ll

Show First 20 Lines • Show All 65 Lines • ▼ Show 20 Lines	;
ret <vscale x 4 x i32> %4		ret <vscale x 4 x i32> %4
}		}

; When casting from VLA to VLS via memory check we bail out when producing a		; When casting from VLA to VLS via memory check we bail out when producing a
; GEP where the element type is a scalable vector.		; GEP where the element type is a scalable vector.
define <vscale x 4 x i32> @cast_alloca_from_svint32_t() {		define <vscale x 4 x i32> @cast_alloca_from_svint32_t() {
; CHECK-LABEL: @cast_alloca_from_svint32_t(		; CHECK-LABEL: @cast_alloca_from_svint32_t(
; CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 4 x i32>, align 16		; CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 4 x i32>, align 16
; CHECK-NEXT: [[TMP1:%.]] = bitcast <vscale x 4 x i32> [[RETVAL_COERCE]] to i8*		; CHECK-NEXT: [[RETVAL_0__SROA_CAST:%.]] = bitcast <vscale x 4 x i32> [[RETVAL_COERCE]] to <16 x i32>*
; CHECK-NEXT: [[RETVAL_0__SROA_CAST:%.]] = bitcast i8 [[TMP1]] to <16 x i32>*
; CHECK-NEXT: store <16 x i32> undef, <16 x i32>* [[RETVAL_0__SROA_CAST]], align 16		; CHECK-NEXT: store <16 x i32> undef, <16 x i32>* [[RETVAL_0__SROA_CAST]], align 16
; CHECK-NEXT: [[TMP2:%.]] = load <vscale x 4 x i32>, <vscale x 4 x i32> [[RETVAL_COERCE]], align 16		; CHECK-NEXT: [[TMP1:%.]] = load <vscale x 4 x i32>, <vscale x 4 x i32> [[RETVAL_COERCE]], align 16
; CHECK-NEXT: ret <vscale x 4 x i32> [[TMP2]]		; CHECK-NEXT: ret <vscale x 4 x i32> [[TMP1]]
;		;
%retval = alloca <16 x i32>		%retval = alloca <16 x i32>
%retval.coerce = alloca <vscale x 4 x i32>		%retval.coerce = alloca <vscale x 4 x i32>
%1 = bitcast <vscale x 4 x i32>* %retval.coerce to i8*		%1 = bitcast <vscale x 4 x i32>* %retval.coerce to i8*
%2 = bitcast <16 x i32>* %retval to i8*		%2 = bitcast <16 x i32>* %retval to i8*
call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %1, i8* align 16 %2, i64 64, i1 false)		call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %1, i8* align 16 %2, i64 64, i1 false)
%3 = load <vscale x 4 x i32>, <vscale x 4 x i32>* %retval.coerce		%3 = load <vscale x 4 x i32>, <vscale x 4 x i32>* %retval.coerce
ret <vscale x 4 x i32> %3		ret <vscale x 4 x i32> %3
}		}

declare void @llvm.memcpy.p0i8.p0i8.i64(i8* nocapture, i8* nocapture, i64, i1) nounwind		declare void @llvm.memcpy.p0i8.p0i8.i64(i8* nocapture, i8* nocapture, i64, i1) nounwind