This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/llvm/IR/
-
llvm/
-
IR/
-
DataLayout.h
-
lib/
-
IR/
-
DataLayout.cpp
-
Transforms/InstCombine/
-
InstCombine/
-
InstCombineCompares.cpp
2/2
InstructionCombining.cpp

Differential D42123

Derive GEP index type from Data Layout
ClosedPublic

Authored by delena on Jan 16 2018, 12:06 PM.

Download Raw Diff

Details

Reviewers

hfinkel
Ashutosh
theraven
igorb
craig.topper

Commits

rG945b7e5aa639: Adding a width of the GEP index to the Data Layout.
rL325102: Adding a width of the GEP index to the Data Layout.

Summary

In the current version InstCombiner “normalizes” GEPs and extends Index operand to the pointer width.

It works fine if you can convert pointer to integer for address calculation and all registered targets do this.
The target I’m working on has very restricted ISA for the pointer calculation. Hal suggested to retrieve information for GEP index width from Data Layout.
http://lists.llvm.org/pipermail/llvm-dev/2018-January/120416.html

I added the interface to Data Layout and I changed the InstCombiner.
I know that I didn't touch all GEP creation points, but all changes that you see in the review are covered by our internal test system.

Diff Detail

Repository: rL LLVM

Event Timeline

delena created this revision.Jan 16 2018, 12:06 PM

craig.topper added inline comments.Jan 16 2018, 5:15 PM

lib/Transforms/InstCombine/InstructionCombining.cpp
1883–1887	Should this assert message be updated since it not guaranteed to be pointer width now?
1923–1927	Same with this assert.

Fixed 2 "assert" messages.

Added tests for a data layout, where pointer is wider than the largest supported integer type.

craig.topper added inline comments.Jan 18 2018, 5:05 PM

lib/Analysis/ConstantFolding.cpp
812 ↗	(On Diff #130138)	Can you put curly braces after this if and the for loop below it to help with readability? I know the for loop didn't have them before, but I feel like it should have. I tend to think that if the inner scope is curly braced, the outer scope should be too.

I don't like this patch as is, for several reasons.

It's adding a hack that assumes that the offset should be the width of the widest integer operation. This is probably true in most cases (it is for us), but if we're going to introduce the idea that an address offset is distinct from the size of the pointer then we should do it properly and add that to the TargetInfo string explicitly (defaulting to the same size, if not specified).
We're computing the correct width every time it's requested, which looks expensive. TargetInfo should store the width for each address space and, for non-vector types, not have to do any calculation to determine the kind of integer to return.
It fixes only around 20% of the places that we've found that assume that the size and range of the pointer are the same.

This revision now requires changes to proceed.Jan 19 2018, 4:19 AM

Updated, following Craig's comments.

In D42123#981602, @theraven wrote:

I don't like this patch as is, for several reasons.

It's adding a hack that assumes that the offset should be the width of the widest integer operation. This is probably true in most cases (it is for us), but if we're going to introduce the idea that an address offset is distinct from the size of the pointer then we should do it properly and add that to the TargetInfo string explicitly (defaulting to the same size, if not specified).

So you propose to extend Data Layout string and add index size to it, right? It was one of options that Hal suggested. Ok.

We're computing the correct width every time it's requested, which looks expensive. TargetInfo should store the width for each address space and, for non-vector types, not have to do any calculation to determine the kind of integer to return.

We calculated getIntPtrType() anyway, getIndexType() is not more expensive. If I extend TargetInfo, the extension will be optional and all other targets will calculate getIntPtrType() anyway.

It fixes only around 20% of the places that we've found that assume that the size and range of the pointer are the same.

I can't derive all places from your code. You can show me them all, one-by-one, or we'll fix more places gradually on top of this patch.

In D42123#982063, @delena wrote:

In D42123#981602, @theraven wrote:

I don't like this patch as is, for several reasons.

It's adding a hack that assumes that the offset should be the width of the widest integer operation. This is probably true in most cases (it is for us), but if we're going to introduce the idea that an address offset is distinct from the size of the pointer then we should do it properly and add that to the TargetInfo string explicitly (defaulting to the same size, if not specified).

So you propose to extend Data Layout string and add index size to it, right? It was one of options that Hal suggested. Ok.

Yes, if we're going to fix this upstream, let's fix it properly.

We're computing the correct width every time it's requested, which looks expensive. TargetInfo should store the width for each address space and, for non-vector types, not have to do any calculation to determine the kind of integer to return.

We calculated getIntPtrType() anyway, getIndexType() is not more expensive. If I extend TargetInfo, the extension will be optional and all other targets will calculate getIntPtrType() anyway.

If you read it from the DataLayout string, you'll either construct it at that parsing time from the specified version or from the default version.

It fixes only around 20% of the places that we've found that assume that the size and range of the pointer are the same.

I can't derive all places from your code. You can show me them all, one-by-one, or we'll fix more places gradually on top of this patch.

Greping the code for all uses of getPointerBaseSize should show them all, but I can send you a list.

I want to deprecate SCEVs for pointers if the index size is not equal to pointer size.
What do you think?

bool ScalarEvolution::isSCEVable(Type *Ty) const {
  if (Ty->isIntegerTy())
    return true;
  if (Ty->isPointerTy()) {
    // Pointer can't be scevable if index type and pointer type have different
    // width.
    const DataLayout& DL = getDataLayout();
    if (DL.getIndexTypeSizeInBits(Ty) == DL.getPointerTypeSizeInBits(Ty))
      return true;
  }
  return false;

}

In D42123#987589, @delena wrote:

I want to deprecate SCEVs for pointers if the index size is not equal to pointer size.
What do you think?

This will mean that you don't get a load of loop optimisations. I think that's a pretty big hammer. There's no reason why SCEV can't work here - we use it and have a bunch of patches against it to make it work in this context. Please take a look at our code and see how much of it is applicable to you.

I looked at your code:
/ Return the size in bits of the specified type, for which isSCEVable must
/ return true.
uint64_t ScalarEvolution::getTypeSizeInBits(Type *Ty) const {

assert(isSCEVable(Ty) && "Type is not SCEVable!");
const DataLayout &DL = getDataLayout();
if (PointerType *PT = dyn_cast<PointerType>(Ty))
  return DL.getPointerBaseSizeInBits(PT->getPointerAddressSpace());
return DL.getTypeSizeInBits(Ty);

}
I can't say that size of pointer is smaller that it is. I can't truncate pointer to integer in order to expand all SCEV expressions.

Ayal added a subscriber: Ayal.Jan 29 2018, 9:31 AM

Added index width specification to the DataLayout. Updated the langref.
Fixed Pointer vs Index sizes in the code.
Added more tests.

Herald added a subscriber: sanjoy. · View Herald TranscriptJan 30 2018, 6:07 AM

sanjoy added inline comments.Jan 30 2018, 10:25 AM

../lib/Analysis/ScalarEvolution.cpp
3667 ↗	(On Diff #131955)	Generally speaking; the SCEV changes need to be tested.
3675 ↗	(On Diff #131955)	I don't think this is a correct place to make this change -- the size of a pointer is the size of a pointer. I think you need to change the SCEV corresponding to GEP(Ptr, Idx) to be "sext(Ptr) + Idx" or "Ptr + sext(Idx)" depending on their relative sizes.

delena added inline comments.Jan 31 2018, 1:46 AM

../lib/Analysis/ScalarEvolution.cpp
3667 ↗	(On Diff #131955)	I added several tests that go through the SCEV. Looks ok right now. I can't say that I cover all corner cases, but we can do further changes gradually, there is no impact on in-tree targets. If you see something specific that requires more testing now, please let me know.
3675 ↗	(On Diff #131955)	I can't create SCEV expressions with ptr+ind, it will fail with assertion on different types.

Ayal added inline comments.Feb 1 2018, 2:47 AM

../lib/Analysis/ScalarEvolution.cpp
3675 ↗	(On Diff #131955)	Elena, Sanjoy's thought above to change SCEV to be "sext(Ptr) + Idx" or "Ptr + sext(Idx)" will bring the two addends to be of the same type, i.e., of the larger type. The challenge in your case is lack of target support for integer addition of pointer-sized integers; which seems similar to CHERI's case. Except CHERI pointers (or capabilities) hold in addition to a standard-sized address additional information, such that the latter can be stripped out for SCEV purposes (IIUC - @theraven please correct if needed); whereas in your case the address itself is larger than a standard-sized integer. Perhaps for your case too the pointer can be stripped down to standard-sized integers to leverage SCEV's capabilities on "legal" types, which seems to be what your patch is doing, coupled with separate logic that deals with the stripped out bits(?).

theraven added inline comments.Feb 1 2018, 3:49 AM

../lib/Analysis/ScalarEvolution.cpp
3675 ↗	(On Diff #131955)	That's pretty much the case for us: our pointers are 128 bits, but have a 64-bit range (64 bits of metadata). We have modified DataLayout to explicitly understand that pointer size and range are different (in a slightly hacky way, which we should improve before we think about upstreaming). In scalar evolution, we always use the pointer's range as the type. We don't support arbitrary integer operations on pointers and in our back end we have added some new MVTs to represent non-integer pointer types. Our architecture provides pointer + integer addressing modes. I believe that, in the motivating example for this change, the existing ScalarEvolution code is correct: it should use pointer-sized integers, because otherwise the analyses are likely to be wrong in some exciting corner cases. We have addressed this by adding explicit PTRADD SelectionDAG nodes, which perform pointer + integer addition. For complex addressing modes, we end up with (ptradd base (some complex integer ops)). This works well as long as the underlying hardware supports address register + integer register addressing, which I presume is the case for Intel (it is for all Harvard architectures that I've come across). If you are targeting an architecture for which pointer operations and integer operations are not the same, then you should follow the same approach: in the back end, lower pointers to some non-integer type and match pointer operations with different patterns to integer ones. We have a bunch of SelectionDAG and TableGen patches that make this work well, which we'd be happy to upstream.

" > We have addressed this by adding explicit PTRADD SelectionDAG nodes, which perform pointer + integer addition. For complex addressing modes, we end up with (ptradd base (some complex integer ops)). This works well as long as the underlying hardware supports address register + integer register addressing, which I presume is the case for Intel (it is for all Harvard architectures that I've come across)."

Yes, we also added ADDPTR node for SelectionDAG and we have more changes related to the special pointer type. Apparently, the codegen does not work with MVT::Ptr.
We can try to upstream the part of DAG builder, that makes ADDPTR from GEP.

@theraven , the latest uploaded version is aligned with what you implemented out of the tree. Could you, please, take a look?

Two very small nits (which I'd be happy to see fixed after commit, but might be easier to fix first), but otherwise it looks like a significantly cleaned-up version of what we have.

Thank you very much for working on this! Our next merge will be a little bit painful, but subsequent ones should be a lot easier.

Are you planning on upstreaming your ADDPTR SelectionDAG stuff? We have added PTRADD, INTTOPTR and PTRTOINT nodes and if they're useful to someone apart from us then we can upstream them.

../include/llvm/IR/DataLayout.h
357 ↗	(On Diff #131955)	Please can we not have a default for AS? We've added defaults for other things like this because they were existing APIs and we didn't want to have to update all of the callers at once, but all of the callers of this are already being updated and so should specify the correct AS.
../lib/Analysis/ScalarEvolution.cpp
3675 ↗	(On Diff #131955)	I agree with @sanjoy that this isn't the correct place for this change, but it does happen to be the least disruptive place for the change. The correct solution is probably to rename this method to something like `getTypeArithmeticSizeInBits` so that it's clear that it's returning a size as a proxy for a range and not a storage size.

This revision is now accepted and ready to land.Feb 6 2018, 2:29 AM

sanjoy added inline comments.Feb 6 2018, 1:06 PM

../lib/Analysis/ScalarEvolution.cpp
3667 ↗	(On Diff #131955)	I don't see these tests in this current version of the patch.
3675 ↗	(On Diff #131955)	Given what you said, the right fix seems to be to truncate 128 bit pointers to 64 bits in getSCEV instead of lying about the pointer's size. SCEV calls into other parts of LLVM like ValueTracking, and other parts of LLVM call into SCEV (invdvars, lsr, scev-aa etc.) and I'm worried that a discrepancy like this (pointer size = 64 in SCEV but 128 elsewhere) will cause bugs.

I don't see these tests in this current version of the patch.

All tests that you see *-custom.ll" go through the scev calculations.

theraven added inline comments.Feb 7 2018, 2:54 AM

../lib/Analysis/ScalarEvolution.cpp
3675 ↗	(On Diff #131955)	Truncating for us would be absolutely the wrong thing, because it changes the semantics (throws away all of the bounds metadata, falls back to some per-environment bounds which may not even allow access to this address). In our case, we have a difference between the size and the range. All of the places we've seen this used in ScalarEvolution, it cares about the range, not the storage size, it just happens that on most architectures these are the same (because a 32-bit pointer has a range of 2^32 bytes, a 64-bit pointer has a range of 2^64 bytes). The correct solution is to either rename this method something like `getTypeArithmeticSizeInBits` and update all of the callers, or add a new `getTypeArithmeticSizeInBits` method and update all of the callers (I don't believe that there are any that care about the storage size, but I might have missed one). We've been running with a change that's semantically identical to @delena's for a few years and have not encountered any miscompilations as a result (building a whole of the FreeBSD base system, a bunch of benchmarks, and a few other large programs), so I'm fairly confident that it's safe.

arichardson added a subscriber: arichardson.Feb 8 2018, 12:26 AM

Thank you very much for working on this. It will make our future upstream merges much easier.

../docs/LangRef.rst
1913 ↗	(On Diff #131955)	fourth parameter
../include/llvm/IR/DataLayout.h
357 ↗	(On Diff #131955)	Yes, please remove the default value here. We have run into lots of issues due to using the size of AS0 instead of the correct one.
../lib/Transforms/InstCombine/InstructionCombining.cpp
1511 ↗	(On Diff #131955)	Index width may not be the same width as pointer width

Updated according to the latest comments.

Added more tests with custom data layout.

Herald added subscribers: haicheng, eraman. · View Herald TranscriptFeb 12 2018, 5:35 AM

Closed by commit rL325102: Adding a width of the GEP index to the Data Layout. (authored by delena). · Explain WhyFeb 13 2018, 11:00 PM

This revision was automatically updated to reflect the committed changes.

bjope added a subscriber: bjope.Jun 20 2018, 1:05 AM

Herald added a subscriber: zzheng. · View Herald TranscriptJun 20 2018, 1:05 AM

Joe mentioned this in D68328: Fix occurrences that size and range of pointers are assumed to be the same..Oct 2 2019, 3:33 AM

Nicola mentioned this in rG5f6208778ff9: [DataLayout] Fix occurrences that size and range of pointers are assumed to be….Dec 12 2019, 2:16 AM

Nicola mentioned this in rG97572775d2fe: Reland [DataLayout] Fix occurrences that size and range of pointers are assumed….Dec 13 2019, 6:35 AM

efriedma mentioned this in D89540: [SCEV] Index type usually is, but is not guaranteed to be, equal to the pointer bit width.Oct 16 2020, 9:44 AM

arichardson mentioned this in D99660: Use DL.getIndexType() in Value::getPointerAlignment().Mar 31 2021, 8:04 AM

arichardson mentioned this in D135158: [DataLayout] Introduce DataLayout::getPointerIntegralSize(AS).Oct 4 2022, 8:13 AM

jrtc27 mentioned this in D143437: [llvm] Use pointer index type for more GEP offsets (pre-codegen).Feb 16 2023, 3:27 PM

Revision Contents

Path

Size

include/

llvm/

IR/

DataLayout.h

6 lines

lib/

IR/

DataLayout.cpp

13 lines

Transforms/

InstCombine/

InstCombineCompares.cpp

6 lines

InstructionCombining.cpp

45 lines

Diff 130096

include/llvm/IR/DataLayout.h

Show First 20 Lines • Show All 447 Lines • ▼ Show 20 Lines	Type *getLargestLegalIntType(LLVMContext &C) const {
unsigned LargestSize = getLargestLegalIntTypeSizeInBits();		unsigned LargestSize = getLargestLegalIntTypeSizeInBits();
return (LargestSize == 0) ? nullptr : Type::getIntNTy(C, LargestSize);		return (LargestSize == 0) ? nullptr : Type::getIntNTy(C, LargestSize);
}		}

/// \brief Returns the size of largest legal integer type size, or 0 if none		/// \brief Returns the size of largest legal integer type size, or 0 if none
/// are set.		/// are set.
unsigned getLargestLegalIntTypeSizeInBits() const;		unsigned getLargestLegalIntTypeSizeInBits() const;

		/// \brief Returns the type of a GEP index.
		/// It is a smaller type between IntPtrType and the largest supported
		/// integer type. We can't take the IntPtrType for index, since the target may
		/// not support full arithmetic for it.
		Type getIndexType(Type PtrTy) const;

/// \brief Returns the offset from the beginning of the type for the specified		/// \brief Returns the offset from the beginning of the type for the specified
/// indices.		/// indices.
///		///
/// Note that this takes the element type, not the pointer type.		/// Note that this takes the element type, not the pointer type.
/// This is used to implement getelementptr.		/// This is used to implement getelementptr.
int64_t getIndexedOffsetInType(Type ElemTy, ArrayRef<Value > Indices) const;		int64_t getIndexedOffsetInType(Type ElemTy, ArrayRef<Value > Indices) const;

/// \brief Returns a StructLayout object, indicating the alignment of the		/// \brief Returns a StructLayout object, indicating the alignment of the
▲ Show 20 Lines • Show All 109 Lines • Show Last 20 Lines

lib/IR/DataLayout.cpp

Show First 20 Lines • Show All 720 Lines • ▼ Show 20 Lines	Type *DataLayout::getSmallestLegalIntType(LLVMContext &C, unsigned Width) const {
return nullptr;		return nullptr;
}		}

unsigned DataLayout::getLargestLegalIntTypeSizeInBits() const {		unsigned DataLayout::getLargestLegalIntTypeSizeInBits() const {
auto Max = std::max_element(LegalIntWidths.begin(), LegalIntWidths.end());		auto Max = std::max_element(LegalIntWidths.begin(), LegalIntWidths.end());
return Max != LegalIntWidths.end() ? *Max : 0;		return Max != LegalIntWidths.end() ? *Max : 0;
}		}

		Type DataLayout::getIndexType(Type Ty) const {
		assert(Ty->isPtrOrPtrVectorTy() &&
		"Expected a pointer or pointer vector type.");
		unsigned PointerSize = getPointerTypeSizeInBits(Ty);
		unsigned LargestIntSize = getLargestLegalIntTypeSizeInBits();
		unsigned NumBits =
		LargestIntSize ? std::min(PointerSize, LargestIntSize) : PointerSize;
		IntegerType *IntTy = IntegerType::get(Ty->getContext(), NumBits);
		if (VectorType *VecTy = dyn_cast<VectorType>(Ty))
		return VectorType::get(IntTy, VecTy->getNumElements());
		return IntTy;
		}

int64_t DataLayout::getIndexedOffsetInType(Type *ElemTy,		int64_t DataLayout::getIndexedOffsetInType(Type *ElemTy,
ArrayRef<Value *> Indices) const {		ArrayRef<Value *> Indices) const {
int64_t Result = 0;		int64_t Result = 0;

generic_gep_type_iterator<Value* const*>		generic_gep_type_iterator<Value* const*>
GTI = gep_type_begin(ElemTy, Indices),		GTI = gep_type_begin(ElemTy, Indices),
GTE = gep_type_end(ElemTy, Indices);		GTE = gep_type_end(ElemTy, Indices);
for (; GTI != GTE; ++GTI) {		for (; GTI != GTE; ++GTI) {
▲ Show 20 Lines • Show All 50 Lines • Show Last 20 Lines

lib/Transforms/InstCombine/InstCombineCompares.cpp

Show First 20 Lines • Show All 675 Lines • ▼ Show 20 Lines	static Value rewriteGEPAsOffset(Value Start, Value *Base,
SetVector<Value *> &Explored) {		SetVector<Value *> &Explored) {
// Perform all the substitutions. This is a bit tricky because we can		// Perform all the substitutions. This is a bit tricky because we can
// have cycles in our use-def chains.		// have cycles in our use-def chains.
// 1. Create the PHI nodes without any incoming values.		// 1. Create the PHI nodes without any incoming values.
// 2. Create all the other values.		// 2. Create all the other values.
// 3. Add the edges for the PHI nodes.		// 3. Add the edges for the PHI nodes.
// 4. Emit GEPs to get the original pointers.		// 4. Emit GEPs to get the original pointers.
// 5. Remove the original instructions.		// 5. Remove the original instructions.
Type *IndexType = IntegerType::get(		Type *IndexType = DL.getIndexType(Start->getType());
Base->getContext(), DL.getPointerTypeSizeInBits(Start->getType()));

DenseMap<Value , Value > NewInsts;		DenseMap<Value , Value > NewInsts;
NewInsts[Base] = ConstantInt::getNullValue(IndexType);		NewInsts[Base] = ConstantInt::getNullValue(IndexType);

// Create the new PHI nodes, without adding any incoming values.		// Create the new PHI nodes, without adding any incoming values.
for (Value *Val : Explored) {		for (Value *Val : Explored) {
if (Val == Base)		if (Val == Base)
continue;		continue;
▲ Show 20 Lines • Show All 90 Lines • ▼ Show 20 Lines	static Value rewriteGEPAsOffset(Value Start, Value *Base,
return NewInsts[Start];		return NewInsts[Start];
}		}

/// Looks through GEPs, IntToPtrInsts and PtrToIntInsts in order to express		/// Looks through GEPs, IntToPtrInsts and PtrToIntInsts in order to express
/// the input Value as a constant indexed GEP. Returns a pair containing		/// the input Value as a constant indexed GEP. Returns a pair containing
/// the GEPs Pointer and Index.		/// the GEPs Pointer and Index.
static std::pair<Value , Value >		static std::pair<Value , Value >
getAsConstantIndexedAddress(Value *V, const DataLayout &DL) {		getAsConstantIndexedAddress(Value *V, const DataLayout &DL) {
Type *IndexType = IntegerType::get(V->getContext(),		Type *IndexType = DL.getIndexType(V->getType());
DL.getPointerTypeSizeInBits(V->getType()));

Constant *Index = ConstantInt::getNullValue(IndexType);		Constant *Index = ConstantInt::getNullValue(IndexType);
while (true) {		while (true) {
if (GEPOperator *GEP = dyn_cast<GEPOperator>(V)) {		if (GEPOperator *GEP = dyn_cast<GEPOperator>(V)) {
// We accept only inbouds GEPs here to exclude the possibility of		// We accept only inbouds GEPs here to exclude the possibility of
// overflow.		// overflow.
if (!GEP->isInBounds())		if (!GEP->isInBounds())
break;		break;
▲ Show 20 Lines • Show All 4,279 Lines • Show Last 20 Lines

lib/Transforms/InstCombine/InstructionCombining.cpp

Show First 20 Lines • Show All 1,101 Lines • ▼ Show 20 Lines	Type InstCombiner::FindElementAtOffset(PointerType PtrTy, int64_t Offset,
SmallVectorImpl<Value *> &NewIndices) {		SmallVectorImpl<Value *> &NewIndices) {
Type *Ty = PtrTy->getElementType();		Type *Ty = PtrTy->getElementType();
if (!Ty->isSized())		if (!Ty->isSized())
return nullptr;		return nullptr;

// Start with the index over the outer type. Note that the type size		// Start with the index over the outer type. Note that the type size
// might be zero (even if the offset isn't zero) if the indexed type		// might be zero (even if the offset isn't zero) if the indexed type
// is something like [0 x {int, int}]		// is something like [0 x {int, int}]
Type *IntPtrTy = DL.getIntPtrType(PtrTy);		Type *IndexTy = DL.getIndexType(PtrTy);
int64_t FirstIdx = 0;		int64_t FirstIdx = 0;
if (int64_t TySize = DL.getTypeAllocSize(Ty)) {		if (int64_t TySize = DL.getTypeAllocSize(Ty)) {
FirstIdx = Offset/TySize;		FirstIdx = Offset/TySize;
Offset -= FirstIdx*TySize;		Offset -= FirstIdx*TySize;

// Handle hosts where % returns negative instead of values [0..TySize).		// Handle hosts where % returns negative instead of values [0..TySize).
if (Offset < 0) {		if (Offset < 0) {
--FirstIdx;		--FirstIdx;
Offset += TySize;		Offset += TySize;
assert(Offset >= 0);		assert(Offset >= 0);
}		}
assert((uint64_t)Offset < (uint64_t)TySize && "Out of range offset");		assert((uint64_t)Offset < (uint64_t)TySize && "Out of range offset");
}		}

NewIndices.push_back(ConstantInt::get(IntPtrTy, FirstIdx));		NewIndices.push_back(ConstantInt::get(IndexTy, FirstIdx));

// Index into the types. If we fail, set OrigBase to null.		// Index into the types. If we fail, set OrigBase to null.
while (Offset) {		while (Offset) {
// Indexing into tail padding between struct/array elements.		// Indexing into tail padding between struct/array elements.
if (uint64_t(Offset * 8) >= DL.getTypeSizeInBits(Ty))		if (uint64_t(Offset * 8) >= DL.getTypeSizeInBits(Ty))
return nullptr;		return nullptr;

if (StructType *STy = dyn_cast<StructType>(Ty)) {		if (StructType *STy = dyn_cast<StructType>(Ty)) {
const StructLayout *SL = DL.getStructLayout(STy);		const StructLayout *SL = DL.getStructLayout(STy);
assert(Offset < (int64_t)SL->getSizeInBytes() &&		assert(Offset < (int64_t)SL->getSizeInBytes() &&
"Offset must stay within the indexed type");		"Offset must stay within the indexed type");

unsigned Elt = SL->getElementContainingOffset(Offset);		unsigned Elt = SL->getElementContainingOffset(Offset);
NewIndices.push_back(ConstantInt::get(Type::getInt32Ty(Ty->getContext()),		NewIndices.push_back(ConstantInt::get(Type::getInt32Ty(Ty->getContext()),
Elt));		Elt));

Offset -= SL->getElementOffset(Elt);		Offset -= SL->getElementOffset(Elt);
Ty = STy->getElementType(Elt);		Ty = STy->getElementType(Elt);
} else if (ArrayType *AT = dyn_cast<ArrayType>(Ty)) {		} else if (ArrayType *AT = dyn_cast<ArrayType>(Ty)) {
uint64_t EltSize = DL.getTypeAllocSize(AT->getElementType());		uint64_t EltSize = DL.getTypeAllocSize(AT->getElementType());
assert(EltSize && "Cannot index into a zero-sized array");		assert(EltSize && "Cannot index into a zero-sized array");
NewIndices.push_back(ConstantInt::get(IntPtrTy,Offset/EltSize));		NewIndices.push_back(ConstantInt::get(IndexTy,Offset/EltSize));
Offset %= EltSize;		Offset %= EltSize;
Ty = AT->getElementType();		Ty = AT->getElementType();
} else {		} else {
// Otherwise, we can't index into the middle of this atomic type, bail.		// Otherwise, we can't index into the middle of this atomic type, bail.
return nullptr;		return nullptr;
}		}
}		}

▲ Show 20 Lines • Show All 346 Lines • ▼ Show 20 Lines	if (Value *V = SimplifyGEPInst(GEP.getSourceElementType(), Ops,
SQ.getWithInstruction(&GEP)))		SQ.getWithInstruction(&GEP)))
return replaceInstUsesWith(GEP, V);		return replaceInstUsesWith(GEP, V);

Value *PtrOp = GEP.getOperand(0);		Value *PtrOp = GEP.getOperand(0);

// Eliminate unneeded casts for indices, and replace indices which displace		// Eliminate unneeded casts for indices, and replace indices which displace
// by multiples of a zero size type with zero.		// by multiples of a zero size type with zero.
bool MadeChange = false;		bool MadeChange = false;
Type *IntPtrTy =
DL.getIntPtrType(GEP.getPointerOperandType()->getScalarType());		// Index width shouldn't have the same width as pointer. Data layout chooses
		// the right type based on supported integer types.
		Type *NewScalarIndexTy =
		DL.getIndexType(GEP.getPointerOperandType()->getScalarType());

gep_type_iterator GTI = gep_type_begin(GEP);		gep_type_iterator GTI = gep_type_begin(GEP);
for (User::op_iterator I = GEP.op_begin() + 1, E = GEP.op_end(); I != E;		for (User::op_iterator I = GEP.op_begin() + 1, E = GEP.op_end(); I != E;
++I, ++GTI) {		++I, ++GTI) {
// Skip indices into struct types.		// Skip indices into struct types.
if (GTI.isStruct())		if (GTI.isStruct())
continue;		continue;

// Index type should have the same width as IntPtr
Type IndexTy = (I)->getType();		Type IndexTy = (I)->getType();
Type *NewIndexType = IndexTy->isVectorTy() ?		Type *NewIndexType =
VectorType::get(IntPtrTy, IndexTy->getVectorNumElements()) : IntPtrTy;		IndexTy->isVectorTy()
		? VectorType::get(NewScalarIndexTy, IndexTy->getVectorNumElements())
		: NewScalarIndexTy;

// If the element type has zero size then any index over it is equivalent		// If the element type has zero size then any index over it is equivalent
// to an index of zero, so replace it with zero if it is not zero already.		// to an index of zero, so replace it with zero if it is not zero already.
Type *EltTy = GTI.getIndexedType();		Type *EltTy = GTI.getIndexedType();
if (EltTy->isSized() && DL.getTypeAllocSize(EltTy) == 0)		if (EltTy->isSized() && DL.getTypeAllocSize(EltTy) == 0)
if (!isa<Constant>(I) \|\| !cast<Constant>(I)->isNullValue()) {		if (!isa<Constant>(I) \|\| !cast<Constant>(I)->isNullValue()) {
*I = Constant::getNullValue(NewIndexType);		*I = Constant::getNullValue(NewIndexType);
MadeChange = true;		MadeChange = true;
▲ Show 20 Lines • Show All 312 Lines • ▼ Show 20 Lines	if (HasZeroPointerIndex) {
// Transform things like:		// Transform things like:
// %t = getelementptr i32* bitcast ([2 x i32]* %str to i32*), i32 %V		// %t = getelementptr i32* bitcast ([2 x i32]* %str to i32*), i32 %V
// into: %t1 = getelementptr [2 x i32]* %str, i32 0, i32 %V; bitcast		// into: %t1 = getelementptr [2 x i32]* %str, i32 0, i32 %V; bitcast
Type *SrcElTy = StrippedPtrTy->getElementType();		Type *SrcElTy = StrippedPtrTy->getElementType();
Type *ResElTy = GEP.getSourceElementType();		Type *ResElTy = GEP.getSourceElementType();
if (SrcElTy->isArrayTy() &&		if (SrcElTy->isArrayTy() &&
DL.getTypeAllocSize(SrcElTy->getArrayElementType()) ==		DL.getTypeAllocSize(SrcElTy->getArrayElementType()) ==
DL.getTypeAllocSize(ResElTy)) {		DL.getTypeAllocSize(ResElTy)) {
Type *IdxType = DL.getIntPtrType(GEP.getType());		Type *IdxType = DL.getIndexType(GEP.getType());
Value *Idx[2] = { Constant::getNullValue(IdxType), GEP.getOperand(1) };		Value *Idx[2] = { Constant::getNullValue(IdxType), GEP.getOperand(1) };
Value *NewGEP =		Value *NewGEP =
GEP.isInBounds()		GEP.isInBounds()
? Builder.CreateInBoundsGEP(nullptr, StrippedPtr, Idx,		? Builder.CreateInBoundsGEP(nullptr, StrippedPtr, Idx,
GEP.getName())		GEP.getName())
: Builder.CreateGEP(nullptr, StrippedPtr, Idx, GEP.getName());		: Builder.CreateGEP(nullptr, StrippedPtr, Idx, GEP.getName());

// V and GEP are both pointer types --> BitCast		// V and GEP are both pointer types --> BitCast
Show All 10 Lines	if (HasZeroPointerIndex) {
// factor.		// factor.
uint64_t ResSize = DL.getTypeAllocSize(ResElTy);		uint64_t ResSize = DL.getTypeAllocSize(ResElTy);
uint64_t SrcSize = DL.getTypeAllocSize(SrcElTy);		uint64_t SrcSize = DL.getTypeAllocSize(SrcElTy);
if (ResSize && SrcSize % ResSize == 0) {		if (ResSize && SrcSize % ResSize == 0) {
Value *Idx = GEP.getOperand(1);		Value *Idx = GEP.getOperand(1);
unsigned BitWidth = Idx->getType()->getPrimitiveSizeInBits();		unsigned BitWidth = Idx->getType()->getPrimitiveSizeInBits();
uint64_t Scale = SrcSize / ResSize;		uint64_t Scale = SrcSize / ResSize;

// Earlier transforms ensure that the index has type IntPtrType, which		// Earlier transforms ensure that the index has the right type
// considerably simplifies the logic by eliminating implicit casts.		// according to Data Layout, which considerably simplifies the
assert(Idx->getType() == DL.getIntPtrType(GEP.getType()) &&		// logic by eliminating implicit casts.
"Index not cast to pointer width?");		assert(Idx->getType() == DL.getIndexType(GEP.getType()) &&
		"Index type does not match the Data Layout preferences");
		craig.topperUnsubmitted Done Reply Inline Actions Should this assert message be updated since it not guaranteed to be pointer width now? craig.topper: Should this assert message be updated since it not guaranteed to be pointer width now?

bool NSW;		bool NSW;
if (Value *NewIdx = Descale(Idx, APInt(BitWidth, Scale), NSW)) {		if (Value *NewIdx = Descale(Idx, APInt(BitWidth, Scale), NSW)) {
// Successfully decomposed Idx as NewIdx * Scale, form a new GEP.		// Successfully decomposed Idx as NewIdx * Scale, form a new GEP.
// If the multiplication NewIdx * Scale may overflow then the new		// If the multiplication NewIdx * Scale may overflow then the new
// GEP may not be "inbounds".		// GEP may not be "inbounds".
Value *NewGEP =		Value *NewGEP =
GEP.isInBounds() && NSW		GEP.isInBounds() && NSW
Show All 19 Lines	if (HasZeroPointerIndex) {
uint64_t ResSize = DL.getTypeAllocSize(ResElTy);		uint64_t ResSize = DL.getTypeAllocSize(ResElTy);
uint64_t ArrayEltSize =		uint64_t ArrayEltSize =
DL.getTypeAllocSize(SrcElTy->getArrayElementType());		DL.getTypeAllocSize(SrcElTy->getArrayElementType());
if (ResSize && ArrayEltSize % ResSize == 0) {		if (ResSize && ArrayEltSize % ResSize == 0) {
Value *Idx = GEP.getOperand(1);		Value *Idx = GEP.getOperand(1);
unsigned BitWidth = Idx->getType()->getPrimitiveSizeInBits();		unsigned BitWidth = Idx->getType()->getPrimitiveSizeInBits();
uint64_t Scale = ArrayEltSize / ResSize;		uint64_t Scale = ArrayEltSize / ResSize;

// Earlier transforms ensure that the index has type IntPtrType, which		// Earlier transforms ensure that the index has the right type
// considerably simplifies the logic by eliminating implicit casts.		// according to the Data Layout, which considerably simplifies
assert(Idx->getType() == DL.getIntPtrType(GEP.getType()) &&		// the logic by eliminating implicit casts.
"Index not cast to pointer width?");		assert(Idx->getType() == DL.getIndexType(GEP.getType()) &&
		"Index type does not match the Data Layout preferences");
		craig.topperUnsubmitted Done Reply Inline Actions Same with this assert. craig.topper: Same with this assert.

bool NSW;		bool NSW;
if (Value *NewIdx = Descale(Idx, APInt(BitWidth, Scale), NSW)) {		if (Value *NewIdx = Descale(Idx, APInt(BitWidth, Scale), NSW)) {
// Successfully decomposed Idx as NewIdx * Scale, form a new GEP.		// Successfully decomposed Idx as NewIdx * Scale, form a new GEP.
// If the multiplication NewIdx * Scale may overflow then the new		// If the multiplication NewIdx * Scale may overflow then the new
// GEP may not be "inbounds".		// GEP may not be "inbounds".
Value *Off[2] = {		Type *IndTy = DL.getIndexType(GEP.getType());
Constant::getNullValue(DL.getIntPtrType(GEP.getType())),		Value *Off[2] = {Constant::getNullValue(IndTy), NewIdx};
NewIdx};

Value *NewGEP = GEP.isInBounds() && NSW		Value *NewGEP = GEP.isInBounds() && NSW
? Builder.CreateInBoundsGEP(		? Builder.CreateInBoundsGEP(
SrcElTy, StrippedPtr, Off, GEP.getName())		SrcElTy, StrippedPtr, Off, GEP.getName())
: Builder.CreateGEP(SrcElTy, StrippedPtr, Off,		: Builder.CreateGEP(SrcElTy, StrippedPtr, Off,
GEP.getName());		GEP.getName());
// The NewGEP must be pointer typed, so must the old one -> BitCast		// The NewGEP must be pointer typed, so must the old one -> BitCast
return CastInst::CreatePointerBitCastOrAddrSpaceCast(NewGEP,		return CastInst::CreatePointerBitCastOrAddrSpaceCast(NewGEP,
▲ Show 20 Lines • Show All 1,412 Lines • Show Last 20 Lines