This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
lib/Analysis/
-
Analysis/
3
BasicAliasAnalysis.cpp
-
test/
-
Analysis/BasicAA/
-
BasicAA/
-
noalias-structure.ll
-
Transforms/LoopVectorize/
-
LoopVectorize/
-
global_alias.ll

Differential D20665

Claim NoAlias if two GEPs index different fields of the same struct
ClosedPublic

Authored by twoh on May 25 2016, 11:54 PM.

Download Raw Diff

Details

Reviewers

• dberlin
hfinkel

Commits

rGe846c9dc5250: Claim NoAlias if two GEPs index different fields of the same struct
rL271415: Claim NoAlias if two GEPs index different fields of the same struct

Summary

Patch for Bug 27478. Make BasicAliasAnalysis claims NoAlias if two GEPs index different fields of the same structure.

Diff Detail

Repository: rL LLVM

Event Timeline

twoh updated this revision to Diff 58572.May 25 2016, 11:54 PM

twoh retitled this revision from to Claim NoAlias if two GEPs index different fields of the same struct.

twoh updated this object.

twoh added a reviewer: hfinkel.

twoh added a subscriber: llvm-commits.

mcrosier added a subscriber: mcrosier.May 26 2016, 6:47 AM

mcrosier added inline comments.

lib/Analysis/BasicAliasAnalysis.cpp
841 ↗	(On Diff #58572)	Please add a period.
858 ↗	(On Diff #58572)	Pre-increment is preferred.

• dberlin added a subscriber: • dberlin.May 26 2016, 7:29 AM

• dberlin added inline comments.

lib/Analysis/BasicAliasAnalysis.cpp
841 ↗	(On Diff #58572)	This loop never verifies they are the same struct, just A struct. They have to be the same struct, otherwise it is entirely possible for them to be different structs, and be at (offset,size) that conflict. In general, your entire calculation is essentially trying to figure out (offset, size) of the entire set of accesses. It happens that, for the same struct, you can guarantee that if they go to different (offset, size) they can't alias (because you can't really overlay the structs in a different way). But this is not, in general, true because you don't know where they are really starting from unless you see the allocation site (IE this could be a gep of a gep of a gep of a ...)
867 ↗	(On Diff #58572)	This is not right. Just because what is being loaded is a struct type, means nothing. As above, it's possible to overlay the structs in memory. (Because LLVM is not C, things that are not legal in C are legal in LLVM). You need to verify that what they are accessing through is the same struct type, as the top of your loop says. Not that the loaded element is a struct type.
868 ↗	(On Diff #58572)	This loop is really hard to follow the logic of because of where it breaks and continues.

twoh added inline comments.May 26 2016, 9:25 AM

lib/Analysis/BasicAliasAnalysis.cpp
841 ↗	(On Diff #58572)	@dberlin Thank you for your comment. This loop is invoked only when the pointer operand of GEP1 and GEP2 are same (there is an assertion above). Doesn't this guarantee that they are starting from the same point regardless of their allocation site? The logic is that if two GEPs sharing the same base pointer access through the same index until index 'i-1' but have a different value at index 'i', and if the index typed at index 'i-1' is a struct, then at index 'i' they access two different fields of the same struct (I assumed 'same struct' because two GEPs share the same pointer operand and same index until 'i-1').

Doesn't this guarantee that they are starting from the same point

regardless of their allocation site?

Yes, I missed that in the context :)

The logic is that if two GEPs sharing the same base pointer access through
the same index until index 'i-1' but have a different value at index 'i',
and if the index typed at index 'i-1' is a struct, then at index 'i' they
access two different fields of the same struct (I assumed 'same struct'
because two GEPs share the same pointer operand and same index until 'i-1').

Okay. Because of the way the loop is structured, this isn't really clear
that this is what is happening, despite the comment
:)

http://reviews.llvm.org/D20665

@dberlin Thank you for clarifying. I'll fix the code to make it clear, and address @mcrosier's comments as well.

Addressing comments

Ping

Looks good to me at this point, thank you for addressing comments.

This revision is now accepted and ready to land.Jun 1 2016, 8:57 AM

@dberlin, thank you for accepting the diff. It would be great if someone can commit this diff for me as I don't have the privilege yet.

If you can please remove the whitespace changes, and update the diff, i'll
commit it.

I get apply failures due to them.

@@ -904,7 +938,7 @@

// Because they cannot partially overlap and because fields in an array
// cannot overlap, if we can prove the final indices are different between
// GEP1 and GEP2, we can conclude GEP1 and GEP2 don't alias.

// If the last indices are constants, we've already checked they don't
// equal each other so we can exit early.
if (C1 && C2)

@@ -977,7 +1011,7 @@
the highest %f1 can be is (%alloca + 3). This means %random can
not be higher
than (%alloca - 1), and so is not inbounds, a contradiction.
bool BasicAAResult::isGEPBaseAtNegativeOffset(const GEPOperator *GEPOp,

const DecomposedGEP &DecompGEP, const DecomposedGEP &DecompAlloca,

+ const DecomposedGEP &DecompGEP, const DecomposedGEP &DecompAlloca,

    uint64_t AllocaAccessSize) {
// If the alloca access size is unknown, or the GEP isn't inbounds, bail.
if (AllocaAccessSize == MemoryLocation::UnknownSize || !GEPOp->isInBounds())

Rebased. @dberlin, I'm sorry. I should have rebased to the most recent version. It seems that the change in the variable name caused the conflict. I removed the whitespace changes from the diff as well.

Closed by commit rL271415: Claim NoAlias if two GEPs index different fields of the same struct (authored by dannyb). · Explain WhyJun 1 2016, 11:18 AM

This revision was automatically updated to reflect the committed changes.

Hi Taewook,
I have reverted this, as it seems to cause failures in selfhosting and sanitizer builds.

See
http://lab.llvm.org:8011/builders/clang-x86_64-linux-selfhost-modules/builds/16280

http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-autoconf/builds/20377
etc

I don't know if it is due to undenied behavior in the original code, or your patch missing semantics, but you should investigate.

Happy to recommit it once we've got it working.

@dberlin, Thank you for the note. I'll take a look.

eli.friedman added a subscriber: eli.friedman.Jun 1 2016, 5:08 PM

eli.friedman added inline comments.

llvm/trunk/lib/Analysis/BasicAliasAnalysis.cpp
842	You aren't allowed to make aliasing assumptions based on the type of a GEP; it's just pointer math. "inbounds" just means that the result has to be in bounds relative to the allocation as a whole. See http://llvm.org/docs/LangRef.html#getelementptr-instruction . Even if a frontend generates "nice" code, LLVM itself performs transformations which will break any assumptions based on the type of a GEP (for example, LLVM will transform a bitcast into a GEP).

twoh added inline comments.Jun 1 2016, 5:29 PM

llvm/trunk/lib/Analysis/BasicAliasAnalysis.cpp
842	@eli.friedman Thanks for the comment. I'm afraid I didn't understand the point of your comments. This patch is for the case where two GEPs have the same pointer operand and access through the same indices until a certain point. For such case, not only types, but also indexed values should be identical at that point. If so, when the next index values are different, it means that two GEPs will point different fields of the same struct. It would be great if you could suggest an example that this patch generates wrong result.

His point is that types in GEP's are just suggestions (see the GEP faq).

The type itself is meaningless. You can take the result, bitcast it to a
int*, whatever.

You can also take an int *, bitcast it to a struct, gep it, and use the
result.

Example:

struct A {
  unsigned char a[10];
  unsigned char b;
};
void f(A* x) {
  for (unsigned i = 0; i < 11; ++i)
    x->b += ((unsigned char*)x)[i];
}

Errr, this is not two GEPS of a struct type base pointer with the same
type, with a minimum of two operands, etc.

It's

%6 = getelementptr inbounds i8, i8* %5, i64 %4
 %9 = getelementptr inbounds %struct.A, %struct.A* %x, i32 0, i32 1

This code will do nothing with this.

Want to try again?
:)

and the obvious modification still gives right answers (ie using x->a):

%5 = getelementptr inbounds %struct.A, %struct.A* %x, i32 0, i32 0
%6 = getelementptr inbounds [10 x i8], [10 x i8]* %5, i32 0, i32 0
%7 = getelementptr inbounds i8, i8* %6, i64 %4
%8 = load i8, i8* %7, align 1
%9 = zext i8 %8 to i32
%10 = getelementptr inbounds %struct.A, %struct.A* %x, i32 0, i32 1

%5 and %10 do *not* alias.
The code says nothing about %10 and %6 or %7

My example run through "clang -S -emit-llvm -O2 -fno-unroll-loops":

define void @_Z1fP1A(%struct.A* nocapture %x) #0 {
entry:
  %b = getelementptr inbounds %struct.A, %struct.A* %x, i64 0, i32 1
  %.pre = load i8, i8* %b, align 1, !tbaa !1
  br label %for.body

for.cond.cleanup:                                 ; preds = %for.body
  ret void

for.body:                                         ; preds = %for.body, %entry
  %0 = phi i8 [ %.pre, %entry ], [ %add, %for.body ]
  %indvars.iv = phi i64 [ 0, %entry ], [ %indvars.iv.next, %for.body ]
  %arrayidx = getelementptr inbounds %struct.A, %struct.A* %x, i64 0, i32 0, i64 %indvars.iv
  %1 = load i8, i8* %arrayidx, align 1, !tbaa !5
  %add = add i8 %0, %1
  store i8 %add, i8* %b, align 1, !tbaa !1
  %indvars.iv.next = add nuw nsw i64 %indvars.iv, 1
  %exitcond = icmp eq i64 %indvars.iv.next, 11
  br i1 %exitcond, label %for.cond.cleanup, label %for.body
}

Note in particular %b = getelementptr inbounds %struct.A, %struct.A* %x, i64 0, i32 1 vs. %arrayidx = getelementptr inbounds %struct.A, %struct.A* %x, i64 0, i32 0, i64 %indvars.iv.

sanjoy added a subscriber: sanjoy.Jun 1 2016, 7:44 PM

sanjoy added inline comments.

llvm/trunk/lib/Analysis/BasicAliasAnalysis.cpp
842	Yes I think Eli is right here, if you have struct { int arr_a[2]; int arr_b[2]; }; then in LLVM IR the `-1` th element of `arr_b` is the `1` st element of `arr_a`. Generally I don't think you can do much better than using `GEP::accumulateConstantOffset` if there is a trailing non-constant index. But maybe in cases like `GEP(PTR, 1, unknown, 4)` you can play some tricks with divisibility?

You generally aren't going to be able to do anything without frontend info.
The one trick you can play is actually to use pointer alignment (if it's
there).

@eli.friedman Thank you for bring up this example :) Actually your example is very close with the code snippet that motivates me to work on this patch. The only difference is that in my code index variable i iterates up to 10, not 11.

One thing I'd like to make sure is that the unit of "allocated object" in http://llvm.org/docs/LangRef.html#getelementptr-instruction is the entire struct, not an each field of the struct. I realized that if the value of %indvars.iv is 10 then %arrayidx and %b have the same valid value, but I assumed that it is still fine to aggressively claim NoAlias for such case because accessing %arrayidx will result poison value. However, if the entire struct is the unit of allocated object, then this patch is invalid.

@sanjoy Yes I missed the case of the index value being negative as well (Ahmed pointed out the same point as well in the mailing list). Although it may not be hard to fix the patch to check the sign of indexes, I think it is not worth because this patch has other problems (what Eli pointed) as well.

@dberlin Thank you for your time in reviewing this patch! I'll try to figure out if it is possible to exploit clang annotated metadata to enable this optimization, because this is why gcc outperforms llvm for some of our internal benchmarks. It would be great if you can give me suggestions as well.

You definitely should produce metadata to do this optimization .

(the other alternative, adding some attribute to GEP, tends to be vastly
non-preferred)

Clang already has this: -fstrict-aliasing, which enables struct-path
aware TBAA, via !tbaa metadata.

That TBAA isn't sufficient is intentional though: you're probably
hitting the return PartialAlias at the end of BasicAAResult::aliasGEP
(PR9971).

-Ahmed

"allocated object" in this context means the entire object returned by something like malloc() or the alloca instruction.

If you have an idea for how to improve the wording, please propose a patch for LangRef (llvm/docs/LangRef.rst).

So I'm thinking of adding metadata that indicates the original type of the pointer operand to GEP instructions. With metadata, an analysis can tell if GEP's pointer operand is transformed from bitcast operation or not. What do you think?

So, i'm honestly a bit confused.

Do you mean "what the original language pointer type" was?
Because LLVM does not know enough to know what the language rules are, and
so you'd generally want something lower level, like we have now with !tbaa,
etc.

Do you mean something else?

If your goal is to distinguish whether the original accesses were to
different field offsets, and the language says that means they cna't alias,
you should produce that as metadata.
If your goal is something else, we should start with the goal and try to
design something for it, and hopefully nearby use cases (if any) :)

@dberlin I meant the same level of type information as tbaa for GEP's pointer operand.

For eli's example above, if the original access were to the array field of the struct, clang generates GEP like

%20 = load %struct.A*, %struct.A** %3, align 8
%21 = getelementptr inbounds %struct.A, %struct.A* %20, i32 0, i32 0
%22 = getelementptr inbounds [10 x i8], [10 x i8]* %21, i64 0, i64 %19

But if the access were to char* casted pointer, then it is like

%20 = load %struct.A*, %struct.A** %3, align 8
%21 = bitcast %struct.A* %20 to i8*
%22 = getelementptr inbounds i8, i8* %21, i64 %19

and these instructions are eventually combined to

%10 = getelementptr inbounds %struct.A, %struct.A* %0, i64 0, i32 0, i64 %9

So by knowing the original pointer operand type of GEP, it is possible to tell if the original accesses were to the field of the struct itself or to the casted pointer, assuming that clang generates bitscast operation for every type casting.

Hey taewook,

@dberlin Thank you for your comments. My original plan was making each GEP to understand its source-level type with metadata, but now I also agree on you that the approach will not add much value. Instead, I'm working on adding metadata when clang observes struct field access in the source. The metadata will tell the name of the struct and the index of the accessed field. Does that sound more reasonable?

Yes, but i would go with "offset and size". name and index does not tell
you whether they overlap.

Think of unions :)

@dberlin I was planning to add metadata only for structures (not unions), but even in that case "offset and size" seems like a better idea. Thanks!

Just FYI;
If you want to get this metadata into trunk, you are going to have to have
a real design RFC for it, and, as part of that, explain why you can't or
shouldn't generalize something existing (like the struct path metadata
etc), to this case.

Not suggesting you can't make that argument, of course, just making sure
you realize you'll have to actually be able to defend new metadata, we
don't just add everything :)

@dberlin Thank you for letting me know :) Actually one of my concern is that the scope of the problem that can be solved by adding this metadata might be too limited. Before writing RFC, I'll think a bit more about if there is a way to improve the design of this metadata so it can be useful to other alias analysis problems as well.

a.elovikov added a subscriber: a.elovikov.Apr 9 2018, 1:49 AM

Revision Contents

Path

Size

llvm/

trunk/

lib/

Analysis/

BasicAliasAnalysis.cpp

38 lines

test/

Analysis/

BasicAA/

noalias-structure.ll

36 lines

Transforms/

LoopVectorize/

global_alias.ll

158 lines

Diff 59256

llvm/trunk/lib/Analysis/BasicAliasAnalysis.cpp

Show First 20 Lines • Show All 828 Lines • ▼ Show 20 Lines	static AliasResult aliasSameBasePointerGEPs(const GEPOperator *GEP1,
const DataLayout &DL) {		const DataLayout &DL) {

assert(GEP1->getPointerOperand() == GEP2->getPointerOperand() &&		assert(GEP1->getPointerOperand() == GEP2->getPointerOperand() &&
"Expected GEPs with the same pointer operand");		"Expected GEPs with the same pointer operand");

// Try to determine whether GEP1 and GEP2 index through arrays, into structs,		// Try to determine whether GEP1 and GEP2 index through arrays, into structs,
// such that the struct field accesses provably cannot alias.		// such that the struct field accesses provably cannot alias.
// We also need at least two indices (the pointer, and the struct field).		// We also need at least two indices (the pointer, and the struct field).
if (GEP1->getNumIndices() != GEP2->getNumIndices() \|\|		if (GEP1->getNumIndices() < 2)
GEP1->getNumIndices() < 2)		return MayAlias;

		// If both GEP1 and GEP2 have the inbounds keyword but index different fields
		// of the same struct, they do not alias.
		if (GEP1->isInBounds() && GEP2->isInBounds()) {
		eli.friedmanUnsubmitted Not Done Reply Inline Actions You aren't allowed to make aliasing assumptions based on the type of a GEP; it's just pointer math. "inbounds" just means that the result has to be in bounds relative to the allocation as a whole. See http://llvm.org/docs/LangRef.html#getelementptr-instruction . Even if a frontend generates "nice" code, LLVM itself performs transformations which will break any assumptions based on the type of a GEP (for example, LLVM will transform a bitcast into a GEP). eli.friedman: You aren't allowed to make aliasing assumptions based on the type of a GEP; it's just pointer…
		twohAuthorUnsubmitted Not Done Reply Inline Actions @eli.friedman Thanks for the comment. I'm afraid I didn't understand the point of your comments. This patch is for the case where two GEPs have the same pointer operand and access through the same indices until a certain point. For such case, not only types, but also indexed values should be identical at that point. If so, when the next index values are different, it means that two GEPs will point different fields of the same struct. It would be great if you could suggest an example that this patch generates wrong result. twoh: @eli.friedman Thanks for the comment. I'm afraid I didn't understand the point of your comments.
		sanjoyUnsubmitted Not Done Reply Inline Actions Yes I think Eli is right here, if you have struct { int arr_a[2]; int arr_b[2]; }; then in LLVM IR the `-1` th element of `arr_b` is the `1` st element of `arr_a`. Generally I don't think you can do much better than using `GEP::accumulateConstantOffset` if there is a trailing non-constant index. But maybe in cases like `GEP(PTR, 1, unknown, 4)` you can play some tricks with divisibility? sanjoy: Yes I think Eli is right here, if you have ``` struct { int arr_a[2]; int arr_b[2]; }; ```…
		auto Opi1 = GEP1->op_begin() + 1;
		auto Opi2 = GEP2->op_begin() + 1;
		auto Ope1 = GEP1->op_end();
		auto Ope2 = GEP2->op_end();

		SmallVector<Value *, 8> IntermediateIndices;
		ConstantInt *C1 = nullptr;
		ConstantInt *C2 = nullptr;
		while (Opi1 != Ope1 && Opi2 != Ope2 &&
		(C1 = dyn_cast<ConstantInt>(*Opi1)) &&
		(C2 = dyn_cast<ConstantInt>(*Opi2))) {
		if (C1 == C2) {
		IntermediateIndices.push_back(C1);
		++Opi1;
		++Opi2;
		} else {
		// Both GEPs share the same pointer operand and access through the same
		// indices up to this point, but now they are having different index
		// values. At this point, if the indexed type is a StructType, this means
		// that two GEPs are for two different fields in the same structure.
		auto *Ty = GetElementPtrInst::getIndexedType(
		GEP1->getSourceElementType(), IntermediateIndices);
		if (isa<StructType>(Ty))
		return NoAlias;
		break;
		}
		}
		}

		if (GEP1->getNumIndices() != GEP2->getNumIndices())
return MayAlias;		return MayAlias;

// If we don't know the size of the accesses through both GEPs, we can't		// If we don't know the size of the accesses through both GEPs, we can't
// determine whether the struct fields accessed can't alias.		// determine whether the struct fields accessed can't alias.
if (V1Size == MemoryLocation::UnknownSize \|\|		if (V1Size == MemoryLocation::UnknownSize \|\|
V2Size == MemoryLocation::UnknownSize)		V2Size == MemoryLocation::UnknownSize)
return MayAlias;		return MayAlias;

▲ Show 20 Lines • Show All 877 Lines • Show Last 20 Lines

llvm/trunk/test/Analysis/BasicAA/noalias-structure.ll

				; RUN: opt < %s -basicaa -aa-eval -print-all-alias-modref-info -disable-output 2>&1 \| FileCheck %s

				target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
				target triple = "x86_64-unknown-linux-gnu"

				%struct.Type = type { [10 x i32], i32 }
				@Foo = external global %struct.Type, align 4

				; /// Check that BasicAA claims no alias between different fileds of a structure
				; void test() {
				; for (unsigned i = 0 ; i < 10 ; i++)
				; Foo.arr[i] += Foo.i;
				; }

				define void @test() {
				; CHECK-LABEL: Function: test:
				entry:
				%0 = load i32, i32* getelementptr inbounds (%struct.Type, %struct.Type* @Foo, i64 0, i32 1), align 4
				br label %for.body

				for.cond.cleanup: ; preds = %for.body
				ret void

				for.body: ; preds = %for.body, %entry
				%indvars.iv = phi i64 [ 0, %entry ], [ %indvars.iv.next, %for.body ]

				%arrayidx = getelementptr inbounds %struct.Type, %struct.Type* @Foo, i64 0, i32 0, i64 %indvars.iv
				%1 = load i32, i32* %arrayidx, align 4
				%add = add nsw i32 %1, %0
				store i32 %add, i32* %arrayidx, align 4
				; CHECK: NoAlias: i32* %arrayidx, i32* getelementptr inbounds (%struct.Type, %struct.Type* @Foo, i64 0, i32 1)

				%indvars.iv.next = add nuw nsw i64 %indvars.iv, 1
				%exitcond = icmp eq i64 %indvars.iv.next, 10
				br i1 %exitcond, label %for.cond.cleanup, label %for.body
				}

llvm/trunk/test/Transforms/LoopVectorize/global_alias.ll

Show First 20 Lines • Show All 759 Lines • ▼ Show 20 Lines

for.end: ; preds = %for.cond		for.end: ; preds = %for.cond
%6 = load i32, i32* %a.addr, align 4		%6 = load i32, i32* %a.addr, align 4
%arrayidx5 = getelementptr inbounds [100 x i32], [100 x i32]* getelementptr inbounds (%struct.anon, %struct.anon* @Foo, i32 0, i32 0), i32 0, i32 %6		%arrayidx5 = getelementptr inbounds [100 x i32], [100 x i32]* getelementptr inbounds (%struct.anon, %struct.anon* @Foo, i32 0, i32 0), i32 0, i32 %6
%7 = load i32, i32* %arrayidx5, align 4		%7 = load i32, i32* %arrayidx5, align 4
ret i32 %7		ret i32 %7
}		}

		; /// Different objects, swapped induction
;; === Now, the tests that we could vectorize with induction changes or run-time checks ===		; int noAlias15(int a) {


; /// Different objects, swapped induction, alias at the end
; int mayAlias01 (int a) {
; int i;		; int i;
; for (i=0; i<SIZE; i++)		; for (i=0; i<SIZE; i++)
; Foo.A[i] = Foo.B[SIZE-i-1] + a;		; Foo.A[i] = Foo.B[SIZE-i-1] + a;
; return Foo.A[a];		; return Foo.A[a];
; }		; }
; CHECK-LABEL: define i32 @mayAlias01(		; CHECK-LABEL: define i32 @noAlias15(
; CHECK-NOT: add nsw <4 x i32>		; CHECK: add nsw <4 x i32>
; CHECK: ret		; CHECK: ret

define i32 @mayAlias01(i32 %a) nounwind {		define i32 @noAlias15(i32 %a) nounwind {
entry:		entry:
%a.addr = alloca i32, align 4		%a.addr = alloca i32, align 4
%i = alloca i32, align 4		%i = alloca i32, align 4
store i32 %a, i32* %a.addr, align 4		store i32 %a, i32* %a.addr, align 4
store i32 0, i32* %i, align 4		store i32 0, i32* %i, align 4
br label %for.cond		br label %for.cond

for.cond: ; preds = %for.inc, %entry		for.cond: ; preds = %for.inc, %entry
Show All 22 Lines

for.end: ; preds = %for.cond		for.end: ; preds = %for.cond
%6 = load i32, i32* %a.addr, align 4		%6 = load i32, i32* %a.addr, align 4
%arrayidx3 = getelementptr inbounds [100 x i32], [100 x i32]* getelementptr inbounds (%struct.anon, %struct.anon* @Foo, i32 0, i32 0), i32 0, i32 %6		%arrayidx3 = getelementptr inbounds [100 x i32], [100 x i32]* getelementptr inbounds (%struct.anon, %struct.anon* @Foo, i32 0, i32 0), i32 0, i32 %6
%7 = load i32, i32* %arrayidx3, align 4		%7 = load i32, i32* %arrayidx3, align 4
ret i32 %7		ret i32 %7
}		}

; /// Different objects, swapped induction, alias at the beginning		; /// Different objects, swapped induction
; int mayAlias02 (int a) {		; int noAlias16 (int a) {
; int i;		; int i;
; for (i=0; i<SIZE; i++)		; for (i=0; i<SIZE; i++)
; Foo.A[SIZE-i-1] = Foo.B[i] + a;		; Foo.A[SIZE-i-1] = Foo.B[i] + a;
; return Foo.A[a];		; return Foo.A[a];
; }		; }
; CHECK-LABEL: define i32 @mayAlias02(		; CHECK-LABEL: define i32 @noAlias16(
; CHECK-NOT: add nsw <4 x i32>		; CHECK: add nsw <4 x i32>
; CHECK: ret		; CHECK: ret

define i32 @mayAlias02(i32 %a) nounwind {		define i32 @noAlias16(i32 %a) nounwind {
entry:		entry:
%a.addr = alloca i32, align 4		%a.addr = alloca i32, align 4
%i = alloca i32, align 4		%i = alloca i32, align 4
store i32 %a, i32* %a.addr, align 4		store i32 %a, i32* %a.addr, align 4
store i32 0, i32* %i, align 4		store i32 0, i32* %i, align 4
br label %for.cond		br label %for.cond

for.cond: ; preds = %for.inc, %entry		for.cond: ; preds = %for.inc, %entry
Show All 22 Lines

for.end: ; preds = %for.cond		for.end: ; preds = %for.cond
%6 = load i32, i32* %a.addr, align 4		%6 = load i32, i32* %a.addr, align 4
%arrayidx3 = getelementptr inbounds [100 x i32], [100 x i32]* getelementptr inbounds (%struct.anon, %struct.anon* @Foo, i32 0, i32 0), i32 0, i32 %6		%arrayidx3 = getelementptr inbounds [100 x i32], [100 x i32]* getelementptr inbounds (%struct.anon, %struct.anon* @Foo, i32 0, i32 0), i32 0, i32 %6
%7 = load i32, i32* %arrayidx3, align 4		%7 = load i32, i32* %arrayidx3, align 4
ret i32 %7		ret i32 %7
}		}

; /// Pointer access, run-time check added
; int mayAlias03 (int a) {
; int i;
; for (i=0; i<SIZE; i++)
; (PA+i) = (PB+SIZE-i-1) + a;
; return *(PA+a);
; }
; CHECK-LABEL: define i32 @mayAlias03(
; CHECK-NOT: add nsw <4 x i32>
; CHECK: ret

define i32 @mayAlias03(i32 %a) nounwind {
entry:
%a.addr = alloca i32, align 4
%i = alloca i32, align 4
store i32 %a, i32* %a.addr, align 4
store i32 0, i32* %i, align 4
br label %for.cond

for.cond: ; preds = %for.inc, %entry
%0 = load i32, i32* %i, align 4
%cmp = icmp slt i32 %0, 100
br i1 %cmp, label %for.body, label %for.end

for.body: ; preds = %for.cond
%1 = load i32, i32* @PB, align 4
%add.ptr = getelementptr inbounds i32, i32* %1, i32 100
%2 = load i32, i32* %i, align 4
%idx.neg = sub i32 0, %2
%add.ptr1 = getelementptr inbounds i32, i32* %add.ptr, i32 %idx.neg
%add.ptr2 = getelementptr inbounds i32, i32* %add.ptr1, i32 -1
%3 = load i32, i32* %add.ptr2, align 4
%4 = load i32, i32* %a.addr, align 4
%add = add nsw i32 %3, %4
%5 = load i32, i32* @PA, align 4
%6 = load i32, i32* %i, align 4
%add.ptr3 = getelementptr inbounds i32, i32* %5, i32 %6
store i32 %add, i32* %add.ptr3, align 4
br label %for.inc

for.inc: ; preds = %for.body
%7 = load i32, i32* %i, align 4
%inc = add nsw i32 %7, 1
store i32 %inc, i32* %i, align 4
br label %for.cond

for.end: ; preds = %for.cond
%8 = load i32, i32* @PA, align 4
%9 = load i32, i32* %a.addr, align 4
%add.ptr4 = getelementptr inbounds i32, i32* %8, i32 %9
%10 = load i32, i32* %add.ptr4, align 4
ret i32 %10
}


;; === Finally, the tests that should only vectorize with care (or if we ignore undefined behaviour at all) ===		;; === Ignore undefined behavior ===


; int mustAlias01 (int a) {		; int noAlias17(int a) {
; int i;		; int i;
; for (i=0; i<SIZE; i++)		; for (i=0; i<SIZE; i++)
; Foo.A[i+10] = Foo.B[SIZE-i-1] + a;		; Foo.A[i+10] = Foo.B[SIZE-i-1] + a;
; return Foo.A[a];		; return Foo.A[a];
; }		; }
; CHECK-LABEL: define i32 @mustAlias01(		; CHECK-LABEL: define i32 @noAlias17(
; CHECK-NOT: add nsw <4 x i32>		; CHECK: add nsw <4 x i32>
; CHECK: ret		; CHECK: ret

define i32 @mustAlias01(i32 %a) nounwind {		define i32 @noAlias17(i32 %a) nounwind {
entry:		entry:
%a.addr = alloca i32, align 4		%a.addr = alloca i32, align 4
%i = alloca i32, align 4		%i = alloca i32, align 4
store i32 %a, i32* %a.addr, align 4		store i32 %a, i32* %a.addr, align 4
store i32 0, i32* %i, align 4		store i32 0, i32* %i, align 4
br label %for.cond		br label %for.cond

for.cond: ; preds = %for.inc, %entry		for.cond: ; preds = %for.inc, %entry
Show All 23 Lines

for.end: ; preds = %for.cond		for.end: ; preds = %for.cond
%6 = load i32, i32* %a.addr, align 4		%6 = load i32, i32* %a.addr, align 4
%arrayidx4 = getelementptr inbounds [100 x i32], [100 x i32]* getelementptr inbounds (%struct.anon, %struct.anon* @Foo, i32 0, i32 0), i32 0, i32 %6		%arrayidx4 = getelementptr inbounds [100 x i32], [100 x i32]* getelementptr inbounds (%struct.anon, %struct.anon* @Foo, i32 0, i32 0), i32 0, i32 %6
%7 = load i32, i32* %arrayidx4, align 4		%7 = load i32, i32* %arrayidx4, align 4
ret i32 %7		ret i32 %7
}		}

; int mustAlias02 (int a) {		; int noAlias18(int a) {
; int i;		; int i;
; for (i=0; i<SIZE; i++)		; for (i=0; i<SIZE; i++)
; Foo.A[i] = Foo.B[SIZE-i-10] + a;		; Foo.A[i] = Foo.B[SIZE-i-10] + a;
; return Foo.A[a];		; return Foo.A[a];
; }		; }
; CHECK-LABEL: define i32 @mustAlias02(		; CHECK-LABEL: define i32 @noAlias18(
; CHECK-NOT: add nsw <4 x i32>		; CHECK: add nsw <4 x i32>
; CHECK: ret		; CHECK: ret

define i32 @mustAlias02(i32 %a) nounwind {		define i32 @noAlias18(i32 %a) nounwind {
entry:		entry:
%a.addr = alloca i32, align 4		%a.addr = alloca i32, align 4
%i = alloca i32, align 4		%i = alloca i32, align 4
store i32 %a, i32* %a.addr, align 4		store i32 %a, i32* %a.addr, align 4
store i32 0, i32* %i, align 4		store i32 0, i32* %i, align 4
br label %for.cond		br label %for.cond

for.cond: ; preds = %for.inc, %entry		for.cond: ; preds = %for.inc, %entry
Show All 22 Lines

for.end: ; preds = %for.cond		for.end: ; preds = %for.cond
%6 = load i32, i32* %a.addr, align 4		%6 = load i32, i32* %a.addr, align 4
%arrayidx3 = getelementptr inbounds [100 x i32], [100 x i32]* getelementptr inbounds (%struct.anon, %struct.anon* @Foo, i32 0, i32 0), i32 0, i32 %6		%arrayidx3 = getelementptr inbounds [100 x i32], [100 x i32]* getelementptr inbounds (%struct.anon, %struct.anon* @Foo, i32 0, i32 0), i32 0, i32 %6
%7 = load i32, i32* %arrayidx3, align 4		%7 = load i32, i32* %arrayidx3, align 4
ret i32 %7		ret i32 %7
}		}

; int mustAlias03 (int a) {		; int noAlias19(int a) {
; int i;		; int i;
; for (i=0; i<SIZE; i++)		; for (i=0; i<SIZE; i++)
; Foo.A[i+10] = Foo.B[SIZE-i-10] + a;		; Foo.A[i+10] = Foo.B[SIZE-i-10] + a;
; return Foo.A[a];		; return Foo.A[a];
; }		; }
; CHECK-LABEL: define i32 @mustAlias03(		; CHECK-LABEL: define i32 @noAlias19(
; CHECK-NOT: add nsw <4 x i32>		; CHECK: add nsw <4 x i32>
; CHECK: ret		; CHECK: ret

define i32 @mustAlias03(i32 %a) nounwind {		define i32 @noAlias19(i32 %a) nounwind {
entry:		entry:
%a.addr = alloca i32, align 4		%a.addr = alloca i32, align 4
%i = alloca i32, align 4		%i = alloca i32, align 4
store i32 %a, i32* %a.addr, align 4		store i32 %a, i32* %a.addr, align 4
store i32 0, i32* %i, align 4		store i32 0, i32* %i, align 4
br label %for.cond		br label %for.cond

for.cond: ; preds = %for.inc, %entry		for.cond: ; preds = %for.inc, %entry
Show All 22 Lines	for.inc: ; preds = %for.body
br label %for.cond		br label %for.cond

for.end: ; preds = %for.cond		for.end: ; preds = %for.cond
%6 = load i32, i32* %a.addr, align 4		%6 = load i32, i32* %a.addr, align 4
%arrayidx4 = getelementptr inbounds [100 x i32], [100 x i32]* getelementptr inbounds (%struct.anon, %struct.anon* @Foo, i32 0, i32 0), i32 0, i32 %6		%arrayidx4 = getelementptr inbounds [100 x i32], [100 x i32]* getelementptr inbounds (%struct.anon, %struct.anon* @Foo, i32 0, i32 0), i32 0, i32 %6
%7 = load i32, i32* %arrayidx4, align 4		%7 = load i32, i32* %arrayidx4, align 4
ret i32 %7		ret i32 %7
}		}

		; /// Pointer access, run-time check added
		; int mayAlias01 (int a) {
		; int i;
		; for (i=0; i<SIZE; i++)
		; (PA+i) = (PB+SIZE-i-1) + a;
		; return *(PA+a);
		; }
		; CHECK-LABEL: define i32 @mayAlias01(
		; CHECK-NOT: add nsw <4 x i32>
		; CHECK: ret

		define i32 @mayAlias01(i32 %a) nounwind {
		entry:
		%a.addr = alloca i32, align 4
		%i = alloca i32, align 4
		store i32 %a, i32* %a.addr, align 4
		store i32 0, i32* %i, align 4
		br label %for.cond

		for.cond: ; preds = %for.inc, %entry
		%0 = load i32, i32* %i, align 4
		%cmp = icmp slt i32 %0, 100
		br i1 %cmp, label %for.body, label %for.end

		for.body: ; preds = %for.cond
		%1 = load i32, i32* @PB, align 4
		%add.ptr = getelementptr inbounds i32, i32* %1, i32 100
		%2 = load i32, i32* %i, align 4
		%idx.neg = sub i32 0, %2
		%add.ptr1 = getelementptr inbounds i32, i32* %add.ptr, i32 %idx.neg
		%add.ptr2 = getelementptr inbounds i32, i32* %add.ptr1, i32 -1
		%3 = load i32, i32* %add.ptr2, align 4
		%4 = load i32, i32* %a.addr, align 4
		%add = add nsw i32 %3, %4
		%5 = load i32, i32* @PA, align 4
		%6 = load i32, i32* %i, align 4
		%add.ptr3 = getelementptr inbounds i32, i32* %5, i32 %6
		store i32 %add, i32* %add.ptr3, align 4
		br label %for.inc

		for.inc: ; preds = %for.body
		%7 = load i32, i32* %i, align 4
		%inc = add nsw i32 %7, 1
		store i32 %inc, i32* %i, align 4
		br label %for.cond

		for.end: ; preds = %for.cond
		%8 = load i32, i32* @PA, align 4
		%9 = load i32, i32* %a.addr, align 4
		%add.ptr4 = getelementptr inbounds i32, i32* %8, i32 %9
		%10 = load i32, i32* %add.ptr4, align 4
		ret i32 %10
		}