This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/Analysis/
-
llvm/
-
Analysis/
-
ScopedNoAliasAA.h
-
lib/
-
Analysis/
-
ScopedNoAliasAA.cpp
-
IR/
1/2
Metadata.cpp
-
test/
-
Analysis/ScopedNoAliasAA/
-
ScopedNoAliasAA/
1/1
alias-scope-merging.ll
-
Transforms/
-
GVN/
-
noalias.ll
-
InstCombine/
-
fold-phi-load-metadata.ll
-
MemCpyOpt/
4/6
callslot_badaa.ll
-
NewGVN/
-
noalias.ll

Differential D91576

[MemCpyOpt] Correctly merge alias scopes during call slot optimization
ClosedPublic

Authored by modimo on Nov 16 2020, 3:44 PM.

Download Raw Diff

Details

Reviewers

hfinkel
hoyFB
nikic
jdoerfert
jeroen.dobbelaere

Commits

rG18603319321a: [MemCpyOpt] Correctly merge alias scopes during call slot optimization

Summary

When MemCpyOpt performs call slot optimization it will concatenate the alias.scope metadata between the function call and the memcpy. However, scoped AA relies on the domains in metadata to be maintained in a caller-callee relationship. Naive concatenation breaks this assumption leading to bad AA results.

The fix is to take the intersection of domains then union the scopes within those domains.

The original bug came from a case of rust bad codegen which uses this bad aliasing to perform additional memcpy optimizations. As show in the added test case %src got forwarded past its lifetime leading to a dereference of garbage data.

Testing
ninja check-llvm

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

modimo created this revision.Nov 16 2020, 3:44 PM

Herald added a project: Restricted Project. · View Herald TranscriptNov 16 2020, 3:44 PM

Herald added subscribers: llvm-commits, lxfind, JDevlieghere, hiraditya. · View Herald Transcript

modimo requested review of this revision.Nov 16 2020, 3:44 PM

modimo edited the summary of this revision. (Show Details)

modimo added a subscriber: • test.

Herald added subscribers: pengfei, jfb, kosarev. · View Herald TranscriptNov 16 2020, 3:46 PM

Add more comments in test case

modimo edited the summary of this revision. (Show Details)Nov 16 2020, 3:46 PM

nikic added a reviewer: nikic.Nov 17 2020, 12:33 AM

I don't get this fix. Adding extra scopes to alias.scope should always be conservatively correct. I think there may be some confusion here due to the malformed noalias metadata you're using.

llvm/test/Transforms/MemCpyOpt/callslot_badaa.ll
35	This alias.scope metadata looks corrupt. alias.scope accepts a list of scopes. I don't see any scopes or domains here.

nikic added reviewers: jdoerfert, jeroen.dobbelaere.Nov 17 2020, 11:44 AM

In D91576#2400585, @nikic wrote:

Adding extra scopes to alias.scope should always be conservatively correct

That is also my understanding, but per https://llvm.org/docs/LangRef.html#noalias-and-alias-scope-metadata, in particular this statement:

"When evaluating an aliasing query, if for some domain, the set of scopes with that domain in one instruction’s alias.scope list is a subset of (or equal to) the set of scopes for that domain in another instruction’s noalias list, then the two memory accesses are assumed not to alias.

, if the extra scopes being added meet the condition above, a no-alias answer could be given. This doesn't sound conservative to me. Please correct me if I miss anything.

In D91576#2401162, @hoy wrote:

Adding extra scopes to alias.scope should always be conservatively correct

That is also my understanding, but per https://llvm.org/docs/LangRef.html#noalias-and-alias-scope-metadata, in particular this statement:
"When evaluating an aliasing query, if for some domain, the set of scopes with that domain in one instruction’s alias.scope list is a subset of (or equal to) the set of scopes for that domain in another instruction’s noalias list, then the two memory accesses are assumed not to alias.
, if the extra scopes being added meet the condition above, a no-alias answer could be given. This doesn't sound conservative to me. Please correct me if I miss anything.

Note the "is a subset" requirement. If you add more scopes to the list, then the list is a subset of strictly less noalias scope lists. {!0} is a subset of {!0}, but {!0, !1} is not a subset of {!0}.

In the description, you say:

This causes future optimizations to get incorrect AA results leading to bad code.

Do you have an example/bugreport that shows this ? Based on this testcase, I cannot conclude that what is produced today is wrong. The only thing that might happen is that you get more 'aliasing', as the !alias.scopes have been merged.

In D91576#2401926, @nikic wrote:
In D91576#2401162, @hoy wrote:
Adding extra scopes to alias.scope should always be conservatively correct

That is also my understanding, but per https://llvm.org/docs/LangRef.html#noalias-and-alias-scope-metadata, in particular this statement:
"When evaluating an aliasing query, if for some domain, the set of scopes with that domain in one instruction’s alias.scope list is a subset of (or equal to) the set of scopes for that domain in another instruction’s noalias list, then the two memory accesses are assumed not to alias.
, if the extra scopes being added meet the condition above, a no-alias answer could be given. This doesn't sound conservative to me. Please correct me if I miss anything.
Note the "is a subset" requirement. If you add more scopes to the list, then the list is a subset of strictly less noalias scope lists. {!0} is a subset of {!0}, but {!0, !1} is not a subset of {!0}.

If !0 and !1 don't share the same domain, alias.scope {!0, !1} and noalias {!0} can still be considered non-aliased, since the statement says "if for some domain, ..." ?

BTW, what does a domain stand? Does it represent a set of abstract memory locations? Thanks.

I've looked more into what alias.scopes are and I think I've got a better handle on what's actually wrong. In the meantime let me try to describe the original problem better. The first section is checking my understanding and making sure we're on the same page. Skip to the second part for the bug details.

My understanding of domains

Let's say we have the following (I slightly modified test/Transforms/Inline/noalias.ll for this):

define void @hello(float* noalias nocapture %a, float* nocapture readonly %c) #0 {
entry:
  %0 = load float, float* %c, align 4
  %arrayidx = getelementptr inbounds float, float* %a, i64 5
  store float %0, float* %arrayidx, align 4
  ret void
}

define void @hello2(float* noalias nocapture %a, float* nocapture readonly %c) #0 {
entry:
  %0 = load float, float* %c, align 4
  %arrayidx = getelementptr inbounds float, float* %a, i64 5
  store float %0, float* %arrayidx, align 4
  ret void
}

define void @foo(float* nocapture %a, float* nocapture readonly %c) #0 {
entry:
  tail call void @hello(float* %a, float* %c)
  tail call void @hello2(float* %a, float* %c)
  %0 = load float, float* %c, align 4
  %arrayidx = getelementptr inbounds float, float* %a, i64 7
  store float %0, float* %arrayidx, align 4
  ret void
}

After -inline -enable-noalias-to-md-conversion

; Function Attrs: nounwind uwtable
define void @foo(float* nocapture %a, float* nocapture readonly %c) #0 {
entry:
  %0 = load float, float* %c, align 4, !noalias !0
  %arrayidx.i = getelementptr inbounds float, float* %a, i64 5
  store float %0, float* %arrayidx.i, align 4, !alias.scope !0
  %1 = load float, float* %c, align 4, !noalias !3
  %arrayidx.i1 = getelementptr inbounds float, float* %a, i64 5
  store float %1, float* %arrayidx.i1, align 4, !alias.scope !3
  %2 = load float, float* %c, align 4
  %arrayidx = getelementptr inbounds float, float* %a, i64 7
  store float %2, float* %arrayidx, align 4
  ret void
}

attributes #0 = { nounwind uwtable }

!0 = !{!1}
!1 = distinct !{!1, !2, !"hello: %a"}
!2 = distinct !{!2, !"hello"}
!3 = !{!4}
!4 = distinct !{!4, !5, !"hello2: %a"}
!5 = distinct !{!5, !"hello2"}

Looking at the code in AddAliasScopeMetadata I believe what happens is that:

We create a new domain (!2 and !5) for each inline site. Doesn't matter if the callee is the same (say if I called hello twice) two still get created
Scopes are created around arguments that are marked noalias, so in this case !4 and !1
Alias sets are the last to be created, these are what's tagged on the actual instructions themselves (!0 and !3)

When querying AA results by adding -basic-aa -scoped-noalias-aa -aa-eval -evaluate-aa-metadata -print-all-alias-modref-info to opt on this pair:

MayAlias:   %0 = load float, float* %c, align 4, !noalias !0 <->   store float %1, float* %arrayidx.i1, align 4, !alias.scope !3

The following steps are taken:

Find the underlying domains left to right behind !noalias !0, in this case !2 but it could be more
Find all scopes in in these domains that belong to !0 (noalias_set) and !3 (aliasscope_set)
1. In this case, noalias_set={!1} and aliasscope_set={}
Since aliasscope_set is empty here that's a bail condition and we conclude MayAlias

In a more interesting case such as:

NoAlias:   %0 = load float, float* %c, align 4, !noalias !0 <->   store float %0, float* %arrayidx.i, align 4, !alias.scope !0

Here noalias_set={!1} and aliasscope_set={!1} and we check to see if noalias_set is a superset of aliasscope_set. Since it is, we can concluded this is noalias

In the step (2) above though this is isolated to a single domain at a time. When for any single domain the superset relationship holds, then we immediately conclude noalias.
I think this is because domains are nested based from inlining. If I have domains {!1, !2} in my metadata, that indicates the I'm currently in callee2 of this call chain caller->callee1->callee2. Thus, as long as this property holds the first domain that matches with a superset is enough to conclude noalias.

How the bug I'm looking at manifests

In memcpyopt we can fold 2 memcpys into a single one. This causes the alias.scope metadata to be merged in a union. Suppose I'm merging the following 2 instances (I'm reusing the metadata from the above examples):

define i8 @test(i8 %input) {
  %tmp = alloca i8
  %dst = alloca i8
  %src = alloca i8
  call void @llvm.lifetime.start.p0i8(i64 8, i8* nonnull %src), !noalias !3
  store i8 %input, i8* %src
  call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 8 %tmp, i8* align 8 %src, i64 1, i1 false), !alias.scope !1
  call void @llvm.lifetime.end.p0i8(i64 8, i8* nonnull %src), !noalias !3
  call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 8 %dst, i8* align 8 %tmp, i64 1, i1 false), !alias.scope !3
  %ret_value = load i8, i8* %dst
  ret i8 %ret_value
}

declare void @llvm.lifetime.start.p0i8(i64, i8* nocapture)
declare void @llvm.lifetime.end.p0i8(i64, i8* nocapture)
declare void @llvm.memcpy.p0i8.p0i8.i64(i8*, i8*, i64, i1)

!0 = !{!1}
!1 = distinct !{!1, !2, !"hello: %a"}
!2 = distinct !{!2, !"hello"}
!3 = !{!4}
!4 = distinct !{!4, !5, !"hello2: %a"}
!5 = distinct !{!5, !"hello2"}

The merged instance then will look like (-memcpyopt -enable-noalias-to-md-conversion -basic-aa -scoped-noalias-aa -aa-eval -evaluate-aa-metadata -print-all-alias-modref-info -S)

define i8 @test(i8 %input) {
  %tmp = alloca i8, align 1
  %dst = alloca i8, align 1
  %src = alloca i8, align 1
  call void @llvm.lifetime.start.p0i8(i64 8, i8* nonnull %src), !noalias !0
  store i8 %input, i8* %src, align 1
  call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 8 %dst, i8* align 8 %src, i64 1, i1 false), !alias.scope !3
  call void @llvm.lifetime.end.p0i8(i64 8, i8* nonnull %src), !noalias !0
  %ret_value = load i8, i8* %dst, align 1
  ret i8 %ret_value
}
...
!0 = !{!1}
!1 = distinct !{!1, !2, !"hello2: %a"}
!2 = distinct !{!2, !"hello2"}
!3 = !{!1, !4, !5, !"hello: %a"}
!4 = distinct !{!4, !5, !"hello: %a"}
!5 = distinct !{!5, !"hello"}

!3 now represents the union of the previous scopes. Now, if I ask AA what the relationship between:

NoModRef:   call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 8 %dst, i8* align 8 %src, i64 1, i1 false), !alias.scope !3 <->   call void @llvm.lifetime.end.p0i8(i64 8, i8* nonnull %src), !noalias !0

I get a result which seems incorrect. lifetime.end definitely does ref %src from the previous memcpy. When querying scoped-AA we look at the first domain inside !noalias !0, which is !2 and its scopes which is {!1}. The scopes in that domain for !alias.scope !3 is however also only {!1} as well since the added metadata is in a separate domain. Thus because {!1} is a superset of {!1} scoped-AA concludes its nomodref.

I think the underlying problem is that scoped-AA relies on any appended domains to reside in a caller-callee dependence. If that holds, in this example you wouldn't care that you have another domain in !alias.scope !3 since it would be a callee within a scope that we know doesn't alias. However, because memcpyopt is not inlining and doesn't preserve this rule the merged metadata doesn't express the new dependence correctly and we lose aliasing information.

Hi Modi,

Following line in your input example is wrong and explains why the resulting alias info is corrupt:

call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 8 %tmp, i8* align 8 %src, i64 1, i1 false), !alias.scope !1

It should be !alias.scope !0

Given that, it does not change the result of the alias analysis which indeed seems to be wrong. It seems the 'union' should only be allowed if the domains match (and maybe not just for this case ?)

I would propose as solution to not 'merge' the !alias.scope in this case, but to:

keep it if it is identical on both memcpy
throw it away if it is not identical

Note to self: check why this should not be a problem with the full restrict patches (which does not use !alias.scope).

In D91576#2404054, @jeroen.dobbelaere wrote:

Hi Modi,

Following line in your input example is wrong and explains why the resulting alias info is corrupt:

call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 8 %tmp, i8* align 8 %src, i64 1, i1 false), !alias.scope !1

It should be !alias.scope !0

Thanks for verifying! Good catch, I'll update the patch with this fixed up example.

Given that, it does not change the result of the alias analysis which indeed seems to be wrong. It seems the 'union' should only be allowed if the domains match (and maybe not just for this case ?)

Given how the problem manifests I agree that this is general to any merging of alias.scope. This is a pretty complicated area and I think I've drilled through to the ground truth.

I would propose as solution to not 'merge' the !alias.scope in this case, but to:

keep it if it is identical on both memcpy

throw it away if it is not identical

That's a reasonable approach as well. I mentioned this in the summary as "We could also strip away or 'intersect' alias.scope on the final merge to still maintain correctness". I found in my specific case final codegen was not impacted by hindering this optimization and the trade-off would be less powerful AA for later phases.

Update unit test with correct set of metadata

modimo edited the summary of this revision. (Show Details)Nov 18 2020, 6:16 PM

modimo edited the summary of this revision. (Show Details)Nov 18 2020, 6:19 PM

modimo added inline comments.Nov 18 2020, 6:23 PM

llvm/test/Transforms/MemCpyOpt/callslot_badaa.ll
35	Yeah the metadata in my first example was wrong. I've dug through how scopes and domains work and the current example should be representative of legal metadata.

I would not block the optimization, but just get the !alias.scope metadata correct

llvm/lib/Transforms/Scalar/MemCpyOptimizer.cpp
988 ↗	(On Diff #306275)	Omit MD_alias_scope, as the 'default merge' mechanism is definitely invalid for this case. Replace it (after the combineMetadata) with something like: if (C->getMetadata(LLVMContext::MD_alias_scope) != cpyLoad->getMetadata(LLVMContext::MD_alias_scope) C->setMetadata(LLVMContext::MD_alias_scope, nullptr); (and add a decent comment ;) )

Changing to conservative merge of metadata rather than bailing on optimization. Updated test case due to that.

modimo retitled this revision from [MemCpyOpt] Bail on call slot optimization if it merges alias scopes to [MemCpyOpt] Correctly merge alias scopes during call slot optimization.Nov 19 2020, 3:23 PM

modimo edited the summary of this revision. (Show Details)

@jeroen.dobbelaere I think the correct merging in all cases requires this strategy (or one which checks that all the domains match). If true (@hfinkel?) that'll be a larger change that needs more testing.

smith added a subscriber: smith.Nov 20 2020, 3:28 AM

Unless there's something that makes this issue specific to the call slot optimization, wouldn't it be better to adjust the logic in MDNode::getMostGenericAliasScope() to ensure that any code performing alias scope merging is covered?

In D91576#2411078, @nikic wrote:

Unless there's something that makes this issue specific to the call slot optimization, wouldn't it be better to adjust the logic in MDNode::getMostGenericAliasScope() to ensure that any code performing alias scope merging is covered?

Yes, after thinking some more about it, that is probably the best solution. This also corresponds to what the Full Restrict patches are doing when merging ptr_provenance operands. The merging/handling of different domains should only be done during inlining and that seems to be done explicitly in InlineFunction.cpp. It is for this use case that the langref explains the behavior of alias.scope/noalias wrt handling of aliasing.

llvm/test/Transforms/MemCpyOpt/callslot_badaa.ll
21	It would be nice if we could check that the !alias.scope really has been omitted.

Move the fix to getMostGenericAliasScope. Renamed metadata in test case.

Herald added a subscriber: dexonsmith. · View Herald TranscriptNov 30 2020, 1:01 PM

modimo added inline comments.Nov 30 2020, 1:03 PM

llvm/test/Transforms/MemCpyOpt/callslot_badaa.ll
21	added --match-full-lines in FileCheck to do just that :D

About !alias.scope and !noalias (also see https://llvm.org/docs/LangRef.html#noalias-and-alias-scope-metadata):

'Scopes in one domain don't affect scopes in other domains' -> this is used during inlining
The 'alias.scope' indicates for an instruction A will not alias with another instruction B IF, for a domain, the alias.scopes for A belonging to that domain are a subset of the 'noalias' scopes of instruction B.

Because of that: when merging metadata at the same level (merging two store instructions for example),
you can only keep the domains that are in both instructions valid. For those, you must take the union of the scopes.

Examples: (vXdY : variable X in Domain Y)

a) same domain(s):

store1: alias.scope v1d1
store2: alias.scope v2d1
=>
store_merged: alias.scope  v1d1, v2d2

b) Disjunct domains

store1 : alias.scope  v1d1
store2 : alias.scope  v2d2
=>
store_merged:  NO alias.scope

c) partially overlapping (subset):

store1: alias.scope v1d1
store2: alias.scope v2d1, v3d2
=>
store_merged: alias.scope v1d1, v2d1

d) partially overlapping:

store1: alias.scope v1d1, v2d2
store2: alias.scope v3d2, v4d3
=>
store_merged: alias.scope v2d2, v3d2

NOTE: why is concatenate not valid ?

Take example (b). Suppose you have a:

store3 : noalias  v1d1

now, store1 and store3 do not alias. store2 and store3 might alias.
The merger store_merged might alias with store3. So, it is not valid to just take the union of the scopes of store1 and store2.

llvm/lib/IR/Metadata.cpp
948	Two remarks:http://www.cplusplus.com/reference/algorithm/includes/ ) SmallPtrSet is not sorted, so it does not fulfill the preconditions for std::includes (See http://www.cplusplus.com/reference/algorithm/includes/ ) imho, this is still not a valid merging for !alias.scope (See external comment) You probably want to do: if (ADomains != BDomains) return nullptr; or take a real intersection of the domains, and then the union of the scopes belonging to the intersection of the domains.

tislam added a subscriber: tislam.Dec 1 2020, 11:19 AM

tislam added inline comments.

llvm/test/Transforms/MemCpyOpt/callslot_badaa.ll
12	Regarding the alias.scope metadata `!3 = !{!1, !4, !5, !"callee0: %a"}`. Should this be `!3 = !{!1, !4}`? An alias.scope metadata should be a list of scope metadata and should not include any domain metadata (!5, for example).

Go with intersection of domains then union the scopes within those domains as discussed. Updating tests to match latest behavior.

modimo added inline comments.Dec 1 2020, 3:27 PM

llvm/lib/IR/Metadata.cpp
948	That's good to know about SmallPtrSet, I'll keep that in mind for the future. For (2) your approach is the correct one and I've updated the patch to match.
llvm/test/Transforms/MemCpyOpt/callslot_badaa.ll
12	Yes good catch. I regenerated the output with trunk which matches your statement. I suspect the top half and bottom half got out of sync at some point in the revisions.

kkwli0 added a subscriber: kkwli0.Dec 1 2020, 6:27 PM

This looks good to me. I would improve the alias-scope-merging.ll test to explicitly check the merged scopes.

llvm/test/Analysis/ScopedNoAliasAA/alias-scope-merging.ll

-; CHECK:   ![[SCOPE]] = !{!{{[0-9]+}}, !{{[0-9]+}}}
+; CHECK-DAG: ![[CALLEE0_A:[0-9]+]] = distinct !{!{{[0-9]+}}, !{{[0-9]+}}, !"callee0: %a"}
+; CHECK-DAG: ![[CALLEE0_B:[0-9]+]] = distinct !{!{{[0-9]+}}, !{{[0-9]+}}, !"callee0: %b"}
+; CHECK-DAG: ![[SCOPE]] = !{![[CALLEE0_A]], ![[CALLEE0_B]]}

Updating new merge test to explicitly look for the correct merged scopes. Thanks @jeroen.dobbelaere!

modimo marked an inline comment as done.Dec 2 2020, 1:48 PM

Looks good to me. Thanks for driving this !

This revision is now accepted and ready to land.Dec 2 2020, 2:30 PM

Closed by commit rG18603319321a: [MemCpyOpt] Correctly merge alias scopes during call slot optimization (authored by modimo). · Explain WhyDec 3 2020, 9:24 AM

This revision was automatically updated to reflect the committed changes.

modimo added a commit: rG18603319321a: [MemCpyOpt] Correctly merge alias scopes during call slot optimization.

Certainly, I appreciate the thorough and quick review! Best of luck with your restrict patches!

There is a library layering issue. LLVMAnalysis provides llvm/Analysis/ScopedNoAliasAA.h and depends on LLVMCore.

LLVMCore provides llvm/IR/Metadata.cpp and it should not include a header file in LLVMAnalysis

llvm/Analysis/ScopedNoAliasAA.h will include several other files (AliasAnalysis.h and MemoryLocation.h)

Good catch @MaskRay. If I'm understanding correctly I think the correct approach is to move the class AliasScopeNode I need to metadata.h to fix this?

In D91576#2431659, @modimo wrote:

Good catch @MaskRay. If I'm understanding correctly I think the correct approach is to move the class AliasScopeNode I need to metadata.h to fix this?

LG. And delete #include "llvm/Analysis/ScopedNoAliasAA.h"

MaskRay mentioned this in rG756fa8b9be0c: [Metadata] Fix layer violation in D91576.Dec 3 2020, 10:58 AM

Appreciate the quick commit! Guess I gotta be faster with D92592 :D.

Revision Contents

Path

Size

llvm/

include/

llvm/

Analysis/

ScopedNoAliasAA.h

21 lines

lib/

Analysis/

ScopedNoAliasAA.cpp

25 lines

IR/

Metadata.cpp

28 lines

test/

Analysis/

ScopedNoAliasAA/

alias-scope-merging.ll

37 lines

Transforms/

GVN/

noalias.ll

29 lines

InstCombine/

fold-phi-load-metadata.ll

4 lines

MemCpyOpt/

callslot_badaa.ll

39 lines

NewGVN/

noalias.ll

29 lines

Diff 309283

llvm/include/llvm/Analysis/ScopedNoAliasAA.h

	Show All 19 Lines
	#include <memory>			#include <memory>

	namespace llvm {			namespace llvm {

	class Function;			class Function;
	class MDNode;			class MDNode;
	class MemoryLocation;			class MemoryLocation;

				/// This is a simple wrapper around an MDNode which provides a higher-level
				/// interface by hiding the details of how alias analysis information is encoded
				/// in its operands.
				class AliasScopeNode {
				const MDNode *Node = nullptr;

				public:
				AliasScopeNode() = default;
				explicit AliasScopeNode(const MDNode *N) : Node(N) {}

				/// Get the MDNode for this AliasScopeNode.
				const MDNode *getNode() const { return Node; }

				/// Get the MDNode for this AliasScopeNode's domain.
				const MDNode *getDomain() const {
				if (Node->getNumOperands() < 2)
				return nullptr;
				return dyn_cast_or_null<MDNode>(Node->getOperand(1));
				}
				};

	/// A simple AA result which uses scoped-noalias metadata to answer queries.			/// A simple AA result which uses scoped-noalias metadata to answer queries.
	class ScopedNoAliasAAResult : public AAResultBase<ScopedNoAliasAAResult> {			class ScopedNoAliasAAResult : public AAResultBase<ScopedNoAliasAAResult> {
	friend AAResultBase<ScopedNoAliasAAResult>;			friend AAResultBase<ScopedNoAliasAAResult>;

	public:			public:
	/// Handle invalidation events from the new pass manager.			/// Handle invalidation events from the new pass manager.
	///			///
	/// By definition, this result is stateless and so remains valid.			/// By definition, this result is stateless and so remains valid.
	▲ Show 20 Lines • Show All 55 Lines • Show Last 20 Lines

llvm/lib/Analysis/ScopedNoAliasAA.cpp

	Show First 20 Lines • Show All 44 Lines • ▼ Show 20 Lines
	using namespace llvm;			using namespace llvm;

	// A handy option for disabling scoped no-alias functionality. The same effect			// A handy option for disabling scoped no-alias functionality. The same effect
	// can also be achieved by stripping the associated metadata tags from IR, but			// can also be achieved by stripping the associated metadata tags from IR, but
	// this option is sometimes more convenient.			// this option is sometimes more convenient.
	static cl::opt<bool> EnableScopedNoAlias("enable-scoped-noalias",			static cl::opt<bool> EnableScopedNoAlias("enable-scoped-noalias",
	cl::init(true), cl::Hidden);			cl::init(true), cl::Hidden);

	namespace {

	/// This is a simple wrapper around an MDNode which provides a higher-level
	/// interface by hiding the details of how alias analysis information is encoded
	/// in its operands.
	class AliasScopeNode {
	const MDNode *Node = nullptr;

	public:
	AliasScopeNode() = default;
	explicit AliasScopeNode(const MDNode *N) : Node(N) {}

	/// Get the MDNode for this AliasScopeNode.
	const MDNode *getNode() const { return Node; }

	/// Get the MDNode for this AliasScopeNode's domain.
	const MDNode *getDomain() const {
	if (Node->getNumOperands() < 2)
	return nullptr;
	return dyn_cast_or_null<MDNode>(Node->getOperand(1));
	}
	};

	} // end anonymous namespace

	AliasResult ScopedNoAliasAAResult::alias(const MemoryLocation &LocA,			AliasResult ScopedNoAliasAAResult::alias(const MemoryLocation &LocA,
	const MemoryLocation &LocB,			const MemoryLocation &LocB,
	AAQueryInfo &AAQI) {			AAQueryInfo &AAQI) {
	if (!EnableScopedNoAlias)			if (!EnableScopedNoAlias)
	return AAResultBase::alias(LocA, LocB, AAQI);			return AAResultBase::alias(LocA, LocB, AAQI);

	// Get the attached MDNodes.			// Get the attached MDNodes.
	const MDNode AScopes = LocA.AATags.Scope, BScopes = LocB.AATags.Scope;			const MDNode AScopes = LocA.AATags.Scope, BScopes = LocB.AATags.Scope;
	▲ Show 20 Lines • Show All 126 Lines • Show Last 20 Lines

llvm/lib/IR/Metadata.cpp

Show All 21 Lines
#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
#include "llvm/ADT/SetVector.h"		#include "llvm/ADT/SetVector.h"
#include "llvm/ADT/SmallPtrSet.h"		#include "llvm/ADT/SmallPtrSet.h"
#include "llvm/ADT/SmallSet.h"		#include "llvm/ADT/SmallSet.h"
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
#include "llvm/ADT/StringMap.h"		#include "llvm/ADT/StringMap.h"
#include "llvm/ADT/StringRef.h"		#include "llvm/ADT/StringRef.h"
#include "llvm/ADT/Twine.h"		#include "llvm/ADT/Twine.h"
		#include "llvm/Analysis/ScopedNoAliasAA.h"
#include "llvm/IR/Argument.h"		#include "llvm/IR/Argument.h"
#include "llvm/IR/BasicBlock.h"		#include "llvm/IR/BasicBlock.h"
#include "llvm/IR/Constant.h"		#include "llvm/IR/Constant.h"
#include "llvm/IR/ConstantRange.h"		#include "llvm/IR/ConstantRange.h"
#include "llvm/IR/Constants.h"		#include "llvm/IR/Constants.h"
#include "llvm/IR/DebugInfoMetadata.h"		#include "llvm/IR/DebugInfoMetadata.h"
#include "llvm/IR/DebugLoc.h"		#include "llvm/IR/DebugLoc.h"
#include "llvm/IR/Function.h"		#include "llvm/IR/Function.h"
▲ Show 20 Lines • Show All 883 Lines • ▼ Show 20 Lines	MDNode MDNode::intersect(MDNode A, MDNode *B) {
// behaviour? Or was that an unintended side-effect of node uniquing?		// behaviour? Or was that an unintended side-effect of node uniquing?
return getOrSelfReference(A->getContext(), MDs.getArrayRef());		return getOrSelfReference(A->getContext(), MDs.getArrayRef());
}		}

MDNode MDNode::getMostGenericAliasScope(MDNode A, MDNode *B) {		MDNode MDNode::getMostGenericAliasScope(MDNode A, MDNode *B) {
if (!A \|\| !B)		if (!A \|\| !B)
return nullptr;		return nullptr;

return concatenate(A, B);		// Take the intersection of domains then union the scopes
		// within those domains
		SmallPtrSet<const MDNode *, 16> ADomains;
		SmallPtrSet<const MDNode *, 16> IntersectDomains;
		SmallSetVector<Metadata *, 4> MDs;
		for (const MDOperand &MDOp : A->operands())
		if (const MDNode *NAMD = dyn_cast<MDNode>(MDOp))
		if (const MDNode *Domain = AliasScopeNode(NAMD).getDomain())
		ADomains.insert(Domain);

		for (const MDOperand &MDOp : B->operands())
		if (const MDNode *NAMD = dyn_cast<MDNode>(MDOp))
		if (const MDNode *Domain = AliasScopeNode(NAMD).getDomain())
		if (ADomains.contains(Domain)) {
		IntersectDomains.insert(Domain);
		MDs.insert(MDOp);
		}

		for (const MDOperand &MDOp : A->operands())
		jeroen.dobbelaereUnsubmitted Not Done Reply Inline Actions Two remarks:http://www.cplusplus.com/reference/algorithm/includes/ ) SmallPtrSet is not sorted, so it does not fulfill the preconditions for std::includes (See http://www.cplusplus.com/reference/algorithm/includes/ ) imho, this is still not a valid merging for !alias.scope (See external comment) You probably want to do: if (ADomains != BDomains) return nullptr; or take a real intersection of the domains, and then the union of the scopes belonging to the intersection of the domains. jeroen.dobbelaere: Two remarks:http://www.cplusplus.com/reference/algorithm/includes/ ) 1. SmallPtrSet is not…
		modimoAuthorUnsubmitted Done Reply Inline Actions That's good to know about SmallPtrSet, I'll keep that in mind for the future. For (2) your approach is the correct one and I've updated the patch to match. modimo: That's good to know about SmallPtrSet, I'll keep that in mind for the future. For (2) your…
		if (const MDNode *NAMD = dyn_cast<MDNode>(MDOp))
		if (const MDNode *Domain = AliasScopeNode(NAMD).getDomain())
		if (IntersectDomains.contains(Domain))
		MDs.insert(MDOp);

		return MDs.empty() ? nullptr
		: getOrSelfReference(A->getContext(), MDs.getArrayRef());
}		}

MDNode MDNode::getMostGenericFPMath(MDNode A, MDNode *B) {		MDNode MDNode::getMostGenericFPMath(MDNode A, MDNode *B) {
if (!A \|\| !B)		if (!A \|\| !B)
return nullptr;		return nullptr;

APFloat AVal = mdconst::extract<ConstantFP>(A->getOperand(0))->getValueAPF();		APFloat AVal = mdconst::extract<ConstantFP>(A->getOperand(0))->getValueAPF();
APFloat BVal = mdconst::extract<ConstantFP>(B->getOperand(0))->getValueAPF();		APFloat BVal = mdconst::extract<ConstantFP>(B->getOperand(0))->getValueAPF();
▲ Show 20 Lines • Show All 578 Lines • Show Last 20 Lines

llvm/test/Analysis/ScopedNoAliasAA/alias-scope-merging.ll

This file was added.

				; RUN: opt < %s -S -memcpyopt \| FileCheck --match-full-lines %s

				; Alias scopes are merged by taking the intersection of domains, then the union of the scopes within those domains
				define i8 @test(i8 %input) {
				%tmp = alloca i8
				%dst = alloca i8
				%src = alloca i8
				; CHECK: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 8 %dst, i8* align 8 %src, i64 1, i1 false), !alias.scope ![[SCOPE:[0-9]+]]
				call void @llvm.lifetime.start.p0i8(i64 8, i8* nonnull %src), !noalias !4
				store i8 %input, i8* %src
				call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 8 %tmp, i8* align 8 %src, i64 1, i1 false), !alias.scope !0
				call void @llvm.lifetime.end.p0i8(i64 8, i8* nonnull %src), !noalias !4
				call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 8 %dst, i8* align 8 %tmp, i64 1, i1 false), !alias.scope !4
				%ret_value = load i8, i8* %dst
				ret i8 %ret_value
				}

				; Merged scope contains "callee0: %a" and "callee0 : %b"
				; CHECK-DAG: ![[CALLEE0_A:[0-9]+]] = distinct !{!{{[0-9]+}}, !{{[0-9]+}}, !"callee0: %a"}
				; CHECK-DAG: ![[CALLEE0_B:[0-9]+]] = distinct !{!{{[0-9]+}}, !{{[0-9]+}}, !"callee0: %b"}
				jeroen.dobbelaereUnsubmitted Done Reply Inline Actions -; CHECK: ![[SCOPE]] = !{!{{[0-9]+}}, !{{[0-9]+}}} +; CHECK-DAG: ![[CALLEE0_A:[0-9]+]] = distinct !{!{{[0-9]+}}, !{{[0-9]+}}, !"callee0: %a"} +; CHECK-DAG: ![[CALLEE0_B:[0-9]+]] = distinct !{!{{[0-9]+}}, !{{[0-9]+}}, !"callee0: %b"} +; CHECK-DAG: ![[SCOPE]] = !{![[CALLEE0_A]], ![[CALLEE0_B]]} jeroen.dobbelaere: -; CHECK: ![[SCOPE]] = !{!{{[0-9]+}}, !{{[0-9]+}}} +; CHECK-DAG: ![[CALLEE0_A:[0-9]+]] =…
				; CHECK-DAG: ![[SCOPE]] = !{![[CALLEE0_A]], ![[CALLEE0_B]]}

				declare void @llvm.lifetime.start.p0i8(i64, i8* nocapture)
				declare void @llvm.lifetime.end.p0i8(i64, i8* nocapture)
				declare void @llvm.memcpy.p0i8.p0i8.i64(i8, i8, i64, i1)

				!0 = !{!1, !7}
				!1 = distinct !{!1, !3, !"callee0: %a"}
				!2 = distinct !{!2, !3, !"callee0: %b"}
				!3 = distinct !{!3, !"callee0"}

				!4 = !{!2, !5}
				!5 = distinct !{!5, !6, !"callee1: %a"}
				!6 = distinct !{!6, !"callee1"}

				!7 = distinct !{!7, !8, !"callee2: %a"}
				!8 = distinct !{!8, !"callee2"}

llvm/test/Transforms/GVN/noalias.ll

	; RUN: opt -scoped-noalias-aa -basic-aa -gvn -S < %s \| FileCheck %s			; RUN: opt -scoped-noalias-aa -basic-aa -gvn -S < %s \| FileCheck %s

	define i32 @test1(i32* %p, i32* %q) {			define i32 @test1(i32* %p, i32* %q) {
	; CHECK-LABEL: @test1(i32* %p, i32* %q)			; CHECK-LABEL: @test1(i32* %p, i32* %q)
	; CHECK: load i32, i32* %p			; CHECK: load i32, i32* %p
	; CHECK-NOT: noalias			; CHECK-NOT: noalias
	; CHECK: %c = add i32 %a, %a			; CHECK: %c = add i32 %a, %a
	%a = load i32, i32* %p, !noalias !0			%a = load i32, i32* %p, !noalias !3
	%b = load i32, i32* %p			%b = load i32, i32* %p
	%c = add i32 %a, %b			%c = add i32 %a, %b
	ret i32 %c			ret i32 %c
	}			}

	define i32 @test2(i32* %p, i32* %q) {			define i32 @test2(i32* %p, i32* %q) {
	; CHECK-LABEL: @test2(i32* %p, i32* %q)			; CHECK-LABEL: @test2(i32* %p, i32* %q)
	; CHECK: load i32, i32* %p, align 4, !alias.scope !0			; CHECK: load i32, i32* %p, align 4, !alias.scope ![[SCOPE1:[0-9]+]]
	; CHECK: %c = add i32 %a, %a			; CHECK: %c = add i32 %a, %a
	%a = load i32, i32* %p, !alias.scope !0			%a = load i32, i32* %p, !alias.scope !3
	%b = load i32, i32* %p, !alias.scope !0			%b = load i32, i32* %p, !alias.scope !3
	%c = add i32 %a, %b			%c = add i32 %a, %b
	ret i32 %c			ret i32 %c
	}			}

	; FIXME: In this case we can do better than intersecting the scopes, and can
	; concatenate them instead. Both loads are in the same basic block, the first
	; makes the second safe to speculatively execute, and there are no calls that may
	; throw in between.
	define i32 @test3(i32* %p, i32* %q) {			define i32 @test3(i32* %p, i32* %q) {
	; CHECK-LABEL: @test3(i32* %p, i32* %q)			; CHECK-LABEL: @test3(i32* %p, i32* %q)
	; CHECK: load i32, i32* %p, align 4, !alias.scope !1			; CHECK: load i32, i32* %p, align 4, !alias.scope ![[SCOPE2:[0-9]+]]
	; CHECK: %c = add i32 %a, %a			; CHECK: %c = add i32 %a, %a
	%a = load i32, i32* %p, !alias.scope !1			%a = load i32, i32* %p, !alias.scope !4
	%b = load i32, i32* %p, !alias.scope !2			%b = load i32, i32* %p, !alias.scope !5
	%c = add i32 %a, %b			%c = add i32 %a, %b
	ret i32 %c			ret i32 %c
	}			}

				; CHECK: ![[SCOPE1]] = !{!{{[0-9]+}}}
				; CHECK: ![[SCOPE2]] = !{!{{[0-9]+}}, !{{[0-9]+}}}
	declare i32 @foo(i32*) readonly			declare i32 @foo(i32*) readonly

	!0 = !{!0}			!0 = distinct !{!0, !2, !"callee0: %a"}
	!1 = !{!1}			!1 = distinct !{!1, !2, !"callee0: %b"}
	!2 = !{!0, !1}			!2 = distinct !{!2, !"callee0"}

				!3 = !{!0}
				!4 = !{!1}
				!5 = !{!0, !1}

llvm/test/Transforms/InstCombine/fold-phi-load-metadata.ll

Show All 34 Lines	return: ; preds = %if.end, %if.then
store i32* %pval, i32** @g1, align 8		store i32* %pval, i32** @g1, align 8
ret i32 %retval		ret i32 %retval
}		}

; CHECK: ![[EMPTYNODE]] = !{}		; CHECK: ![[EMPTYNODE]] = !{}
; CHECK: ![[TBAA]] = !{![[TAG1:[0-9]+]], ![[TAG1]], i64 0}		; CHECK: ![[TBAA]] = !{![[TAG1:[0-9]+]], ![[TAG1]], i64 0}
; CHECK: ![[TAG1]] = !{!"int", !{{[0-9]+}}, i64 0}		; CHECK: ![[TAG1]] = !{!"int", !{{[0-9]+}}, i64 0}
; CHECK: ![[RANGE]] = !{i32 10, i32 25}		; CHECK: ![[RANGE]] = !{i32 10, i32 25}
; CHECK: ![[ALIAS_SCOPE]] = !{![[SCOPE0:[0-9]+]], ![[SCOPE2:[0-9]+]], ![[SCOPE1:[0-9]+]]}		; CHECK: ![[ALIAS_SCOPE]] = !{![[SCOPE0:[0-9]+]], ![[SCOPE1:[0-9]+]], ![[SCOPE2:[0-9]+]]}
; CHECK: ![[SCOPE0]] = distinct !{![[SCOPE0]], !{{[0-9]+}}, !"scope0"}		; CHECK: ![[SCOPE0]] = distinct !{![[SCOPE0]], !{{[0-9]+}}, !"scope0"}
; CHECK: ![[SCOPE2]] = distinct !{![[SCOPE2]], !{{[0-9]+}}, !"scope2"}
; CHECK: ![[SCOPE1]] = distinct !{![[SCOPE1]], !{{[0-9]+}}, !"scope1"}		; CHECK: ![[SCOPE1]] = distinct !{![[SCOPE1]], !{{[0-9]+}}, !"scope1"}
		; CHECK: ![[SCOPE2]] = distinct !{![[SCOPE2]], !{{[0-9]+}}, !"scope2"}
; CHECK: ![[NOALIAS]] = !{![[SCOPE3:[0-9]+]]}		; CHECK: ![[NOALIAS]] = !{![[SCOPE3:[0-9]+]]}
; CHECK: ![[SCOPE3]] = distinct !{![[SCOPE3]], !{{[0-9]+}}, !"scope3"}		; CHECK: ![[SCOPE3]] = distinct !{![[SCOPE3]], !{{[0-9]+}}, !"scope3"}

!0 = !{!1, !4, i64 4}		!0 = !{!1, !4, i64 4}
!1 = !{!"", !7, i64 0, !4, i64 4}		!1 = !{!"", !7, i64 0, !4, i64 4}
!2 = !{!3, !4, i64 0}		!2 = !{!3, !4, i64 0}
!3 = !{!"", !4, i64 0, !7, i64 4}		!3 = !{!"", !4, i64 0, !7, i64 4}
!4 = !{!"int", !5, i64 0}		!4 = !{!"int", !5, i64 0}
Show All 15 Lines

llvm/test/Transforms/MemCpyOpt/callslot_badaa.ll

This file was added.

				; RUN: opt < %s -S -memcpyopt \| FileCheck --match-full-lines %s

				; Make sure callslot optimization merges alias.scope metadata correctly when it merges instructions.
				; Merging here naively generates:
				; call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 8 %dst, i8* align 8 %src, i64 1, i1 false), !alias.scope !3
				; call void @llvm.lifetime.end.p0i8(i64 8, i8* nonnull %src), !noalias !0
				; ...
				; !0 = !{!1}
				; !1 = distinct !{!1, !2, !"callee1: %a"}
				; !2 = distinct !{!2, !"callee1"}
				; !3 = !{!1, !4}
				; !4 = distinct !{!4, !5, !"callee0: %a"}
				tislamUnsubmitted Done Reply Inline Actions Regarding the alias.scope metadata `!3 = !{!1, !4, !5, !"callee0: %a"}`. Should this be `!3 = !{!1, !4}`? An alias.scope metadata should be a list of scope metadata and should not include any domain metadata (!5, for example). tislam: Regarding the alias.scope metadata `!3 = !{!1, !4, !5, !"callee0: %a"}`. Should this be `!3 = !
				modimoAuthorUnsubmitted Done Reply Inline Actions Yes good catch. I regenerated the output with trunk which matches your statement. I suspect the top half and bottom half got out of sync at some point in the revisions. modimo: Yes good catch. I regenerated the output with trunk which matches your statement. I suspect the…
				; !5 = distinct !{!5, !"callee0"}
				; Which is incorrect because the lifetime.end of %src will now "noalias" the above memcpy.
				define i8 @test(i8 %input) {
				%tmp = alloca i8
				%dst = alloca i8
				%src = alloca i8
				; NOTE: we're matching the full line and looking for the lack of !alias.scope here
				; CHECK: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 8 %dst, i8* align 8 %src, i64 1, i1 false)
				call void @llvm.lifetime.start.p0i8(i64 8, i8* nonnull %src), !noalias !3
				jeroen.dobbelaereUnsubmitted Not Done Reply Inline Actions It would be nice if we could check that the !alias.scope really has been omitted. jeroen.dobbelaere: It would be nice if we could check that the !alias.scope really has been omitted.
				modimoAuthorUnsubmitted Done Reply Inline Actions added --match-full-lines in FileCheck to do just that :D modimo: added --match-full-lines in FileCheck to do just that :D
				store i8 %input, i8* %src
				call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 8 %tmp, i8* align 8 %src, i64 1, i1 false), !alias.scope !0
				call void @llvm.lifetime.end.p0i8(i64 8, i8* nonnull %src), !noalias !3
				call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 8 %dst, i8* align 8 %tmp, i64 1, i1 false), !alias.scope !3
				%ret_value = load i8, i8* %dst
				ret i8 %ret_value
				}

				declare void @llvm.lifetime.start.p0i8(i64, i8* nocapture)
				declare void @llvm.lifetime.end.p0i8(i64, i8* nocapture)
				declare void @llvm.memcpy.p0i8.p0i8.i64(i8, i8, i64, i1)

				!0 = !{!1}
				!1 = distinct !{!1, !2, !"callee0: %a"}
				nikicUnsubmitted Not Done Reply Inline Actions This alias.scope metadata looks corrupt. alias.scope accepts a list of scopes. I don't see any scopes or domains here. nikic: This alias.scope metadata looks corrupt. alias.scope accepts a list of scopes. I don't see any…
				modimoAuthorUnsubmitted Done Reply Inline Actions Yeah the metadata in my first example was wrong. I've dug through how scopes and domains work and the current example should be representative of legal metadata. modimo: Yeah the metadata in my first example was wrong. I've dug through how scopes and domains work…
				!2 = distinct !{!2, !"callee0"}
				!3 = !{!4}
				!4 = distinct !{!4, !5, !"callee1: %a"}
				!5 = distinct !{!5, !"callee1"}

llvm/test/Transforms/NewGVN/noalias.ll

	; RUN: opt -scoped-noalias-aa -basic-aa -newgvn -S < %s \| FileCheck %s			; RUN: opt -scoped-noalias-aa -basic-aa -newgvn -S < %s \| FileCheck %s

	define i32 @test1(i32* %p, i32* %q) {			define i32 @test1(i32* %p, i32* %q) {
	; CHECK-LABEL: @test1(i32* %p, i32* %q)			; CHECK-LABEL: @test1(i32* %p, i32* %q)
	; CHECK: load i32, i32* %p			; CHECK: load i32, i32* %p
	; CHECK-NOT: noalias			; CHECK-NOT: noalias
	; CHECK: %c = add i32 %a, %a			; CHECK: %c = add i32 %a, %a
	%a = load i32, i32* %p, !noalias !0			%a = load i32, i32* %p, !noalias !3
	%b = load i32, i32* %p			%b = load i32, i32* %p
	%c = add i32 %a, %b			%c = add i32 %a, %b
	ret i32 %c			ret i32 %c
	}			}

	define i32 @test2(i32* %p, i32* %q) {			define i32 @test2(i32* %p, i32* %q) {
	; CHECK-LABEL: @test2(i32* %p, i32* %q)			; CHECK-LABEL: @test2(i32* %p, i32* %q)
	; CHECK: load i32, i32* %p, align 4, !alias.scope !0			; CHECK: load i32, i32* %p, align 4, !alias.scope ![[SCOPE1:[0-9]+]]
	; CHECK: %c = add i32 %a, %a			; CHECK: %c = add i32 %a, %a
	%a = load i32, i32* %p, !alias.scope !0			%a = load i32, i32* %p, !alias.scope !3
	%b = load i32, i32* %p, !alias.scope !0			%b = load i32, i32* %p, !alias.scope !3
	%c = add i32 %a, %b			%c = add i32 %a, %b
	ret i32 %c			ret i32 %c
	}			}

	; FIXME: In this case we can do better than intersecting the scopes, and can
	; concatenate them instead. Both loads are in the same basic block, the first
	; makes the second safe to speculatively execute, and there are no calls that may
	; throw in between.
	define i32 @test3(i32* %p, i32* %q) {			define i32 @test3(i32* %p, i32* %q) {
	; CHECK-LABEL: @test3(i32* %p, i32* %q)			; CHECK-LABEL: @test3(i32* %p, i32* %q)
	; CHECK: load i32, i32* %p, align 4, !alias.scope !1			; CHECK: load i32, i32* %p, align 4, !alias.scope ![[SCOPE2:[0-9]+]]
	; CHECK: %c = add i32 %a, %a			; CHECK: %c = add i32 %a, %a
	%a = load i32, i32* %p, !alias.scope !1			%a = load i32, i32* %p, !alias.scope !4
	%b = load i32, i32* %p, !alias.scope !2			%b = load i32, i32* %p, !alias.scope !5
	%c = add i32 %a, %b			%c = add i32 %a, %b
	ret i32 %c			ret i32 %c
	}			}

				; CHECK: ![[SCOPE1]] = !{!{{[0-9]+}}}
				; CHECK: ![[SCOPE2]] = !{!{{[0-9]+}}, !{{[0-9]+}}}
	declare i32 @foo(i32*) readonly			declare i32 @foo(i32*) readonly

	!0 = !{!0}			!0 = distinct !{!0, !2, !"callee0: %a"}
	!1 = !{!1}			!1 = distinct !{!1, !2, !"callee0: %b"}
	!2 = !{!0, !1}			!2 = distinct !{!2, !"callee0"}

				!3 = !{!0}
				!4 = !{!1}
				!5 = !{!0, !1}

This is an archive of the discontinued LLVM Phabricator instance.

[MemCpyOpt] Correctly merge alias scopes during call slot optimizationClosedPublic

Details

Diff Detail

Event Timeline

My understanding of domains

How the bug I'm looking at manifests

Revision Contents

Diff 309283

llvm/include/llvm/Analysis/ScopedNoAliasAA.h

llvm/lib/Analysis/ScopedNoAliasAA.cpp

llvm/lib/IR/Metadata.cpp

llvm/test/Analysis/ScopedNoAliasAA/alias-scope-merging.ll

llvm/test/Transforms/GVN/noalias.ll

llvm/test/Transforms/InstCombine/fold-phi-load-metadata.ll

llvm/test/Transforms/MemCpyOpt/callslot_badaa.ll

llvm/test/Transforms/NewGVN/noalias.ll

[MemCpyOpt] Correctly merge alias scopes during call slot optimization
ClosedPublic