This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/Transforms/IPO/
-
llvm/
-
Transforms/
-
IPO/
6
Attributor.h
-
lib/Transforms/IPO/
-
Transforms/
-
IPO/
-
Attributor.cpp
1/11
AttributorAttributes.cpp
-
test/Transforms/Attributor/
-
Transforms/
-
Attributor/
-
call-simplify-pointer-info.ll
2/3
value-simplify-pointer-info.ll

Differential D136526

[AAPointerInfo] refactor how offsets and Access objects are tracked
ClosedPublic

Authored by sameerds on Oct 22 2022, 6:15 AM.

Download Raw Diff

Details

Reviewers

jdoerfert
sstefan1
ye-luo

Commits

rG376d0469b917: [AAPointerInfo] refactor how offsets and Access objects are tracked
rGb756096b0cbe: [AAPointerInfo] refactor how offsets and Access objects are tracked

Summary

AAPointerInfo now maintains a list of all Access objects that it owns, along
with the following maps:

OffsetBins: OffsetAndSize -> { Access }
InstTupleMap: RemoteI x LocalI -> Access

A RemoteI is any instruction that accesses memory. RemoteI is different from
LocalI if and only if LocalI is a call; then RemoteI is some instruction in the
callgraph starting from LocalI.

Motivation: When AAPointerInfo recomputes the offset for an instruction, it sets
the value to Unknown if the new offset is not the same as the old offset. The
instruction must now be moved from its current bin to the bin corresponding to
the new offset. This happens for example, when:

A PHINode has operands that result in different offsets.
The same remote inst is reachable from the same local inst via different paths in the callgraph:

  A (local inst)
  |
  B
 / \
C1  C2
 \ /
  D (remote inst)

This fixes a bug where a store is incorrectly eliminated in a lit test.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

sameerds created this revision.Oct 22 2022, 6:15 AM

Herald added a project: Restricted Project. · View Herald TranscriptOct 22 2022, 6:15 AM

Herald added subscribers: ormris, okura, jdoerfert and 2 others. · View Herald Transcript

sameerds requested review of this revision.Oct 22 2022, 6:15 AM

Herald added a reviewer: jdoerfert. · View Herald TranscriptOct 22 2022, 6:15 AM

Herald added a reviewer: sstefan1. · View Herald Transcript

Herald added a project: Restricted Project. · View Herald Transcript

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B193734: Diff 469888.Oct 22 2022, 6:53 AM

Can you point out the test that this fixes? Maybe a standalone reproducer would be good too. I read only the commit message and I'm not sure I understand the problem.

In D136526#3877140, @jdoerfert wrote:

Can you point out the test that this fixes? Maybe a standalone reproducer would be good too. I read only the commit message and I'm not sure I understand the problem.

I did leave comments on the Phab page for that other commit ... https://reviews.llvm.org/rGad98ef8be409 . But I guess it's incorrect to say that the other commit introduced the bug. It's a latent bug that was exposed by tests like phi_no_store_2.

Essentially, the PHI %p in phi_no_store_2 is visited twice. The first time, it is assigned the offset 0, and the second time it is set to unknown. This results in the following bins being printed in the debug log:

Accesses by bin after update:
[0-1] : 1

10 - store i8 1, i8* %p, align 1
- c: i8 1

[-2--1] : 1

10 - store i8 1, i8* %p, align 1
- c: i8 1

Then forallInterferingAccesses comes along and sees only the first bin with the assumption that one access can only be in one bin. Hence it concludes that the store does not have a potential copy in load %l21, and hence eliminates the store. This is incorrect, since the load is still there and it should see the value written by that store. The store can be eliminated only by recognizing the potential copy and first propagating the value to the load.

The root cause is that if an access is already in a bin, and its offset is updated to unknown, then the access needs to first be removed from its current bin. This change ensures this behaviour by improving the plumbing of how accesses are tracked in bin, so that clients don't have to worry about these things.

In D136526#3877410, @sameerds wrote:

The root cause is that if an access is already in a bin, and its offset is updated to unknown, then the access needs to first be removed from its current bin. This change ensures this behaviour by improving the plumbing of how accesses are tracked in bin, so that clients don't have to worry about these things.

I see. The error description makes sense. Can we split this into a fix and the rewrite though? The fix would be to remove the entry from the bin as soon as the offset changes (which can only become invalid I assume). The rewrite gets rid of the bins and keeps lists of instructions per access, right? There is a change in one of the tests that I think is caused by the rewrite and should not be. We also need tests to capture the benefits of the rewrite then. Also, are the bins itself bad or simply bad that we have on offset per access?

llvm/test/Transforms/Attributor/value-simplify-pointer-info.ll
746	Why do we miss out on these propagations? There are no PHIs involved, right? Something is amiss.

The refactor now accounts for remote/local instructions. This restores all the other affected lit tests.

sameerds edited the summary of this revision. (Show Details)Oct 25 2022, 2:37 AM

sameerds edited the summary of this revision. (Show Details)

In D136526#3880473, @jdoerfert wrote:

In D136526#3877410, @sameerds wrote:

The root cause is that if an access is already in a bin, and its offset is updated to unknown, then the access needs to first be removed from its current bin. This change ensures this behaviour by improving the plumbing of how accesses are tracked in bin, so that clients don't have to worry about these things.

I see. The error description makes sense. Can we split this into a fix and the rewrite though? The fix would be to remove the entry from the bin as soon as the offset changes (which can only become invalid I assume). The rewrite gets rid of the bins and keeps lists of instructions per access, right? There is a change in one of the tests that I think is caused by the rewrite and should not be. We also need tests to capture the benefits of the rewrite then. Also, are the bins itself bad or simply bad that we have on offset per access?

It's the one-offset-per-access that causes loss of information. I've updated the change description to talk more about this. Hope that clarifies the situation! Also fixed the side effect on other places. I had failed to distinguish between multiple local insts pointing to the same remote inst.

sameerds added inline comments.Oct 25 2022, 2:43 AM

llvm/test/Transforms/Attributor/value-simplify-pointer-info.ll
746	When I don't distinguish between calls that reach the same remote inst, that remote inst is conservatively treated as unknown. I thought it would be okay to "improve" this later, but then realized that the solution is very relevant to the refactoring. The new change does it right, and this side-effect disappears.

Harbormaster completed remote builds in B194131: Diff 470424.Oct 25 2022, 4:47 AM

Test impact looks good now. I have two issues with the patch but we can resolve both. One is the update of the OAS which needs to also trigger an update of the access kind and value, e.g., if we loose precise information we cannot keep the MUST kind and value around. Second, we should probably avoid nested DenseMaps, I tried to propose some alternatives.

llvm/include/llvm/Transforms/IPO/Attributor.h
265	At this point we need to signal the access it is a May not Must access anymore if the offset or size changed from something that was not unassigned, right? We also need to give up on the value I suppose.
267	Not super happy about first and second. Should we wrap it in `setOffset` `setSize` ?
5122	Documentation, for all of these. See the surrounding code.
llvm/lib/Transforms/IPO/AttributorAttributes.cpp
849	We could use Access* here to get stable addresses.
852	I'm not sure nested DenseMaps are a good idea. I think it usually ends up using way too much memory. Let's try to use an encoding without it.
891	This looks like we could map instructions to a list of OAS instead. Or directly the Access*?
909	Also here, if we have `RemoteI -> list<Access*>` we should be fine, no?

sameerds added inline comments.Oct 26 2022, 9:57 AM

llvm/lib/Transforms/IPO/AttributorAttributes.cpp
849	We could, but then we have to worry about memory allocation on the stack. In fact, I am still wondering about the copy constructor for PointerInfo::State. It seems to make a shallow copy of AccessBins. Doesn't that result in a double free? If we keep a list of Access, instead of a list of Access, it looks like more code with no clear benefit. TBH, I am a big fan of "never use new". For now, the unsigned index into the list of Access is sufficient for its very local use. If it needs to be exposed to clients, maybe could wrap it in a custom iterator instead of exposing stable pointers? Of course I am open to using a list of Access if you think that is beneficial.

sameerds added inline comments.Oct 26 2022, 9:59 AM

llvm/lib/Transforms/IPO/AttributorAttributes.cpp
849	s/stack/heap

jdoerfert added inline comments.Oct 26 2022, 10:56 AM

llvm/lib/Transforms/IPO/AttributorAttributes.cpp
849	"never use new" is not a good argument. For one, we have a bump allocator for the Attributor so the "cost" argument doesn't apply at all. Second, even if we don't use the bump allocator, new is not inherently costly. DenseMaps of DenseMaps (or similar nested structures) can hover grow fast. They also will inherently use new, so all you do here is to hide the heap allocations and cause more of them. We could, but then we have to worry about memory allocation on the stack. In fact, I am still wondering about the copy constructor for PointerInfo::State. It seems to make a shallow copy of AccessBins. Doesn't that result in a double free? As far as I can tell, PpinterInfo::State did not have a copy constructor. The move constructor did not cause double free problems as it cleared the other access bin after moving the content. If the new version requires adequate handling for copy/move constructor, that should be easy to add, no? Of course I am open to using a list of Access* if you think that is beneficial. As I said, I had mixed experience with nested maps. This is not exactly the same case but the first one I could find: https://reviews.llvm.org/rG14cb0bdf2b6ca0b7befbb07fe9f73dad5786f59b

sameerds added inline comments.Oct 26 2022, 6:51 PM

llvm/lib/Transforms/IPO/AttributorAttributes.cpp
849	Right, there is no copy constructor in state, but there is a copy assignment operator which does a shallow copy. I guess it didn't matter because it only got used in places where the state is empty. We don't seem to be talking about the same thing here. I totally agree with not using DenseMap, and I am working on using a list instead. My comment was a reply to this granparent comment: We could use Access* here to get stable addresses. Where in this specific case do you see the need for stable addresses? To me, stable addresses means something like this: SmallVector<Access> AccessList auto Acc = new Access; AccessList.push_back(Acc) Which means now I have to remember to destroy those Access objects later and also worry about deep copies. If the new version requires adequate handling for copy/move constructor, that should be easy to add, no? Memory management may be easy, but it is also easily missed. I am quite the fan of coding practices that simply avoid the hassle. For example, I see a commit in git history where the destruction on AccessBins was missed. I would much prefer the following: SmallVector<Access> AccessList; AccessList.emplace_back({....}); Here, AccessList manages the allocation of the Access object and I don't have to worry about destruction and copying. This is what "never use new" means. To repeat, it has nothing to do with the cost of DenseMap. It also has nothing to do with custom Allocators. The point is that application programmers should refrain from using the new operator, and instead rely on containers to take care of the allocation. The simplest such container is unique_ptr, for example. All containers take Allocators as template arguments, so the question of which Allocator to use is completely orthogonal. "never use new" is a code hygiene concept, not a performance concept. So do you really see a reason to have stable pointers to Access objects? Otherwise I would much prefer keeping them directly in a SmallVector.

jdoerfert added inline comments.Oct 26 2022, 10:06 PM

llvm/include/llvm/Transforms/IPO/Attributor.h
265	^ this one we need to handle as well.
llvm/lib/Transforms/IPO/AttributorAttributes.cpp
849	I'm happy with alternatives. Stable addresses would have allowed to do avoid all the indices and double lookups, though, that is not by itself a problem. I thought below it would be easier if we could store Access*, if you think we don't need to, that's generally OK.

Addressed most of the review comments. Replaced the nested DenseMap with a list.

This diff includes https://reviews.llvm.org/D136745 (OffsetAndSize without the
std::pair) merged into it, so that I don't have to rebase the current diff.
Those changes will get filtered out when submitting to tip of main.

sameerds added inline comments.Oct 27 2022, 8:47 AM

llvm/lib/Transforms/IPO/AttributorAttributes.cpp
891	Should we do this in a separate change? Latest diff addresses pretty much all the other comments.

tschuett added a subscriber: tschuett.Oct 27 2022, 8:54 AM

tschuett added inline comments.

llvm/include/llvm/Transforms/IPO/Attributor.h
214–220	Would a class with public and private make this safer to use?

Harbormaster completed remote builds in B194667: Diff 471175.Oct 27 2022, 9:53 AM

I think this looks good. Fixes an error and I hope doesn't introduce new ones.
Could you, in a follow up, add some tests for the multi access per instruction support you introduced now?

llvm/include/llvm/Transforms/IPO/Attributor.h
214–220	We use it as a POD, basically a pair with helper functions and nicer names. I doubt there is much value in private members and public setters.
llvm/lib/Transforms/IPO/AttributorAttributes.cpp
891	This doesn't need addressing per se. Let's keep it as is for now.

This revision is now accepted and ready to land.Oct 27 2022, 11:25 AM

Rebased to tip of main.
Added test for callgraph from the description: a leaf instruction may have Unknown offset if different offsets reach along different paths in the callgraph.

NOTE: I intend to commit this on Sunday night, IST.

sameerds added inline comments.Oct 28 2022, 4:21 AM

llvm/test/Transforms/Attributor/value-simplify-pointer-info.ll
1707	Oops! This was not supposed to happen. Checking.

Harbormaster completed remote builds in B194897: Diff 471491.Oct 28 2022, 4:56 AM

rebased to include a recent fix; added lit tests

Harbormaster completed remote builds in B195304: Diff 472069.Oct 31 2022, 11:31 AM

Closed by commit rGb756096b0cbe: [AAPointerInfo] refactor how offsets and Access objects are tracked (authored by sameerds). · Explain WhyOct 31 2022, 9:29 PM

This revision was automatically updated to reflect the committed changes.

sameerds added a commit: rGb756096b0cbe: [AAPointerInfo] refactor how offsets and Access objects are tracked.

ye-luo added a reverting change: rG00b09a7b18ab: Revert "[AAPointerInfo] refactor how offsets and Access objects are tracked".Nov 2 2022, 10:03 PM

sameerds reopened this revision.Nov 14 2022, 2:48 PM

This revision is now accepted and ready to land.Nov 14 2022, 2:48 PM

Fixes https://github.com/llvm/llvm-project/issues/58774

When merging to Access objects, don't call reset() on the content, but it needs
to be set to nullptr instead. The latter means that the content was determined
to be unknown. Added an assertion to getWrittenValue() to which would have
caught this more quickly.

sameerds added a reviewer: ye-luo.Nov 14 2022, 2:49 PM

Harbormaster completed remote builds in B197630: Diff 475283.Nov 14 2022, 3:38 PM

Pass tests on my side. Thanks

This revision was landed with ongoing or failed builds.Nov 15 2022, 5:26 AM

Closed by commit rG376d0469b917: [AAPointerInfo] refactor how offsets and Access objects are tracked (authored by sameerds). · Explain Why

This revision was automatically updated to reflect the committed changes.

sameerds added a commit: rG376d0469b917: [AAPointerInfo] refactor how offsets and Access objects are tracked.

Revision Contents

Path

Size

llvm/

include/

llvm/

Transforms/

IPO/

Attributor.h

74 lines

lib/

Transforms/

IPO/

Attributor.cpp

2 lines

AttributorAttributes.cpp

244 lines

test/

Transforms/

Attributor/

call-simplify-pointer-info.ll

155 lines

value-simplify-pointer-info.ll

41 lines

Diff 475436

llvm/include/llvm/Transforms/IPO/Attributor.h

Show First 20 Lines • Show All 205 Lines • ▼ Show 20 Lines
/// not_none + undef => not_none		/// not_none + undef => not_none
/// V1 + V2 => nullptr		/// V1 + V2 => nullptr
Optional<Value *>		Optional<Value *>
combineOptionalValuesInAAValueLatice(const Optional<Value *> &A,		combineOptionalValuesInAAValueLatice(const Optional<Value *> &A,
const Optional<Value > &B, Type Ty);		const Optional<Value > &B, Type Ty);

/// Helper to represent an access offset and size, with logic to deal with		/// Helper to represent an access offset and size, with logic to deal with
/// uncertainty and check for overlapping accesses.		/// uncertainty and check for overlapping accesses.
struct OffsetAndSize {		struct OffsetAndSize {
int64_t Offset = Unassigned;		int64_t Offset = Unassigned;
int64_t Size = Unassigned;		int64_t Size = Unassigned;

OffsetAndSize(int64_t Offset, int64_t Size) : Offset(Offset), Size(Size) {}		OffsetAndSize(int64_t Offset, int64_t Size) : Offset(Offset), Size(Size) {}
OffsetAndSize() = default;		OffsetAndSize() = default;
static OffsetAndSize getUnknown() { return OffsetAndSize{Unknown, Unknown}; }		static OffsetAndSize getUnknown() { return OffsetAndSize{Unknown, Unknown}; }
		tschuettUnsubmitted Not Done Reply Inline Actions Would a class with public and private make this safer to use? tschuett: Would a class with public and private make this safer to use?
		jdoerfertUnsubmitted Not Done Reply Inline Actions We use it as a POD, basically a pair with helper functions and nicer names. I doubt there is much value in private members and public setters. jdoerfert: We use it as a POD, basically a pair with helper functions and nicer names. I doubt there is…

/// Return true if offset or size are unknown.		/// Return true if offset or size are unknown.
bool offsetOrSizeAreUnknown() const {		bool offsetOrSizeAreUnknown() const {
return Offset == OffsetAndSize::Unknown \|\| Size == OffsetAndSize::Unknown;		return Offset == OffsetAndSize::Unknown \|\| Size == OffsetAndSize::Unknown;
}		}

/// Return true if offset and size are unknown, thus this is the default		/// Return true if offset and size are unknown, thus this is the default
/// unknown object.		/// unknown object.
Show All 16 Lines	bool mayOverlap(const OffsetAndSize &OAS) const {
if (offsetOrSizeAreUnknown() \|\| OAS.offsetOrSizeAreUnknown())		if (offsetOrSizeAreUnknown() \|\| OAS.offsetOrSizeAreUnknown())
return true;		return true;

// Check if one offset point is in the other interval [offset,		// Check if one offset point is in the other interval [offset,
// offset+size].		// offset+size].
return OAS.Offset + OAS.Size > Offset && OAS.Offset < Offset + Size;		return OAS.Offset + OAS.Size > Offset && OAS.Offset < Offset + Size;
}		}

		OffsetAndSize &operator&=(const OffsetAndSize &R) {
		if (Offset == Unassigned)
		Offset = R.Offset;
		else if (R.Offset != Unassigned && R.Offset != Offset)
		Offset = Unknown;

		if (Size == Unassigned)
		Size = R.Size;
		else if (Size == Unknown \|\| R.Size == Unknown)
		Size = Unknown;
		else if (R.Size != Unassigned)
		Size = std::max(Size, R.Size);

		jdoerfertUnsubmitted Not Done Reply Inline Actions At this point we need to signal the access it is a May not Must access anymore if the offset or size changed from something that was not unassigned, right? We also need to give up on the value I suppose. jdoerfert: At this point we need to signal the access it is a May not Must access anymore if the offset or…
		jdoerfertUnsubmitted Not Done Reply Inline Actions ^ this one we need to handle as well. jdoerfert: ^ this one we need to handle as well.
		return *this;
		}
		jdoerfertUnsubmitted Not Done Reply Inline Actions Not super happy about first and second. Should we wrap it in `setOffset` `setSize` ? jdoerfert: Not super happy about first and second. Should we wrap it in `setOffset` `setSize` ?

/// Constants used to represent special offsets or sizes.		/// Constants used to represent special offsets or sizes.
/// - This assumes that Offset and Size are non-negative.		/// - This assumes that Offset and Size are non-negative.
/// - The constants should not clash with DenseMapInfo, such as EmptyKey		/// - The constants should not clash with DenseMapInfo, such as EmptyKey
/// (INT64_MAX) and TombstoneKey (INT64_MIN).		/// (INT64_MAX) and TombstoneKey (INT64_MIN).
static constexpr int64_t Unassigned = -1;		static constexpr int64_t Unassigned = -1;
static constexpr int64_t Unknown = -2;		static constexpr int64_t Unknown = -2;
};		};

▲ Show 20 Lines • Show All 4,726 Lines • ▼ Show 20 Lines	enum AccessKind {
AK_MAY_READ_WRITE = AK_MAY \| AK_R \| AK_W,		AK_MAY_READ_WRITE = AK_MAY \| AK_R \| AK_W,
AK_MUST_READ = AK_MUST \| AK_R,		AK_MUST_READ = AK_MUST \| AK_R,
AK_MUST_WRITE = AK_MUST \| AK_W,		AK_MUST_WRITE = AK_MUST \| AK_W,
AK_MUST_READ_WRITE = AK_MUST \| AK_R \| AK_W,		AK_MUST_READ_WRITE = AK_MUST \| AK_R \| AK_W,
};		};

/// An access description.		/// An access description.
struct Access {		struct Access {
Access(Instruction I, Optional<Value > Content, AccessKind Kind, Type *Ty)		Access(Instruction *I, int64_t Offset, int64_t Size,
: LocalI(I), RemoteI(I), Content(Content), Kind(Kind), Ty(Ty) {		Optional<Value > Content, AccessKind Kind, Type Ty)
		: LocalI(I), RemoteI(I), Content(Content), OAS(Offset, Size),
		Kind(Kind), Ty(Ty) {
verify();		verify();
}		}
Access(Instruction LocalI, Instruction RemoteI, Optional<Value *> Content,		Access(Instruction LocalI, Instruction RemoteI, int64_t Offset,
AccessKind Kind, Type *Ty)		int64_t Size, Optional<Value > Content, AccessKind Kind, Type Ty)
: LocalI(LocalI), RemoteI(RemoteI), Content(Content), Kind(Kind),		: LocalI(LocalI), RemoteI(RemoteI), Content(Content), OAS(Offset, Size),
Ty(Ty) {		Kind(Kind), Ty(Ty) {
verify();		verify();
}		}
Access(const Access &Other) = default;		Access(const Access &Other) = default;
Access(const Access &&Other)		Access(const Access &&Other)
: LocalI(Other.LocalI), RemoteI(Other.RemoteI), Content(Other.Content),		: LocalI(Other.LocalI), RemoteI(Other.RemoteI), Content(Other.Content),
Kind(Other.Kind), Ty(Other.Ty) {}		OAS(Other.OAS), Kind(Other.Kind), Ty(Other.Ty) {}

Access &operator=(const Access &Other) = default;		Access &operator=(const Access &Other) = default;
bool operator==(const Access &R) const {		bool operator==(const Access &R) const {
return LocalI == R.LocalI && RemoteI == R.RemoteI &&		return LocalI == R.LocalI && RemoteI == R.RemoteI && OAS == R.OAS &&
Content == R.Content && Kind == R.Kind;		Content == R.Content && Kind == R.Kind;
}		}
bool operator!=(const Access &R) const { return !(*this == R); }		bool operator!=(const Access &R) const { return !(*this == R); }

Access &operator&=(const Access &R) {		Access &operator&=(const Access &R) {
assert(RemoteI == R.RemoteI && "Expected same instruction!");		assert(RemoteI == R.RemoteI && "Expected same instruction!");
		assert(LocalI == R.LocalI && "Expected same instruction!");
		Kind = AccessKind(Kind \| R.Kind);
		auto Before = OAS;
		OAS &= R.OAS;
		if (Before.isUnassigned() \|\| Before == OAS) {
Content =		Content =
AA::combineOptionalValuesInAAValueLatice(Content, R.Content, Ty);		AA::combineOptionalValuesInAAValueLatice(Content, R.Content, Ty);
Kind = AccessKind(Kind \| R.Kind);		} else {
		// Since the OAS information changed, set a conservative state -- drop
		// the contents, and assume MayAccess rather than MustAccess.
		setWrittenValueUnknown();
		Kind = AccessKind(Kind \| AK_MAY);
		Kind = AccessKind(Kind & ~AK_MUST);
		}
		verify();
return *this;		return *this;
}		}

void verify() {		void verify() {
assert(isMustAccess() + isMayAccess() == 1 &&		assert(isMustAccess() + isMayAccess() == 1 &&
"Expect must or may access, not both.");		"Expect must or may access, not both.");
}		}

Show All 19 Lines	struct Access {
/// Return true if the value written is not known yet.		/// Return true if the value written is not known yet.
bool isWrittenValueYetUndetermined() const { return !Content; }		bool isWrittenValueYetUndetermined() const { return !Content; }

/// Return true if the value written cannot be determined at all.		/// Return true if the value written cannot be determined at all.
bool isWrittenValueUnknown() const {		bool isWrittenValueUnknown() const {
return Content.has_value() && !*Content;		return Content.has_value() && !*Content;
}		}

		/// Set the value written to nullptr, i.e., unknown.
		void setWrittenValueUnknown() { Content = nullptr; }

/// Return the type associated with the access, if known.		/// Return the type associated with the access, if known.
Type *getType() const { return Ty; }		Type *getType() const { return Ty; }

/// Return the value writen, if any. As long as		/// Return the value writen, if any.
/// isWrittenValueYetUndetermined return true this function shall not be		Value *getWrittenValue() const {
/// called.		assert(!isWrittenValueYetUndetermined() &&
Value getWrittenValue() const { return Content; }		"Value needs to be determined before accessing it.");
		return *Content;
		}

/// Return the written value which can be `llvm::null` if it is not yet		/// Return the written value which can be `llvm::null` if it is not yet
/// determined.		/// determined.
Optional<Value *> getContent() const { return Content; }		Optional<Value *> getContent() const { return Content; }

		/// Return the offset for this access.
		int64_t getOffset() const { return OAS.Offset; }

		/// Return the size for this access.
		int64_t getSize() const { return OAS.Size; }

private:		private:
/// The instruction responsible for the access with respect to the local		/// The instruction responsible for the access with respect to the local
/// scope of the associated attribute.		/// scope of the associated attribute.
Instruction *LocalI;		Instruction *LocalI;

/// The instruction responsible for the access.		/// The instruction responsible for the access.
Instruction *RemoteI;		Instruction *RemoteI;

/// The value written, if any. `llvm::none` means "not known yet", `nullptr`		/// The value written, if any. `llvm::none` means "not known yet", `nullptr`
/// cannot be determined.		/// cannot be determined.
Optional<Value *> Content;		Optional<Value *> Content;

		/// The object accessed, in terms of an offset and size in bytes.
		jdoerfertUnsubmitted Not Done Reply Inline Actions Documentation, for all of these. See the surrounding code. jdoerfert: Documentation, for all of these. See the surrounding code.
		AA::OffsetAndSize OAS;

/// The access kind, e.g., READ, as bitset (could be more than one).		/// The access kind, e.g., READ, as bitset (could be more than one).
AccessKind Kind;		AccessKind Kind;

/// The type of the content, thus the type read/written, can be null if not		/// The type of the content, thus the type read/written, can be null if not
/// available.		/// available.
Type *Ty;		Type *Ty;
};		};

Show All 19 Lines	struct AAPointerInfo : public AbstractAttribute {
/// for all of them, false otherwise. In contrast to forallInterferingAccesses		/// for all of them, false otherwise. In contrast to forallInterferingAccesses
/// this function will perform reasoning to exclude write accesses that cannot		/// this function will perform reasoning to exclude write accesses that cannot
/// affect the load even if they on the surface look as if they would. The		/// affect the load even if they on the surface look as if they would. The
/// flag \p HasBeenWrittenTo will be set to true if we know that \p I does not		/// flag \p HasBeenWrittenTo will be set to true if we know that \p I does not
/// read the intial value of the underlying memory.		/// read the intial value of the underlying memory.
virtual bool forallInterferingAccesses(		virtual bool forallInterferingAccesses(
Attributor &A, const AbstractAttribute &QueryingAA, Instruction &I,		Attributor &A, const AbstractAttribute &QueryingAA, Instruction &I,
function_ref<bool(const Access &, bool)> CB, bool &HasBeenWrittenTo,		function_ref<bool(const Access &, bool)> CB, bool &HasBeenWrittenTo,
AA::OffsetAndSize *OASPtr = nullptr) const = 0;		AA::OffsetAndSize &OAS) const = 0;

/// This function should return true if the type of the \p AA is AAPointerInfo		/// This function should return true if the type of the \p AA is AAPointerInfo
static bool classof(const AbstractAttribute *AA) {		static bool classof(const AbstractAttribute *AA) {
return (AA->getIdAddr() == &ID);		return (AA->getIdAddr() == &ID);
}		}

/// Unique ID (due to the unique address)		/// Unique ID (due to the unique address)
static const char ID;		static const char ID;
▲ Show 20 Lines • Show All 49 Lines • Show Last 20 Lines

llvm/lib/Transforms/IPO/Attributor.cpp

Show First 20 Lines • Show All 451 Lines • ▼ Show 20 Lines	for (Value *Obj : Objects) {
// If the value has been written to we don't need the initial value of the		// If the value has been written to we don't need the initial value of the
// object.		// object.
bool HasBeenWrittenTo = false;		bool HasBeenWrittenTo = false;

AA::OffsetAndSize OAS;		AA::OffsetAndSize OAS;
auto &PI = A.getAAFor<AAPointerInfo>(QueryingAA, IRPosition::value(*Obj),		auto &PI = A.getAAFor<AAPointerInfo>(QueryingAA, IRPosition::value(*Obj),
DepClassTy::NONE);		DepClassTy::NONE);
if (!PI.forallInterferingAccesses(A, QueryingAA, I, CheckAccess,		if (!PI.forallInterferingAccesses(A, QueryingAA, I, CheckAccess,
HasBeenWrittenTo, &OAS)) {		HasBeenWrittenTo, OAS)) {
LLVM_DEBUG(		LLVM_DEBUG(
dbgs()		dbgs()
<< "Failed to verify all interfering accesses for underlying object: "		<< "Failed to verify all interfering accesses for underlying object: "
<< *Obj << "\n");		<< *Obj << "\n");
return false;		return false;
}		}

if (IsLoad && !HasBeenWrittenTo && !OAS.isUnassigned()) {		if (IsLoad && !HasBeenWrittenTo && !OAS.isUnassigned()) {
▲ Show 20 Lines • Show All 3,129 Lines • Show Last 20 Lines

llvm/lib/Transforms/IPO/AttributorAttributes.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 751 Lines • ▼ Show 20 Lines	struct AccessAsInstructionInfo : DenseMapInfo<Instruction *> {
static unsigned getHashValue(const Access &A);		static unsigned getHashValue(const Access &A);
static bool isEqual(const Access &LHS, const Access &RHS);		static bool isEqual(const Access &LHS, const Access &RHS);
};		};

} // namespace llvm		} // namespace llvm

/// A type to track pointer/struct usage and accesses for AAPointerInfo.		/// A type to track pointer/struct usage and accesses for AAPointerInfo.
struct AA::PointerInfo::State : public AbstractState {		struct AA::PointerInfo::State : public AbstractState {

~State() {
// We do not delete the Accesses objects but need to destroy them still.
for (auto &It : AccessBins)
It.second->~Accesses();
}

/// Return the best possible representable state.		/// Return the best possible representable state.
static State getBestState(const State &SIS) { return State(); }		static State getBestState(const State &SIS) { return State(); }

/// Return the worst possible representable state.		/// Return the worst possible representable state.
static State getWorstState(const State &SIS) {		static State getWorstState(const State &SIS) {
State R;		State R;
R.indicatePessimisticFixpoint();		R.indicatePessimisticFixpoint();
return R;		return R;
}		}

State() = default;		State() = default;
State(State &&SIS) : AccessBins(std::move(SIS.AccessBins)) {		State(State &&SIS) = default;
SIS.AccessBins.clear();
}

const State &getAssumed() const { return *this; }		const State &getAssumed() const { return *this; }

/// See AbstractState::isValidState().		/// See AbstractState::isValidState().
bool isValidState() const override { return BS.isValidState(); }		bool isValidState() const override { return BS.isValidState(); }

/// See AbstractState::isAtFixpoint().		/// See AbstractState::isAtFixpoint().
bool isAtFixpoint() const override { return BS.isAtFixpoint(); }		bool isAtFixpoint() const override { return BS.isAtFixpoint(); }
Show All 9 Lines	ChangeStatus indicatePessimisticFixpoint() override {
BS.indicatePessimisticFixpoint();		BS.indicatePessimisticFixpoint();
return ChangeStatus::CHANGED;		return ChangeStatus::CHANGED;
}		}

State &operator=(const State &R) {		State &operator=(const State &R) {
if (this == &R)		if (this == &R)
return *this;		return *this;
BS = R.BS;		BS = R.BS;
AccessBins = R.AccessBins;		AccessList = R.AccessList;
		OffsetBins = R.OffsetBins;
		RemoteIMap = R.RemoteIMap;
return *this;		return *this;
}		}

State &operator=(State &&R) {		State &operator=(State &&R) {
if (this == &R)		if (this == &R)
return *this;		return *this;
std::swap(BS, R.BS);		std::swap(BS, R.BS);
std::swap(AccessBins, R.AccessBins);		std::swap(AccessList, R.AccessList);
		std::swap(OffsetBins, R.OffsetBins);
		std::swap(RemoteIMap, R.RemoteIMap);
return *this;		return *this;
}		}

bool operator==(const State &R) const {		/// Add a new Access to the state at offset \p Offset and with size \p Size.
if (BS != R.BS)
return false;
if (AccessBins.size() != R.AccessBins.size())
return false;
auto It = begin(), RIt = R.begin(), E = end();
while (It != E) {
if (It->getFirst() != RIt->getFirst())
return false;
auto &Accs = It->getSecond();
auto &RAccs = RIt->getSecond();
if (Accs->size() != RAccs->size())
return false;
for (const auto &ZipIt : llvm::zip(Accs, RAccs))
if (std::get<0>(ZipIt) != std::get<1>(ZipIt))
return false;
++It;
++RIt;
}
return true;
}
bool operator!=(const State &R) const { return !(*this == R); }

/// We store accesses in a set with the instruction as key.
struct Accesses {
SmallVector<AAPointerInfo::Access, 4> Accesses;
DenseMap<const Instruction *, unsigned> Map;

unsigned size() const { return Accesses.size(); }

using vec_iterator = decltype(Accesses)::iterator;
vec_iterator begin() { return Accesses.begin(); }
vec_iterator end() { return Accesses.end(); }

using iterator = decltype(Map)::const_iterator;
iterator find(AAPointerInfo::Access &Acc) {
return Map.find(Acc.getRemoteInst());
}
iterator find_end() { return Map.end(); }

AAPointerInfo::Access &get(iterator &It) {
return Accesses[It->getSecond()];
}

void insert(AAPointerInfo::Access &Acc) {
Map[Acc.getRemoteInst()] = Accesses.size();
Accesses.push_back(Acc);
}
};

/// We store all accesses in bins denoted by their offset and size.
using AccessBinsTy = DenseMap<AA::OffsetAndSize, Accesses *>;

AccessBinsTy::const_iterator begin() const { return AccessBins.begin(); }
AccessBinsTy::const_iterator end() const { return AccessBins.end(); }

protected:
/// The bins with all the accesses for the associated pointer.
AccessBinsTy AccessBins;

/// Add a new access to the state at offset \p Offset and with size \p Size.
/// The access is associated with \p I, writes \p Content (if anything), and		/// The access is associated with \p I, writes \p Content (if anything), and
/// is of kind \p Kind.		/// is of kind \p Kind. If an Access already exists for the same \p I and same
		/// \p RemoteI, the two are combined, potentially losing information about
		/// offset and size. The resulting access must now be moved from its original
		/// OffsetBin to the bin for its new offset.
		///
/// \Returns CHANGED, if the state changed, UNCHANGED otherwise.		/// \Returns CHANGED, if the state changed, UNCHANGED otherwise.
ChangeStatus addAccess(Attributor &A, int64_t Offset, int64_t Size,		ChangeStatus addAccess(Attributor &A, int64_t Offset, int64_t Size,
Instruction &I, Optional<Value *> Content,		Instruction &I, Optional<Value *> Content,
AAPointerInfo::AccessKind Kind, Type *Ty,		AAPointerInfo::AccessKind Kind, Type *Ty,
Instruction *RemoteI = nullptr,		Instruction *RemoteI = nullptr);
Accesses *BinPtr = nullptr) {
AA::OffsetAndSize Key{Offset, Size};		using OffsetBinsTy = DenseMap<OffsetAndSize, SmallSet<unsigned, 4>>;
Accesses *&Bin = BinPtr ? BinPtr : AccessBins[Key];
if (!Bin)		using const_bin_iterator = OffsetBinsTy::const_iterator;
Bin = new (A.Allocator) Accesses;		const_bin_iterator begin() const { return OffsetBins.begin(); }
AAPointerInfo::Access Acc(&I, RemoteI ? RemoteI : &I, Content, Kind, Ty);		const_bin_iterator end() const { return OffsetBins.end(); }
// Check if we have an access for this instruction in this bin, if not,
// simply add it.		const AAPointerInfo::Access &getAccess(unsigned Index) const {
auto It = Bin->find(Acc);		return AccessList[Index];
if (It == Bin->find_end()) {
Bin->insert(Acc);
return ChangeStatus::CHANGED;
}
// If the existing access is the same as then new one, nothing changed.
AAPointerInfo::Access &Current = Bin->get(It);
AAPointerInfo::Access Before = Current;
// The new one will be combined with the existing one.
Current &= Acc;
return Current == Before ? ChangeStatus::UNCHANGED : ChangeStatus::CHANGED;
}		}

		protected:
		// Every memory instruction results in an Access object. We maintain a list of
		// all Access objects that we own, along with the following maps:
		//
		// - OffsetBins: OffsetAndSize -> { Access }
		// - RemoteIMap: RemoteI x LocalI -> Access
		//
		// A RemoteI is any instruction that accesses memory. RemoteI is different
		// from LocalI if and only if LocalI is a call; then RemoteI is some
		// instruction in the callgraph starting from LocalI. Multiple paths in the
		// callgraph from LocalI to RemoteI may produce multiple accesses, but these
		// are all combined into a single Access object. This may result in loss of
		// information in OffsetAndSize in the Access object.
		SmallVector<AAPointerInfo::Access> AccessList;
		jdoerfertUnsubmitted Not Done Reply Inline Actions We could use Access* here to get stable addresses. jdoerfert: We could use Access* here to get stable addresses.
		sameerdsAuthorUnsubmitted Not Done Reply Inline Actions We could, but then we have to worry about memory allocation on the stack. In fact, I am still wondering about the copy constructor for PointerInfo::State. It seems to make a shallow copy of AccessBins. Doesn't that result in a double free? If we keep a list of Access, instead of a list of Access, it looks like more code with no clear benefit. TBH, I am a big fan of "never use new". For now, the unsigned index into the list of Access is sufficient for its very local use. If it needs to be exposed to clients, maybe could wrap it in a custom iterator instead of exposing stable pointers? Of course I am open to using a list of Access if you think that is beneficial. sameerds: We could, but then we have to worry about memory allocation on the stack. In fact, I am still…
		sameerdsAuthorUnsubmitted Not Done Reply Inline Actions s/stack/heap sameerds: s/stack/heap
		jdoerfertUnsubmitted Not Done Reply Inline Actions "never use new" is not a good argument. For one, we have a bump allocator for the Attributor so the "cost" argument doesn't apply at all. Second, even if we don't use the bump allocator, new is not inherently costly. DenseMaps of DenseMaps (or similar nested structures) can hover grow fast. They also will inherently use new, so all you do here is to hide the heap allocations and cause more of them. We could, but then we have to worry about memory allocation on the stack. In fact, I am still wondering about the copy constructor for PointerInfo::State. It seems to make a shallow copy of AccessBins. Doesn't that result in a double free? As far as I can tell, PpinterInfo::State did not have a copy constructor. The move constructor did not cause double free problems as it cleared the other access bin after moving the content. If the new version requires adequate handling for copy/move constructor, that should be easy to add, no? Of course I am open to using a list of Access* if you think that is beneficial. As I said, I had mixed experience with nested maps. This is not exactly the same case but the first one I could find: https://reviews.llvm.org/rG14cb0bdf2b6ca0b7befbb07fe9f73dad5786f59b jdoerfert: "never use new" is not a good argument. For one, we have a bump allocator for the Attributor so…
		sameerdsAuthorUnsubmitted Not Done Reply Inline Actions Right, there is no copy constructor in state, but there is a copy assignment operator which does a shallow copy. I guess it didn't matter because it only got used in places where the state is empty. We don't seem to be talking about the same thing here. I totally agree with not using DenseMap, and I am working on using a list instead. My comment was a reply to this granparent comment: We could use Access* here to get stable addresses. Where in this specific case do you see the need for stable addresses? To me, stable addresses means something like this: SmallVector<Access> AccessList auto Acc = new Access; AccessList.push_back(Acc) Which means now I have to remember to destroy those Access objects later and also worry about deep copies. If the new version requires adequate handling for copy/move constructor, that should be easy to add, no? Memory management may be easy, but it is also easily missed. I am quite the fan of coding practices that simply avoid the hassle. For example, I see a commit in git history where the destruction on AccessBins was missed. I would much prefer the following: SmallVector<Access> AccessList; AccessList.emplace_back({....}); Here, AccessList manages the allocation of the Access object and I don't have to worry about destruction and copying. This is what "never use new" means. To repeat, it has nothing to do with the cost of DenseMap. It also has nothing to do with custom Allocators. The point is that application programmers should refrain from using the new operator, and instead rely on containers to take care of the allocation. The simplest such container is unique_ptr, for example. All containers take Allocators as template arguments, so the question of which Allocator to use is completely orthogonal. "never use new" is a code hygiene concept, not a performance concept. So do you really see a reason to have stable pointers to Access objects? Otherwise I would much prefer keeping them directly in a SmallVector. sameerds: Right, there is no copy constructor in state, but there is a copy assignment operator which…
		jdoerfertUnsubmitted Not Done Reply Inline Actions I'm happy with alternatives. Stable addresses would have allowed to do avoid all the indices and double lookups, though, that is not by itself a problem. I thought below it would be easier if we could store Access, if you think we don't need to, that's generally OK. jdoerfert:* I'm happy with alternatives. Stable addresses would have allowed to do avoid all the indices…
		OffsetBinsTy OffsetBins;
		DenseMap<const Instruction *, SmallVector<unsigned>> RemoteIMap;

		jdoerfertUnsubmitted Not Done Reply Inline Actions I'm not sure nested DenseMaps are a good idea. I think it usually ends up using way too much memory. Let's try to use an encoding without it. jdoerfert: I'm not sure nested DenseMaps are a good idea. I think it usually ends up using way too much…
/// See AAPointerInfo::forallInterferingAccesses.		/// See AAPointerInfo::forallInterferingAccesses.
bool forallInterferingAccesses(		bool forallInterferingAccesses(
AA::OffsetAndSize OAS,		AA::OffsetAndSize OAS,
function_ref<bool(const AAPointerInfo::Access &, bool)> CB) const {		function_ref<bool(const AAPointerInfo::Access &, bool)> CB) const {
if (!isValidState())		if (!isValidState())
return false;		return false;

for (const auto &It : AccessBins) {		for (const auto &It : OffsetBins) {
AA::OffsetAndSize ItOAS = It.getFirst();		AA::OffsetAndSize ItOAS = It.getFirst();
if (!OAS.mayOverlap(ItOAS))		if (!OAS.mayOverlap(ItOAS))
continue;		continue;
bool IsExact = OAS == ItOAS && !OAS.offsetOrSizeAreUnknown();		bool IsExact = OAS == ItOAS && !OAS.offsetOrSizeAreUnknown();
for (auto &Access : *It.getSecond())		for (auto Index : It.getSecond()) {
		auto &Access = AccessList[Index];
if (!CB(Access, IsExact))		if (!CB(Access, IsExact))
return false;		return false;
}		}
		}
return true;		return true;
}		}

/// See AAPointerInfo::forallInterferingAccesses.		/// See AAPointerInfo::forallInterferingAccesses.
bool forallInterferingAccesses(		bool forallInterferingAccesses(
Instruction &I,		Instruction &I,
function_ref<bool(const AAPointerInfo::Access &, bool)> CB,		function_ref<bool(const AAPointerInfo::Access &, bool)> CB,
AA::OffsetAndSize *OASPtr) const {		AA::OffsetAndSize &OAS) const {
if (!isValidState())		if (!isValidState())
return false;		return false;

// First find the offset and size of I.		auto LocalList = RemoteIMap.find(&I);
AA::OffsetAndSize OAS;		if (LocalList == RemoteIMap.end()) {
for (const auto &It : AccessBins) {
for (auto &Access : *It.getSecond()) {
if (Access.getRemoteInst() == &I) {
OAS = It.getFirst();
break;
}
}
if (OAS.Size != AA::OffsetAndSize::Unassigned)
break;
}

if (OASPtr)
*OASPtr = OAS;

// No access for I was found, we are done.
if (OAS.Size == AA::OffsetAndSize::Unassigned)
return true;		return true;
		}

// Now that we have an offset and size, find all overlapping ones and use		for (auto LI : LocalList->getSecond()) {
// the callback on the accesses.		auto &Access = AccessList[LI];
		OAS &= {Access.getOffset(), Access.getSize()};
		}
return forallInterferingAccesses(OAS, CB);		return forallInterferingAccesses(OAS, CB);
		jdoerfertUnsubmitted Not Done Reply Inline Actions This looks like we could map instructions to a list of OAS instead. Or directly the Access? jdoerfert:* This looks like we could map instructions to a list of OAS instead. Or directly the Access*?
		sameerdsAuthorUnsubmitted Done Reply Inline Actions Should we do this in a separate change? Latest diff addresses pretty much all the other comments. sameerds: Should we do this in a separate change? Latest diff addresses pretty much all the other…
		jdoerfertUnsubmitted Not Done Reply Inline Actions This doesn't need addressing per se. Let's keep it as is for now. jdoerfert: This doesn't need addressing per se. Let's keep it as is for now.
}		}

private:		private:
/// State to track fixpoint and validity.		/// State to track fixpoint and validity.
BooleanState BS;		BooleanState BS;
};		};

		ChangeStatus AA::PointerInfo::State::addAccess(Attributor &A, int64_t Offset,
		int64_t Size, Instruction &I,
		Optional<Value *> Content,
		AAPointerInfo::AccessKind Kind,
		Type Ty, Instruction RemoteI) {
		RemoteI = RemoteI ? RemoteI : &I;
		AAPointerInfo::Access Acc(&I, RemoteI, Offset, Size, Content, Kind, Ty);

		// Check if we have an access for this instruction, if not, simply add it.
		auto &LocalList = RemoteIMap[RemoteI];
		bool AccExists = false;
		jdoerfertUnsubmitted Not Done Reply Inline Actions Also here, if we have `RemoteI -> list<Access>` we should be fine, no? jdoerfert:* Also here, if we have `RemoteI -> list<Access*>` we should be fine, no?
		unsigned AccIndex = AccessList.size();
		for (auto Index : LocalList) {
		auto &A = AccessList[Index];
		if (A.getLocalInst() == &I) {
		AccExists = true;
		AccIndex = Index;
		break;
		}
		}
		if (!AccExists) {
		AccessList.push_back(Acc);
		LocalList.push_back(AccIndex);
		} else {
		// The new one will be combined with the existing one.
		auto &Current = AccessList[AccIndex];
		auto Before = Current;
		Current &= Acc;
		if (Current == Before)
		return ChangeStatus::UNCHANGED;

		Acc = Current;
		AA::OffsetAndSize Key{Before.getOffset(), Before.getSize()};
		assert(OffsetBins.count(Key) && "Existing Access must be in some bin.");
		auto &Bin = OffsetBins[Key];
		assert(Bin.count(AccIndex) &&
		"Expected bin to actually contain the Access.");
		LLVM_DEBUG(dbgs() << "[AAPointerInfo] Removing Access "
		<< AccessList[AccIndex] << " with key {" << Key.Offset
		<< ',' << Key.Size << "}\n");
		Bin.erase(AccIndex);
		}

		AA::OffsetAndSize Key{Acc.getOffset(), Acc.getSize()};
		LLVM_DEBUG(dbgs() << "[AAPointerInfo] Inserting Access " << Acc
		<< " with key {" << Key.Offset << ',' << Key.Size << "}\n");
		OffsetBins[Key].insert(AccIndex);
		return ChangeStatus::CHANGED;
		}

namespace {		namespace {
struct AAPointerInfoImpl		struct AAPointerInfoImpl
: public StateWrapper<AA::PointerInfo::State, AAPointerInfo> {		: public StateWrapper<AA::PointerInfo::State, AAPointerInfo> {
using BaseTy = StateWrapper<AA::PointerInfo::State, AAPointerInfo>;		using BaseTy = StateWrapper<AA::PointerInfo::State, AAPointerInfo>;
AAPointerInfoImpl(const IRPosition &IRP, Attributor &A) : BaseTy(IRP) {}		AAPointerInfoImpl(const IRPosition &IRP, Attributor &A) : BaseTy(IRP) {}

/// See AbstractAttribute::getAsStr().		/// See AbstractAttribute::getAsStr().
const std::string getAsStr() const override {		const std::string getAsStr() const override {
return std::string("PointerInfo ") +		return std::string("PointerInfo ") +
(isValidState() ? (std::string("#") +		(isValidState() ? (std::string("#") +
std::to_string(AccessBins.size()) + " bins")		std::to_string(OffsetBins.size()) + " bins")
: "<invalid>");		: "<invalid>");
}		}

/// See AbstractAttribute::manifest(...).		/// See AbstractAttribute::manifest(...).
ChangeStatus manifest(Attributor &A) override {		ChangeStatus manifest(Attributor &A) override {
return AAPointerInfo::manifest(A);		return AAPointerInfo::manifest(A);
}		}

bool forallInterferingAccesses(		bool forallInterferingAccesses(
AA::OffsetAndSize OAS,		AA::OffsetAndSize OAS,
function_ref<bool(const AAPointerInfo::Access &, bool)> CB)		function_ref<bool(const AAPointerInfo::Access &, bool)> CB)
const override {		const override {
return State::forallInterferingAccesses(OAS, CB);		return State::forallInterferingAccesses(OAS, CB);
}		}

bool forallInterferingAccesses(		bool forallInterferingAccesses(
Attributor &A, const AbstractAttribute &QueryingAA, Instruction &I,		Attributor &A, const AbstractAttribute &QueryingAA, Instruction &I,
function_ref<bool(const Access &, bool)> UserCB, bool &HasBeenWrittenTo,		function_ref<bool(const Access &, bool)> UserCB, bool &HasBeenWrittenTo,
AA::OffsetAndSize *OASPtr = nullptr) const override {		AA::OffsetAndSize &OAS) const override {
HasBeenWrittenTo = false;		HasBeenWrittenTo = false;

SmallPtrSet<const Access *, 8> DominatingWrites;		SmallPtrSet<const Access *, 8> DominatingWrites;
SmallVector<std::pair<const Access *, bool>, 8> InterferingAccesses;		SmallVector<std::pair<const Access *, bool>, 8> InterferingAccesses;

Function &Scope = *I.getFunction();		Function &Scope = *I.getFunction();
const auto &NoSyncAA = A.getAAFor<AANoSync>(		const auto &NoSyncAA = A.getAAFor<AANoSync>(
QueryingAA, IRPosition::function(Scope), DepClassTy::OPTIONAL);		QueryingAA, IRPosition::function(Scope), DepClassTy::OPTIONAL);
▲ Show 20 Lines • Show All 98 Lines • ▼ Show 20 Lines	auto AccessCB = [&](const Access &Acc, bool Exact) {
// work yet if we have threading effects, or the access is complicated.		// work yet if we have threading effects, or the access is complicated.
if (CanUseCFGResoning && Dominates && UseDominanceReasoning &&		if (CanUseCFGResoning && Dominates && UseDominanceReasoning &&
IsSameThreadAsLoad(Acc))		IsSameThreadAsLoad(Acc))
DominatingWrites.insert(&Acc);		DominatingWrites.insert(&Acc);

InterferingAccesses.push_back({&Acc, Exact});		InterferingAccesses.push_back({&Acc, Exact});
return true;		return true;
};		};
if (!State::forallInterferingAccesses(I, AccessCB, OASPtr))		if (!State::forallInterferingAccesses(I, AccessCB, OAS))
return false;		return false;

if (HasBeenWrittenTo) {		if (HasBeenWrittenTo) {
const Function *ScopePtr = &Scope;		const Function *ScopePtr = &Scope;
IsLiveInCalleeCB = [ScopePtr](const Function &Fn) {		IsLiveInCalleeCB = [ScopePtr](const Function &Fn) {
return ScopePtr != &Fn;		return ScopePtr != &Fn;
};		};
}		}
▲ Show 20 Lines • Show All 50 Lines • ▼ Show 20 Lines	if (!OtherAA.getState().isValidState() \|\| !isValidState())
return indicatePessimisticFixpoint();		return indicatePessimisticFixpoint();

const auto &OtherAAImpl = static_cast<const AAPointerInfoImpl &>(OtherAA);		const auto &OtherAAImpl = static_cast<const AAPointerInfoImpl &>(OtherAA);
bool IsByval =		bool IsByval =
FromCallee && OtherAAImpl.getAssociatedArgument()->hasByValAttr();		FromCallee && OtherAAImpl.getAssociatedArgument()->hasByValAttr();

// Combine the accesses bin by bin.		// Combine the accesses bin by bin.
ChangeStatus Changed = ChangeStatus::UNCHANGED;		ChangeStatus Changed = ChangeStatus::UNCHANGED;
for (const auto &It : OtherAAImpl.getState()) {		const auto &State = OtherAAImpl.getState();
		for (const auto &It : State) {
AA::OffsetAndSize OAS = AA::OffsetAndSize::getUnknown();		AA::OffsetAndSize OAS = AA::OffsetAndSize::getUnknown();
if (Offset != AA::OffsetAndSize::Unknown &&		if (Offset != AA::OffsetAndSize::Unknown &&
!It.first.offsetOrSizeAreUnknown()) {		!It.first.offsetOrSizeAreUnknown()) {
OAS = AA::OffsetAndSize(It.first.Offset + Offset, It.first.Size);		OAS = AA::OffsetAndSize(It.first.Offset + Offset, It.first.Size);
}		}
Accesses *Bin = AccessBins.lookup(OAS);		for (auto Index : It.getSecond()) {
for (const AAPointerInfo::Access &RAcc : *It.second) {		const auto &RAcc = State.getAccess(Index);
if (IsByval && !RAcc.isRead())		if (IsByval && !RAcc.isRead())
continue;		continue;
bool UsedAssumedInformation = false;		bool UsedAssumedInformation = false;
AccessKind AK = RAcc.getKind();		AccessKind AK = RAcc.getKind();
Optional<Value *> Content = RAcc.getContent();		Optional<Value *> Content = RAcc.getContent();
if (FromCallee) {		if (FromCallee) {
Content = A.translateArgumentToCallSiteContent(		Content = A.translateArgumentToCallSiteContent(
RAcc.getContent(), CB, *this, UsedAssumedInformation);		RAcc.getContent(), CB, *this, UsedAssumedInformation);
AK =		AK =
AccessKind(AK & (IsByval ? AccessKind::AK_R : AccessKind::AK_RW));		AccessKind(AK & (IsByval ? AccessKind::AK_R : AccessKind::AK_RW));
AK = AccessKind(AK \| (RAcc.isMayAccess() ? AK_MAY : AK_MUST));		AK = AccessKind(AK \| (RAcc.isMayAccess() ? AK_MAY : AK_MUST));
}		}
Changed =		Changed = Changed \| addAccess(A, OAS.Offset, OAS.Size, CB, Content, AK,
Changed \| addAccess(A, OAS.Offset, OAS.Size, CB, Content, AK,		RAcc.getType(), RAcc.getRemoteInst());
RAcc.getType(), RAcc.getRemoteInst(), Bin);
}		}
}		}
return Changed;		return Changed;
}		}

/// Statistic tracking for all AAPointerInfo implementations.		/// Statistic tracking for all AAPointerInfo implementations.
/// See AbstractAttribute::trackStatistics().		/// See AbstractAttribute::trackStatistics().
void trackPointerInfoStatistics(const IRPosition &IRP) const {}		void trackPointerInfoStatistics(const IRPosition &IRP) const {}

/// Dump the state into \p O.		/// Dump the state into \p O.
void dumpState(raw_ostream &O) {		void dumpState(raw_ostream &O) {
for (auto &It : AccessBins) {		for (auto &It : OffsetBins) {
O << "[" << It.first.Offset << "-" << It.first.Offset + It.first.Size		O << "[" << It.first.Offset << "-" << It.first.Offset + It.first.Size
<< "] : " << It.getSecond()->size() << "\n";		<< "] : " << It.getSecond().size() << "\n";
for (auto &Acc : *It.getSecond()) {		for (auto AccIndex : It.getSecond()) {
		auto &Acc = AccessList[AccIndex];
O << " - " << Acc.getKind() << " - " << *Acc.getLocalInst() << "\n";		O << " - " << Acc.getKind() << " - " << *Acc.getLocalInst() << "\n";
if (Acc.getLocalInst() != Acc.getRemoteInst())		if (Acc.getLocalInst() != Acc.getRemoteInst())
O << " --> " << *Acc.getRemoteInst()		O << " --> " << *Acc.getRemoteInst()
<< "\n";		<< "\n";
if (!Acc.isWrittenValueYetUndetermined()) {		if (!Acc.isWrittenValueYetUndetermined()) {
if (Acc.getWrittenValue())		if (Acc.getWrittenValue())
O << " - c: " << *Acc.getWrittenValue() << "\n";		O << " - c: " << *Acc.getWrittenValue() << "\n";
else		else
▲ Show 20 Lines • Show All 9,821 Lines • Show Last 20 Lines

llvm/test/Transforms/Attributor/call-simplify-pointer-info.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --function-signature --check-attributes --check-globals			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --function-signature --check-attributes --check-globals
	; RUN: opt -aa-pipeline=basic-aa -passes=attributor -attributor-manifest-internal -attributor-max-iterations-verify -attributor-annotate-decl-cs -attributor-max-iterations=4 -S < %s \| FileCheck %s --check-prefixes=TUNIT			; RUN: opt -aa-pipeline=basic-aa -passes=attributor -attributor-manifest-internal -attributor-max-iterations-verify -attributor-annotate-decl-cs -attributor-max-iterations=7 -S < %s \| FileCheck %s --check-prefixes=TUNIT
	; RUN: opt -aa-pipeline=basic-aa -passes=attributor-cgscc -attributor-manifest-internal -attributor-annotate-decl-cs -S < %s \| FileCheck %s --check-prefixes=CGSCC			; RUN: opt -aa-pipeline=basic-aa -passes=attributor-cgscc -attributor-manifest-internal -attributor-annotate-decl-cs -S < %s \| FileCheck %s --check-prefixes=CGSCC
	;			;

	define internal i8 @read_arg(i8* %p) {			define internal i8 @read_arg(i8* %p) {
	; CGSCC: Function Attrs: nofree norecurse nosync nounwind willreturn memory(argmem: read)			; CGSCC: Function Attrs: nofree norecurse nosync nounwind willreturn memory(argmem: read)
	; CGSCC-LABEL: define {{[^@]+}}@read_arg			; CGSCC-LABEL: define {{[^@]+}}@read_arg
	; CGSCC-SAME: (i8* nocapture nofree noundef nonnull readonly dereferenceable(1022) [[P:%.*]]) #[[ATTR0:[0-9]+]] {			; CGSCC-SAME: (i8* nocapture nofree noundef nonnull readonly dereferenceable(1022) [[P:%.*]]) #[[ATTR0:[0-9]+]] {
	; CGSCC-NEXT: entry:			; CGSCC-NEXT: entry:
	Show All 37 Lines
	;			;
	; CGSCC: Function Attrs: nofree nosync nounwind willreturn memory(none)			; CGSCC: Function Attrs: nofree nosync nounwind willreturn memory(none)
	; CGSCC-LABEL: define {{[^@]+}}@call_simplifiable_1			; CGSCC-LABEL: define {{[^@]+}}@call_simplifiable_1
	; CGSCC-SAME: () #[[ATTR1:[0-9]+]] {			; CGSCC-SAME: () #[[ATTR1:[0-9]+]] {
	; CGSCC-NEXT: entry:			; CGSCC-NEXT: entry:
	; CGSCC-NEXT: [[BYTES:%.*]] = alloca [1024 x i8], align 16			; CGSCC-NEXT: [[BYTES:%.*]] = alloca [1024 x i8], align 16
	; CGSCC-NEXT: [[I0:%.]] = getelementptr inbounds [1024 x i8], [1024 x i8] [[BYTES]], i64 0, i64 2			; CGSCC-NEXT: [[I0:%.]] = getelementptr inbounds [1024 x i8], [1024 x i8] [[BYTES]], i64 0, i64 2
	; CGSCC-NEXT: store i8 2, i8* [[I0]], align 2			; CGSCC-NEXT: store i8 2, i8* [[I0]], align 2
	; CGSCC-NEXT: [[R:%.]] = call i8 @read_arg(i8 nocapture nofree noundef nonnull readonly align 2 dereferenceable(1022) [[I0]]) #[[ATTR2:[0-9]+]]			; CGSCC-NEXT: [[R:%.]] = call i8 @read_arg(i8 nocapture nofree noundef nonnull readonly align 2 dereferenceable(1022) [[I0]]) #[[ATTR3:[0-9]+]]
	; CGSCC-NEXT: ret i8 [[R]]			; CGSCC-NEXT: ret i8 [[R]]
	;			;
	entry:			entry:
	%Bytes = alloca [1024 x i8], align 16			%Bytes = alloca [1024 x i8], align 16
	%i0 = getelementptr inbounds [1024 x i8], [1024 x i8]* %Bytes, i64 0, i64 2			%i0 = getelementptr inbounds [1024 x i8], [1024 x i8]* %Bytes, i64 0, i64 2
	store i8 2, i8* %i0, align 1			store i8 2, i8* %i0, align 1
	%r = call i8 @read_arg(i8* %i0)			%r = call i8 @read_arg(i8* %i0)
	ret i8 %r			ret i8 %r
	}			}

				;;; Same as read_arg, but we need a copy to form distinct leaves in the callgraph.

				define internal i8 @read_arg_1(i8* %p) {
				; CGSCC: Function Attrs: nofree norecurse nosync nounwind willreturn memory(argmem: read)
				; CGSCC-LABEL: define {{[^@]+}}@read_arg_1
				; CGSCC-SAME: (i8* nocapture nofree noundef nonnull readonly dereferenceable(1) [[P:%.*]]) #[[ATTR0]] {
				; CGSCC-NEXT: entry:
				; CGSCC-NEXT: [[L:%.]] = load i8, i8 [[P]], align 1
				; CGSCC-NEXT: ret i8 [[L]]
				;
				entry:
				%l = load i8, i8* %p, align 1
				ret i8 %l
				}

				define internal i8 @sum_two_same_loads(i8* %p) {
				; CGSCC: Function Attrs: nofree nosync nounwind willreturn memory(argmem: read)
				; CGSCC-LABEL: define {{[^@]+}}@sum_two_same_loads
				; CGSCC-SAME: (i8* nocapture nofree noundef nonnull readonly dereferenceable(1022) [[P:%.*]]) #[[ATTR2:[0-9]+]] {
				; CGSCC-NEXT: [[X:%.]] = call i8 @read_arg_1(i8 nocapture nofree noundef nonnull readonly dereferenceable(1022) [[P]]) #[[ATTR4:[0-9]+]]
				; CGSCC-NEXT: [[Y:%.]] = call i8 @read_arg_1(i8 nocapture nofree noundef nonnull readonly dereferenceable(1022) [[P]]) #[[ATTR4]]
				; CGSCC-NEXT: [[Z:%.*]] = add nsw i8 [[X]], [[Y]]
				; CGSCC-NEXT: ret i8 [[Z]]
				;
				%x = call i8 @read_arg_1(i8* %p)
				%y = call i8 @read_arg_1(i8* %p)
				%z = add nsw i8 %x, %y
				ret i8 %z
				}

				define i8 @call_simplifiable_2() {
				; TUNIT: Function Attrs: nofree norecurse nosync nounwind willreturn memory(none)
				; TUNIT-LABEL: define {{[^@]+}}@call_simplifiable_2
				; TUNIT-SAME: () #[[ATTR1]] {
				; TUNIT-NEXT: entry:
				; TUNIT-NEXT: [[BYTES:%.*]] = alloca [1024 x i8], align 16
				; TUNIT-NEXT: [[I0:%.]] = getelementptr inbounds [1024 x i8], [1024 x i8] [[BYTES]], i64 0, i64 2
				; TUNIT-NEXT: [[I1:%.]] = getelementptr inbounds [1024 x i8], [1024 x i8] [[BYTES]], i64 0, i64 3
				; TUNIT-NEXT: ret i8 4
				;
				; CGSCC: Function Attrs: nofree nosync nounwind willreturn memory(none)
				; CGSCC-LABEL: define {{[^@]+}}@call_simplifiable_2
				; CGSCC-SAME: () #[[ATTR1]] {
				; CGSCC-NEXT: entry:
				; CGSCC-NEXT: [[BYTES:%.*]] = alloca [1024 x i8], align 16
				; CGSCC-NEXT: [[I0:%.]] = getelementptr inbounds [1024 x i8], [1024 x i8] [[BYTES]], i64 0, i64 2
				; CGSCC-NEXT: store i8 2, i8* [[I0]], align 2
				; CGSCC-NEXT: [[I1:%.]] = getelementptr inbounds [1024 x i8], [1024 x i8] [[BYTES]], i64 0, i64 3
				; CGSCC-NEXT: store i8 3, i8* [[I1]], align 1
				; CGSCC-NEXT: [[R:%.]] = call i8 @sum_two_same_loads(i8 nocapture nofree noundef nonnull readonly align 2 dereferenceable(1022) [[I0]]) #[[ATTR3]]
				; CGSCC-NEXT: ret i8 [[R]]
				;
				entry:
				%Bytes = alloca [1024 x i8], align 16
				%i0 = getelementptr inbounds [1024 x i8], [1024 x i8]* %Bytes, i64 0, i64 2
				store i8 2, i8* %i0
				%i1 = getelementptr inbounds [1024 x i8], [1024 x i8]* %Bytes, i64 0, i64 3
				store i8 3, i8* %i1
				%r = call i8 @sum_two_same_loads(i8* %i0)
				ret i8 %r
				}

	define i8 @call_not_simplifiable_1() {			define i8 @call_not_simplifiable_1() {
	; TUNIT: Function Attrs: nofree norecurse nosync nounwind willreturn memory(none)			; TUNIT: Function Attrs: nofree norecurse nosync nounwind willreturn memory(none)
	; TUNIT-LABEL: define {{[^@]+}}@call_not_simplifiable_1			; TUNIT-LABEL: define {{[^@]+}}@call_not_simplifiable_1
	; TUNIT-SAME: () #[[ATTR1]] {			; TUNIT-SAME: () #[[ATTR1]] {
	; TUNIT-NEXT: entry:			; TUNIT-NEXT: entry:
	; TUNIT-NEXT: [[BYTES:%.*]] = alloca [1024 x i8], align 16			; TUNIT-NEXT: [[BYTES:%.*]] = alloca [1024 x i8], align 16
	; TUNIT-NEXT: [[I0:%.]] = getelementptr inbounds [1024 x i8], [1024 x i8] [[BYTES]], i64 0, i64 2			; TUNIT-NEXT: [[I0:%.]] = getelementptr inbounds [1024 x i8], [1024 x i8] [[BYTES]], i64 0, i64 2
	; TUNIT-NEXT: store i8 2, i8* [[I0]], align 2			; TUNIT-NEXT: store i8 2, i8* [[I0]], align 2
	; TUNIT-NEXT: [[R:%.]] = call i8 @read_arg_index(i8 nocapture nofree noundef nonnull readonly align 2 dereferenceable(1022) [[I0]]) #[[ATTR2:[0-9]+]]			; TUNIT-NEXT: [[R:%.]] = call i8 @read_arg_index(i8 nocapture nofree noundef nonnull readonly align 2 dereferenceable(1022) [[I0]]) #[[ATTR2:[0-9]+]]
	; TUNIT-NEXT: ret i8 [[R]]			; TUNIT-NEXT: ret i8 [[R]]
	;			;
	; CGSCC: Function Attrs: nofree nosync nounwind willreturn memory(none)			; CGSCC: Function Attrs: nofree nosync nounwind willreturn memory(none)
	; CGSCC-LABEL: define {{[^@]+}}@call_not_simplifiable_1			; CGSCC-LABEL: define {{[^@]+}}@call_not_simplifiable_1
	; CGSCC-SAME: () #[[ATTR1]] {			; CGSCC-SAME: () #[[ATTR1]] {
	; CGSCC-NEXT: entry:			; CGSCC-NEXT: entry:
	; CGSCC-NEXT: [[BYTES:%.*]] = alloca [1024 x i8], align 16			; CGSCC-NEXT: [[BYTES:%.*]] = alloca [1024 x i8], align 16
	; CGSCC-NEXT: [[I0:%.]] = getelementptr inbounds [1024 x i8], [1024 x i8] [[BYTES]], i64 0, i64 2			; CGSCC-NEXT: [[I0:%.]] = getelementptr inbounds [1024 x i8], [1024 x i8] [[BYTES]], i64 0, i64 2
	; CGSCC-NEXT: store i8 2, i8* [[I0]], align 2			; CGSCC-NEXT: store i8 2, i8* [[I0]], align 2
	; CGSCC-NEXT: [[R:%.]] = call i8 @read_arg_index(i8 nocapture nofree noundef nonnull readonly align 2 dereferenceable(1022) [[I0]]) #[[ATTR2]]			; CGSCC-NEXT: [[R:%.]] = call i8 @read_arg_index(i8 nocapture nofree noundef nonnull readonly align 2 dereferenceable(1022) [[I0]]) #[[ATTR3]]
	; CGSCC-NEXT: ret i8 [[R]]			; CGSCC-NEXT: ret i8 [[R]]
	;			;
	entry:			entry:
	%Bytes = alloca [1024 x i8], align 16			%Bytes = alloca [1024 x i8], align 16
	%i0 = getelementptr inbounds [1024 x i8], [1024 x i8]* %Bytes, i64 0, i64 2			%i0 = getelementptr inbounds [1024 x i8], [1024 x i8]* %Bytes, i64 0, i64 2
	store i8 2, i8* %i0, align 1			store i8 2, i8* %i0, align 1
	%r = call i8 @read_arg_index(i8* %i0, i64 0)			%r = call i8 @read_arg_index(i8* %i0, i64 0)
	ret i8 %r			ret i8 %r
	}			}

				;;; Same as read_arg, but we need a copy to form distinct leaves in the callgraph.

				define internal i8 @read_arg_2(i8* %p) {
				; TUNIT: Function Attrs: nofree norecurse nosync nounwind willreturn memory(argmem: read)
				; TUNIT-LABEL: define {{[^@]+}}@read_arg_2
				; TUNIT-SAME: (i8* nocapture nofree noundef nonnull readonly dereferenceable(1021) [[P:%.*]]) #[[ATTR0]] {
				; TUNIT-NEXT: entry:
				; TUNIT-NEXT: [[L:%.]] = load i8, i8 [[P]], align 1
				; TUNIT-NEXT: ret i8 [[L]]
				;
				; CGSCC: Function Attrs: nofree norecurse nosync nounwind willreturn memory(argmem: read)
				; CGSCC-LABEL: define {{[^@]+}}@read_arg_2
				; CGSCC-SAME: (i8* nocapture nofree noundef nonnull readonly dereferenceable(1) [[P:%.*]]) #[[ATTR0]] {
				; CGSCC-NEXT: entry:
				; CGSCC-NEXT: [[L:%.]] = load i8, i8 [[P]], align 1
				; CGSCC-NEXT: ret i8 [[L]]
				;
				entry:
				%l = load i8, i8* %p, align 1
				ret i8 %l
				}

				define internal i8 @sum_two_different_loads(i8* %p, i8* %q) {
				; TUNIT: Function Attrs: nofree norecurse nosync nounwind willreturn memory(argmem: read)
				; TUNIT-LABEL: define {{[^@]+}}@sum_two_different_loads
				; TUNIT-SAME: (i8* nocapture nofree noundef nonnull readonly align 2 dereferenceable(1022) [[P:%.]], i8 nocapture nofree noundef nonnull readonly dereferenceable(1021) [[Q:%.*]]) #[[ATTR0]] {
				; TUNIT-NEXT: [[X:%.]] = call i8 @read_arg_2(i8 nocapture nofree noundef nonnull readonly align 2 dereferenceable(1022) [[P]]) #[[ATTR2]]
				; TUNIT-NEXT: [[Y:%.]] = call i8 @read_arg_2(i8 nocapture nofree noundef nonnull readonly dereferenceable(1021) [[Q]]) #[[ATTR2]]
				; TUNIT-NEXT: [[Z:%.*]] = add nsw i8 [[X]], [[Y]]
				; TUNIT-NEXT: ret i8 [[Z]]
				;
				; CGSCC: Function Attrs: nofree nosync nounwind willreturn memory(argmem: read)
				; CGSCC-LABEL: define {{[^@]+}}@sum_two_different_loads
				; CGSCC-SAME: (i8* nocapture nofree noundef nonnull readonly dereferenceable(1022) [[P:%.]], i8 nocapture nofree noundef nonnull readonly dereferenceable(1021) [[Q:%.*]]) #[[ATTR2]] {
				; CGSCC-NEXT: [[X:%.]] = call i8 @read_arg_2(i8 nocapture nofree noundef nonnull readonly dereferenceable(1022) [[P]]) #[[ATTR4]]
				; CGSCC-NEXT: [[Y:%.]] = call i8 @read_arg_2(i8 nocapture nofree noundef nonnull readonly dereferenceable(1021) [[Q]]) #[[ATTR4]]
				; CGSCC-NEXT: [[Z:%.*]] = add nsw i8 [[X]], [[Y]]
				; CGSCC-NEXT: ret i8 [[Z]]
				;
				%x = call i8 @read_arg_2(i8* %p)
				%y = call i8 @read_arg_2(i8* %q)
				%z = add nsw i8 %x, %y
				ret i8 %z
				}

				define i8 @call_not_simplifiable_2() {
				; TUNIT: Function Attrs: nofree norecurse nosync nounwind willreturn memory(none)
				; TUNIT-LABEL: define {{[^@]+}}@call_not_simplifiable_2
				; TUNIT-SAME: () #[[ATTR1]] {
				; TUNIT-NEXT: entry:
				; TUNIT-NEXT: [[BYTES:%.*]] = alloca [1024 x i8], align 16
				; TUNIT-NEXT: [[I0:%.]] = getelementptr inbounds [1024 x i8], [1024 x i8] [[BYTES]], i64 0, i64 2
				; TUNIT-NEXT: store i8 2, i8* [[I0]], align 2
				; TUNIT-NEXT: [[I1:%.]] = getelementptr inbounds [1024 x i8], [1024 x i8] [[BYTES]], i64 0, i64 3
				; TUNIT-NEXT: store i8 3, i8* [[I1]], align 1
				; TUNIT-NEXT: [[BASE:%.]] = getelementptr inbounds [1024 x i8], [1024 x i8] [[BYTES]], i64 0, i64 0
				; TUNIT-NEXT: [[R:%.]] = call i8 @sum_two_different_loads(i8 nocapture nofree noundef nonnull readonly align 2 dereferenceable(1022) [[I0]], i8* nocapture nofree noundef nonnull readonly dereferenceable(1021) [[I1]]) #[[ATTR2]]
				; TUNIT-NEXT: ret i8 [[R]]
				;
				; CGSCC: Function Attrs: nofree nosync nounwind willreturn memory(none)
				; CGSCC-LABEL: define {{[^@]+}}@call_not_simplifiable_2
				; CGSCC-SAME: () #[[ATTR1]] {
				; CGSCC-NEXT: entry:
				; CGSCC-NEXT: [[BYTES:%.*]] = alloca [1024 x i8], align 16
				; CGSCC-NEXT: [[I0:%.]] = getelementptr inbounds [1024 x i8], [1024 x i8] [[BYTES]], i64 0, i64 2
				; CGSCC-NEXT: store i8 2, i8* [[I0]], align 2
				; CGSCC-NEXT: [[I1:%.]] = getelementptr inbounds [1024 x i8], [1024 x i8] [[BYTES]], i64 0, i64 3
				; CGSCC-NEXT: store i8 3, i8* [[I1]], align 1
				; CGSCC-NEXT: [[BASE:%.]] = getelementptr inbounds [1024 x i8], [1024 x i8] [[BYTES]], i64 0, i64 0
				; CGSCC-NEXT: [[R:%.]] = call i8 @sum_two_different_loads(i8 nocapture nofree noundef nonnull readonly align 2 dereferenceable(1022) [[I0]], i8* nocapture nofree noundef nonnull readonly dereferenceable(1021) [[I1]]) #[[ATTR3]]
				; CGSCC-NEXT: ret i8 [[R]]
				;
				entry:
				%Bytes = alloca [1024 x i8], align 16
				%i0 = getelementptr inbounds [1024 x i8], [1024 x i8]* %Bytes, i64 0, i64 2
				store i8 2, i8* %i0
				%i1 = getelementptr inbounds [1024 x i8], [1024 x i8]* %Bytes, i64 0, i64 3
				store i8 3, i8* %i1
				%base = getelementptr inbounds [1024 x i8], [1024 x i8]* %Bytes, i64 0, i64 0
				%r = call i8 @sum_two_different_loads(i8* %i0, i8* %i1)
				ret i8 %r
				}

	;.			;.
	; TUNIT: attributes #[[ATTR0]] = { nofree norecurse nosync nounwind willreturn memory(argmem: read) }			; TUNIT: attributes #[[ATTR0]] = { nofree norecurse nosync nounwind willreturn memory(argmem: read) }
	; TUNIT: attributes #[[ATTR1]] = { nofree norecurse nosync nounwind willreturn memory(none) }			; TUNIT: attributes #[[ATTR1]] = { nofree norecurse nosync nounwind willreturn memory(none) }
	; TUNIT: attributes #[[ATTR2]] = { nofree nosync nounwind willreturn }			; TUNIT: attributes #[[ATTR2]] = { nofree nosync nounwind willreturn }
	;.			;.
	; CGSCC: attributes #[[ATTR0]] = { nofree norecurse nosync nounwind willreturn memory(argmem: read) }			; CGSCC: attributes #[[ATTR0]] = { nofree norecurse nosync nounwind willreturn memory(argmem: read) }
	; CGSCC: attributes #[[ATTR1]] = { nofree nosync nounwind willreturn memory(none) }			; CGSCC: attributes #[[ATTR1]] = { nofree nosync nounwind willreturn memory(none) }
	; CGSCC: attributes #[[ATTR2]] = { willreturn }			; CGSCC: attributes #[[ATTR2]] = { nofree nosync nounwind willreturn memory(argmem: read) }
				; CGSCC: attributes #[[ATTR3]] = { willreturn }
				; CGSCC: attributes #[[ATTR4]] = { willreturn memory(read) }
	;.			;.

llvm/test/Transforms/Attributor/value-simplify-pointer-info.ll

	Show First 20 Lines • Show All 737 Lines • ▼ Show 20 Lines
	; TUNIT-NEXT: store float 0x40119999A0000000, float* [[F2]], align 4, !tbaa [[TBAA10]]			; TUNIT-NEXT: store float 0x40119999A0000000, float* [[F2]], align 4, !tbaa [[TBAA10]]
	; TUNIT-NEXT: [[F3:%.]] = getelementptr inbounds [[STRUCT_S]], %struct.S [[AGG_RESULT]], i64 0, i32 5			; TUNIT-NEXT: [[F3:%.]] = getelementptr inbounds [[STRUCT_S]], %struct.S [[AGG_RESULT]], i64 0, i32 5
	; TUNIT-NEXT: store float 0x40119999A0000000, float* [[F3]], align 4, !tbaa [[TBAA11]]			; TUNIT-NEXT: store float 0x40119999A0000000, float* [[F3]], align 4, !tbaa [[TBAA11]]
	; TUNIT-NEXT: [[I1:%.]] = getelementptr inbounds [[STRUCT_S]], %struct.S [[AGG_RESULT]], i64 0, i32 0			; TUNIT-NEXT: [[I1:%.]] = getelementptr inbounds [[STRUCT_S]], %struct.S [[AGG_RESULT]], i64 0, i32 0
	; TUNIT-NEXT: store i32 1, i32* [[I1]], align 4, !tbaa [[TBAA12]]			; TUNIT-NEXT: store i32 1, i32* [[I1]], align 4, !tbaa [[TBAA12]]
	; TUNIT-NEXT: [[I2:%.]] = getelementptr inbounds [[STRUCT_S]], %struct.S [[AGG_RESULT]], i64 0, i32 1			; TUNIT-NEXT: [[I2:%.]] = getelementptr inbounds [[STRUCT_S]], %struct.S [[AGG_RESULT]], i64 0, i32 1
	; TUNIT-NEXT: store i32 4, i32* [[I2]], align 4, !tbaa [[TBAA13]]			; TUNIT-NEXT: store i32 4, i32* [[I2]], align 4, !tbaa [[TBAA13]]
	; TUNIT-NEXT: [[I3:%.]] = getelementptr inbounds [[STRUCT_S]], %struct.S [[AGG_RESULT]], i64 0, i32 2			; TUNIT-NEXT: [[I3:%.]] = getelementptr inbounds [[STRUCT_S]], %struct.S [[AGG_RESULT]], i64 0, i32 2
	; TUNIT-NEXT: store i32 4, i32* [[I3]], align 4, !tbaa [[TBAA14]]			; TUNIT-NEXT: store i32 4, i32* [[I3]], align 4, !tbaa [[TBAA14]]
				jdoerfertUnsubmitted Not Done Reply Inline Actions Why do we miss out on these propagations? There are no PHIs involved, right? Something is amiss. jdoerfert: Why do we miss out on these propagations? There are no PHIs involved, right? Something is amiss.
				sameerdsAuthorUnsubmitted Done Reply Inline Actions When I don't distinguish between calls that reach the same remote inst, that remote inst is conservatively treated as unknown. I thought it would be okay to "improve" this later, but then realized that the solution is very relevant to the refactoring. The new change does it right, and this side-effect disappears. sameerds: When I don't distinguish between calls that reach the same remote inst, that remote inst is…
	; TUNIT-NEXT: ret void			; TUNIT-NEXT: ret void
	;			;
	; CGSCC: Function Attrs: nofree nosync nounwind willreturn			; CGSCC: Function Attrs: nofree nosync nounwind willreturn
	; CGSCC-LABEL: define {{[^@]+}}@static_global_simplifiable_1			; CGSCC-LABEL: define {{[^@]+}}@static_global_simplifiable_1
	; CGSCC-SAME: (%struct.S* noalias nocapture nofree nonnull writeonly sret([[STRUCT_S:%.]]) align 4 dereferenceable(24) [[AGG_RESULT:%.]]) #[[ATTR3]] {			; CGSCC-SAME: (%struct.S* noalias nocapture nofree nonnull writeonly sret([[STRUCT_S:%.]]) align 4 dereferenceable(24) [[AGG_RESULT:%.]]) #[[ATTR3]] {
	; CGSCC-NEXT: entry:			; CGSCC-NEXT: entry:
	; CGSCC-NEXT: store float 0x3FF19999A0000000, float* getelementptr inbounds ([[STRUCT_S]], %struct.S* @Gs1, i64 0, i32 3), align 4, !tbaa [[TBAA7]]			; CGSCC-NEXT: store float 0x3FF19999A0000000, float* getelementptr inbounds ([[STRUCT_S]], %struct.S* @Gs1, i64 0, i32 3), align 4, !tbaa [[TBAA7]]
	; CGSCC-NEXT: store float 0x40019999A0000000, float* getelementptr inbounds ([[STRUCT_S]], %struct.S* @Gs1, i64 0, i32 4), align 4, !tbaa [[TBAA10]]			; CGSCC-NEXT: store float 0x40019999A0000000, float* getelementptr inbounds ([[STRUCT_S]], %struct.S* @Gs1, i64 0, i32 4), align 4, !tbaa [[TBAA10]]
	▲ Show 20 Lines • Show All 944 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: [[L2:%.]] = load i32, i32 [[GEP2]], align 4			; CHECK-NEXT: [[L2:%.]] = load i32, i32 [[GEP2]], align 4
	; CHECK-NEXT: [[L3:%.]] = load i32, i32 [[GEP3]], align 4			; CHECK-NEXT: [[L3:%.]] = load i32, i32 [[GEP3]], align 4
	; CHECK-NEXT: [[ADD1:%.*]] = add i32 [[L1]], [[L2]]			; CHECK-NEXT: [[ADD1:%.*]] = add i32 [[L1]], [[L2]]
	; CHECK-NEXT: [[ADD2:%.*]] = add i32 [[ADD1]], [[L3]]			; CHECK-NEXT: [[ADD2:%.*]] = add i32 [[ADD1]], [[L3]]
	; CHECK-NEXT: ret i32 [[ADD2]]			; CHECK-NEXT: ret i32 [[ADD2]]
	;			;
	entry:			entry:
	%s = alloca %struct.S, align 4			%s = alloca %struct.S, align 4
	%bc = bitcast %struct.S* %s to i32*			%bc = bitcast %struct.S* %s to i32*
				sameerdsAuthorUnsubmitted Done Reply Inline Actions Oops! This was not supposed to happen. Checking. sameerds: Oops! This was not supposed to happen. Checking.
	%gep1 = getelementptr inbounds %struct.S, %struct.S* %s, i64 0, i32 2			%gep1 = getelementptr inbounds %struct.S, %struct.S* %s, i64 0, i32 2
	%gep2 = getelementptr inbounds i32, i32* %bc, i32 %arg1			%gep2 = getelementptr inbounds i32, i32* %bc, i32 %arg1
	%gep3 = getelementptr inbounds i32, i32* %bc, i32 %arg2			%gep3 = getelementptr inbounds i32, i32* %bc, i32 %arg2
	store i32 7, i32* %gep1			store i32 7, i32* %gep1
	store i32 7, i32* %gep2			store i32 7, i32* %gep2
	store i32 7, i32* %gep3			store i32 7, i32* %gep3
	%l1 = load i32, i32* %gep1			%l1 = load i32, i32* %gep1
	%l2 = load i32, i32* %gep2			%l2 = load i32, i32* %gep2
	▲ Show 20 Lines • Show All 502 Lines • ▼ Show 20 Lines
	; TUNIT: Function Attrs: nofree norecurse nosync nounwind willreturn			; TUNIT: Function Attrs: nofree norecurse nosync nounwind willreturn
	; TUNIT-LABEL: define {{[^@]+}}@phi_no_store_2			; TUNIT-LABEL: define {{[^@]+}}@phi_no_store_2
	; TUNIT-SAME: () #[[ATTR3]] {			; TUNIT-SAME: () #[[ATTR3]] {
	; TUNIT-NEXT: entry:			; TUNIT-NEXT: entry:
	; TUNIT-NEXT: br label [[LOOP:%.*]]			; TUNIT-NEXT: br label [[LOOP:%.*]]
	; TUNIT: loop:			; TUNIT: loop:
	; TUNIT-NEXT: [[P:%.]] = phi i8 [ bitcast (i32* @a2 to i8), [[ENTRY:%.]] ], [ [[G:%.*]], [[LOOP]] ]			; TUNIT-NEXT: [[P:%.]] = phi i8 [ bitcast (i32* @a2 to i8), [[ENTRY:%.]] ], [ [[G:%.*]], [[LOOP]] ]
	; TUNIT-NEXT: [[I:%.]] = phi i8 [ 0, [[ENTRY]] ], [ [[O:%.]], [[LOOP]] ]			; TUNIT-NEXT: [[I:%.]] = phi i8 [ 0, [[ENTRY]] ], [ [[O:%.]], [[LOOP]] ]
				; TUNIT-NEXT: store i8 1, i8* [[P]], align 2
	; TUNIT-NEXT: [[G]] = getelementptr i8, i8* bitcast (i32* @a2 to i8*), i64 2			; TUNIT-NEXT: [[G]] = getelementptr i8, i8* bitcast (i32* @a2 to i8*), i64 2
	; TUNIT-NEXT: [[O]] = add nsw i8 [[I]], 1			; TUNIT-NEXT: [[O]] = add nsw i8 [[I]], 1
	; TUNIT-NEXT: [[C:%.*]] = icmp eq i8 [[O]], 7			; TUNIT-NEXT: [[C:%.*]] = icmp eq i8 [[O]], 7
	; TUNIT-NEXT: br i1 [[C]], label [[END:%.*]], label [[LOOP]]			; TUNIT-NEXT: br i1 [[C]], label [[END:%.*]], label [[LOOP]]
	; TUNIT: end:			; TUNIT: end:
	; TUNIT-NEXT: [[L21:%.]] = load i8, i8 getelementptr (i8, i8* bitcast (i32* @a2 to i8*), i64 2), align 2			; TUNIT-NEXT: [[L21:%.]] = load i8, i8 getelementptr (i8, i8* bitcast (i32* @a2 to i8*), i64 2), align 2
	; TUNIT-NEXT: [[L22:%.]] = load i8, i8 getelementptr (i8, i8* bitcast (i32* @a2 to i8*), i64 3), align 1			; TUNIT-NEXT: [[L22:%.]] = load i8, i8 getelementptr (i8, i8* bitcast (i32* @a2 to i8*), i64 3), align 1
	; TUNIT-NEXT: [[ADD:%.*]] = add i8 [[L21]], [[L22]]			; TUNIT-NEXT: [[ADD:%.*]] = add i8 [[L21]], [[L22]]
	; TUNIT-NEXT: ret i8 [[ADD]]			; TUNIT-NEXT: ret i8 [[ADD]]
	;			;
	; CGSCC: Function Attrs: nofree norecurse nosync nounwind willreturn			; CGSCC: Function Attrs: nofree norecurse nosync nounwind willreturn
	; CGSCC-LABEL: define {{[^@]+}}@phi_no_store_2			; CGSCC-LABEL: define {{[^@]+}}@phi_no_store_2
	; CGSCC-SAME: () #[[ATTR5]] {			; CGSCC-SAME: () #[[ATTR5]] {
	; CGSCC-NEXT: entry:			; CGSCC-NEXT: entry:
	; CGSCC-NEXT: br label [[LOOP:%.*]]			; CGSCC-NEXT: br label [[LOOP:%.*]]
	; CGSCC: loop:			; CGSCC: loop:
	; CGSCC-NEXT: [[P:%.]] = phi i8 [ bitcast (i32* @a2 to i8), [[ENTRY:%.]] ], [ [[G:%.*]], [[LOOP]] ]			; CGSCC-NEXT: [[P:%.]] = phi i8 [ bitcast (i32* @a2 to i8), [[ENTRY:%.]] ], [ [[G:%.*]], [[LOOP]] ]
	; CGSCC-NEXT: [[I:%.]] = phi i8 [ 0, [[ENTRY]] ], [ [[O:%.]], [[LOOP]] ]			; CGSCC-NEXT: [[I:%.]] = phi i8 [ 0, [[ENTRY]] ], [ [[O:%.]], [[LOOP]] ]
				; CGSCC-NEXT: store i8 1, i8* [[P]], align 2
	; CGSCC-NEXT: [[G]] = getelementptr i8, i8* bitcast (i32* @a2 to i8*), i64 2			; CGSCC-NEXT: [[G]] = getelementptr i8, i8* bitcast (i32* @a2 to i8*), i64 2
	; CGSCC-NEXT: [[O]] = add nsw i8 [[I]], 1			; CGSCC-NEXT: [[O]] = add nsw i8 [[I]], 1
	; CGSCC-NEXT: [[C:%.*]] = icmp eq i8 [[O]], 7			; CGSCC-NEXT: [[C:%.*]] = icmp eq i8 [[O]], 7
	; CGSCC-NEXT: br i1 [[C]], label [[END:%.*]], label [[LOOP]]			; CGSCC-NEXT: br i1 [[C]], label [[END:%.*]], label [[LOOP]]
	; CGSCC: end:			; CGSCC: end:
	; CGSCC-NEXT: [[L21:%.]] = load i8, i8 getelementptr (i8, i8* bitcast (i32* @a2 to i8*), i64 2), align 2			; CGSCC-NEXT: [[L21:%.]] = load i8, i8 getelementptr (i8, i8* bitcast (i32* @a2 to i8*), i64 2), align 2
	; CGSCC-NEXT: [[L22:%.]] = load i8, i8 getelementptr (i8, i8* bitcast (i32* @a2 to i8*), i64 3), align 1			; CGSCC-NEXT: [[L22:%.]] = load i8, i8 getelementptr (i8, i8* bitcast (i32* @a2 to i8*), i64 3), align 1
	; CGSCC-NEXT: [[ADD:%.*]] = add i8 [[L21]], [[L22]]			; CGSCC-NEXT: [[ADD:%.*]] = add i8 [[L21]], [[L22]]
	Show All 24 Lines
	; TUNIT-LABEL: define {{[^@]+}}@phi_no_store_3			; TUNIT-LABEL: define {{[^@]+}}@phi_no_store_3
	; TUNIT-SAME: () #[[ATTR3]] {			; TUNIT-SAME: () #[[ATTR3]] {
	; TUNIT-NEXT: entry:			; TUNIT-NEXT: entry:
	; TUNIT-NEXT: store i8 0, i8* getelementptr (i8, i8* bitcast (i32* @a3 to i8*), i64 3), align 1			; TUNIT-NEXT: store i8 0, i8* getelementptr (i8, i8* bitcast (i32* @a3 to i8*), i64 3), align 1
	; TUNIT-NEXT: br label [[LOOP:%.*]]			; TUNIT-NEXT: br label [[LOOP:%.*]]
	; TUNIT: loop:			; TUNIT: loop:
	; TUNIT-NEXT: [[P:%.]] = phi i8 [ bitcast (i32* @a3 to i8), [[ENTRY:%.]] ], [ [[G:%.*]], [[LOOP]] ]			; TUNIT-NEXT: [[P:%.]] = phi i8 [ bitcast (i32* @a3 to i8), [[ENTRY:%.]] ], [ [[G:%.*]], [[LOOP]] ]
	; TUNIT-NEXT: [[I:%.]] = phi i8 [ 0, [[ENTRY]] ], [ [[O:%.]], [[LOOP]] ]			; TUNIT-NEXT: [[I:%.]] = phi i8 [ 0, [[ENTRY]] ], [ [[O:%.]], [[LOOP]] ]
				; TUNIT-NEXT: store i8 1, i8* [[P]], align 2
	; TUNIT-NEXT: [[G]] = getelementptr i8, i8* bitcast (i32* @a3 to i8*), i64 2			; TUNIT-NEXT: [[G]] = getelementptr i8, i8* bitcast (i32* @a3 to i8*), i64 2
	; TUNIT-NEXT: [[O]] = add nsw i8 [[I]], 1			; TUNIT-NEXT: [[O]] = add nsw i8 [[I]], 1
	; TUNIT-NEXT: [[C:%.*]] = icmp eq i8 [[O]], 7			; TUNIT-NEXT: [[C:%.*]] = icmp eq i8 [[O]], 7
	; TUNIT-NEXT: br i1 [[C]], label [[END:%.*]], label [[LOOP]]			; TUNIT-NEXT: br i1 [[C]], label [[END:%.*]], label [[LOOP]]
	; TUNIT: end:			; TUNIT: end:
	; TUNIT-NEXT: [[L31:%.]] = load i8, i8 getelementptr (i8, i8* bitcast (i32* @a3 to i8*), i64 2), align 2			; TUNIT-NEXT: [[L31:%.]] = load i8, i8 getelementptr (i8, i8* bitcast (i32* @a3 to i8*), i64 2), align 2
	; TUNIT-NEXT: [[L32:%.]] = load i8, i8 getelementptr (i8, i8* bitcast (i32* @a3 to i8*), i64 3), align 1			; TUNIT-NEXT: [[L32:%.]] = load i8, i8 getelementptr (i8, i8* bitcast (i32* @a3 to i8*), i64 3), align 1
	; TUNIT-NEXT: [[ADD:%.*]] = add i8 [[L31]], [[L32]]			; TUNIT-NEXT: [[ADD:%.*]] = add i8 [[L31]], [[L32]]
	; TUNIT-NEXT: [[L34:%.]] = load i8, i8 bitcast (i32* getelementptr inbounds (i32, i32* @a3, i64 1) to i8*), align 4			; TUNIT-NEXT: [[L34:%.]] = load i8, i8 bitcast (i32* getelementptr inbounds (i32, i32* @a3, i64 1) to i8*), align 4
	; TUNIT-NEXT: [[ADD2:%.*]] = add i8 [[ADD]], [[L34]]			; TUNIT-NEXT: [[ADD2:%.*]] = add i8 [[ADD]], [[L34]]
	; TUNIT-NEXT: ret i8 [[ADD2]]			; TUNIT-NEXT: ret i8 [[ADD2]]
	;			;
	; CGSCC: Function Attrs: nofree norecurse nosync nounwind willreturn			; CGSCC: Function Attrs: nofree norecurse nosync nounwind willreturn
	; CGSCC-LABEL: define {{[^@]+}}@phi_no_store_3			; CGSCC-LABEL: define {{[^@]+}}@phi_no_store_3
	; CGSCC-SAME: () #[[ATTR5]] {			; CGSCC-SAME: () #[[ATTR5]] {
	; CGSCC-NEXT: entry:			; CGSCC-NEXT: entry:
	; CGSCC-NEXT: store i8 0, i8* getelementptr (i8, i8* bitcast (i32* @a3 to i8*), i64 3), align 1			; CGSCC-NEXT: store i8 0, i8* getelementptr (i8, i8* bitcast (i32* @a3 to i8*), i64 3), align 1
	; CGSCC-NEXT: br label [[LOOP:%.*]]			; CGSCC-NEXT: br label [[LOOP:%.*]]
	; CGSCC: loop:			; CGSCC: loop:
	; CGSCC-NEXT: [[P:%.]] = phi i8 [ bitcast (i32* @a3 to i8), [[ENTRY:%.]] ], [ [[G:%.*]], [[LOOP]] ]			; CGSCC-NEXT: [[P:%.]] = phi i8 [ bitcast (i32* @a3 to i8), [[ENTRY:%.]] ], [ [[G:%.*]], [[LOOP]] ]
	; CGSCC-NEXT: [[I:%.]] = phi i8 [ 0, [[ENTRY]] ], [ [[O:%.]], [[LOOP]] ]			; CGSCC-NEXT: [[I:%.]] = phi i8 [ 0, [[ENTRY]] ], [ [[O:%.]], [[LOOP]] ]
				; CGSCC-NEXT: store i8 1, i8* [[P]], align 2
	; CGSCC-NEXT: [[G]] = getelementptr i8, i8* bitcast (i32* @a3 to i8*), i64 2			; CGSCC-NEXT: [[G]] = getelementptr i8, i8* bitcast (i32* @a3 to i8*), i64 2
	; CGSCC-NEXT: [[O]] = add nsw i8 [[I]], 1			; CGSCC-NEXT: [[O]] = add nsw i8 [[I]], 1
	; CGSCC-NEXT: [[C:%.*]] = icmp eq i8 [[O]], 7			; CGSCC-NEXT: [[C:%.*]] = icmp eq i8 [[O]], 7
	; CGSCC-NEXT: br i1 [[C]], label [[END:%.*]], label [[LOOP]]			; CGSCC-NEXT: br i1 [[C]], label [[END:%.*]], label [[LOOP]]
	; CGSCC: end:			; CGSCC: end:
	; CGSCC-NEXT: [[L31:%.]] = load i8, i8 getelementptr (i8, i8* bitcast (i32* @a3 to i8*), i64 2), align 2			; CGSCC-NEXT: [[L31:%.]] = load i8, i8 getelementptr (i8, i8* bitcast (i32* @a3 to i8*), i64 2), align 2
	; CGSCC-NEXT: [[L32:%.]] = load i8, i8 getelementptr (i8, i8* bitcast (i32* @a3 to i8*), i64 3), align 1			; CGSCC-NEXT: [[L32:%.]] = load i8, i8 getelementptr (i8, i8* bitcast (i32* @a3 to i8*), i64 3), align 1
	; CGSCC-NEXT: [[ADD:%.*]] = add i8 [[L31]], [[L32]]			; CGSCC-NEXT: [[ADD:%.*]] = add i8 [[L31]], [[L32]]
	▲ Show 20 Lines • Show All 785 Lines • ▼ Show 20 Lines
	; CGSCC-NEXT: ret void			; CGSCC-NEXT: ret void
	;			;
	%l = load i32, i32* %a			%l = load i32, i32* %a
	%sel = select i1 %c, i32 %l, i32 42			%sel = select i1 %c, i32 %l, i32 42
	store i32 %sel, i32* %a			store i32 %sel, i32* %a
	ret void			ret void
	}			}

				define i8 @multiple_offsets_not_simplifiable_1(i1 %cnd1, i1 %cnd2) {
				; TUNIT: Function Attrs: nofree norecurse nosync nounwind willreturn
				; TUNIT-LABEL: define {{[^@]+}}@multiple_offsets_not_simplifiable_1
				; TUNIT-SAME: (i1 [[CND1:%.]], i1 [[CND2:%.]]) #[[ATTR3]] {
				; TUNIT-NEXT: entry:
				; TUNIT-NEXT: [[BYTES:%.*]] = alloca [1024 x i8], align 16
				; TUNIT-NEXT: [[GEP7:%.]] = getelementptr inbounds [1024 x i8], [1024 x i8] [[BYTES]], i64 0, i64 7
				; TUNIT-NEXT: [[GEP23:%.]] = getelementptr inbounds [1024 x i8], [1024 x i8] [[BYTES]], i64 0, i64 23
				; TUNIT-NEXT: [[SEL_PTR:%.]] = select i1 [[CND1]], i8 [[GEP7]], i8* [[GEP23]]
				; TUNIT-NEXT: store i8 42, i8* [[SEL_PTR]], align 4
				; TUNIT-NEXT: [[I:%.]] = load i8, i8 [[GEP7]], align 4
				; TUNIT-NEXT: ret i8 [[I]]
				;
				; CGSCC: Function Attrs: nofree norecurse nosync nounwind willreturn
				; CGSCC-LABEL: define {{[^@]+}}@multiple_offsets_not_simplifiable_1
				; CGSCC-SAME: (i1 [[CND1:%.]], i1 [[CND2:%.]]) #[[ATTR5]] {
				; CGSCC-NEXT: entry:
				; CGSCC-NEXT: [[BYTES:%.*]] = alloca [1024 x i8], align 16
				; CGSCC-NEXT: [[GEP7:%.]] = getelementptr inbounds [1024 x i8], [1024 x i8] [[BYTES]], i64 0, i64 7
				; CGSCC-NEXT: [[GEP23:%.]] = getelementptr inbounds [1024 x i8], [1024 x i8] [[BYTES]], i64 0, i64 23
				; CGSCC-NEXT: [[SEL_PTR:%.]] = select i1 [[CND1]], i8 [[GEP7]], i8* [[GEP23]]
				; CGSCC-NEXT: store i8 42, i8* [[SEL_PTR]], align 4
				; CGSCC-NEXT: [[I:%.]] = load i8, i8 [[GEP7]], align 4
				; CGSCC-NEXT: ret i8 [[I]]
				;
				entry:
				%Bytes = alloca [1024 x i8], align 16
				%gep7 = getelementptr inbounds [1024 x i8], [1024 x i8]* %Bytes, i64 0, i64 7
				%gep23 = getelementptr inbounds [1024 x i8], [1024 x i8]* %Bytes, i64 0, i64 23
				; %phi.ptr = phi i8* [ %gep7, %then ], [ %gep23, %else ]
				%sel.ptr = select i1 %cnd1, i8* %gep7, i8* %gep23
				store i8 42, i8* %sel.ptr, align 4
				%i = load i8, i8* %gep7, align 4
				ret i8 %i
				}


	!llvm.module.flags = !{!0, !1}			!llvm.module.flags = !{!0, !1}
	!llvm.ident = !{!2}			!llvm.ident = !{!2}

	!0 = !{i32 1, !"wchar_size", i32 4}			!0 = !{i32 1, !"wchar_size", i32 4}
	!1 = !{i32 7, !"uwtable", i32 1}			!1 = !{i32 7, !"uwtable", i32 1}
	!2 = !{!"clang version 13.0.0"}			!2 = !{!"clang version 13.0.0"}
	!3 = !{!4, !4, i64 0}			!3 = !{!4, !4, i64 0}
	!4 = !{!"int", !5, i64 0}			!4 = !{!"int", !5, i64 0}
	▲ Show 20 Lines • Show All 141 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[AAPointerInfo] refactor how offsets and Access objects are trackedClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 475436

llvm/include/llvm/Transforms/IPO/Attributor.h

llvm/lib/Transforms/IPO/Attributor.cpp

llvm/lib/Transforms/IPO/AttributorAttributes.cpp

llvm/test/Transforms/Attributor/call-simplify-pointer-info.ll

llvm/test/Transforms/Attributor/value-simplify-pointer-info.ll

[AAPointerInfo] refactor how offsets and Access objects are tracked
ClosedPublic