This is an archive of the discontinued LLVM Phabricator instance.

llvm/include/llvm/IR/Metadata.h
1217	This should probably be part of D92887 rather than this one.
llvm/include/llvm/Transforms/Utils/Cloning.h
294	Is this needed? I originally introduced this class in order to reuse it between inlining and unrolling, but as far as I can tell, you do not make use of it later in the patch series (D92887 implements a separate set of helper functions). If that's right, then I would simply drop these changes and keep the original code.

nikic mentioned this in D92887: [LoopUnroll] Use llvm.experimental.noalias.scope.decl for duplicating noalias metadata as needed.Jan 8 2021, 12:12 PM

jeroen.dobbelaere added inline comments.Jan 9 2021, 7:30 AM

llvm/include/llvm/Transforms/Utils/Cloning.h
294	I like the refactoring you did. It also makes it easier to put part of the mechanism early, which is needed (at least in the full restrict case, probably also here) when inlining recursive functions. In that case the caller and callee are the same and gathering things to clone before any change is done was necessary. I'll move the infrastructure back to InlineFunction as it will indeed only be used there.

Update according to comments
Keep scope cloning refactoring local

jeroen.dobbelaere removed a child revision: D93042: [noalias.decl] Look through llvm.experimental.noalias.scope.decl.Jan 9 2021, 9:54 AM

jeroen.dobbelaere added a parent revision: D93042: [noalias.decl] Look through llvm.experimental.noalias.scope.decl.

jeroen.dobbelaere added a child revision: D92887: [LoopUnroll] Use llvm.experimental.noalias.scope.decl for duplicating noalias metadata as needed.

jeroen.dobbelaere removed a parent revision: D93039: Introduce llvm.noalias.decl intrinsic.

nikic added inline comments.Jan 17 2021, 9:35 AM

llvm/lib/Transforms/Utils/InlineFunction.cpp
831	The mention of unrolling here no longer makes sense.
835	While this collects the metadata on the experimental.noalias.scope.decl, I'm not seeing any code that would actually update it on the cloned instructions. remap() only checks the noalias and alias.scope metadata, no? Looking through the tests, I think we're missing a test case that inlines two levels, so that the first creates the intrinsic, and the second has to properly rename its operand.
llvm/test/Transforms/Coroutines/coro-retcon-resume-values.ll
2	You need to add `-aa-pipeline=default` to avoid the NPM regression.

Thanks for the review ! I'll try to come with an update later today.

llvm/lib/Transforms/Utils/InlineFunction.cpp
835	Good catch, it indeed seems that I missed the remapping of the 'llvm.experimental.noalias.scope.decl' argument in this version :( More tests is always better. I'll add one for this, and also one for the caller == callee case.
llvm/test/Transforms/Coroutines/coro-retcon-resume-values.ll
2	That seems to do the trick.

When working on a test with recursion and inlining, I was thinking about the differences between duplicating scopes in the unrolling (D92887) and duplicating the scopes during the inlining. The refactored code for the inlining keeps the same behavior as before and does a 'deep clone': all the MDNodes and there dependencies are cloned. So the scopes themselves become unique, but also the (function) domains to which they belong are duplicated.

In the utilities added for unrolling (D92887), only the scopes are duplicated. Their domains are kept. And that makes sense as the domain represents the function to which the scope belongs.
The side effect of this, is that the scope layout can be different with different pass orders: loop-unroll after inlining or inlining after loop-unroll will make a difference on how the domains are represented.

IMHO, it would make sense to not do the deep copy for inlining. The scope domain of the inlined function can be kept and shared by multiple functions and multiple inlines, as long as the scopes themselves are unique.
On the other hand, a change in the domain handling can have an effect on the speed of analysis, as an initial filtering is done based on the domains. I don't think we should do this kind of change right now, but it is something we can keep in mind.

Adapted to comments.
Added new testcase for recursive inlining.

jeroen.dobbelaere marked 3 inline comments as done.Jan 19 2021, 9:13 AM

jeroen.dobbelaere added a parent revision: D94978: [NFC] cleanup noalias2.ll test.Jan 19 2021, 9:20 AM

LGTM. I would simplify the MetadataAsValue cloning a bit, but it's not particularly important either.

llvm/lib/Transforms/Utils/InlineFunction.cpp
832	I don't think it makes sense to separately track MDV and MDVMap. I would create the new MetadataAsValue directly in remap: if (auto II = dyn_cast<IntrinsicInst>(I)) { if (II->getIntrinsicID() == Intrinsic::experimental_noalias_scope_decl) { auto MV = cast<MetadataAsValue>( II->getOperand(Intrinsic::NoAliasScopeDeclScopeArg))); auto *NewMV = MetadataAsValue::get( I->getContext(), MDMap[cast<MDNode>(MV->getMetadata())]); II->setOperand(Intrinsic::NoAliasScopeDeclScopeArg, NewMV); } } Or so.
834	These can be dropped (inlining performs only one clone).
1697	The above part of the comment is repeated below. You might want to drop it in one place.
llvm/test/Transforms/Coroutines/coro-retcon-resume-values.ll
2	You can drop the `--check-prefixes` now.

This revision is now accepted and ready to land.Jan 19 2021, 12:51 PM

Adapted to comments. Simplified the cloning of the MetadataAsValue.

Thanks @nikic for the fast and helpful reviews !

@jdoerfert, could you have an extra look at this one (and the follow ups) ? Thanks !

Ping @jdoerfert Do you want to check over this patch and the LoopUnroll/LoopRotate ones before they go in?

FWIW, this also lgtm.

I am going to commit this patch today, so that we have a view on possible fallout.

Closed by commit rG2b9a834c43cb: [InlineFunction] Use llvm.experimental.noalias.scope.decl for noalias arguments. (authored by jeroen.dobbelaere). · Explain WhyJan 23 2021, 3:12 AM

This revision was automatically updated to reflect the committed changes.

jeroen.dobbelaere added a commit: rG2b9a834c43cb: [InlineFunction] Use llvm.experimental.noalias.scope.decl for noalias arguments..

Herald added a project: Restricted Project. · View Herald TranscriptJan 23 2021, 3:12 AM

Herald added a subscriber: cfe-commits. · View Herald Transcript

jeroen.dobbelaere mentioned this in D95141: [InstCombine] Remove unused llvm.experimental.noalias.scope.decl.Jan 23 2021, 5:54 AM

http://llvm-compile-time-tracker.com/compare.php?from=344afa853fcfcc085cb5c957b4a07c7ea013bb1b&to=2b9a834c43cb1f93d33958c14b695896bb4e9c1e&stat=size-text

Codesize regression 1% for tramp3d. Can you check it?

In D93040#2517758, @xbolva00 wrote:

http://llvm-compile-time-tracker.com/compare.php?from=344afa853fcfcc085cb5c957b4a07c7ea013bb1b&to=2b9a834c43cb1f93d33958c14b695896bb4e9c1e&stat=size-text

Codesize regression 1% for tramp3d. Can you check it?

I assume this to be a secondary effect of having the instructions in the first place. Maybe some unroll or inline size threshold needs to be thought about them. At the end of the day, we might not be able to avoid something like this as we make !noalias correct, though, I imagine the threshold theory which can be resolved.

In D93040#2517769, @jdoerfert wrote:

In D93040#2517758, @xbolva00 wrote:

http://llvm-compile-time-tracker.com/compare.php?from=344afa853fcfcc085cb5c957b4a07c7ea013bb1b&to=2b9a834c43cb1f93d33958c14b695896bb4e9c1e&stat=size-text

Codesize regression 1% for tramp3d. Can you check it?

I assume this to be a secondary effect of having the instructions in the first place. Maybe some unroll or inline size threshold needs to be thought about them. At the end of the day, we might not be able to avoid something like this as we make !noalias correct, though, I imagine the threshold theory which can be resolved.

The loop unrolling/rotating and the cleanup patches are not yet committed. You can see the effect of those here:

http://llvm-compile-time-tracker.com/compare.php?from=344afa853fcfcc085cb5c957b4a07c7ea013bb1b&to=eaf871f4e7fde26cd755cc4c2d67f2c244c66f18&stat=size-text

Also see D95141 for links to more results.

jeroen.dobbelaere mentioned this in rG659c7bcde62e: [LoopRotate] Use llvm.experimental.noalias.scope.decl for duplicating noalias….Jan 24 2021, 4:55 AM

jeroen.dobbelaere mentioned this in D68484: [PATCH 01/27] [noalias] LangRef: noalias intrinsics and ptr_provenance documentation..Jan 29 2021, 12:31 AM

jeroen.dobbelaere added inline comments.Feb 1 2021, 2:09 AM

llvm/lib/Transforms/Utils/InlineFunction.cpp
828–829	@nikic In the full restrict patches, we also check if the instruction was already handled. I was able to trigger this with an assertion and I have a more or less reduced testcase. Either we keep a SmallPtrSet and check if the instruction was already handled (this is what the full restrict version does; See D68509 InlineFunction.cpp#969). Or we only replace the metadata if it is in the MDMap (by using MDMap.lookup(M). Any preference ?

nikic added inline comments.Feb 1 2021, 3:27 AM

llvm/lib/Transforms/Utils/InlineFunction.cpp
828–829	I don't understand under what circumstances we'd handle an instruction twice.

Revision Contents

Path

Size

llvm/

include/

llvm/

IR/

Metadata.h

6 lines

Transforms/

Utils/

Cloning.h

24 lines

lib/

Transforms/

Utils/

CloneFunction.cpp

101 lines

InlineFunction.cpp

127 lines

test/

Transforms/

Coroutines/

ArgAddr.ll

2 lines

coro-retcon-resume-values.ll

28 lines

3 lines

3 lines

2 lines

2 lines

2 lines

Inline/

launder.invariant.group.ll

2 lines

noalias-calls-always.ll

82 lines

noalias-calls.ll

84 lines

noalias.ll

34 lines

noalias2.ll

79 lines

PhaseOrdering/

inlining-alignment-assumptions.ll

1 line

instcombine-sroa-inttoptr.ll

2 lines

pr39282.ll

16 lines

Diff 310905

llvm/include/llvm/IR/Metadata.h

Show First 20 Lines • Show All 1,203 Lines • ▼ Show 20 Lines	public:
const MDNode *getNode() const { return Node; }		const MDNode *getNode() const { return Node; }

/// Get the MDNode for this AliasScopeNode's domain.		/// Get the MDNode for this AliasScopeNode's domain.
const MDNode *getDomain() const {		const MDNode *getDomain() const {
if (Node->getNumOperands() < 2)		if (Node->getNumOperands() < 2)
return nullptr;		return nullptr;
return dyn_cast_or_null<MDNode>(Node->getOperand(1));		return dyn_cast_or_null<MDNode>(Node->getOperand(1));
}		}
		StringRef getName() const {
		if (Node->getNumOperands() > 2)
		if (MDString *N = dyn_cast_or_null<MDString>(Node->getOperand(2)))
		return N->getString();
		return StringRef();
		}
		nikicUnsubmitted Not Done Reply Inline Actions This should probably be part of D92887 rather than this one. nikic: This should probably be part of D92887 rather than this one.
};		};

/// Typed iterator through MDNode operands.		/// Typed iterator through MDNode operands.
///		///
/// An iterator that transforms an \a MDNode::iterator into an iterator over a		/// An iterator that transforms an \a MDNode::iterator into an iterator over a
/// particular Metadata subclass.		/// particular Metadata subclass.
template <class T>		template <class T>
class TypedMDOperandIterator		class TypedMDOperandIterator
▲ Show 20 Lines • Show All 237 Lines • Show Last 20 Lines

llvm/include/llvm/Transforms/Utils/Cloning.h

	Show All 11 Lines
	// functions, to copying basic blocks to support loop unrolling or superblock			// functions, to copying basic blocks to support loop unrolling or superblock
	// formation, etc.			// formation, etc.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_TRANSFORMS_UTILS_CLONING_H			#ifndef LLVM_TRANSFORMS_UTILS_CLONING_H
	#define LLVM_TRANSFORMS_UTILS_CLONING_H			#define LLVM_TRANSFORMS_UTILS_CLONING_H

				#include "llvm/ADT/SetVector.h"
	#include "llvm/ADT/SmallVector.h"			#include "llvm/ADT/SmallVector.h"
	#include "llvm/ADT/Twine.h"			#include "llvm/ADT/Twine.h"
	#include "llvm/Analysis/AssumptionCache.h"			#include "llvm/Analysis/AssumptionCache.h"
	#include "llvm/Analysis/InlineCost.h"			#include "llvm/Analysis/InlineCost.h"
	#include "llvm/IR/ValueHandle.h"			#include "llvm/IR/ValueHandle.h"
	#include "llvm/Transforms/Utils/ValueMapper.h"			#include "llvm/Transforms/Utils/ValueMapper.h"
	#include <functional>			#include <functional>
	#include <memory>			#include <memory>
	▲ Show 20 Lines • Show All 235 Lines • ▼ Show 20 Lines

	/// Updates profile information by adjusting the entry count by adding			/// Updates profile information by adjusting the entry count by adding
	/// entryDelta then scaling callsite information by the new count divided by the			/// entryDelta then scaling callsite information by the new count divided by the
	/// old count. VMap is used during inlinng to also update the new clone			/// old count. VMap is used during inlinng to also update the new clone
	void updateProfileCallee(			void updateProfileCallee(
	Function *Callee, int64_t entryDelta,			Function *Callee, int64_t entryDelta,
	const ValueMap<const Value , WeakTrackingVH> VMap = nullptr);			const ValueMap<const Value , WeakTrackingVH> VMap = nullptr);

	} // end namespace llvm			/// Utility for cloning !noalias and !alias.scope metadata. When a code region
				/// using scoped alias metadata is cloned, the aliasing relationships may not
				/// hold between the two clones, in which case it is necessary to clone the
				/// metadata using this utility. This comes up with inlining and unrolling.
				class ScopedAliasMetadataCloner {
				using MetadataMap = DenseMap<const MDNode *, TrackingMDNodeRef>;
				SetVector<const MDNode *> MD;
				MetadataMap Map;
				void addRecursiveMetadataUses();

				public:
				ScopedAliasMetadataCloner(ArrayRef<BasicBlock *> Blocks);
				ScopedAliasMetadataCloner(const Function *F);

				/// Create a new clone of the scoped alias metadata, which will be used by
				/// subsequent remap() calls.
				void clone();

				/// Remap instructions in the given VMap from the original to the cloned
				/// metadata.
				void remap(ValueToValueMapTy &VMap);
				};
				} // end namespace llvm
				nikicUnsubmitted Not Done Reply Inline Actions Is this needed? I originally introduced this class in order to reuse it between inlining and unrolling, but as far as I can tell, you do not make use of it later in the patch series (D92887 implements a separate set of helper functions). If that's right, then I would simply drop these changes and keep the original code. nikic: Is this needed? I originally introduced this class in order to reuse it between inlining and…
				jeroen.dobbelaereAuthorUnsubmitted Done Reply Inline Actions I like the refactoring you did. It also makes it easier to put part of the mechanism early, which is needed (at least in the full restrict case, probably also here) when inlining recursive functions. In that case the caller and callee are the same and gathering things to clone before any change is done was necessary. I'll move the infrastructure back to InlineFunction as it will indeed only be used there. jeroen.dobbelaere: I like the refactoring you did. It also makes it easier to put part of the mechanism early…
	#endif // LLVM_TRANSFORMS_UTILS_CLONING_H			#endif // LLVM_TRANSFORMS_UTILS_CLONING_H

llvm/lib/Transforms/Utils/CloneFunction.cpp

Show All 21 Lines
#include "llvm/IR/Constants.h"		#include "llvm/IR/Constants.h"
#include "llvm/IR/DebugInfo.h"		#include "llvm/IR/DebugInfo.h"
#include "llvm/IR/DerivedTypes.h"		#include "llvm/IR/DerivedTypes.h"
#include "llvm/IR/Function.h"		#include "llvm/IR/Function.h"
#include "llvm/IR/GlobalVariable.h"		#include "llvm/IR/GlobalVariable.h"
#include "llvm/IR/Instructions.h"		#include "llvm/IR/Instructions.h"
#include "llvm/IR/IntrinsicInst.h"		#include "llvm/IR/IntrinsicInst.h"
#include "llvm/IR/LLVMContext.h"		#include "llvm/IR/LLVMContext.h"
		#include "llvm/IR/MDBuilder.h"
#include "llvm/IR/Metadata.h"		#include "llvm/IR/Metadata.h"
#include "llvm/IR/Module.h"		#include "llvm/IR/Module.h"
#include "llvm/Transforms/Utils/BasicBlockUtils.h"		#include "llvm/Transforms/Utils/BasicBlockUtils.h"
#include "llvm/Transforms/Utils/Cloning.h"		#include "llvm/Transforms/Utils/Cloning.h"
#include "llvm/Transforms/Utils/Local.h"		#include "llvm/Transforms/Utils/Local.h"
#include "llvm/Transforms/Utils/ValueMapper.h"		#include "llvm/Transforms/Utils/ValueMapper.h"
#include <map>		#include <map>
using namespace llvm;		using namespace llvm;

		#define DEBUG_TYPE "clone-function"

/// See comments in Cloning.h.		/// See comments in Cloning.h.
BasicBlock llvm::CloneBasicBlock(const BasicBlock BB, ValueToValueMapTy &VMap,		BasicBlock llvm::CloneBasicBlock(const BasicBlock BB, ValueToValueMapTy &VMap,
const Twine &NameSuffix, Function *F,		const Twine &NameSuffix, Function *F,
ClonedCodeInfo *CodeInfo,		ClonedCodeInfo *CodeInfo,
DebugInfoFinder *DIFinder) {		DebugInfoFinder *DIFinder) {
DenseMap<const MDNode , MDNode > Cache;		DenseMap<const MDNode , MDNode > Cache;
BasicBlock *NewBB = BasicBlock::Create(BB->getContext(), "", F);		BasicBlock *NewBB = BasicBlock::Create(BB->getContext(), "", F);
if (BB->hasName())		if (BB->hasName())
▲ Show 20 Lines • Show All 831 Lines • ▼ Show 20 Lines	for (unsigned i = 0, e = New->getNumOperands(); i != e; ++i)
auto I = ValueMapping.find(Inst);		auto I = ValueMapping.find(Inst);
if (I != ValueMapping.end())		if (I != ValueMapping.end())
New->setOperand(i, I->second);		New->setOperand(i, I->second);
}		}
}		}

return NewBB;		return NewBB;
}		}

		ScopedAliasMetadataCloner::ScopedAliasMetadataCloner(
		ArrayRef<BasicBlock *> Blocks) {
		for (BasicBlock *BB : Blocks) {
		for (const Instruction &I : *BB) {
		if (const MDNode *M = I.getMetadata(LLVMContext::MD_alias_scope))
		MD.insert(M);
		if (const MDNode *M = I.getMetadata(LLVMContext::MD_noalias))
		MD.insert(M);
		}
		}
		addRecursiveMetadataUses();
		}

		ScopedAliasMetadataCloner::ScopedAliasMetadataCloner(const Function *F) {
		for (const BasicBlock &BB : *F) {
		for (const Instruction &I : BB) {
		if (const MDNode *M = I.getMetadata(LLVMContext::MD_alias_scope))
		MD.insert(M);
		if (const MDNode *M = I.getMetadata(LLVMContext::MD_noalias))
		MD.insert(M);

		// We also need to clone the metadata in noalias intrinsics.
		if (const auto *II = dyn_cast<IntrinsicInst>(&I))
		if (II->getIntrinsicID() == Intrinsic::noalias_decl)
		if (const auto *M = dyn_cast<MDNode>(
		cast<MetadataAsValue>(
		II->getOperand(Intrinsic::NoAliasDeclScopeArg))
		->getMetadata()))
		MD.insert(M);
		}
		}
		addRecursiveMetadataUses();
		}

		void ScopedAliasMetadataCloner::addRecursiveMetadataUses() {
		SmallVector<const Metadata *, 16> Queue(MD.begin(), MD.end());
		while (!Queue.empty()) {
		const MDNode *M = cast<MDNode>(Queue.pop_back_val());
		for (const Metadata *Op : M->operands())
		if (const MDNode *OpMD = dyn_cast<MDNode>(Op))
		if (MD.insert(OpMD))
		Queue.push_back(OpMD);
		}
		}

		void ScopedAliasMetadataCloner::clone() {
		// Discard a previous clone that may exist.
		Map.clear();

		SmallVector<TempMDTuple, 16> DummyNodes;
		for (const MDNode *I : MD) {
		DummyNodes.push_back(MDTuple::getTemporary(I->getContext(), None));
		Map[I].reset(DummyNodes.back().get());
		}

		// Create new metadata nodes to replace the dummy nodes, replacing old
		// metadata references with either a dummy node or an already-created new
		// node.
		SmallVector<Metadata *, 4> NewOps;
		for (const MDNode *I : MD) {
		for (const Metadata *Op : I->operands()) {
		if (const MDNode *M = dyn_cast<MDNode>(Op))
		NewOps.push_back(Map[M]);
		else
		NewOps.push_back(const_cast<Metadata *>(Op));
		}

		MDNode *NewM = MDNode::get(I->getContext(), NewOps);
		MDTuple *TempM = cast<MDTuple>(Map[I]);
		assert(TempM->isTemporary() && "Expected temporary node");

		TempM->replaceAllUsesWith(NewM);
		NewOps.clear();
		}
		}

		void ScopedAliasMetadataCloner::remap(ValueToValueMapTy &VMap) {
		if (Map.empty())
		return; // Nothing to do.

		for (auto Entry : VMap) {
		// Check that key is an instruction, to skip the Argument mapping, which
		// points to an instruction in the original function, not the inlined one.
		if (!Entry->second \|\| !isa<Instruction>(Entry->first))
		continue;

		Instruction *I = dyn_cast<Instruction>(Entry->second);
		if (!I)
		continue;

		if (MDNode *M = I->getMetadata(LLVMContext::MD_alias_scope))
		I->setMetadata(LLVMContext::MD_alias_scope, Map[M]);

		if (MDNode *M = I->getMetadata(LLVMContext::MD_noalias))
		I->setMetadata(LLVMContext::MD_noalias, Map[M]);
		}
		}

llvm/lib/Transforms/Utils/InlineFunction.cpp

Show First 20 Lines • Show All 73 Lines • ▼ Show 20 Lines
using namespace llvm;		using namespace llvm;
using ProfileCount = Function::ProfileCount;		using ProfileCount = Function::ProfileCount;

static cl::opt<bool>		static cl::opt<bool>
EnableNoAliasConversion("enable-noalias-to-md-conversion", cl::init(true),		EnableNoAliasConversion("enable-noalias-to-md-conversion", cl::init(true),
cl::Hidden,		cl::Hidden,
cl::desc("Convert noalias attributes to metadata during inlining."));		cl::desc("Convert noalias attributes to metadata during inlining."));

		static cl::opt<bool> UseNoAliasIntrinsic(
		"use-noalias-intrinsic-during-inlining", cl::Hidden, cl::ZeroOrMore,
		cl::init(true),
		cl::desc("Use the llvm.noalias.decl intrinsic during inlining."));

// Disabled by default, because the added alignment assumptions may increase		// Disabled by default, because the added alignment assumptions may increase
// compile-time and block optimizations. This option is not suitable for use		// compile-time and block optimizations. This option is not suitable for use
// with frontends that emit comprehensive parameter alignment annotations.		// with frontends that emit comprehensive parameter alignment annotations.
static cl::opt<bool>		static cl::opt<bool>
PreserveAlignmentAssumptions("preserve-alignment-assumptions-during-inlining",		PreserveAlignmentAssumptions("preserve-alignment-assumptions-during-inlining",
cl::init(false), cl::Hidden,		cl::init(false), cl::Hidden,
cl::desc("Convert align attributes to assumptions during inlining."));		cl::desc("Convert align attributes to assumptions during inlining."));

▲ Show 20 Lines • Show All 725 Lines • ▼ Show 20 Lines	if (AliasScope)
NI->setMetadata(LLVMContext::MD_alias_scope, MDNode::concatenate(		NI->setMetadata(LLVMContext::MD_alias_scope, MDNode::concatenate(
NI->getMetadata(LLVMContext::MD_alias_scope), AliasScope));		NI->getMetadata(LLVMContext::MD_alias_scope), AliasScope));

if (NoAlias)		if (NoAlias)
NI->setMetadata(LLVMContext::MD_noalias, MDNode::concatenate(		NI->setMetadata(LLVMContext::MD_noalias, MDNode::concatenate(
NI->getMetadata(LLVMContext::MD_noalias), NoAlias));		NI->getMetadata(LLVMContext::MD_noalias), NoAlias));
}		}
}		}

/// When inlining a function that contains noalias scope metadata,
/// this metadata needs to be cloned so that the inlined blocks
/// have different "unique scopes" at every call site. Were this not done, then
/// aliasing scopes from a function inlined into a caller multiple times could
/// not be differentiated (and this would lead to miscompiles because the
/// non-aliasing property communicated by the metadata could have
/// call-site-specific control dependencies).
static void CloneAliasScopeMetadata(CallBase &CB, ValueToValueMapTy &VMap) {
const Function *CalledFunc = CB.getCalledFunction();
SetVector<const MDNode *> MD;

// Note: We could only clone the metadata if it is already used in the
// caller. I'm omitting that check here because it might confuse
// inter-procedural alias analysis passes. We can revisit this if it becomes
// an efficiency or overhead problem.

for (const BasicBlock &I : *CalledFunc)
for (const Instruction &J : I) {
if (const MDNode *M = J.getMetadata(LLVMContext::MD_alias_scope))
MD.insert(M);
if (const MDNode *M = J.getMetadata(LLVMContext::MD_noalias))
MD.insert(M);
}

if (MD.empty())
return;

// Walk the existing metadata, adding the complete (perhaps cyclic) chain to
// the set.
SmallVector<const Metadata *, 16> Queue(MD.begin(), MD.end());
while (!Queue.empty()) {
const MDNode *M = cast<MDNode>(Queue.pop_back_val());
for (unsigned i = 0, ie = M->getNumOperands(); i != ie; ++i)
if (const MDNode *M1 = dyn_cast<MDNode>(M->getOperand(i)))
if (MD.insert(M1))
Queue.push_back(M1);
}

// Now we have a complete set of all metadata in the chains used to specify
// the noalias scopes and the lists of those scopes.
SmallVector<TempMDTuple, 16> DummyNodes;
DenseMap<const MDNode *, TrackingMDNodeRef> MDMap;
for (const MDNode *I : MD) {
DummyNodes.push_back(MDTuple::getTemporary(CalledFunc->getContext(), None));
MDMap[I].reset(DummyNodes.back().get());
}

// Create new metadata nodes to replace the dummy nodes, replacing old
// metadata references with either a dummy node or an already-created new
// node.
for (const MDNode *I : MD) {
SmallVector<Metadata *, 4> NewOps;
for (unsigned i = 0, ie = I->getNumOperands(); i != ie; ++i) {
const Metadata *V = I->getOperand(i);
if (const MDNode *M = dyn_cast<MDNode>(V))
NewOps.push_back(MDMap[M]);
else
NewOps.push_back(const_cast<Metadata *>(V));
}

MDNode *NewM = MDNode::get(CalledFunc->getContext(), NewOps);
MDTuple *TempM = cast<MDTuple>(MDMap[I]);
assert(TempM->isTemporary() && "Expected temporary node");

TempM->replaceAllUsesWith(NewM);
}

// Now replace the metadata in the new inlined instructions with the
// repacements from the map.
for (ValueToValueMapTy::iterator VMI = VMap.begin(), VMIE = VMap.end();
VMI != VMIE; ++VMI) {
// Check that key is an instruction, to skip the Argument mapping, which
// points to an instruction in the original function, not the inlined one.
if (!VMI->second \|\| !isa<Instruction>(VMI->first))
continue;

Instruction *NI = dyn_cast<Instruction>(VMI->second);
if (!NI)
continue;

if (MDNode *M = NI->getMetadata(LLVMContext::MD_alias_scope))
NI->setMetadata(LLVMContext::MD_alias_scope, MDMap[M]);

if (MDNode *M = NI->getMetadata(LLVMContext::MD_noalias))
NI->setMetadata(LLVMContext::MD_noalias, MDMap[M]);
}
}

/// If the inlined function has noalias arguments,		/// If the inlined function has noalias arguments,
		jeroen.dobbelaereAuthorUnsubmitted Done Reply Inline Actions @nikic In the full restrict patches, we also check if the instruction was already handled. I was able to trigger this with an assertion and I have a more or less reduced testcase. Either we keep a SmallPtrSet and check if the instruction was already handled (this is what the full restrict version does; See D68509 InlineFunction.cpp#969). Or we only replace the metadata if it is in the MDMap (by using MDMap.lookup(M). Any preference ? jeroen.dobbelaere: @nikic In the full restrict patches, we also check if the instruction was already handled. I…
		nikicUnsubmitted Not Done Reply Inline Actions I don't understand under what circumstances we'd handle an instruction twice. nikic: I don't understand under what circumstances we'd handle an instruction twice.
/// then add new alias scopes for each noalias argument, tag the mapped noalias		/// then add new alias scopes for each noalias argument, tag the mapped noalias
/// parameters with noalias metadata specifying the new scope, and tag all		/// parameters with noalias metadata specifying the new scope, and tag all
		nikicUnsubmitted Done Reply Inline Actions The mention of unrolling here no longer makes sense. nikic: The mention of unrolling here no longer makes sense.
/// non-derived loads, stores and memory intrinsics with the new alias scopes.		/// non-derived loads, stores and memory intrinsics with the new alias scopes.
		nikicUnsubmitted Not Done Reply Inline Actions I don't think it makes sense to separately track MDV and MDVMap. I would create the new MetadataAsValue directly in remap: if (auto II = dyn_cast<IntrinsicInst>(I)) { if (II->getIntrinsicID() == Intrinsic::experimental_noalias_scope_decl) { auto MV = cast<MetadataAsValue>( II->getOperand(Intrinsic::NoAliasScopeDeclScopeArg))); auto NewMV = MetadataAsValue::get( I->getContext(), MDMap[cast<MDNode>(MV->getMetadata())]); II->setOperand(Intrinsic::NoAliasScopeDeclScopeArg, NewMV); } } Or so. nikic:* I don't think it makes sense to separately track MDV and MDVMap. I would create the new…
static void AddAliasScopeMetadata(CallBase &CB, ValueToValueMapTy &VMap,		static void AddAliasScopeMetadata(CallBase &CB, ValueToValueMapTy &VMap,
const DataLayout &DL, AAResults *CalleeAAR) {		const DataLayout &DL, AAResults *CalleeAAR) {
		nikicUnsubmitted Not Done Reply Inline Actions These can be dropped (inlining performs only one clone). nikic: These can be dropped (inlining performs only one clone).
if (!EnableNoAliasConversion)		if (!EnableNoAliasConversion)
		nikicUnsubmitted Done Reply Inline Actions While this collects the metadata on the experimental.noalias.scope.decl, I'm not seeing any code that would actually update it on the cloned instructions. remap() only checks the noalias and alias.scope metadata, no? Looking through the tests, I think we're missing a test case that inlines two levels, so that the first creates the intrinsic, and the second has to properly rename its operand. nikic: While this collects the metadata on the experimental.noalias.scope.decl, I'm not seeing any…
		jeroen.dobbelaereAuthorUnsubmitted Done Reply Inline Actions Good catch, it indeed seems that I missed the remapping of the 'llvm.experimental.noalias.scope.decl' argument in this version :( More tests is always better. I'll add one for this, and also one for the caller == callee case. jeroen.dobbelaere: Good catch, it indeed seems that I missed the remapping of the 'llvm.experimental.noalias.scope.
return;		return;

const Function *CalledFunc = CB.getCalledFunction();		const Function *CalledFunc = CB.getCalledFunction();
SmallVector<const Argument *, 4> NoAliasArgs;		SmallVector<const Argument *, 4> NoAliasArgs;

for (const Argument &Arg : CalledFunc->args())		for (const Argument &Arg : CalledFunc->args())
if (CB.paramHasAttr(Arg.getArgNo(), Attribute::NoAlias) && !Arg.use_empty())		if (CB.paramHasAttr(Arg.getArgNo(), Attribute::NoAlias) && !Arg.use_empty())
NoAliasArgs.push_back(&Arg);		NoAliasArgs.push_back(&Arg);
Show All 30 Lines	if (A->hasName()) {
Name += utostr(i);		Name += utostr(i);
}		}

// Note: We always create a new anonymous root here. This is true regardless		// Note: We always create a new anonymous root here. This is true regardless
// of the linkage of the callee because the aliasing "scope" is not just a		// of the linkage of the callee because the aliasing "scope" is not just a
// property of the callee, but also all control dependencies in the caller.		// property of the callee, but also all control dependencies in the caller.
MDNode *NewScope = MDB.createAnonymousAliasScope(NewDomain, Name);		MDNode *NewScope = MDB.createAnonymousAliasScope(NewDomain, Name);
NewScopes.insert(std::make_pair(A, NewScope));		NewScopes.insert(std::make_pair(A, NewScope));

		if (UseNoAliasIntrinsic) {
		// Introduce a llvm.noalias.decl for the noalias argument.
		MDNode *AScopeList = MDNode::get(CalledFunc->getContext(), NewScope);

		// The alloca was optimized away -> use a nullptr
		Value *MappedA = VMap[A];
		auto *IdentifyPAlloca =
		ConstantPointerNull::get(MappedA->getType()->getPointerTo());
		auto *NoAliasDecl = IRBuilder<>(&CB).CreateNoAliasDeclaration(
		IdentifyPAlloca, AScopeList);
		// Ignore the result for now. The result will be used when the
		// llvm.noalias intrinsic is introduced.
		(void)NoAliasDecl;
		}
}		}

// Iterate over all new instructions in the map; for all memory-access		// Iterate over all new instructions in the map; for all memory-access
// instructions, add the alias scope metadata.		// instructions, add the alias scope metadata.
for (ValueToValueMapTy::iterator VMI = VMap.begin(), VMIE = VMap.end();		for (ValueToValueMapTy::iterator VMI = VMap.begin(), VMIE = VMap.end();
VMI != VMIE; ++VMI) {		VMI != VMIE; ++VMI) {
if (const Instruction *I = dyn_cast<Instruction>(VMI->first)) {		if (const Instruction *I = dyn_cast<Instruction>(VMI->first)) {
if (!VMI->second)		if (!VMI->second)
▲ Show 20 Lines • Show All 782 Lines • ▼ Show 20 Lines	llvm::InlineResult llvm::InlineFunction(CallBase &CB, InlineFunctionInfo &IFI,
ClonedCodeInfo InlinedFunctionInfo;		ClonedCodeInfo InlinedFunctionInfo;
Function::iterator FirstNewBlock;		Function::iterator FirstNewBlock;

{ // Scope to destroy VMap after cloning.		{ // Scope to destroy VMap after cloning.
ValueToValueMapTy VMap;		ValueToValueMapTy VMap;
// Keep a list of pair (dst, src) to emit byval initializations.		// Keep a list of pair (dst, src) to emit byval initializations.
SmallVector<std::pair<Value, Value>, 4> ByValInit;		SmallVector<std::pair<Value, Value>, 4> ByValInit;

		// When inlining a function that contains noalias scope metadata,
		// this metadata needs to be cloned so that the inlined blocks
		// have different "unique scopes" at every call site.
		nikicUnsubmitted Not Done Reply Inline Actions The above part of the comment is repeated below. You might want to drop it in one place. nikic: The above part of the comment is repeated below. You might want to drop it in one place.
		// Track the metadata that must be cloned. Do this before other changes to
		// the function, so that we do not get in trouble when inlining caller ==
		// callee.
		ScopedAliasMetadataCloner SAMetadataCloner(CB.getCalledFunction());

auto &DL = Caller->getParent()->getDataLayout();		auto &DL = Caller->getParent()->getDataLayout();

// Calculate the vector of arguments to pass into the function cloner, which		// Calculate the vector of arguments to pass into the function cloner, which
// matches up the formal to the actual argument values.		// matches up the formal to the actual argument values.
auto AI = CB.arg_begin();		auto AI = CB.arg_begin();
unsigned ArgNo = 0;		unsigned ArgNo = 0;
for (Function::arg_iterator I = CalledFunc->arg_begin(),		for (Function::arg_iterator I = CalledFunc->arg_begin(),
E = CalledFunc->arg_end(); I != E; ++I, ++AI, ++ArgNo) {		E = CalledFunc->arg_end(); I != E; ++I, ++AI, ++ArgNo) {
▲ Show 20 Lines • Show All 104 Lines • ▼ Show 20 Lines	if (IFI.CG)
UpdateCallGraphAfterInlining(CB, FirstNewBlock, VMap, IFI);		UpdateCallGraphAfterInlining(CB, FirstNewBlock, VMap, IFI);

// For 'nodebug' functions, the associated DISubprogram is always null.		// For 'nodebug' functions, the associated DISubprogram is always null.
// Conservatively avoid propagating the callsite debug location to		// Conservatively avoid propagating the callsite debug location to
// instructions inlined from a function whose DISubprogram is not null.		// instructions inlined from a function whose DISubprogram is not null.
fixupLineNumbers(Caller, FirstNewBlock, &CB,		fixupLineNumbers(Caller, FirstNewBlock, &CB,
CalledFunc->getSubprogram() != nullptr);		CalledFunc->getSubprogram() != nullptr);

// Clone existing noalias metadata if necessary.		// When inlining a function that contains noalias scope metadata,
CloneAliasScopeMetadata(CB, VMap);		// this metadata needs to be cloned so that the inlined blocks
		// have different "unique scopes" at every call site. Were this not done,
		// then aliasing scopes from a function inlined into a caller multiple times
		// could not be differentiated (and this would lead to miscompiles because
		// the non-aliasing property communicated by the metadata could have
		// call-site-specific control dependencies).
		SAMetadataCloner.clone();
		SAMetadataCloner.remap(VMap);

// Add noalias metadata if necessary.		// Add noalias metadata if necessary.
AddAliasScopeMetadata(CB, VMap, DL, CalleeAAR);		AddAliasScopeMetadata(CB, VMap, DL, CalleeAAR);

// Clone return attributes on the callsite into the calls within the inlined		// Clone return attributes on the callsite into the calls within the inlined
// function which feed into its return value.		// function which feed into its return value.
AddReturnAttributes(CB, VMap);		AddReturnAttributes(CB, VMap);

▲ Show 20 Lines • Show All 603 Lines • Show Last 20 Lines

llvm/test/Transforms/Coroutines/ArgAddr.ll

	Show First 20 Lines • Show All 48 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: %dec1.spill.addr.i = getelementptr inbounds i8, i8* %call.i, i64 20			; CHECK-NEXT: %dec1.spill.addr.i = getelementptr inbounds i8, i8* %call.i, i64 20
	; CHECK-NEXT: bitcast i8* %dec1.spill.addr.i to i32*			; CHECK-NEXT: bitcast i8* %dec1.spill.addr.i to i32*
	; CHECK-NEXT: store i32 4			; CHECK-NEXT: store i32 4
	; CHECK-NEXT: call void @print(i32 4)			; CHECK-NEXT: call void @print(i32 4)
	; CHECK-NEXT: %index.addr12.i = getelementptr inbounds i8, i8* %call.i, i64 24			; CHECK-NEXT: %index.addr12.i = getelementptr inbounds i8, i8* %call.i, i64 24
	; CHECK-NEXT: bitcast i8* %index.addr12.i to i1*			; CHECK-NEXT: bitcast i8* %index.addr12.i to i1*
	; CHECK-NEXT: store i1 false			; CHECK-NEXT: store i1 false
	; CHECK-NEXT: store i32 3			; CHECK-NEXT: store i32 3
				; CHECK-NEXT: call i8* @llvm.noalias.decl
	; CHECK-NEXT: store i32 3			; CHECK-NEXT: store i32 3
	; CHECK-NEXT: call void @print(i32 3)			; CHECK-NEXT: call void @print(i32 3)
	; CHECK-NEXT: store i1 false			; CHECK-NEXT: store i1 false
	; CHECK-NEXT: store i32 2			; CHECK-NEXT: store i32 2
				; CHECK-NEXT: call i8* @llvm.noalias.decl
	; CHECK-NEXT: store i32 2			; CHECK-NEXT: store i32 2
	; CHECK-NEXT: call void @print(i32 2)			; CHECK-NEXT: call void @print(i32 2)
	; CHECK: ret i32 0			; CHECK: ret i32 0
	}			}

	declare i8* @malloc(i32)			declare i8* @malloc(i32)
	declare void @free(i8*)			declare void @free(i8*)
	declare void @print(i32)			declare void @print(i32)
	Show All 11 Lines

llvm/test/Transforms/Coroutines/coro-retcon-resume-values.ll

	; RUN: opt < %s -enable-coroutines -O2 -S \| FileCheck %s			; RUN: opt < %s -enable-coroutines -O2 -S \| FileCheck %s --check-prefixes=CHECK,OPM
	; RUN: opt < %s -enable-coroutines -passes='default<O2>' -S \| FileCheck %s			; RUN: opt < %s -enable-coroutines -passes='default<O2>' -S \| FileCheck %s --check-prefixes=CHECK,NPM
				nikicUnsubmitted Done Reply Inline Actions You need to add `-aa-pipeline=default` to avoid the NPM regression. nikic: You need to add `-aa-pipeline=default` to avoid the NPM regression.
				jeroen.dobbelaereAuthorUnsubmitted Done Reply Inline Actions That seems to do the trick. jeroen.dobbelaere: That seems to do the trick.
				nikicUnsubmitted Not Done Reply Inline Actions You can drop the `--check-prefixes` now. nikic: You can drop the `--check-prefixes` now.

	define i8* @f(i8* %buffer, i32 %n) {			define i8* @f(i8* %buffer, i32 %n) {
	entry:			entry:
	%id = call token @llvm.coro.id.retcon(i32 8, i32 4, i8* %buffer, i8* bitcast (i8* (i8, i32, i1) @prototype to i8), i8 bitcast (i8* (i32)* @allocate to i8), i8 bitcast (void (i8) @deallocate to i8*))			%id = call token @llvm.coro.id.retcon(i32 8, i32 4, i8* %buffer, i8* bitcast (i8* (i8, i32, i1) @prototype to i8), i8 bitcast (i8* (i32)* @allocate to i8), i8 bitcast (void (i8) @deallocate to i8*))
	%hdl = call i8* @llvm.coro.begin(token %id, i8* null)			%hdl = call i8* @llvm.coro.begin(token %id, i8* null)
	br label %loop			br label %loop

	loop:			loop:
	▲ Show 20 Lines • Show All 51 Lines • ▼ Show 20 Lines
	}			}

	; Unfortunately, we don't seem to fully optimize this right now due			; Unfortunately, we don't seem to fully optimize this right now due
	; to some sort of phase-ordering thing.			; to some sort of phase-ordering thing.
	; CHECK-LABEL: define i32 @main			; CHECK-LABEL: define i32 @main
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK: [[BUFFER:%.*]] = alloca [8 x i8], align 4			; CHECK: [[BUFFER:%.*]] = alloca [8 x i8], align 4
	; CHECK: [[SLOT:%.]] = bitcast [8 x i8] [[BUFFER]] to i32*			; CHECK: [[SLOT:%.]] = bitcast [8 x i8] [[BUFFER]] to i32*
	; CHECK-NEXT: store i32 7, i32* [[SLOT]], align 4
	; CHECK-NEXT: call void @print(i32 7)			; OPM-NEXT: call i8* @llvm.noalias.decl
				; OPM-NEXT: call i8* @llvm.noalias.decl
				; OPM-NEXT: store i32 7, i32* [[SLOT]], align 4
				; OPM-NEXT: call i8* @llvm.noalias.decl
				; OPM-NEXT: call void @print(i32 7)

				; FIXME: After introduction of llvm.noalias.decl, this is not fully optimzed
				; with the new pass manager.
				; NPM-NEXT: store i32 1, i32* [[SLOT]], align 4
				; NPM-NEXT: call i8* @llvm.noalias.decl.p0i8.p0p0i8.i64
				; NPM-NEXT: [[RELOAD0:%.+]] = load i32, i32* [[SLOT]], align 4
				; NPM-NEXT: [[SUM0:%.+]] = add i32 [[RELOAD0]], 2
				; NPM-NEXT: store i32 [[SUM0]], i32* [[SLOT]], align 4
				; NPM-NEXT: call i8* @llvm.noalias.decl.p0i8.p0p0i8.i64
				; NPM-NEXT: [[RELOAD1:%.+]] = load i32, i32* [[SLOT]], align 4
				; NPM-NEXT: [[SUM2:%.+]] = add i32 [[RELOAD1]], 4
				; NPM-NEXT: store i32 [[SUM2]], i32* [[SLOT]], align 4
				; NPM-NEXT: call i8* @llvm.noalias.decl.p0i8.p0p0i8.i64
				; NPM-NEXT: [[RELOAD2:%.+]] = load i32, i32* [[SLOT]], align 4
				; NPM-NEXT: call void @print(i32 [[RELOAD2]])

	; CHECK-NEXT: ret i32 0			; CHECK-NEXT: ret i32 0

	declare token @llvm.coro.id.retcon(i32, i32, i8, i8, i8, i8)			declare token @llvm.coro.id.retcon(i32, i32, i8, i8, i8, i8)
	declare i8* @llvm.coro.begin(token, i8*)			declare i8* @llvm.coro.begin(token, i8*)
	declare { i32, i1 } @llvm.coro.suspend.retcon.sl_i32i1s(...)			declare { i32, i1 } @llvm.coro.suspend.retcon.sl_i32i1s(...)
	declare i1 @llvm.coro.end(i8*, i1)			declare i1 @llvm.coro.end(i8*, i1)
	declare i8* @llvm.coro.prepare.retcon(i8*)			declare i8* @llvm.coro.prepare.retcon(i8*)

	declare i8* @prototype(i8*, i32, i1 zeroext)			declare i8* @prototype(i8*, i32, i1 zeroext)

	declare noalias i8* @allocate(i32 %size)			declare noalias i8* @allocate(i32 %size)
	declare void @deallocate(i8* %ptr)			declare void @deallocate(i8* %ptr)

	declare void @print(i32)			declare void @print(i32)

llvm/test/Transforms/Coroutines/coro-retcon-value.ll

	Show First 20 Lines • Show All 73 Lines • ▼ Show 20 Lines
	; Unfortunately, we don't seem to fully optimize this right now due			; Unfortunately, we don't seem to fully optimize this right now due
	; to some sort of phase-ordering thing.			; to some sort of phase-ordering thing.
	; CHECK-LABEL: define i32 @main			; CHECK-LABEL: define i32 @main
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK: [[BUFFER:%.*]] = alloca [8 x i8], align 4			; CHECK: [[BUFFER:%.*]] = alloca [8 x i8], align 4
	; CHECK: [[SLOT:%.]] = bitcast [8 x i8] [[BUFFER]] to i32*			; CHECK: [[SLOT:%.]] = bitcast [8 x i8] [[BUFFER]] to i32*
	; CHECK-NEXT: store i32 4, i32* [[SLOT]], align 4			; CHECK-NEXT: store i32 4, i32* [[SLOT]], align 4
	; CHECK-NEXT: call void @print(i32 4)			; CHECK-NEXT: call void @print(i32 4)
				; CHECK-NEXT: call i8* @llvm.noalias.decl
	; CHECK-NEXT: [[LOAD:%.]] = load i32, i32 [[SLOT]], align 4			; CHECK-NEXT: [[LOAD:%.]] = load i32, i32 [[SLOT]], align 4
	; CHECK-NEXT: [[INC:%.*]] = add i32 [[LOAD]], 1			; CHECK-NEXT: [[INC:%.*]] = add i32 [[LOAD]], 1
	; CHECK-NEXT: store i32 [[INC]], i32* [[SLOT]], align 4			; CHECK-NEXT: store i32 [[INC]], i32* [[SLOT]], align 4
	; CHECK-NEXT: call void @print(i32 [[INC]])			; CHECK-NEXT: call void @print(i32 [[INC]])
				; CHECK-NEXT: call i8* @llvm.noalias.decl
	; CHECK-NEXT: [[LOAD:%.]] = load i32, i32 [[SLOT]], align 4			; CHECK-NEXT: [[LOAD:%.]] = load i32, i32 [[SLOT]], align 4
	; CHECK-NEXT: [[INC:%.*]] = add i32 [[LOAD]], 1			; CHECK-NEXT: [[INC:%.*]] = add i32 [[LOAD]], 1
	; CHECK-NEXT: store i32 [[INC]], i32* [[SLOT]], align 4			; CHECK-NEXT: store i32 [[INC]], i32* [[SLOT]], align 4
	; CHECK-NEXT: call void @print(i32 [[INC]])			; CHECK-NEXT: call void @print(i32 [[INC]])
				; CHECK-NEXT: call i8* @llvm.noalias.decl
	; CHECK-NEXT: ret i32 0			; CHECK-NEXT: ret i32 0

	declare token @llvm.coro.id.retcon(i32, i32, i8, i8, i8, i8)			declare token @llvm.coro.id.retcon(i32, i32, i8, i8, i8, i8)
	declare i8* @llvm.coro.begin(token, i8*)			declare i8* @llvm.coro.begin(token, i8*)
	declare i8 @llvm.coro.suspend.retcon.i8(...)			declare i8 @llvm.coro.suspend.retcon.i8(...)
	declare i1 @llvm.coro.end(i8*, i1)			declare i1 @llvm.coro.end(i8*, i1)
	declare i8* @llvm.coro.prepare.retcon(i8*)			declare i8* @llvm.coro.prepare.retcon(i8*)

	declare {i8, i32} @prototype(i8, i8 zeroext)			declare {i8, i32} @prototype(i8, i8 zeroext)

	declare noalias i8* @allocate(i32 %size)			declare noalias i8* @allocate(i32 %size)
	declare void @deallocate(i8* %ptr)			declare void @deallocate(i8* %ptr)

	declare void @print(i32)			declare void @print(i32)

llvm/test/Transforms/Coroutines/coro-retcon.ll

	Show First 20 Lines • Show All 63 Lines • ▼ Show 20 Lines
	; Unfortunately, we don't seem to fully optimize this right now due			; Unfortunately, we don't seem to fully optimize this right now due
	; to some sort of phase-ordering thing.			; to some sort of phase-ordering thing.
	; CHECK-LABEL: define i32 @main			; CHECK-LABEL: define i32 @main
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK: [[BUFFER:%.*]] = alloca [8 x i8], align 4			; CHECK: [[BUFFER:%.*]] = alloca [8 x i8], align 4
	; CHECK: [[SLOT:%.]] = bitcast [8 x i8] [[BUFFER]] to i32*			; CHECK: [[SLOT:%.]] = bitcast [8 x i8] [[BUFFER]] to i32*
	; CHECK-NEXT: store i32 4, i32* [[SLOT]], align 4			; CHECK-NEXT: store i32 4, i32* [[SLOT]], align 4
	; CHECK-NEXT: call void @print(i32 4)			; CHECK-NEXT: call void @print(i32 4)
				; CHECK-NEXT: call i8* @llvm.noalias.decl
	; CHECK-NEXT: [[LOAD:%.]] = load i32, i32 [[SLOT]], align 4			; CHECK-NEXT: [[LOAD:%.]] = load i32, i32 [[SLOT]], align 4
	; CHECK-NEXT: [[INC:%.*]] = add i32 [[LOAD]], 1			; CHECK-NEXT: [[INC:%.*]] = add i32 [[LOAD]], 1
	; CHECK-NEXT: store i32 [[INC]], i32* [[SLOT]], align 4			; CHECK-NEXT: store i32 [[INC]], i32* [[SLOT]], align 4
	; CHECK-NEXT: call void @print(i32 [[INC]])			; CHECK-NEXT: call void @print(i32 [[INC]])
				; CHECK-NEXT: call i8* @llvm.noalias.decl
	; CHECK-NEXT: [[LOAD:%.]] = load i32, i32 [[SLOT]], align 4			; CHECK-NEXT: [[LOAD:%.]] = load i32, i32 [[SLOT]], align 4
	; CHECK-NEXT: [[INC:%.*]] = add i32 [[LOAD]], 1			; CHECK-NEXT: [[INC:%.*]] = add i32 [[LOAD]], 1
	; CHECK-NEXT: call void @print(i32 [[INC]])			; CHECK-NEXT: call void @print(i32 [[INC]])
				; CHECK-NEXT: call i8* @llvm.noalias.decl
	; CHECK-NEXT: ret i32 0			; CHECK-NEXT: ret i32 0

	define hidden { i8, i8 } @g(i8* %buffer, i16* %ptr) {			define hidden { i8, i8 } @g(i8* %buffer, i16* %ptr) {
	entry:			entry:
	%id = call token @llvm.coro.id.retcon(i32 8, i32 4, i8* %buffer, i8* bitcast ({ i8, i8 } (i8, i1) @g_prototype to i8), i8 bitcast (i8* (i32)* @allocate to i8), i8 bitcast (void (i8) @deallocate to i8*))			%id = call token @llvm.coro.id.retcon(i32 8, i32 4, i8* %buffer, i8* bitcast ({ i8, i8 } (i8, i1) @g_prototype to i8), i8 bitcast (i8* (i32)* @allocate to i8), i8 bitcast (void (i8) @deallocate to i8*))
	%hdl = call i8* @llvm.coro.begin(token %id, i8* null)			%hdl = call i8* @llvm.coro.begin(token %id, i8* null)
	br label %loop			br label %loop

	Show All 27 Lines

llvm/test/Transforms/Coroutines/ex2.ll

Show First 20 Lines • Show All 43 Lines • ▼ Show 20 Lines	entry:
br i1 %to, label %return, label %destroy		br i1 %to, label %return, label %destroy
destroy:		destroy:
call void @llvm.coro.destroy(i8* %hdl)		call void @llvm.coro.destroy(i8* %hdl)
br label %return		br label %return
return:		return:
ret i32 0		ret i32 0
; CHECK-NOT: call i8* @CustomAlloc		; CHECK-NOT: call i8* @CustomAlloc
; CHECK: call void @print(i32 4)		; CHECK: call void @print(i32 4)
		; CHECK-NEXT: call i8* @llvm.noalias.decl
; CHECK-NEXT: call void @print(i32 5)		; CHECK-NEXT: call void @print(i32 5)
		; CHECK-NEXT: call i8* @llvm.noalias.decl
; CHECK-NEXT: call void @print(i32 6)		; CHECK-NEXT: call void @print(i32 6)
; CHECK-NEXT: ret i32 0		; CHECK-NEXT: ret i32 0
}		}

declare i8* @CustomAlloc(i32)		declare i8* @CustomAlloc(i32)
declare void @CustomFree(i8*)		declare void @CustomFree(i8*)
declare void @print(i32)		declare void @print(i32)

Show All 10 Lines

llvm/test/Transforms/Coroutines/ex3.ll

Show First 20 Lines • Show All 47 Lines • ▼ Show 20 Lines	entry:
br i1 %to, label %return, label %destroy		br i1 %to, label %return, label %destroy
destroy:		destroy:
call void @llvm.coro.destroy(i8* %hdl)		call void @llvm.coro.destroy(i8* %hdl)
br label %return		br label %return
return:		return:
ret i32 0		ret i32 0
; CHECK-NOT: i8* @malloc		; CHECK-NOT: i8* @malloc
; CHECK: call void @print(i32 4)		; CHECK: call void @print(i32 4)
		; CHECK-NEXT: call i8* @llvm.noalias.decl
; CHECK-NEXT: call void @print(i32 -5)		; CHECK-NEXT: call void @print(i32 -5)
		; CHECK-NEXT: call i8* @llvm.noalias.decl
; CHECK-NEXT: call void @print(i32 5)		; CHECK-NEXT: call void @print(i32 5)
; CHECK: ret i32 0		; CHECK: ret i32 0
}		}

declare i8* @malloc(i32)		declare i8* @malloc(i32)
declare void @free(i8*)		declare void @free(i8*)
declare void @print(i32)		declare void @print(i32)

Show All 10 Lines

llvm/test/Transforms/Coroutines/ex4.ll

Show First 20 Lines • Show All 44 Lines • ▼ Show 20 Lines	entry:
%val1 = load i32, i32* %promise.addr		%val1 = load i32, i32* %promise.addr
call void @print(i32 %val1)		call void @print(i32 %val1)
call void @llvm.coro.resume(i8* %hdl)		call void @llvm.coro.resume(i8* %hdl)
%val2 = load i32, i32* %promise.addr		%val2 = load i32, i32* %promise.addr
call void @print(i32 %val2)		call void @print(i32 %val2)
call void @llvm.coro.destroy(i8* %hdl)		call void @llvm.coro.destroy(i8* %hdl)
ret i32 0		ret i32 0
; CHECK: call void @print(i32 4)		; CHECK: call void @print(i32 4)
		; CHECK-NEXT: call i8* @llvm.noalias.decl
; CHECK-NEXT: call void @print(i32 5)		; CHECK-NEXT: call void @print(i32 5)
		; CHECK-NEXT: call i8* @llvm.noalias.decl
; CHECK-NEXT: call void @print(i32 6)		; CHECK-NEXT: call void @print(i32 6)
; CHECK: ret i32 0		; CHECK: ret i32 0
}		}

declare i8* @llvm.coro.promise(i8*, i32, i1)		declare i8* @llvm.coro.promise(i8*, i32, i1)
declare i8* @malloc(i32)		declare i8* @malloc(i32)
declare void @free(i8*)		declare void @free(i8*)
declare void @print(i32)		declare void @print(i32)
Show All 11 Lines

llvm/test/Transforms/Inline/launder.invariant.group.ll

Show All 17 Lines	; CHECK-NOT: noalias
%6 = getelementptr inbounds %struct.A, %struct.A* %0, i64 0, i32 1		%6 = getelementptr inbounds %struct.A, %struct.A* %0, i64 0, i32 1
%7 = load i32, i32* %6, align 8		%7 = load i32, i32* %6, align 8
ret i32 %7		ret i32 %7
}		}

; CHECK-LABEL: define i32 @foo(%struct.A* noalias		; CHECK-LABEL: define i32 @foo(%struct.A* noalias
define i32 @foo(%struct.A* noalias) {		define i32 @foo(%struct.A* noalias) {
; CHECK-NOT: call i32 @bar(		; CHECK-NOT: call i32 @bar(
; CHECK-NOT: noalias		; CHECK-NOT: !noalias
%2 = tail call i32 @bar(%struct.A* %0)		%2 = tail call i32 @bar(%struct.A* %0)
ret i32 %2		ret i32 %2
}		}


; This test checks if invariant group intrinsics have zero cost for inlining.		; This test checks if invariant group intrinsics have zero cost for inlining.
; CHECK-LABEL: define i8* @caller(i8*		; CHECK-LABEL: define i8* @caller(i8*
define i8* @caller(i8* %p) {		define i8* @caller(i8* %p) {
Show All 25 Lines

llvm/test/Transforms/Inline/noalias-calls-always.ll

				; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt -aa-pipeline=basic-aa -passes=always-inline -enable-noalias-to-md-conversion -S < %s \| FileCheck %s			; RUN: opt -aa-pipeline=basic-aa -passes=always-inline -enable-noalias-to-md-conversion -S < %s \| FileCheck %s
	target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"			target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	declare void @llvm.memcpy.p0i8.p0i8.i64(i8* nocapture, i8* nocapture readonly, i64, i1) #0			declare void @llvm.memcpy.p0i8.p0i8.i64(i8* nocapture, i8* nocapture readonly, i64, i1) #0
	declare void @hey() #0			declare void @hey() #0

	define void @hello(i8* noalias nocapture %a, i8* noalias nocapture readonly %c, i8* nocapture %b) #1 {			define void @hello(i8* noalias nocapture %a, i8* noalias nocapture readonly %c, i8* nocapture %b) #1 {
				; CHECK-LABEL: @hello(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: [[L:%.*]] = alloca i8, i32 512, align 1
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[A:%.]], i8 align 16 [[B:%.*]], i64 16, i1 false)
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[B]], i8* align 16 [[C:%.*]], i64 16, i1 false)
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[A]], i8* align 16 [[C]], i64 16, i1 false)
				; CHECK-NEXT: call void @hey()
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[L]], i8* align 16 [[C]], i64 16, i1 false)
				; CHECK-NEXT: ret void
				;
	entry:			entry:
	%l = alloca i8, i32 512, align 1			%l = alloca i8, i32 512, align 1
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %a, i8* align 16 %b, i64 16, i1 0)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %a, i8* align 16 %b, i64 16, i1 0)
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %b, i8* align 16 %c, i64 16, i1 0)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %b, i8* align 16 %c, i64 16, i1 0)
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %a, i8* align 16 %c, i64 16, i1 0)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %a, i8* align 16 %c, i64 16, i1 0)
	call void @hey()			call void @hey()
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %l, i8* align 16 %c, i64 16, i1 0)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %l, i8* align 16 %c, i64 16, i1 0)
	ret void			ret void
	}			}

	define void @foo(i8* nocapture %a, i8* nocapture readonly %c, i8* nocapture %b) #2 {			define void @foo(i8* nocapture %a, i8* nocapture readonly %c, i8* nocapture %b) #2 {
				; CHECK-LABEL: @foo(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: [[L_I:%.*]] = alloca i8, i32 512, align 1
				; CHECK-NEXT: [[TMP0:%.]] = call i8 @llvm.noalias.decl.p0i8.p0p0i8.i64(i8** null, i64 0, [[META0:metadata !.*]])
				; CHECK-NEXT: [[TMP1:%.]] = call i8 @llvm.noalias.decl.p0i8.p0p0i8.i64(i8** null, i64 0, [[META3:metadata !.*]])
				; CHECK-NEXT: call void @llvm.lifetime.start.p0i8(i64 512, i8* [[L_I]])
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[A:%.]], i8 align 16 [[B:%.]], i64 16, i1 false) [[ATTR4:#.]], !noalias !3
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[B]], i8* align 16 [[C:%.*]], i64 16, i1 false) [[ATTR4]], !noalias !0
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[A]], i8* align 16 [[C]], i64 16, i1 false) [[ATTR4]], !alias.scope !5
				; CHECK-NEXT: call void @hey() [[ATTR4]], !noalias !5
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[L_I]], i8* align 16 [[C]], i64 16, i1 false) [[ATTR4]], !noalias !0
				; CHECK-NEXT: call void @llvm.lifetime.end.p0i8(i64 512, i8* [[L_I]])
				; CHECK-NEXT: ret void
				;
	entry:			entry:
	tail call void @hello(i8* %a, i8* %c, i8* %b)			tail call void @hello(i8* %a, i8* %c, i8* %b)
	ret void			ret void
	}			}

	define void @hello_cs(i8* nocapture %a, i8* nocapture readonly %c, i8* nocapture %b) #1 {			define void @hello_cs(i8* nocapture %a, i8* nocapture readonly %c, i8* nocapture %b) #1 {
				; CHECK-LABEL: @hello_cs(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: [[L:%.*]] = alloca i8, i32 512, align 1
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[A:%.]], i8 align 16 [[B:%.*]], i64 16, i1 false)
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[B]], i8* align 16 [[C:%.*]], i64 16, i1 false)
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[A]], i8* align 16 [[C]], i64 16, i1 false)
				; CHECK-NEXT: call void @hey()
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[L]], i8* align 16 [[C]], i64 16, i1 false)
				; CHECK-NEXT: ret void
				;
	entry:			entry:
	%l = alloca i8, i32 512, align 1			%l = alloca i8, i32 512, align 1
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %a, i8* align 16 %b, i64 16, i1 0)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %a, i8* align 16 %b, i64 16, i1 0)
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %b, i8* align 16 %c, i64 16, i1 0)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %b, i8* align 16 %c, i64 16, i1 0)
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %a, i8* align 16 %c, i64 16, i1 0)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %a, i8* align 16 %c, i64 16, i1 0)
	call void @hey()			call void @hey()
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %l, i8* align 16 %c, i64 16, i1 0)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %l, i8* align 16 %c, i64 16, i1 0)
	ret void			ret void
	}			}

	define void @foo_cs(i8* nocapture %a, i8* nocapture readonly %c, i8* nocapture %b) #2 {			define void @foo_cs(i8* nocapture %a, i8* nocapture readonly %c, i8* nocapture %b) #2 {
				; CHECK-LABEL: @foo_cs(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: [[L_I:%.*]] = alloca i8, i32 512, align 1
				; CHECK-NEXT: [[TMP0:%.]] = call i8 @llvm.noalias.decl.p0i8.p0p0i8.i64(i8** null, i64 0, [[META6:metadata !.*]])
				; CHECK-NEXT: [[TMP1:%.]] = call i8 @llvm.noalias.decl.p0i8.p0p0i8.i64(i8** null, i64 0, [[META9:metadata !.*]])
				; CHECK-NEXT: call void @llvm.lifetime.start.p0i8(i64 512, i8* [[L_I]])
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[A:%.]], i8 align 16 [[B:%.*]], i64 16, i1 false) [[ATTR4]], !noalias !9
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[B]], i8* align 16 [[C:%.*]], i64 16, i1 false) [[ATTR4]], !noalias !6
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[A]], i8* align 16 [[C]], i64 16, i1 false) [[ATTR4]], !alias.scope !11
				; CHECK-NEXT: call void @hey() [[ATTR4]], !noalias !11
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[L_I]], i8* align 16 [[C]], i64 16, i1 false) [[ATTR4]], !noalias !6
				; CHECK-NEXT: call void @llvm.lifetime.end.p0i8(i64 512, i8* [[L_I]])
				; CHECK-NEXT: ret void
				;
	entry:			entry:
	tail call void @hello_cs(i8* noalias %a, i8* noalias %c, i8* %b)			tail call void @hello_cs(i8* noalias %a, i8* noalias %c, i8* %b)
	ret void			ret void
	}			}

	; CHECK: define void @foo(i8* nocapture %a, i8* nocapture readonly %c, i8* nocapture %b)
	; CHECK: entry:
	; CHECK: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %a, i8* align 16 %b, i64 16, i1 false) #4, !noalias !0
	; CHECK: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %b, i8* align 16 %c, i64 16, i1 false) #4, !noalias !3
	; CHECK: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %a, i8* align 16 %c, i64 16, i1 false) #4, !alias.scope !5
	; CHECK: call void @hey() #4, !noalias !5
	; CHECK: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %{{.}}, i8 align 16 %c, i64 16, i1 false) #4, !noalias !3
	; CHECK: ret void
	; CHECK: }

	; CHECK: define void @foo_cs(i8* nocapture %a, i8* nocapture readonly %c, i8* nocapture %b)
	; CHECK: entry:
	; CHECK: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %a, i8* align 16 %b, i64 16, i1 false) #4, !noalias !6
	; CHECK: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %b, i8* align 16 %c, i64 16, i1 false) #4, !noalias !9
	; CHECK: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %a, i8* align 16 %c, i64 16, i1 false) #4, !alias.scope !11
	; CHECK: call void @hey() #4, !noalias !11
	; CHECK: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %{{.}}, i8 align 16 %c, i64 16, i1 false) #4, !noalias !9
	; CHECK: ret void
	; CHECK: }

	attributes #0 = { nounwind argmemonly willreturn }			attributes #0 = { nounwind argmemonly willreturn }
	attributes #1 = { nounwind alwaysinline }			attributes #1 = { nounwind alwaysinline }
	attributes #2 = { nounwind uwtable }			attributes #2 = { nounwind uwtable }

	; CHECK: !0 = !{!1}			; CHECK: !0 = !{!1}
	; CHECK: !1 = distinct !{!1, !2, !"hello: %c"}			; CHECK: !1 = distinct !{!1, !2, !"hello: %a"}
	; CHECK: !2 = distinct !{!2, !"hello"}			; CHECK: !2 = distinct !{!2, !"hello"}
	; CHECK: !3 = !{!4}			; CHECK: !3 = !{!4}
	; CHECK: !4 = distinct !{!4, !2, !"hello: %a"}			; CHECK: !4 = distinct !{!4, !2, !"hello: %c"}
	; CHECK: !5 = !{!4, !1}			; CHECK: !5 = !{!1, !4}

	; CHECK: !6 = !{!7}			; CHECK: !6 = !{!7}
	; CHECK: !7 = distinct !{!7, !8, !"hello_cs: %c"}			; CHECK: !7 = distinct !{!7, !8, !"hello_cs: %a"}
	; CHECK: !8 = distinct !{!8, !"hello_cs"}			; CHECK: !8 = distinct !{!8, !"hello_cs"}
	; CHECK: !9 = !{!10}			; CHECK: !9 = !{!10}
	; CHECK: !10 = distinct !{!10, !8, !"hello_cs: %a"}			; CHECK: !10 = distinct !{!10, !8, !"hello_cs: %c"}
	; CHECK: !11 = !{!10, !7}			; CHECK: !11 = !{!7, !10}

llvm/test/Transforms/Inline/noalias-calls.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --function-signature			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --function-signature
	; RUN: opt -basic-aa -inline -enable-noalias-to-md-conversion -S < %s \| FileCheck %s			; RUN: opt -basic-aa -inline -enable-noalias-to-md-conversion -S < %s \| FileCheck %s
	; RUN: opt -aa-pipeline=basic-aa -passes=inline -enable-noalias-to-md-conversion -S < %s \| FileCheck %s			; RUN: opt -aa-pipeline=basic-aa -passes=inline -enable-noalias-to-md-conversion -S < %s \| FileCheck %s
	target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"			target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	declare void @llvm.memcpy.p0i8.p0i8.i64(i8* nocapture, i8* nocapture readonly, i64, i1) #0			declare void @llvm.memcpy.p0i8.p0i8.i64(i8* nocapture, i8* nocapture readonly, i64, i1) #0
	declare void @hey() #0			declare void @hey() #0

	define void @hello(i8* noalias nocapture %a, i8* noalias nocapture readonly %c, i8* nocapture %b) #1 {			define void @hello(i8* noalias nocapture %a, i8* noalias nocapture readonly %c, i8* nocapture %b) #1 {
				; CHECK-LABEL: define {{[^@]+}}@hello
				; CHECK-SAME: (i8* noalias nocapture [[A:%.]], i8 noalias nocapture readonly [[C:%.]], i8 nocapture [[B:%.]]) [[ATTR1:#.]] {
				; CHECK-NEXT: entry:
				; CHECK-NEXT: [[L:%.*]] = alloca i8, i32 512, align 1
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[A]], i8* align 16 [[B]], i64 16, i1 false)
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[B]], i8* align 16 [[C]], i64 16, i1 false)
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[A]], i8* align 16 [[C]], i64 16, i1 false)
				; CHECK-NEXT: call void @hey()
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[L]], i8* align 16 [[C]], i64 16, i1 false)
				; CHECK-NEXT: ret void
				;
	entry:			entry:
	%l = alloca i8, i32 512, align 1			%l = alloca i8, i32 512, align 1
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %a, i8* align 16 %b, i64 16, i1 0)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %a, i8* align 16 %b, i64 16, i1 0)
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %b, i8* align 16 %c, i64 16, i1 0)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %b, i8* align 16 %c, i64 16, i1 0)
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %a, i8* align 16 %c, i64 16, i1 0)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %a, i8* align 16 %c, i64 16, i1 0)
	call void @hey()			call void @hey()
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %l, i8* align 16 %c, i64 16, i1 0)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %l, i8* align 16 %c, i64 16, i1 0)
	ret void			ret void
	}			}

	define void @foo(i8* nocapture %a, i8* nocapture readonly %c, i8* nocapture %b) #2 {			define void @foo(i8* nocapture %a, i8* nocapture readonly %c, i8* nocapture %b) #2 {
				; CHECK-LABEL: define {{[^@]+}}@foo
				; CHECK-SAME: (i8* nocapture [[A:%.]], i8 nocapture readonly [[C:%.]], i8 nocapture [[B:%.]]) [[ATTR2:#.]] {
				; CHECK-NEXT: entry:
				; CHECK-NEXT: [[L_I:%.*]] = alloca i8, i32 512, align 1
				; CHECK-NEXT: [[TMP0:%.]] = call i8 @llvm.noalias.decl.p0i8.p0p0i8.i64(i8** null, i64 0, [[META0:metadata !.*]])
				; CHECK-NEXT: [[TMP1:%.]] = call i8 @llvm.noalias.decl.p0i8.p0p0i8.i64(i8** null, i64 0, [[META3:metadata !.*]])
				; CHECK-NEXT: call void @llvm.lifetime.start.p0i8(i64 512, i8* [[L_I]])
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[A]], i8* align 16 [[B]], i64 16, i1 false) [[ATTR2]], !noalias !3
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[B]], i8* align 16 [[C]], i64 16, i1 false) [[ATTR2]], !noalias !0
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[A]], i8* align 16 [[C]], i64 16, i1 false) [[ATTR2]], !alias.scope !5
				; CHECK-NEXT: call void @hey() [[ATTR2]], !noalias !5
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[L_I]], i8* align 16 [[C]], i64 16, i1 false) [[ATTR2]], !noalias !0
				; CHECK-NEXT: call void @llvm.lifetime.end.p0i8(i64 512, i8* [[L_I]])
				; CHECK-NEXT: ret void
				;
	entry:			entry:
	tail call void @hello(i8* %a, i8* %c, i8* %b)			tail call void @hello(i8* %a, i8* %c, i8* %b)
	ret void			ret void
	}			}

	define void @hello_cs(i8* nocapture %a, i8* nocapture readonly %c, i8* nocapture %b) #1 {			define void @hello_cs(i8* nocapture %a, i8* nocapture readonly %c, i8* nocapture %b) #1 {
				; CHECK-LABEL: define {{[^@]+}}@hello_cs
				; CHECK-SAME: (i8* nocapture [[A:%.]], i8 nocapture readonly [[C:%.]], i8 nocapture [[B:%.*]]) [[ATTR1]] {
				; CHECK-NEXT: entry:
				; CHECK-NEXT: [[L:%.*]] = alloca i8, i32 512, align 1
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[A]], i8* align 16 [[B]], i64 16, i1 false)
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[B]], i8* align 16 [[C]], i64 16, i1 false)
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[A]], i8* align 16 [[C]], i64 16, i1 false)
				; CHECK-NEXT: call void @hey()
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[L]], i8* align 16 [[C]], i64 16, i1 false)
				; CHECK-NEXT: ret void
				;
	entry:			entry:
	%l = alloca i8, i32 512, align 1			%l = alloca i8, i32 512, align 1
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %a, i8* align 16 %b, i64 16, i1 0)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %a, i8* align 16 %b, i64 16, i1 0)
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %b, i8* align 16 %c, i64 16, i1 0)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %b, i8* align 16 %c, i64 16, i1 0)
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %a, i8* align 16 %c, i64 16, i1 0)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %a, i8* align 16 %c, i64 16, i1 0)
	call void @hey()			call void @hey()
	call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %l, i8* align 16 %c, i64 16, i1 0)			call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %l, i8* align 16 %c, i64 16, i1 0)
	ret void			ret void
	}			}

	define void @foo_cs(i8* nocapture %a, i8* nocapture readonly %c, i8* nocapture %b) #2 {			define void @foo_cs(i8* nocapture %a, i8* nocapture readonly %c, i8* nocapture %b) #2 {
				; CHECK-LABEL: define {{[^@]+}}@foo_cs
				; CHECK-SAME: (i8* nocapture [[A:%.]], i8 nocapture readonly [[C:%.]], i8 nocapture [[B:%.*]]) [[ATTR2]] {
				; CHECK-NEXT: entry:
				; CHECK-NEXT: [[L_I:%.*]] = alloca i8, i32 512, align 1
				; CHECK-NEXT: [[TMP0:%.]] = call i8 @llvm.noalias.decl.p0i8.p0p0i8.i64(i8** null, i64 0, [[META6:metadata !.*]])
				; CHECK-NEXT: [[TMP1:%.]] = call i8 @llvm.noalias.decl.p0i8.p0p0i8.i64(i8** null, i64 0, [[META9:metadata !.*]])
				; CHECK-NEXT: call void @llvm.lifetime.start.p0i8(i64 512, i8* [[L_I]])
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[A]], i8* align 16 [[B]], i64 16, i1 false) [[ATTR2]], !noalias !9
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[B]], i8* align 16 [[C]], i64 16, i1 false) [[ATTR2]], !noalias !6
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[A]], i8* align 16 [[C]], i64 16, i1 false) [[ATTR2]], !alias.scope !11
				; CHECK-NEXT: call void @hey() [[ATTR2]], !noalias !11
				; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[L_I]], i8* align 16 [[C]], i64 16, i1 false) [[ATTR2]], !noalias !6
				; CHECK-NEXT: call void @llvm.lifetime.end.p0i8(i64 512, i8* [[L_I]])
				; CHECK-NEXT: ret void
				;
	entry:			entry:
	tail call void @hello_cs(i8* noalias %a, i8* noalias %c, i8* %b)			tail call void @hello_cs(i8* noalias %a, i8* noalias %c, i8* %b)
	ret void			ret void
	}			}

	; CHECK: define void @foo(i8* nocapture %a, i8* nocapture readonly %c, i8* nocapture %b) #2 {
	; CHECK: entry:
	; CHECK: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %a, i8* align 16 %b, i64 16, i1 false) #2, !noalias !0
	; CHECK: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %b, i8* align 16 %c, i64 16, i1 false) #2, !noalias !3
	; CHECK: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %a, i8* align 16 %c, i64 16, i1 false) #2, !alias.scope !5
	; CHECK: call void @hey() #2, !noalias !5
	; CHECK: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %{{.}}, i8 align 16 %c, i64 16, i1 false) #2, !noalias !3
	; CHECK: ret void
	; CHECK: }

	; CHECK: define void @foo_cs(i8* nocapture %a, i8* nocapture readonly %c, i8* nocapture %b) #2 {
	; CHECK: entry:
	; CHECK: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %a, i8* align 16 %b, i64 16, i1 false) #2, !noalias !6
	; CHECK: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %b, i8* align 16 %c, i64 16, i1 false) #2, !noalias !9
	; CHECK: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %a, i8* align 16 %c, i64 16, i1 false) #2, !alias.scope !11
	; CHECK: call void @hey() #2, !noalias !11
	; CHECK: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 %{{.}}, i8 align 16 %c, i64 16, i1 false) #2, !noalias !9
	; CHECK: ret void
	; CHECK: }

	attributes #0 = { argmemonly nofree nosync nounwind willreturn }			attributes #0 = { argmemonly nofree nosync nounwind willreturn }
	attributes #1 = { argmemonly nounwind willreturn }			attributes #1 = { argmemonly nounwind willreturn }
	attributes #2 = { nounwind }			attributes #2 = { nounwind }
	attributes #3 = { nounwind uwtable }			attributes #3 = { nounwind uwtable }

	; CHECK: !0 = !{!1}			; CHECK: !0 = !{!1}
	; CHECK: !1 = distinct !{!1, !2, !"hello: %c"}			; CHECK: !1 = distinct !{!1, !2, !"hello: %a"}
	; CHECK: !2 = distinct !{!2, !"hello"}			; CHECK: !2 = distinct !{!2, !"hello"}
	; CHECK: !3 = !{!4}			; CHECK: !3 = !{!4}
	; CHECK: !4 = distinct !{!4, !2, !"hello: %a"}			; CHECK: !4 = distinct !{!4, !2, !"hello: %c"}
	; CHECK: !5 = !{!4, !1}			; CHECK: !5 = !{!1, !4}

	; CHECK: !6 = !{!7}			; CHECK: !6 = !{!7}
	; CHECK: !7 = distinct !{!7, !8, !"hello_cs: %c"}			; CHECK: !7 = distinct !{!7, !8, !"hello_cs: %a"}
	; CHECK: !8 = distinct !{!8, !"hello_cs"}			; CHECK: !8 = distinct !{!8, !"hello_cs"}
	; CHECK: !9 = !{!10}			; CHECK: !9 = !{!10}
	; CHECK: !10 = distinct !{!10, !8, !"hello_cs: %a"}			; CHECK: !10 = distinct !{!10, !8, !"hello_cs: %c"}
	; CHECK: !11 = !{!10, !7}			; CHECK: !11 = !{!7, !10}

llvm/test/Transforms/Inline/noalias.ll

	Show All 13 Lines
	entry:			entry:
	tail call void @hello(float* %a, float* %c)			tail call void @hello(float* %a, float* %c)
	%0 = load float, float* %c, align 4			%0 = load float, float* %c, align 4
	%arrayidx = getelementptr inbounds float, float* %a, i64 7			%arrayidx = getelementptr inbounds float, float* %a, i64 7
	store float %0, float* %arrayidx, align 4			store float %0, float* %arrayidx, align 4
	ret void			ret void
	}			}

	; CHECK: define void @foo(float* nocapture %a, float* nocapture readonly %c) #0 {			; CHECK-LABEL: define void @foo(float* nocapture %a, float* nocapture readonly %c) #0 {
	; CHECK: entry:			; CHECK: entry:
	; CHECK: %0 = load float, float* %c, align 4, !noalias !0			; CHECK: call i8* @llvm.noalias.decl
				; CHECK: [[TMP0:%.+]] = load float, float* %c, align 4, !noalias !0
	; CHECK: %arrayidx.i = getelementptr inbounds float, float* %a, i64 5			; CHECK: %arrayidx.i = getelementptr inbounds float, float* %a, i64 5
	; CHECK: store float %0, float* %arrayidx.i, align 4, !alias.scope !0			; CHECK: store float [[TMP0]], float* %arrayidx.i, align 4, !alias.scope !0
	; CHECK: %1 = load float, float* %c, align 4			; CHECK: [[TMP1:%.+]] = load float, float* %c, align 4
	; CHECK: %arrayidx = getelementptr inbounds float, float* %a, i64 7			; CHECK: %arrayidx = getelementptr inbounds float, float* %a, i64 7
	; CHECK: store float %1, float* %arrayidx, align 4			; CHECK: store float [[TMP1]], float* %arrayidx, align 4
	; CHECK: ret void			; CHECK: ret void
	; CHECK: }			; CHECK: }

	define void @hello2(float* noalias nocapture %a, float* noalias nocapture %b, float* nocapture readonly %c) #0 {			define void @hello2(float* noalias nocapture %a, float* noalias nocapture %b, float* nocapture readonly %c) #0 {
	entry:			entry:
	%0 = load float, float* %c, align 4			%0 = load float, float* %c, align 4
	%arrayidx = getelementptr inbounds float, float* %a, i64 5			%arrayidx = getelementptr inbounds float, float* %a, i64 5
	store float %0, float* %arrayidx, align 4			store float %0, float* %arrayidx, align 4
	%arrayidx1 = getelementptr inbounds float, float* %b, i64 8			%arrayidx1 = getelementptr inbounds float, float* %b, i64 8
	store float %0, float* %arrayidx1, align 4			store float %0, float* %arrayidx1, align 4
	ret void			ret void
	}			}

	define void @foo2(float* nocapture %a, float* nocapture %b, float* nocapture readonly %c) #0 {			define void @foo2(float* nocapture %a, float* nocapture %b, float* nocapture readonly %c) #0 {
	entry:			entry:
	tail call void @hello2(float* %a, float* %b, float* %c)			tail call void @hello2(float* %a, float* %b, float* %c)
	%0 = load float, float* %c, align 4			%0 = load float, float* %c, align 4
	%arrayidx = getelementptr inbounds float, float* %a, i64 7			%arrayidx = getelementptr inbounds float, float* %a, i64 7
	store float %0, float* %arrayidx, align 4			store float %0, float* %arrayidx, align 4
	ret void			ret void
	}			}

	; CHECK: define void @foo2(float* nocapture %a, float* nocapture %b, float* nocapture readonly %c) #0 {			; CHECK-LABEL: define void @foo2(float* nocapture %a, float* nocapture %b, float* nocapture readonly %c) #0 {
	; CHECK: entry:			; CHECK: entry:
	; CHECK: %0 = load float, float* %c, align 4, !noalias !3			; CHECK: call i8* @llvm.noalias.decl.p0i8.p0p0f32.i64(float** null, i64 0, metadata !3)
				; CHECK: call i8* @llvm.noalias.decl.p0i8.p0p0f32.i64(float** null, i64 0, metadata !6)
				; CHECK: [[TMP0:%.+]] = load float, float* %c, align 4, !noalias !8
	; CHECK: %arrayidx.i = getelementptr inbounds float, float* %a, i64 5			; CHECK: %arrayidx.i = getelementptr inbounds float, float* %a, i64 5
	; CHECK: store float %0, float* %arrayidx.i, align 4, !alias.scope !7, !noalias !8			; CHECK: store float [[TMP0]], float* %arrayidx.i, align 4, !alias.scope !3, !noalias !6
	; CHECK: %arrayidx1.i = getelementptr inbounds float, float* %b, i64 8			; CHECK: %arrayidx1.i = getelementptr inbounds float, float* %b, i64 8
	; CHECK: store float %0, float* %arrayidx1.i, align 4, !alias.scope !8, !noalias !7			; CHECK: store float [[TMP0]], float* %arrayidx1.i, align 4, !alias.scope !6, !noalias !3
	; CHECK: %1 = load float, float* %c, align 4			; CHECK: [[TMP1:%.+]] = load float, float* %c, align 4
	; CHECK: %arrayidx = getelementptr inbounds float, float* %a, i64 7			; CHECK: %arrayidx = getelementptr inbounds float, float* %a, i64 7
	; CHECK: store float %1, float* %arrayidx, align 4			; CHECK: store float [[TMP1]], float* %arrayidx, align 4
	; CHECK: ret void			; CHECK: ret void
	; CHECK: }			; CHECK: }

	attributes #0 = { nounwind uwtable }			attributes #0 = { nounwind uwtable }

	; CHECK: !0 = !{!1}			; CHECK: !0 = !{!1}
	; CHECK: !1 = distinct !{!1, !2, !"hello: %a"}			; CHECK: !1 = distinct !{!1, !2, !"hello: %a"}
	; CHECK: !2 = distinct !{!2, !"hello"}			; CHECK: !2 = distinct !{!2, !"hello"}
	; CHECK: !3 = !{!4, !6}			; CHECK: !3 = !{!4}
	; CHECK: !4 = distinct !{!4, !5, !"hello2: %a"}			; CHECK: !4 = distinct !{!4, !5, !"hello2: %a"}
	; CHECK: !5 = distinct !{!5, !"hello2"}			; CHECK: !5 = distinct !{!5, !"hello2"}
	; CHECK: !6 = distinct !{!6, !5, !"hello2: %b"}			; CHECK: !6 = !{!7}
	; CHECK: !7 = !{!4}			; CHECK: !7 = distinct !{!7, !5, !"hello2: %b"}
	; CHECK: !8 = !{!6}			; CHECK: !8 = !{!4, !7}

llvm/test/Transforms/Inline/noalias2.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --function-signature			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --function-signature
	; RUN: opt -inline -enable-noalias-to-md-conversion -S < %s \| FileCheck %s --check-prefixes=CHECK,NO_ASSUME			; RUN: opt -inline -enable-noalias-to-md-conversion -S < %s \| FileCheck %s --check-prefixes=CHECK,NO_ASSUME
	; RUN: opt -inline -enable-noalias-to-md-conversion --enable-knowledge-retention -S < %s \| FileCheck %s			; RUN: opt -inline -enable-noalias-to-md-conversion --enable-knowledge-retention -S < %s \| FileCheck %s

	target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"			target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	define void @hello(float* noalias nocapture %a, float* noalias nocapture readonly %c) #0 {			define void @hello(float* noalias nocapture %a, float* noalias nocapture readonly %c) #0 {
	; CHECK-LABEL: define {{[^@]+}}@hello			; CHECK-LABEL: define {{[^@]+}}@hello
	; CHECK-SAME: (float* noalias nocapture [[A:%.]], float noalias nocapture readonly [[C:%.*]]) #0			; CHECK-SAME: (float* noalias nocapture [[A:%.]], float noalias nocapture readonly [[C:%.]]) [[ATTR0:#.]] {
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[TMP0:%.]] = load float, float [[C]], align 4			; CHECK-NEXT: [[TMP0:%.]] = load float, float [[C]], align 4
	; CHECK-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds float, float [[A]], i64 5			; CHECK-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds float, float [[A]], i64 5
	; CHECK-NEXT: store float [[TMP0]], float* [[ARRAYIDX]], align 4			; CHECK-NEXT: store float [[TMP0]], float* [[ARRAYIDX]], align 4
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	; ASSUME-LABEL: @hello(			; ASSUME-LABEL: @hello(
	; ASSUME-NEXT: entry:			; ASSUME-NEXT: entry:
	; ASSUME-NEXT: [[TMP0:%.]] = load float, float [[C:%.*]], align 4			; ASSUME-NEXT: [[TMP0:%.]] = load float, float [[C:%.*]], align 4
	; ASSUME-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds float, float [[A:%.*]], i64 5			; ASSUME-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds float, float [[A:%.*]], i64 5
	; ASSUME-NEXT: store float [[TMP0]], float* [[ARRAYIDX]], align 4			; ASSUME-NEXT: store float [[TMP0]], float* [[ARRAYIDX]], align 4
	; ASSUME-NEXT: ret void			; ASSUME-NEXT: ret void
	entry:			entry:
	%0 = load float, float* %c, align 4			%0 = load float, float* %c, align 4
	%arrayidx = getelementptr inbounds float, float* %a, i64 5			%arrayidx = getelementptr inbounds float, float* %a, i64 5
	store float %0, float* %arrayidx, align 4			store float %0, float* %arrayidx, align 4
	ret void			ret void
	}			}

	define void @foo(float* noalias nocapture %a, float* noalias nocapture readonly %c) #0 {			define void @foo(float* noalias nocapture %a, float* noalias nocapture readonly %c) #0 {
	; CHECK-LABEL: define {{[^@]+}}@foo			; CHECK-LABEL: define {{[^@]+}}@foo
	; CHECK-SAME: (float* noalias nocapture [[A:%.]], float noalias nocapture readonly [[C:%.*]]) #0			; CHECK-SAME: (float* noalias nocapture [[A:%.]], float noalias nocapture readonly [[C:%.*]]) [[ATTR0]] {
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[TMP0:%.]] = load float, float [[C]], align 4, !alias.scope !0, !noalias !3			; CHECK-NEXT: [[TMP0:%.]] = call i8 @llvm.noalias.decl.p0i8.p0p0f32.i64(float** null, i64 0, [[META0:metadata !.*]])
				; CHECK-NEXT: [[TMP1:%.]] = call i8 @llvm.noalias.decl.p0i8.p0p0f32.i64(float** null, i64 0, [[META3:metadata !.*]])
				; CHECK-NEXT: [[TMP2:%.]] = load float, float [[C]], align 4, !alias.scope !3, !noalias !0
	; CHECK-NEXT: [[ARRAYIDX_I:%.]] = getelementptr inbounds float, float [[A]], i64 5			; CHECK-NEXT: [[ARRAYIDX_I:%.]] = getelementptr inbounds float, float [[A]], i64 5
	; CHECK-NEXT: store float [[TMP0]], float* [[ARRAYIDX_I]], align 4, !alias.scope !3, !noalias !0			; CHECK-NEXT: store float [[TMP2]], float* [[ARRAYIDX_I]], align 4, !alias.scope !0, !noalias !3
	; CHECK-NEXT: [[TMP1:%.]] = load float, float [[C]], align 4			; CHECK-NEXT: [[TMP3:%.]] = load float, float [[C]], align 4
	; CHECK-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds float, float [[A]], i64 7			; CHECK-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds float, float [[A]], i64 7
	; CHECK-NEXT: store float [[TMP1]], float* [[ARRAYIDX]], align 4			; CHECK-NEXT: store float [[TMP3]], float* [[ARRAYIDX]], align 4
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	; ASSUME-LABEL: @foo(			; ASSUME-LABEL: @foo(
	; ASSUME-NEXT: entry:			; ASSUME-NEXT: entry:
	; ASSUME-NEXT: call void @llvm.assume(i1 true) [ "noalias"(float* [[A:%.]]), "noalias"(float [[C:%.*]]) ]			; ASSUME-NEXT: call void @llvm.assume(i1 true) [ "noalias"(float* [[A:%.]]), "noalias"(float [[C:%.*]]) ]
	; ASSUME-NEXT: [[TMP0:%.]] = load float, float [[C]], align 4, !alias.scope !0, !noalias !3			; ASSUME-NEXT: [[TMP0:%.]] = load float, float [[C]], align 4, !alias.scope !0, !noalias !3
	; ASSUME-NEXT: [[ARRAYIDX_I:%.]] = getelementptr inbounds float, float [[A]], i64 5			; ASSUME-NEXT: [[ARRAYIDX_I:%.]] = getelementptr inbounds float, float [[A]], i64 5
	; ASSUME-NEXT: store float [[TMP0]], float* [[ARRAYIDX_I]], align 4, !alias.scope !3, !noalias !0			; ASSUME-NEXT: store float [[TMP0]], float* [[ARRAYIDX_I]], align 4, !alias.scope !3, !noalias !0
	; ASSUME-NEXT: [[TMP1:%.]] = load float, float [[C]], align 4			; ASSUME-NEXT: [[TMP1:%.]] = load float, float [[C]], align 4
	; ASSUME-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds float, float [[A]], i64 7			; ASSUME-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds float, float [[A]], i64 7
	; ASSUME-NEXT: store float [[TMP1]], float* [[ARRAYIDX]], align 4			; ASSUME-NEXT: store float [[TMP1]], float* [[ARRAYIDX]], align 4
	; ASSUME-NEXT: ret void			; ASSUME-NEXT: ret void
	entry:			entry:
	tail call void @hello(float* %a, float* %c)			tail call void @hello(float* %a, float* %c)
	%0 = load float, float* %c, align 4			%0 = load float, float* %c, align 4
	%arrayidx = getelementptr inbounds float, float* %a, i64 7			%arrayidx = getelementptr inbounds float, float* %a, i64 7
	store float %0, float* %arrayidx, align 4			store float %0, float* %arrayidx, align 4
	ret void			ret void
	}			}

	define void @hello2(float* noalias nocapture %a, float* noalias nocapture %b, float* nocapture readonly %c) #0 {			define void @hello2(float* noalias nocapture %a, float* noalias nocapture %b, float* nocapture readonly %c) #0 {
	; CHECK-LABEL: define {{[^@]+}}@hello2			; CHECK-LABEL: define {{[^@]+}}@hello2
	; CHECK-SAME: (float* noalias nocapture [[A:%.]], float noalias nocapture [[B:%.]], float nocapture readonly [[C:%.*]]) #0			; CHECK-SAME: (float* noalias nocapture [[A:%.]], float noalias nocapture [[B:%.]], float nocapture readonly [[C:%.*]]) [[ATTR0]] {
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[TMP0:%.]] = load float, float [[C]], align 4			; CHECK-NEXT: [[TMP0:%.]] = load float, float [[C]], align 4
	; CHECK-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds float, float [[A]], i64 6			; CHECK-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds float, float [[A]], i64 6
	; CHECK-NEXT: store float [[TMP0]], float* [[ARRAYIDX]], align 4			; CHECK-NEXT: store float [[TMP0]], float* [[ARRAYIDX]], align 4
	; CHECK-NEXT: [[ARRAYIDX1:%.]] = getelementptr inbounds float, float [[B]], i64 8			; CHECK-NEXT: [[ARRAYIDX1:%.]] = getelementptr inbounds float, float [[B]], i64 8
	; CHECK-NEXT: store float [[TMP0]], float* [[ARRAYIDX1]], align 4			; CHECK-NEXT: store float [[TMP0]], float* [[ARRAYIDX1]], align 4
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	%0 = load float, float* %c, align 4			%0 = load float, float* %c, align 4
	%arrayidx = getelementptr inbounds float, float* %a, i64 6			%arrayidx = getelementptr inbounds float, float* %a, i64 6
	store float %0, float* %arrayidx, align 4			store float %0, float* %arrayidx, align 4
	%arrayidx1 = getelementptr inbounds float, float* %b, i64 8			%arrayidx1 = getelementptr inbounds float, float* %b, i64 8
	store float %0, float* %arrayidx1, align 4			store float %0, float* %arrayidx1, align 4
	ret void			ret void
	}			}

	; Check that when hello() is inlined into foo(), and then foo() is inlined into			; Check that when hello() is inlined into foo(), and then foo() is inlined into
	; foo2(), the noalias scopes are properly concatenated.			; foo2(), the noalias scopes are properly concatenated.
	define void @foo2(float* nocapture %a, float* nocapture %b, float* nocapture readonly %c) #0 {			define void @foo2(float* nocapture %a, float* nocapture %b, float* nocapture readonly %c) #0 {
	; CHECK-LABEL: define {{[^@]+}}@foo2			; CHECK-LABEL: define {{[^@]+}}@foo2
	; CHECK-SAME: (float* nocapture [[A:%.]], float nocapture [[B:%.]], float nocapture readonly [[C:%.*]]) #0			; CHECK-SAME: (float* nocapture [[A:%.]], float nocapture [[B:%.]], float nocapture readonly [[C:%.*]]) [[ATTR0]] {
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[TMP0:%.]] = load float, float [[C]], align 4, !alias.scope !5, !noalias !10			; CHECK-NEXT: [[TMP0:%.]] = call i8 @llvm.noalias.decl.p0i8.p0p0f32.i64(float** null, i64 0, [[META5:metadata !.*]])
				; CHECK-NEXT: [[TMP1:%.]] = call i8 @llvm.noalias.decl.p0i8.p0p0f32.i64(float** null, i64 0, [[META8:metadata !.*]])
				; CHECK-NEXT: [[TMP2:%.]] = call i8 @llvm.noalias.decl.p0i8.p0p0f32.i64(float** null, i64 0, [[META0]]) [[ATTR2:#.*]], !noalias !10
				; CHECK-NEXT: [[TMP3:%.]] = call i8 @llvm.noalias.decl.p0i8.p0p0f32.i64(float** null, i64 0, [[META3]]) [[ATTR2]], !noalias !10
				; CHECK-NEXT: [[TMP4:%.]] = load float, float [[C]], align 4, !alias.scope !11, !noalias !14
	; CHECK-NEXT: [[ARRAYIDX_I_I:%.]] = getelementptr inbounds float, float [[A]], i64 5			; CHECK-NEXT: [[ARRAYIDX_I_I:%.]] = getelementptr inbounds float, float [[A]], i64 5
	; CHECK-NEXT: store float [[TMP0]], float* [[ARRAYIDX_I_I]], align 4, !alias.scope !10, !noalias !5			; CHECK-NEXT: store float [[TMP4]], float* [[ARRAYIDX_I_I]], align 4, !alias.scope !14, !noalias !11
	; CHECK-NEXT: [[TMP1:%.]] = load float, float [[C]], align 4, !alias.scope !13, !noalias !14			; CHECK-NEXT: [[TMP5:%.]] = load float, float [[C]], align 4, !alias.scope !8, !noalias !5
	; CHECK-NEXT: [[ARRAYIDX_I:%.]] = getelementptr inbounds float, float [[A]], i64 7			; CHECK-NEXT: [[ARRAYIDX_I:%.]] = getelementptr inbounds float, float [[A]], i64 7
	; CHECK-NEXT: store float [[TMP1]], float* [[ARRAYIDX_I]], align 4, !alias.scope !14, !noalias !13			; CHECK-NEXT: store float [[TMP5]], float* [[ARRAYIDX_I]], align 4, !alias.scope !5, !noalias !8
	; CHECK-NEXT: [[TMP2:%.]] = load float, float [[C]], align 4, !noalias !15			; CHECK-NEXT: [[TMP6:%.]] = call i8 @llvm.noalias.decl.p0i8.p0p0f32.i64(float** null, i64 0, [[META16:metadata !.*]])
				; CHECK-NEXT: [[TMP7:%.]] = call i8 @llvm.noalias.decl.p0i8.p0p0f32.i64(float** null, i64 0, [[META19:metadata !.*]])
				; CHECK-NEXT: [[TMP8:%.]] = load float, float [[C]], align 4, !noalias !21
	; CHECK-NEXT: [[ARRAYIDX_I1:%.]] = getelementptr inbounds float, float [[A]], i64 6			; CHECK-NEXT: [[ARRAYIDX_I1:%.]] = getelementptr inbounds float, float [[A]], i64 6
	; CHECK-NEXT: store float [[TMP2]], float* [[ARRAYIDX_I1]], align 4, !alias.scope !19, !noalias !20			; CHECK-NEXT: store float [[TMP8]], float* [[ARRAYIDX_I1]], align 4, !alias.scope !16, !noalias !19
	; CHECK-NEXT: [[ARRAYIDX1_I:%.]] = getelementptr inbounds float, float [[B]], i64 8			; CHECK-NEXT: [[ARRAYIDX1_I:%.]] = getelementptr inbounds float, float [[B]], i64 8
	; CHECK-NEXT: store float [[TMP2]], float* [[ARRAYIDX1_I]], align 4, !alias.scope !20, !noalias !19			; CHECK-NEXT: store float [[TMP8]], float* [[ARRAYIDX1_I]], align 4, !alias.scope !19, !noalias !16
	; CHECK-NEXT: [[TMP3:%.]] = load float, float [[C]], align 4			; CHECK-NEXT: [[TMP9:%.]] = load float, float [[C]], align 4
	; CHECK-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds float, float [[A]], i64 7			; CHECK-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds float, float [[A]], i64 7
	; CHECK-NEXT: store float [[TMP3]], float* [[ARRAYIDX]], align 4			; CHECK-NEXT: store float [[TMP9]], float* [[ARRAYIDX]], align 4
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	tail call void @foo(float* %a, float* %c)			tail call void @foo(float* %a, float* %c)
	tail call void @hello2(float* %a, float* %b, float* %c)			tail call void @hello2(float* %a, float* %b, float* %c)
	%0 = load float, float* %c, align 4			%0 = load float, float* %c, align 4
	%arrayidx = getelementptr inbounds float, float* %a, i64 7			%arrayidx = getelementptr inbounds float, float* %a, i64 7
	store float %0, float* %arrayidx, align 4			store float %0, float* %arrayidx, align 4
	ret void			ret void
	}			}

	; NO_ASSUME: !0 = !{!1}			; NO_ASSUME: !0 = !{!1}
	; NO_ASSUME: !1 = distinct !{!1, !2, !"hello: %c"}			; NO_ASSUME: !1 = distinct !{!1, !2, !"hello: %a"}
	; NO_ASSUME: !2 = distinct !{!2, !"hello"}			; NO_ASSUME: !2 = distinct !{!2, !"hello"}
	; NO_ASSUME: !3 = !{!4}			; NO_ASSUME: !3 = !{!4}
	; NO_ASSUME: !4 = distinct !{!4, !2, !"hello: %a"}			; NO_ASSUME: !4 = distinct !{!4, !2, !"hello: %c"}
	; NO_ASSUME: !5 = !{!6, !8}			; NO_ASSUME: !5 = !{!6}
	; NO_ASSUME: !6 = distinct !{!6, !7, !"hello: %c"}			; NO_ASSUME: !6 = distinct !{!6, !7, !"foo: %a"}
	; NO_ASSUME: !7 = distinct !{!7, !"hello"}			; NO_ASSUME: !7 = distinct !{!7, !"foo"}
	; NO_ASSUME: !8 = distinct !{!8, !9, !"foo: %c"}			; NO_ASSUME: !8 = !{!9}
	; NO_ASSUME: !9 = distinct !{!9, !"foo"}			; NO_ASSUME: !9 = distinct !{!9, !7, !"foo: %c"}
	; NO_ASSUME: !10 = !{!11, !12}			; NO_ASSUME: !10 = !{!6, !9}
	; NO_ASSUME: !11 = distinct !{!11, !7, !"hello: %a"}			; NO_ASSUME: !11 = !{!12, !9}
	; NO_ASSUME: !12 = distinct !{!12, !9, !"foo: %a"}			; NO_ASSUME: !12 = distinct !{!12, !13, !"hello: %c"}
	; NO_ASSUME: !13 = !{!8}			; NO_ASSUME: !13 = distinct !{!13, !"hello"}
	; NO_ASSUME: !14 = !{!12}			; NO_ASSUME: !14 = !{!15, !6}
	; NO_ASSUME: !15 = !{!16, !18}			; NO_ASSUME: !15 = distinct !{!15, !13, !"hello: %a"}
	; NO_ASSUME: !16 = distinct !{!16, !17, !"hello2: %a"}			; NO_ASSUME: !16 = !{!17}
	; NO_ASSUME: !17 = distinct !{!17, !"hello2"}			; NO_ASSUME: !17 = distinct !{!17, !18, !"hello2: %a"}
	; NO_ASSUME: !18 = distinct !{!18, !17, !"hello2: %b"}			; NO_ASSUME: !18 = distinct !{!18, !"hello2"}
	; NO_ASSUME: !19 = !{!16}			; NO_ASSUME: !19 = !{!20}
	; NO_ASSUME: !20 = !{!18}			; NO_ASSUME: !20 = distinct !{!20, !18, !"hello2: %b"}
				; NO_ASSUME: !21 = !{!17, !20}

	attributes #0 = { nounwind uwtable }			attributes #0 = { nounwind uwtable }

llvm/test/Transforms/PhaseOrdering/inlining-alignment-assumptions.ll

	Show First 20 Lines • Show All 89 Lines • ▼ Show 20 Lines

	define internal void @callee2(i64* noalias sret(i64) align 32 %arg) {			define internal void @callee2(i64* noalias sret(i64) align 32 %arg) {
	store i64 0, i64* %arg, align 8			store i64 0, i64* %arg, align 8
	ret void			ret void
	}			}

	define amdgpu_kernel void @caller2() {			define amdgpu_kernel void @caller2() {
	; CHECK-LABEL: @caller2(			; CHECK-LABEL: @caller2(
				; CHECK-NEXT: [[TMP1:%.]] = tail call i8 @llvm.noalias.decl.p0i8.p0p0i64.i64(i64** null, i64 0, [[META0:metadata !.*]])
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	%alloca = alloca i64, align 8, addrspace(5)			%alloca = alloca i64, align 8, addrspace(5)
	%cast = addrspacecast i64 addrspace(5)* %alloca to i64*			%cast = addrspacecast i64 addrspace(5)* %alloca to i64*
	call void @callee2(i64* sret(i64) align 32 %cast)			call void @callee2(i64* sret(i64) align 32 %cast)
	ret void			ret void
	}			}

llvm/test/Transforms/PhaseOrdering/instcombine-sroa-inttoptr.ll

	Show First 20 Lines • Show All 64 Lines • ▼ Show 20 Lines
	}			}

	define dso_local i32* @_Z3foo1S(%0* byval(%0) align 8 %arg) {			define dso_local i32* @_Z3foo1S(%0* byval(%0) align 8 %arg) {
	; CHECK-LABEL: @_Z3foo1S(			; CHECK-LABEL: @_Z3foo1S(
	; CHECK-NEXT: bb:			; CHECK-NEXT: bb:
	; CHECK-NEXT: [[I2:%.]] = alloca [[TMP0:%.]], align 8			; CHECK-NEXT: [[I2:%.]] = alloca [[TMP0:%.]], align 8
	; CHECK-NEXT: [[I1_SROA_0_0_I5_SROA_IDX:%.]] = getelementptr inbounds [[TMP0]], %0 [[ARG:%.*]], i64 0, i32 0			; CHECK-NEXT: [[I1_SROA_0_0_I5_SROA_IDX:%.]] = getelementptr inbounds [[TMP0]], %0 [[ARG:%.*]], i64 0, i32 0
	; CHECK-NEXT: [[I1_SROA_0_0_COPYLOAD:%.]] = load i32, i32** [[I1_SROA_0_0_I5_SROA_IDX]], align 8			; CHECK-NEXT: [[I1_SROA_0_0_COPYLOAD:%.]] = load i32, i32** [[I1_SROA_0_0_I5_SROA_IDX]], align 8
				; CHECK-NEXT: [[TMP0]] = tail call i8* @llvm.noalias.decl.p0i8.p0p0s_s.i64(%0** null, i64 0, [[META0:metadata !.*]])
	; CHECK-NEXT: [[I_SROA_0_0_I6_SROA_IDX:%.]] = getelementptr inbounds [[TMP0]], %0 [[I2]], i64 0, i32 0			; CHECK-NEXT: [[I_SROA_0_0_I6_SROA_IDX:%.]] = getelementptr inbounds [[TMP0]], %0 [[I2]], i64 0, i32 0
	; CHECK-NEXT: store i32* [[I1_SROA_0_0_COPYLOAD]], i32** [[I_SROA_0_0_I6_SROA_IDX]], align 8			; CHECK-NEXT: store i32* [[I1_SROA_0_0_COPYLOAD]], i32** [[I_SROA_0_0_I6_SROA_IDX]], align 8
	; CHECK-NEXT: tail call void @_Z7escape01S(%0* nonnull byval(%0) align 8 [[I2]])			; CHECK-NEXT: tail call void @_Z7escape01S(%0* nonnull byval(%0) align 8 [[I2]])
	; CHECK-NEXT: ret i32* [[I1_SROA_0_0_COPYLOAD]]			; CHECK-NEXT: ret i32* [[I1_SROA_0_0_COPYLOAD]]
	;			;
	bb:			bb:
	%i = alloca %0, align 8			%i = alloca %0, align 8
	%i1 = alloca %0, align 8			%i1 = alloca %0, align 8
	Show All 23 Lines

	declare void @llvm.lifetime.end.p0i8(i64 immarg, i8* nocapture)			declare void @llvm.lifetime.end.p0i8(i64 immarg, i8* nocapture)

	define dso_local i32* @_Z3bar1S(%0* byval(%0) align 8 %arg) {			define dso_local i32* @_Z3bar1S(%0* byval(%0) align 8 %arg) {
	; CHECK-LABEL: @_Z3bar1S(			; CHECK-LABEL: @_Z3bar1S(
	; CHECK-NEXT: bb:			; CHECK-NEXT: bb:
	; CHECK-NEXT: [[I1_SROA_0_0_I4_SROA_IDX:%.]] = getelementptr inbounds [[TMP0:%.]], %0* [[ARG:%.*]], i64 0, i32 0			; CHECK-NEXT: [[I1_SROA_0_0_I4_SROA_IDX:%.]] = getelementptr inbounds [[TMP0:%.]], %0* [[ARG:%.*]], i64 0, i32 0
	; CHECK-NEXT: [[I1_SROA_0_0_COPYLOAD:%.]] = load i32, i32** [[I1_SROA_0_0_I4_SROA_IDX]], align 8			; CHECK-NEXT: [[I1_SROA_0_0_COPYLOAD:%.]] = load i32, i32** [[I1_SROA_0_0_I4_SROA_IDX]], align 8
				; CHECK-NEXT: [[TMP0]] = tail call i8* @llvm.noalias.decl.p0i8.p0p0s_s.i64(%0** null, i64 0, [[META3:metadata !.*]])
	; CHECK-NEXT: [[I5:%.*]] = tail call i32 @_Z4condv()			; CHECK-NEXT: [[I5:%.*]] = tail call i32 @_Z4condv()
	; CHECK-NEXT: [[I6_NOT:%.*]] = icmp eq i32 [[I5]], 0			; CHECK-NEXT: [[I6_NOT:%.*]] = icmp eq i32 [[I5]], 0
	; CHECK-NEXT: br i1 [[I6_NOT]], label [[BB10:%.]], label [[BB7:%.]]			; CHECK-NEXT: br i1 [[I6_NOT]], label [[BB10:%.]], label [[BB7:%.]]
	; CHECK: bb7:			; CHECK: bb7:
	; CHECK-NEXT: tail call void @_Z5sync0v()			; CHECK-NEXT: tail call void @_Z5sync0v()
	; CHECK-NEXT: tail call void @_Z7escape0Pi(i32* [[I1_SROA_0_0_COPYLOAD]])			; CHECK-NEXT: tail call void @_Z7escape0Pi(i32* [[I1_SROA_0_0_COPYLOAD]])
	; CHECK-NEXT: br label [[BB13:%.*]]			; CHECK-NEXT: br label [[BB13:%.*]]
	; CHECK: bb10:			; CHECK: bb10:
	▲ Show 20 Lines • Show All 46 Lines • Show Last 20 Lines

llvm/test/Transforms/PhaseOrdering/pr39282.ll

	Show All 13 Lines
	}			}

	; Consider that %addr1 = %addr2 + 1, in which case %addr2i and %addr1i are			; Consider that %addr1 = %addr2 + 1, in which case %addr2i and %addr1i are
	; noalias within one iteration, but may alias across iterations.			; noalias within one iteration, but may alias across iterations.
	; TODO: This is a micompile.			; TODO: This is a micompile.
	define void @pr39282(i32* %addr1, i32* %addr2) {			define void @pr39282(i32* %addr1, i32* %addr2) {
	; CHECK-LABEL: @pr39282(			; CHECK-LABEL: @pr39282(
	; CHECK-NEXT: start:			; CHECK-NEXT: start:
	; CHECK-NEXT: [[X_I:%.]] = load i32, i32 [[ADDR1:%.*]], align 4, !alias.scope !0, !noalias !3			; CHECK-NEXT: [[TMP0:%.]] = tail call i8 @llvm.noalias.decl.p0i8.p0p0i32.i64(i32** null, i64 0, [[META0:metadata !.*]])
				; CHECK-NEXT: [[TMP1:%.]] = tail call i8 @llvm.noalias.decl.p0i8.p0p0i32.i64(i32** null, i64 0, [[META3:metadata !.*]])
				; CHECK-NEXT: [[X_I:%.]] = load i32, i32 [[ADDR1:%.*]], align 4, !alias.scope !3, !noalias !0
	; CHECK-NEXT: [[ADDR1I_1:%.]] = getelementptr inbounds i32, i32 [[ADDR1]], i64 1			; CHECK-NEXT: [[ADDR1I_1:%.]] = getelementptr inbounds i32, i32 [[ADDR1]], i64 1
	; CHECK-NEXT: [[ADDR2I_1:%.]] = getelementptr inbounds i32, i32 [[ADDR2:%.*]], i64 1			; CHECK-NEXT: [[ADDR2I_1:%.]] = getelementptr inbounds i32, i32 [[ADDR2:%.*]], i64 1
	; CHECK-NEXT: [[X_I_1:%.]] = load i32, i32 [[ADDR1I_1]], align 4, !alias.scope !0, !noalias !3			; CHECK-NEXT: [[TMP2:%.]] = tail call i8 @llvm.noalias.decl.p0i8.p0p0i32.i64(i32** null, i64 0, [[META0]])
	; CHECK-NEXT: store i32 [[X_I]], i32* [[ADDR2]], align 4, !alias.scope !3, !noalias !0			; CHECK-NEXT: [[TMP3:%.]] = tail call i8 @llvm.noalias.decl.p0i8.p0p0i32.i64(i32** null, i64 0, [[META3]])
	; CHECK-NEXT: store i32 [[X_I_1]], i32* [[ADDR2I_1]], align 4, !alias.scope !3, !noalias !0			; CHECK-NEXT: [[X_I_1:%.]] = load i32, i32 [[ADDR1I_1]], align 4, !alias.scope !3, !noalias !0
				; CHECK-NEXT: [[TMP4:%.]] = tail call i8 @llvm.noalias.decl.p0i8.p0p0i32.i64(i32** null, i64 0, [[META0]])
				; CHECK-NEXT: [[TMP5:%.]] = tail call i8 @llvm.noalias.decl.p0i8.p0p0i32.i64(i32** null, i64 0, [[META3]])
				; CHECK-NEXT: store i32 [[X_I]], i32* [[ADDR2]], align 4, !alias.scope !0, !noalias !3
				; CHECK-NEXT: [[TMP6:%.]] = tail call i8 @llvm.noalias.decl.p0i8.p0p0i32.i64(i32** null, i64 0, [[META0]])
				; CHECK-NEXT: [[TMP7:%.]] = tail call i8 @llvm.noalias.decl.p0i8.p0p0i32.i64(i32** null, i64 0, [[META3]])
				; CHECK-NEXT: store i32 [[X_I_1]], i32* [[ADDR2I_1]], align 4, !alias.scope !0, !noalias !3
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	start:			start:
	br label %body			br label %body

	body:			body:
	%i = phi i32 [ 0, %start ], [ %i.next, %body ]			%i = phi i32 [ 0, %start ], [ %i.next, %body ]
	%j = and i32 %i, 1			%j = and i32 %i, 1
	Show All 10 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[InlineFunction] Use llvm.experimental.noalias.scope.decl for noalias arguments.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 310905

llvm/include/llvm/IR/Metadata.h

llvm/include/llvm/Transforms/Utils/Cloning.h

llvm/lib/Transforms/Utils/CloneFunction.cpp

llvm/lib/Transforms/Utils/InlineFunction.cpp

llvm/test/Transforms/Coroutines/ArgAddr.ll

llvm/test/Transforms/Coroutines/coro-retcon-resume-values.ll

llvm/test/Transforms/Coroutines/coro-retcon-value.ll

llvm/test/Transforms/Coroutines/coro-retcon.ll

llvm/test/Transforms/Coroutines/ex2.ll

llvm/test/Transforms/Coroutines/ex3.ll

llvm/test/Transforms/Coroutines/ex4.ll

llvm/test/Transforms/Inline/launder.invariant.group.ll

llvm/test/Transforms/Inline/noalias-calls-always.ll

llvm/test/Transforms/Inline/noalias-calls.ll

llvm/test/Transforms/Inline/noalias.ll

llvm/test/Transforms/Inline/noalias2.ll

llvm/test/Transforms/PhaseOrdering/inlining-alignment-assumptions.ll

llvm/test/Transforms/PhaseOrdering/instcombine-sroa-inttoptr.ll

llvm/test/Transforms/PhaseOrdering/pr39282.ll

[InlineFunction] Use llvm.experimental.noalias.scope.decl for noalias arguments.
ClosedPublic