This is an archive of the discontinued LLVM Phabricator instance.

[Inliner] Use whitelist instead of blacklist when checking function attribute compatibility and make the check stricter
ClosedPublic

Authored by ahatanak on Feb 20 2015, 3:24 PM.

Download Raw Diff

Details

Reviewers

chandlerc
echristo
dexonsmith

Commits

rGb45b1ea86f8f: [Inliner] Merge the attributes of the caller and callee functions
rL257575: [Inliner] Merge the attributes of the caller and callee functions

Summary

This patch changes functionsHaveCompatibleAttribute to list the attributes that can be ignored rather than listing the attributes that should be checked (whitelist, instead of blacklist). I think this is better as we won't unintentionally inline a function that is incompatible with the caller when a new function attribute is added.

In my patch, I've whitelisted all function attributes listed in http://llvm.org/docs/LangRef.html#function-attributes except for these two:
alignstack(<n>)
noimplicitfloat

None of the string attributes have been added to the whitelist yet, so functions that have incompatible string attributes (for example, functions that have incompatible fast-math attributes) will not get inlined.

Diff Detail

Event Timeline

ahatanak updated this revision to Diff 20434.Feb 20 2015, 3:24 PM

ahatanak retitled this revision from to [Inliner] Use whitelist instead of blacklist when checking function attribute compatibility and make the check stricter.

ahatanak updated this object.

ahatanak edited the test plan for this revision. (Show Details)

ahatanak added reviewers: dexonsmith, chandlerc, echristo.

ahatanak added a subscriber: Unknown Object (MLST).

ping

I really don't like this. I really don't like the prior solution either.

Today, we have subtle, hard to diagnose correctness bugs. Bad bad bad.

With this we have subtle, hard to diagnose performance bugs combined with a bit of a maintenance nightmare to keep this up to date. Bad bad bad.

I also fear the performance bugs will in practice happen much more often. I'm seriously worried about the string attributes for example.

I think we need to back up and think of a better design for this. A *much* better design. Do you have any thoughts on how to do this in a more maintainable way? If not I'll try to write something else.

Naturally, if there are specific attributes that are casuing problems, let's get a targeted fix in place for those attributes.

This revision now requires changes to proceed.Mar 3 2015, 5:48 PM

I came up with a patch that addresses the concerns Chandler brought up. This is still a WIP patch, so I would like to hear feedback before I go too far down this route.

The updated patch takes a completely different approach to determining a function's inlinability. It uses table-gen to specify the attributes' enums along with their inlinability/compatibility information. With this new approach, whenever someone defines and uses a new attribute in the IR, the compatibility information has to be added too. Currently, there is only one field in class Attr in the .td file, which indicates whether the inliner should check the attribute's compatibility, but it should be easy to add other fields or rules if other passes need more information about the attribute. For example, I think most of the code in the verifier that looks at attributes in the IR can be auto-generated by table-gen.

The patch doesn't add any checks for the target-specific function attribute strings, but it can be done in a similar way to target-independent enum attributes. Functions are auto-generated by table-gen from .td files and are passed to the constructor of TargetIRAnalysis to enable checking compatibility of target-specific attributes. Also, we should be able to generate code that checks the validity of a string attribute added to the IR, which is something we currently lack. It would issue a warning or error if a string not recognized by the target was used to get an attribute from the attribute set.

Herald added a subscriber: jholewinski. · View Herald TranscriptMar 10 2015, 2:59 PM

Ok, finally back to this. Generally I do like the direction. Some more specific guidelines below.

include/llvm/Analysis/TargetTransformInfo.h
804–805 ↗	(On Diff #21626)	I would start off without trying to solve the problem for textual target attributes, and just take a conservative approach for them until we have a good use case, and can layer it on top.
include/llvm/IR/EnumAttributes.td
1 ↗	(On Diff #21626)	I really like this general approach. Could you start off the .td file with a comment explaining the syntax that should be used?
3 ↗	(On Diff #21626)	Rather than a single bit for compatibility checking, I'd like this to be a list of attributes to fall back on when merging. I'm thinking of a structure like this. def X : Attr<..., CompatSequence<[A, B, C]>>; def Y : Attr<..., CompatSequence<[B, C]>>; This would say that X is compatible with A, B, or C. So if we're inlining a Callee with X into A, B, or C, its fine. It would also mean that if we are inlining a callee with X into a caller Y, we could switch to attribute B instead as the first common attribute between them. Thoughts?

Thank you for the review. A few comments inline.

include/llvm/Analysis/TargetTransformInfo.h
804–805 ↗	(On Diff #21626)	Yes, I think we can look at the target independent attributes first.
include/llvm/IR/EnumAttributes.td
3 ↗	(On Diff #21626)	I think we should use a list of attributes instead of a single bit if we want to check the compatibility between two different kinds of attributes. Do we need to do that for any of the target dependent or independent attributes we are currently using? I know we have to check attributes of the same kind in some cases (e.g., SanitizeAddress), but I'm not aware of any pair of different attribute kinds that can block inlining. Also, if we want to make the list of attributes short, we should probably use lists of incompatible attributes instead of lists of compatible ones (or define table-gen constructs for both), as I think most attributes are always compatible with each other regardless of their values.

Hi Akira,

I think for right now since it's fairly trivial to come up with a testcase that will run into problems we should do a quick conservative patch to the inliner first that will enable some progress - i.e. we should check the target-cpu and target-features strings and if they don't match exactly go ahead and reject in the inliner. This is, obviously, much less than optimal, but will get us to an iterative solution.

Thoughts?

-eric

Yes, I that works for me. I've been working on rewriting the .td file, but
I can do that later.

There are other attributes that we need to check besides target-cpu and
target-features:

Fast FP math attributes: I think we should reject inlining if the caller

has a fast fp-math attribute (e.g., unsafe-fp-math = true) but the callee
doesn't. Alternatively, we can change the caller's fp-math attribute (to
unsafe-fp-math = false). I know the long term solution would be to model
these attributes at the instruction-level, but until we make that change,
we should treat them as function attributes.

NoImplicitFloat.
Probably we should check some of the target-specific attributes too.

Sent a new patch here which changes inliner to check target-cpu and target-features:

http://reviews.llvm.org/D8984

As a side note, I just ran into the - more or less - mirror image of this.

CodeExtractor currently doesn't copy any function attributes into the function it creates (except, for some reason, nounwind).
Copying all attributes, however, doesn't seem safe. For example, a readnone function can have a stack allocation and the pointer can be passed into the extracted function. On the other hand, we definitely want to copy target-cpu/target-features.

Update patch attached.

The following are the changes I made to the table-gen file:

Added attributes classes which derive from the base class. This enables distinguishing enum attributes from string attributes and key-value attributes from attributes that don't have values.
Added target-independent string attributes. Inliner uses these attributes to check caller-callee compatibility.
Added two classes to describe function inlining rules.

I'm mainly interested in whether the table-gen syntax is easy to understand and is expressive enough to describe most of the inlining rules that are not so complex. Also, as I mentioned before, I think it's possible to make some improvements to it later so that it can be used to auto-generate code for other passes (e.g., IR verifier) or for documentation.

In D7802#155864, @mkuper wrote:

As a side note, I just ran into the - more or less - mirror image of this.

CodeExtractor currently doesn't copy any function attributes into the function it creates (except, for some reason, nounwind).
Copying all attributes, however, doesn't seem safe. For example, a readnone function can have a stack allocation and the pointer can be passed into the extracted function. On the other hand, we definitely want to copy target-cpu/target-features.

I can confirm CodeExtractor doesn't copy target-cpu and target-features. I haven't thought through the solution, but probably we can either fix CodeExtractor to recompute the attributes like readnone or remove them and let FunctionAttrs deduce them later.

Resurrecting the discussion on inliner's function attribute compatibility checking.

I rebased the patch and made the following changes:

Changed the syntax of class "Attr" in Attributes.td. The new syntax allows specifying the c++ function that is called to check the compatibility between the attributes of the caller and callee.
Updated cmake files.
Add comments and renamed functions.

Rebase and add rules to merge caller's and callee's attributes to Attributes.td.

Merge rules enable modifying the caller's attribute set if the caller and callee have incompatible attribute sets, rather than blocking inlining. For example, if the callee has attribute "noimplicitfloat", but the caller doesn't, the merge rule defined in Attributes.td attaches "noimplicitfloat" to the caller.

I've got a few things here that are related and would like to take a look
if you wouldn't mind waiting while I take a look after Duncan's LGTM.

Thanks!

-eric

LGTM.

Thanks and sorry for the delay.

-eric

include/llvm/IR/Attributes.h
69	Can you separate this out into a separate patch? (Feel free to commit it as long as it's a nop change).

Closed by commit rL257575: [Inliner] Merge the attributes of the caller and callee functions (authored by ahatanak). · Explain WhyJan 12 2016, 10:06 PM

This revision was automatically updated to reflect the committed changes.

Hahnfeld mentioned this in D47070: [CUDA] Upgrade linked bitcode to enable inlining.May 19 2018, 1:27 AM

Revision Contents

Path

Size

include/

llvm/

IR/

Attributes.h

64 lines

Attributes.td

187 lines

CMakeLists.txt

5 lines

lib/

Analysis/

InlineCost.cpp

4 lines

IR/

Attributes.cpp

78 lines

AttributesCompatFunc.td

1 line

CMakeLists.txt

4 lines

Makefile

34 lines

Transforms/

IPO/

Inliner.cpp

35 lines

test/

Analysis/

BasicAA/

intrinsics.ll

4 lines

TypeBasedAliasAnalysis/

intrinsics.ll

4 lines

Bitcode/

compatibility-3.6.ll

4 lines

compatibility-3.7.ll

4 lines

compatibility.ll

4 lines

Transforms/

Inline/

attributes.ll

84 lines

inline_invoke.ll

2 lines

MemCpyOpt/

memcpy.ll

2 lines

ObjCARC/

nested.ll

2 lines

utils/

TableGen/

188 lines

1 line

8 lines

1 line

Diff 37320

include/llvm/IR/Attributes.h

Show All 27 Lines
namespace llvm {		namespace llvm {

class AttrBuilder;		class AttrBuilder;
class AttributeImpl;		class AttributeImpl;
class AttributeSetImpl;		class AttributeSetImpl;
class AttributeSetNode;		class AttributeSetNode;
class Constant;		class Constant;
template<typename T> struct DenseMapInfo;		template<typename T> struct DenseMapInfo;
		class Function;
class LLVMContext;		class LLVMContext;
class Type;		class Type;

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
/// \class		/// \class
/// \brief Functions, function parameters, and return types can have attributes		/// \brief Functions, function parameters, and return types can have attributes
/// to indicate how they should be treated by optimizations and code		/// to indicate how they should be treated by optimizations and code
/// generation. This class represents one of those attributes. It's light-weight		/// generation. This class represents one of those attributes. It's light-weight
Show All 15 Lines	public:
/// nounwind = No need for an entry		/// nounwind = No need for an entry
/// uwtable = Needs an entry because the ABI says so and because		/// uwtable = Needs an entry because the ABI says so and because
/// an exception might pass by.		/// an exception might pass by.
/// uwtable + nounwind = Needs an entry because the ABI says so.		/// uwtable + nounwind = Needs an entry because the ABI says so.

enum AttrKind {		enum AttrKind {
// IR-Level Attributes		// IR-Level Attributes
None, ///< No attributes have been set		None, ///< No attributes have been set
Alignment, ///< Alignment of parameter (5 bits)		#define GET_ATTR_ENUM
///< stored as log2 of alignment with +1 bias		#include "llvm/IR/Attributes.inc"
		echristoUnsubmitted Not Done Reply Inline Actions Can you separate this out into a separate patch? (Feel free to commit it as long as it's a nop change). echristo: Can you separate this out into a separate patch? (Feel free to commit it as long as it's a nop…
///< 0 means unaligned (different from align(1))
AlwaysInline, ///< inline=always
Builtin, ///< Callee is recognized as a builtin, despite
///< nobuiltin attribute on its declaration.
ByVal, ///< Pass structure by value
InAlloca, ///< Pass structure in an alloca
Cold, ///< Marks function as being in a cold path.
Convergent, ///< Can only be moved to control-equivalent blocks
InlineHint, ///< Source said inlining was desirable
InReg, ///< Force argument to be passed in register
JumpTable, ///< Build jump-instruction tables and replace refs.
MinSize, ///< Function must be optimized for size first
Naked, ///< Naked function
Nest, ///< Nested function static chain
NoAlias, ///< Considered to not alias after call
NoBuiltin, ///< Callee isn't recognized as a builtin
NoCapture, ///< Function creates no aliases of pointer
NoDuplicate, ///< Call cannot be duplicated
NoImplicitFloat, ///< Disable implicit floating point insts
NoInline, ///< inline=never
NonLazyBind, ///< Function is called early and/or
///< often, so lazy binding isn't worthwhile
NonNull, ///< Pointer is known to be not null
Dereferenceable, ///< Pointer is known to be dereferenceable
DereferenceableOrNull, ///< Pointer is either null or dereferenceable
NoRedZone, ///< Disable redzone
NoReturn, ///< Mark the function as not returning
NoUnwind, ///< Function doesn't unwind stack
OptimizeForSize, ///< opt_size
OptimizeNone, ///< Function must not be optimized.
ReadNone, ///< Function does not access memory
ReadOnly, ///< Function only reads from memory
ArgMemOnly, ///< Funciton can access memory only using pointers
///< based on its arguments.
Returned, ///< Return value is always equal to this argument
ReturnsTwice, ///< Function can return twice
SExt, ///< Sign extended before/after call
StackAlignment, ///< Alignment of stack for function (3 bits)
///< stored as log2 of alignment with +1 bias 0
///< means unaligned (different from
///< alignstack=(1))
StackProtect, ///< Stack protection.
StackProtectReq, ///< Stack protection required.
StackProtectStrong, ///< Strong Stack protection.
SafeStack, ///< Safe Stack protection.
StructRet, ///< Hidden pointer to structure to return
SanitizeAddress, ///< AddressSanitizer is on.
SanitizeThread, ///< ThreadSanitizer is on.
SanitizeMemory, ///< MemorySanitizer is on.
UWTable, ///< Function must be in a unwind table
ZExt, ///< Zero extended before/after call

EndAttrKinds ///< Sentinal value useful for loops		EndAttrKinds ///< Sentinal value useful for loops
};		};

private:		private:
AttributeImpl *pImpl;		AttributeImpl *pImpl;
Attribute(AttributeImpl *A) : pImpl(A) {}		Attribute(AttributeImpl *A) : pImpl(A) {}

public:		public:
▲ Show 20 Lines • Show All 445 Lines • ▼ Show 20 Lines	public:
AttrBuilder &addRawValue(uint64_t Val);		AttrBuilder &addRawValue(uint64_t Val);
};		};

namespace AttributeFuncs {		namespace AttributeFuncs {

/// \brief Which attributes cannot be applied to a type.		/// \brief Which attributes cannot be applied to a type.
AttrBuilder typeIncompatible(Type *Ty);		AttrBuilder typeIncompatible(Type *Ty);

		/// \returns Return true if the two functions have compatible target-independent
		/// attributes for inlining purposes.
		bool areInlineCompatible(const Function &Caller, const Function &Callee);

		/// \brief Merge caller's and callee's attributes.
		void mergeAttributesForInlining(Function &Caller, const Function &Callee);

} // end AttributeFuncs namespace		} // end AttributeFuncs namespace

} // end llvm namespace		} // end llvm namespace

#endif		#endif

include/llvm/IR/Attributes.td

This file was added.

				/// Attribute base class.
				class Attr<string S> {
				// String representation of this attribute in the IR.
				string AttrString = S;
				}

				/// Enum attribute.
				class EnumAttr<string S> : Attr<S>;

				/// StringBool attribute.
				class StrBoolAttr<string S> : Attr<S>;

				/// Target-independent enum attributes.

				/// Alignment of parameter (5 bits) stored as log2 of alignment with +1 bias.
				/// 0 means unaligned (different from align(1)).
				def Alignment : EnumAttr<"align">;

				/// inline=always.
				def AlwaysInline : EnumAttr<"alwaysinline">;

				/// Funciton can access memory only using pointers based on its arguments.
				def ArgMemOnly : EnumAttr<"argmemonly">;

				/// Callee is recognized as a builtin, despite nobuiltin attribute on its
				/// declaration.
				def Builtin : EnumAttr<"builtin">;

				/// Pass structure by value.
				def ByVal : EnumAttr<"byval">;

				/// Marks function as being in a cold path.
				def Cold : EnumAttr<"cold">;

				/// Can only be moved to control-equivalent blocks.
				def Convergent : EnumAttr<"convergent">;

				/// Pointer is known to be dereferenceable.
				def Dereferenceable : EnumAttr<"dereferenceable">;

				/// Pointer is either null or dereferenceable.
				def DereferenceableOrNull : EnumAttr<"dereferenceable_or_null">;

				/// Pass structure in an alloca.
				def InAlloca : EnumAttr<"inalloca">;

				/// Source said inlining was desirable.
				def InlineHint : EnumAttr<"inlinehint">;

				/// Force argument to be passed in register.
				def InReg : EnumAttr<"inreg">;

				/// Build jump-instruction tables and replace refs.
				def JumpTable : EnumAttr<"jumptable">;

				/// Function must be optimized for size first.
				def MinSize : EnumAttr<"minsize">;

				/// Naked function.
				def Naked : EnumAttr<"naked">;

				/// Nested function static chain.
				def Nest : EnumAttr<"nest">;

				/// Considered to not alias after call.
				def NoAlias : EnumAttr<"noalias">;

				/// Callee isn't recognized as a builtin.
				def NoBuiltin : EnumAttr<"nobuiltin">;

				/// Function creates no aliases of pointer.
				def NoCapture : EnumAttr<"nocapture">;

				/// Call cannot be duplicated.
				def NoDuplicate : EnumAttr<"noduplicate">;

				/// Disable implicit floating point insts.
				def NoImplicitFloat : EnumAttr<"noimplicitfloat">;

				/// inline=never.
				def NoInline : EnumAttr<"noinline">;

				/// Function is called early and/or often, so lazy binding isn't worthwhile.
				def NonLazyBind : EnumAttr<"nonlazybind">;

				/// Pointer is known to be not null.
				def NonNull : EnumAttr<"nonnull">;

				/// Disable redzone.
				def NoRedZone : EnumAttr<"noredzone">;

				/// Mark the function as not returning.
				def NoReturn : EnumAttr<"noreturn">;

				/// Function doesn't unwind stack.
				def NoUnwind : EnumAttr<"nounwind">;

				/// opt_size.
				def OptimizeForSize : EnumAttr<"optsize">;

				/// Function must not be optimized.
				def OptimizeNone : EnumAttr<"optnone">;

				/// Function does not access memory.
				def ReadNone : EnumAttr<"readnone">;

				/// Function only reads from memory.
				def ReadOnly : EnumAttr<"readonly">;

				/// Return value is always equal to this argument.
				def Returned : EnumAttr<"returned">;

				/// Function can return twice.
				def ReturnsTwice : EnumAttr<"returns_twice">;

				/// Safe Stack protection.
				def SafeStack : EnumAttr<"safestack">;

				/// Sign extended before/after call.
				def SExt : EnumAttr<"signext">;

				/// Alignment of stack for function (3 bits) stored as log2 of alignment with
				/// +1 bias 0 means unaligned (different from alignstack=(1)).
				def StackAlignment : EnumAttr<"alignstack">;

				/// Stack protection.
				def StackProtect : EnumAttr<"ssp">;

				/// Stack protection required.
				def StackProtectReq : EnumAttr<"sspreq">;

				/// Strong Stack protection.
				def StackProtectStrong : EnumAttr<"sspstrong">;

				/// Hidden pointer to structure to return.
				def StructRet : EnumAttr<"sret">;

				/// AddressSanitizer is on.
				def SanitizeAddress : EnumAttr<"sanitize_address">;

				/// ThreadSanitizer is on.
				def SanitizeThread : EnumAttr<"sanitize_thread">;

				/// MemorySanitizer is on.
				def SanitizeMemory : EnumAttr<"sanitize_memory">;

				/// Function must be in a unwind table.
				def UWTable : EnumAttr<"uwtable">;

				/// Zero extended before/after call.
				def ZExt : EnumAttr<"zeroext">;

				/// Target-independent string attributes.
				def LessPreciseFPMAD : StrBoolAttr<"less-precise-fpmad">;
				def NoInfsFPMath : StrBoolAttr<"no-infs-fp-math">;
				def NoNansFPMath : StrBoolAttr<"no-nans-fp-math">;
				def UnsafeFPMath : StrBoolAttr<"unsafe-fp-math">;

				class CompatRule<string F> {
				// The name of the function called to check the attribute of the caller and
				// callee and decide whether inlining should be allowed. The function's
				// signature must match "bool(const Function&, const Function &)", where the
				// first parameter is the reference to the caller and the second parameter is
				// the reference to the callee. It must return false if the attributes of the
				// caller and callee are incompatible, and true otherwise.
				string CompatFunc = F;
				}

				def : CompatRule<"isEqual<SanitizeAddressAttr>">;
				def : CompatRule<"isEqual<SanitizeThreadAttr>">;
				def : CompatRule<"isEqual<SanitizeMemoryAttr>">;

				class MergeRule<string F> {
				// The name of the function called to merge the attributes of the caller and
				// callee. The function's signature must match
				// "void(Function&, const Function &)", where the first parameter is the
				// reference to the caller and the second parameter is the reference to the
				// callee.
				string MergeFunc = F;
				}

				def : MergeRule<"setAND<LessPreciseFPMADAttr>">;
				def : MergeRule<"setAND<NoInfsFPMathAttr>">;
				def : MergeRule<"setAND<NoNansFPMathAttr>">;
				def : MergeRule<"setAND<UnsafeFPMathAttr>">;
				def : MergeRule<"setOR<NoImplicitFloatAttr>">;
				def : MergeRule<"adjustCallerSSPLevel">;

include/llvm/IR/CMakeLists.txt

	set(LLVM_TARGET_DEFINITIONS Intrinsics.td)			set(LLVM_TARGET_DEFINITIONS Attributes.td)
				tablegen(LLVM Attributes.inc -gen-attr)

				set(LLVM_TARGET_DEFINITIONS Intrinsics.td)
	tablegen(LLVM Intrinsics.gen -gen-intrinsic)			tablegen(LLVM Intrinsics.gen -gen-intrinsic)

	add_public_tablegen_target(intrinsics_gen)			add_public_tablegen_target(intrinsics_gen)

lib/Analysis/InlineCost.cpp

	Show First 20 Lines • Show All 1,356 Lines • ▼ Show 20 Lines
	}			}

	/// \brief Test that there are no attribute conflicts between Caller and Callee			/// \brief Test that there are no attribute conflicts between Caller and Callee
	/// that prevent inlining.			/// that prevent inlining.
	static bool functionsHaveCompatibleAttributes(Function *Caller,			static bool functionsHaveCompatibleAttributes(Function *Caller,
	Function *Callee,			Function *Callee,
	TargetTransformInfo &TTI) {			TargetTransformInfo &TTI) {
	return TTI.areInlineCompatible(Caller, Callee) &&			return TTI.areInlineCompatible(Caller, Callee) &&
	attributeMatches(Caller, Callee, Attribute::SanitizeAddress) &&			AttributeFuncs::areInlineCompatible(Caller, Callee);
	attributeMatches(Caller, Callee, Attribute::SanitizeMemory) &&
	attributeMatches(Caller, Callee, Attribute::SanitizeThread);
	}			}

	InlineCost InlineCostAnalysis::getInlineCost(CallSite CS, Function *Callee,			InlineCost InlineCostAnalysis::getInlineCost(CallSite CS, Function *Callee,
	int Threshold) {			int Threshold) {
	// Cannot inline indirect calls.			// Cannot inline indirect calls.
	if (!Callee)			if (!Callee)
	return llvm::InlineCost::getNever();			return llvm::InlineCost::getNever();

	▲ Show 20 Lines • Show All 76 Lines • Show Last 20 Lines

lib/IR/Attributes.cpp

//===-- Attributes.cpp - Implement AttributesList -------------------------===//		//===-- Attributes.cpp - Implement AttributesList -------------------------===//
//		//
// The LLVM Compiler Infrastructure		// The LLVM Compiler Infrastructure
//		//
// This file is distributed under the University of Illinois Open Source		// This file is distributed under the University of Illinois Open Source
// License. See LICENSE.TXT for details.		// License. See LICENSE.TXT for details.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// \file		// \file
// \brief This file implements the Attribute, AttributeImpl, AttrBuilder,		// \brief This file implements the Attribute, AttributeImpl, AttrBuilder,
// AttributeSetImpl, and AttributeSet classes.		// AttributeSetImpl, and AttributeSet classes.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "llvm/IR/Attributes.h"		#include "llvm/IR/Attributes.h"
		#include "llvm/IR/Function.h"
#include "AttributeImpl.h"		#include "AttributeImpl.h"
#include "LLVMContextImpl.h"		#include "LLVMContextImpl.h"
#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
#include "llvm/ADT/StringExtras.h"		#include "llvm/ADT/StringExtras.h"
#include "llvm/IR/Type.h"		#include "llvm/IR/Type.h"
#include "llvm/Support/Atomic.h"		#include "llvm/Support/Atomic.h"
#include "llvm/Support/Debug.h"		#include "llvm/Support/Debug.h"
#include "llvm/Support/ManagedStatic.h"		#include "llvm/Support/ManagedStatic.h"
▲ Show 20 Lines • Show All 1,374 Lines • ▼ Show 20 Lines	Incompatible.addAttribute(Attribute::ByVal)
.addDereferenceableOrNullAttr(1) // the int here is ignored		.addDereferenceableOrNullAttr(1) // the int here is ignored
.addAttribute(Attribute::ReadNone)		.addAttribute(Attribute::ReadNone)
.addAttribute(Attribute::ReadOnly)		.addAttribute(Attribute::ReadOnly)
.addAttribute(Attribute::StructRet)		.addAttribute(Attribute::StructRet)
.addAttribute(Attribute::InAlloca);		.addAttribute(Attribute::InAlloca);

return Incompatible;		return Incompatible;
}		}

		template<typename AttrClass>
		static bool isEqual(const Function &Caller, const Function &Callee) {
		return Caller.getFnAttribute(AttrClass::Kind) ==
		Callee.getFnAttribute(AttrClass::Kind);
		}

		/// \brief Compute the logical AND of the attributes of the caller and the
		/// callee.
		///
		/// This function sets the caller's attribute to false if the callee's attribute
		/// is false.
		template<typename AttrClass>
		static void setAND(Function &Caller, const Function &Callee) {
		if (AttrClass::isSet(Caller, AttrClass::Kind) &&
		!AttrClass::isSet(Callee, AttrClass::Kind))
		AttrClass::set(Caller, AttrClass::Kind, false);
		}

		/// \brief Compute the logical OR of the attributes of the caller and the
		/// callee.
		///
		/// This function sets the caller's attribute to true if the callee's attribute
		/// is true.
		template<typename AttrClass>
		static void setOR(Function &Caller, const Function &Callee) {
		if (!AttrClass::isSet(Caller, AttrClass::Kind) &&
		AttrClass::isSet(Callee, AttrClass::Kind))
		AttrClass::set(Caller, AttrClass::Kind, true);
		}

		/// \brief If the inlined function had a higher stack protection level than the
		/// calling function, then bump up the caller's stack protection level.
		static void adjustCallerSSPLevel(Function &Caller, const Function &Callee) {
		// If upgrading the SSP attribute, clear out the old SSP Attributes first.
		// Having multiple SSP attributes doesn't actually hurt, but it adds useless
		// clutter to the IR.
		AttrBuilder B;
		B.addAttribute(Attribute::StackProtect)
		.addAttribute(Attribute::StackProtectStrong)
		.addAttribute(Attribute::StackProtectReq);
		AttributeSet OldSSPAttr = AttributeSet::get(Caller.getContext(),
		AttributeSet::FunctionIndex,
		B);

		if (Callee.hasFnAttribute(Attribute::SafeStack)) {
		Caller.removeAttributes(AttributeSet::FunctionIndex, OldSSPAttr);
		Caller.addFnAttr(Attribute::SafeStack);
		} else if (Callee.hasFnAttribute(Attribute::StackProtectReq) &&
		!Caller.hasFnAttribute(Attribute::SafeStack)) {
		Caller.removeAttributes(AttributeSet::FunctionIndex, OldSSPAttr);
		Caller.addFnAttr(Attribute::StackProtectReq);
		} else if (Callee.hasFnAttribute(Attribute::StackProtectStrong) &&
		!Caller.hasFnAttribute(Attribute::SafeStack) &&
		!Caller.hasFnAttribute(Attribute::StackProtectReq)) {
		Caller.removeAttributes(AttributeSet::FunctionIndex, OldSSPAttr);
		Caller.addFnAttr(Attribute::StackProtectStrong);
		} else if (Callee.hasFnAttribute(Attribute::StackProtect) &&
		!Caller.hasFnAttribute(Attribute::SafeStack) &&
		!Caller.hasFnAttribute(Attribute::StackProtectReq) &&
		!Caller.hasFnAttribute(Attribute::StackProtectStrong))
		Caller.addFnAttr(Attribute::StackProtect);
		}

		#define GET_ATTR_COMPAT_FUNC
		#include "AttributesCompatFunc.inc"

		bool AttributeFuncs::areInlineCompatible(const Function &Caller,
		const Function &Callee) {
		return hasCompatibleFnAttrs(Caller, Callee);
		}


		void AttributeFuncs::mergeAttributesForInlining(Function &Caller,
		const Function &Callee) {
		mergeFnAttrs(Caller, Callee);
		}

lib/IR/AttributesCompatFunc.td

This file was added.

include "llvm/IR/Attributes.td"

lib/IR/CMakeLists.txt

				set(LLVM_TARGET_DEFINITIONS AttributesCompatFunc.td)
				tablegen(LLVM AttributesCompatFunc.inc -gen-attr)
				add_public_tablegen_target(AttributeCompatFuncTableGen)

	add_llvm_library(LLVMCore			add_llvm_library(LLVMCore
	AsmWriter.cpp			AsmWriter.cpp
	Attributes.cpp			Attributes.cpp
	AutoUpgrade.cpp			AutoUpgrade.cpp
	BasicBlock.cpp			BasicBlock.cpp
	Comdat.cpp			Comdat.cpp
	ConstantFold.cpp			ConstantFold.cpp
	ConstantRange.cpp			ConstantRange.cpp
	▲ Show 20 Lines • Show All 56 Lines • Show Last 20 Lines

lib/IR/Makefile

	##===- lib/IR/Makefile -------------------------------------- Makefile --===##			##===- lib/IR/Makefile -------------------------------------- Makefile --===##
	#			#
	# The LLVM Compiler Infrastructure			# The LLVM Compiler Infrastructure
	#			#
	# This file is distributed under the University of Illinois Open Source			# This file is distributed under the University of Illinois Open Source
	# License. See LICENSE.TXT for details.			# License. See LICENSE.TXT for details.
	#			#
	##===----------------------------------------------------------------------===##			##===----------------------------------------------------------------------===##
	LEVEL = ../..			LEVEL = ../..
	LIBRARYNAME = LLVMCore			LIBRARYNAME = LLVMCore
	BUILD_ARCHIVE = 1			BUILD_ARCHIVE = 1

	BUILT_SOURCES = $(PROJ_OBJ_ROOT)/include/llvm/IR/Intrinsics.gen			BUILT_SOURCES = $(PROJ_OBJ_ROOT)/include/llvm/IR/Intrinsics.gen \
				$(PROJ_OBJ_ROOT)/include/llvm/IR/Attributes.inc \
				$(PROJ_OBJ_ROOT)/lib/IR/AttributesCompatFunc.inc

	include $(LEVEL)/Makefile.common			include $(LEVEL)/Makefile.common

	GENFILE:=$(PROJ_OBJ_ROOT)/include/llvm/IR/Intrinsics.gen			GENFILE:=$(PROJ_OBJ_ROOT)/include/llvm/IR/Intrinsics.gen
				ATTRINCFILE:=$(PROJ_OBJ_ROOT)/include/llvm/IR/Attributes.inc
				ATTRCOMPATFUNCINCFILE:=$(PROJ_OBJ_ROOT)/lib/IR/AttributesCompatFunc.inc

	INTRINSICTD := $(PROJ_SRC_ROOT)/include/llvm/IR/Intrinsics.td			INTRINSICTD := $(PROJ_SRC_ROOT)/include/llvm/IR/Intrinsics.td
	INTRINSICTDS := $(wildcard $(PROJ_SRC_ROOT)/include/llvm/IR/Intrinsics*.td)			INTRINSICTDS := $(wildcard $(PROJ_SRC_ROOT)/include/llvm/IR/Intrinsics*.td)
				ATTRIBUTESTD := $(PROJ_SRC_ROOT)/include/llvm/IR/Attributes.td
				ATTRCOMPATFUNCTD := $(PROJ_SRC_ROOT)/lib/IR/AttributesCompatFunc.td

	$(ObjDir)/Intrinsics.gen.tmp: $(ObjDir)/.dir $(INTRINSICTDS) $(LLVM_TBLGEN)			$(ObjDir)/Intrinsics.gen.tmp: $(ObjDir)/.dir $(INTRINSICTDS) $(LLVM_TBLGEN)
	$(Echo) Building Intrinsics.gen.tmp from Intrinsics.td			$(Echo) Building Intrinsics.gen.tmp from Intrinsics.td
	$(Verb) $(LLVMTableGen) $(call SYSPATH, $(INTRINSICTD)) -o $(call SYSPATH, $@) -gen-intrinsic			$(Verb) $(LLVMTableGen) $(call SYSPATH, $(INTRINSICTD)) -o $(call SYSPATH, $@) -gen-intrinsic

	$(GENFILE): $(ObjDir)/Intrinsics.gen.tmp $(PROJ_OBJ_ROOT)/include/llvm/IR/.dir			$(GENFILE): $(ObjDir)/Intrinsics.gen.tmp $(PROJ_OBJ_ROOT)/include/llvm/IR/.dir
	$(Verb) $(CMP) -s $@ $< \|\| ( $(CP) $< $@ && \			$(Verb) $(CMP) -s $@ $< \|\| ( $(CP) $< $@ && \
	$(EchoCmd) Updated Intrinsics.gen because Intrinsics.gen.tmp \			$(EchoCmd) Updated Intrinsics.gen because Intrinsics.gen.tmp \
	changed significantly. )			changed significantly. )

				$(ObjDir)/Attributes.inc.tmp: $(ObjDir)/.dir $(ATTRIBUTESTD) $(LLVM_TBLGEN)
				$(Echo) Building Attributes.inc.tmp from $(ATTRIBUTESTD)
				$(Verb) $(LLVMTableGen) $(call SYSPATH, $(ATTRIBUTESTD)) -o $(call SYSPATH, $@) -gen-attr

				$(ATTRINCFILE): $(ObjDir)/Attributes.inc.tmp $(PROJ_OBJ_ROOT)/include/llvm/IR/.dir
				$(Verb) $(CMP) -s $@ $< \|\| ( $(CP) $< $@ && \
				$(EchoCmd) Updated Attributes.inc because Attributes.inc.tmp \
				changed significantly. )

				$(ObjDir)/AttributesCompatFunc.inc.tmp: $(ObjDir)/.dir $(ATTRCOMPATFUNCTD) $(LLVM_TBLGEN)
				$(Echo) Building AttributesCompatFunc.inc.tmp from $(ATTRCOMPATFUNCTD)
				$(Verb) $(LLVMTableGen) $(call SYSPATH, $(ATTRCOMPATFUNCTD)) -o $(call SYSPATH, $@) -gen-attr

				$(ATTRCOMPATFUNCINCFILE): $(ObjDir)/AttributesCompatFunc.inc.tmp $(PROJ_OBJ_ROOT)/include/llvm/IR/.dir
				$(Verb) $(CMP) -s $@ $< \|\| ( $(CP) $< $@ && \
				$(EchoCmd) Updated AttributesCompatFunc.inc because AttributesCompatFunc.inc.tmp \
				changed significantly. )

	install-local:: $(GENFILE)			install-local:: $(GENFILE)
	$(Echo) Installing $(DESTDIR)$(PROJ_includedir)/llvm/IR/Intrinsics.gen			$(Echo) Installing $(DESTDIR)$(PROJ_includedir)/llvm/IR/Intrinsics.gen
	$(Verb) $(DataInstall) $(GENFILE) $(DESTDIR)$(PROJ_includedir)/llvm/IR/Intrinsics.gen			$(Verb) $(DataInstall) $(GENFILE) $(DESTDIR)$(PROJ_includedir)/llvm/IR/Intrinsics.gen

				install-local:: $(ATTRINCFILE)
				$(Echo) Installing $(DESTDIR)$(PROJ_includedir)/llvm/IR/Attributes.inc
				$(Verb) $(DataInstall) $(ATTRINCFILE) $(DESTDIR)$(PROJ_includedir)/llvm/IR/Attributes.inc

				install-local:: $(ATTRCOMPATFUNCINCFILE)
				$(Echo) Installing $(DESTDIR)$(PROJ_libdir)/IR/AttributesCompatFunc.inc
				$(Verb) $(DataInstall) $(ATTRCOMPATFUNCINCFILE) $(DESTDIR)$(PROJ_libdir)/IR/AttributesCompatFunc.inc

lib/Transforms/IPO/Inliner.cpp

Show First 20 Lines • Show All 80 Lines • ▼ Show 20 Lines	void Inliner::getAnalysisUsage(AnalysisUsage &AU) const {
AU.addRequired<TargetLibraryInfoWrapperPass>();		AU.addRequired<TargetLibraryInfoWrapperPass>();
CallGraphSCCPass::getAnalysisUsage(AU);		CallGraphSCCPass::getAnalysisUsage(AU);
}		}


typedef DenseMap<ArrayType, std::vector<AllocaInst> >		typedef DenseMap<ArrayType, std::vector<AllocaInst> >
InlinedArrayAllocasTy;		InlinedArrayAllocasTy;

/// \brief If the inlined function had a higher stack protection level than the
/// calling function, then bump up the caller's stack protection level.
static void AdjustCallerSSPLevel(Function Caller, Function Callee) {
// If upgrading the SSP attribute, clear out the old SSP Attributes first.
// Having multiple SSP attributes doesn't actually hurt, but it adds useless
// clutter to the IR.
AttrBuilder B;
B.addAttribute(Attribute::StackProtect)
.addAttribute(Attribute::StackProtectStrong)
.addAttribute(Attribute::StackProtectReq);
AttributeSet OldSSPAttr = AttributeSet::get(Caller->getContext(),
AttributeSet::FunctionIndex,
B);

if (Callee->hasFnAttribute(Attribute::SafeStack)) {
Caller->removeAttributes(AttributeSet::FunctionIndex, OldSSPAttr);
Caller->addFnAttr(Attribute::SafeStack);
} else if (Callee->hasFnAttribute(Attribute::StackProtectReq) &&
!Caller->hasFnAttribute(Attribute::SafeStack)) {
Caller->removeAttributes(AttributeSet::FunctionIndex, OldSSPAttr);
Caller->addFnAttr(Attribute::StackProtectReq);
} else if (Callee->hasFnAttribute(Attribute::StackProtectStrong) &&
!Caller->hasFnAttribute(Attribute::SafeStack) &&
!Caller->hasFnAttribute(Attribute::StackProtectReq)) {
Caller->removeAttributes(AttributeSet::FunctionIndex, OldSSPAttr);
Caller->addFnAttr(Attribute::StackProtectStrong);
} else if (Callee->hasFnAttribute(Attribute::StackProtect) &&
!Caller->hasFnAttribute(Attribute::SafeStack) &&
!Caller->hasFnAttribute(Attribute::StackProtectReq) &&
!Caller->hasFnAttribute(Attribute::StackProtectStrong))
Caller->addFnAttr(Attribute::StackProtect);
}

/// If it is possible to inline the specified call site,		/// If it is possible to inline the specified call site,
/// do so and update the CallGraph for this operation.		/// do so and update the CallGraph for this operation.
///		///
/// This function also does some basic book-keeping to update the IR. The		/// This function also does some basic book-keeping to update the IR. The
/// InlinedArrayAllocas map keeps track of any allocas that are already		/// InlinedArrayAllocas map keeps track of any allocas that are already
/// available from other functions inlined into the caller. If we are able to		/// available from other functions inlined into the caller. If we are able to
/// inline this call site we attempt to reuse already available allocas or add		/// inline this call site we attempt to reuse already available allocas or add
/// any new allocas to the set if not possible.		/// any new allocas to the set if not possible.
Show All 11 Lines	static bool InlineCallIfPossible(Pass &P, CallSite CS, InlineFunctionInfo &IFI,
// work around the limitations of the legacy pass manager.		// work around the limitations of the legacy pass manager.
AAResults AAR(createLegacyPMAAResults(P, *Callee, BAR));		AAResults AAR(createLegacyPMAAResults(P, *Callee, BAR));

// Try to inline the function. Get the list of static allocas that were		// Try to inline the function. Get the list of static allocas that were
// inlined.		// inlined.
if (!InlineFunction(CS, IFI, &AAR, InsertLifetime))		if (!InlineFunction(CS, IFI, &AAR, InsertLifetime))
return false;		return false;

AdjustCallerSSPLevel(Caller, Callee);		AttributeFuncs::mergeAttributesForInlining(Caller, Callee);

// Look at all of the allocas that we inlined through this call site. If we		// Look at all of the allocas that we inlined through this call site. If we
// have already inlined other allocas through other calls into this function,		// have already inlined other allocas through other calls into this function,
// then we know that they have disjoint lifetimes and that we can merge them.		// then we know that they have disjoint lifetimes and that we can merge them.
//		//
// There are many heuristics possible for merging these allocas, and the		// There are many heuristics possible for merging these allocas, and the
// different options have different tradeoffs. One thing that we really		// different options have different tradeoffs. One thing that we really
// don't want to hurt is SRoA: once inlining happens, often allocas are no		// don't want to hurt is SRoA: once inlining happens, often allocas are no
▲ Show 20 Lines • Show All 591 Lines • Show Last 20 Lines

test/Analysis/BasicAA/intrinsics.ll

Show All 32 Lines	entry:
%b = call <8 x i16> @llvm.arm.neon.vld1.v8i16.p0i8(i8* %p, i32 16) nounwind		%b = call <8 x i16> @llvm.arm.neon.vld1.v8i16.p0i8(i8* %p, i32 16) nounwind
%c = add <8 x i16> %a, %b		%c = add <8 x i16> %a, %b
ret <8 x i16> %c		ret <8 x i16> %c
}		}

declare <8 x i16> @llvm.arm.neon.vld1.v8i16.p0i8(i8*, i32) nounwind readonly		declare <8 x i16> @llvm.arm.neon.vld1.v8i16.p0i8(i8*, i32) nounwind readonly
declare void @llvm.arm.neon.vst1.p0i8.v8i16(i8*, <8 x i16>, i32) nounwind		declare void @llvm.arm.neon.vst1.p0i8.v8i16(i8*, <8 x i16>, i32) nounwind

; CHECK: attributes #0 = { nounwind readonly argmemonly }		; CHECK: attributes #0 = { argmemonly nounwind readonly }
; CHECK: attributes #1 = { nounwind argmemonly }		; CHECK: attributes #1 = { argmemonly nounwind }
; CHECK: attributes [[ATTR]] = { nounwind }		; CHECK: attributes [[ATTR]] = { nounwind }

test/Analysis/TypeBasedAliasAnalysis/intrinsics.ll

Show All 16 Lines	entry:
%b = call <8 x i16> @llvm.arm.neon.vld1.v8i16.p0i8(i8* %p, i32 16) nounwind, !tbaa !2		%b = call <8 x i16> @llvm.arm.neon.vld1.v8i16.p0i8(i8* %p, i32 16) nounwind, !tbaa !2
%c = add <8 x i16> %a, %b		%c = add <8 x i16> %a, %b
ret <8 x i16> %c		ret <8 x i16> %c
}		}

declare <8 x i16> @llvm.arm.neon.vld1.v8i16.p0i8(i8*, i32) nounwind readonly		declare <8 x i16> @llvm.arm.neon.vld1.v8i16.p0i8(i8*, i32) nounwind readonly
declare void @llvm.arm.neon.vst1.p0i8.v8i16(i8*, <8 x i16>, i32) nounwind		declare void @llvm.arm.neon.vst1.p0i8.v8i16(i8*, <8 x i16>, i32) nounwind

; CHECK: attributes #0 = { nounwind readonly argmemonly }		; CHECK: attributes #0 = { argmemonly nounwind readonly }
; CHECK: attributes #1 = { nounwind argmemonly }		; CHECK: attributes #1 = { argmemonly nounwind }
; CHECK: attributes [[NUW]] = { nounwind }		; CHECK: attributes [[NUW]] = { nounwind }

!0 = !{!"tbaa root", null}		!0 = !{!"tbaa root", null}
!1 = !{!3, !3, i64 0}		!1 = !{!3, !3, i64 0}
!2 = !{!4, !4, i64 0}		!2 = !{!4, !4, i64 0}
!3 = !{!"A", !0}		!3 = !{!"A", !0}
!4 = !{!"B", !0}		!4 = !{!"B", !0}

test/Bitcode/compatibility-3.6.ll

	Show First 20 Lines • Show All 1,174 Lines • ▼ Show 20 Lines
	; CHECK: attributes #22 = { sanitize_memory }			; CHECK: attributes #22 = { sanitize_memory }
	; CHECK: attributes #23 = { sanitize_thread }			; CHECK: attributes #23 = { sanitize_thread }
	; CHECK: attributes #24 = { ssp }			; CHECK: attributes #24 = { ssp }
	; CHECK: attributes #25 = { sspreq }			; CHECK: attributes #25 = { sspreq }
	; CHECK: attributes #26 = { sspstrong }			; CHECK: attributes #26 = { sspstrong }
	; CHECK: attributes #27 = { uwtable }			; CHECK: attributes #27 = { uwtable }
	; CHECK: attributes #28 = { "cpu"="cortex-a8" }			; CHECK: attributes #28 = { "cpu"="cortex-a8" }
	; CHECK: attributes #29 = { nounwind readnone }			; CHECK: attributes #29 = { nounwind readnone }
	; CHECK: attributes #30 = { nounwind readonly argmemonly }			; CHECK: attributes #30 = { argmemonly nounwind readonly }
	; CHECK: attributes #31 = { nounwind argmemonly }			; CHECK: attributes #31 = { argmemonly nounwind }
	; CHECK: attributes #32 = { nounwind readonly }			; CHECK: attributes #32 = { nounwind readonly }
	; CHECK: attributes #33 = { builtin }			; CHECK: attributes #33 = { builtin }

	;; Metadata			;; Metadata

	; Metadata -- Module flags			; Metadata -- Module flags
	!llvm.module.flags = !{!0, !1, !2, !4, !5, !6}			!llvm.module.flags = !{!0, !1, !2, !4, !5, !6}
	; CHECK: !llvm.module.flags = !{!0, !1, !2, !4, !5, !6}			; CHECK: !llvm.module.flags = !{!0, !1, !2, !4, !5, !6}
	Show All 15 Lines

test/Bitcode/compatibility-3.7.ll

	Show First 20 Lines • Show All 1,237 Lines • ▼ Show 20 Lines
	; CHECK: attributes #25 = { sanitize_thread }			; CHECK: attributes #25 = { sanitize_thread }
	; CHECK: attributes #26 = { ssp }			; CHECK: attributes #26 = { ssp }
	; CHECK: attributes #27 = { sspreq }			; CHECK: attributes #27 = { sspreq }
	; CHECK: attributes #28 = { sspstrong }			; CHECK: attributes #28 = { sspstrong }
	; CHECK: attributes #29 = { "thunk" }			; CHECK: attributes #29 = { "thunk" }
	; CHECK: attributes #30 = { uwtable }			; CHECK: attributes #30 = { uwtable }
	; CHECK: attributes #31 = { "cpu"="cortex-a8" }			; CHECK: attributes #31 = { "cpu"="cortex-a8" }
	; CHECK: attributes #32 = { nounwind readnone }			; CHECK: attributes #32 = { nounwind readnone }
	; CHECK: attributes #33 = { nounwind readonly argmemonly }			; CHECK: attributes #33 = { argmemonly nounwind readonly }
	; CHECK: attributes #34 = { nounwind argmemonly }			; CHECK: attributes #34 = { argmemonly nounwind }
	; CHECK: attributes #35 = { nounwind readonly }			; CHECK: attributes #35 = { nounwind readonly }
	; CHECK: attributes #36 = { builtin }			; CHECK: attributes #36 = { builtin }

	;; Metadata			;; Metadata

	; Metadata -- Module flags			; Metadata -- Module flags
	!llvm.module.flags = !{!0, !1, !2, !4, !5, !6}			!llvm.module.flags = !{!0, !1, !2, !4, !5, !6}
	; CHECK: !llvm.module.flags = !{!0, !1, !2, !4, !5, !6}			; CHECK: !llvm.module.flags = !{!0, !1, !2, !4, !5, !6}
	Show All 25 Lines

test/Bitcode/compatibility.ll

	Show First 20 Lines • Show All 1,492 Lines • ▼ Show 20 Lines
	; CHECK: attributes #25 = { sanitize_thread }			; CHECK: attributes #25 = { sanitize_thread }
	; CHECK: attributes #26 = { ssp }			; CHECK: attributes #26 = { ssp }
	; CHECK: attributes #27 = { sspreq }			; CHECK: attributes #27 = { sspreq }
	; CHECK: attributes #28 = { sspstrong }			; CHECK: attributes #28 = { sspstrong }
	; CHECK: attributes #29 = { "thunk" }			; CHECK: attributes #29 = { "thunk" }
	; CHECK: attributes #30 = { uwtable }			; CHECK: attributes #30 = { uwtable }
	; CHECK: attributes #31 = { "cpu"="cortex-a8" }			; CHECK: attributes #31 = { "cpu"="cortex-a8" }
	; CHECK: attributes #32 = { nounwind readnone }			; CHECK: attributes #32 = { nounwind readnone }
	; CHECK: attributes #33 = { nounwind readonly argmemonly }			; CHECK: attributes #33 = { argmemonly nounwind readonly }
	; CHECK: attributes #34 = { nounwind argmemonly }			; CHECK: attributes #34 = { argmemonly nounwind }
	; CHECK: attributes #35 = { nounwind readonly }			; CHECK: attributes #35 = { nounwind readonly }
	; CHECK: attributes #36 = { builtin }			; CHECK: attributes #36 = { builtin }

	;; Metadata			;; Metadata

	; Metadata -- Module flags			; Metadata -- Module flags
	!llvm.module.flags = !{!0, !1, !2, !4, !5, !6}			!llvm.module.flags = !{!0, !1, !2, !4, !5, !6}
	; CHECK: !llvm.module.flags = !{!0, !1, !2, !4, !5, !6}			; CHECK: !llvm.module.flags = !{!0, !1, !2, !4, !5, !6}
	Show All 25 Lines

test/Transforms/Inline/attributes.ll

	Show First 20 Lines • Show All 154 Lines • ▼ Show 20 Lines

	define i32 @test_target_features1(i32 %i) "target-features"="+sse4.2" {			define i32 @test_target_features1(i32 %i) "target-features"="+sse4.2" {
	%1 = call i32 @test_target_features_callee1(i32 %i)			%1 = call i32 @test_target_features_callee1(i32 %i)
	ret i32 %1			ret i32 %1
	; CHECK-LABEL: @test_target_features1(			; CHECK-LABEL: @test_target_features1(
	; CHECK-NEXT: @test_target_features_callee1			; CHECK-NEXT: @test_target_features_callee1
	; CHECK-NEXT: ret i32			; CHECK-NEXT: ret i32
	}			}

				define i32 @less-precise-fpmad_callee0(i32 %i) "less-precise-fpmad"="false" {
				ret i32 %i
				; CHECK: @less-precise-fpmad_callee0(i32 %i) [[FPMAD_FALSE:#[0-9]+]] {
				; CHECK-NEXT: ret i32
				}

				define i32 @less-precise-fpmad_callee1(i32 %i) "less-precise-fpmad"="true" {
				ret i32 %i
				; CHECK: @less-precise-fpmad_callee1(i32 %i) [[FPMAD_TRUE:#[0-9]+]] {
				; CHECK-NEXT: ret i32
				}

				define i32 @test_less-precise-fpmad0(i32 %i) "less-precise-fpmad"="false" {
				%1 = call i32 @less-precise-fpmad_callee0(i32 %i)
				ret i32 %1
				; CHECK: @test_less-precise-fpmad0(i32 %i) [[FPMAD_FALSE]] {
				; CHECK-NEXT: ret i32
				}

				define i32 @test_less-precise-fpmad1(i32 %i) "less-precise-fpmad"="false" {
				%1 = call i32 @less-precise-fpmad_callee1(i32 %i)
				ret i32 %1
				; CHECK: @test_less-precise-fpmad1(i32 %i) [[FPMAD_FALSE]] {
				; CHECK-NEXT: ret i32
				}

				define i32 @test_less-precise-fpmad2(i32 %i) "less-precise-fpmad"="true" {
				%1 = call i32 @less-precise-fpmad_callee0(i32 %i)
				ret i32 %1
				; CHECK: @test_less-precise-fpmad2(i32 %i) [[FPMAD_FALSE]] {
				; CHECK-NEXT: ret i32
				}

				define i32 @test_less-precise-fpmad3(i32 %i) "less-precise-fpmad"="true" {
				%1 = call i32 @less-precise-fpmad_callee1(i32 %i)
				ret i32 %1
				; CHECK: @test_less-precise-fpmad3(i32 %i) [[FPMAD_TRUE]] {
				; CHECK-NEXT: ret i32
				}

				define i32 @no-implicit-float_callee0(i32 %i) {
				ret i32 %i
				; CHECK: @no-implicit-float_callee0(i32 %i) {
				; CHECK-NEXT: ret i32
				}

				define i32 @no-implicit-float_callee1(i32 %i) noimplicitfloat {
				ret i32 %i
				; CHECK: @no-implicit-float_callee1(i32 %i) [[NOIMPLICITFLOAT:#[0-9]+]] {
				; CHECK-NEXT: ret i32
				}

				define i32 @test_no-implicit-float0(i32 %i) {
				%1 = call i32 @no-implicit-float_callee0(i32 %i)
				ret i32 %1
				; CHECK: @test_no-implicit-float0(i32 %i) {
				; CHECK-NEXT: ret i32
				}

				define i32 @test_no-implicit-float1(i32 %i) {
				%1 = call i32 @no-implicit-float_callee1(i32 %i)
				ret i32 %1
				; CHECK: @test_no-implicit-float1(i32 %i) [[NOIMPLICITFLOAT]] {
				; CHECK-NEXT: ret i32
				}

				define i32 @test_no-implicit-float2(i32 %i) noimplicitfloat {
				%1 = call i32 @no-implicit-float_callee0(i32 %i)
				ret i32 %1
				; CHECK: @test_no-implicit-float2(i32 %i) [[NOIMPLICITFLOAT]] {
				; CHECK-NEXT: ret i32
				}

				define i32 @test_no-implicit-float3(i32 %i) noimplicitfloat {
				%1 = call i32 @no-implicit-float_callee1(i32 %i)
				ret i32 %1
				; CHECK: @test_no-implicit-float3(i32 %i) [[NOIMPLICITFLOAT]] {
				; CHECK-NEXT: ret i32
				}

				; CHECK: attributes [[FPMAD_FALSE]] = { "less-precise-fpmad"="false" }
				; CHECK: attributes [[FPMAD_TRUE]] = { "less-precise-fpmad"="true" }
				; CHECK: attributes [[NOIMPLICITFLOAT]] = { noimplicitfloat }

test/Transforms/Inline/inline_invoke.ll

	Show First 20 Lines • Show All 338 Lines • ▼ Show 20 Lines
	; CHECK: [[FIX]]:			; CHECK: [[FIX]]:
	; CHECK-NEXT: [[T1:%.*]] = phi i32 [ 0, %[[JOIN]] ], [ 1, %lpad ]			; CHECK-NEXT: [[T1:%.*]] = phi i32 [ 0, %[[JOIN]] ], [ 1, %lpad ]
	; CHECK-NEXT: call void @use(i32 [[T1]])			; CHECK-NEXT: call void @use(i32 [[T1]])
	; CHECK-NEXT: call void @_ZSt9terminatev()			; CHECK-NEXT: call void @_ZSt9terminatev()

	; CHECK: attributes [[NUW]] = { nounwind }			; CHECK: attributes [[NUW]] = { nounwind }
	; CHECK: attributes #1 = { nounwind readnone }			; CHECK: attributes #1 = { nounwind readnone }
	; CHECK: attributes #2 = { ssp uwtable }			; CHECK: attributes #2 = { ssp uwtable }
	; CHECK: attributes #3 = { nounwind argmemonly }			; CHECK: attributes #3 = { argmemonly nounwind }
	; CHECK: attributes #4 = { noreturn nounwind }			; CHECK: attributes #4 = { noreturn nounwind }

test/Transforms/MemCpyOpt/memcpy.ll

Show First 20 Lines • Show All 200 Lines • ▼ Show 20 Lines	define void @test10(%opaque* noalias nocapture sret %x, i32 %y) {
store i32 %c, i32* %d		store i32 %c, i32* %d
ret void		ret void
}		}

declare void @f1(%struct.big* nocapture sret)		declare void @f1(%struct.big* nocapture sret)
declare void @f2(%struct.big*)		declare void @f2(%struct.big*)

; CHECK: attributes [[NUW]] = { nounwind }		; CHECK: attributes [[NUW]] = { nounwind }
; CHECK: attributes #1 = { nounwind argmemonly }		; CHECK: attributes #1 = { argmemonly nounwind }
; CHECK: attributes #2 = { nounwind ssp }		; CHECK: attributes #2 = { nounwind ssp }
; CHECK: attributes #3 = { nounwind ssp uwtable }		; CHECK: attributes #3 = { nounwind ssp uwtable }

test/Transforms/ObjCARC/nested.ll

Show First 20 Lines • Show All 814 Lines • ▼ Show 20 Lines	entry:
call void @objc_release(i8* %foo21) nounwind		call void @objc_release(i8* %foo21) nounwind
%strongdestroy25 = load i8, i8* %foo10, align 8		%strongdestroy25 = load i8, i8* %foo10, align 8
call void @objc_release(i8* %strongdestroy25) nounwind, !clang.imprecise_release !0		call void @objc_release(i8* %strongdestroy25) nounwind, !clang.imprecise_release !0
call void @objc_release(i8* %call) nounwind, !clang.imprecise_release !0		call void @objc_release(i8* %call) nounwind, !clang.imprecise_release !0
ret void		ret void
}		}


; CHECK: attributes #0 = { nounwind argmemonly }		; CHECK: attributes #0 = { argmemonly nounwind }
; CHECK: attributes #1 = { nonlazybind }		; CHECK: attributes #1 = { nonlazybind }
; CHECK: attributes [[NUW]] = { nounwind }		; CHECK: attributes [[NUW]] = { nounwind }

utils/TableGen/Attribute.cpp

This file was added.

				//===- Attribute.cpp - Generate attributes --------------------------------===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//

				#include "llvm/Support/SourceMgr.h"
				#include "llvm/Support/MemoryBuffer.h"
				#include "llvm/TableGen/Error.h"
				#include "llvm/TableGen/Record.h"
				#include <algorithm>
				#include <string>
				#include <vector>
				using namespace llvm;

				#define DEBUG_TYPE "attr-enum"

				namespace {

				class Attribute {
				public:
				Attribute(RecordKeeper &R) : Records(R) {}
				void emit(raw_ostream &OS);

				private:
				void emitTargetIndependentEnums(raw_ostream &OS);
				void emitFnAttrCompatCheck(raw_ostream &OS, bool IsStringAttr);

				void printEnumAttrClasses(raw_ostream &OS,
				const std::vector<Record *> &Records);
				void printStrAttrClasses(raw_ostream &OS,
				const std::vector<Record *> &Records);
				void printStrBoolAttrClasses(raw_ostream &OS,
				const std::vector<Record *> &Records);

				RecordKeeper &Records;
				};

				} // End anonymous namespace.

				static bool isStringAttr(const Record &A) {
				return A.isSubClassOf("StrAttr");
				}

				void Attribute::emitTargetIndependentEnums(raw_ostream &OS) {
				OS << "#ifdef GET_ATTR_ENUM\n";
				OS << "#undef GET_ATTR_ENUM\n";

				const std::vector<Record*> &Attrs =
				Records.getAllDerivedDefinitions("EnumAttr");

				for (auto A : Attrs)
				if (!isStringAttr(*A))
				OS << A->getName() << ",\n";

				OS << "#endif\n";
				}

				void Attribute::emitFnAttrCompatCheck(raw_ostream &OS, bool IsStringAttr) {
				OS << "#ifdef GET_ATTR_COMPAT_FUNC\n";
				OS << "#undef GET_ATTR_COMPAT_FUNC\n";

				OS << "struct EnumAttr {\n";
				OS << " static bool isSet(const Function &Fn,\n";
				OS << " Attribute::AttrKind Kind) {\n";
				OS << " return Fn.hasFnAttribute(Kind);\n";
				OS << " }\n\n";
				OS << " static void set(Function &Fn,\n";
				OS << " Attribute::AttrKind Kind, bool Val) {\n";
				OS << " if (Val)\n";
				OS << " Fn.addFnAttr(Kind);\n";
				OS << " else\n";
				OS << " Fn.removeFnAttr(Kind);\n";
				OS << " }\n";
				OS << "};\n\n";

				OS << "struct StrAttr {\n";
				OS << " static bool isSet(const Function &Fn,\n";
				OS << " StringRef Kind) {\n";
				OS << " return Fn.hasFnAttribute(Kind);\n";
				OS << " }\n\n";
				OS << " static void set(Function &Fn,\n";
				OS << " StringRef Kind, bool Val) {\n";
				OS << " if (Val)\n";
				OS << " Fn.addFnAttr(Kind);\n";
				OS << " else {\n";
				OS << " auto &Ctx = Fn.getContext();\n";
				OS << " AttributeSet As;\n";
				OS << " As.addAttribute(Ctx, AttributeSet::FunctionIndex, Kind);\n";
				OS << " Fn.removeAttributes(AttributeSet::FunctionIndex, As);\n";
				OS << " }\n";
				OS << " }\n";
				OS << "};\n\n";

				OS << "struct StrBoolAttr {\n";
				OS << " static bool isSet(const Function &Fn,\n";
				OS << " StringRef Kind) {\n";
				OS << " auto A = Fn.getFnAttribute(Kind);\n";
				OS << " return A.getValueAsString().equals(\"true\");\n";
				OS << " }\n\n";
				OS << " static void set(Function &Fn,\n";
				OS << " StringRef Kind, bool Val) {\n";
				OS << " Fn.addFnAttr(Kind, Val ? \"true\" : \"false\");\n";
				OS << " }\n";
				OS << "};\n\n";

				printEnumAttrClasses(OS ,Records.getAllDerivedDefinitions("EnumAttr"));
				//printStrAttrClasses(OS, Records.getAllDerivedDefinitions("StrAttr"));
				printStrBoolAttrClasses(OS , Records.getAllDerivedDefinitions("StrBoolAttr"));

				OS << "static inline bool hasCompatibleFnAttrs(const Function &Caller,\n"
				<< " const Function &Callee) {\n";
				OS << " bool Ret = true;\n\n";

				const std::vector<Record *> &CompatRules =
				Records.getAllDerivedDefinitions("CompatRule");

				for (auto *Rule : CompatRules) {
				StringRef FuncName = Rule->getValueAsString("CompatFunc");
				OS << " Ret &= " << FuncName << "(Caller, Callee);\n";
				}

				OS << "\n";
				OS << " return Ret;\n";
				OS << "}\n\n";

				const std::vector<Record *> &MergeRules =
				Records.getAllDerivedDefinitions("MergeRule");
				OS << "static inline void mergeFnAttrs(Function &Caller,\n"
				<< " const Function &Callee) {\n";

				for (auto *Rule : MergeRules) {
				StringRef FuncName = Rule->getValueAsString("MergeFunc");
				OS << " " << FuncName << "(Caller, Callee);\n";
				}

				OS << "}\n\n";

				OS << "#endif\n";
				}

				void Attribute::printEnumAttrClasses(raw_ostream &OS,
				const std::vector<Record *> &Records) {
				OS << "// EnumAttr classes\n";
				for (const auto *R : Records) {
				OS << "struct " << R->getName() << "Attr : EnumAttr {\n";
				OS << " constexpr static const enum Attribute::AttrKind Kind = ";
				OS << "Attribute::" << R->getName() << ";\n";
				OS << "};\n";
				}
				OS << "\n";
				}

				void Attribute::printStrAttrClasses(raw_ostream &OS,
				const std::vector<Record *> &Records) {
				OS << "// StrAttr classes\n";
				for (const auto *R : Records)
				OS << "// class " << R->getName() << "\n";
				OS << "\n";
				}

				void Attribute::printStrBoolAttrClasses(raw_ostream &OS,
				const std::vector<Record *> &Records) {
				OS << "// StrBoolAttr classes\n";
				for (const auto *R : Records) {
				OS << "struct " << R->getName() << "Attr : StrBoolAttr {\n";
				OS << " constexpr static const char * const Kind = \"";
				OS << R->getValueAsString("AttrString") << "\";\n";
				OS << "};\n";
				}
				OS << "\n";
				}

				void Attribute::emit(raw_ostream &OS) {
				emitTargetIndependentEnums(OS);
				emitFnAttrCompatCheck(OS, false);
				}

				namespace llvm {

				void EmitAttribute(RecordKeeper &RK, raw_ostream &OS) {
				Attribute(RK).emit(OS);
				}

				} // End llvm namespace.

utils/TableGen/CMakeLists.txt

	set(LLVM_LINK_COMPONENTS Support)			set(LLVM_LINK_COMPONENTS Support)

	add_tablegen(llvm-tblgen LLVM			add_tablegen(llvm-tblgen LLVM
	AsmMatcherEmitter.cpp			AsmMatcherEmitter.cpp
	AsmWriterEmitter.cpp			AsmWriterEmitter.cpp
	AsmWriterInst.cpp			AsmWriterInst.cpp
				Attribute.cpp
	CallingConvEmitter.cpp			CallingConvEmitter.cpp
	CodeEmitterGen.cpp			CodeEmitterGen.cpp
	CodeGenDAGPatterns.cpp			CodeGenDAGPatterns.cpp
	CodeGenInstruction.cpp			CodeGenInstruction.cpp
	CodeGenMapTable.cpp			CodeGenMapTable.cpp
	CodeGenRegisters.cpp			CodeGenRegisters.cpp
	CodeGenSchedule.cpp			CodeGenSchedule.cpp
	CodeGenTarget.cpp			CodeGenTarget.cpp
	Show All 21 Lines

utils/TableGen/TableGen.cpp

Show All 35 Lines	enum ActionType {
GenDFAPacketizer,		GenDFAPacketizer,
GenFastISel,		GenFastISel,
GenSubtarget,		GenSubtarget,
GenIntrinsic,		GenIntrinsic,
GenTgtIntrinsic,		GenTgtIntrinsic,
PrintEnums,		PrintEnums,
PrintSets,		PrintSets,
GenOptParserDefs,		GenOptParserDefs,
GenCTags		GenCTags,
		GenAttribute
};		};

namespace {		namespace {
cl::opt<ActionType>		cl::opt<ActionType>
Action(cl::desc("Action to perform:"),		Action(cl::desc("Action to perform:"),
cl::values(clEnumValN(PrintRecords, "print-records",		cl::values(clEnumValN(PrintRecords, "print-records",
"Print all records to stdout (default)"),		"Print all records to stdout (default)"),
clEnumValN(GenEmitter, "gen-emitter",		clEnumValN(GenEmitter, "gen-emitter",
Show All 27 Lines	Action(cl::desc("Action to perform:"),
clEnumValN(PrintEnums, "print-enums",		clEnumValN(PrintEnums, "print-enums",
"Print enum values for a class"),		"Print enum values for a class"),
clEnumValN(PrintSets, "print-sets",		clEnumValN(PrintSets, "print-sets",
"Print expanded sets for testing DAG exprs"),		"Print expanded sets for testing DAG exprs"),
clEnumValN(GenOptParserDefs, "gen-opt-parser-defs",		clEnumValN(GenOptParserDefs, "gen-opt-parser-defs",
"Generate option definitions"),		"Generate option definitions"),
clEnumValN(GenCTags, "gen-ctags",		clEnumValN(GenCTags, "gen-ctags",
"Generate ctags-compatible index"),		"Generate ctags-compatible index"),
		clEnumValN(GenAttribute, "gen-attr",
		"Generate attribute"),
clEnumValEnd));		clEnumValEnd));

cl::opt<std::string>		cl::opt<std::string>
Class("class", cl::desc("Print Enum list for this class"),		Class("class", cl::desc("Print Enum list for this class"),
cl::value_desc("class name"));		cl::value_desc("class name"));

bool LLVMTableGenMain(raw_ostream &OS, RecordKeeper &Records) {		bool LLVMTableGenMain(raw_ostream &OS, RecordKeeper &Records) {
switch (Action) {		switch (Action) {
▲ Show 20 Lines • Show All 64 Lines • ▼ Show 20 Lines	for (Record *Rec : Records.getAllDerivedDefinitions("Set")) {
OS << ' ' << Elt->getName();		OS << ' ' << Elt->getName();
OS << " ]\n";		OS << " ]\n";
}		}
break;		break;
}		}
case GenCTags:		case GenCTags:
EmitCTags(Records, OS);		EmitCTags(Records, OS);
break;		break;
		case GenAttribute:
		EmitAttribute(Records, OS);
		break;
}		}

return false;		return false;
}		}
}		}

int main(int argc, char **argv) {		int main(int argc, char **argv) {
sys::PrintStackTraceOnErrorSignal();		sys::PrintStackTraceOnErrorSignal();
Show All 14 Lines

utils/TableGen/TableGenBackends.h

	Show First 20 Lines • Show All 72 Lines • ▼ Show 20 Lines
	void EmitFastISel(RecordKeeper &RK, raw_ostream &OS);			void EmitFastISel(RecordKeeper &RK, raw_ostream &OS);
	void EmitInstrInfo(RecordKeeper &RK, raw_ostream &OS);			void EmitInstrInfo(RecordKeeper &RK, raw_ostream &OS);
	void EmitPseudoLowering(RecordKeeper &RK, raw_ostream &OS);			void EmitPseudoLowering(RecordKeeper &RK, raw_ostream &OS);
	void EmitRegisterInfo(RecordKeeper &RK, raw_ostream &OS);			void EmitRegisterInfo(RecordKeeper &RK, raw_ostream &OS);
	void EmitSubtarget(RecordKeeper &RK, raw_ostream &OS);			void EmitSubtarget(RecordKeeper &RK, raw_ostream &OS);
	void EmitMapTable(RecordKeeper &RK, raw_ostream &OS);			void EmitMapTable(RecordKeeper &RK, raw_ostream &OS);
	void EmitOptParser(RecordKeeper &RK, raw_ostream &OS);			void EmitOptParser(RecordKeeper &RK, raw_ostream &OS);
	void EmitCTags(RecordKeeper &RK, raw_ostream &OS);			void EmitCTags(RecordKeeper &RK, raw_ostream &OS);
				void EmitAttribute(RecordKeeper &RK, raw_ostream &OS);

	} // End llvm namespace			} // End llvm namespace

	#endif			#endif

This is an archive of the discontinued LLVM Phabricator instance.

[Inliner] Use whitelist instead of blacklist when checking function attribute compatibility and make the check stricterClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 37320

include/llvm/IR/Attributes.h

include/llvm/IR/Attributes.td

include/llvm/IR/CMakeLists.txt

lib/Analysis/InlineCost.cpp

lib/IR/Attributes.cpp

lib/IR/AttributesCompatFunc.td

lib/IR/CMakeLists.txt

lib/IR/Makefile

lib/Transforms/IPO/Inliner.cpp

test/Analysis/BasicAA/intrinsics.ll

test/Analysis/TypeBasedAliasAnalysis/intrinsics.ll

test/Bitcode/compatibility-3.6.ll

test/Bitcode/compatibility-3.7.ll

test/Bitcode/compatibility.ll

test/Transforms/Inline/attributes.ll

test/Transforms/Inline/inline_invoke.ll

test/Transforms/MemCpyOpt/memcpy.ll

test/Transforms/ObjCARC/nested.ll

utils/TableGen/Attribute.cpp

utils/TableGen/CMakeLists.txt

utils/TableGen/TableGen.cpp

utils/TableGen/TableGenBackends.h

[Inliner] Use whitelist instead of blacklist when checking function attribute compatibility and make the check stricter
ClosedPublic