This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/include/llvm/Support/
-
include/
-
llvm/
-
Support/
-
TypeName.h
-
mlir/include/mlir/
-
include/
-
mlir/
-
IR/
-
Builders.h
-
Diagnostics.h
-
OperationSupport.h
-
PatternMatch.h
-
Support/
1
TypeID.h

Differential D145762

Reduce inlining that had larger binary size impact
Needs RevisionPublic

Authored by jpienaar on Mar 9 2023, 8:37 PM.

Download Raw Diff

Details

Reviewers

mehdi_amini
rriddle

Summary

On a build with a large number of registered operations this resulted in
~3.4 MB reduction in binary in release mode.

These were found to reduce the size but this change is RFC (it contains changes
related to debugging and error path but not exclusively).

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

jpienaar created this revision.Mar 9 2023, 8:37 PM

Herald added a reviewer: rriddle. · View Herald TranscriptMar 9 2023, 8:37 PM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: Moerafaat, zero9178, bzcheeseman and 20 others. · View Herald Transcript

jpienaar requested review of this revision.Mar 9 2023, 8:37 PM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptMar 9 2023, 8:37 PM

Herald added subscribers: llvm-commits, stephenneuendorffer, nicolasvasilache. · View Herald Transcript

Harbormaster completed remote builds in B218583: Diff 504016.Mar 9 2023, 9:36 PM

~3.4 MB reduction in binary in release mode.

I don’t know how to interpret this without the total binary size, what is the percentage?

In general I am quite concerned about messing with the optimizer this way, this should be extremely exceptional. This'll be ad-hoc, forces a performance tradeoff specific to a given use case, and couple the "heuristic" to the exact compiler you're using (what does it do on Windows? On Mac?).
Have you tried building your project with -Os? -Oz? PGO? FullLTO?

mlir/include/mlir/Support/TypeID.h
192	Isn't this a potentially hot routine?

This revision now requires changes to proceed.Mar 10 2023, 2:03 AM

In D145762#4184158, @mehdi_amini wrote:

~3.4 MB reduction in binary in release mode.

I don’t know how to interpret this without the total binary size, what is the percentage?

The total binary size is relevant for user to decide which part they want to optimize and that's where biggest impact will be by doing more application specific removal. But here I'm more interested in the MLIR parts as this reduction corresponds to ~30% of total MLIR codesize (filtering on any file with mlir in it) in the original binary rather than caring about this specific binary.

In D145762#4184165, @mehdi_amini wrote:

In general I am quite concerned about messing with the optimizer this way, this should be extremely exceptional. This'll be ad-hoc, forces a performance tradeoff specific to a given use case, and couple the "heuristic" to the exact compiler you're using (what does it do on Windows? On Mac?).
Have you tried building your project with -Os? -Oz? PGO? FullLTO?

Performance tradeoff wrt debugging and error case was what I wanted to discuss (these were programmatically identified but I don't think all would make sense). I agree that if pure size is goal it's one thing, but these are intended to be performance optimized binaries in general. The debugging or error case seems not specific, inlining vs function call for these paths trades size for speed in exceptional paths. This is similar to me to where we document expectations around verification in production runs, it's just in code :-). PGO would be able to change the heuristic either way wouldn't it? E.g., having the default of the error/debugging cases be marked as cold.

Revision Contents

Path

Size

llvm/

include/

llvm/

Support/

TypeName.h

2 lines

mlir/

include/

mlir/

IR/

3 lines

7 lines

29 lines

5 lines

Support/

TypeID.h

2 lines

Diff 504016

llvm/include/llvm/Support/TypeName.h

	Show All 18 Lines
	/// This routine may fail on some platforms or for particularly unusual types.			/// This routine may fail on some platforms or for particularly unusual types.
	/// Do not use it for anything other than logging and debugging aids. It isn't			/// Do not use it for anything other than logging and debugging aids. It isn't
	/// portable or dependendable in any real sense.			/// portable or dependendable in any real sense.
	///			///
	/// The returned StringRef will point into a static storage duration string.			/// The returned StringRef will point into a static storage duration string.
	/// However, it may not be null terminated and may be some strangely aligned			/// However, it may not be null terminated and may be some strangely aligned
	/// inner substring of a larger string.			/// inner substring of a larger string.
	template <typename DesiredTypeName>			template <typename DesiredTypeName>
	inline StringRef getTypeName() {			LLVM_ATTRIBUTE_NOINLINE StringRef getTypeName() {
	#if defined(__clang__) \|\| defined(__GNUC__)			#if defined(__clang__) \|\| defined(__GNUC__)
	StringRef Name = __PRETTY_FUNCTION__;			StringRef Name = __PRETTY_FUNCTION__;

	StringRef Key = "DesiredTypeName = ";			StringRef Key = "DesiredTypeName = ";
	Name = Name.substr(Name.find(Key));			Name = Name.substr(Name.find(Key));
	assert(!Name.empty() && "Unable to find the template parameter!");			assert(!Name.empty() && "Unable to find the template parameter!");
	Name = Name.drop_front(Key.size());			Name = Name.drop_front(Key.size());

	Show All 29 Lines

mlir/include/mlir/IR/Builders.h

Show First 20 Lines • Show All 460 Lines • ▼ Show 20 Lines	Operation *create(Location loc, StringAttr opName, ValueRange operands,
TypeRange types = {},		TypeRange types = {},
ArrayRef<NamedAttribute> attributes = {},		ArrayRef<NamedAttribute> attributes = {},
BlockRange successors = {},		BlockRange successors = {},
MutableArrayRef<std::unique_ptr<Region>> regions = {});		MutableArrayRef<std::unique_ptr<Region>> regions = {});

private:		private:
/// Helper for sanity checking preconditions for create* methods below.		/// Helper for sanity checking preconditions for create* methods below.
template <typename OpT>		template <typename OpT>
RegisteredOperationName getCheckRegisteredInfo(MLIRContext *ctx) {		LLVM_ATTRIBUTE_NOINLINE RegisteredOperationName
		getCheckRegisteredInfo(MLIRContext *ctx) {
std::optional<RegisteredOperationName> opName =		std::optional<RegisteredOperationName> opName =
RegisteredOperationName::lookup(OpT::getOperationName(), ctx);		RegisteredOperationName::lookup(OpT::getOperationName(), ctx);
if (LLVM_UNLIKELY(!opName)) {		if (LLVM_UNLIKELY(!opName)) {
llvm::report_fatal_error(		llvm::report_fatal_error(
"Building op `" + OpT::getOperationName() +		"Building op `" + OpT::getOperationName() +
"` but it isn't registered in this MLIRContext: the dialect may not "		"` but it isn't registered in this MLIRContext: the dialect may not "
"be loaded or this operation isn't registered by the dialect. See "		"be loaded or this operation isn't registered by the dialect. See "
"also https://mlir.llvm.org/getting_started/Faq/"		"also https://mlir.llvm.org/getting_started/Faq/"
▲ Show 20 Lines • Show All 102 Lines • Show Last 20 Lines

mlir/include/mlir/IR/Diagnostics.h

Show First 20 Lines • Show All 171 Lines • ▼ Show 20 Lines	public:
MutableArrayRef<DiagnosticArgument> getArguments() { return arguments; }		MutableArrayRef<DiagnosticArgument> getArguments() { return arguments; }
ArrayRef<DiagnosticArgument> getArguments() const { return arguments; }		ArrayRef<DiagnosticArgument> getArguments() const { return arguments; }

/// Stream operator for inserting new diagnostic arguments.		/// Stream operator for inserting new diagnostic arguments.
template <typename Arg>		template <typename Arg>
std::enable_if_t<!std::is_convertible<Arg, StringRef>::value &&		std::enable_if_t<!std::is_convertible<Arg, StringRef>::value &&
std::is_constructible<DiagnosticArgument, Arg>::value,		std::is_constructible<DiagnosticArgument, Arg>::value,
Diagnostic &>		Diagnostic &>
operator<<(Arg &&val) {		LLVM_ATTRIBUTE_NOINLINE operator<<(Arg &&val) {
arguments.push_back(DiagnosticArgument(std::forward<Arg>(val)));		arguments.push_back(DiagnosticArgument(std::forward<Arg>(val)));
return *this;		return *this;
}		}
Diagnostic &operator<<(StringAttr val);		Diagnostic &operator<<(StringAttr val);

/// Stream in a string literal.		/// Stream in a string literal.
Diagnostic &operator<<(const char *val) {		LLVM_ATTRIBUTE_NOINLINE Diagnostic &operator<<(const char *val) {
arguments.push_back(DiagnosticArgument(val));		arguments.push_back(DiagnosticArgument(val));
return *this;		return *this;
}		}

/// Stream in a Twine argument.		/// Stream in a Twine argument.
Diagnostic &operator<<(char val);		Diagnostic &operator<<(char val);
Diagnostic &operator<<(const Twine &val);		Diagnostic &operator<<(const Twine &val);
Diagnostic &operator<<(Twine &&val);		Diagnostic &operator<<(Twine &&val);
Show All 24 Lines	public:
Diagnostic &appendRange(const T &c, const char *delim = ", ") {		Diagnostic &appendRange(const T &c, const char *delim = ", ") {
llvm::interleave(		llvm::interleave(
c, [this](const auto &a) { this << a; }, [&]() { this << delim; });		c, [this](const auto &a) { this << a; }, [&]() { this << delim; });
return *this;		return *this;
}		}

/// Append arguments to the diagnostic.		/// Append arguments to the diagnostic.
template <typename Arg1, typename Arg2, typename... Args>		template <typename Arg1, typename Arg2, typename... Args>
Diagnostic &append(Arg1 &&arg1, Arg2 &&arg2, Args &&...args) {		LLVM_ATTRIBUTE_NOINLINE Diagnostic &append(Arg1 &&arg1, Arg2 &&arg2,
		Args &&...args) {
append(std::forward<Arg1>(arg1));		append(std::forward<Arg1>(arg1));
return append(std::forward<Arg2>(arg2), std::forward<Args>(args)...);		return append(std::forward<Arg2>(arg2), std::forward<Args>(args)...);
}		}
/// Append one argument to the diagnostic.		/// Append one argument to the diagnostic.
template <typename Arg>		template <typename Arg>
Diagnostic &append(Arg &&arg) {		Diagnostic &append(Arg &&arg) {
*this << std::forward<Arg>(arg);		*this << std::forward<Arg>(arg);
return *this;		return *this;
▲ Show 20 Lines • Show All 444 Lines • Show Last 20 Lines

mlir/include/mlir/IR/OperationSupport.h

Show First 20 Lines • Show All 99 Lines • ▼ Show 20 Lines	struct InterfaceConcept {
virtual LogicalResult verifyRegionInvariants(Operation *) = 0;		virtual LogicalResult verifyRegionInvariants(Operation *) = 0;
};		};

public:		public:
class Impl : public InterfaceConcept {		class Impl : public InterfaceConcept {
public:		public:
Impl(StringRef, Dialect *dialect, TypeID typeID,		Impl(StringRef, Dialect *dialect, TypeID typeID,
detail::InterfaceMap interfaceMap);		detail::InterfaceMap interfaceMap);

		LLVM_ATTRIBUTE_NOINLINE
Impl(StringAttr name, Dialect *dialect, TypeID typeID,		Impl(StringAttr name, Dialect *dialect, TypeID typeID,
detail::InterfaceMap interfaceMap)		detail::InterfaceMap interfaceMap)
: name(name), typeID(typeID), dialect(dialect),		: name(name), typeID(typeID), dialect(dialect),
interfaceMap(std::move(interfaceMap)) {}		interfaceMap(std::move(interfaceMap)) {}

		LLVM_ATTRIBUTE_NOINLINE
		~Impl() = default;

/// Returns true if this is a registered operation.		/// Returns true if this is a registered operation.
bool isRegistered() const { return typeID != TypeID::get<void>(); }		bool isRegistered() const { return typeID != TypeID::get<void>(); }
detail::InterfaceMap &getInterfaceMap() { return interfaceMap; }		detail::InterfaceMap &getInterfaceMap() { return interfaceMap; }
Dialect *getDialect() const { return dialect; }		Dialect *getDialect() const { return dialect; }
StringAttr getName() const { return name; }		StringAttr getName() const { return name; }
TypeID getTypeID() const { return typeID; }		TypeID getTypeID() const { return typeID; }
ArrayRef<StringAttr> getAttributeNames() const { return attributeNames; }		ArrayRef<StringAttr> getAttributeNames() const { return attributeNames; }

▲ Show 20 Lines • Show All 261 Lines • ▼ Show 20 Lines
public:		public:
/// Implementation of the InterfaceConcept for operation APIs that forwarded		/// Implementation of the InterfaceConcept for operation APIs that forwarded
/// to a concrete op implementation.		/// to a concrete op implementation.
template <typename ConcreteOp>		template <typename ConcreteOp>
struct Model : public Impl {		struct Model : public Impl {
Model(Dialect *dialect)		Model(Dialect *dialect)
: Impl(ConcreteOp::getOperationName(), dialect,		: Impl(ConcreteOp::getOperationName(), dialect,
TypeID::get<ConcreteOp>(), ConcreteOp::getInterfaceMap()) {}		TypeID::get<ConcreteOp>(), ConcreteOp::getInterfaceMap()) {}
LogicalResult foldHook(Operation *op, ArrayRef<Attribute> attrs,		LLVM_ATTRIBUTE_NOINLINE ~Model() = default;

		LLVM_ATTRIBUTE_NOINLINE LogicalResult
		foldHook(Operation *op, ArrayRef<Attribute> attrs,
SmallVectorImpl<OpFoldResult> &results) final {		SmallVectorImpl<OpFoldResult> &results) final {
return ConcreteOp::getFoldHookFn()(op, attrs, results);		return ConcreteOp::getFoldHookFn()(op, attrs, results);
}		}
void getCanonicalizationPatterns(RewritePatternSet &set,		void getCanonicalizationPatterns(RewritePatternSet &set,
MLIRContext *context) final {		MLIRContext *context) final {
ConcreteOp::getCanonicalizationPatterns(set, context);		ConcreteOp::getCanonicalizationPatterns(set, context);
}		}
bool hasTrait(TypeID id) final { return ConcreteOp::getHasTraitFn()(id); }		LLVM_ATTRIBUTE_NOINLINE bool hasTrait(TypeID id) final {
		return ConcreteOp::getHasTraitFn()(id);
		}
OperationName::ParseAssemblyFn getParseAssemblyFn() final {		OperationName::ParseAssemblyFn getParseAssemblyFn() final {
return ConcreteOp::parse;		return ConcreteOp::parse;
}		}
void populateDefaultAttrs(const OperationName &name,		void populateDefaultAttrs(const OperationName &name,
NamedAttrList &attrs) final {		NamedAttrList &attrs) final {
ConcreteOp::populateDefaultAttrs(name, attrs);		ConcreteOp::populateDefaultAttrs(name, attrs);
}		}
void printAssembly(Operation *op, OpAsmPrinter &printer,		LLVM_ATTRIBUTE_NOINLINE void
StringRef name) final {		printAssembly(Operation *op, OpAsmPrinter &printer, StringRef name) final {
ConcreteOp::getPrintAssemblyFn()(op, printer, name);		ConcreteOp::getPrintAssemblyFn()(op, printer, name);
}		}
LogicalResult verifyInvariants(Operation *op) final {		LLVM_ATTRIBUTE_NOINLINE LogicalResult
		verifyInvariants(Operation *op) final {
return ConcreteOp::getVerifyInvariantsFn()(op);		return ConcreteOp::getVerifyInvariantsFn()(op);
}		}
LogicalResult verifyRegionInvariants(Operation *op) final {		LLVM_ATTRIBUTE_NOINLINE LogicalResult
		verifyRegionInvariants(Operation *op) final {
return ConcreteOp::getVerifyRegionInvariantsFn()(op);		return ConcreteOp::getVerifyRegionInvariantsFn()(op);
}		}
};		};

/// Lookup the registered operation information for the given operation.		/// Lookup the registered operation information for the given operation.
/// Returns std::nullopt if the operation isn't registered.		/// Returns std::nullopt if the operation isn't registered.
static std::optional<RegisteredOperationName> lookup(StringRef name,		static std::optional<RegisteredOperationName> lookup(StringRef name,
MLIRContext *ctx);		MLIRContext *ctx);
▲ Show 20 Lines • Show All 268 Lines • ▼ Show 20 Lines	struct OperationState {
SmallVector<Type, 4> types;		SmallVector<Type, 4> types;
NamedAttrList attributes;		NamedAttrList attributes;
/// Successors of this operation and their respective operands.		/// Successors of this operation and their respective operands.
SmallVector<Block *, 1> successors;		SmallVector<Block *, 1> successors;
/// Regions that the op will hold.		/// Regions that the op will hold.
SmallVector<std::unique_ptr<Region>, 1> regions;		SmallVector<std::unique_ptr<Region>, 1> regions;

public:		public:
		LLVM_ATTRIBUTE_NOINLINE OperationState() = default;
		LLVM_ATTRIBUTE_NOINLINE ~OperationState() = default;

OperationState(Location location, StringRef name);		OperationState(Location location, StringRef name);
OperationState(Location location, OperationName name);		OperationState(Location location, OperationName name);

OperationState(Location location, OperationName name, ValueRange operands,		OperationState(Location location, OperationName name, ValueRange operands,
TypeRange types, ArrayRef<NamedAttribute> attributes = {},		TypeRange types, ArrayRef<NamedAttribute> attributes = {},
BlockRange successors = {},		BlockRange successors = {},
MutableArrayRef<std::unique_ptr<Region>> regions = {});		MutableArrayRef<std::unique_ptr<Region>> regions = {});
OperationState(Location location, StringRef name, ValueRange operands,		OperationState(Location location, StringRef name, ValueRange operands,
▲ Show 20 Lines • Show All 351 Lines • Show Last 20 Lines

mlir/include/mlir/IR/PatternMatch.h

Show First 20 Lines • Show All 1,778 Lines • ▼ Show 20 Lines	insert(LogicalResult (*implFn)(OpType, PatternRewriter &rewriter)) {
add(std::make_unique<FnPattern>(std::move(implFn), getContext()));		add(std::make_unique<FnPattern>(std::move(implFn), getContext()));
return *this;		return *this;
}		}

private:		private:
/// Add an instance of the pattern type 'T'. Return a reference to `this` for		/// Add an instance of the pattern type 'T'. Return a reference to `this` for
/// chaining insertions.		/// chaining insertions.
template <typename T, typename... Args>		template <typename T, typename... Args>
		LLVM_ATTRIBUTE_NOINLINE
std::enable_if_t<std::is_base_of<RewritePattern, T>::value>		std::enable_if_t<std::is_base_of<RewritePattern, T>::value>
addImpl(ArrayRef<StringRef> debugLabels, Args &&...args) {		addImpl(ArrayRef<StringRef> debugLabels, Args &&...args) {
std::unique_ptr<T> pattern =		std::unique_ptr<T> pattern =
RewritePattern::create<T>(std::forward<Args>(args)...);		RewritePattern::create<T>(std::forward<Args>(args)...);
pattern->addDebugLabels(debugLabels);		pattern->addDebugLabels(debugLabels);
nativePatterns.emplace_back(std::move(pattern));		nativePatterns.emplace_back(std::move(pattern));
}		}
template <typename T, typename... Args>		template <typename T, typename... Args>
std::enable_if_t<std::is_base_of<PDLPatternModule, T>::value>		std::enable_if_t<std::is_base_of<PDLPatternModule, T>::value>
addImpl(ArrayRef<StringRef> debugLabels, Args &&...args) {		addImpl(ArrayRef<StringRef> debugLabels, Args &&...args) {
Show All 13 Lines

mlir/include/mlir/Support/TypeID.h

Show First 20 Lines • Show All 183 Lines • ▼ Show 20 Lines	public:
/// prevent situations where a forward declared type uses this fallback even		/// prevent situations where a forward declared type uses this fallback even
/// though there is a strong definition for the TypeID in the location where		/// though there is a strong definition for the TypeID in the location where
/// `T` is defined.		/// `T` is defined.
template <typename U>		template <typename U>
using is_fully_resolved_trait = decltype(sizeof(U));		using is_fully_resolved_trait = decltype(sizeof(U));
template <typename U>		template <typename U>
using is_fully_resolved = llvm::is_detected<is_fully_resolved_trait, U>;		using is_fully_resolved = llvm::is_detected<is_fully_resolved_trait, U>;

static TypeID resolveTypeID() {		LLVM_ATTRIBUTE_NOINLINE static TypeID resolveTypeID() {
		mehdi_aminiUnsubmitted Not Done Reply Inline Actions Isn't this a potentially hot routine? mehdi_amini: Isn't this a potentially hot routine?
static_assert(is_fully_resolved<T>::value,		static_assert(is_fully_resolved<T>::value,
"TypeID::get<> requires the complete definition of `T`");		"TypeID::get<> requires the complete definition of `T`");
static TypeID id = registerImplicitTypeID(llvm::getTypeName<T>());		static TypeID id = registerImplicitTypeID(llvm::getTypeName<T>());
return id;		return id;
}		}
};		};

/// This class provides utilities for resolving the TypeID of a class that		/// This class provides utilities for resolving the TypeID of a class that
▲ Show 20 Lines • Show All 167 Lines • Show Last 20 Lines