This is an archive of the discontinued LLVM Phabricator instance.

[NFC][mlgo] Generalize model runner interface
ClosedPublic

Authored by mtrofin on Dec 7 2021, 4:29 PM.

Download Raw Diff

Details

Reviewers

yundiqian
davidxl

Commits

rG059e03476cbb: [NFC][mlgo] Generalize model runner interface

Summary

This prepares it for the regalloc work. Part of it is making model
evaluation accross 'development' and 'release' scenarios more reusable.
This patch:

extends support to tensors of any shape (not just scalars, like we had

in the inliner -Oz case). While the tensor shape can be anything, we
assume row-major layout and expose the tensor as a buffer.

exposes the NoInferenceModelRunner, which we use in the 'development'

mode to keep the evaluation code path consistent and simplify logging,
as we'll want to reuse it in the regalloc case.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

mtrofin created this revision.Dec 7 2021, 4:29 PM

Herald added subscribers: hiraditya, mgorny. · View Herald TranscriptDec 7 2021, 4:29 PM

mtrofin requested review of this revision.Dec 7 2021, 4:29 PM

Herald added a project: Restricted Project. · View Herald TranscriptDec 7 2021, 4:29 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B138038: Diff 392593.Dec 7 2021, 5:05 PM

yundiqian accepted this revision.Dec 8 2021, 4:07 PM

yundiqian added inline comments.

llvm/include/llvm/Analysis/MLModelRunner.h
41–45	do they need to be public?
llvm/lib/Analysis/DevelopmentModeInlineAdvisor.cpp
269	will part of it get re-factored out of inline to common lib?

This revision is now accepted and ready to land.Dec 8 2021, 4:07 PM

mtrofin added inline comments.Dec 8 2021, 6:15 PM

llvm/include/llvm/Analysis/MLModelRunner.h
41–45	I think I may need them in a subsequent patch, but I can move them to public then.
llvm/lib/Analysis/DevelopmentModeInlineAdvisor.cpp
269	I'd like to, yes. Same with the log population from the evaluator - it's pretty generic.

moved 2 APIs to protected

This revision was landed with ongoing or failed builds.Dec 8 2021, 8:11 PM

Closed by commit rG059e03476cbb: [NFC][mlgo] Generalize model runner interface (authored by mtrofin). · Explain Why

This revision was automatically updated to reflect the committed changes.

mtrofin added a commit: rG059e03476cbb: [NFC][mlgo] Generalize model runner interface.

Harbormaster completed remote builds in B138350: Diff 393017.Dec 8 2021, 11:36 PM

Revision Contents

Path

Size

llvm/

include/

llvm/

Analysis/

MLModelRunner.h

22 lines

NoInferenceModelRunner.h

39 lines

Utils/

TFUtils.h

4 lines

lib/

Analysis/

CMakeLists.txt

1 line

DevelopmentModeInlineAdvisor.cpp

79 lines

MLInlineAdvisor.cpp

47 lines

NoInferenceModelRunner.cpp

33 lines

ReleaseModeModelRunner.cpp

22 lines

unittests/

Analysis/

CMakeLists.txt

5 lines

MLModelRunnerTest.cpp

33 lines

Diff 393018

llvm/include/llvm/Analysis/MLModelRunner.h

	//===- MLModelRunner.h ---- ML model runner interface ------------ C++ --===//			//===- MLModelRunner.h ---- ML model runner interface ------------ C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//

	#ifndef LLVM_ANALYSIS_MLMODELRUNNER_H			#ifndef LLVM_ANALYSIS_MLMODELRUNNER_H
	#define LLVM_ANALYSIS_MLMODELRUNNER_H			#define LLVM_ANALYSIS_MLMODELRUNNER_H

	#include "llvm/Analysis/InlineModelFeatureMaps.h"
	#include "llvm/IR/LLVMContext.h"			#include "llvm/IR/LLVMContext.h"
	#include "llvm/IR/PassManager.h"			#include "llvm/IR/PassManager.h"

	namespace llvm {			namespace llvm {

	/// MLModelRunner interface: abstraction of a mechanism for evaluating a			/// MLModelRunner interface: abstraction of a mechanism for evaluating a
	/// tensorflow "saved model".			/// tensorflow "saved model".
	class MLModelRunner {			class MLModelRunner {
	public:			public:
	// Disallows copy and assign.			// Disallows copy and assign.
	MLModelRunner(const MLModelRunner &) = delete;			MLModelRunner(const MLModelRunner &) = delete;
	MLModelRunner &operator=(const MLModelRunner &) = delete;			MLModelRunner &operator=(const MLModelRunner &) = delete;
	virtual ~MLModelRunner() = default;			virtual ~MLModelRunner() = default;

	virtual bool run() = 0;			template <typename T> T evaluate() {
	virtual void setFeature(FeatureIndex Index, int64_t Value) = 0;			return reinterpret_cast<T >(evaluateUntyped());
	virtual int64_t getFeature(int Index) const = 0;			}

				template <typename T, typename I> T *getTensor(I FeatureID) {
				return reinterpret_cast<T *>(
				getTensorUntyped(static_cast<size_t>(FeatureID)));
				}

				template <typename T, typename I> const T *getTensor(I FeatureID) const {
				return reinterpret_cast<const T *>(
				getTensorUntyped(static_cast<size_t>(FeatureID)));
				}

	protected:			protected:
	MLModelRunner(LLVMContext &Ctx) : Ctx(Ctx) {}			MLModelRunner(LLVMContext &Ctx) : Ctx(Ctx) {}
				virtual void *evaluateUntyped() = 0;
				virtual void *getTensorUntyped(size_t Index) = 0;
				const void *getTensorUntyped(size_t Index) const {
				yundiqianUnsubmitted Not Done Reply Inline Actions do they need to be public? yundiqian: do they need to be public?
				mtrofinAuthorUnsubmitted Done Reply Inline Actions I think I may need them in a subsequent patch, but I can move them to public then. mtrofin: I think I may need them in a subsequent patch, but I can move them to public then.
				return (const_cast<MLModelRunner *>(this))->getTensorUntyped(Index);
				}

	LLVMContext &Ctx;			LLVMContext &Ctx;
	};			};
	} // namespace llvm			} // namespace llvm

	#endif // LLVM_ANALYSIS_MLMODELRUNNER_H			#endif // LLVM_ANALYSIS_MLMODELRUNNER_H

llvm/include/llvm/Analysis/NoInferenceModelRunner.h

This file was added.

				//===- NoInferenceModelRunner.h ---- noop ML model runner ------- C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//

				#ifndef LLVM_ANALYSIS_NOINFERENCEMODELRUNNER_H
				#define LLVM_ANALYSIS_NOINFERENCEMODELRUNNER_H

				#include "llvm/Config/llvm-config.h"

				/// While not strictly necessary to conditionally compile this, it really
				/// has no usecase outside the 'development' mode.
				#ifdef LLVM_HAVE_TF_API
				#include "llvm/Analysis/MLModelRunner.h"
				#include "llvm/Analysis/Utils/TFUtils.h"
				namespace llvm {
				/// A pseudo model runner. We use it to store feature values when collecting
				/// logs for the default policy, in 'development' mode, but never ask it to
				/// 'run'.
				class NoInferenceModelRunner : public MLModelRunner {
				public:
				NoInferenceModelRunner(LLVMContext &Ctx,
				const std::vector<TensorSpec> &Inputs);

				private:
				void *evaluateUntyped() override {
				llvm_unreachable("We shouldn't call run on this model runner.");
				}
				void *getTensorUntyped(size_t Index) override;

				std::vector<std::unique_ptr<char[]>> ValuesBuffer;
				};
				} // namespace llvm
				#endif // defined(LLVM_HAVE_TF_API)
				#endif // defined(LLVM_ANALYSIS_NOINFERENCEMODELRUNNER_H)
				No newline at end of file

llvm/include/llvm/Analysis/Utils/TFUtils.h

Show First 20 Lines • Show All 240 Lines • ▼ Show 20 Lines	public:
template <typename T> T *getInput(size_t Index) {		template <typename T> T *getInput(size_t Index) {
return static_cast<T *>(getUntypedInput(Index));		return static_cast<T *>(getUntypedInput(Index));
}		}

/// Returns true if the tensorflow model was loaded successfully, false		/// Returns true if the tensorflow model was loaded successfully, false
/// otherwise.		/// otherwise.
bool isValid() const { return !!Impl; }		bool isValid() const { return !!Impl; }

private:		/// Untyped access to input.
void *getUntypedInput(size_t Index);		void *getUntypedInput(size_t Index);

		private:
std::unique_ptr<TFModelEvaluatorImpl> Impl;		std::unique_ptr<TFModelEvaluatorImpl> Impl;
};		};

/// List of supported types, as a pair:		/// List of supported types, as a pair:
/// - C++ type		/// - C++ type
/// - enum name (implementation-specific)		/// - enum name (implementation-specific)
#define TFUTILS_SUPPORTED_TYPES(M) \		#define TFUTILS_SUPPORTED_TYPES(M) \
M(float, TF_FLOAT) \		M(float, TF_FLOAT) \
Show All 20 Lines

llvm/lib/Analysis/CMakeLists.txt

Show First 20 Lines • Show All 99 Lines • ▼ Show 20 Lines	add_llvm_component_library(LLVMAnalysis
MemoryBuiltins.cpp		MemoryBuiltins.cpp
MemoryDependenceAnalysis.cpp		MemoryDependenceAnalysis.cpp
MemoryLocation.cpp		MemoryLocation.cpp
MemorySSA.cpp		MemorySSA.cpp
MemorySSAUpdater.cpp		MemorySSAUpdater.cpp
ModuleDebugInfoPrinter.cpp		ModuleDebugInfoPrinter.cpp
ModuleSummaryAnalysis.cpp		ModuleSummaryAnalysis.cpp
MustExecute.cpp		MustExecute.cpp
		NoInferenceModelRunner.cpp
ObjCARCAliasAnalysis.cpp		ObjCARCAliasAnalysis.cpp
ObjCARCAnalysisUtils.cpp		ObjCARCAnalysisUtils.cpp
ObjCARCInstKind.cpp		ObjCARCInstKind.cpp
OptimizationRemarkEmitter.cpp		OptimizationRemarkEmitter.cpp
OverflowInstAnalysis.cpp		OverflowInstAnalysis.cpp
PHITransAddr.cpp		PHITransAddr.cpp
PhiValues.cpp		PhiValues.cpp
PostDominators.cpp		PostDominators.cpp
▲ Show 20 Lines • Show All 46 Lines • Show Last 20 Lines

llvm/lib/Analysis/DevelopmentModeInlineAdvisor.cpp

Show All 10 Lines
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
#include "llvm/Config/config.h"		#include "llvm/Config/config.h"
#if defined(LLVM_HAVE_TF_API)		#if defined(LLVM_HAVE_TF_API)

#include "llvm/Analysis/CallGraph.h"		#include "llvm/Analysis/CallGraph.h"
#include "llvm/Analysis/InlineSizeEstimatorAnalysis.h"		#include "llvm/Analysis/InlineSizeEstimatorAnalysis.h"
#include "llvm/Analysis/MLInlineAdvisor.h"		#include "llvm/Analysis/MLInlineAdvisor.h"
		#include "llvm/Analysis/NoInferenceModelRunner.h"
#include "llvm/Analysis/Utils/TFUtils.h"		#include "llvm/Analysis/Utils/TFUtils.h"
#include "llvm/IR/LLVMContext.h"		#include "llvm/IR/LLVMContext.h"
#include "llvm/Support/CommandLine.h"		#include "llvm/Support/CommandLine.h"
#include "llvm/Support/ManagedStatic.h"		#include "llvm/Support/ManagedStatic.h"

#include <vector>		#include <vector>

using namespace llvm;		using namespace llvm;
▲ Show 20 Lines • Show All 229 Lines • ▼ Show 20 Lines	private:
static const int64_t NoReward = 0;		static const int64_t NoReward = 0;
TrainingLogger &Logger;		TrainingLogger &Logger;
const Optional<size_t> CallerSizeEstimateBefore;		const Optional<size_t> CallerSizeEstimateBefore;
const Optional<size_t> CalleeSizeEstimateBefore;		const Optional<size_t> CalleeSizeEstimateBefore;
const int64_t DefaultDecision;		const int64_t DefaultDecision;
const int64_t Mandatory;		const int64_t Mandatory;
};		};

/// A pseudo model runner. We use it to store feature values when collecting
/// logs for the default policy, but never ask it to 'run'.
class NoInferenceModelRunner : public MLModelRunner {
public:
NoInferenceModelRunner(LLVMContext &Ctx)
: MLModelRunner(Ctx), Features(NumberOfFeatures) {}
void setFeature(FeatureIndex Index, int64_t Value) override {
Features[static_cast<int>(Index)] = Value;
}

int64_t getFeature(int Index) const override { return Features[Index]; }
bool run() override {
llvm_unreachable("We shouldn't call run on this model runner.");
}

private:
InlineFeatures Features;
};

/// ModelUnderTrainingRunner - training mode implementation. It uses TF C APIs		/// ModelUnderTrainingRunner - training mode implementation. It uses TF C APIs
/// to dynamically load and evaluate a TF SavedModel		/// to dynamically load and evaluate a TF SavedModel
/// (https://www.tensorflow.org/guide/saved_model). Runtime performance is		/// (https://www.tensorflow.org/guide/saved_model). Runtime performance is
/// sacrificed for ease of use while training.		/// sacrificed for ease of use while training.
class ModelUnderTrainingRunner final : public MLModelRunner {		class ModelUnderTrainingRunner final : public MLModelRunner {
		yundiqianUnsubmitted Not Done Reply Inline Actions will part of it get re-factored out of inline to common lib? yundiqian: will part of it get re-factored out of inline to common lib?
		mtrofinAuthorUnsubmitted Done Reply Inline Actions I'd like to, yes. Same with the log population from the evaluator - it's pretty generic. mtrofin: I'd like to, yes. Same with the log population from the evaluator - it's pretty generic.
public:		public:
ModelUnderTrainingRunner(LLVMContext &Ctx, const std::string &ModelPath);		ModelUnderTrainingRunner(LLVMContext &Ctx, const std::string &ModelPath);

bool run() override;

// Disallows copy and assign.		// Disallows copy and assign.
ModelUnderTrainingRunner(const ModelUnderTrainingRunner &) = delete;		ModelUnderTrainingRunner(const ModelUnderTrainingRunner &) = delete;
ModelUnderTrainingRunner &		ModelUnderTrainingRunner &
operator=(const ModelUnderTrainingRunner &) = delete;		operator=(const ModelUnderTrainingRunner &) = delete;

void setFeature(FeatureIndex Index, int64_t Value) override;
int64_t getFeature(int Index) const override;
bool isValid() const { return !!Evaluator; }		bool isValid() const { return !!Evaluator; }

const std::vector<LoggedFeatureSpec> &outputLoggedFeatureSpecs() const {		const std::vector<LoggedFeatureSpec> &outputLoggedFeatureSpecs() const {
return OutputSpecs;		return OutputSpecs;
}		}

const Optional<TFModelEvaluator::EvaluationResult> &		const Optional<TFModelEvaluator::EvaluationResult> &
lastEvaluationResult() const {		lastEvaluationResult() const {
return LastEvaluationResult;		return LastEvaluationResult;
}		}

		static const std::vector<TensorSpec> getInputFeatures() {
		std::vector<TensorSpec> InputSpecs;
		for (size_t I = 0; I < NumberOfFeatures; ++I)
		InputSpecs.push_back(TensorSpec::createSpec<int64_t>(
		TFFeedPrefix + FeatureNameMap[I], {1}));
		append_range(InputSpecs, TrainingOnlyFeatures);
		return InputSpecs;
		}

private:		private:
std::unique_ptr<TFModelEvaluator> Evaluator;		std::unique_ptr<TFModelEvaluator> Evaluator;
std::vector<LoggedFeatureSpec> OutputSpecs;		std::vector<LoggedFeatureSpec> OutputSpecs;
Optional<TFModelEvaluator::EvaluationResult> LastEvaluationResult;		Optional<TFModelEvaluator::EvaluationResult> LastEvaluationResult;
		void *evaluateUntyped() override;
		void *getTensorUntyped(size_t Index) override;

// The training framework needs some additional features.		// The training framework needs some additional features.
const std::vector<TensorSpec> TrainingOnlyFeatures{		const static std::vector<TensorSpec> TrainingOnlyFeatures;
		};

		const std::vector<TensorSpec> ModelUnderTrainingRunner::TrainingOnlyFeatures{
TensorSpec::createSpec<int64_t>(TFFeedPrefix + "inlining_default", {1}),		TensorSpec::createSpec<int64_t>(TFFeedPrefix + "inlining_default", {1}),
TensorSpec::createSpec<float>(TFFeedPrefix + "discount", {1}),		TensorSpec::createSpec<float>(TFFeedPrefix + "discount", {1}),
TensorSpec::createSpec<float>(TFFeedPrefix + "reward", {1}),		TensorSpec::createSpec<float>(TFFeedPrefix + "reward", {1}),
TensorSpec::createSpec<int32_t>(TFFeedPrefix + "step_type", {1})};		TensorSpec::createSpec<int32_t>(TFFeedPrefix + "step_type", {1})};
};
} // namespace		} // namespace

TrainingLogger::TrainingLogger(StringRef LogFileName,		TrainingLogger::TrainingLogger(StringRef LogFileName,
const ModelUnderTrainingRunner *MUTR)		const ModelUnderTrainingRunner *MUTR)
: LogFileName(LogFileName), MUTR(MUTR) {		: LogFileName(LogFileName), MUTR(MUTR) {
// The first output is the inlining decision.		// The first output is the inlining decision.
if (MUTR)		if (MUTR)
OutputCount = MUTR->outputLoggedFeatureSpecs().size();		OutputCount = MUTR->outputLoggedFeatureSpecs().size();
Show All 17 Lines	L = std::make_unique<Logger>(
InlineSizeEstimatorAnalysis::isEvaluatorRequested());		InlineSizeEstimatorAnalysis::isEvaluatorRequested());
}		}

/// Log one inlining event.		/// Log one inlining event.
void TrainingLogger::logInlineEvent(const InlineEvent &Event,		void TrainingLogger::logInlineEvent(const InlineEvent &Event,
const MLModelRunner &ModelRunner) {		const MLModelRunner &ModelRunner) {
size_t CurrentFeature = 0;		size_t CurrentFeature = 0;
for (; CurrentFeature < NumberOfFeatures; ++CurrentFeature) {		for (; CurrentFeature < NumberOfFeatures; ++CurrentFeature) {
int64_t F = ModelRunner.getFeature(CurrentFeature);		int64_t F = *ModelRunner.getTensor<int64_t>(CurrentFeature);
L->logInt64Value(CurrentFeature, &F);		L->logInt64Value(CurrentFeature, &F);
}		}

for (size_t I = 1; I < OutputCount; ++I) {		for (size_t I = 1; I < OutputCount; ++I) {
const auto &Result = *MUTR->lastEvaluationResult();		const auto &Result = *MUTR->lastEvaluationResult();
const char *RawData =		const char *RawData =
reinterpret_cast<const char *>(Result.getUntypedTensorValue(I));		reinterpret_cast<const char *>(Result.getUntypedTensorValue(I));
L->logSpecifiedTensorValue(CurrentFeature, RawData);		L->logSpecifiedTensorValue(CurrentFeature, RawData);
▲ Show 20 Lines • Show All 63 Lines • ▼ Show 20 Lines

std::unique_ptr<MLInlineAdvice>		std::unique_ptr<MLInlineAdvice>
DevelopmentModeMLInlineAdvisor::getAdviceFromModel(		DevelopmentModeMLInlineAdvisor::getAdviceFromModel(
CallBase &CB, OptimizationRemarkEmitter &ORE) {		CallBase &CB, OptimizationRemarkEmitter &ORE) {
if (IsDoingInference && !isLogging())		if (IsDoingInference && !isLogging())
return MLInlineAdvisor::getAdviceFromModel(CB, ORE);		return MLInlineAdvisor::getAdviceFromModel(CB, ORE);

bool DefaultAdvice = GetDefaultAdvice(CB);		bool DefaultAdvice = GetDefaultAdvice(CB);
auto Recommendation = IsDoingInference ? ModelRunner->run() : DefaultAdvice;		auto Recommendation =
		IsDoingInference ? static_cast<bool>(ModelRunner->evaluate<int64_t>())
		: DefaultAdvice;
return std::make_unique<LoggingMLInlineAdvice>(		return std::make_unique<LoggingMLInlineAdvice>(
/Advisor=/this,		/Advisor=/this,
/CB=/CB, /ORE=/ORE, /Recommendation=/Recommendation,		/CB=/CB, /ORE=/ORE, /Recommendation=/Recommendation,
/Logger=/*Logger,		/Logger=/*Logger,
/CallerSizeEstimateBefore=/getNativeSizeEstimate(*CB.getCaller()),		/CallerSizeEstimateBefore=/getNativeSizeEstimate(*CB.getCaller()),
/CalleeSizeEstimateBefore=/		/CalleeSizeEstimateBefore=/
getNativeSizeEstimate(*CB.getCalledFunction()),		getNativeSizeEstimate(*CB.getCalledFunction()),
/DefaultDecision=/DefaultAdvice);		/DefaultDecision=/DefaultAdvice);
Show All 11 Lines	for (auto &F : M) {
Ret += *getNativeSizeEstimate(F);		Ret += *getNativeSizeEstimate(F);
}		}
return Ret;		return Ret;
}		}

ModelUnderTrainingRunner::ModelUnderTrainingRunner(LLVMContext &Ctx,		ModelUnderTrainingRunner::ModelUnderTrainingRunner(LLVMContext &Ctx,
const std::string &ModelPath)		const std::string &ModelPath)
: MLModelRunner(Ctx) {		: MLModelRunner(Ctx) {
std::vector<TensorSpec> InputSpecs;		std::vector<TensorSpec> InputSpecs =
for (size_t I = 0; I < NumberOfFeatures; ++I)		ModelUnderTrainingRunner::getInputFeatures();
InputSpecs.push_back(
TensorSpec::createSpec<int64_t>(TFFeedPrefix + FeatureNameMap[I], {1}));
append_range(InputSpecs, TrainingOnlyFeatures);
if (auto MaybeOutSpecs =		if (auto MaybeOutSpecs =
loadOutputSpecs(Ctx, DecisionName, ModelPath, TFOutputSpecOverride))		loadOutputSpecs(Ctx, DecisionName, ModelPath, TFOutputSpecOverride))
OutputSpecs = std::move(*MaybeOutSpecs);		OutputSpecs = std::move(*MaybeOutSpecs);
else		else
return;		return;

Evaluator = std::make_unique<TFModelEvaluator>(		Evaluator = std::make_unique<TFModelEvaluator>(
ModelPath, InputSpecs, [&](size_t I) { return OutputSpecs[I].Spec; },		ModelPath, InputSpecs, [&](size_t I) { return OutputSpecs[I].Spec; },
OutputSpecs.size());		OutputSpecs.size());
if (!Evaluator \|\| !Evaluator->isValid()) {		if (!Evaluator \|\| !Evaluator->isValid()) {
Ctx.emitError("Failed to create inliner saved model evaluator");		Ctx.emitError("Failed to create inliner saved model evaluator");
Evaluator.reset();		Evaluator.reset();
return;		return;
}		}
}		}

bool ModelUnderTrainingRunner::run() {		void *ModelUnderTrainingRunner::evaluateUntyped() {
LastEvaluationResult = Evaluator->evaluate();		LastEvaluationResult = Evaluator->evaluate();
if (!LastEvaluationResult.hasValue()) {		if (!LastEvaluationResult.hasValue()) {
Ctx.emitError("Error evaluating model.");		Ctx.emitError("Error evaluating model.");
return false;		return nullptr;
}
int64_t Decision = *LastEvaluationResult->getTensorValue<int64_t>(0);
return static_cast<bool>(Decision);
}		}
		return LastEvaluationResult->getTensorValue<int64_t>(0);
int64_t ModelUnderTrainingRunner::getFeature(int Index) const {
return *Evaluator->getInput<int64_t>(Index);
}		}

void ModelUnderTrainingRunner::setFeature(FeatureIndex Index, int64_t Value) {		void *ModelUnderTrainingRunner::getTensorUntyped(size_t Index) {
size_t NumericIndex = static_cast<size_t>(Index);		return Evaluator->getUntypedInput(Index);
*(Evaluator->getInput<int64_t>(NumericIndex)) = Value;
}		}

std::unique_ptr<InlineAdvisor> llvm::getDevelopmentModeAdvisor(		std::unique_ptr<InlineAdvisor> llvm::getDevelopmentModeAdvisor(
Module &M, ModuleAnalysisManager &MAM,		Module &M, ModuleAnalysisManager &MAM,
std::function<bool(CallBase &)> GetDefaultAdvice) {		std::function<bool(CallBase &)> GetDefaultAdvice) {
auto &Ctx = M.getContext();		auto &Ctx = M.getContext();
std::unique_ptr<MLModelRunner> Runner;		std::unique_ptr<MLModelRunner> Runner;
ModelUnderTrainingRunner *MUTRPtr = nullptr;		ModelUnderTrainingRunner *MUTRPtr = nullptr;
bool IsDoingInference = false;		bool IsDoingInference = false;
if (TFModelUnderTrainingPath.empty())		if (TFModelUnderTrainingPath.empty())
Runner.reset(new NoInferenceModelRunner(Ctx));		Runner.reset(new NoInferenceModelRunner(
		Ctx, ModelUnderTrainingRunner::getInputFeatures()));
else {		else {
auto MUTR = std::make_unique<ModelUnderTrainingRunner>(		auto MUTR = std::make_unique<ModelUnderTrainingRunner>(
Ctx, TFModelUnderTrainingPath);		Ctx, TFModelUnderTrainingPath);
if (!MUTR \|\| !MUTR->isValid()) {		if (!MUTR \|\| !MUTR->isValid()) {
Ctx.emitError("Could not load the policy model from the provided path");		Ctx.emitError("Could not load the policy model from the provided path");
return nullptr;		return nullptr;
}		}
IsDoingInference = true;		IsDoingInference = true;
Show All 12 Lines

llvm/lib/Analysis/MLInlineAdvisor.cpp

Show First 20 Lines • Show All 239 Lines • ▼ Show 20 Lines	std::unique_ptr<InlineAdvice> MLInlineAdvisor::getAdviceImpl(CallBase &CB) {
auto NrCtantParams = 0;		auto NrCtantParams = 0;
for (auto I = CB.arg_begin(), E = CB.arg_end(); I != E; ++I) {		for (auto I = CB.arg_begin(), E = CB.arg_end(); I != E; ++I) {
NrCtantParams += (isa<Constant>(*I));		NrCtantParams += (isa<Constant>(*I));
}		}

auto &CallerBefore = FAM.getResult<FunctionPropertiesAnalysis>(Caller);		auto &CallerBefore = FAM.getResult<FunctionPropertiesAnalysis>(Caller);
auto &CalleeBefore = FAM.getResult<FunctionPropertiesAnalysis>(Callee);		auto &CalleeBefore = FAM.getResult<FunctionPropertiesAnalysis>(Callee);

ModelRunner->setFeature(FeatureIndex::CalleeBasicBlockCount,		*ModelRunner->getTensor<int64_t>(FeatureIndex::CalleeBasicBlockCount) =
CalleeBefore.BasicBlockCount);		CalleeBefore.BasicBlockCount;
ModelRunner->setFeature(FeatureIndex::CallSiteHeight,		*ModelRunner->getTensor<int64_t>(FeatureIndex::CallSiteHeight) =
FunctionLevels[&Caller]);		FunctionLevels[&Caller];
ModelRunner->setFeature(FeatureIndex::NodeCount, NodeCount);		*ModelRunner->getTensor<int64_t>(FeatureIndex::NodeCount) = NodeCount;
ModelRunner->setFeature(FeatureIndex::NrCtantParams, NrCtantParams);		*ModelRunner->getTensor<int64_t>(FeatureIndex::NrCtantParams) = NrCtantParams;
ModelRunner->setFeature(FeatureIndex::EdgeCount, EdgeCount);		*ModelRunner->getTensor<int64_t>(FeatureIndex::EdgeCount) = EdgeCount;
ModelRunner->setFeature(FeatureIndex::CallerUsers, CallerBefore.Uses);		*ModelRunner->getTensor<int64_t>(FeatureIndex::CallerUsers) =
ModelRunner->setFeature(FeatureIndex::CallerConditionallyExecutedBlocks,		CallerBefore.Uses;
CallerBefore.BlocksReachedFromConditionalInstruction);		*ModelRunner->getTensor<int64_t>(
ModelRunner->setFeature(FeatureIndex::CallerBasicBlockCount,		FeatureIndex::CallerConditionallyExecutedBlocks) =
CallerBefore.BasicBlockCount);		CallerBefore.BlocksReachedFromConditionalInstruction;
ModelRunner->setFeature(FeatureIndex::CalleeConditionallyExecutedBlocks,		*ModelRunner->getTensor<int64_t>(FeatureIndex::CallerBasicBlockCount) =
CalleeBefore.BlocksReachedFromConditionalInstruction);		CallerBefore.BasicBlockCount;
ModelRunner->setFeature(FeatureIndex::CalleeUsers, CalleeBefore.Uses);		*ModelRunner->getTensor<int64_t>(
ModelRunner->setFeature(FeatureIndex::CostEstimate, CostEstimate);		FeatureIndex::CalleeConditionallyExecutedBlocks) =
		CalleeBefore.BlocksReachedFromConditionalInstruction;
		*ModelRunner->getTensor<int64_t>(FeatureIndex::CalleeUsers) =
		CalleeBefore.Uses;
		*ModelRunner->getTensor<int64_t>(FeatureIndex::CostEstimate) = CostEstimate;

// Add the cost features		// Add the cost features
for (size_t I = 0;		for (size_t I = 0;
I < static_cast<size_t>(InlineCostFeatureIndex::NumberOfFeatures); ++I) {		I < static_cast<size_t>(InlineCostFeatureIndex::NumberOfFeatures); ++I) {
ModelRunner->setFeature(		*ModelRunner->getTensor<int64_t>(inlineCostFeatureToMlFeature(
inlineCostFeatureToMlFeature(static_cast<InlineCostFeatureIndex>(I)),		static_cast<InlineCostFeatureIndex>(I))) = CostFeatures->at(I);
CostFeatures->at(I));
}		}

return getAdviceFromModel(CB, ORE);		return getAdviceFromModel(CB, ORE);
}		}

std::unique_ptr<MLInlineAdvice>		std::unique_ptr<MLInlineAdvice>
MLInlineAdvisor::getAdviceFromModel(CallBase &CB,		MLInlineAdvisor::getAdviceFromModel(CallBase &CB,
OptimizationRemarkEmitter &ORE) {		OptimizationRemarkEmitter &ORE) {
return std::make_unique<MLInlineAdvice>(this, CB, ORE, ModelRunner->run());		return std::make_unique<MLInlineAdvice>(
		this, CB, ORE, static_cast<bool>(ModelRunner->evaluate<int64_t>()));
}		}

std::unique_ptr<InlineAdvice> MLInlineAdvisor::getMandatoryAdvice(CallBase &CB,		std::unique_ptr<InlineAdvice> MLInlineAdvisor::getMandatoryAdvice(CallBase &CB,
bool Advice) {		bool Advice) {
// Make sure we track inlinings in all cases - mandatory or not.		// Make sure we track inlinings in all cases - mandatory or not.
if (Advice && !ForceStop)		if (Advice && !ForceStop)
return getMandatoryAdviceImpl(CB);		return getMandatoryAdviceImpl(CB);

Show All 9 Lines	MLInlineAdvisor::getMandatoryAdviceImpl(CallBase &CB) {
return std::make_unique<MLInlineAdvice>(this, CB, getCallerORE(CB), true);		return std::make_unique<MLInlineAdvice>(this, CB, getCallerORE(CB), true);
}		}

void MLInlineAdvice::reportContextForRemark(		void MLInlineAdvice::reportContextForRemark(
DiagnosticInfoOptimizationBase &OR) {		DiagnosticInfoOptimizationBase &OR) {
using namespace ore;		using namespace ore;
OR << NV("Callee", Callee->getName());		OR << NV("Callee", Callee->getName());
for (size_t I = 0; I < NumberOfFeatures; ++I)		for (size_t I = 0; I < NumberOfFeatures; ++I)
OR << NV(FeatureNameMap[I], getAdvisor()->getModelRunner().getFeature(I));		OR << NV(FeatureNameMap[I],
		*getAdvisor()->getModelRunner().getTensor<int64_t>(I));
OR << NV("ShouldInline", isInliningRecommended());		OR << NV("ShouldInline", isInliningRecommended());
}		}

void MLInlineAdvice::recordInliningImpl() {		void MLInlineAdvice::recordInliningImpl() {
ORE.emit([&]() {		ORE.emit([&]() {
OptimizationRemark R(DEBUG_TYPE, "InliningSuccess", DLoc, Block);		OptimizationRemark R(DEBUG_TYPE, "InliningSuccess", DLoc, Block);
reportContextForRemark(R);		reportContextForRemark(R);
return R;		return R;
Show All 31 Lines

llvm/lib/Analysis/NoInferenceModelRunner.cpp

This file was added.

				//===- NoInferenceModelRunner.cpp - noop ML model runner ----------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// A pseudo model runner. We use it to store feature values when collecting
				// logs for the default policy, in 'development' mode, but never ask it to
				// 'run'.
				//===----------------------------------------------------------------------===//
				#include "llvm/Config/config.h"
				#if defined(LLVM_HAVE_TF_API)

				#include "llvm/Analysis/NoInferenceModelRunner.h"
				#include "llvm/Analysis/Utils/TFUtils.h"

				using namespace llvm;

				NoInferenceModelRunner::NoInferenceModelRunner(
				LLVMContext &Ctx, const std::vector<TensorSpec> &Inputs)
				: MLModelRunner(Ctx) {
				ValuesBuffer.reserve(Inputs.size());
				for (const auto &TS : Inputs)
				ValuesBuffer.push_back(std::make_unique<char[]>(TS.getElementCount() *
				TS.getElementByteSize()));
				}

				void *NoInferenceModelRunner::getTensorUntyped(size_t Index) {
				return ValuesBuffer[Index].get();
				}
				#endif // defined(LLVM_HAVE_TF_API)
				No newline at end of file

llvm/lib/Analysis/ReleaseModeModelRunner.cpp

Show All 29 Lines

/// MLModelRunner - production mode implementation. It uses a AOT-compiled		/// MLModelRunner - production mode implementation. It uses a AOT-compiled
/// SavedModel for efficient execution.		/// SavedModel for efficient execution.
class ReleaseModeModelRunner final : public MLModelRunner {		class ReleaseModeModelRunner final : public MLModelRunner {
public:		public:
ReleaseModeModelRunner(LLVMContext &Ctx);		ReleaseModeModelRunner(LLVMContext &Ctx);
virtual ~ReleaseModeModelRunner() = default;		virtual ~ReleaseModeModelRunner() = default;

bool run() override;

void setFeature(FeatureIndex Index, int64_t Value) override;
int64_t getFeature(int Index) const override;

private:		private:
		void *evaluateUntyped() override;
		void *getTensorUntyped(size_t Index) override;

std::vector<int32_t> FeatureIndices;		std::vector<int32_t> FeatureIndices;
int32_t ResultIndex = -1;		int32_t ResultIndex = -1;
std::unique_ptr<llvm::InlinerSizeModel> CompiledModel;		std::unique_ptr<llvm::InlinerSizeModel> CompiledModel;
};		};
} // namespace		} // namespace

ReleaseModeModelRunner::ReleaseModeModelRunner(LLVMContext &Ctx)		ReleaseModeModelRunner::ReleaseModeModelRunner(LLVMContext &Ctx)
: MLModelRunner(Ctx),		: MLModelRunner(Ctx),
Show All 9 Lines	for (size_t I = 0; I < NumberOfFeatures; ++I) {
FeatureIndices[I] = Index;		FeatureIndices[I] = Index;
}		}

ResultIndex =		ResultIndex =
CompiledModel->LookupResultIndex(std::string(FetchPrefix) + DecisionName);		CompiledModel->LookupResultIndex(std::string(FetchPrefix) + DecisionName);
assert(ResultIndex >= 0 && "Cannot find DecisionName in inlining model");		assert(ResultIndex >= 0 && "Cannot find DecisionName in inlining model");
}		}

int64_t ReleaseModeModelRunner::getFeature(int Index) const {		void *ReleaseModeModelRunner::getTensorUntyped(size_t Index) {
return static_cast<int64_t >(		return reinterpret_cast<char *>(
CompiledModel->arg_data(FeatureIndices[Index]));		CompiledModel->arg_data(FeatureIndices[Index]));
}		}

void ReleaseModeModelRunner::setFeature(FeatureIndex Index, int64_t Value) {		void *ReleaseModeModelRunner::evaluateUntyped() {
static_cast<int64_t >(CompiledModel->arg_data(
FeatureIndices[static_cast<size_t>(Index)])) = Value;
}

bool ReleaseModeModelRunner::run() {
CompiledModel->Run();		CompiledModel->Run();
return static_cast<bool>(		return CompiledModel->result_data(ResultIndex);
static_cast<int64_t >(CompiledModel->result_data(ResultIndex)));
}		}

std::unique_ptr<InlineAdvisor>		std::unique_ptr<InlineAdvisor>
llvm::getReleaseModeAdvisor(Module &M, ModuleAnalysisManager &MAM) {		llvm::getReleaseModeAdvisor(Module &M, ModuleAnalysisManager &MAM) {
auto AOTRunner = std::make_unique<ReleaseModeModelRunner>(M.getContext());		auto AOTRunner = std::make_unique<ReleaseModeModelRunner>(M.getContext());
return std::make_unique<MLInlineAdvisor>(M, MAM, std::move(AOTRunner));		return std::make_unique<MLInlineAdvisor>(M, MAM, std::move(AOTRunner));
}		}
#endif // defined(LLVM_HAVE_TF_AOT)		#endif // defined(LLVM_HAVE_TF_AOT)

llvm/unittests/Analysis/CMakeLists.txt

	set(LLVM_LINK_COMPONENTS			set(LLVM_LINK_COMPONENTS
	Analysis			Analysis
	AsmParser			AsmParser
	Core			Core
	Support			Support
	TransformUtils			TransformUtils
	)			)

				set(MLGO_TESTS TFUtilsTest.cpp MLModelRunnerTest.cpp)
	if (DEFINED LLVM_HAVE_TF_API)			if (DEFINED LLVM_HAVE_TF_API)
	LIST(APPEND EXTRA_TESTS TFUtilsTest.cpp)			LIST(APPEND EXTRA_TESTS ${MLGO_TESTS})
	else()			else()
	LIST(APPEND LLVM_OPTIONAL_SOURCES TFUtilsTest.cpp)			LIST(APPEND LLVM_OPTIONAL_SOURCES ${MLGO_TESTS})
	endif()			endif()

	add_llvm_unittest_with_input_files(AnalysisTests			add_llvm_unittest_with_input_files(AnalysisTests
	AliasAnalysisTest.cpp			AliasAnalysisTest.cpp
	AliasSetTrackerTest.cpp			AliasSetTrackerTest.cpp
	AssumeBundleQueriesTest.cpp			AssumeBundleQueriesTest.cpp
	BasicAliasAnalysisTest.cpp			BasicAliasAnalysisTest.cpp
	BlockFrequencyInfoTest.cpp			BlockFrequencyInfoTest.cpp
	Show All 35 Lines

llvm/unittests/Analysis/MLModelRunnerTest.cpp

This file was added.

				//===- MLModelRunnerTest.cpp - test for MLModelRunner ---------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "llvm/Analysis/MLModelRunner.h"
				#include "llvm/Analysis/NoInferenceModelRunner.h"
				#include "gtest/gtest.h"

				using namespace llvm;

				TEST(NoInferenceModelRunner, AccessTensors) {
				const std::vector<TensorSpec> Inputs{
				TensorSpec::createSpec<int64_t>("F1", {1}),
				TensorSpec::createSpec<int64_t>("F2", {10}),
				TensorSpec::createSpec<float>("F2", {5}),
				};
				LLVMContext Ctx;
				NoInferenceModelRunner NIMR(Ctx, Inputs);
				NIMR.getTensor<int64_t>(0)[0] = 1;
				std::memcpy(NIMR.getTensor<int64_t>(1),
				std::vector<int64_t>{1, 2, 3, 4, 5, 6, 7, 8, 9, 10}.data(),
				10 * sizeof(int64_t));
				std::memcpy(NIMR.getTensor<float>(2),
				std::vector<float>{0.1, 0.2, 0.3, 0.4, 0.5}.data(),
				5 * sizeof(float));
				ASSERT_EQ(NIMR.getTensor<int64_t>(0)[0], 1);
				ASSERT_EQ(NIMR.getTensor<int64_t>(1)[8], 9);
				ASSERT_EQ(NIMR.getTensor<float>(2)[1], 0.2f);
				}
				No newline at end of file

This is an archive of the discontinued LLVM Phabricator instance.

[NFC][mlgo] Generalize model runner interfaceClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 393018

llvm/include/llvm/Analysis/MLModelRunner.h

llvm/include/llvm/Analysis/NoInferenceModelRunner.h

llvm/include/llvm/Analysis/Utils/TFUtils.h

llvm/lib/Analysis/CMakeLists.txt

llvm/lib/Analysis/DevelopmentModeInlineAdvisor.cpp

llvm/lib/Analysis/MLInlineAdvisor.cpp

llvm/lib/Analysis/NoInferenceModelRunner.cpp

llvm/lib/Analysis/ReleaseModeModelRunner.cpp

llvm/unittests/Analysis/CMakeLists.txt

llvm/unittests/Analysis/MLModelRunnerTest.cpp

[NFC][mlgo] Generalize model runner interface
ClosedPublic