This is an archive of the discontinued LLVM Phabricator instance.

I'm not sure there's a formal policy on this, but generally part of being an open-source project is that the source is available in human-readable format. With the exception of a few regression tests for binary parsers, the entire LLVM tree is human-readable. A model clearly doesn't count as human-readable.

If it isn't practical to train the model as part of the LLVM build (because it would take too long), it might make sense to commit binary files. There's some precedent for this in-tree: lowering for shuffles on some targets is based on a precomputed table, built using a utility that isn't run as part of the normal build process. But I would expect reproducible instructions for how to generate the files.

Harbormaster failed remote builds in B59725: Diff 269718!Jun 9 2020, 7:52 PM

In D81515#2083938, @efriedma wrote:

Including the models in the LLVM tree is problematic.

I'm not sure there's a formal policy on this, but generally part of being an open-source project is that the source is available in human-readable format. With the exception of a few regression tests for binary parsers, the entire LLVM tree is human-readable. A model clearly doesn't count as human-readable.

If it isn't practical to train the model as part of the LLVM build (because it would take too long), it might make sense to commit binary files. There's some precedent for this in-tree: lowering for shuffles on some targets is based on a precomputed table, built using a utility that isn't run as part of the normal build process. But I would expect reproducible instructions for how to generate the files.

Indeed, training part of the build would be impractical. But that still doesn't mean we need binary files.

I believe there are 2 concerns:

binary files: I agree with the sentiment about binaries. We want to explore a way to offer the model in a text format. That would require changes to the AOT compiler. We decided to start with what we had, believing that, given this part of the project is a built-time opt-in, it shouldn't cause much hindrance for the interim until we develop a text format.

how to train a model. There is a high level description of the means we used to train a model in the RFC, and, as outlined there, we intend to open source a reference training tool. Our plan is to do that in the next step.

In D81515#2083938, @efriedma wrote:

Including the models in the LLVM tree is problematic.

I'm not sure there's a formal policy on this, but generally part of being an open-source project is that the source is available in human-readable format. With the exception of a few regression tests for binary parsers, the entire LLVM tree is human-readable. A model clearly doesn't count as human-readable.

If it isn't practical to train the model as part of the LLVM build (because it would take too long), it might make sense to commit binary files. There's some precedent for this in-tree: lowering for shuffles on some targets is based on a precomputed table, built using a utility that isn't run as part of the normal build process. But I would expect reproducible instructions for how to generate the files.

If there's some standardized binary format for models, that's might be okay? By analogy, there are some PNG files in the documentation; we don't insist people use XPM or something like that. There are some technical reasons to prefer text, though: it would allow someone to identify or diff the contents of the files without specialized tools.

I'm more concerned about adding an opaque matrix of coefficients nobody can reproduce into the codebase. I think before we commit a generated model, the training tool needs to be committed, and someone needs to verify they can independently reproduce the generated model using that tool. I think it's important we set the right precedent here.

In D81515#2084261, @efriedma wrote:

If there's some standardized binary format for models, that's might be okay? By analogy, there are some PNG files in the documentation; we don't insist people use XPM or something like that. There are some technical reasons to prefer text, though: it would allow someone to identify or diff the contents of the files without specialized tools.

It's the tensorflow format for models - https://www.tensorflow.org/guide/saved_model

I'm more concerned about adding an opaque matrix of coefficients nobody can reproduce into the codebase. I think before we commit a generated model, the training tool needs to be committed, and someone needs to verify they can independently reproduce the generated model using that tool. I think it's important we set the right precedent here.

We are on the same page - we do plan to release the training tools for developers wishing to produce their own models. It may be natural to do that first step first, but in this case, we believe the staging described in the RFC may have some merit (we should have described our motivation in the RFC, come to think of it). The main motivation for starting with the LLVM components (both ‘release mode’ and ‘development mode’, which I plan to submit next), and then making the training tools available (in a separate repository), is that having the LLVM components available allows for quicker experimentation by our partner teams, thus allowing us to parallelize work on upstreaming the training components with more ML exploration with those teams.

IIUC, being an experimental feature that is conditionally-compiled in LLVM, this staging wouldn't have any material downside to anyone, while helping us maintain velocity. Importantly, because this is an optionally-built component, there should be no impact on "business as usual" LLVM developers, and, in particular, the build bots testing this feature are pointing to the silent master.

mtrofin mentioned this in D81507: [llvm][NFC] Factor some common data in InlineAdvice.Jun 10 2020, 4:14 PM

Can you also rebase the patch?

llvm/CMakeLists.txt
965	Add a comment here describing briefly how to download TF packages and set AOT_PATH?
llvm/cmake/modules/TensorFlowCompile.cmake
2	document the function.
20	groupped -- grouped.
llvm/lib/Analysis/InlineAdvisor.cpp
160	add an assert in the #else branch?
llvm/lib/Analysis/ML/Common/MLInlineAdvisor.cpp
62 ↗	(On Diff #269718)	Add comments here what it does -- e.g, feature extraction etc.
105 ↗	(On Diff #269718)	add top level comments documenting what it does.
206 ↗	(On Diff #269718)	extract the feature extraction code into a small helper?
llvm/lib/Analysis/ML/InlineModelFeatureMaps.h
18 ↗	(On Diff #269718)	add comment on each feature
38 ↗	(On Diff #269718)	It is hard to keep the index in sync with names. how about something with a def table: // ml_features.def DEFINE_FEATURE(CalleeBasicCount, "callee bb count") DEFINE_FEATURE(CallSiteHeight, "callsite_height) ..... For enum define #define DEFINE_FEATURE(en, name) en, #include "ml_features.def" #undef DEFINE_FEATURE

rebased + feedback

Herald added a subscriber: aaron.ballman. · View Herald TranscriptJun 12 2020, 9:34 AM

Harbormaster failed remote builds in B60132: Diff 270428!Jun 12 2020, 9:46 AM

Moved everything to Analysis

mtrofin added inline comments.Jun 15 2020, 5:40 PM

llvm/lib/Analysis/InlineAdvisor.cpp
160	Actually, we don't assert, rather the tryCreate caller checks the return of this function and emits an error if it didn't get an Advisor - this is the current behavior.
llvm/lib/Analysis/ML/Common/MLInlineAdvisor.cpp
206 ↗	(On Diff #269718)	There's a lot of little parameters to pass in that case. It'll probably be more natural when the features become more involved (multi-dimensional) to have groups of helpers like that.

Default TENSORFLOW_AOT_PATH should be "", not its description.

Harbormaster failed remote builds in B60404: Diff 270928!Jun 15 2020, 7:19 PM

Harbormaster failed remote builds in B60408: Diff 270932!Jun 15 2020, 8:23 PM

davidxl added inline comments.Jun 17 2020, 12:57 PM

llvm/cmake/modules/TensorFlowCompile.cmake
27	there are lots of references to ${CMAKE_CURRENT_BINARY_DIR}/${fname} here, perhaps use a common variable for it?
llvm/include/llvm/Analysis/InlineModelFeatureMaps.h
58	put these static variables inside the .cpp file to avoid multiple copies if the header is included by different sources.
llvm/include/llvm/Analysis/InlineModelRunner.h
21 ↗	(On Diff #270932)	Is this interface just for inliner or more general. Perhaps just name it MLModelRunner?
llvm/lib/Analysis/MLInlineAdvisor.cpp
38	Is this for controlling training time overhead?
73	reverse the condition with continue to reduce nesting level
78	inlinable callee
llvm/lib/Analysis/ReleaseModeModelRunner.cpp
32	MLInferenceRunner?

Feedback

llvm/include/llvm/Analysis/InlineModelRunner.h
21 ↗	(On Diff #270932)	Renamed
llvm/lib/Analysis/MLInlineAdvisor.cpp
38	Not only, it's also for controlling against misbehaving policies. Description was wrong, it's not native size increase, it's IR size.
llvm/lib/Analysis/ReleaseModeModelRunner.cpp
32	What would we call the Development mode one then?

asl added a subscriber: asl.Jun 17 2020, 2:21 PM

asl added inline comments.

llvm/cmake/modules/TensorFlowCompile.cmake
25	This misses the target triple. Otherwise even on MacOS it will generate linux object file.

Harbormaster failed remote builds in B60702: Diff 271474!Jun 17 2020, 3:07 PM

target triple

mtrofin marked 2 inline comments as done.Jun 17 2020, 4:58 PM

mtrofin added inline comments.

llvm/cmake/modules/TensorFlowCompile.cmake
25	Thanks! Fixed.

Harbormaster failed remote builds in B60738: Diff 271530!Jun 17 2020, 6:22 PM

asl added inline comments.Jun 18 2020, 4:34 AM

llvm/cmake/modules/TensorFlowCompile.cmake
25	Host triple should be used here, no?

correct triple

mtrofin marked 2 inline comments as done.Jun 18 2020, 6:51 AM

mtrofin added inline comments.

llvm/cmake/modules/TensorFlowCompile.cmake
25	Done - thanks!

Harbormaster failed remote builds in B60823: Diff 271703!Jun 18 2020, 8:08 AM

fix some formatting

Harbormaster completed remote builds in B60867: Diff 271788.Jun 18 2020, 12:36 PM

davidxl added inline comments.Jun 19 2020, 10:33 AM

llvm/test/Transforms/Inline/ML/bounds-checks.ll
38	can you explain about the expected output?
llvm/test/Transforms/Inline/ML/ml-test-release-mode.ll
2	Why can't default inliner handle this case (adder call can be folded).

mtrofin marked 4 inline comments as done.Jun 19 2020, 12:06 PM

mtrofin added inline comments.

llvm/test/Transforms/Inline/ML/bounds-checks.ll
38	Added more detail
llvm/test/Transforms/Inline/ML/ml-test-release-mode.ll
2	Cost evaluation - added explanation.

more details in test

Harbormaster failed remote builds in B61084: Diff 272156!Jun 19 2020, 1:37 PM

clang-tidy

davidxl added inline comments.Jun 22 2020, 11:01 AM

llvm/test/Transforms/Inline/ML/bounds-checks.ll
38	ok. I do wish the increase threshold to be learned as well in the future and make this option unnecessary.

lgtm. Wait a few days to see if other reviewers have more feedbacks.

This revision is now accepted and ready to land.Jun 22 2020, 11:01 AM

Harbormaster failed remote builds in B61270: Diff 272493!Jun 22 2020, 12:22 PM

raghesh added a subscriber: raghesh.Jun 24 2020, 2:58 AM

mtrofin mentioned this in D77752: [llvm] Machine Learned policy for inlining -Oz.Jun 24 2020, 8:03 AM

Hi, your git commit contains extra Phabricator tags. You can drop Reviewers: Subscribers: Tags: and the text Summary: from the git commit with the following script:

arcfilter () {
        arc amend
        git log -1 --pretty=%B | awk '/Reviewers:|Subscribers:/{p=1} /Reviewed By:|Differential Revision:/{p=0} !p && !/^Summary:$/ {sub(/^Summary: /,"");print}' | git commit --amend --date=now -F -
}

Reviewed By: is considered important by some people. Please keep the tag. (I have updated my script to use --date=now (setting author date to committer date))

https://reviews.llvm.org/D80978 contains a git pre-push hook to automate this.

Closed by commit rGbdceefe95ba6: [llvm] Release-mode ML InlineAdvisor (authored by mtrofin). · Explain WhyJun 24 2020, 8:37 AM

This revision was automatically updated to reflect the committed changes.

thakis added a subscriber: thakis.Jun 24 2020, 9:58 AM

thakis added inline comments.

llvm/test/lit.site.cfg.py.in
51	Please use llvm_canonicalize_cmake_booleans for this.

mtrofin marked an inline comment as done.Jun 24 2020, 11:18 AM

mtrofin added inline comments.

llvm/test/lit.site.cfg.py.in
51	could you elaborate how? I see it used in CMakeLists files - not super sure how I'd use it here. Thanks!

mtrofin marked 2 inline comments as done.Jun 29 2020, 8:46 AM

mtrofin added inline comments.

llvm/test/lit.site.cfg.py.in
51	Being addressed in D82776.

Hi Mircea, Could you also provide the information on what specific tf-nightly, protobuf version did you guys use to save the two frozen models? Unfortunately, I don't seem to load the models using a number of tf-nighly versions and am receiving

google.protobuf.message.DecodeError: Error parsing message

After further investigations, I noticed this has been done using the new TF's SavedModel method and Keras : https://tensorflow.google.cn/tutorials/keras/save_and_load?hl=en#save_checkpoints_during_training

Would you provide scripts to load the model and see the layers?

Thanks,

Amir

In D81515#2344805, @AmirJamez wrote:
Hi Mircea, Could you also provide the information on what specific tf-nightly, protobuf version did you guys use to save the two frozen models? Unfortunately, I don't seem to load the models using a number of tf-nighly versions and am receiving
google.protobuf.message.DecodeError: Error parsing message
After further investigations, I noticed this has been done using the new TF's SavedModel method and Keras : https://tensorflow.google.cn/tutorials/keras/save_and_load?hl=en#save_checkpoints_during_training

Would you provide scripts to load the model and see the layers?

Thanks,

Amir

Hello Amir,

to answer the first question (but I think you figured that already), the authoritative versions are captured in the bot script, available at https://github.com/google/ml-compiler-opt/blob/master/buildbot/buildbot_init.sh

Re. second question, visualization - this is a question for Yundi, Gaurav, or Eugene (they are the ML experts). I'll venture "tensorboard" as an answer, but I'll make sure they give the authoritative one in a moment.

gjain added a subscriber: gjain.Oct 21 2020, 3:22 PM

In D81515#2344814, @mtrofin wrote:

In D81515#2344805, @AmirJamez wrote:

Would you provide scripts to load the model and see the layers?

Re. second question, visualization - this is a question for Yundi, Gaurav, or Eugene (they are the ML experts). I'll venture "tensorboard" as an answer, but I'll make sure they give the authoritative one in a moment.

You should be able to use tensorboard but you need to first import the model into tensorboard with https://github.com/tensorflow/tensorflow/blob/master/tensorflow/python/tools/import_pb_to_tensorboard.py. Something like python import_pb_to_tensorboard.py --model_dir=llvm/lib/Analysis/models/inliner/ --log_dir=/tmp/inliner should work. Then you'll be able to run tensorboard on the log_dir.

Here's a hosted visualization from tensorboard for your convenience: https://tensorboard.dev/experiment/C45o0HjZTPGRSqpOrdkbeg/#graphs

yundiqian added a subscriber: yundiqian.Oct 21 2020, 5:41 PM

In D81515#2345894, @gjain wrote:

In D81515#2344814, @mtrofin wrote:

In D81515#2344805, @AmirJamez wrote:

Would you provide scripts to load the model and see the layers?

Re. second question, visualization - this is a question for Yundi, Gaurav, or Eugene (they are the ML experts). I'll venture "tensorboard" as an answer, but I'll make sure they give the authoritative one in a moment.

You should be able to use tensorboard but you need to first import the model into tensorboard with https://github.com/tensorflow/tensorflow/blob/master/tensorflow/python/tools/import_pb_to_tensorboard.py. Something like python import_pb_to_tensorboard.py --model_dir=llvm/lib/Analysis/models/inliner/ --log_dir=/tmp/inliner should work. Then you'll be able to run tensorboard on the log_dir.

Here's a hosted visualization from tensorboard for your convenience: https://tensorboard.dev/experiment/C45o0HjZTPGRSqpOrdkbeg/#graphs

Thanks.

(1) May I ask what was the reason behind using a tf-nighlty rather than a tensoflow release?
(2) tf.nighlty mentioned in https://github.com/google/ml-compiler-opt/blob/master/buildbot/buildbot_init.sh#L119 is no longer available in https://pypi.org/project/tf-nightly/#history :)
(3) I can confirm that I was able to generate logs and subsequently visualize the model with tensorboard 2.3.0 and tensorflow release 2.2.0 instead. Also, in pursuit of installing packages, I ran into:

tensorboard duplicate plugins for name projector

which it turned out to be a common issue for tensorboard when there are multiple packages installed, as a result of trying tf.nightly with release. Removing duplicate tensorboard fixed the issue.

(4) Will you also release training scripts for brewing ir2native model as well here: https://github.com/google/ml-compiler-opt

Thanks,

Amir

In D81515#2349037, @AmirJamez wrote:

In D81515#2345894, @gjain wrote:

In D81515#2344814, @mtrofin wrote:

In D81515#2344805, @AmirJamez wrote:

Would you provide scripts to load the model and see the layers?

Re. second question, visualization - this is a question for Yundi, Gaurav, or Eugene (they are the ML experts). I'll venture "tensorboard" as an answer, but I'll make sure they give the authoritative one in a moment.

You should be able to use tensorboard but you need to first import the model into tensorboard with https://github.com/tensorflow/tensorflow/blob/master/tensorflow/python/tools/import_pb_to_tensorboard.py. Something like python import_pb_to_tensorboard.py --model_dir=llvm/lib/Analysis/models/inliner/ --log_dir=/tmp/inliner should work. Then you'll be able to run tensorboard on the log_dir.

Here's a hosted visualization from tensorboard for your convenience: https://tensorboard.dev/experiment/C45o0HjZTPGRSqpOrdkbeg/#graphs

Thanks.

(1) May I ask what was the reason behind using a tf-nighlty rather than a tensoflow release?

Historic reason - at the time we started upstreaming the work, the necessary changes to the pip package were not in the release package yet.

(2) tf.nighlty mentioned in https://github.com/google/ml-compiler-opt/blob/master/buildbot/buildbot_init.sh#L119 is no longer available in https://pypi.org/project/tf-nightly/#history :)

Thanks for pointing it out - updated the script; one of the build bots was also having issues for this reason, must have been a recent change (or the bots weren't rebooted in a while)

(3) I can confirm that I was able to generate logs and subsequently visualize the model with tensorboard 2.3.0 and tensorflow release 2.2.0 instead. Also, in pursuit of installing packages, I ran into:
tensorboard duplicate plugins for name projector
which it turned out to be a common issue for tensorboard when there are multiple packages installed, as a result of trying tf.nightly with release. Removing duplicate tensorboard fixed the issue.

To confirm, now that we're using the release 2.3.0 tensorflow pip package, this shouldn't be an issue anymore, correct?

(4) Will you also release training scripts for brewing ir2native model as well here: https://github.com/google/ml-compiler-opt

IR2Native is used for RL training algorithms where we want partial rewards. That's what we initially did, but then we got better characteristics with training algorithms using just final reward (==the .text size in the native object). We abandoned for the short term the partial rewards training. We suspect it will start making sense again when we incorporate more global context than we currently do (currently, the global context is really thin - node/edge counts, uses, and a measure of the initial DAG position). So this is a long way of saying: we should probably yank out IR2Native right now, for code simplicity, but didn't get around to doing it.

Thanks,

Amir

In D81515#2357592, @mtrofin wrote:
In D81515#2349037, @AmirJamez wrote:

In D81515#2345894, @gjain wrote:

In D81515#2344814, @mtrofin wrote:

In D81515#2344805, @AmirJamez wrote:

Would you provide scripts to load the model and see the layers?

Re. second question, visualization - this is a question for Yundi, Gaurav, or Eugene (they are the ML experts). I'll venture "tensorboard" as an answer, but I'll make sure they give the authoritative one in a moment.

You should be able to use tensorboard but you need to first import the model into tensorboard with https://github.com/tensorflow/tensorflow/blob/master/tensorflow/python/tools/import_pb_to_tensorboard.py. Something like python import_pb_to_tensorboard.py --model_dir=llvm/lib/Analysis/models/inliner/ --log_dir=/tmp/inliner should work. Then you'll be able to run tensorboard on the log_dir.

Here's a hosted visualization from tensorboard for your convenience: https://tensorboard.dev/experiment/C45o0HjZTPGRSqpOrdkbeg/#graphs

Thanks.

(1) May I ask what was the reason behind using a tf-nighlty rather than a tensoflow release?

Historic reason - at the time we started upstreaming the work, the necessary changes to the pip package were not in the release package yet.

(2) tf.nighlty mentioned in https://github.com/google/ml-compiler-opt/blob/master/buildbot/buildbot_init.sh#L119 is no longer available in https://pypi.org/project/tf-nightly/#history :)

Thanks for pointing it out - updated the script; one of the build bots was also having issues for this reason, must have been a recent change (or the bots weren't rebooted in a while)
(3) I can confirm that I was able to generate logs and subsequently visualize the model with tensorboard 2.3.0 and tensorflow release 2.2.0 instead. Also, in pursuit of installing packages, I ran into:
tensorboard duplicate plugins for name projector
which it turned out to be a common issue for tensorboard when there are multiple packages installed, as a result of trying tf.nightly with release. Removing duplicate tensorboard fixed the issue.
To confirm, now that we're using the release 2.3.0 tensorflow pip package, this shouldn't be an issue anymore, correct?

Yes. I confirm using TF.2.3.0 and Tensorboard 2.3.0; pip3 install tensorflow==2.3 --user did the job.

(4) Will you also release training scripts for brewing ir2native model as well here: https://github.com/google/ml-compiler-opt

IR2Native is used for RL training algorithms where we want partial rewards. That's what we initially did, but then we got better characteristics with training algorithms using just final reward (==the .text size in the native object). We abandoned for the short term the partial rewards training. We suspect it will start making sense again when we incorporate more global context than we currently do (currently, the global context is really thin - node/edge counts, uses, and a measure of the initial DAG position). So this is a long way of saying: we should probably yank out IR2Native right now, for code simplicity, but didn't get around to doing it.

I see. So there are two questions:

(Q1) Could you provide a definition for an IR2native final/optimal partial rewards ? I'd assume it was the final iteration of model weights when the training was stopped, however, what was the stop condition here?

(Q2) To make sense of it, let consider:
(2-1) Training Phase:

If models are trained together in the same pipeline: So that means you trained these two (IR2Native and RL) together in the same pipeline, meaning that when you feed IR2Native the training data, the partial rewards are fed into the RL model. If that's the case, it would be tricky as the partial rewards changes each iteration and depending on the input data and gradually converge to a more accurate values (lower loss function) and meanwhile you kept feeding these, inaccurate values, to the RL model to get trained. I guess as long as you had a unified strategy to deal with the loss functions, this method should be tricky.
If IR2Native was trained first: Based on your reply and that you mentioned you fixed the buckets with their final partial rewards, I assume this was the method you used, meaning that you trained IR2Native and stopped the training at a certain iteration perhaps with a low loss function value? or other criteria. At this point, you use the final buckets of IR2Native to train RL. So in a way IR2Native's inference is used to train RL. Is that a correct assumption?

(2-2) Inference Phase:
So at deployment and when an LLVM user passes opt -passes=scc-oz-module-inliner -enable-ml-inliner=release -S, callers()'s IR2Native features are collected and one bucket is chosen as partial reward which is then fed into RL to decide whether or not to inline a callee() ?

Thanks,

Amir

In D81515#2362125, @AmirJamez wrote:
In D81515#2357592, @mtrofin wrote:
In D81515#2349037, @AmirJamez wrote:

In D81515#2345894, @gjain wrote:

In D81515#2344814, @mtrofin wrote:

In D81515#2344805, @AmirJamez wrote:

Would you provide scripts to load the model and see the layers?

Re. second question, visualization - this is a question for Yundi, Gaurav, or Eugene (they are the ML experts). I'll venture "tensorboard" as an answer, but I'll make sure they give the authoritative one in a moment.

You should be able to use tensorboard but you need to first import the model into tensorboard with https://github.com/tensorflow/tensorflow/blob/master/tensorflow/python/tools/import_pb_to_tensorboard.py. Something like python import_pb_to_tensorboard.py --model_dir=llvm/lib/Analysis/models/inliner/ --log_dir=/tmp/inliner should work. Then you'll be able to run tensorboard on the log_dir.

Here's a hosted visualization from tensorboard for your convenience: https://tensorboard.dev/experiment/C45o0HjZTPGRSqpOrdkbeg/#graphs

Thanks.

(1) May I ask what was the reason behind using a tf-nighlty rather than a tensoflow release?

Historic reason - at the time we started upstreaming the work, the necessary changes to the pip package were not in the release package yet.

(2) tf.nighlty mentioned in https://github.com/google/ml-compiler-opt/blob/master/buildbot/buildbot_init.sh#L119 is no longer available in https://pypi.org/project/tf-nightly/#history :)

Thanks for pointing it out - updated the script; one of the build bots was also having issues for this reason, must have been a recent change (or the bots weren't rebooted in a while)
(3) I can confirm that I was able to generate logs and subsequently visualize the model with tensorboard 2.3.0 and tensorflow release 2.2.0 instead. Also, in pursuit of installing packages, I ran into:
tensorboard duplicate plugins for name projector
which it turned out to be a common issue for tensorboard when there are multiple packages installed, as a result of trying tf.nightly with release. Removing duplicate tensorboard fixed the issue.
To confirm, now that we're using the release 2.3.0 tensorflow pip package, this shouldn't be an issue anymore, correct?
Yes. I confirm using TF.2.3.0 and Tensorboard 2.3.0; pip3 install tensorflow==2.3 --user did the job.

(4) Will you also release training scripts for brewing ir2native model as well here: https://github.com/google/ml-compiler-opt

IR2Native is used for RL training algorithms where we want partial rewards. That's what we initially did, but then we got better characteristics with training algorithms using just final reward (==the .text size in the native object). We abandoned for the short term the partial rewards training. We suspect it will start making sense again when we incorporate more global context than we currently do (currently, the global context is really thin - node/edge counts, uses, and a measure of the initial DAG position). So this is a long way of saying: we should probably yank out IR2Native right now, for code simplicity, but didn't get around to doing it.

I see. So there are two questions:

(Q1) Could you provide a definition for an IR2native final/optimal partial rewards ? I'd assume it was the final iteration of model weights when the training was stopped, however, what was the stop condition here?

IR2Native was trained through supervised learning: we captured features after last inlining, then also captured final native size of that function (when asm printing), as label.

(Q2) To make sense of it, let consider:
(2-1) Training Phase:

If models are trained together in the same pipeline: So that means you trained these two (IR2Native and RL) together in the same pipeline, meaning that when you feed IR2Native the training data, the partial rewards are fed into the RL model. If that's the case, it would be tricky as the partial rewards changes each iteration and depending on the input data and gradually converge to a more accurate values (lower loss function) and meanwhile you kept feeding these, inaccurate values, to the RL model to get trained. I guess as long as you had a unified strategy to deal with the loss functions, this method should be tricky.

If IR2Native was trained first: Based on your reply and that you mentioned you fixed the buckets with their final partial rewards, I assume this was the method you used, meaning that you trained IR2Native and stopped the training at a certain iteration perhaps with a low loss function value? or other criteria. At this point, you use the final buckets of IR2Native to train RL. So in a way IR2Native's inference is used to train RL. Is that a correct assumption?

(2-2) Inference Phase:
So at deployment and when an LLVM user passes opt -passes=scc-oz-module-inliner -enable-ml-inliner=release -S, callers()'s IR2Native features are collected and one bucket is chosen as partial reward which is then fed into RL to decide whether or not to inline a callee() ?

IR2Native was trained completely separately: at a point, we captured the feature|label tuples from a corpus. Then we did supervised learning on that dataset, and obtained the IR2Native model.

After that, we only used the IR2Native model in inference mode any time we wanted to train the the inliner model. The IR used for the training sessions was different (same overall codebase, but unrelated points in time). We didn't retrain IR2Native before training the inliner either.

Thanks,

Amir

Revision Contents

Path

Size

llvm/

CMakeLists.txt

19 lines

cmake/

modules/

TensorFlowCompile.cmake

38 lines

include/

llvm/

Analysis/

InlineAdvisor.h

5 lines

InlineModelFeatureMaps.h

70 lines

MLInlineAdvisor.h

107 lines

MLModelRunner.h

39 lines

lib/

Analysis/

CMakeLists.txt

16 lines

InlineAdvisor.cpp

4 lines

MLInlineAdvisor.cpp

301 lines

ReleaseModeModelRunner.cpp

87 lines

models/

inliner/

saved_model.pb

variables/

variables.data-00000-of-00002

variables.data-00001-of-00002

variables.index

test/

Bindings/

Go/

lit.local.cfg

3 lines

Transforms/

Inline/

ML/

Inputs/

test-module.ll

64 lines

bounds-checks.ll

41 lines

ml-test-release-mode.ll

14 lines

inlining-advisor-default.ll

2 lines

lit.cfg.py

3 lines

lit.site.cfg.py.in

1 line

Diff 273041

llvm/CMakeLists.txt

	Show First 20 Lines • Show All 956 Lines • ▼ Show 20 Lines
	include(TableGen)			include(TableGen)

	if( MINGW AND NOT "${CMAKE_CXX_COMPILER_ID}" MATCHES "Clang" )			if( MINGW AND NOT "${CMAKE_CXX_COMPILER_ID}" MATCHES "Clang" )
	# People report that -O3 is unreliable on MinGW. The traditional			# People report that -O3 is unreliable on MinGW. The traditional
	# build also uses -O2 for that reason:			# build also uses -O2 for that reason:
	llvm_replace_compiler_option(CMAKE_CXX_FLAGS_RELEASE "-O3" "-O2")			llvm_replace_compiler_option(CMAKE_CXX_FLAGS_RELEASE "-O3" "-O2")
	endif()			endif()

				# For up-to-date instructions for installing the Tensorflow dependency, refer to
				davidxlUnsubmitted Done Reply Inline Actions Add a comment here describing briefly how to download TF packages and set AOT_PATH? davidxl: Add a comment here describing briefly how to download TF packages and set AOT_PATH?
				# the bot setup script: https://github.com/google/ml-compiler-opt/blob/master/buildbot/buildbot_init.sh
				# Specifically, assuming python3 is installed:
				# python3 -m pip install --upgrade pip && python3 -m pip install --user tf_nightly==2.3.0.dev20200528
				# Then set TENSORFLOW_AOT_PATH to the package install - usually it's ~/.local/lib/python3.7/site-packages/tensorflow
				#
				set(TENSORFLOW_AOT_PATH "" CACHE PATH "Path to TensorFlow pip install dir")

				if (NOT TENSORFLOW_AOT_PATH STREQUAL "")
				set(LLVM_HAVE_TF_AOT "ON" CACHE BOOL "Tensorflow AOT available")
				set(TENSORFLOW_AOT_COMPILER
				"${TENSORFLOW_AOT_PATH}/../../../../bin/saved_model_cli"
				CACHE PATH "Path to the Tensorflow AOT compiler")
				add_definitions("-DLLVM_HAVE_TF_AOT")
				include_directories(${TENSORFLOW_AOT_PATH}/include)
				add_subdirectory(${TENSORFLOW_AOT_PATH}/xla_aot_runtime_src
				${CMAKE_ARCHIVE_OUTPUT_DIRECTORY}/tf_runtime)
				endif()

	# Put this before tblgen. Else we have a circular dependence.			# Put this before tblgen. Else we have a circular dependence.
	add_subdirectory(lib/Demangle)			add_subdirectory(lib/Demangle)
	add_subdirectory(lib/Support)			add_subdirectory(lib/Support)
	add_subdirectory(lib/TableGen)			add_subdirectory(lib/TableGen)

	add_subdirectory(utils/TableGen)			add_subdirectory(utils/TableGen)

	add_subdirectory(include/llvm)			add_subdirectory(include/llvm)
	▲ Show 20 Lines • Show All 203 Lines • Show Last 20 Lines

llvm/cmake/modules/TensorFlowCompile.cmake

This file was added.

				# Run the tensorflow compiler (saved_model_cli) on the saved model in the
				# ${model} directory, looking for the ${tag_set} tag set, and the SignatureDef
				davidxlUnsubmitted Done Reply Inline Actions document the function. davidxl: document the function.
				# ${signature_def_key}.
				# Produce a pair of files called ${fname}.h and ${fname}.o in the
				# ${CMAKE_CURRENT_BINARY_DIR}. The generated header will define a C++ class
				# called ${cpp_class} - which may be a namespace-qualified class name.
				function(tfcompile model tag_set signature_def_key fname cpp_class)
				if (IS_ABSOLUTE ${model})
				set(LLVM_ML_MODELS_ABSOLUTE ${model})
				else()
				set(LLVM_ML_MODELS_ABSOLUTE
				${CMAKE_CURRENT_SOURCE_DIR}/${model})
				endif()

				set(prefix ${CMAKE_CURRENT_BINARY_DIR}/${fname})
				set(obj_file ${prefix}.o)
				set(hdr_file ${prefix}.h)
				add_custom_command(OUTPUT ${obj_file} ${hdr_file}
				COMMAND "XLA_FLAGS=\"--xla_cpu_multi_thread_eigen=false\"" ${TENSORFLOW_AOT_COMPILER} aot_compile_cpu
				--dir ${LLVM_ML_MODELS_ABSOLUTE}
				davidxlUnsubmitted Done Reply Inline Actions groupped -- grouped. davidxl: groupped -- grouped.
				--tag_set ${tag_set}
				--signature_def_key ${signature_def_key}
				--output_prefix ${prefix}
				--cpp_class ${cpp_class}
				--target_triple ${LLVM_HOST_TRIPLE}
				aslUnsubmitted Done Reply Inline Actions This misses the target triple. Otherwise even on MacOS it will generate linux object file. asl: This misses the target triple. Otherwise even on MacOS it will generate linux object file.
				mtrofinAuthorUnsubmitted Done Reply Inline Actions Thanks! Fixed. mtrofin: Thanks! Fixed.
				aslUnsubmitted Done Reply Inline Actions Host triple should be used here, no? asl: Host triple should be used here, no?
				mtrofinAuthorUnsubmitted Done Reply Inline Actions Done - thanks! mtrofin: Done - thanks!
				)

				davidxlUnsubmitted Done Reply Inline Actions there are lots of references to ${CMAKE_CURRENT_BINARY_DIR}/${fname} here, perhaps use a common variable for it? davidxl: there are lots of references to ${CMAKE_CURRENT_BINARY_DIR}/${fname} here, perhaps use a common…
				# Aggregate the objects so that results of different tfcompile calls may be
				# grouped into one target.
				set(GENERATED_OBJS ${GENERATED_OBJS} ${obj_file} PARENT_SCOPE)
				set_source_files_properties(${obj_file} PROPERTIES
				GENERATED 1 EXTERNAL_OBJECT 1)

				set(GENERATED_HEADERS ${GENERATED_HEADERS} ${hdr_file} PARENT_SCOPE)
				set_source_files_properties(${hdr_file} PROPERTIES
				GENERATED 1)

				endfunction()

llvm/include/llvm/Analysis/InlineAdvisor.h

Show First 20 Lines • Show All 197 Lines • ▼ Show 20 Lines	private:
Module &M;		Module &M;
ModuleAnalysisManager &MAM;		ModuleAnalysisManager &MAM;
std::unique_ptr<InlineAdvisor> Advisor;		std::unique_ptr<InlineAdvisor> Advisor;
};		};

Result run(Module &M, ModuleAnalysisManager &MAM) { return Result(M, MAM); }		Result run(Module &M, ModuleAnalysisManager &MAM) { return Result(M, MAM); }
};		};

		#ifdef LLVM_HAVE_TF_AOT
		std::unique_ptr<InlineAdvisor>
		getReleaseModeAdvisor(Module &M, ModuleAnalysisManager &MAM);
		#endif

// Default (manual policy) decision making helper APIs. Shared with the legacy		// Default (manual policy) decision making helper APIs. Shared with the legacy
// pass manager inliner.		// pass manager inliner.

/// Return the cost only if the inliner should attempt to inline at the given		/// Return the cost only if the inliner should attempt to inline at the given
/// CallSite. If we return the cost, we will emit an optimisation remark later		/// CallSite. If we return the cost, we will emit an optimisation remark later
/// using that cost, so we won't do so from this function. Return None if		/// using that cost, so we won't do so from this function. Return None if
/// inlining should not be attempted.		/// inlining should not be attempted.
Optional<InlineCost>		Optional<InlineCost>
Show All 20 Lines

llvm/include/llvm/Analysis/InlineModelFeatureMaps.h

This file was added.

				//===- InlineModelFeatureMaps.h - common model runner defs ------- C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//

				#ifndef LLVM_ANALYSIS_INLINEMODELFEATUREMAPS_H
				#define LLVM_ANALYSIS_INLINEMODELFEATUREMAPS_H

				#include <array>
				#include <string>
				#include <vector>

				namespace llvm {

				// List of features. Each feature is defined through a triple:
				// - the name of an enum member, which will be the feature index
				// - a textual name, used for Tensorflow model binding (so it needs to match the
				// names used by the Tensorflow model)
				// - a documentation description. Currently, that is not used anywhere
				// programmatically, and serves as workaround to inability of inserting comments
				// in macros.
				#define INLINE_FEATURE_ITERATOR(M) \
				M(CalleeBasicBlockCount, "callee_basic_block_count", \
				"number of basic blocks of the callee") \
				M(CallSiteHeight, "callsite_height", \
				"position of the call site in the original call graph - measured from " \
				"the farthest SCC") \
				M(NodeCount, "node_count", \
				"total current number of defined functions in the module") \
				M(NrCtantParams, "nr_ctant_params", \
				"number of parameters in the call site that are constants") \
				M(CostEstimate, "cost_estimate", "total cost estimate (threshold - free)") \
				M(EdgeCount, "edge_count", \
				"number of module-internal users of the caller, +1 if the caller is " \
				"exposed externally") \
				M(CallerUsers, "caller_users", \
				"number of blocks reached from a conditional instruction, in the caller") \
				M(CallerConditionallyExecutedBlocks, "caller_conditionally_executed_blocks", \
				"number of blocks reached from a conditional instruction, in the caller") \
				M(CallerBasicBlockCount, "caller_basic_block_count", \
				"number of basic blocks in the caller") \
				M(CalleeConditionallyExecutedBlocks, "callee_conditionally_executed_blocks", \
				"number of blocks reached from a conditional instruction, in the callee") \
				M(CalleeUsers, "callee_users", \
				"number of blocks reached from a conditional instruction, in the callee")

				enum class FeatureIndex : size_t {
				#define POPULATE_INDICES(INDEX_NAME, NAME, COMMENT) INDEX_NAME,
				INLINE_FEATURE_ITERATOR(POPULATE_INDICES)
				#undef POPULATE_INDICES
				NumberOfFeatures
				};

				constexpr size_t NumberOfFeatures =
				davidxlUnsubmitted Done Reply Inline Actions put these static variables inside the .cpp file to avoid multiple copies if the header is included by different sources. davidxl: put these static variables inside the .cpp file to avoid multiple copies if the header is…
				static_cast<size_t>(FeatureIndex::NumberOfFeatures);

				extern const std::array<std::string, NumberOfFeatures> FeatureNameMap;

				extern const char *const DecisionName;
				extern const char *const DefaultDecisionName;
				extern const char *const RewardName;

				using InlineFeatures = std::vector<int64_t>;

				} // namespace llvm
				#endif // LLVM_ANALYSIS_INLINEMODELFEATUREMAPS_H

llvm/include/llvm/Analysis/MLInlineAdvisor.h

This file was added.

				//===- MLInlineAdvisor.h - ML - based InlineAdvisor factories ---- C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLVM_ANALYSIS_MLINLINEADVISOR_H
				#define LLVM_ANALYSIS_MLINLINEADVISOR_H

				#include "llvm/Analysis/CallGraph.h"
				#include "llvm/Analysis/InlineAdvisor.h"
				#include "llvm/Analysis/MLModelRunner.h"
				#include "llvm/IR/PassManager.h"

				#include <memory>
				#include <unordered_map>

				namespace llvm {
				class Module;
				class MLInlineAdvice;

				class MLInlineAdvisor : public InlineAdvisor {
				public:
				MLInlineAdvisor(Module &M, ModuleAnalysisManager &MAM,
				std::unique_ptr<MLModelRunner> ModelRunner);

				CallGraph *callGraph() const { return CG.get(); }
				virtual ~MLInlineAdvisor() = default;

				void onPassEntry() override;

				std::unique_ptr<InlineAdvice> getAdvice(CallBase &CB) override;

				int64_t getIRSize(const Function &F) const { return F.getInstructionCount(); }
				void onSuccessfulInlining(const MLInlineAdvice &Advice,
				bool CalleeWasDeleted);

				bool isForcedToStop() const { return ForceStop; }
				int64_t getLocalCalls(Function &F);
				const MLModelRunner &getModelRunner() const { return *ModelRunner.get(); }

				protected:
				virtual std::unique_ptr<MLInlineAdvice>
				getMandatoryAdvice(CallBase &CB, OptimizationRemarkEmitter &ORE);

				virtual std::unique_ptr<MLInlineAdvice>
				getAdviceFromModel(CallBase &CB, OptimizationRemarkEmitter &ORE);

				Module &M;
				std::unique_ptr<MLModelRunner> ModelRunner;

				private:
				int64_t getModuleIRSize() const;

				std::unique_ptr<CallGraph> CG;

				int64_t NodeCount = 0;
				int64_t EdgeCount = 0;
				std::map<const Function *, unsigned> FunctionLevels;
				const int32_t InitialIRSize = 0;
				int32_t CurrentIRSize = 0;

				bool ForceStop = false;
				};

				/// InlineAdvice that tracks changes post inlining. For that reason, it only
				/// overrides the "successful inlining" extension points.
				class MLInlineAdvice : public InlineAdvice {
				public:
				MLInlineAdvice(MLInlineAdvisor *Advisor, CallBase &CB,
				OptimizationRemarkEmitter &ORE, bool Recommendation)
				: InlineAdvice(Advisor, CB, ORE, Recommendation),
				CallerIRSize(Advisor->isForcedToStop() ? 0
				: Advisor->getIRSize(*Caller)),
				CalleeIRSize(Advisor->isForcedToStop() ? 0
				: Advisor->getIRSize(*Callee)),
				CallerAndCalleeEdges(Advisor->isForcedToStop()
				? 0
				: (Advisor->getLocalCalls(*Caller) +
				Advisor->getLocalCalls(*Callee))) {}
				virtual ~MLInlineAdvice() = default;

				void recordInliningImpl() override;
				void recordInliningWithCalleeDeletedImpl() override;
				void recordUnsuccessfulInliningImpl(const InlineResult &Result) override;
				void recordUnattemptedInliningImpl() override;

				Function *getCaller() const { return Caller; }
				Function *getCallee() const { return Callee; }

				const int64_t CallerIRSize;
				const int64_t CalleeIRSize;
				const int64_t CallerAndCalleeEdges;

				private:
				void reportContextForRemark(DiagnosticInfoOptimizationBase &OR);

				MLInlineAdvisor *getAdvisor() const {
				return static_cast<MLInlineAdvisor *>(Advisor);
				};
				};

				} // namespace llvm

				#endif // LLVM_ANALYSIS_MLINLINEADVISOR_H
				No newline at end of file

llvm/include/llvm/Analysis/MLModelRunner.h

This file was added.

				//===- MLModelRunner.h ---- ML model runner interface ------------ C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//

				#ifndef LLVM_ANALYSIS_MLMODELRUNNER_H
				#define LLVM_ANALYSIS_MLMODELRUNNER_H

				#include "llvm/Analysis/InlineModelFeatureMaps.h"
				#include "llvm/IR/LLVMContext.h"
				#include "llvm/IR/PassManager.h"

				namespace llvm {

				/// MLModelRunner interface: abstraction of a mechanism for evaluating a
				/// tensorflow "saved model".
				class MLModelRunner {
				public:
				// Disallows copy and assign.
				MLModelRunner(const MLModelRunner &) = delete;
				MLModelRunner &operator=(const MLModelRunner &) = delete;
				virtual ~MLModelRunner() = default;

				virtual bool run() = 0;
				virtual void setFeature(FeatureIndex Index, int64_t Value) = 0;
				virtual int64_t getFeature(int Index) const = 0;

				protected:
				MLModelRunner(LLVMContext &Ctx) : Ctx(Ctx) {}

				LLVMContext &Ctx;
				};
				} // namespace llvm

				#endif // LLVM_ANALYSIS_MLMODELRUNNER_H

llvm/lib/Analysis/CMakeLists.txt

		set(CommonMLSources MLInlineAdvisor.cpp)
		set(ReleaseModeMLSources ReleaseModeModelRunner.cpp)

		if (DEFINED LLVM_HAVE_TF_AOT)
		include(TensorFlowCompile)
		tfcompile(models/inliner serve action InlinerSizeModel llvm::InlinerSizeModel)
		list(APPEND ReleaseModeMLSources
		$<TARGET_OBJECTS:tf_xla_runtime_objects>
		${GENERATED_OBJS}
		)
		set(MLPolicySources ${CommonMLSources} ${ReleaseModeMLSources})
		else()
		set(LLVM_OPTIONAL_SOURCES ${CommonMLSources} ${ReleaseModeMLSources})
		endif()

add_llvm_component_library(LLVMAnalysis		add_llvm_component_library(LLVMAnalysis
AliasAnalysis.cpp		AliasAnalysis.cpp
AliasAnalysisEvaluator.cpp		AliasAnalysisEvaluator.cpp
AliasAnalysisSummary.cpp		AliasAnalysisSummary.cpp
AliasSetTracker.cpp		AliasSetTracker.cpp
Analysis.cpp		Analysis.cpp
AssumeBundleQueries.cpp		AssumeBundleQueries.cpp
AssumptionCache.cpp		AssumptionCache.cpp
▲ Show 20 Lines • Show All 88 Lines • ▼ Show 20 Lines	add_llvm_component_library(LLVMAnalysis
TypeBasedAliasAnalysis.cpp		TypeBasedAliasAnalysis.cpp
TypeMetadataUtils.cpp		TypeMetadataUtils.cpp
ScopedNoAliasAA.cpp		ScopedNoAliasAA.cpp
ValueLattice.cpp		ValueLattice.cpp
ValueLatticeUtils.cpp		ValueLatticeUtils.cpp
ValueTracking.cpp		ValueTracking.cpp
VectorUtils.cpp		VectorUtils.cpp
VFABIDemangling.cpp		VFABIDemangling.cpp
		${MLPolicySources}

ADDITIONAL_HEADER_DIRS		ADDITIONAL_HEADER_DIRS
${LLVM_MAIN_INCLUDE_DIR}/llvm/Analysis		${LLVM_MAIN_INCLUDE_DIR}/llvm/Analysis

DEPENDS		DEPENDS
intrinsics_gen		intrinsics_gen
)		)

llvm/lib/Analysis/InlineAdvisor.cpp

Show First 20 Lines • Show All 149 Lines • ▼ Show 20 Lines	bool InlineAdvisorAnalysis::Result::tryCreate(InlineParams Params,
switch (Mode) {		switch (Mode) {
case InliningAdvisorMode::Default:		case InliningAdvisorMode::Default:
Advisor.reset(new DefaultInlineAdvisor(FAM, Params));		Advisor.reset(new DefaultInlineAdvisor(FAM, Params));
break;		break;
case InliningAdvisorMode::Development:		case InliningAdvisorMode::Development:
// To be added subsequently under conditional compilation.		// To be added subsequently under conditional compilation.
break;		break;
case InliningAdvisorMode::Release:		case InliningAdvisorMode::Release:
// To be added subsequently under conditional compilation.		#ifdef LLVM_HAVE_TF_AOT
		Advisor = llvm::getReleaseModeAdvisor(M, MAM);
		#endif
		davidxlUnsubmitted Done Reply Inline Actions add an assert in the #else branch? davidxl: add an assert in the #else branch?
		mtrofinAuthorUnsubmitted Done Reply Inline Actions Actually, we don't assert, rather the tryCreate caller checks the return of this function and emits an error if it didn't get an Advisor - this is the current behavior. mtrofin: Actually, we don't assert, rather the tryCreate caller checks the return of this function and…
break;		break;
}		}
return !!Advisor;		return !!Advisor;
}		}

/// Return true if inlining of CB can block the caller from being		/// Return true if inlining of CB can block the caller from being
/// inlined which is proved to be more beneficial. \p IC is the		/// inlined which is proved to be more beneficial. \p IC is the
/// estimated inline cost associated with callsite \p CB.		/// estimated inline cost associated with callsite \p CB.
▲ Show 20 Lines • Show All 232 Lines • Show Last 20 Lines

llvm/lib/Analysis/MLInlineAdvisor.cpp

This file was added.

				//===- MLInlineAdvisor.cpp - machine learned InlineAdvisor ----------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// This file implements the interface between the inliner and a learned model.
				// It delegates model evaluation to either the AOT compiled model (the
				// 'release' mode) or a runtime-loaded model (the 'development' case).
				//
				//===----------------------------------------------------------------------===//
				#include <limits>
				#include <unordered_map>
				#include <unordered_set>

				#include "llvm/ADT/SCCIterator.h"
				#include "llvm/Analysis/CallGraph.h"
				#include "llvm/Analysis/InlineCost.h"
				#include "llvm/Analysis/InlineFeaturesAnalysis.h"
				#include "llvm/Analysis/MLInlineAdvisor.h"
				#include "llvm/Analysis/MLModelRunner.h"
				#include "llvm/Analysis/OptimizationRemarkEmitter.h"
				#include "llvm/Analysis/TargetLibraryInfo.h"
				#include "llvm/Analysis/TargetTransformInfo.h"
				#include "llvm/IR/InstIterator.h"
				#include "llvm/IR/Instructions.h"
				#include "llvm/IR/PassManager.h"
				#include "llvm/Support/CommandLine.h"
				#include "llvm/Support/Path.h"

				using namespace llvm;

				#define DEBUG_TYPE "inline-ml"

				static cl::opt<float> SizeIncreaseThreshold(
				"ml-advisor-size-increase-threshold", cl::Hidden,
				davidxlUnsubmitted Done Reply Inline Actions Is this for controlling training time overhead? davidxl: Is this for controlling training time overhead?
				mtrofinAuthorUnsubmitted Done Reply Inline Actions Not only, it's also for controlling against misbehaving policies. Description was wrong, it's not native size increase, it's IR size. mtrofin: Not only, it's also for controlling against misbehaving policies. Description was wrong, it's…
				cl::desc("Maximum factor by which expected native size may increase before "
				"blocking any further inlining."),
				cl::init(2.0));

				const std::array<std::string, NumberOfFeatures> llvm::FeatureNameMap{
				#define POPULATE_NAMES(INDEX_NAME, NAME, COMMENT) NAME,
				INLINE_FEATURE_ITERATOR(POPULATE_NAMES)
				#undef POPULATE_NAMES
				};

				const char *const llvm::DecisionName = "inlining_decision";
				const char *const llvm::DefaultDecisionName = "inlining_default";
				const char *const llvm::RewardName = "delta_size";

				CallBase *getInlinableCS(Instruction &I) {
				if (auto *CS = dyn_cast<CallBase>(&I))
				if (Function *Callee = CS->getCalledFunction()) {
				if (!Callee->isDeclaration()) {
				return CS;
				}
				}
				return nullptr;
				}

				MLInlineAdvisor::MLInlineAdvisor(Module &M, ModuleAnalysisManager &MAM,
				std::unique_ptr<MLModelRunner> Runner)
				: InlineAdvisor(
				MAM.getResult<FunctionAnalysisManagerModuleProxy>(M).getManager()),
				M(M), ModelRunner(std::move(Runner)), CG(new CallGraph(M)),
				InitialIRSize(getModuleIRSize()), CurrentIRSize(InitialIRSize) {
				assert(ModelRunner);

				// Extract the 'call site height' feature - the position of a call site
				// relative to the farthest statically reachable SCC node. We don't mutate
				// this value while inlining happens. Empirically, this feature proved
				davidxlUnsubmitted Done Reply Inline Actions reverse the condition with continue to reduce nesting level davidxl: reverse the condition with continue to reduce nesting level
				// critical in behavioral cloning - i.e. training a model to mimic the manual
				// heuristic's decisions - and, thus, equally important for training for
				// improvement.
				for (auto I = scc_begin(CG.get()); !I.isAtEnd(); ++I) {
				const std::vector<CallGraphNode > &CGNodes = I;
				davidxlUnsubmitted Done Reply Inline Actions inlinable callee davidxl: inlinable callee
				unsigned Level = 0;
				for (auto *CGNode : CGNodes) {
				Function *F = CGNode->getFunction();
				if (!F \|\| F->isDeclaration())
				continue;
				for (auto &I : instructions(F)) {
				if (auto *CS = getInlinableCS(I)) {
				auto *Called = CS->getCalledFunction();
				auto Pos = FunctionLevels.find(Called);
				// In bottom up traversal, an inlinable callee is either in the
				// same SCC, or to a function in a visited SCC. So not finding its
				// level means we haven't visited it yet, meaning it's in this SCC.
				if (Pos == FunctionLevels.end())
				continue;
				Level = std::max(Level, Pos->second + 1);
				}
				}
				}
				for (auto *CGNode : CGNodes) {
				Function *F = CGNode->getFunction();
				if (F && !F->isDeclaration())
				FunctionLevels[F] = Level;
				}
				}
				}

				void MLInlineAdvisor::onPassEntry() {
				// Function passes executed between InlinerPass runs may have changed the
				// module-wide features.
				NodeCount = 0;
				EdgeCount = 0;
				for (auto &F : M)
				if (!F.isDeclaration()) {
				++NodeCount;
				EdgeCount += getLocalCalls(F);
				}
				}

				int64_t MLInlineAdvisor::getLocalCalls(Function &F) {
				return FAM.getResult<InlineFeaturesAnalysis>(F).DirectCallsToDefinedFunctions;
				}

				// Update the internal state of the advisor, and force invalidate feature
				// analysis. Currently, we maintain minimal (and very simple) global state - the
				// number of functions and the number of static calls. We also keep track of the
				// total IR size in this module, to stop misbehaving policies at a certain bloat
				// factor (SizeIncreaseThreshold)
				void MLInlineAdvisor::onSuccessfulInlining(const MLInlineAdvice &Advice,
				bool CalleeWasDeleted) {
				assert(!ForceStop);
				Function *Caller = Advice.getCaller();
				Function *Callee = Advice.getCallee();

				// The caller features aren't valid anymore.
				FAM.invalidate<InlineFeaturesAnalysis>(*Caller);
				int64_t IRSizeAfter =
				getIRSize(*Caller) + (CalleeWasDeleted ? 0 : Advice.CalleeIRSize);
				CurrentIRSize += IRSizeAfter - (Advice.CallerIRSize + Advice.CalleeIRSize);
				if (CurrentIRSize > SizeIncreaseThreshold * InitialIRSize)
				ForceStop = true;

				// We can delta-update module-wide features. We know the inlining only changed
				// the caller, and maybe the callee (by deleting the latter).
				// Nodes are simple to update.
				// For edges, we 'forget' the edges that the caller and callee used to have
				// before inlining, and add back what they currently have together.
				int64_t NewCallerAndCalleeEdges =
				FAM.getResult<InlineFeaturesAnalysis>(*Caller)
				.DirectCallsToDefinedFunctions;

				if (CalleeWasDeleted)
				--NodeCount;
				else
				NewCallerAndCalleeEdges += FAM.getResult<InlineFeaturesAnalysis>(*Callee)
				.DirectCallsToDefinedFunctions;
				EdgeCount += (NewCallerAndCalleeEdges - Advice.CallerAndCalleeEdges);
				assert(CurrentIRSize >= 0 && EdgeCount >= 0 && NodeCount >= 0);
				}

				int64_t MLInlineAdvisor::getModuleIRSize() const {
				int64_t Ret = 0;
				for (auto &F : CG->getModule())
				if (!F.isDeclaration())
				Ret += getIRSize(F);
				return Ret;
				}

				std::unique_ptr<InlineAdvice> MLInlineAdvisor::getAdvice(CallBase &CB) {
				auto &Caller = *CB.getCaller();
				auto &Callee = *CB.getCalledFunction();

				auto GetAssumptionCache = [&](Function &F) -> AssumptionCache & {
				return FAM.getResult<AssumptionAnalysis>(F);
				};
				auto GetTLI = [&](Function &F) -> const TargetLibraryInfo & {
				return FAM.getResult<TargetLibraryAnalysis>(F);
				};

				auto &TIR = FAM.getResult<TargetIRAnalysis>(Callee);
				auto &ORE = FAM.getResult<OptimizationRemarkEmitterAnalysis>(Caller);

				auto TrivialDecision =
				llvm::getAttributeBasedInliningDecision(CB, &Callee, TIR, GetTLI);

				// If this is a "never inline" case, there won't be any changes to internal
				// state we need to track, so we can just return the base InlineAdvice, which
				// will do nothing interesting.
				// Same thing if this is a recursive case.
				if ((TrivialDecision.hasValue() && !TrivialDecision->isSuccess()) \|\|
				&Caller == &Callee)
				return std::make_unique<InlineAdvice>(this, CB, ORE, false);

				bool Mandatory = TrivialDecision.hasValue() && TrivialDecision->isSuccess();

				// If we need to stop, we won't want to track anymore any state changes, so
				// we just return the base InlineAdvice, which acts as a noop.
				if (ForceStop) {
				ORE.emit([&] {
				return OptimizationRemarkMissed(DEBUG_TYPE, "ForceStop", &CB)
				<< "Won't attempt inlining because module size grew too much.";
				});
				return std::make_unique<InlineAdvice>(this, CB, ORE, Mandatory);
				}

				int CostEstimate = 0;
				if (!Mandatory) {
				auto IsCallSiteInlinable =
				llvm::getInliningCostEstimate(CB, TIR, GetAssumptionCache);
				if (!IsCallSiteInlinable) {
				// We can't inline this for correctness reasons, so return the base
				// InlineAdvice, as we don't care about tracking any state changes (which
				// won't happen).
				return std::make_unique<InlineAdvice>(this, CB, ORE, false);
				}
				CostEstimate = *IsCallSiteInlinable;
				}

				if (Mandatory)
				return getMandatoryAdvice(CB, ORE);

				auto NrCtantParams = 0;
				for (auto I = CB.arg_begin(), E = CB.arg_end(); I != E; ++I) {
				NrCtantParams += (isa<Constant>(*I));
				}

				auto &CallerBefore = FAM.getResult<InlineFeaturesAnalysis>(Caller);
				auto &CalleeBefore = FAM.getResult<InlineFeaturesAnalysis>(Callee);

				ModelRunner->setFeature(FeatureIndex::CalleeBasicBlockCount,
				CalleeBefore.BasicBlockCount);
				ModelRunner->setFeature(FeatureIndex::CallSiteHeight,
				FunctionLevels[&Caller]);
				ModelRunner->setFeature(FeatureIndex::NodeCount, NodeCount);
				ModelRunner->setFeature(FeatureIndex::NrCtantParams, NrCtantParams);
				ModelRunner->setFeature(FeatureIndex::CostEstimate, CostEstimate);
				ModelRunner->setFeature(FeatureIndex::EdgeCount, EdgeCount);
				ModelRunner->setFeature(FeatureIndex::CallerUsers, CallerBefore.Uses);
				ModelRunner->setFeature(FeatureIndex::CallerConditionallyExecutedBlocks,
				CallerBefore.BlocksReachedFromConditionalInstruction);
				ModelRunner->setFeature(FeatureIndex::CallerBasicBlockCount,
				CallerBefore.BasicBlockCount);
				ModelRunner->setFeature(FeatureIndex::CalleeConditionallyExecutedBlocks,
				CalleeBefore.BlocksReachedFromConditionalInstruction);
				ModelRunner->setFeature(FeatureIndex::CalleeUsers, CalleeBefore.Uses);
				return getAdviceFromModel(CB, ORE);
				}

				std::unique_ptr<MLInlineAdvice>
				MLInlineAdvisor::getAdviceFromModel(CallBase &CB,
				OptimizationRemarkEmitter &ORE) {
				return std::make_unique<MLInlineAdvice>(this, CB, ORE, ModelRunner->run());
				}

				std::unique_ptr<MLInlineAdvice>
				MLInlineAdvisor::getMandatoryAdvice(CallBase &CB,
				OptimizationRemarkEmitter &ORE) {
				return std::make_unique<MLInlineAdvice>(this, CB, ORE, true);
				}

				void MLInlineAdvice::reportContextForRemark(
				DiagnosticInfoOptimizationBase &OR) {
				using namespace ore;
				OR << NV("Callee", Callee->getName());
				for (size_t I = 0; I < NumberOfFeatures; ++I)
				OR << NV(FeatureNameMap[I], getAdvisor()->getModelRunner().getFeature(I));
				OR << NV("ShouldInline", isInliningRecommended());
				}

				void MLInlineAdvice::recordInliningImpl() {
				ORE.emit([&]() {
				OptimizationRemark R(DEBUG_TYPE, "InliningSuccess", DLoc, Block);
				reportContextForRemark(R);
				return R;
				});
				getAdvisor()->onSuccessfulInlining(this, /CalleeWasDeleted*/ false);
				}

				void MLInlineAdvice::recordInliningWithCalleeDeletedImpl() {
				ORE.emit([&]() {
				OptimizationRemark R(DEBUG_TYPE, "InliningSuccessWithCalleeDeleted", DLoc,
				Block);
				reportContextForRemark(R);
				return R;
				});
				getAdvisor()->onSuccessfulInlining(this, /CalleeWasDeleted*/ true);
				}

				void MLInlineAdvice::recordUnsuccessfulInliningImpl(
				const InlineResult &Result) {
				ORE.emit([&]() {
				OptimizationRemarkMissed R(DEBUG_TYPE, "InliningAttemptedAndUnsuccessful",
				DLoc, Block);
				reportContextForRemark(R);
				return R;
				});
				}
				void MLInlineAdvice::recordUnattemptedInliningImpl() {
				ORE.emit([&]() {
				OptimizationRemarkMissed R(DEBUG_TYPE, "IniningNotAttempted", DLoc, Block);
				reportContextForRemark(R);
				return R;
				});
				}
				No newline at end of file

llvm/lib/Analysis/ReleaseModeModelRunner.cpp

This file was added.

				//===- ReleaseModeModelRunner.cpp - Fast, precompiled model runner -------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// This file implements a model runner wrapping an AOT compiled ML model.
				// Only inference is supported.
				//
				//===----------------------------------------------------------------------===//

				#include "llvm/Analysis/InlineModelFeatureMaps.h"
				#include "llvm/Analysis/MLInlineAdvisor.h"

				// codegen-ed file
				#include "InlinerSizeModel.h" // NOLINT

				#include <memory>
				#include <vector>

				using namespace llvm;
				namespace {

				static const char *const FeedPrefix = "feed_";
				static const char *const FetchPrefix = "fetch_";

				/// MLModelRunner - production mode implementation. It uses a AOT-compiled
				/// SavedModel for efficient execution.
				class ReleaseModeModelRunner final : public MLModelRunner {
				public:
				davidxlUnsubmitted Not Done Reply Inline Actions MLInferenceRunner? davidxl: MLInferenceRunner?
				mtrofinAuthorUnsubmitted Done Reply Inline Actions What would we call the Development mode one then? mtrofin: What would we call the Development mode one then?
				ReleaseModeModelRunner(LLVMContext &Ctx);
				virtual ~ReleaseModeModelRunner() = default;

				bool run() override;

				void setFeature(FeatureIndex Index, int64_t Value) override;
				int64_t getFeature(int Index) const override;

				private:
				std::vector<int32_t> FeatureIndices;
				int32_t ResultIndex = -1;
				std::unique_ptr<llvm::InlinerSizeModel> CompiledModel;
				};
				} // namespace

				ReleaseModeModelRunner::ReleaseModeModelRunner(LLVMContext &Ctx)
				: MLModelRunner(Ctx),
				CompiledModel(std::make_unique<llvm::InlinerSizeModel>()) {
				assert(CompiledModel && "The CompiledModel should be valid");

				FeatureIndices.reserve(NumberOfFeatures);

				for (size_t I = 0; I < NumberOfFeatures; ++I) {
				const int Index =
				CompiledModel->LookupArgIndex(FeedPrefix + FeatureNameMap[I]);
				assert(Index >= 0 && "Cannot find Feature in inlining model");
				FeatureIndices[I] = Index;
				}

				ResultIndex =
				CompiledModel->LookupResultIndex(std::string(FetchPrefix) + DecisionName);
				assert(ResultIndex >= 0 && "Cannot find DecisionName in inlining model");
				}

				int64_t ReleaseModeModelRunner::getFeature(int Index) const {
				return static_cast<int64_t >(
				CompiledModel->arg_data(FeatureIndices[Index]));
				}

				void ReleaseModeModelRunner::setFeature(FeatureIndex Index, int64_t Value) {
				static_cast<int64_t >(CompiledModel->arg_data(
				FeatureIndices[static_cast<size_t>(Index)])) = Value;
				}

				bool ReleaseModeModelRunner::run() {
				CompiledModel->Run();
				return static_cast<bool>(
				static_cast<int64_t >(CompiledModel->result_data(ResultIndex)));
				}

				std::unique_ptr<InlineAdvisor>
				llvm::getReleaseModeAdvisor(Module &M, ModuleAnalysisManager &MAM) {
				auto AOTRunner = std::make_unique<ReleaseModeModelRunner>(M.getContext());
				return std::make_unique<MLInlineAdvisor>(M, MAM, std::move(AOTRunner));
				}

llvm/lib/Analysis/models/inliner/saved_model.pb

This binary file was added.

llvm/lib/Analysis/models/inliner/variables/variables.data-00000-of-00002

This binary file was added.

llvm/lib/Analysis/models/inliner/variables/variables.data-00001-of-00002

This binary file was added.

llvm/lib/Analysis/models/inliner/variables/variables.index

This binary file was added.

llvm/test/Bindings/Go/lit.local.cfg

	import os			import os
	import pipes			import pipes
	import shlex			import shlex
	import sys			import sys

	if not 'go' in config.root.llvm_bindings:			if not 'go' in config.root.llvm_bindings:
	config.unsupported = True			config.unsupported = True

	if not config.root.include_go_tests:			if not config.root.include_go_tests:
	config.unsupported = True			config.unsupported = True

				if config.have_tf_aot:
				config.unsupported = True

	def find_executable(executable, path=None):			def find_executable(executable, path=None):
	if path is None:			if path is None:
	path = os.environ['PATH']			path = os.environ['PATH']
	paths = path.split(os.pathsep)			paths = path.split(os.pathsep)
	base, ext = os.path.splitext(executable)			base, ext = os.path.splitext(executable)

	if (sys.platform == 'win32' or os.name == 'os2') and (ext != '.exe'):			if (sys.platform == 'win32' or os.name == 'os2') and (ext != '.exe'):
	executable = executable + '.exe'			executable = executable + '.exe'
	▲ Show 20 Lines • Show All 41 Lines • Show Last 20 Lines

llvm/test/Transforms/Inline/ML/Inputs/test-module.ll

This file was added.

				target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
				target triple = "x86_64-grtev4-linux-gnu"

				declare void @external_fct(i32)

				define dso_local i32 @top() {
				%a = call i32 @multiplier(i32 5)
				%b = call i32 @adder(i32 10)
				%ret = add nsw i32 %a, %b
				call void @external_fct(i32 %ret)
				ret i32 %ret
				}

				define internal dso_local i32 @adder(i32) {
				%2 = alloca i32, align 4
				store i32 %0, i32* %2, align 4
				%3 = load i32, i32* %2, align 4
				%4 = call i32 @multiplier(i32 %3)
				%5 = load i32, i32* %2, align 4
				%6 = call i32 @switcher(i32 1)
				%7 = add nsw i32 %4, %6
				ret i32 %7
				}

				define internal i32 @multiplier(i32) {
				%2 = alloca i32, align 4
				store i32 %0, i32* %2, align 4
				%3 = load i32, i32* %2, align 4
				%4 = load i32, i32* %2, align 4
				%5 = mul nsw i32 %3, %4
				ret i32 %5
				}

				define i32 @switcher(i32) {
				%2 = alloca i32, align 4
				%3 = alloca i32, align 4
				store i32 %0, i32* %3, align 4
				%4 = load i32, i32* %3, align 4
				switch i32 %4, label %11 [
				i32 1, label %5
				i32 2, label %6
				]

				; <label>:5: ; preds = %1
				store i32 2, i32* %2, align 4
				br label %12

				; <label>:6: ; preds = %1
				%7 = load i32, i32* %3, align 4
				%8 = load i32, i32* %3, align 4
				%9 = call i32 @multiplier(i32 %8)
				%10 = add nsw i32 %7, %9
				store i32 %10, i32* %2, align 4
				br label %12

				; <label>:11: ; preds = %1
				%adder.result = call i32 @adder(i32 2)
				store i32 %adder.result, i32* %2, align 4
				br label %12

				; <label>:12: ; preds = %11, %6, %5
				%13 = load i32, i32* %2, align 4
				ret i32 %13
				}
				No newline at end of file

llvm/test/Transforms/Inline/ML/bounds-checks.ll

This file was added.

				; Test behavior when inlining policy grows size out of control.
				; In all cases, the end result is the same: mandatory inlinings must happen.
				; However, when we discover we 'trip' over the artificially-low size increase
				; factor, we don't inline anymore.
				; REQUIRES: have_tf_aot
				; RUN: opt -passes=scc-oz-module-inliner -enable-ml-inliner=release -ml-advisor-size-increase-threshold=10.0 -S < %s 2>&1 \| FileCheck %s --check-prefix=CHECK --check-prefix=NOBOUNDS
				; RUN: opt -passes=scc-oz-module-inliner -enable-ml-inliner=release -ml-advisor-size-increase-threshold=1.0 -S < %s 2>&1 \| FileCheck %s --check-prefix=CHECK --check-prefix=BOUNDS

				target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"
				target triple = "x86_64-grtev4-linux-gnu"

				declare i64 @f1()

				define i64 @f2() #0 {
				%r = call i64 @f1()
				%r2 = add i64 13, %r
				ret i64 %r2
				}

				define i64 @some_function() {
				%r = call i64 @f1()
				%r2 = add i64 13, %r
				ret i64 %r2
				}

				define i64 @top() {
				%r = call i64 @f2()
				%r2 = call i64 @some_function()
				%r3 = add i64 %r, %r2
				ret i64 %r3
				}

				attributes #0 = { alwaysinline }

				; CHECK-LABEL: @top
				; f2 must always be inlined, so we won't find a call to it in @top()
				; CHECK-NOT: call i64 @f2
				; @some-function isn't mandatory, and when we set the increase threshold too low,
				davidxlUnsubmitted Done Reply Inline Actions can you explain about the expected output? davidxl: can you explain about the expected output?
				mtrofinAuthorUnsubmitted Done Reply Inline Actions Added more detail mtrofin: Added more detail
				davidxlUnsubmitted Not Done Reply Inline Actions ok. I do wish the increase threshold to be learned as well in the future and make this option unnecessary. davidxl: ok. I do wish the increase threshold to be learned as well in the future and make this option…
				; it won't be inlined.
				; NOBOUNDS-NOT: @some_function
				; BOUNDS: call i64 @some_function
				No newline at end of file

llvm/test/Transforms/Inline/ML/ml-test-release-mode.ll

This file was added.

				; The default inliner doesn't elide @adder, it believes it's too costly to inline
				; adder into switcher. The ML inliner carries out that inlining, resulting in
				davidxlUnsubmitted Done Reply Inline Actions Why can't default inliner handle this case (adder call can be folded). davidxl: Why can't default inliner handle this case (adder call can be folded).
				mtrofinAuthorUnsubmitted Done Reply Inline Actions Cost evaluation - added explanation. mtrofin: Cost evaluation - added explanation.
				; a smaller result (part of it is that adder gets elided).
				;
				; This test uses Inputs/test-module.ll, as it will share it with a similar test
				; for the 'development' mode.
				;
				; REQUIRES: have_tf_aot
				; RUN: opt -passes=scc-oz-module-inliner -enable-ml-inliner=release -S < %S/Inputs/test-module.ll 2>&1 \| FileCheck %s --check-prefix=CHECK
				; RUN: opt -passes=scc-oz-module-inliner -enable-ml-inliner=default -S < %S/Inputs/test-module.ll 2>&1 \| FileCheck %s --check-prefix=DEFAULT

				; CHECK-NOT: @adder
				; DEFAULT-LABEL: @adder
				; DEFAULT-NEXT: %2 = mul
				No newline at end of file

llvm/test/Transforms/Inline/inlining-advisor-default.ll

	; Check that, in the absence of dependencies, we emit an error message when			; Check that, in the absence of dependencies, we emit an error message when
	; trying to use ML-driven inlining.			; trying to use ML-driven inlining.
	;			; REQUIRES: !have_tf_aot
	; RUN: not opt -passes=scc-oz-module-inliner -enable-ml-inliner=development -S < %s 2>&1 \| FileCheck %s			; RUN: not opt -passes=scc-oz-module-inliner -enable-ml-inliner=development -S < %s 2>&1 \| FileCheck %s
	; RUN: not opt -passes=scc-oz-module-inliner -enable-ml-inliner=release -S < %s 2>&1 \| FileCheck %s			; RUN: not opt -passes=scc-oz-module-inliner -enable-ml-inliner=release -S < %s 2>&1 \| FileCheck %s

	declare i64 @f1()			declare i64 @f1()

	; CHECK: Could not setup Inlining Advisor for the requested mode and/or options			; CHECK: Could not setup Inlining Advisor for the requested mode and/or options
	No newline at end of file			No newline at end of file

llvm/test/lit.cfg.py

Show First 20 Lines • Show All 213 Lines • ▼ Show 20 Lines	config.substitutions.append(('%loadnewpmbye',
.format(config.llvm_shlib_dir,		.format(config.llvm_shlib_dir,
config.llvm_shlib_ext)))		config.llvm_shlib_ext)))


# Static libraries are not built if BUILD_SHARED_LIBS is ON.		# Static libraries are not built if BUILD_SHARED_LIBS is ON.
if not config.build_shared_libs and not config.link_llvm_dylib:		if not config.build_shared_libs and not config.link_llvm_dylib:
config.available_features.add('static-libs')		config.available_features.add('static-libs')

		if config.have_tf_aot:
		config.available_features.add("have_tf_aot")

def have_cxx_shared_library():		def have_cxx_shared_library():
readobj_exe = lit.util.which('llvm-readobj', config.llvm_tools_dir)		readobj_exe = lit.util.which('llvm-readobj', config.llvm_tools_dir)
if not readobj_exe:		if not readobj_exe:
print('llvm-readobj not found')		print('llvm-readobj not found')
return False		return False

try:		try:
readobj_cmd = subprocess.Popen(		readobj_cmd = subprocess.Popen(
▲ Show 20 Lines • Show All 127 Lines • Show Last 20 Lines

llvm/test/lit.site.cfg.py.in

	Show First 20 Lines • Show All 42 Lines • ▼ Show 20 Lines
	config.link_llvm_dylib = @LLVM_LINK_LLVM_DYLIB@			config.link_llvm_dylib = @LLVM_LINK_LLVM_DYLIB@
	config.llvm_libxml2_enabled = @LLVM_LIBXML2_ENABLED@			config.llvm_libxml2_enabled = @LLVM_LIBXML2_ENABLED@
	config.llvm_host_triple = '@LLVM_HOST_TRIPLE@'			config.llvm_host_triple = '@LLVM_HOST_TRIPLE@'
	config.host_arch = "@HOST_ARCH@"			config.host_arch = "@HOST_ARCH@"
	config.have_opt_viewer_modules = @LLVM_HAVE_OPT_VIEWER_MODULES@			config.have_opt_viewer_modules = @LLVM_HAVE_OPT_VIEWER_MODULES@
	config.libcxx_used = @LLVM_LIBCXX_USED@			config.libcxx_used = @LLVM_LIBCXX_USED@
	config.has_plugins = @LLVM_ENABLE_PLUGINS@			config.has_plugins = @LLVM_ENABLE_PLUGINS@
	config.linked_bye_extension = @LLVM_BYE_LINK_INTO_TOOLS@			config.linked_bye_extension = @LLVM_BYE_LINK_INTO_TOOLS@
				config.have_tf_aot = ("@LLVM_HAVE_TF_AOT@" == "ON")
				thakisUnsubmitted Done Reply Inline Actions Please use llvm_canonicalize_cmake_booleans for this. thakis: Please use llvm_canonicalize_cmake_booleans for this.
				mtrofinAuthorUnsubmitted Done Reply Inline Actions could you elaborate how? I see it used in CMakeLists files - not super sure how I'd use it here. Thanks! mtrofin: could you elaborate how? I see it used in CMakeLists files - not super sure how I'd use it here.
				mtrofinAuthorUnsubmitted Done Reply Inline Actions Being addressed in D82776. mtrofin: Being addressed in D82776.

	# Support substitution of the tools_dir with user parameters. This is			# Support substitution of the tools_dir with user parameters. This is
	# used when we can't determine the tool dir at configuration time.			# used when we can't determine the tool dir at configuration time.
	try:			try:
	config.llvm_tools_dir = config.llvm_tools_dir % lit_config.params			config.llvm_tools_dir = config.llvm_tools_dir % lit_config.params
	config.llvm_shlib_dir = config.llvm_shlib_dir % lit_config.params			config.llvm_shlib_dir = config.llvm_shlib_dir % lit_config.params
	except KeyError:			except KeyError:
	e = sys.exc_info()[1]			e = sys.exc_info()[1]
	Show All 9 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[llvm] Release-mode ML InlineAdvisorClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 273041

llvm/CMakeLists.txt

llvm/cmake/modules/TensorFlowCompile.cmake

llvm/include/llvm/Analysis/InlineAdvisor.h

llvm/include/llvm/Analysis/InlineModelFeatureMaps.h

llvm/include/llvm/Analysis/MLInlineAdvisor.h

llvm/include/llvm/Analysis/MLModelRunner.h

llvm/lib/Analysis/CMakeLists.txt

llvm/lib/Analysis/InlineAdvisor.cpp

llvm/lib/Analysis/MLInlineAdvisor.cpp

llvm/lib/Analysis/ReleaseModeModelRunner.cpp

llvm/lib/Analysis/models/inliner/saved_model.pb

llvm/lib/Analysis/models/inliner/variables/variables.data-00000-of-00002

llvm/lib/Analysis/models/inliner/variables/variables.data-00001-of-00002

llvm/lib/Analysis/models/inliner/variables/variables.index

llvm/test/Bindings/Go/lit.local.cfg

llvm/test/Transforms/Inline/ML/Inputs/test-module.ll

llvm/test/Transforms/Inline/ML/bounds-checks.ll

llvm/test/Transforms/Inline/ML/ml-test-release-mode.ll

llvm/test/Transforms/Inline/inlining-advisor-default.ll

llvm/test/lit.cfg.py

llvm/test/lit.site.cfg.py.in

[llvm] Release-mode ML InlineAdvisor
ClosedPublic