NOTE: Note that no attributes are derived yet. This patch will not go in alone but only with others that derive attributes. The framework is split for review purposes. This commit introduces the Attributor pass infrastructure and fixpoint iteration framework. Further patches will introduce abstract attributes into this framework. In a nutshell, the Attributor will update instances of abstract arguments until a fixpoint, or a "timeout", is reached. Communication between the Attributor and the abstract attributes that are derived is restricted to the AbstractState and AbstractAttribute interfaces. Please see the file comment in Attributor.h for detailed information including design decisions and typical use case. Also consider the class documentation for Attributor, AbstractState, and AbstractAttribute.
Details
Diff Detail
- Repository
- rG LLVM Github Monorepo
- Build Status
Buildable 31766 Build 31765: arc lint + arc unit
Event Timeline
llvm/lib/Transforms/IPO/Attributor.cpp | ||
---|---|---|
180 | That note, and the ability to annotate multiple call sites at once, will be removed in the next update. |
I haven't been through the details yet, but I have a few high-level thoughts.
This is a lot of code and I think a high-level description of the framework would be very helpful, like what are the abstract states/lattice values, how do we transition between them, what is the strategy for iterating over functions, how are we applying & materializing updates, how expensive is this analysis?
As this seems to be a generic framework it would be helpful to describe how to hook up a new attribute for inference.
And finally, what can we do to check correctness? As the current attribute definitions are a bit fuzzy, it seems this could lead to problems down the road, in case we infer invalid attributes, which only get used (and cause problems) a bit later. Assuming we have a clear definition of a set of attributes, how hard would it be to verify the attributes in a module (maybe we are doing it already)?
Thanks for starting the review process!
This is a lot of code and I think a high-level description of the framework would be very helpful,
Agreed. I will add a high-level description (as a comment) tomorrow to sum up how the framework is supposed to work/be used. Some details are included below.
like what are the abstract states/lattice values,
This is up to the individual abstract attributes (which are introduced in the other patches). Generally, the Attributor assumes the AbstractAttribute::updateImpl method will be monotone and the underlying lattice is of finite depth. That being said, the iteration threshold and the fact that abstract attributes have to allow to fall back to a known correct solution, should make this framework behave correctly (and terminate) even if the properties are not fulfilled.
how do we transition between them, what is the strategy for iterating over functions, how are we applying & materializing updates, how expensive is this analysis?
The Attributor calls the AbstractAttribute::updateImpl method on all abstract attributes that might have been impacted by changes in the last iteration of the analysis. There is currently no particular order specified but it is important to note that we look at all "potentially impacted abstract attributes" not the underlying IR. That means we often do not need to rerun analysis on the whole module every iteration.
The set of "potentially impacted abstract attributes" is collected on the fly through a hook in the Attributor::getAAFor method. If an abstract attribute P queries the abstract attribute Q (for whatever reason! Note that Q could be P itself), a dependence from P to Q is recorded. If Q ever changes in the future, the updateImpl method of P is invoked in the next iteration.
If an optimistic fixpoint is reached, because no changes happened in an iteration, or a pessimistic fixpoint is enforced, due to the iteration threshold, we manifest all abstract attributes that are improving the status quo.
Materialization is generally up to the abstract attributes. There are default methods in the AbstractAttribute class that, with minimal overloading effort, cover general cases, e.g., one attribute on a function, argument, return value, etc.. Slight variations from this norm can be achieved by overloading the helper functions, e.g., AbstractAttribute:getDeducedAttributes, for more elaborate schemes the AbstractAttribute::manifest method can be overloaded.
The cost is "empirically non-existent" on LLVM-TS and SPEC2006 when I run with all the abstract attributes that are up for review. This patch adds virtually no overhead on its own.
For my experiments, I collected samples of (I think) 11 runs of a fully loaded system with 112 threads. There was always (= for each of the patches in this patch set) at most one significant compile and/or runtime change reported by lnt, the difference was very small and it sometimes improved and sometimes regressed.
As this seems to be a generic framework it would be helpful to describe how to hook up a new attribute for inference.
Again agreed. As part of the overview which I forgot to include and mentioned above, I'll describe the necessary steps. For now, if you are interested, you can take a look at D60074, probably the smallest patch I have that adds a new attribute.
And finally, what can we do to check correctness? As the current attribute definitions are a bit fuzzy, it seems this could lead to problems down the road, in case we infer invalid attributes, which only get used (and cause problems) a bit later. Assuming we have a clear definition of a set of attributes, how hard would it be to verify the attributes in a module (maybe we are doing it already)?
I'm unsure I understand this question. If a fixpoint is reached, as reported by the AbstractAttribute::updateImpl methods, we should have derived "correct" information. Obviously, there might be a bug in the logic of the AbstractAttribute::updateImpl function, or the associated materialization, the dependence tracking in the attributor, etc. but I do not have a better solution than more (stress) tests (see D59903). I open a few bugs while I was developing attribute deductions in this framework and compare the results to the existing one. We current derive different things we should not as described in PR41336 and PR41328. More attributes will also trigger, or better expose, "bugs" down the line (see D59917) and we will probably have to manually look into the root cause as they pop up, was the attribute not correct or later reasoning based on it.
It looks like this is a completely new approach to attribute inference compared to the post-order function atters pass?
It'd be really good to actually spell it out nicely. Is this just version #2 of that? Can we use the same name, and put the code for this behind a flag or something to allow experimenting?
Generally, and I think this goes to the comments already made, I think this needs a design overview. This could be in the commit log, but increasingly I think we'd benefit from having a more high level practice of including design documents with commits like this. So either in the file comment of the header file for this framework, or if long/complex enough, in a markdown document referenced form the file comment, I'd like to see an overview of the design and approach you're pursuing to do more complete attribute deduction.
I also think we should be clear what is happening, so it would be good to come up with terms and define them precisely. I think "deduction" should be used more for the *computing* of attributes by reasoning about the properties of the IR. And "inference" should be adding attributes due to some extrinsic knowledge such as target library info, command line flags, etc. But totally open to different / better terminology.
llvm/include/llvm/Transforms/IPO/Attributor.h | ||
---|---|---|
256–262 | Note that these use doxygen syntax but aren't actually doxygen comments or terribly useful for that IMO. I'd much rather more detailed doxygen comments on the specific types than trying to partition up the code into sections. Just my preference. |
It is, one of the reasons I put it in a separate file. " post-order function atters" is actually 5-7 separate "passes" over an SCC just living in the same LLVM-IR pass.
It'd be really good to actually spell it out nicely.
Good point. I guess I can put that in the commit message and also work on other descriptions.
Is this just version #2 of that? Can we use the same name, and put the code for this behind a flag or something to allow experimenting?
I "kinda" could, I prefer not to. There are various justifications, let me give two now and it that is not sufficient I'll provide more:
- "functionattrs" outgrow "function attributes" a while ago when it started to derive argument attributes. The attributor framework is designed to derive everything, from function, to argument and return attributes, over instruction "attributes/metadata" to things that do not have an LLVM-IR equivalent.
- The design, as well as the capabilities, of the function attribute deduction pass is fairly specific. I would like to prevent "reuse" in either direction as the Attributor framework is supposed to replace the existing code with better deduction across the board.
Generally, and I think this goes to the comments already made, I think this needs a design overview. This could be in the commit log, but increasingly I think we'd benefit from having a more high level practice of including design documents with commits like this. So either in the file comment of the header file for this framework, or if long/complex enough, in a markdown document referenced form the file comment, I'd like to see an overview of the design and approach you're pursuing to do more complete attribute deduction.
I totally agree, as I did with Florian who requested better documentation. I added a first draft of an high-level file comment today. I will also provide a more elaborate commit message and class comments asap. Some clang cuda bugs kept me busy all day :(
I also think we should be clear what is happening, so it would be good to come up with terms and define them precisely.
Seems very reasonable to me. I won't claim I have a clear terminology throughout the commits/comments.
I think "deduction" should be used more for the *computing* of attributes by reasoning about the properties of the IR. And "inference" should be adding attributes due to some extrinsic knowledge such as target library info, command line flags, etc. But totally open to different / better terminology.
Fine with me. I think I exclusively used "deduction" (so far) as it the expected usage model. I should probably add some Attributor documentation which could also specify how we talk about things. Until then, please let me know if there is anything you want me to clarify!
llvm/include/llvm/Transforms/IPO/Attributor.h | ||
---|---|---|
256–262 | I did so in the header now, except the last "section" comment. I thought/hoped these will help read and extend the files later on. If that hope is not shared I can remove them everywhere. | |
llvm/lib/Transforms/IPO/Attributor.cpp | ||
180 | I'm still debating with myself if this is necessary/useful or not. |
Right, my thinking for checking correctness was along the following lines: Given some input IR with attributes, it should be fairly easy to check if (some of the) provided attributes are consistent with each other and the IR, right? At least easier than deducing them ;) I was wondering if we could implement a function/module verifier to check certain attributes. That way, we could check that certain types of attributes deduced are consistent with the underlying IR, without the attributes being used for some other optimization.
llvm/include/llvm/Transforms/IPO/Attributor.h | ||
---|---|---|
30 | nit: not sure if automagically is the best way to put it. | |
272 | it's defined as struct? | |
llvm/lib/Transforms/IPO/Attributor.cpp | ||
79 | This doesn't do anything at the moment? | |
107 | Would it make sense to put this in Attribute? | |
111 | could be shortened to just return Old.getValueAsInt() >= New.getValueAsInt() | |
320 | Passing in the whole Attributor seems a bit heavy handed. Conceptually, IIUC, to update an attribute, we just need a way to query other attributes (and potentially some other IR info) as input from the attributor, right? | |
452 | Is the plan here for the Attributor to provide an interface for the Attribute implementations to query "interesting" instructions, instead of the Attribute implementations looking at the IR themselves? It seems this might impose a tighter coupling of Attributes <-> Attributor than intended with the flexible attribute registration system. Related to the comment at line 319, it might be worth splitting this into a more restricted query/info interface that Attribute::update can use. It seems not related to the fixed point iteration in Attributor. |
Yes, we could implement this check, and yes doing so in this framework would probably help. Unfortunately, the logic we need for determining attributes and the one we need to "verify" attributes is only similar but often not the same. That said, we can most probably share a lot of it. So if that is something we want to look into I would propose to add AbstractAttribute::verify functionality which, if overloaded, will try to "disprove" existing attributes and report them as errors/warnings/notes/remarks/... Some of the logic in the updateImpl and verify is then overlapping for most attributes. It is unclear to me right now if you would have any need for the fixpoint framework though. I guess you'll end up with a separate "AttributeVerify" pass instead.
llvm/include/llvm/Transforms/IPO/Attributor.h | ||
---|---|---|
30 | I'll remove the magic :( | |
272 | True, but there is little difference anyway, right? I use structs here as there are no members. | |
llvm/lib/Transforms/IPO/Attributor.cpp | ||
107 | Potentially, yes. I don't right now where we would need similar functionality but I can most certainly move it there either way. | |
111 | Will do. | |
320 | Correct, query other in-flight attributes and cached IR information. I pass the Attributor because the above functionality is pretty much everything it exposes as public interface anyway (I think). Would you like to have another thin interface layer on top? | |
452 | The first part goes into a very interesting direction I have thought about as well. Without doing performance tests (mostly because I didn't have any attributes etc.) I choose this solution for "performance" reasons. If we invert it, which we still can and which I do not oppose, every attribute is queried for every instruction in order to decide if it is interesting to cache it, at least in the naive implementation I have in mind right now. If we can come up with a good interface to expose IR information, probably cached, that would be great. If we put that, and the attribute query interface outside the Attributor, that would be OK. Though, the attributor will need to know about attribute queries (now through Attributor::getAAFor<...>(...) to track dependencies). So far I decided this coupling is not too bad even though we have to hardcode "interesting" here. If no one has super strong feelings about this, I'd argue we go with this for now and once we have a good handle on the use cases we try to design a clean interface. But again, I will not oppose any proposal for such an interface right now. |
@fhahn, I could split the Attributor and the part that identifies the AbstractAttributes in the IR.
This way the Attributor becomes more lightweight, there is less coupling, and its easier to reuse the framework for other AbstractAttributes.
Introduce a separate information cache (slims down the Attributor). Allow reuse
of the Attributor framework for a subset of the "default" attributes as well as
custom ones.
llvm/lib/Transforms/IPO/Attributor.cpp | ||
---|---|---|
79 | It has a call site but since there are no attributes yet, nothing is done. | |
320 | I slimmed down the Attributor further. What do you think now? | |
452 | The new separation of InformationCache allows you to extend it for custom attributes. I kept the old way for "default" attributes for now. |
llvm/include/llvm/Transforms/IPO/Attributor.h | ||
---|---|---|
241 | I am not sure "Determine opportunities to derive 'default' attributes ' is very clear. This is where we add the initial abstract attributes to the state and also collect the IR info, right? Maybe worth stating that a bit more explicitly. I am not sure what the long term plan for the distinction between 'default' and other attributes is, but it might be worth to only add this notation once it is required? | |
276 | 32 entries per attribute seems a bit large at a first look. Might be interesting to gather some statistics on the average number of dependencies. | |
llvm/lib/Transforms/IPO/Attributor.cpp | ||
274 | QueriedAAs is a SmallPtrSet, so the order we insert items into the worklist here is not deterministic. Is there anything preventing us from reaching different fixed points when iterating over items in different orders? | |
435 | nit: information. | |
450 | spurious newline? | |
461 | IMO this and above comment does not add any meaningful information. | |
464 | IMO this can also be dropped, as the string in the assert gives enough details about the check. | |
468 | te -> the | |
486 | runAttributorOnSCC(SCCFunctions) returns true if it changed the IR, right? Shouldn't the return values be switched, i.e. do not preserve anything, if something changed? As a side note, shouldn't we be able to preserve any analysis that does not make use of attributes? From a correctness perspective, we should be able to preserve all analysis, but we would like to invalidate those that could make use of the additional info. AFAIK at least in the new pass manager, it should be possible to preserve function-level analysis in a module/SCC pass. |
I'll update the patch with the requested (small) changes asap.
Are there more design comments? (@chandlerc?)
llvm/include/llvm/Transforms/IPO/Attributor.h | ||
---|---|---|
241 | Mh, fair point. Though, I think it makes sense to have a set of attributes that live here and the ability to add some that do not. Would it help if I list the "default" attributes? Do you think I should hide this interface for now? | |
276 | Fair point. I can do that with the attributes we have. (or ask the GSoC students to do it). | |
llvm/lib/Transforms/IPO/Attributor.cpp | ||
274 | Good catch. Short answer, if the attribute update methods are sane and an optimistic fixpoint is reached, we should get a unique one. If not, there might be different ones reached. I can make it all deterministic. | |
486 | Good catch.
I should be, and we should list some. I'll look into it. I *think*, right now the placement ensures that we do not invalidate anything. |
As additional verification, we could run the attributor again, after materializing the changes from the initial run and check if no new facts are discovered, if the iteration limit is not exceeded. We have something similar in NewGVN and it has been helpful to surface missing links in the virtual representation/ non-determinism (links between the abstract attributes in the Attributor case for example).
I like this idea very much. I'll implement it (as a EXPENSIVE_CHECK) a verify method. Though, I can only check if no new facts were materialized because virtual facts might be rediscovered.
llvm/include/llvm/Transforms/IPO/Attributor.h | ||
---|---|---|
180 | I'm actually not happy anymore with the fact that AAType::ID is supposed to be of AttributeKind type because of string attributes. Since it is an integer in the maps already we can work around that but proposals to change this are welcome. |
llvm/include/llvm/Transforms/IPO/Attributor.h | ||
---|---|---|
26 | An example would be useful here for explaining the abstract attributes "(e,g,, something)". | |
49 | I'm not yet sure how to think about this. Many current attributes are derives with the help of LLVM utility functions which walk use-def chains, etc. How can an attributor use those utilities and know on what else it depends? Or are we going to enhance all utilities to return an optional dependency vector or similar? Or is the idea that we don't change the IR until the end, so all such utilities see only pessimistic information? | |
llvm/lib/Transforms/IPO/Attributor.cpp | ||
366 | This is true even if we did't reach the fixpoint and has to reset some attributes to their pessimistic state? | |
402 | Are there supposed to be some other cases here? |
llvm/include/llvm/Transforms/IPO/Attributor.h | ||
---|---|---|
26 | I have an example in the description of getAAFor below. I will add "(see theAttributor::getAAFor description)" here. | |
49 |
I strongly urge us *not* to change the IR until the end. That is what I have done so far in all the following attributor patches. Trying to clean up after optimistically modifying the IR seems like a very fragile approach. It breaks with caches all over the place and with the common concept of the IR as the origin of truth.
We can do that if necessary. I haven't so far. Just for the record, current way of using utilities doesn't benefit from "in-flight" information either. All new deductions have been strictly more powerful on the LLVM-TS and SPEC2006 benchmarks (if I remember correctly).
I have various patches to derive attributes already on phabricator so we can actually look at them in detail (see the Stack button above). Some design decisions are described in the class comment of the Attributor but let me rephrase with regards to your question: If you use utility functions you get potentially pessimistic answers. You can walk def-use chains, etc., just fine but you should query attributes in flight whenever possible. By construction, these attributes should provide the best possible result. An attribute can rely on utility functions as well but should only do so after other means failed to provide justification for a better state. I don't exclude that we might want to re-implement/enhance utility functions at some point such that they can, in turn, use in-flight attributes. | |
llvm/lib/Transforms/IPO/Attributor.cpp | ||
366 | [I read it as "Is this true...".] No it is not. If we do not reach an optimistic fixpoint we force a pessimistic one. If we modify the IR, we can, in the re-run, again find a better stable state (which could be an optimistic or pessimistic fixpoint) that allows again to enhance the IR. After thinking about the use of utility functions a bit more, I guess this can actually also happen if we have a fixpoint based on pessimistic information. Maybe we have to weaken the "llvm_unreachable" at some point, or we have to avoid/extend utility functions. For now, and with up to 32 iterations, I haven't seen a single non-optimistic fixpoint in all the test runs on LLVM-TS and SPEC2006. | |
402 |
Yes, at some point. In this patch *no* attributes are registered or derived and we also do not have "interesting" instructions yet. They are introduced with the attributes patches (see dependent patches). I didn't know a better way to split this. I opted to keep the framework together and put the attributes in separate patches. If you prefer it any other way, I'd be more than happy do change it. |
Make the Attributor a Module Pass and move it earlier in the pipeline. Disable it by default.
A couple of things, but otherwise this LGTM. We can iterate and add this this in tree (my preference is to get his basic infrastructure committed so that the commits that add deduction for particular attributes can all be separate).
llvm/lib/Transforms/IPO/Attributor.cpp | ||
---|---|---|
366 | Okay. I don't like theoretically-unsound checks (that aren't explicitly labeled as such), however, and the fact that this is enabled by default when EXPENSIVE_CHECKS is defined isn't good if this isn't sound. Please update the comment in the cl::opt to indicate that this might spuriously fail for pessimistic fixed points, and don't enable this by default with EXPENSIVE_CHECKS. | |
400 | Please add a comment here explaining what gets added here (so that it's clear why it's here and empty). |
I think it is still overly aggressive with analysis invalidation, but I agree it would be good to iterate on those things once the basics are in. Maybe it's worth a FIXME comment. Otherwise, looks good, thanks for the patience.
I added the FIXME and I verified that at the new point in the (old PM) pipeline there is no pass that is destroyed because of the Attributor. It is not part of the new PM pipeline right now.
llvm/lib/Transforms/IPO/Attributor.cpp | ||
---|---|---|
366 | You are right. I did that now. I did not before because for the first few attributes this should be sound. They do use utility functions but such that they should be able to get the best possible result even if the assumed information is not manifested in IR. The first attribute, returned on arguments, is one of them. I will find a way to run this such that only attributes with this property are verified and then I'll go back to "auto-verify" it when expensive checks are enabled. | |
400 | Done |
An example would be useful here for explaining the abstract attributes "(e,g,, something)".