Porting nonnull attribute to attributor.
Details
Diff Detail
Event Timeline
Currently, nonnull is deduced if all the return values are defined by nonnull return function.
I'll make it see function arguments later.
| llvm/include/llvm/Transforms/IPO/Attributor.h | ||
|---|---|---|
| 676 | copy and past comment, also above | |
| llvm/lib/Transforms/IPO/Attributor.cpp | ||
| 720 | Undef values are fine. undef basically allows you to "chose" a value, so we can "chose" something other than null. | |
| 725 | You should (also) ask for the AANonNull attribute for the value and if you get an abstract attribute that says nonnull we are fine. That should subsume some of the code below and also trigger ones we have AANonNullArgument. | |
| 732 | You should ask value tracking via isKnownNonZero here as well. It will need to be improved still to use optimistic information but it is better than nothing. | |
| 744 | The predicate above will be reused later on. Maybe "generate" it in a function in AANonNullImpl for a given Attributor. | |
| llvm/lib/Transforms/IPO/Attributor.cpp | ||
|---|---|---|
| 683 | Explain the behavior of the generate better, what does it return, true for nonnull? | |
| 728 | Why would you indicate a fixpoint here? If there is no attribute present in the IR we can still deduce it. | |
| 739 | No braces around a single statement | |
| 760 | Why do we need to duplicate the A.getAAFor ...  code and the ceck afterwards? | |
| 768 | In that case, move the "indicatePessimisticFixpoint" from the init method here. Than one can at least see what is going on. | |
| llvm/test/Transforms/FunctionAttrs/nonnull.ll | ||
| 107 | One more with inbounds (below, untested!) and maybe some without inbounds while we are at it. ; CHECK: define i8* @test10
; FIXME: missing nonnull
; ATTRIBUTOR: define i8* @test10
define i8* @test10(i8* %a, i64 %n) {
  %cmp = icmp ne i64 %n, null 
  call void @llvm.assume(i1 %cmp)
  %b = getelementptr inbounds i8, i8* %a, i64 %n
  ret i8* %b
} | |
| llvm/lib/Transforms/IPO/Attributor.cpp | ||
|---|---|---|
| 687 | Why isn't isAssumedNonNull() sufficient here? | |
| 740 | Please add here something like: FIXME: The `AAReturnedValues` should provide the predicate with the `ReturnInst` vector as well such that we can use the control flow sensitive version of `isKnownNonZero`. This should fix `test11` in `test/Transforms/FunctionAttrs/nonnull.ll` | |
| 764 | CHANGED | |
| 770 | I don't think you need the "Impl" part. | |
- Deduce nonnull on functions argument if we have nonnull in whole call sites (mostly based on D59202).
- Deduce nonnull on a call site argument if the passed value is assumed nonnull.
I think this is almost ready.
| llvm/include/llvm/Transforms/IPO/Attributor.h | ||
|---|---|---|
| 261 | No newline and "sites" in the first line. | |
| llvm/lib/Transforms/IPO/Attributor.cpp | ||
| 682 | "Generate a predicate that checks if a given value is assumed nonnull". | |
| 705 | This comment is a bit out of place. I don't think you need it at all. | |
| 835 | You can pass this value to the AbstractAttribute constructor (via a new AANonNullImpl and AANonNull constructor so there is no need to overload this method and keep track of the argument number. | |
| 854 | isKnowNonNull(&V, ...) ? | |
| 973 | Could you move this to the remaining Attributor impl, please. | |
| llvm/test/Transforms/FunctionAttrs/nonnull.ll | ||
| 283 | These are a lot of copies. Could we somehow make the "default" check and only specialize/copy when the two (-funcattr and -attributor) generate different code? Or is this already what you did and I just missed it? Also, could you add FIXMEs where we fail to deduce something with the attributor? | |
| llvm/lib/Transforms/IPO/Attributor.cpp | ||
|---|---|---|
| 854 | 
 Could you explain more detail? | |
| llvm/lib/Transforms/IPO/Attributor.cpp | ||
|---|---|---|
| 854 | Sorry. Any reason you do not use the isKnownNonNull functionality here (as you did above)? The check for an AANonNullImpl attribute will only succeed if the values is a call or argument. | |
Hello,
Maybe you remember https://reviews.llvm.org/D53342 ? Should we do it “InstCombine”-way or some alternative one ?
It could fit Atributor nicely, but not sure if nonnull dropping isn’t something weird for Attributor.
I did not, thanks for reminding me. What do you mean by the "InstCombine"-way and what alternatives do you have in mind?
Sorry that I'm a bit lost at the moment, I looked at D53342 and I seemed there are still problems with that patch even though the idea is very useful and we should add it.
I want to ask, whether it is okay to set/drop nonnull attribute in InstCombine or you would like to see this setting/dropping in Attributor.
If you are OK with “InstCombine”-way, I can rebase/update my patch.
I think placing attributes for known library functions should not be part of the Attributor. So, rebase & update your patch and I'll review it.
| llvm/lib/Transforms/IPO/Attributor.cpp | ||
|---|---|---|
| 854 | I understand you are saying about isKnownNonZero :). | |
define internal i32* @f1(i32* %arg) {
bb:
  %tmp = icmp eq i32* %arg, null
  br i1 %tmp, label %bb9, label %bb1
bb1:                                              ; preds = %bb
  %tmp2 = load i32, i32* %arg, align 4
  %tmp3 = icmp eq i32 %tmp2, 0
  br i1 %tmp3, label %bb6, label %bb4
bb4:                                              ; preds = %bb1
  %tmp5 = getelementptr inbounds i32, i32* %arg, i64 1
  %tmp5b = tail call i32* @f3(i32* %tmp5)
  br label %bb9
bb6:                                              ; preds = %bb1
  %tmp7 = tail call i32* @f2(i32* %arg)
  ret i32* %tmp7
bb9:                                              ; preds = %bb4, %bb
  %tmp10 = phi i32* [ %tmp5, %bb4 ], [ inttoptr (i64 4 to i32*), %bb ]
  ret i32* %tmp10
}
define internal i32* @f2(i32* %arg) {
bb:
  %tmp = tail call i32* @f1(i32* %arg)
  ret i32* %tmp
}
define dso_local noalias i32* @f3(i32* %arg) {
bb:
  %tmp = call i32* @f1(i32* %arg)
  ret i32* null
}Another test case to show some shortcomings of the current approach.
| llvm/lib/Transforms/IPO/Attributor.cpp | ||
|---|---|---|
| 676 | Copy & Paste | |
| 801 | We have to make sure we do not get the AANonNullImpl for the argument back when none exists for the call site argument. So you have to test that the associated value of NonNullCSArgAA is the call site operand, or you check that NonNullCSArgAA is not this. | |
| 835 | It seems without the argument number you have to do a linear search. I didn't realize that. Can we store the argument number but still pass the CS.getArgOperand(ArgNo) to the AANonNullImpl constructor as well? | |
| 838 | Add a strongly worded comment here telling people they should not (explicitly) look at the argument of the callee in this method. | |
| 843 | It's only about the assumed state. So, UNCHANGED as we assumed non-null already otherwise updateImpl wouldn't be called. | |
| 854 | That is what I meant all along ;) | |
| llvm/test/Transforms/FunctionAttrs/nonnull.ll | ||
| 152 | We derive nonnull here because test13 is dead, correct? Maybe that is not what you wanted to test ;) | |
We accumulated some unnecessary virtual declarations in the Attributor. I created D64637 to remove them in already committed patches. It would be good if you remove them in your active patches too.
| llvm/lib/Transforms/IPO/Attributor.cpp | ||
|---|---|---|
| 801 | 
 I could understand the previous implementation didn't work well. But I couldn't understand the latter. I think it is sufficient to query CS.getArgOperand(ArgNo) to Attributor, isn't it? | |
| llvm/lib/Transforms/IPO/Attributor.cpp | ||
|---|---|---|
| 978 | registerAA(*new AANonNullCallSiteArgument(CS, i, InfoCache), i); | |
Small nits that should be fixed but other than that this looks good to me.
| llvm/lib/Transforms/IPO/Attributor.cpp | ||
|---|---|---|
| 833 | I think we should switch the order here. If we have an AANonNull for the call site we should use it for sure. Otherwise, look at the paramAttr and ask isKnownNonZero. The latter is not necessarily cheap so we should use information computed already first. You want CallBase not CallInst, or better, compare the anchoredvalue with the CS inst. | |
| 846 | The whole "direction" concept is something I talked about but I don't think we actually need it. Let's just say we need to do it to avoid circular conclusions. | |
| 853 | This seems like something we can move in the initialize method as it should not change over time. | |
| 857 | I improved getAAFor such that whenever it returns a non-null pointer the state of that abstract attribute is valid. That means we do not need to check for a valid state at the call sites. | |
| llvm/test/Transforms/FunctionAttrs/nonnull.ll | ||
| 4 | We need a target layout here. The test is now dependent on the sizes of pointer (I think) and without target layout it might take the host values which would be bad for 32bit machines. | |
| 175 | Will be fixed with D64708. | |
No newline and "sites" in the first line.