This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
docs/
3/3
LangRef.rst
-
include/llvm/IR/
-
llvm/
-
IR/
-
Argument.h
-
lib/
-
Analysis/
1/1
ValueTracking.cpp
-
IR/
-
Function.cpp
-
Transforms/IPO/
-
IPO/
1/1
FunctionAttrs.cpp
-
test/
-
Analysis/ValueTracking/
-
ValueTracking/
1/1
known-nonnull-at.ll
-
Transforms/
-
Attributor/
-
align.ll
-
nonnull.ll
-
FunctionAttrs/
-
nonnull.ll
-
InstCombine/
-
call_nonnull_arg.ll
-
unused-nonnull.ll

Differential D90529

Allow nonnull/align attribute to accept poison
ClosedPublic

Authored by aqjune on Oct 31 2020, 6:56 AM.

Download Raw Diff

Details

Reviewers

jdoerfert
efriedma
nikic
fhahn
hfinkel
sstefan1
baziotis

Commits

rG4479c0c2c0be: Allow nonnull/align attribute to accept poison

Summary

Currently LLVM is relying on ValueTracking's isKnownNonZero to attach nonnull, which can return true when the value is poison.
To make the semantics of nonnull consistent with the behavior of isKnownNonZero, this makes the semantics of nonnull to accept poison, and return poison if the input pointer isn't null.
This makes many transformations like below legal:

%p = gep inbounds %x, 1 ; % p is non-null pointer or poison
call void @f(%p)        ; instcombine converts this to call void @f(nonnull %p)

This semantics makes propagation of nonnull to caller illegal.
The reason is that, passing poison to nonnull does not immediately raise UB anymore, so such program is still well defined, if the callee does not use the argument.
Having noundef attribute there re-allows this.

define void @f(i8* %p) {       ; functionattr cannot mark %p nonnull here anymore
  call void @g(i8* nonnull %p) ; .. because @g never raises UB if it never uses %p.
  ret void
}

Another attribute that needs to be updated is align. This patch updates the semantics of align to accept poison as well.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

aqjune created this revision.Oct 31 2020, 6:56 AM

Herald added subscribers: llvm-commits, dexonsmith, hiraditya. · View Herald TranscriptOct 31 2020, 6:56 AM

Herald added a project: Restricted Project. · View Herald TranscriptOct 31 2020, 6:56 AM

aqjune requested review of this revision.Oct 31 2020, 6:56 AM

I tried to update Attributor as well, but the infrastructure was quite big; I'll update Attributor as well if someone points where to fix.

Harbormaster completed remote builds in B77139: Diff 302087.Oct 31 2020, 7:51 AM

make LangRef explicitly describe non-ub

Harbormaster completed remote builds in B77145: Diff 302096.Oct 31 2020, 9:09 AM

To share my understanding about how nonnull is used in optimizations:

It is mainly used to fold away null pointer checks.

define void @f(i8* nonnull %p) {
  if (icmp eq %p, null) ret void         ; this check can be folded to true
  ...
}

Note that the new nonnull-or-poison semantics is fine in the above case because if %p is poison we can fold icmp eq %p, null into false.

A similar but slightly different case:

...
call void @f(i8* nonnull %p)
if (icmp eq %p, null) ret void

In this case, the nonnull-or-poison semantics cannot make the comparison after call folded, because f may not use the poison argument at all, making the program well-defined.
But, if f has noundef attribute as well, this is fine because passing null to f is now UB:

...
call void @f(i8* noundef nonnull %p)
if (icmp eq %p, null) ret void            ; %p is always non-null pointer!

It is combined with dereferenceable_or_null to infer dereferenceable.

This is still fine with the new nonnull-or-poison because passing poison to dereferenceable_or_null is still UB, guaranteeing that the pointer is never poison.

I think we have two choices here:

Don't raise UB when "value attributes" are passed a "wrong value", e.g., null for a nonnull attribute, but make the value poison. Use nonull + noundef to make it UB.
Make all "value attributes" accept poison without raising UB.

I was in the past pushing for 1), I don't have the link handy but we can probably find it. I think the last time I brought this up was the noundef discussion actually.
One of my examples was the gep one shown in the commit message,
I vaguely remember one where the user "broke the contract" but in a way they would assume to be harmless, e.g., they did not cause any side-effect, maybe something like:

void foo(bool valid, X& x) {
  if (!valid) return;
  ...
}

obj_ptr = null;
foo(obj_ptr != null, *obj_ptr);

I remember @efriedma was not a fan of 1) at the time, unsure if that is still the case with noundef in place.

If we don't do 1), we should talk about 2) before we make nonnull special. I fail to see the reason it is different from any other (or at least most) "value attributes".

This revision now requires changes to proceed.Oct 31 2020, 3:35 PM

In D90529#2366723, @jdoerfert wrote:

I think we have two choices here:

Don't raise UB when "value attributes" are passed a "wrong value", e.g., null for a nonnull attribute, but make the value poison. Use nonull + noundef to make it UB.

Make all "value attributes" accept poison without raising UB.

If we don't do 1), we should talk about 2) before we make nonnull special.

It seems these two choices are entangled.
If f(nonnull poison) is okay (in other words, not UB), then f(nonnull null) shouldn't be UB as well.
The reason is that poison can be folded into null in any time.
For example, when inbounds is stripped from v = gep inbounds ..., v can be transformed from poison to null pointer.

I fail to see the reason it is different from any other (or at least most) "value attributes".

I had a thought about this, and here's what I think:
Each value attribute has a primary transformation to target.
For example, nonnull is for null comparison folding, and dereferenceable is for reordering of load instructions by guaranteeing that dereferencing the pointer never raises UB (unless the pointer is freed).

I think nonnull is different from dereferenceable in that allowing it to be poison doesn't break the intended optimization.
This folding is still okay even if the pointer is poison.
By allowing poison, further optimizations can be done as well because gep inbounds ptr, 1 can be marked as nonnull.
However, allowing dereferenceable to be poison will block such optimizations, which doesn't seem quite desirable IMO.

This causes the properties of attributes slightly heterogeneous, but I think it is fine if the goal is to support more optimizations.

uenoku added a subscriber: uenoku.Nov 1 2020, 10:12 AM

In D90529#2366897, @aqjune wrote:

In D90529#2366723, @jdoerfert wrote:

I think we have two choices here:

Don't raise UB when "value attributes" are passed a "wrong value", e.g., null for a nonnull attribute, but make the value poison. Use nonull + noundef to make it UB.

Make all "value attributes" accept poison without raising UB.

If we don't do 1), we should talk about 2) before we make nonnull special.

It seems these two choices are entangled.
If f(nonnull poison) is okay (in other words, not UB), then f(nonnull null) shouldn't be UB as well.
The reason is that poison can be folded into null in any time.
For example, when inbounds is stripped from v = gep inbounds ..., v can be transformed from poison to null pointer.

Agreed. So let's go with 1) and we change all the "value attribute" semantics to produce poison on violation.
We have noundef for the UB case, that was the reason I wanted it ;)

I fail to see the reason it is different from any other (or at least most) "value attributes".

I had a thought about this, and here's what I think:
Each value attribute has a primary transformation to target.
For example, nonnull is for null comparison folding, and dereferenceable is for reordering of load instructions by guaranteeing that dereferencing the pointer never raises UB (unless the pointer is freed).

I think nonnull is different from dereferenceable in that allowing it to be poison doesn't break the intended optimization.
This folding is still okay even if the pointer is poison.
By allowing poison, further optimizations can be done as well because gep inbounds ptr, 1 can be marked as nonnull.
However, allowing dereferenceable to be poison will block such optimizations, which doesn't seem quite desirable IMO.

This causes the properties of attributes slightly heterogeneous, but I think it is fine if the goal is to support more optimizations.

I don't think we should argue this way, at least it should be the last resort. An attribute describes a property, doesn't matter if it is
nonnull, align, or dereferenceable, it's a property of the value. Now, due to the way things are it is possible to violate
that property and then we need to define what is happening. If we do this on a case-by-case basis and with specific transformations
in mind, we will make the IR more complex, the attributes confusing, and in the end non-composeable.

What is the reason not to go with 1) above, so to remove the UB behavior and produce poison. We write an RFC and do the change.
FWIW, that is what AAUndefinedBehavior in the Attributor already does, only with noundef it will deduce UB for nonnull null.

In D90529#2367050, @jdoerfert wrote:

Agreed. So let's go with 1) and we change all the "value attribute" semantics to produce poison on violation.
We have noundef for the UB case, that was the reason I wanted it ;)

Thank you, glad to hear that moving towards the change!

In D90529#2367050, @jdoerfert wrote:

I don't think we should argue this way, at least it should be the last resort. An attribute describes a property, doesn't matter if it is
nonnull, align, or dereferenceable, it's a property of the value. Now, due to the way things are it is possible to violate
that property and then we need to define what is happening. If we do this on a case-by-case basis and with specific transformations
in mind, we will make the IR more complex, the attributes confusing, and in the end non-composeable.

I think the discussion finally falls into which attribute is a value attribute and which is not.
Value attributes should have the same semantics as you said.
Sometimes value attribute and non-value attribute can interact with each other (e.g., byval implies nonnull unless null is a valid pointer), but such complexity should be only at there.

FWIW, that is what AAUndefinedBehavior in the Attributor already does, only with noundef it will deduce UB for nonnull null.

Aha, thanks for the info.

In D90529#2367694, @aqjune wrote:

In D90529#2367050, @jdoerfert wrote:

Agreed. So let's go with 1) and we change all the "value attribute" semantics to produce poison on violation.
We have noundef for the UB case, that was the reason I wanted it ;)

Thank you, glad to hear that moving towards the change!

Wasn't that my position for more than a year now ;)

In D90529#2367050, @jdoerfert wrote:

I don't think we should argue this way, at least it should be the last resort. An attribute describes a property, doesn't matter if it is
nonnull, align, or dereferenceable, it's a property of the value. Now, due to the way things are it is possible to violate
that property and then we need to define what is happening. If we do this on a case-by-case basis and with specific transformations
in mind, we will make the IR more complex, the attributes confusing, and in the end non-composeable.

I think the discussion finally falls into which attribute is a value attribute and which is not.
Value attributes should have the same semantics as you said.
Sometimes value attribute and non-value attribute can interact with each other (e.g., byval implies nonnull unless null is a valid pointer), but such complexity should be only at there.

Fair. Though, I think we want to produce poison for one set of attributes for which the name "value attribute" was not well chosen.
So far, the things I think should produce poison not UB are:

(pure) value attributes:

nonnull
align
[used_bits] (not existing yet)

(context) value attributes:

dereferenceable
dereferenceable_or_null
[object_size] (as proposed on the list)

WDYT?

FWIW, that is what AAUndefinedBehavior in the Attributor already does, only with noundef it will deduce UB for nonnull null.

Aha, thanks for the info.

In D90529#2368303, @jdoerfert wrote:

Fair. Though, I think we want to produce poison for one set of attributes for which the name "value attribute" was not well chosen.
So far, the things I think should produce poison not UB are:

(pure) value attributes:

nonnull

align

[used_bits] (not existing yet)

(context) value attributes:

dereferenceable

dereferenceable_or_null

[object_size] (as proposed on the list)

WDYT?

In case of dereferenceable, my opinion is still slightly different: it represents the property of the memory at the point.
If a memory block is freed, a same pointer value won't be dereferenceable after the deallocation.

Other than the conceptual reason, my practical concern about dereferenceable is that dereferenceable is hard to use unless it is with noundef.
Since loading poison is UB, the attribute can't still give a guarantee that loading the pointer is well defined.
Furthermore, noundef is hard to infer from the context. For example,

store i32 0, i32* %p
call void @f(i32* %p) ; can we infer %p's noundef & dereferenceable(4) from store?

store i32 0, i32* %p does not guarantee that %p is noundef because it can be partially undef but still dereferenceable (D87994 is a relevant LangRef patch).

The example above was propagating the attribute from caller to callee, but the inverse direction (callee -> caller) is the same: If @f contains a store, it guarantees that the pointer is dereferenceable but not noundef.

In order to regain optimization power, we need a way to attach noundef more aggressively.
D81678 will attach noundef to arguments, and additionally I think it is valid for clang to attach !noundef when lowering non-char typed lvalues to load.

In D90529#2369382, @aqjune wrote:

In D90529#2368303, @jdoerfert wrote:

Fair. Though, I think we want to produce poison for one set of attributes for which the name "value attribute" was not well chosen.
So far, the things I think should produce poison not UB are:

(pure) value attributes:

nonnull

align

[used_bits] (not existing yet)

(context) value attributes:

dereferenceable

dereferenceable_or_null

[object_size] (as proposed on the list)

WDYT?

In case of dereferenceable, my opinion is still slightly different: it represents the property of the memory at the point.
If a memory block is freed, a same pointer value won't be dereferenceable after the deallocation.

That is why it is not listed in the "pure" category ;). It is not only a property of the value, but it still should be split wrt UB behavior. See below for my reasoning based on your example.

Partially related:
It is, nowadays, unclear to me if your interpretation of dereferenceable is what we want though. It is/was my interpretation as well FWIW.
@reames suggested that dereferenceability is not about it pointing to an allocated object but to a memory location that can be loaded without causing observable behavior, i.a., a trap.
Unsure if we want a new attribute for that but certainly there are reasons that you want that information which is a "value property" and not tied to the "allocation status".

Other than the conceptual reason, my practical concern about dereferenceable is that dereferenceable is hard to use unless it is with noundef.

Regardless of my argument below, nobody said you cannot use dereferenceable with noundef "all the time".
I mean, if you argue a current use of dereferenceable would actually require the "UB if violated" behavior, then simply also add noundef as well.

Since loading poison is UB, the attribute can't still give a guarantee that loading the pointer is well defined.

Can you clarify "loading poison is UB". I'm unsure I follow/agree.

FWIW, loading a dereferenceable pointer might result in poison but it's even not clear to me that out-of-bounds accesses are UB as of now (in IR).
Maybe I just forgot where it says I cannot do load i32, i32* (bitcast i8* [alloca i8] to i32*). I certainly would get 3 poison bytes if not UB.

Furthermore, noundef is hard to infer from the context. For example,
store i32 0, i32* %p
call void @f(i32* %p) ; can we infer %p's noundef & dereferenceable(4) from store?
store i32 0, i32* %p does not guarantee that %p is noundef because it can be partially undef but still dereferenceable (D87994 is a relevant LangRef patch).

The example above was propagating the attribute from caller to callee, but the inverse direction (callee -> caller) is the same: If @f contains a store, it guarantees that the pointer is dereferenceable but not noundef.

OK, I'm unsure I get the point though. We can derive dereferenceable in either case, agreed? What is currently better?

(@hideto @stefan1 @okura, I think the Attributor might derive noundef here but should not, wdyt?)

In order to regain optimization power, we need a way to attach noundef more aggressively.

This is correct but orthogonal. Having noundef "backed-in" to other attributes is not helping us to derive noundef it just prevents us to derive other attributes.

D81678 will attach noundef to arguments, and additionally I think it is valid for clang to attach !noundef when lowering non-char typed lvalues to load.

Great.

In D90529#2369578, @jdoerfert wrote:

In order to regain optimization power, we need a way to attach noundef more aggressively.

This is correct but orthogonal. Having noundef "backed-in" to other attributes is not helping us to derive noundef it just prevents us to derive other attributes.

I think dereferenceable(current UB semantics) != dereferenceable(new poison semantics) + noundef. In the past I thought they were equal, but here is the reason...

The reason is that the current dereferenceable accepts partially undefined pointers, IIUC. At least, LangRef does not prohibit it.
noundef allows accepting well-defined pointers only, so there is a slight difference.

Note that this also happens to nonnull: nonnull (current UB sem.) != nonnull(new poison sem.) + noundef.
However, the situation is slightly different, because the change allows more analysis.
gep inbounds p, 1 is now nonnull, which is certainly a benefit.

In order to regain equivalence, nopoison should be introduced. nopoison allows the value to be (partially) undef but not poison.
dereferenceable(current UB semantics) == dereferenceable(new poison semantics) + nopoison holds.

This is really due to the complexity of undef.... :(

Can you clarify "loading poison is UB". I'm unsure I follow/agree.

FWIW, loading a dereferenceable pointer might result in poison but it's even not clear to me that out-of-bounds accesses are UB as of now (in IR).
Maybe I just forgot where it says I cannot do load i32, i32* (bitcast i8* [alloca i8] to i32*). I certainly would get 3 poison bytes if not UB.

I meant 'loading a poison pointer', sorry. load i8* poison raises UB.

In D90529#2369578, @jdoerfert wrote:

OK, I'm unsure I get the point though. We can derive dereferenceable in either case, agreed? What is currently better?

The current dereferenceable definition gives a guarantee that the pointer is not poison. :)
It is good because it guarantees that loads and stores to the pointer don't trap.

BTW, there are !nonnull metadata and llvm.assume with nonnull operand bundle as well - the semantics of those should be updated too.
Maybe we can just follow the semantics of nonnull attribute here.

(reviving an old discussion) to summarize the issue:
The issue was whether it is okay to make dereferenceable %p non-UB if %p was not dereferenceable.

In order to make the attribute useful, noundef %p should be accompanied.
The problem is that we cannot infer noundef %p from store or load in general; noundef can be inferred in limited cases only.

@jdoerfert Kindly ping, what do you think?

I think making align and nonnull to follow poison semantics is still a great thing.
The update of nonnull makes its semantics and existing optimizations about nonnull consistent.
I did not deeply investigate align yet, but I have a good feeling that that moving towards poison semantics will explain more optimizations about align that involve gep inbounds with aligned offsets.

Some nits. I think the direction is right. Update the commit message though. And please wait for an OK by @efriedma .

llvm/docs/LangRef.rst
1231	Mention the combination with `noundef` please.
llvm/lib/Analysis/ValueTracking.cpp
2091–2092	`/* AllowUndefOrPoison */ false`
llvm/lib/Transforms/IPO/FunctionAttrs.cpp
645	see above.
llvm/test/Analysis/ValueTracking/known-nonnull-at.ll
5	maybe rename to make it clear at the call site

This revision is now accepted and ready to land.Dec 2 2020, 8:41 AM

Address comments

aqjune marked 4 inline comments as done.Dec 2 2020, 10:42 PM

aqjune edited the summary of this revision. (Show Details)Dec 2 2020, 10:46 PM

aqjune edited the summary of this revision. (Show Details)

minor fix to a test

Harbormaster completed remote builds in B80911: Diff 309157.Dec 2 2020, 11:22 PM

Harbormaster completed remote builds in B80913: Diff 309159.Dec 2 2020, 11:43 PM

@efriedma gentle ping

Slightly related to this patch: should llvm.assume's nonnull/align operand bundles be updated to follow this semantics as well?
I'll follow any semantics that would be convenient for Attributor's implementation or anything related to this.
I can make a LangRef update patch as well.

Should we have clarification about the semantics of align/nonnull bundle for the assume intrinsic?
When a function call is inlined, the attributes of the parameters may remain as assume's bundles:

call f(nonnull %p)
->
llvm.assume [ "nonnull"(%p) ]

Here are two possible semantics:

When "nonnull"(%p) is at the operand bundle, there also should be "noundef"(%p), otherwise it is a no-op.

; A correct transformation
call f(nonnull noundef %p)
->
llvm.assume [ "nonnull"(%p), "noundef"(%p) ] ; since noundef and nonnull are both in the bundle, it is UB if %p is null

"nonnull"(%p) diverges from nonnull, and immediately raises UB if %p is null. This means that the lowering should be done only when noundef existed.

; A correct transformation
call f(nonnull noundef %p)
->
llvm.assume [ "nonnull"(%p) ] ; this is UB if %p is null

The former makes attribute -> op bundle conversion simpler, whereas the latter makes reasoning from op bundle simpler (well, maybe both are simple enough). :)
I think either option is fine.

In D90529#2449341, @aqjune wrote:

Should we have clarification about the semantics of align/nonnull bundle for the assume intrinsic?

Probably true, good catch. Why not make it the same as the attribute? So,

; A correct transformation
call f(nonnull %p, nonnull noundef %q)
->
llvm.assume [ "nonnull"(%p), "nonnull"(%q), "noundef"(%q)]
; where a `null` value for %p can be assumed to be a poison value and
; a `null` value for %q can be assumed to cause UB because `noundef` of
; poison is UB.

In D90529#2449524, @jdoerfert wrote:
In D90529#2449341, @aqjune wrote:

Should we have clarification about the semantics of align/nonnull bundle for the assume intrinsic?

Probably true, good catch. Why not make it the same as the attribute? So,
; A correct transformation
call f(nonnull %p, nonnull noundef %q)
->
llvm.assume [ "nonnull"(%p), "nonnull"(%q), "noundef"(%q)]
; where a `null` value for %p can be assumed to be a poison value and
; a `null` value for %q can be assumed to cause UB because `noundef` of
; poison is UB.

+1, I'll make a separate patch for this.

I like where this is going. Most of LLVM's alias analysis produce information that only holds if the value is not poison. Since these attributes are derived from said analysis, then it makes sense then they have the same "X is poison or foo(X) holds" semantics.
I agree that certain attributes are different, like dereferenceable. It is useless if the value might be poison as well. Though we may go with the same semantics and then require the noundef attribute to make it useful. Seems like a good way to go as well.

For the partial undef memory access example that Juneyoung gave.. Well, maybe we need to make it UB to dereference a non-deterministic value. Doesn't seem like it's a very useful thing to do, and this non-determinism comes from some previous undefined behavior, so it seems fine to just make dereference of partial undef UB. Simplifies things.

I'll make this patch include the updates in the align attribute first because it helps making fewer patches.
The next patch (operand bundle) will include changes in nonnull/align operand bundles as well.

In D90529#2449703, @nlopes wrote:

For the partial undef memory access example that Juneyoung gave.. Well, maybe we need to make it UB to dereference a non-deterministic value. Doesn't seem like it's a very useful thing to do, and this non-determinism comes from some previous undefined behavior, so it seems fine to just make dereference of partial undef UB. Simplifies things.

There was a discussion for this: https://groups.google.com/g/llvm-dev/c/2Qk4fOHUoAE/m/OxZa3bIhAgAJ
This partially undef thing is a bit painful.. :/

In D90529#2449703, @nlopes wrote:

For the partial undef memory access example that Juneyoung gave.. Well, maybe we need to make it UB to dereference a non-deterministic value. Doesn't seem like it's a very useful thing to do, and this non-determinism comes from some previous undefined behavior, so it seems fine to just make dereference of partial undef UB. Simplifies things.

There was a discussion for this: https://groups.google.com/g/llvm-dev/c/2Qk4fOHUoAE/m/OxZa3bIhAgAJ
This partially undef thing is a bit painful.. :/

If we allow partial undef pointers to be dereferenced, then it's nearly impossible to derive noundef for pointers. So we will have to rely on frontends to annotate. So the question is whether there's any benefit of allowing such accesses and whether that breaks any frontend if not. Can't think of any reason to allow them and that thread doesn't really provide any either AFAICT.

update langref align

I tried to fix Attributor to propagate nonnull/align only when there is noundef, but couldn't: https://godbolt.org/z/Yzsc14
Where should I start?

In D90529#2450746, @nlopes wrote:

If we allow partial undef pointers to be dereferenced, then it's nearly impossible to derive noundef for pointers. So we will have to rely on frontends to annotate. So the question is whether there's any benefit of allowing such accesses and whether that breaks any frontend if not. Can't think of any reason to allow them and that thread doesn't really provide any either AFAICT.

I agree that any frontend won't emit undef when compiling a valid program, but I think the point is that the semantics requires a transformation that hoists a memory instruction to prove that the pointer isn't partially undef, which isn't done by LLVM now.

void f(i64 %offset) {
  p = alloca [16 x i8]
  assume(0 <= offset < 16)
  init(p)
  for (..) {
    v = load p[offset] // if offset is partially undef, this load cannot be hoisted out of the loop
    use(v)
  }
}

aqjune retitled this revision from Allow nonnull attribute to accept poison to Allow nonnull/align attribute to accept poison.Dec 14 2020, 4:03 AM

aqjune edited the summary of this revision. (Show Details)

Harbormaster completed remote builds in B82245: Diff 311540.Dec 14 2020, 4:17 AM

@efriedma ping

ping @efriedma

@jdoerfert Would it be okay if I land this after a week? D91480 also uses the semantics described in this patch as well.

For the attributor, add the tests that don't work with FIXMEs into the attributor test folder please, potentially just into the respective test files, e.g. nonnull.ll

I proposed some wording changes, but no semantic change.

llvm/docs/LangRef.rst
1163–1169	`poison` is not aligned, and can well be null. Let's state the intent and not "include" poison as valid input. It is not and that is why it triggers the output to be poison.
1228–1233

Address comments, leave tests with FIXME

Herald added a reviewer: sstefan1. · View Herald TranscriptJan 12 2021, 7:13 PM

Herald added a reviewer: baziotis. · View Herald Transcript

Herald added subscribers: bbn, kuter. · View Herald Transcript

aqjune marked 2 inline comments as done.Jan 12 2021, 7:13 PM

Harbormaster completed remote builds in B84967: Diff 316308.Jan 12 2021, 8:08 PM

This revision was landed with ongoing or failed builds.Jan 19 2021, 6:54 PM

Closed by commit rG4479c0c2c0be: Allow nonnull/align attribute to accept poison (authored by aqjune). · Explain Why

This revision was automatically updated to reflect the committed changes.

aqjune added a commit: rG4479c0c2c0be: Allow nonnull/align attribute to accept poison.

aqjune mentioned this in D95238: [LangRef] Update memory access ops to raise UB if ptrs are not well defined.Jan 24 2021, 6:06 PM

In D90529#2449558, @aqjune wrote:
In D90529#2449524, @jdoerfert wrote:
In D90529#2449341, @aqjune wrote:

Should we have clarification about the semantics of align/nonnull bundle for the assume intrinsic?

Probably true, good catch. Why not make it the same as the attribute? So,
; A correct transformation
call f(nonnull %p, nonnull noundef %q)
->
llvm.assume [ "nonnull"(%p), "nonnull"(%q), "noundef"(%q)]
; where a `null` value for %p can be assumed to be a poison value and
; a `null` value for %q can be assumed to cause UB because `noundef` of
; poison is UB.
+1, I'll make a separate patch for this.

I'll start working on this soon.

I got a bit confused by the summary until I realized that I should read it this way:) Perhaps you can still edit the summary like the following:

This propagates the nonnull attribute to the caller in the following case:

%p = gep inbounds %x, 1 ; % p is non-null pointer or poison
call void @f(%p)        ; instcombine converts this to call void @f(nonnull %p)

<del>Instead, </del>This patch makes it illegal to propagate the nonnull attribute to the caller, e.g.

define void @f(i8* %p) {       ; functionattr cannot mark %p nonnull here anymore
  call void @g(i8* nonnull %p) ; .. because @g never raises UB if it never uses %p.
  ret void
}

To re-allow nonnull propagation to the caller, the caller should have the noundef attribute.

In D90529#2584092, @MaskRay wrote:
I got a bit confused by the summary until I realized that I should read it this way:) Perhaps you can still edit the summary like the following:

This propagates the nonnull attribute to the caller in the following case:
%p = gep inbounds %x, 1 ; % p is non-null pointer or poison
call void @f(%p)        ; instcombine converts this to call void @f(nonnull %p)
<del>Instead, </del>This patch makes it illegal to propagate the nonnull attribute to the caller, e.g.
define void @f(i8* %p) {       ; functionattr cannot mark %p nonnull here anymore
  call void @g(i8* nonnull %p) ; .. because @g never raises UB if it never uses %p.
  ret void
}
To re-allow nonnull propagation to the caller, the caller should have the noundef attribute.

Edited the summary, thanks :)

In D90529#2449558, @aqjune wrote:
In D90529#2449524, @jdoerfert wrote:
In D90529#2449341, @aqjune wrote:

Should we have clarification about the semantics of align/nonnull bundle for the assume intrinsic?

Probably true, good catch. Why not make it the same as the attribute? So,
; A correct transformation
call f(nonnull %p, nonnull noundef %q)
->
llvm.assume [ "nonnull"(%p), "nonnull"(%q), "noundef"(%q)]
; where a `null` value for %p can be assumed to be a poison value and
; a `null` value for %q can be assumed to cause UB because `noundef` of
; poison is UB.
+1, I'll make a separate patch for this.

While writing a LangRef patch for the semantic changes in nonnull & align attributes in assume operand bundle, I found one interesting case. :)

Creating assume from these two calls yields the same result:

call void @hi1(i8* nonnull %val, i8* noundef %val)
call void @hi2(i8* nonnull noundef %val)

They both generate:

call void @llvm.assume(true)["nonnull"(i8* %val), "noundef"(i8* %val)]

A question is whether this assume is UB if, say, %val is inttoptr(1).
Creating this assume from call void @hi2(%val) is okay because call void @hi2(%val) is already UB.
But, creating the assume from call void @hi1(%val, %val) isn't okay! %val is non-null and noundef, so the call is fine.

To distinguish these two, what about this?
(1) simply use UB semantics for nonnull attribute in assume operand bundle
(2) lower nonnull attribute in the call-site into nonnull bundle only if it is accompanied with noundef.
Then, we don't need to change anything about nonnull from LangRef: the current text is defining it as UB (https://llvm.org/docs/LangRef.html#assume-operand-bundles ).

In D90529#2594026, @aqjune wrote:
While writing a LangRef patch for the semantic changes in nonnull & align attributes in assume operand bundle, I found one interesting case. :)

Creating assume from these two calls yields the same result:
call void @hi1(i8* nonnull %val, i8* noundef %val)
call void @hi2(i8* nonnull noundef %val)
They both generate:
call void @llvm.assume(true)["nonnull"(i8* %val), "noundef"(i8* %val)]
A question is whether this assume is UB if, say, %val is inttoptr(1).
Creating this assume from call void @hi2(%val) is okay because call void @hi2(%val) is already UB.
But, creating the assume from call void @hi1(%val, %val) isn't okay! %val is non-null and noundef, so the call is fine.

To distinguish these two, what about this?
(1) simply use UB semantics for nonnull attribute in assume operand bundle
(2) lower nonnull attribute in the call-site into nonnull bundle only if it is accompanied with noundef.
Then, we don't need to change anything about nonnull from LangRef: the current text is defining it as UB (https://llvm.org/docs/LangRef.html#assume-operand-bundles ).

I thought at first (2) is the way to go but then I asked myself if @hi1(%val, %val) is really not already UB. The question is, does val become poison for this use or is val poison in the scope of the use. I feel the latter is what we actually want. If we assume this is the only call of hi1 and we then do interprocedural transformations to replace the use of the first argument with the second or vice versa, would that be illegal because of the attribute mismatch or allowed? If it is the latter, the call must have been UB.

In D90529#2594464, @jdoerfert wrote:
In D90529#2594026, @aqjune wrote:
While writing a LangRef patch for the semantic changes in nonnull & align attributes in assume operand bundle, I found one interesting case. :)

Creating assume from these two calls yields the same result:
call void @hi1(i8* nonnull %val, i8* noundef %val)
call void @hi2(i8* nonnull noundef %val)
They both generate:
call void @llvm.assume(true)["nonnull"(i8* %val), "noundef"(i8* %val)]
A question is whether this assume is UB if, say, %val is inttoptr(1).
Creating this assume from call void @hi2(%val) is okay because call void @hi2(%val) is already UB.
But, creating the assume from call void @hi1(%val, %val) isn't okay! %val is non-null and noundef, so the call is fine.

To distinguish these two, what about this?
(1) simply use UB semantics for nonnull attribute in assume operand bundle
(2) lower nonnull attribute in the call-site into nonnull bundle only if it is accompanied with noundef.
Then, we don't need to change anything about nonnull from LangRef: the current text is defining it as UB (https://llvm.org/docs/LangRef.html#assume-operand-bundles ).
I thought at first (2) is the way to go but then I asked myself if @hi1(%val, %val) is really not already UB. The question is, does val become poison for this use or is val poison in the scope of the use. I feel the latter is what we actually want. If we assume this is the only call of hi1 and we then do interprocedural transformations to replace the use of the first argument with the second or vice versa, would that be illegal because of the attribute mismatch or allowed? If it is the latter, the call must have been UB.

Hmm, I think the issue happens with a call without noundef as well:

call void @hi1(nonnull %val, %val) ; let's assume that this is the only call

define void @hi(i8* %x, i8* %y) {
  use(%x, %y) ; this cannot be use(%x, %x)!
}

To do this replacement, nonnull at the call site should be dropped first.

Besides this, I think expanding the scope of being poison to the same variables makes optimizations like CSE hard. For example:

call void @hi1(<attr> %val, <attr2> %val2)

If CSE concludes that %val is %val2, it can remove %val2 and optimize it into:

call void @hi1(<attr> %val, <attr2> %val)

.. and this can unexpectedly introduce UB due to the synergy between <attr> and <attr2>. I guess fixing optimizations to consider attributes might be a bit costly.. :/

In D90529#2594677, @aqjune wrote:
In D90529#2594464, @jdoerfert wrote:
In D90529#2594026, @aqjune wrote:
While writing a LangRef patch for the semantic changes in nonnull & align attributes in assume operand bundle, I found one interesting case. :)

Creating assume from these two calls yields the same result:
call void @hi1(i8* nonnull %val, i8* noundef %val)
call void @hi2(i8* nonnull noundef %val)
They both generate:
call void @llvm.assume(true)["nonnull"(i8* %val), "noundef"(i8* %val)]
A question is whether this assume is UB if, say, %val is inttoptr(1).
Creating this assume from call void @hi2(%val) is okay because call void @hi2(%val) is already UB.
But, creating the assume from call void @hi1(%val, %val) isn't okay! %val is non-null and noundef, so the call is fine.

To distinguish these two, what about this?
(1) simply use UB semantics for nonnull attribute in assume operand bundle
(2) lower nonnull attribute in the call-site into nonnull bundle only if it is accompanied with noundef.
Then, we don't need to change anything about nonnull from LangRef: the current text is defining it as UB (https://llvm.org/docs/LangRef.html#assume-operand-bundles ).
I thought at first (2) is the way to go but then I asked myself if @hi1(%val, %val) is really not already UB. The question is, does val become poison for this use or is val poison in the scope of the use. I feel the latter is what we actually want. If we assume this is the only call of hi1 and we then do interprocedural transformations to replace the use of the first argument with the second or vice versa, would that be illegal because of the attribute mismatch or allowed? If it is the latter, the call must have been UB.
Hmm, I think the issue happens with a call without noundef as well:
call void @hi1(nonnull %val, %val) ; let's assume that this is the only call

define void @hi(i8* %x, i8* %y) {
  use(%x, %y) ; this cannot be use(%x, %x)!
}
To do this replacement, nonnull at the call site should be dropped first.

Besides this, I think expanding the scope of being poison to the same variables makes optimizations like CSE hard. For example:
call void @hi1(<attr> %val, <attr2> %val2)
If CSE concludes that %val is %val2, it can remove %val2 and optimize it into:
call void @hi1(<attr> %val, <attr2> %val)
.. and this can unexpectedly introduce UB due to the synergy between <attr> and <attr2>. I guess fixing optimizations to consider attributes might be a bit costly.. :/

That is/was already the case, right? So, is the current behavior OK or not, potentially in light of making nonnull produce poison not UB.
I think it is/was OK, and, TBH I think it has to be. If we look at it scoped based it all makes sense (I think/hope).
On a practical note, you might not see all the call sites so "dropping first" doesn't work if you know that x == y from a local assume (in the hi example above).

If the attribute holds for the value in the scope, CSE is free to change val to val2 because the attributes did already conflict even if you used val and val2.
Do you see a problem when the attribute is scope based?

We might need range based assumptions (ref. section D and 3 in [0]) to encode this information properly when we go from calls to llvm.assume but that is doable.

[0] https://lists.llvm.org/pipermail/llvm-dev/2019-December/137632.html]

In D90529#2594755, @jdoerfert wrote:
In D90529#2594677, @aqjune wrote:
In D90529#2594464, @jdoerfert wrote:
In D90529#2594026, @aqjune wrote:
While writing a LangRef patch for the semantic changes in nonnull & align attributes in assume operand bundle, I found one interesting case. :)

Creating assume from these two calls yields the same result:
call void @hi1(i8* nonnull %val, i8* noundef %val)
call void @hi2(i8* nonnull noundef %val)
They both generate:
call void @llvm.assume(true)["nonnull"(i8* %val), "noundef"(i8* %val)]
A question is whether this assume is UB if, say, %val is inttoptr(1).
Creating this assume from call void @hi2(%val) is okay because call void @hi2(%val) is already UB.
But, creating the assume from call void @hi1(%val, %val) isn't okay! %val is non-null and noundef, so the call is fine.

To distinguish these two, what about this?
(1) simply use UB semantics for nonnull attribute in assume operand bundle
(2) lower nonnull attribute in the call-site into nonnull bundle only if it is accompanied with noundef.
Then, we don't need to change anything about nonnull from LangRef: the current text is defining it as UB (https://llvm.org/docs/LangRef.html#assume-operand-bundles ).
I thought at first (2) is the way to go but then I asked myself if @hi1(%val, %val) is really not already UB. The question is, does val become poison for this use or is val poison in the scope of the use. I feel the latter is what we actually want. If we assume this is the only call of hi1 and we then do interprocedural transformations to replace the use of the first argument with the second or vice versa, would that be illegal because of the attribute mismatch or allowed? If it is the latter, the call must have been UB.
Hmm, I think the issue happens with a call without noundef as well:
call void @hi1(nonnull %val, %val) ; let's assume that this is the only call

define void @hi(i8* %x, i8* %y) {
  use(%x, %y) ; this cannot be use(%x, %x)!
}
To do this replacement, nonnull at the call site should be dropped first.

Besides this, I think expanding the scope of being poison to the same variables makes optimizations like CSE hard. For example:
call void @hi1(<attr> %val, <attr2> %val2)
If CSE concludes that %val is %val2, it can remove %val2 and optimize it into:
call void @hi1(<attr> %val, <attr2> %val)
.. and this can unexpectedly introduce UB due to the synergy between <attr> and <attr2>. I guess fixing optimizations to consider attributes might be a bit costly.. :/
That is/was already the case, right? So, is the current behavior OK or not, potentially in light of making nonnull produce poison not UB.
I think it is/was OK, and, TBH I think it has to be. If we look at it scoped based it all makes sense (I think/hope).
On a practical note, you might not see all the call sites so "dropping first" doesn't work if you know that x == y from a local assume (in the hi example above).

I don't understand this sentence, I thought we could see all the call sites and assume that it was the only call of hi1.

If we assume this is the only call of `hi1` and we then do interprocedural transformations to replace the use of the first argument with the second or vice versa, would that be illegal because of the attribute mismatch or allowed?

If the attribute holds for the value in the scope, CSE is free to change val to val2 because the attributes did already conflict even if you used val and val2.
Do you see a problem when the attribute is scope based?

Before diving into further discussion, let's clarify my understanding about the semantics that you want:

Calling f(nonnull null, noundef null) is equivalent to f(nonnull noundef null, nonnull noundef null) because an attribute applies to all 'equivalent' values in the arguments. Is my understanding correct?
Similarly, calling f(nonnull null, null) is equivalent to f(poison, poison)?
What about f(nonnull poison, null)? Since poison can be folded into any value, we can make it f(nonnull null, null), which is again f(poison, poison).

Besides this, merging two assumes might require something. A notion of scope should be carefully defined in this case.

call void @llvm.assume(true) ["attr1"(p)]
call void @llvm.assume(true) ["attr2"(p)]
  =>
call void @llvm.assume(true) ["attr1"(p), "attr2"(p)] ; synergy between attributes may introduce UB

TBH, making things simple and introducing no special rule seems to be the best practice for avoiding possible miscompilations or glitches in spec I think.
I see that a similar thing is happening in relaxed concurrency and it is really hard to write at least medium-scale program correctly without relying on commonly-used patterns. :(

In D90529#2596221, @aqjune wrote:

If the attribute holds for the value in the scope, CSE is free to change val to val2 because the attributes did already conflict even if you used val and val2.
Do you see a problem when the attribute is scope based?

Before diving into further discussion, let's clarify my understanding about the semantics that you want:

Calling f(nonnull null, noundef null) is equivalent to f(nonnull noundef null, nonnull noundef null) because an attribute applies to all 'equivalent' values in the arguments. Is my understanding correct?

Similarly, calling f(nonnull null, null) is equivalent to f(poison, poison)?

What about f(nonnull poison, null)? Since poison can be folded into any value, we can make it f(nonnull null, null), which is again f(poison, poison).

You need to start with the same value for this to make sense (to me), so these are what I want:

f(nonnull %p, noundef %p) is equivalent to f(nonnull noundef %p, nonnull noundef %p)
%p = %q; f(nonnull %p, noundef %q) is equivalent to f(nonnull noundef %p, nonnull noundef %q)
%p = null; f(nonnull %p, %p) is equivalent to f(poison, poison)

I don't understand how the values were set up in the last example.

[...]

I don't understand this sentence, I thought we could see all the call sites and assume that it was the only call of hi1.

My bad, I forgot we are still assuming this and therefore I interpreted your suggested to scrub the call site in a general setting (which doesn't work).

In D90529#2596245, @aqjune wrote:
Besides this, merging two assumes might require something. A notion of scope should be carefully defined in this case.

assumes need scopes for this to make sense. the current ones have "point-scope" and you could not simply merge them, right.

call void @llvm.assume(true) ["attr1"(p)]
call void @llvm.assume(true) ["attr2"(p)]
=>
call void @llvm.assume(true) ["attr1"(p), "attr2"(p)] ; synergy between attributes may introduce UB
TBH, making things simple and introducing no special rule seems to be the best practice for avoiding possible miscompilations or glitches in spec I think.
I see that a similar thing is happening in relaxed concurrency and it is really hard to write at least medium-scale program correctly without relying on commonly-used patterns. :(

I think I missed your "simple" no-special rule suggestion. Could you repeat that or link it?

In D90529#2596260, @jdoerfert wrote:
In D90529#2596245, @aqjune wrote:
call void @llvm.assume(true) ["attr1"(p)]
call void @llvm.assume(true) ["attr2"(p)]
  =>
call void @llvm.assume(true) ["attr1"(p), "attr2"(p)] ; synergy between attributes may introduce UB
TBH, making things simple and introducing no special rule seems to be the best practice for avoiding possible miscompilations or glitches in spec I think.
I see that a similar thing is happening in relaxed concurrency and it is really hard to write at least medium-scale program correctly without relying on commonly-used patterns. :(
I think I missed your "simple" no-special rule suggestion. Could you repeat that or link it?

My suggestion is to keep the LangRef text of assume bundle as it is, and add nonnull/align to a bundle when creating assume only if it was paired with noundef.

call void @f(nonnull %p)         ; => llvm.assume(true) []
call void @f(nonnull noundef %p) ; => llvm.assume(true) ["nonnull"(%p)]

Well, it is still doing something (the lowering needs to check the existence of noundef), but I think the principle is already applied to the nonnull/align attributes; they can raise UB only if it is paired with noundef, as LangRef says.
I'm sorry if it sounded a bit aggressive :(

In D90529#2596299, @aqjune wrote:
In D90529#2596260, @jdoerfert wrote:
In D90529#2596245, @aqjune wrote:
call void @llvm.assume(true) ["attr1"(p)]
call void @llvm.assume(true) ["attr2"(p)]
  =>
call void @llvm.assume(true) ["attr1"(p), "attr2"(p)] ; synergy between attributes may introduce UB
TBH, making things simple and introducing no special rule seems to be the best practice for avoiding possible miscompilations or glitches in spec I think.
I see that a similar thing is happening in relaxed concurrency and it is really hard to write at least medium-scale program correctly without relying on commonly-used patterns. :(
I think I missed your "simple" no-special rule suggestion. Could you repeat that or link it?
My suggestion is to keep the LangRef text of assume bundle as it is, and add nonnull/align to a bundle when creating assume only if it was paired with noundef.
call void @f(nonnull %p)         ; => llvm.assume(true) []
call void @f(nonnull noundef %p) ; => llvm.assume(true) ["nonnull"(%p)]
Well, it is still doing something (the lowering needs to check the existence of noundef), but I think the principle is already applied to the nonnull/align attributes; they can raise UB only if it is paired with noundef, as LangRef says.

I still like this, it seems like a natural next step, let's do that first.
We can revisit the scope discussion and I think will have to because of the GVN example, among other things.

I'm sorry if it sounded a bit aggressive :(

No worries.

Thank you! :)
I'll implement the corresponding semantics in Alive2 and see whether there is regression.

There was no regression in intraprocedural optimizations (and interprocedural optimizations that Alive2 could support)
But I think I need to look into Attributor's implementation as well; maybe I can have a look tomorrow

A fix in AssumeBundleBuilder to make it comply LangRef: D98228

BTW, I found that "align" can take two operands: "align"(i8* ptr, i64 a, i64 b) What is the meaning of the second index (b)?

In D90529#2612794, @aqjune wrote:

A fix in AssumeBundleBuilder to make it comply LangRef: D98228

BTW, I found that "align" can take two operands: "align"(i8* ptr, i64 a, i64 b) What is the meaning of the second index (b)?

the meaning of the second index is the offset of the alignment. so with "align"(i8* ptr, i64 16, i64 12), ptr is aligned on 4 but ptr + 4 is aligned on 16.
I introduced this to support __builtin_assume_aligned which has similar semantics.

In D90529#2613303, @Tyker wrote:

In D90529#2612794, @aqjune wrote:

A fix in AssumeBundleBuilder to make it comply LangRef: D98228

BTW, I found that "align" can take two operands: "align"(i8* ptr, i64 a, i64 b) What is the meaning of the second index (b)?

the meaning of the second index is the offset of the alignment. so with "align"(i8* ptr, i64 16, i64 12), ptr is aligned on 4 but ptr + 4 is aligned on 16.
I introduced this to support __builtin_assume_aligned which has similar semantics.

Hi, thanks for the info.

But I think an example in Transforms/AlignmentFromAssumptions/simple.ll conflicts with your definition. It has:

28 define i32 @foo2a(i32* nocapture %a) nounwind uwtable readonly {
29 entry:
30   tail call void @llvm.assume(i1 true) ["align"(i32* %a, i32 32, i32 28)]
31   %arrayidx = getelementptr inbounds i32, i32* %a, i64 -1
32   %0 = load i32, i32* %arrayidx, align 4
33   ret i32 %0
34 
35 ; CHECK-LABEL: @foo2a
36 ; CHECK: load i32, i32* {{[^,]+}}, align 32
37 ; CHECK: ret i32
38 }

"align"(i32* %a, i32 32, i32 28) means %a + 4 is 32-bytes aligned, IIUC. Then, %a - 4 cannot be 32-bytes aligned, is it?

From https://gcc.gnu.org/onlinedocs/gcc/Other-Builtins.html , the definition you've explained seems to be correct because it is saying

void *x = __builtin_assume_aligned (arg, 32, 8);
means that the compiler can assume for x, set to arg, that (char *) x - 8 is 32-byte aligned.

Should we specify this in LangRef as well to make its definition explicit?

In D90529#2619492, @aqjune wrote:
In D90529#2613303, @Tyker wrote:

In D90529#2612794, @aqjune wrote:

A fix in AssumeBundleBuilder to make it comply LangRef: D98228

BTW, I found that "align" can take two operands: "align"(i8* ptr, i64 a, i64 b) What is the meaning of the second index (b)?

the meaning of the second index is the offset of the alignment. so with "align"(i8* ptr, i64 16, i64 12), ptr is aligned on 4 but ptr + 4 is aligned on 16.
I introduced this to support __builtin_assume_aligned which has similar semantics.

Hi, thanks for the info.

But I think an example in Transforms/AlignmentFromAssumptions/simple.ll conflicts with your definition. It has:
28 define i32 @foo2a(i32* nocapture %a) nounwind uwtable readonly {
29 entry:
30   tail call void @llvm.assume(i1 true) ["align"(i32* %a, i32 32, i32 28)]
31   %arrayidx = getelementptr inbounds i32, i32* %a, i64 -1
32   %0 = load i32, i32* %arrayidx, align 4
33   ret i32 %0
34 
35 ; CHECK-LABEL: @foo2a
36 ; CHECK: load i32, i32* {{[^,]+}}, align 32
37 ; CHECK: ret i32
38 }
"align"(i32* %a, i32 32, i32 28) means %a + 4 is 32-bytes aligned, IIUC. Then, %a - 4 cannot be 32-bytes aligned, is it?

Yes, this looks like a bug.

From https://gcc.gnu.org/onlinedocs/gcc/Other-Builtins.html , the definition you've explained seems to be correct because it is saying
void *x = __builtin_assume_aligned (arg, 32, 8);
means that the compiler can assume for x, set to arg, that (char *) x - 8 is 32-byte aligned.
Should we specify this in LangRef as well to make its definition explicit?

Yes

aqjune mentioned this in D98684: [LangRef] state that align assume op bundle may take an extra argument.Mar 16 2021, 1:11 AM

aqjune mentioned this in D98759: [AssumeBundles] offset should be added to correctly calculate align.Mar 16 2021, 10:04 PM

aqjune mentioned this in rGc6647693300b: [AssumeBundles] offset should be added to correctly calculate align.Apr 1 2021, 8:32 PM

Revision Contents

Path

Size

llvm/

docs/

LangRef.rst

14 lines

include/

llvm/

IR/

Argument.h

4 lines

lib/

Analysis/

ValueTracking.cpp

3 lines

IR/

Function.cpp

6 lines

Transforms/

IPO/

FunctionAttrs.cpp

2 lines

test/

Analysis/

ValueTracking/

known-nonnull-at.ll

18 lines

Transforms/

Attributor/

align.ll

13 lines

nonnull.ll

13 lines

FunctionAttrs/

nonnull.ll

15 lines

InstCombine/

call_nonnull_arg.ll

33 lines

unused-nonnull.ll

4 lines

Diff 317750

llvm/docs/LangRef.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,154 Lines • ▼ Show 20 Lines ``sret(<ty>)``

for return values. for return values.

The sret type argument specifies the in memory type, which must be The sret type argument specifies the in memory type, which must be

the same as the pointee type of the argument. the same as the pointee type of the argument.

.. _attr_align: .. _attr_align:

``align <n>`` or ``align(<n>)`` ``align <n>`` or ``align(<n>)``

This indicates that the pointer value may be assumed by the optimizer to This indicates that the pointer value has the specified alignment.

have the specified alignment. If the pointer value does not have the If the pointer value does not have the specified alignment,

specified alignment, behavior is undefined. ``align 1`` has no effect on :ref:`poison value <poisonvalues>` is returned or passed instead. The

non-byval, non-preallocated arguments. ``align`` attribute should be combined with the ``noundef`` attribute to

ensure a pointer is aligned, or otherwise the behavior is undefined. Note

that ``align 1`` has no effect on non-byval, non-preallocated arguments.

jdoerfertUnsubmitted

Done

``align <n>`` or ``align(<n>)``

- This indicates that the pointer value is :ref:`poison value <poisonvalues>`

- or has the specified alignment. If the pointer value does not have the

- specified alignment, :ref:`poison value <poisonvalues>` is returned or

- passed instead of undefined behavior. If ``noundef`` attribute exists, the

- value should be a well-defined aligned pointer, otherwise it is undefined

- behavior. ``align 1`` has no effect on non-byval, non-preallocated

- arguments.

+ This indicates that the pointer value has the specified alignment.

+ If the pointer value does not have the specified alignment, :ref:`poison value <poisonvalues>` is returned or passed instead. The ``align`` attribute should

+ be combined with the ``noundef`` attribute to ensure a pointer is aligned,

+ or otherwise the behavior is undefined. Note that ``align 1`` has no effect

+ on non-byval, non-preallocated arguments.

Note that this attribute has additional semantics when combined with the

poison is not aligned, and can well be null. Let's state the intent and not "include" poison as valid input. It is not and that is why it triggers the output to be poison.

jdoerfert: `poison` is not aligned, and can well be null. Let's state the intent and not "include" poison…

Note that this attribute has additional semantics when combined with the Note that this attribute has additional semantics when combined with the

``byval`` or ``preallocated`` attribute, which are documented there. ``byval`` or ``preallocated`` attribute, which are documented there.

.. _noalias: .. _noalias:

``noalias`` ``noalias``

This indicates that memory locations accessed via pointer values This indicates that memory locations accessed via pointer values

:ref:`based <pointeraliasing>` on the argument or return value are not also :ref:`based <pointeraliasing>` on the argument or return value are not also

▲ Show 20 Lines • Show All 42 Lines • ▼ Show 20 Lines ``returned``

and omission of register saves and restores in some cases; it is not and omission of register saves and restores in some cases; it is not

checked or enforced when generating the callee. The parameter and the checked or enforced when generating the callee. The parameter and the

function return type must be valid operands for the function return type must be valid operands for the

:ref:`bitcast instruction <i_bitcast>`. This is not a valid attribute for :ref:`bitcast instruction <i_bitcast>`. This is not a valid attribute for

return values and can only be applied to one parameter. return values and can only be applied to one parameter.

``nonnull`` ``nonnull``

This indicates that the parameter or return pointer is not null. This This indicates that the parameter or return pointer is not null. This

attribute may only be applied to pointer typed parameters. This is not attribute may only be applied to pointer typed parameters. This is not

checked or enforced by LLVM; if the parameter or return pointer is null, checked or enforced by LLVM; if the parameter or return pointer is null,

the behavior is undefined. :ref:`poison value <poisonvalues>` is returned or passed instead.

The ``nonnull`` attribute should be combined with the ``noundef`` attribute

jdoerfertUnsubmitted

Done

Mention the combination with noundef please.

jdoerfert: Mention the combination with `noundef` please.

to ensure a pointer is not null or otherwise the behavior is undefined.

jdoerfertUnsubmitted

Done

``nonnull``

- This indicates that the parameter or return pointer is poison or not null.

+ This indicates that the parameter or return pointer is not null.

This attribute may only be applied to pointer typed parameters. This is not

checked or enforced by LLVM; if the parameter or return pointer is null,

- :ref:`poison value <poisonvalues>` is returned or passed instead of

- undefined behavior. If ``noundef`` attribute exists, the value should be a

- well-defined non-null value, otherwise it is undefined behavior.

+ :ref:`poison value <poisonvalues>` is returned or passed instead.

+ The ``nonnull`` attribute should be combined with the ``noundef`` attribute

+ to ensure a pointer is not null or otherwise the behavior is undefined.

``dereferenceable(<n>)``

jdoerfert:

``dereferenceable(<n>)`` ``dereferenceable(<n>)``

This indicates that the parameter or return pointer is dereferenceable. This This indicates that the parameter or return pointer is dereferenceable. This

attribute may only be applied to pointer typed parameters. A pointer that attribute may only be applied to pointer typed parameters. A pointer that

is dereferenceable can be loaded from speculatively without a risk of is dereferenceable can be loaded from speculatively without a risk of

trapping. The number of bytes known to be dereferenceable must be provided trapping. The number of bytes known to be dereferenceable must be provided

in parentheses. It is legal for the number of bytes to be less than the in parentheses. It is legal for the number of bytes to be less than the

size of the pointee type. The ``nonnull`` attribute does not imply size of the pointee type. The ``nonnull`` attribute does not imply

dereferenceability (consider a pointer to one element past the end of an dereferenceability (consider a pointer to one element past the end of an

▲ Show 20 Lines • Show All 20,113 Lines • Show Last 20 Lines

llvm/include/llvm/IR/Argument.h

Show First 20 Lines • Show All 46 Lines • ▼ Show 20 Lines	public:
unsigned getArgNo() const {		unsigned getArgNo() const {
assert(Parent && "can't get number of unparented arg");		assert(Parent && "can't get number of unparented arg");
return ArgNo;		return ArgNo;
}		}

/// Return true if this argument has the nonnull attribute. Also returns true		/// Return true if this argument has the nonnull attribute. Also returns true
/// if at least one byte is known to be dereferenceable and the pointer is in		/// if at least one byte is known to be dereferenceable and the pointer is in
/// addrspace(0).		/// addrspace(0).
bool hasNonNullAttr() const;		/// If AllowUndefOrPoison is true, respect the semantics of nonnull attribute
		/// and return true even if the argument can be undef or poison.
		bool hasNonNullAttr(bool AllowUndefOrPoison = true) const;

/// If this argument has the dereferenceable attribute, return the number of		/// If this argument has the dereferenceable attribute, return the number of
/// bytes known to be dereferenceable. Otherwise, zero is returned.		/// bytes known to be dereferenceable. Otherwise, zero is returned.
uint64_t getDereferenceableBytes() const;		uint64_t getDereferenceableBytes() const;

/// If this argument has the dereferenceable_or_null attribute, return the		/// If this argument has the dereferenceable_or_null attribute, return the
/// number of bytes known to be dereferenceable. Otherwise, zero is returned.		/// number of bytes known to be dereferenceable. Otherwise, zero is returned.
uint64_t getDereferenceableOrNullBytes() const;		uint64_t getDereferenceableOrNullBytes() const;
▲ Show 20 Lines • Show All 105 Lines • Show Last 20 Lines

llvm/lib/Analysis/ValueTracking.cpp

Show First 20 Lines • Show All 2,082 Lines • ▼ Show 20 Lines	for (auto *U : V->users()) {
NumUsesExplored++;		NumUsesExplored++;

// If the value is used as an argument to a call or invoke, then argument		// If the value is used as an argument to a call or invoke, then argument
// attributes may provide an answer about null-ness.		// attributes may provide an answer about null-ness.
if (const auto *CB = dyn_cast<CallBase>(U))		if (const auto *CB = dyn_cast<CallBase>(U))
if (auto *CalledFunc = CB->getCalledFunction())		if (auto *CalledFunc = CB->getCalledFunction())
for (const Argument &Arg : CalledFunc->args())		for (const Argument &Arg : CalledFunc->args())
if (CB->getArgOperand(Arg.getArgNo()) == V &&		if (CB->getArgOperand(Arg.getArgNo()) == V &&
Arg.hasNonNullAttr() && DT->dominates(CB, CtxI))		Arg.hasNonNullAttr(/* AllowUndefOrPoison */ false) &&
		DT->dominates(CB, CtxI))
		jdoerfertUnsubmitted Done Reply Inline Actions `/* AllowUndefOrPoison / false` jdoerfert:* `/* AllowUndefOrPoison */ false`
return true;		return true;

// If the value is used as a load/store, then the pointer must be non null.		// If the value is used as a load/store, then the pointer must be non null.
if (V == getLoadStorePointerOperand(U)) {		if (V == getLoadStorePointerOperand(U)) {
const Instruction *I = cast<Instruction>(U);		const Instruction *I = cast<Instruction>(U);
if (!NullPointerIsDefined(I->getFunction(),		if (!NullPointerIsDefined(I->getFunction(),
V->getType()->getPointerAddressSpace()) &&		V->getType()->getPointerAddressSpace()) &&
DT->dominates(I, CtxI))		DT->dominates(I, CtxI))
▲ Show 20 Lines • Show All 4,718 Lines • Show Last 20 Lines

llvm/lib/IR/Function.cpp

Show First 20 Lines • Show All 81 Lines • ▼ Show 20 Lines	Argument::Argument(Type Ty, const Twine &Name, Function Par, unsigned ArgNo)
: Value(Ty, Value::ArgumentVal), Parent(Par), ArgNo(ArgNo) {		: Value(Ty, Value::ArgumentVal), Parent(Par), ArgNo(ArgNo) {
setName(Name);		setName(Name);
}		}

void Argument::setParent(Function *parent) {		void Argument::setParent(Function *parent) {
Parent = parent;		Parent = parent;
}		}

bool Argument::hasNonNullAttr() const {		bool Argument::hasNonNullAttr(bool AllowUndefOrPoison) const {
if (!getType()->isPointerTy()) return false;		if (!getType()->isPointerTy()) return false;
if (getParent()->hasParamAttribute(getArgNo(), Attribute::NonNull))		if (getParent()->hasParamAttribute(getArgNo(), Attribute::NonNull) &&
		(AllowUndefOrPoison \|\|
		getParent()->hasParamAttribute(getArgNo(), Attribute::NoUndef)))
return true;		return true;
else if (getDereferenceableBytes() > 0 &&		else if (getDereferenceableBytes() > 0 &&
!NullPointerIsDefined(getParent(),		!NullPointerIsDefined(getParent(),
getType()->getPointerAddressSpace()))		getType()->getPointerAddressSpace()))
return true;		return true;
return false;		return false;
}		}

▲ Show 20 Lines • Show All 1,683 Lines • Show Last 20 Lines

llvm/lib/Transforms/IPO/FunctionAttrs.cpp

Show First 20 Lines • Show All 636 Lines • ▼ Show 20 Lines	static bool addArgumentAttrsFromCallsites(Function &F) {
// TODO: This could be enhanced by testing if the callsite post-dominates the		// TODO: This could be enhanced by testing if the callsite post-dominates the
// entry block or by doing simple forward walks or backward walks to the		// entry block or by doing simple forward walks or backward walks to the
// callsite.		// callsite.
BasicBlock &Entry = F.getEntryBlock();		BasicBlock &Entry = F.getEntryBlock();
for (Instruction &I : Entry) {		for (Instruction &I : Entry) {
if (auto *CB = dyn_cast<CallBase>(&I)) {		if (auto *CB = dyn_cast<CallBase>(&I)) {
if (auto *CalledFunc = CB->getCalledFunction()) {		if (auto *CalledFunc = CB->getCalledFunction()) {
for (auto &CSArg : CalledFunc->args()) {		for (auto &CSArg : CalledFunc->args()) {
if (!CSArg.hasNonNullAttr())		if (!CSArg.hasNonNullAttr(/* AllowUndefOrPoison */ false))
		jdoerfertUnsubmitted Done Reply Inline Actions see above. jdoerfert: see above.
continue;		continue;

// If the non-null callsite argument operand is an argument to 'F'		// If the non-null callsite argument operand is an argument to 'F'
// (the caller) and the call is guaranteed to execute, then the value		// (the caller) and the call is guaranteed to execute, then the value
// must be non-null throughout 'F'.		// must be non-null throughout 'F'.
auto *FArg = dyn_cast<Argument>(CB->getArgOperand(CSArg.getArgNo()));		auto *FArg = dyn_cast<Argument>(CB->getArgOperand(CSArg.getArgNo()));
if (FArg && !FArg->hasNonNullAttr()) {		if (FArg && !FArg->hasNonNullAttr()) {
FArg->addAttr(Attribute::NonNull);		FArg->addAttr(Attribute::NonNull);
▲ Show 20 Lines • Show All 1,036 Lines • Show Last 20 Lines

llvm/test/Analysis/ValueTracking/known-nonnull-at.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt -S -instsimplify < %s \| FileCheck %s			; RUN: opt -S -instsimplify < %s \| FileCheck %s

	declare void @bar(i8* %a, i8* nonnull %b)			declare void @bar(i8* %a, i8* nonnull noundef %b)
				declare void @bar_without_noundef(i8* %a, i8* nonnull %b)
				jdoerfertUnsubmitted Done Reply Inline Actions maybe rename to make it clear at the call site jdoerfert: maybe rename to make it clear at the call site

	; 'y' must be nonnull.			; 'y' must be nonnull.

	define i1 @caller1(i8* %x, i8* %y) {			define i1 @caller1(i8* %x, i8* %y) {
	; CHECK-LABEL: @caller1(			; CHECK-LABEL: @caller1(
	; CHECK-NEXT: call void @bar(i8* [[X:%.]], i8 [[Y:%.*]])			; CHECK-NEXT: call void @bar(i8* [[X:%.]], i8 [[Y:%.*]])
	; CHECK-NEXT: ret i1 false			; CHECK-NEXT: ret i1 false
	;			;
	call void @bar(i8* %x, i8* %y)			call void @bar(i8* %x, i8* %y)
	%null_check = icmp eq i8* %y, null			%null_check = icmp eq i8* %y, null
	ret i1 %null_check			ret i1 %null_check
	}			}

	; Don't know anything about 'y'.			; Don't know anything about 'y'.

				define i1 @caller1_maybepoison(i8* %x, i8* %y) {
				; CHECK-LABEL: @caller1_maybepoison(
				; CHECK-NEXT: call void @bar_without_noundef(i8* [[X:%.]], i8 [[Y:%.*]])
				; CHECK-NEXT: [[NULL_CHECK:%.]] = icmp eq i8 [[Y]], null
				; CHECK-NEXT: ret i1 [[NULL_CHECK]]
				;
				call void @bar_without_noundef(i8* %x, i8* %y)
				%null_check = icmp eq i8* %y, null
				ret i1 %null_check
				}

				; Don't know anything about 'y'.

	define i1 @caller2(i8* %x, i8* %y) {			define i1 @caller2(i8* %x, i8* %y) {
	; CHECK-LABEL: @caller2(			; CHECK-LABEL: @caller2(
	; CHECK-NEXT: call void @bar(i8* [[Y:%.]], i8 [[X:%.*]])			; CHECK-NEXT: call void @bar(i8* [[Y:%.]], i8 [[X:%.*]])
	; CHECK-NEXT: [[NULL_CHECK:%.]] = icmp eq i8 [[Y]], null			; CHECK-NEXT: [[NULL_CHECK:%.]] = icmp eq i8 [[Y]], null
	; CHECK-NEXT: ret i1 [[NULL_CHECK]]			; CHECK-NEXT: ret i1 [[NULL_CHECK]]
	;			;
	call void @bar(i8* %y, i8* %x)			call void @bar(i8* %y, i8* %x)
	%null_check = icmp eq i8* %y, null			%null_check = icmp eq i8* %y, null
	▲ Show 20 Lines • Show All 154 Lines • ▼ Show 20 Lines

	define i8* @test_load_store_after_check(i8* %0) {			define i8* @test_load_store_after_check(i8* %0) {
	; CHECK-LABEL: @test_load_store_after_check(			; CHECK-LABEL: @test_load_store_after_check(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[TMP1:%.]] = call i8 @func(i64 0)			; CHECK-NEXT: [[TMP1:%.]] = call i8 @func(i64 0)
	; CHECK-NEXT: [[NULL_CHECK:%.]] = icmp eq i8 [[TMP1]], null			; CHECK-NEXT: [[NULL_CHECK:%.]] = icmp eq i8 [[TMP1]], null
	; CHECK-NEXT: br i1 [[NULL_CHECK]], label [[RETURN:%.]], label [[IF_END:%.]]			; CHECK-NEXT: br i1 [[NULL_CHECK]], label [[RETURN:%.]], label [[IF_END:%.]]
	; CHECK: if.end:			; CHECK: if.end:
	; CHECK-NEXT: store i8 7, i8* [[TMP1]]			; CHECK-NEXT: store i8 7, i8* [[TMP1]], align 1
	; CHECK-NEXT: br label [[RETURN]]			; CHECK-NEXT: br label [[RETURN]]
	; CHECK: return:			; CHECK: return:
	; CHECK-NEXT: [[RETVAL_0:%.]] = phi i8 [ [[TMP1]], [[IF_END]] ], [ null, [[ENTRY:%.*]] ]			; CHECK-NEXT: [[RETVAL_0:%.]] = phi i8 [ [[TMP1]], [[IF_END]] ], [ null, [[ENTRY:%.*]] ]
	; CHECK-NEXT: ret i8* [[RETVAL_0]]			; CHECK-NEXT: ret i8* [[RETVAL_0]]
	;			;
	entry:			entry:
	%1 = call i8* @func(i64 0)			%1 = call i8* @func(i64 0)
	%null_check = icmp eq i8* %1, null			%null_check = icmp eq i8* %1, null
	Show All 10 Lines

llvm/test/Transforms/Attributor/align.ll

Show First 20 Lines • Show All 1,035 Lines • ▼ Show 20 Lines	if.then: ; preds = %entry
br label %return		br label %return

return: ; preds = %entry, %if.then		return: ; preds = %entry, %if.then
%retval.0 = phi i32* [ %call, %if.then ], [ %p, %entry ]		%retval.0 = phi i32* [ %call, %if.then ], [ %p, %entry ]
call void @user_i32_ptr(i32* %retval.0)		call void @user_i32_ptr(i32* %retval.0)
ret i32* %retval.0		ret i32* %retval.0
}		}

		; FIXME: align 4 should not be propagated to the caller's p unless there is noundef
		define void @align4_caller(i8* %p) {
		; CHECK-LABEL: define {{[^@]+}}@align4_caller
		; CHECK-SAME: (i8* align 4 [[P:%.*]]) {
		; CHECK-NEXT: call void @align4_callee(i8* align 4 [[P]])
		; CHECK-NEXT: ret void
		;
		call void @align4_callee(i8* %p)
		ret void
		}

		declare void @align4_callee(i8* align(4) %p)


attributes #0 = { nounwind uwtable noinline }		attributes #0 = { nounwind uwtable noinline }
attributes #1 = { uwtable noinline }		attributes #1 = { uwtable noinline }
attributes #2 = { null_pointer_is_valid }		attributes #2 = { null_pointer_is_valid }

llvm/test/Transforms/Attributor/nonnull.ll

	Show First 20 Lines • Show All 1,648 Lines • ▼ Show 20 Lines
	; IS__CGSCC____-SAME: () [[ATTR1]] {			; IS__CGSCC____-SAME: () [[ATTR1]] {
	; IS__CGSCC____-NEXT: [[BC:%.]] = bitcast i8 ()* @function_decl to i8*			; IS__CGSCC____-NEXT: [[BC:%.]] = bitcast i8 ()* @function_decl to i8*
	; IS__CGSCC____-NEXT: ret i8* [[BC]]			; IS__CGSCC____-NEXT: ret i8* [[BC]]
	;			;
	%bc = bitcast i8() @function_decl to i8*			%bc = bitcast i8() @function_decl to i8*
	ret i8* %bc			ret i8* %bc
	}			}

				; FIXME: nonnull should not be propagated to the caller's p unless there is noundef
				define void @nonnull_caller(i8* %p) {
				; CHECK-LABEL: define {{[^@]+}}@nonnull_caller
				; CHECK-SAME: (i8* nonnull [[P:%.*]]) {
				; CHECK-NEXT: call void @nonnull_callee(i8* nonnull [[P]])
				; CHECK-NEXT: ret void
				;
				call void @nonnull_callee(i8* %p)
				ret void
				}

				declare void @nonnull_callee(i8* nonnull %p)

	attributes #0 = { null_pointer_is_valid }			attributes #0 = { null_pointer_is_valid }
	attributes #1 = { nounwind willreturn}			attributes #1 = { nounwind willreturn}

llvm/test/Transforms/FunctionAttrs/nonnull.ll

	Show First 20 Lines • Show All 321 Lines • ▼ Show 20 Lines
	}			}

	; Test propagation of nonnull callsite args back to caller.			; Test propagation of nonnull callsite args back to caller.

	declare void @use1(i8* %x)			declare void @use1(i8* %x)
	declare void @use2(i8* %x, i8* %y);			declare void @use2(i8* %x, i8* %y);
	declare void @use3(i8* %x, i8* %y, i8* %z);			declare void @use3(i8* %x, i8* %y, i8* %z);

	declare void @use1nonnull(i8* nonnull %x);			declare void @use1nonnull(i8* nonnull noundef %x);
	declare void @use2nonnull(i8* nonnull %x, i8* nonnull %y);			declare void @use1nonnull_without_noundef(i8* nonnull %x);
	declare void @use3nonnull(i8* nonnull %x, i8* nonnull %y, i8* nonnull %z);			declare void @use2nonnull(i8* nonnull noundef %x, i8* nonnull noundef %y);
				declare void @use3nonnull(i8* nonnull noundef %x, i8* nonnull noundef %y, i8* nonnull noundef %z);

	declare i8 @use1safecall(i8* %x) readonly nounwind ; readonly+nounwind guarantees that execution continues to successor			declare i8 @use1safecall(i8* %x) readonly nounwind ; readonly+nounwind guarantees that execution continues to successor

				; Without noundef, nonnull cannot be propagated to the parent

				define void @parent_poison(i8* %a) {
				; FNATTR-LABEL: @parent_poison(i8* %a)
				call void @use1nonnull_without_noundef(i8* %a)
				ret void
				}

	; Can't extend non-null to parent for any argument because the 2nd call is not guaranteed to execute.			; Can't extend non-null to parent for any argument because the 2nd call is not guaranteed to execute.

	define void @parent1(i8* %a, i8* %b, i8* %c) {			define void @parent1(i8* %a, i8* %b, i8* %c) {
	; FNATTR-LABEL: @parent1(i8* %a, i8* %b, i8* %c)			; FNATTR-LABEL: @parent1(i8* %a, i8* %b, i8* %c)
	; FNATTR-NEXT: call void @use3(i8* %c, i8* %a, i8* %b)			; FNATTR-NEXT: call void @use3(i8* %c, i8* %a, i8* %b)
	; FNATTR-NEXT: call void @use3nonnull(i8* %b, i8* %c, i8* %a)			; FNATTR-NEXT: call void @use3nonnull(i8* %b, i8* %c, i8* %a)
	; FNATTR-NEXT: ret void			; FNATTR-NEXT: ret void
	call void @use3(i8* %c, i8* %a, i8* %b)			call void @use3(i8* %c, i8* %a, i8* %b)
	▲ Show 20 Lines • Show All 431 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/call_nonnull_arg.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -instcombine -S \| FileCheck %s			; RUN: opt < %s -instcombine -S \| FileCheck %s

	; InstCombine should mark null-checked argument as nonnull at callsite			; InstCombine should mark null-checked argument as nonnull at callsite
	declare void @dummy(i32*, i32)			declare void @dummy(i32*, i32)

	define void @test(i32* %a, i32 %b) {			define void @test(i32* %a, i32 %b) {
	; CHECK-LABEL: @test(			; CHECK-LABEL: @test(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[COND1:%.]] = icmp eq i32 %a, null			; CHECK-NEXT: [[COND1:%.]] = icmp eq i32 [[A:%.*]], null
	; CHECK-NEXT: br i1 [[COND1]], label %dead, label %not_null			; CHECK-NEXT: br i1 [[COND1]], label [[DEAD:%.]], label [[NOT_NULL:%.]]
	; CHECK: not_null:			; CHECK: not_null:
	; CHECK-NEXT: [[COND2:%.*]] = icmp eq i32 %b, 0			; CHECK-NEXT: [[COND2:%.]] = icmp eq i32 [[B:%.]], 0
	; CHECK-NEXT: br i1 [[COND2]], label %dead, label %not_zero			; CHECK-NEXT: br i1 [[COND2]], label [[DEAD]], label [[NOT_ZERO:%.*]]
	; CHECK: not_zero:			; CHECK: not_zero:
	; CHECK-NEXT: call void @dummy(i32* nonnull %a, i32 %b)			; CHECK-NEXT: call void @dummy(i32* nonnull [[A]], i32 [[B]])
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	; CHECK: dead:			; CHECK: dead:
	; CHECK-NEXT: unreachable			; CHECK-NEXT: unreachable
	;			;
	entry:			entry:
	%cond1 = icmp eq i32* %a, null			%cond1 = icmp eq i32* %a, null
	br i1 %cond1, label %dead, label %not_null			br i1 %cond1, label %dead, label %not_null
	not_null:			not_null:
	%cond2 = icmp eq i32 %b, 0			%cond2 = icmp eq i32 %b, 0
	br i1 %cond2, label %dead, label %not_zero			br i1 %cond2, label %dead, label %not_zero
	not_zero:			not_zero:
	call void @dummy(i32* %a, i32 %b)			call void @dummy(i32* %a, i32 %b)
	ret void			ret void
	dead:			dead:
	unreachable			unreachable
	}			}

	; The nonnull attribute in the 'bar' declaration is			; The nonnull attribute in the 'bar' declaration is
	; propagated to the parameters of the 'baz' callsite.			; propagated to the parameters of the 'baz' callsite.

	declare void @bar(i8, i8 nonnull)			declare void @bar(i8, i8 nonnull noundef)
				declare void @bar_without_noundef(i8, i8 nonnull)
	declare void @baz(i8, i8)			declare void @baz(i8, i8)

	define void @deduce_nonnull_from_another_call(i8* %a, i8* %b) {			define void @deduce_nonnull_from_another_call(i8* %a, i8* %b) {
	; CHECK-LABEL: @deduce_nonnull_from_another_call(			; CHECK-LABEL: @deduce_nonnull_from_another_call(
	; CHECK-NEXT: call void @bar(i8* %a, i8* %b)			; CHECK-NEXT: call void @bar(i8* [[A:%.]], i8 [[B:%.*]])
	; CHECK-NEXT: call void @baz(i8* nonnull %b, i8* nonnull %b)			; CHECK-NEXT: call void @baz(i8* nonnull [[B]], i8* nonnull [[B]])
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	call void @bar(i8* %a, i8* %b)			call void @bar(i8* %a, i8* %b)
	call void @baz(i8* %b, i8* %b)			call void @baz(i8* %b, i8* %b)
	ret void			ret void
	}			}


				define void @deduce_nonnull_from_another_call2(i8* %a, i8* %b) {
				; CHECK-LABEL: @deduce_nonnull_from_another_call2(
				; CHECK-NEXT: call void @bar_without_noundef(i8* [[A:%.]], i8 [[B:%.*]])
				; CHECK-NEXT: call void @baz(i8* [[B]], i8* [[B]])
				; CHECK-NEXT: ret void
				;
				call void @bar_without_noundef(i8* %a, i8* %b)
				call void @baz(i8* %b, i8* %b)
				ret void
				}

llvm/test/Transforms/InstCombine/unused-nonnull.ll

Show All 29 Lines	null:
call void @call_if_null(i8* %ptr)		call void @call_if_null(i8* %ptr)
br label %done		br label %done

done:		done:
%retval = phi i32 [0, %entry], [%1, %do_work], [%1, %null]		%retval = phi i32 [0, %entry], [%1, %do_work], [%1, %null]
ret i32 %retval		ret i32 %retval
}		}

define i32 @compute(i8* nonnull %ptr, i32 %x) #1 {		define i32 @compute(i8* noundef nonnull %ptr, i32 %x) #1 {
; CHECK-LABEL: define {{[^@]+}}@compute		; CHECK-LABEL: define {{[^@]+}}@compute
; CHECK-SAME: (i8* nocapture nonnull readnone [[PTR:%.]], i32 returned [[X:%.]]) local_unnamed_addr #1		; CHECK-SAME: (i8* nocapture noundef nonnull readnone [[PTR:%.]], i32 returned [[X:%.]]) local_unnamed_addr #1
; CHECK-NEXT: ret i32 [[X]]		; CHECK-NEXT: ret i32 [[X]]
;		;
ret i32 %x		ret i32 %x
}		}

declare void @call_if_null(i8* %ptr) #0		declare void @call_if_null(i8* %ptr) #0

attributes #0 = { nounwind }		attributes #0 = { nounwind }
attributes #1 = { noinline nounwind readonly }		attributes #1 = { noinline nounwind readonly }

This is an archive of the discontinued LLVM Phabricator instance.

Allow nonnull/align attribute to accept poisonClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 317750

llvm/docs/LangRef.rst

llvm/include/llvm/IR/Argument.h

llvm/lib/Analysis/ValueTracking.cpp

llvm/lib/IR/Function.cpp

llvm/lib/Transforms/IPO/FunctionAttrs.cpp

llvm/test/Analysis/ValueTracking/known-nonnull-at.ll

llvm/test/Transforms/Attributor/align.ll

llvm/test/Transforms/Attributor/nonnull.ll

llvm/test/Transforms/FunctionAttrs/nonnull.ll

llvm/test/Transforms/InstCombine/call_nonnull_arg.ll

llvm/test/Transforms/InstCombine/unused-nonnull.ll

Allow nonnull/align attribute to accept poison
ClosedPublic