This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
docs/
3/18
LangRef.rst
-
include/llvm/
-
llvm/
-
Bitcode/
-
LLVMBitCodes.h
-
IR/
-
Attributes.td
-
Function.h
-
Intrinsics.td
-
lib/
-
AsmParser/
-
LLLexer.cpp
-
LLParser.cpp
-
LLToken.h
-
Bitcode/
-
Reader/
-
BitcodeReader.cpp
-
Writer/
-
BitcodeWriter.cpp
-
IR/
-
Attributes.cpp
-
Verifier.cpp
-
test/
-
Bindings/llvm-c/
-
llvm-c/
-
Inputs/
-
invalid.ll.bc
-
invalid-bitcode.test
-
Bitcode/
-
attributes.ll
-
invalid.ll
-
invalid.ll.bc
-
LTO/X86/
-
X86/
-
Inputs/
-
invalid.ll.bc
-
invalid.ll
-
utils/TableGen/
-
TableGen/
-
CodeGenInstruction.h
3
CodeGenIntrinsics.h
-
CodeGenTarget.cpp
-
IntrinsicEmitter.cpp

Differential D20116

Add speculatable function attribute
ClosedPublic

Authored by arsenm on May 10 2016, 10:42 AM.

Download Raw Diff

Details

Reviewers

chandlerc
majnemer
• tstellarAMD
sanjoy
mehdi_amini

Summary

This attribute tells the optimizer that the function may be speculated.

Diff Detail

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

In D20116#484877, @hfinkel wrote:

In D20116#483719, @tstellarAMD wrote:

Rename the attribute to speculatable. This simplifies the patch a lot since
we are no longer trying to solve the problem where specifying one of
the memory properties for intrinsics implies that it has no side effects
(this should probably still be addressed in another patch).

I wasn't sure exactly how to define the speculatable attribute
in the language ref, so I copied the definition from the
isSafeToSpeculativelyExecute() function.

Makes sense to me. Is the idea to have isSafeToSpeculativelyExecute() return true on functions with this attribute?

Yes, this is the idea.

hfinkel added inline comments.Jul 14 2016, 7:34 PM

docs/LangRef.rst
1494–1496	We should say something to indicate that speculatable does not imply CSE-able. Unless I am mistaken, it is possible for a function to be speculatable but return different results given the same parameters. True; only a readnone speculatable function can be CSE'd. It might be readonly, but then you can't CSE it unless you know more about the memory it might access.

In D20116#484877, @hfinkel wrote:

In D20116#483719, @tstellarAMD wrote:

Rename the attribute to speculatable. This simplifies the patch a lot since
we are no longer trying to solve the problem where specifying one of
the memory properties for intrinsics implies that it has no side effects
(this should probably still be addressed in another patch).

I wasn't sure exactly how to define the speculatable attribute
in the language ref, so I copied the definition from the
isSafeToSpeculativelyExecute() function.

Makes sense to me. Is the idea to have isSafeToSpeculativelyExecute() return true on functions with this attribute? I'd find it strange if this were not the plan. Do we plan to have FuncAttrs infer the attribute? I'm wondering if we should say something about cost and how that will be handled. We don't want to speculate expensive-to-execute functions even if it is legal.

The SpeculativeExecution pass already considers the cost

In D20116#484894, @majnemer wrote:

In D20116#484877, @hfinkel wrote:

In D20116#483719, @tstellarAMD wrote:

Rename the attribute to speculatable. This simplifies the patch a lot since
we are no longer trying to solve the problem where specifying one of
the memory properties for intrinsics implies that it has no side effects
(this should probably still be addressed in another patch).

I wasn't sure exactly how to define the speculatable attribute
in the language ref, so I copied the definition from the
isSafeToSpeculativelyExecute() function.

Makes sense to me. Is the idea to have isSafeToSpeculativelyExecute() return true on functions with this attribute? I'd find it strange if this were not the plan. Do we plan to have FuncAttrs infer the attribute? I'm wondering if we should say something about cost and how that will be handled. We don't want to speculate expensive-to-execute functions even if it is legal.

I don't think we should avoid inferring speculatable via a cost model in the same way that we don't slap noinline on huge functions.
IMO, speculating function calls should likely be done via an IPO pass which is driven by some cost model.

We should probably have another, symmetric, attribute: sinkable. But that shouldn't be done with this patch.

We sort of already have this, convergent. Owen was talking about splitting convergent so that it really means do not sink, and then to add a speculatable attribute. The current convergent semantics would be the combination of no sink and no speculate

-Matt

Fix wording in language ref and add a note about speculatable not implying
the function can be CSE'd.

• tstellarAMD mentioned this in D22413: [ValueTracking] Teach isSafeToSpeculativelyExecute() about the speculatable attribute.Jul 15 2016, 9:45 AM

hfinkel added inline comments.Jul 15 2016, 10:11 AM

docs/LangRef.rst
1497	I don't think that we should use "CSE'd" here, as a term. We should say something about this attribute not being enough to conclude that the number of calls executed along any particular execution path being externally observable, or something along those lines. Akin, perhaps, to what we say for volatile.

Replace the term CSE'd in the language ref with a more precise definition.

I reworded Hal's suggested description slightly to try to emphisize the
fact that it's talking about the *number* of calls being externally
observable, not just that the call itself is externally observable.

hfinkel added inline comments.Jul 15 2016, 4:55 PM

docs/LangRef.rst
1498	Do you mean "will not be"?

Fix typo in language ref.

• tstellarAMD marked an inline comment as done.Jul 18 2016, 3:57 AM

• tstellarAMD added inline comments.

docs/LangRef.rst
1498	Yes, I did. This is fixed now.

mehdi_amini added inline comments.Jul 18 2016, 3:30 PM

docs/LangRef.rst
1499	"does not have any effects besides calculating its result" and "speculatable is not enough to conclude that [...] the number of calls to this function will not be externally observable." seem contradictory to me. (Also you have a typo with `exection` instead of `execution`)

• tstellarAMD marked an inline comment as done.Nov 15 2016, 5:08 AM

• tstellarAMD added inline comments.

docs/LangRef.rst
1499	I would like to try to revive this discussion. We've gone back and forth a lot on the attribute description. The intention is that speculatable allows something to be speculatively executed, but is not enough by itself to determine whether or not the function can be CSE'd. Would it make sense to replace: 'does not have any effects besides calculating its result' with 'does not have any effects other than possibly reading/writing memory and calculating its result'

hfinkel added inline comments.Nov 15 2016, 6:59 AM

docs/LangRef.rst
1499	It also can't have any undefined behavior.

mehdi_amini added inline comments.Nov 15 2016, 8:56 AM

docs/LangRef.rst
1499	Isn't it enough to say: `This function attribute indicates that the function does not have undefined behavior, for any possible combination of arguments or global memory state.` ?

hfinkel added inline comments.Nov 15 2016, 2:10 PM

docs/LangRef.rst
1499	Yes, I think that sounds right (the comma is not necessary).

hfinkel mentioned this in D26930: Teach optimizer that pthread_self does not trap. It can be speculatively executed..Nov 21 2016, 3:43 PM

Update definition in LangReg.

Harbormaster completed remote builds in B3494: Diff 86656.Feb 1 2017, 9:35 AM

Herald added a subscriber: wdng. · View Herald TranscriptFeb 1 2017, 9:35 AM

LGTM. But let's wait a little to give a change to @sanjoy or @chandlerc to comment if they feel the need.

I repeat here one of the important earlier comment from @tstellarAMD , since it is not in the description and easy to miss:

I added two new Intrinsic attributes IntrNoSideEffects and IntrHasSideEffects,
which make it possible to specify all the possible memory interaction / side effect
combinations. With these properties in place, it should be possible in the future
to drop the 'no side effect' portion of the intrinsic memory properties once targets
have been updated to use these new properties.

This revision is now accepted and ready to land.Feb 1 2017, 5:18 PM

In D20116#664236, @mehdi_amini wrote:
LGTM. But let's wait a little to give a change to @sanjoy or @chandlerc to comment if they feel the need.

I repeat here one of the important earlier comment from @tstellarAMD , since it is not in the description and easy to miss:
I added two new Intrinsic attributes IntrNoSideEffects and IntrHasSideEffects,
which make it possible to specify all the possible memory interaction / side effect
combinations. With these properties in place, it should be possible in the future
to drop the 'no side effect' portion of the intrinsic memory properties once targets
have been updated to use these new properties.

This comment is actually from an earlier version of the patch. I've dropped this part and wrote this patch instead: https://reviews.llvm.org/D22459

ping

Sorry, I didn't know that this was blocked on me. I'll review this by end of day today.

I'm okay with this as long as this is allowed only on specific intrinsics (i.e. it cannot be used by external clients, but we know that certain intrinsics are speculatable). I don't think we can allow this as generic function attribute, since that allows dead code to affect program behavior. E.g.:

if (false)
  puts("hi") speculatable;

which can get transformed to

puts("hi") speculatable;
if (false)
  ;

The speculatable attribute on puts is incorrect, but we need to allow such "bogus" IR down dead paths. For instance, the original program could have been:

void do_call(fnptr f, bool is_speculatable) {
  if (is_speculatable)
    f("hi") speculatable;
  else
    f("hi");
}

// Later
do_call(puts, false);

In D20116#706967, @sanjoy wrote:
I'm okay with this as long as this is allowed only on specific intrinsics (i.e. it cannot be used by external clients, but we know that certain intrinsics are speculatable). I don't think we can allow this as generic function attribute, since that allows dead code to affect program behavior. E.g.:
if (false)
  puts("hi") speculatable;
which can get transformed to
puts("hi") speculatable;
if (false)
  ;
The speculatable attribute on puts is incorrect, but we need to allow such "bogus" IR down dead paths. For instance, the original program could have been:
void do_call(fnptr f, bool is_speculatable) {
  if (is_speculatable)
    f("hi") speculatable;
  else
    f("hi");
}

// Later
do_call(puts, false);

Then the program is broken to begin with? Why shouldn't speculatable apply to functions that only call other speculatable instructions, similar to how FunctionAttrs can infer this for readnone? Do you mean not expose a user visible attribute in the frontend?

In D20116#706969, @arsenm wrote:
In D20116#706967, @sanjoy wrote:
I'm okay with this as long as this is allowed only on specific intrinsics (i.e. it cannot be used by external clients, but we know that certain intrinsics are speculatable). I don't think we can allow this as generic function attribute, since that allows dead code to affect program behavior. E.g.:
if (false)
  puts("hi") speculatable;
which can get transformed to
puts("hi") speculatable;
if (false)
  ;
The speculatable attribute on puts is incorrect, but we need to allow such "bogus" IR down dead paths. For instance, the original program could have been:
void do_call(fnptr f, bool is_speculatable) {
  if (is_speculatable)
    f("hi") speculatable;
  else
    f("hi");
}

// Later
do_call(puts, false);
Then the program is broken to begin with? Why shouldn't speculatable apply to functions that only call other speculatable instructions, similar to how FunctionAttrs can infer this for readnone? Do you mean not expose a user visible attribute in the frontend?

readnone etc. are different from speculatable, in that once you mark a call site as speculatable you've the said call site as speculatable throughout the lifetime of the program (since, by definition, it can be arbitrarily speculated). readnone, readonly etc. do not have that property.

But thinking about it a bit, I concede that speculatable as a generic (i.e. both intrinsic and non-intrinsic) function attribute is fine. However, it doesn't make sense as a call site attribute: being speculatable only down a control flow path is basically the antithesis of speculatable.

In D20116#707083, @sanjoy wrote:

In D20116#706969, @arsenm wrote:

In D20116#706967, @sanjoy wrote:

readnone etc. are different from speculatable, in that once you mark a call site as speculatable you've the said call site as speculatable throughout the lifetime of the program (since, by definition, it can be arbitrarily speculated). readnone, readonly etc. do not have that property.

I don't follow why readnone and readonly based transformations don't need the same guarantee?

In D20116#707083, @sanjoy wrote:
In D20116#706969, @arsenm wrote:
In D20116#706967, @sanjoy wrote:
I'm okay with this as long as this is allowed only on specific intrinsics (i.e. it cannot be used by external clients, but we know that certain intrinsics are speculatable). I don't think we can allow this as generic function attribute, since that allows dead code to affect program behavior. E.g.:
if (false)
  puts("hi") speculatable;
which can get transformed to
puts("hi") speculatable;
if (false)
  ;
The speculatable attribute on puts is incorrect, but we need to allow such "bogus" IR down dead paths. For instance, the original program could have been:
void do_call(fnptr f, bool is_speculatable) {
  if (is_speculatable)
    f("hi") speculatable;
  else
    f("hi");
}

// Later
do_call(puts, false);
Then the program is broken to begin with? Why shouldn't speculatable apply to functions that only call other speculatable instructions, similar to how FunctionAttrs can infer this for readnone? Do you mean not expose a user visible attribute in the frontend?
readnone etc. are different from speculatable, in that once you mark a call site as speculatable you've the said call site as speculatable throughout the lifetime of the program (since, by definition, it can be arbitrarily speculated). readnone, readonly etc. do not have that property.

But thinking about it a bit, I concede that speculatable as a generic (i.e. both intrinsic and non-intrinsic) function attribute is fine. However, it doesn't make sense as a call site attribute: being speculatable only down a control flow path is basically the antithesis of speculatable.

True, but it can still be a callsite attribute. It can represent a data dependency:

int div(int a, int b) {
  return a/b;
}

div(q, 1); /* speculatable */

In D20116#707103, @hfinkel wrote:
readnone etc. are different from speculatable, in that once you mark a call site as speculatable you've the said call site as speculatable throughout the lifetime of the program (since, by definition, it can be arbitrarily speculated). readnone, readonly etc. do not have that property.

But thinking about it a bit, I concede that speculatable as a generic (i.e. both intrinsic and non-intrinsic) function attribute is fine. However, it doesn't make sense as a call site attribute: being speculatable only down a control flow path is basically the antithesis of speculatable.

True, but it can still be a callsite attribute. It can represent a data dependency:
int div(int a, int b) {
  return a/b;
}

div(q, 1); /* speculatable */

But div is not well defined for any possible combination of arguments or global memory state. I think I understand what you were getting at (that for all values of q the expression div(q, 1) is well defined), but the LangRef definition mentioned in this change does not phrase things that way.

However, this should be fine IMO:

int div_specialized(int a) speculatable {
  return div(a, 1);
}

In D20116#707100, @mehdi_amini wrote:

In D20116#707083, @sanjoy wrote:

readnone etc. are different from speculatable, in that once you mark a call site as speculatable you've the said call site as speculatable throughout the lifetime of the program (since, by definition, it can be arbitrarily speculated). readnone, readonly etc. do not have that property.

I don't follow why readnone and readonly based transformations don't need the same guarantee?

I'm talking about cases like this:

void f(bool to_write, int* ptr) {
  if (to_write) *ptr = 20;
}

void g() {
  int a;
  f(false, &a) readnone;
}

Now f is not readnone generally (there are argument values for which it writes to memory), but in that specific call site we know that it does not.

What I was trying to say before is that, because of how we're defined speculatable, "speculatable for this specific argument / control dependence" does not make sense -- if you expand it out, the statement would be "for this specific set of arguments, global memory state and control dependence, this function does not have undefined behavior for any possible combination of arguments or global memory state", which contradicts itself.

Of course, due to this attribute, we will start being able to hoist calls and invokes. If those are marked readnone etc. we will have to strip those attributes if we hoisted said call or invoke out of control flow. But that's a different story.

In D20116#707120, @sanjoy wrote:
In D20116#707103, @hfinkel wrote:
readnone etc. are different from speculatable, in that once you mark a call site as speculatable you've the said call site as speculatable throughout the lifetime of the program (since, by definition, it can be arbitrarily speculated). readnone, readonly etc. do not have that property.

But thinking about it a bit, I concede that speculatable as a generic (i.e. both intrinsic and non-intrinsic) function attribute is fine. However, it doesn't make sense as a call site attribute: being speculatable only down a control flow path is basically the antithesis of speculatable.

True, but it can still be a callsite attribute. It can represent a data dependency:
int div(int a, int b) {
  return a/b;
}

div(q, 1); /* speculatable */
But div is not well defined for any possible combination of arguments or global memory state. I think I understand what you were getting at (that for all values of q the expression div(q, 1) is well defined), but the LangRef definition mentioned in this change does not phrase things that way.

We might want to adjust the wording. When we say "any possible", for a call site attribute, we mean the values that can possibly present themselves at *that* call site (which may be constrained to be a subset of the values allowed for the input types by the semantics of the program). We mean the same things for other call site attributes (e.g. readnone).

However, this should be fine IMO:
int div_specialized(int a) speculatable {
  return div(a, 1);
}

In D20116#707126, @sanjoy wrote:
In D20116#707100, @mehdi_amini wrote:

In D20116#707083, @sanjoy wrote:

readnone etc. are different from speculatable, in that once you mark a call site as speculatable you've the said call site as speculatable throughout the lifetime of the program (since, by definition, it can be arbitrarily speculated). readnone, readonly etc. do not have that property.

I don't follow why readnone and readonly based transformations don't need the same guarantee?

I'm talking about cases like this:
void f(bool to_write, int* ptr) {
  if (to_write) *ptr = 20;
}

void g() {
  int a;
  f(false, &a) readnone;
}
Now f is not readnone generally (there are argument values for which it writes to memory), but in that specific call site we know that it does not.

What I was trying to say before is that, because of how we're defined speculatable, "speculatable for this specific argument / control dependence" does not make sense -- if you expand it out, the statement would be "for this specific set of arguments, global memory state and control dependence, this function does not have undefined behavior for any possible combination of arguments or global memory state", which contradicts itself.

Of course, due to this attribute, we will start being able to hoist calls and invokes. If those are marked readnone etc. we will have to strip those attributes if we hoisted said call or invoke out of control flow. But that's a different story.

But I think this is exactly the point. We can consider value constraints due to data dependencies, but not from control dependencies. The same infects the other attributes as well (if it is tagged readnone and speculatable, the readnone can also not be control dependent - attributes are not metadata; we shouldn't be stripping them).

I can phrase my concerns as a question -- what is the intended behavior of the following program if condition is false?

void f(int a, int b) { return a / b; }

void main() {
  if (condition)
    f(5, 0) speculatable;
}

As far as I can tell, there are two possibilities:

It is well defined -- main is a no-op.
It has undefined behavior, due to the incorrect speculatable attribute.

If we go with (1), then we cannot hoist the call to f above the if -- we'd introduce UB if we do that. This is the interpretation I'm leaning towards, but this interpretation makes the speculatable attribute useless for general functions, since we're prevented from doing what it was supposed to enable in the first place.

If we go with (2), then we've admitted that "dead code" (that is, code that is not run) can influence program behavior, which I find troubling. In particular, I think foo1 and foo2 should be equivalent:

void foo1() {
}

void foo2() {
  if (false) {
    // Arbitrary syntactically valid stuff
  }
}

In other words, adding dead code should not change program behavior.

Even

void f(int a, int b) speculatable { return a / b; }

void main() {
  if (condition)
    f(5, 0);
}

has the same problem, so I take back my earlier concession on function annotations.

Now it is fine for us to have a speculatable analysis which would infer speculatable in limited cases, like int f() { return 10; }. But that's a module analysis, and not an attribute.
Having them on intrinsics is fine too, since we axiomatically know that certain intrinsics are speculatable, just like we know the add instruction is speculatable.

It has undefined behavior, due to the incorrect speculatable attribute.

I don't see it as being anything other than this. Marking it as speculatable is saying it has no undefined behavior a condition could possibly be avoiding. If you are lying about this, you get what you get.

In D20116#707984, @arsenm wrote:

It has undefined behavior, due to the incorrect speculatable attribute.

I don't see it as being anything other than this. Marking it as speculatable is saying it has no undefined behavior a condition could possibly be avoiding. If you are lying about this, you get what you get.

I agree.

In D20116#707954, @sanjoy wrote:

If we go with (2), then we've admitted that "dead code" (that is, code that is not run) can influence program behavior, which I find troubling.

That troubles (and worries) me as well.

In D20116#708076, @mehdi_amini wrote:

In D20116#707954, @sanjoy wrote:

If we go with (2), then we've admitted that "dead code" (that is, code that is not run) can influence program behavior, which I find troubling.

That troubles (and worries) me as well.

Why? That's part of the promise we make when we tag the code as speculatable. We promise that doing the operation early, even in cases where it might otherwise have been performed, is fine. Furthermore, we're promising that any properties the call has (e.g. promising that a certain argument is nonnull) must not be control dependent. As a result, looking at it as dead code affecting live code is suboptimal; any other properties are just a kind of global assertion.

In D20116#708109, @hfinkel wrote:

In D20116#708076, @mehdi_amini wrote:

That troubles (and worries) me as well.

Why?

Irrational fear? ;-)
It seems unusual, and I'm cautious about introducing unusual properties in the compiler in general, it makes it harder to reason about "stuff" when there aren't "simple" rules to guide the logic.
Are there existing other cases of UB induced by unreachable/dead code?

In D20116#708174, @mehdi_amini wrote:

In D20116#708109, @hfinkel wrote:

In D20116#708076, @mehdi_amini wrote:

That troubles (and worries) me as well.

Why?

Irrational fear? ;-)

;-)

It seems unusual, and I'm cautious about introducing unusual properties in the compiler in general, it makes it harder to reason about "stuff" when there aren't "simple" rules to guide the logic.
Are there existing other cases of UB induced by unreachable/dead code?

Not as far as I know, but this is because we normally need to conservatively assume there might be control dependencies on just about everything we can't completely understand (i.e. a call to a function). This is novel because we're explicitly saying that there aren't such dependencies. This makes validity assertions more "global" (this is how I look at it), meaning that it matters less where in the CFG you put them (even in dead regions). The point is that, in some cases, this is exactly what we want and mean.

In D20116#708174, @mehdi_amini wrote:

In D20116#708109, @hfinkel wrote:

In D20116#708076, @mehdi_amini wrote:

That troubles (and worries) me as well.

Why?

Irrational fear? ;-)
It seems unusual, and I'm cautious about introducing unusual properties in the compiler in general, it makes it harder to reason about "stuff" when there aren't "simple" rules to guide the logic.
Are there existing other cases of UB induced by unreachable/dead code?

I suppose it's the same as speculating a load from a pointer marked as dereferencable that isn't really, which is already done

In D20116#708191, @arsenm wrote:

In D20116#708174, @mehdi_amini wrote:

In D20116#708109, @hfinkel wrote:

In D20116#708076, @mehdi_amini wrote:

That troubles (and worries) me as well.

Why?

Irrational fear? ;-)
It seems unusual, and I'm cautious about introducing unusual properties in the compiler in general, it makes it harder to reason about "stuff" when there aren't "simple" rules to guide the logic.
Are there existing other cases of UB induced by unreachable/dead code?

I suppose it's the same as speculating a load from a pointer marked as dereferencable that isn't really, which is already done

My understanding of what gives people pause is that:

if (p != nullptr)
  something();

if (false)
  foo(p /*nonnull*/) /* speculatable */

depending on the pass ordering, we might end up hoisting the call to foo and then using the nonnull assumption to simplify the if condition (even though that call is dynamically dead). We might not, however (and probably won't because we pretty eagerly remove dead code).

The thing to realize about speculatable is that it promotes all argument restrictions to properties of the argument values themselves. This might certainly seem surprising.

In D20116#708191, @arsenm wrote:

I suppose it's the same as speculating a load from a pointer marked as dereferencable that isn't really, which is already done

As far as I remember, dereferenceable and !dereferenceable are carefully designed to avoid this UB-from-dead-code situation.

In D20116#708109, @hfinkel wrote:

In D20116#708076, @mehdi_amini wrote:

In D20116#707954, @sanjoy wrote:

If we go with (2), then we've admitted that "dead code" (that is, code that is not run) can influence program behavior, which I find troubling.

That troubles (and worries) me as well.

Why? That's part of the promise we make when we tag the code as speculatable. We promise that doing the operation early, even in cases where it might otherwise have been performed, is fine. Furthermore, we're promising that any properties the call has (e.g. promising that a certain argument is nonnull) must not be control dependent. As a result, looking at it as dead code affecting live code is suboptimal; any other properties are just a kind of global assertion.

Making even the behavior of a program dependent on instructions that are never executed seems like a fundamentally new thing, and I'm not yet convinced that that's safe. It may be possible to come up with a consistent model of this new thing, but I think the model will still be tricky to work with.

For instance, certain kinds of devritualization optimizations are wrong in that model. Say we had:

void f() { }
void g() { 1 / 0; } // unused

void main() {
  fnptr p = f;
  *p() speculatable;
}

Now you're saying you can't specialize the call via function pointer like this:

void f() { }
void g() { 1 / 0; } // unused

void main() {
  fnptr p = f;
  if (p == g)
    g() speculatable;  // incorrect
  else
    *p() speculatable;
}

which seems odd.

There are also (somewhat more complex) cases like this that do not involve indirect speculatable calls:

struct Base {
  int k;
  Base(int k) : k(k) {}
};

struct S : public Base {
  S(bool c) : Base(c ? 10 : 20) {}
  virtual void f() {
    div(1, k) speculatable;  // k is either 10 or 20
  }
};

struct T : public Base {
  T() : Base(0) {}
  virtual void f() { }
};

void bug(Base* b) {
  b->f();
}

We have a problem in bug if we ever devirtualize, inline and hoist a load:

void bug(Base *b) {
  int k = b->k;
  if (b->type == S) {
    div(1, k) speculatable;
  } else (b->type == T) {
  }
}

It also breaks "code compression" type optimizations:

if (a == 1 || a == 2) {
  switch (a) {
  case 1:
    div(m, 1) speculatable;
    break;
  case 2:
    div(m, 2) speculatable;
    break;
  }
}

if (a == 1 || a == 2) {
  div(m, a) speculatable;
}

if (a == 1 || a == 2) {
  if (a == 0)
    div(m, 0) speculatable;
  else
    div(m, a) speculatable;
}

In D20116#708238, @sanjoy wrote:

It may be possible to come up with a consistent model of this new thing, but I think the model will still be tricky to work with.

This ^ ended up sounding more FUD'dy than I intended. What I meant to say is that I'm being cautious because this isn't obviously okay, not because this is obviously not okay.

In D20116#708238, @sanjoy wrote:
In D20116#708109, @hfinkel wrote:

In D20116#708076, @mehdi_amini wrote:

In D20116#707954, @sanjoy wrote:

If we go with (2), then we've admitted that "dead code" (that is, code that is not run) can influence program behavior, which I find troubling.

That troubles (and worries) me as well.

Why? That's part of the promise we make when we tag the code as speculatable. We promise that doing the operation early, even in cases where it might otherwise have been performed, is fine. Furthermore, we're promising that any properties the call has (e.g. promising that a certain argument is nonnull) must not be control dependent. As a result, looking at it as dead code affecting live code is suboptimal; any other properties are just a kind of global assertion.

Making even the behavior of a program dependent on instructions that are never executed seems like a fundamentally new thing, and I'm not yet convinced that that's safe. It may be possible to come up with a consistent model of this new thing, but I think the model will still be tricky to work with.

For instance, certain kinds of devritualization optimizations are wrong in that model. Say we had:
void f() { }
void g() { 1 / 0; } // unused

void main() {
  fnptr p = f;
  *p() speculatable;
}
Now you're saying you can't specialize the call via function pointer like this:
void f() { }
void g() { 1 / 0; } // unused

void main() {
  fnptr p = f;
  if (p == g)
    g() speculatable;  // incorrect
  else
    *p() speculatable;
}
which seems odd.

There are also (somewhat more complex) cases like this that do not involve indirect speculatable calls:
struct Base {
  int k;
  Base(int k) : k(k) {}
};

struct S : public Base {
  S(bool c) : Base(c ? 10 : 20) {}
  virtual void f() {
    div(1, k) speculatable;  // k is either 10 or 20
  }
};

struct T : public Base {
  T() : Base(0) {}
  virtual void f() { }
};

void bug(Base* b) {
  b->f();
}
We have a problem in bug if we ever devirtualize, inline and hoist a load:
void bug(Base *b) {
  int k = b->k;
  if (b->type == S) {
    div(1, k) speculatable;
  } else (b->type == T) {
  }
}
It also breaks "code compression" type optimizations:
if (a == 1 || a == 2) {
  switch (a) {
  case 1:
    div(m, 1) speculatable;
    break;
  case 2:
    div(m, 2) speculatable;
    break;
  }
}
to
if (a == 1 || a == 2) {
  div(m, a) speculatable;
}
to
if (a == 1 || a == 2) {
  if (a == 0)
    div(m, 0) speculatable;

Your "code compression" optimization just introduced dead code ;)

else
  div(m, a) speculatable;

}

I think that all of this is right, you can't apply some of these optimizations to call sites with the speculatable attribute. I agree, however, that we need to think carefully about how to define what speculatable means on an individual call site. Perhaps they're like convergent functions in this regard: you can't introduce new control dependencies (at least not in general).

In D20116#708268, @hfinkel wrote:
It also breaks "code compression" type optimizations:
if (a == 1 || a == 2) {
  switch (a) {
  case 1:
    div(m, 1) speculatable;
    break;
  case 2:
    div(m, 2) speculatable;
    break;
  }
}
to
if (a == 1 || a == 2) {
  div(m, a) speculatable;
}
to
if (a == 1 || a == 2) {
  if (a == 0)
    div(m, 0) speculatable;
Your "code compression" optimization just introduced dead code ;)

Yea, I don't even know why I called it "code compression". :)

else
  div(m, a) speculatable;
}
I think that all of this is right, you can't apply some of these optimizations to call sites with the speculatable attribute. I agree, however, that we need to think carefully about how to define what speculatable means on an individual call site. Perhaps they're like convergent functions in this regard: you can't introduce new control dependencies (at least not in general).

I'd say as an initial step support for intrinsics that we _know_ are speculatable (like we _know_ that add is speculatable) can land without any further discussion.

As for a generic speculatable attribute -- I need some time to think about this. Perhaps if done with sufficient care, it is possible.

I'd also like to ping @whitequark for comments. I had blocked D18738 some time back on similar grounds, but if we can reach a conclusion on specuatable, perhaps some of that can be transferable to !unconditionally_dereferenceable as well.

In D20116#708273, @sanjoy wrote:
In D20116#708268, @hfinkel wrote:
It also breaks "code compression" type optimizations:
if (a == 1 || a == 2) {
  switch (a) {
  case 1:
    div(m, 1) speculatable;
    break;
  case 2:
    div(m, 2) speculatable;
    break;
  }
}
to
if (a == 1 || a == 2) {
  div(m, a) speculatable;
}
to
if (a == 1 || a == 2) {
  if (a == 0)
    div(m, 0) speculatable;
Your "code compression" optimization just introduced dead code ;)
Yea, I don't even know why I called it "code compression". :)
else
  div(m, a) speculatable;
}
I think that all of this is right, you can't apply some of these optimizations to call sites with the speculatable attribute. I agree, however, that we need to think carefully about how to define what speculatable means on an individual call site. Perhaps they're like convergent functions in this regard: you can't introduce new control dependencies (at least not in general).
I'd say as an initial step support for intrinsics that we _know_ are speculatable (like we _know_ that add is speculatable) can land without any further discussion.

I'm fine with restricting speculatable to only appear where it appears on a function declaration/definition unless/until we can figure out semantics for it on a call site in general. I don't want it restricted to intrinsics specifically, but I don't think that's the problem.

As for a generic speculatable attribute -- I need some time to think about this. Perhaps if done with sufficient care, it is possible.

I'd also like to ping @whitequark for comments. I had blocked D18738 some time back on similar grounds, but if we can reach a conclusion on specuatable, perhaps some of that can be transferable to !unconditionally_dereferenceable as well.

In D20116#708332, @hfinkel wrote:

I'm fine with restricting speculatable to only appear where it appears on a function declaration/definition unless/until we can figure out semantics for it on a call site in general. I don't want it restricted to intrinsics specifically, but I don't think that's the problem.

Only function-level speculatable (and no call site specific speculatable) seems less problematic. It would mean having a function declaration or definition incorrectly marked as speculatable, even if it is never called, is UB; but I can live with that as long as that is properly documented.

Let me maybe zoom out and give a different perspective:
Right now call site and function attributes are an AND of predicates that are always guaranteed to hold for that specific call site or for all call sites, respectively.
Predicates include things like doesn't write to memory, only writes to memory that is not observable, etc. Using attributes we can state that several of these predicates hold.
In the ideal world, predicates wouldn't overlap, although since we can only state ANDs of predicates and not ORs, some overlap may be need in practice.

Then we have an orthogonal concern which is what's the precondition that is sufficient to justify an optimization. For example, whether a function call can be executed speculatively is one of such preconditions. It can be derived by looking at the function attributes. For example, for speculative execution we probably need to know that the function doesn't write to memory and that it terminates.

So I feel that this speculatable attribute is not the right answer. It should be a helper function that derives its result from a set of function attributes, but shouldn't be an attribute on its own.
The only reason I could see to have it as an attribute would be to carry cost information. Just because a function can be executed speculatively, it doesn't mean it should be. We have infrastructure to record cost of intra-procedural edges, but not across functions AFAIK. That may need a more general solution for LTO anyway.

I'm just slightly concerned that the set of function attributes is growing pretty quickly without a more throughout discussion of the whole set of attributes rather than adding a new attribute every time someone wants to fix a problem. Some attributes are not well supported across the pipeline. It really feels like we need to stop for a moment and refactor function attributes.
Probably what we are missing is a "halts"/"terminating" attributes (that states that the function always returns). It has been discussed multiple times, and now we have another use case.

In D20116#710866, @nlopes wrote:

Then we have an orthogonal concern which is what's the precondition that is sufficient to justify an optimization. For example, whether a function call can be executed speculatively is one of such preconditions. It can be derived by looking at the function attributes. For example, for speculative execution we probably need to know that the function doesn't write to memory and that it terminates.

It is not enough (for example division by zero).

So I feel that this speculatable attribute is not the right answer. It should be a helper function that derives its result from a set of function attributes, but shouldn't be an attribute on its own.

Feel free to propose an alternative solution, right now there is no combination of attributes that is enough.

The only reason I could see to have it as an attribute would be to carry cost information.

I'm not convinced that attributes should carry "cost" information, is there a precedent for this?

I think that all of this is right, you can't apply some of these optimizations to call sites with the speculatable attribute.

A lot of these (but not all of these) amount to "you cannot clone speculatable", ie if you clone the call, you must remove the attribute.

I believe the set of conditions under which you could clone the attribute are:

The new call is CDEQ the original call (IE the set of conditions under which it executes is identical). IF you are cloning from one function to another, it must be CDEQ using the interprocedural control dependence.
The arguments are identical.
The function called is identical or marked speculatable

Note this is not entirely shocking,.
the "no cloning" is true of other attributes (you can't clone and apply readonly like is done in the devirt example above) , it's just that, being an attribute about control dependence, the effects relate to control dependence.

Note: a lot of this is premature anyway. There is no possible way you could ever apply any of these optimizations correctly to speculatable, at all, until we fix post-dom to not be broken.

I agree, however, that we need to think carefully about how to define what speculatable means on an individual call site. Perhaps they're like convergent functions in this regard: you can't introduce new control dependencies (at least not in general).

Definitely true.

Either the CDEQ set of the call must not change , or you must be able to prove that changes cannot impact the function (IE you don't make it any less dead, etc).

const int foo = bar = fred = 0;
if (foo)
if (bar)
if (fred)
   call baz() speculatable

You can prove hoisting into if (foo) cannot make it any less dead.

In D20116#710923, @mehdi_amini wrote:

In D20116#710866, @nlopes wrote:

Then we have an orthogonal concern which is what's the precondition that is sufficient to justify an optimization. For example, whether a function call can be executed speculatively is one of such preconditions. It can be derived by looking at the function attributes. For example, for speculative execution we probably need to know that the function doesn't write to memory and that it terminates.

It is not enough (for example division by zero).

Sure; I didn't mean to have enumerated a complete list.

So I feel that this speculatable attribute is not the right answer. It should be a helper function that derives its result from a set of function attributes, but shouldn't be an attribute on its own.

Feel free to propose an alternative solution, right now there is no combination of attributes that is enough.

My point is that attributes should be about the function behavior: they should be about the *how*, and not what kind of transformations you can do with respective call sites.
In the same way, today we don't have attributes like "call can be removed if result unused", "call can be removed if multiple calls with same argument", "call can be sank into a loop", etc.
Instead we have things like "doesn't write to memory", "only writes to memory that is not observable by the caller", etc.
From our set of attributes we can infer if a transformation is valid or not. This approach scales much better: it's one attribute per behavior we want to capture instead of 1 attribute per transformation we want to do, which is surely much higher.

The other point is that this approach separates concerns: we have an analysis pass that decorates functions with attributes about how they behave, and then we have transformations that consume this information in some way. To me it seems useful to have this separation: it makes the analysis part much easier to reason about and to implement correctly.

My biggest concern with speculatable is that there's no intuitive semantics for it. If people already have different opinions of what "readonly" means (and it was supposed to have trivial semantics), then something as complex as speculatable seems like a can of worms.
And a month from now people will want more and more speculatable attributes. For example, "can be speculated across stores", "can be speculated across stores and function calls", etc. Doesn't seem to scale.

The only reason I could see to have it as an attribute would be to carry cost information.

I'm not convinced that attributes should carry "cost" information, is there a precedent for this?

No, and I didn't propose we do that. But at some point for applications like ThinLTO and PGO it seems that an inter-proc cost/information framework will be needed. ThinLTO needs to summarize what functions do. Imagine extending ThinLTO summaries to include information like "simplifies a lot if 2nd argument is false", "returns a number in range [0, 4]", etc. So it feels that eventually we will need an additional set of information to be attached to functions that we can probably not cover with the attribute framework we have today.
Anyway, that's a separate discussion.

In D20116#711097, @nlopes wrote:

In D20116#710923, @mehdi_amini wrote:

In D20116#710866, @nlopes wrote:

Then we have an orthogonal concern which is what's the precondition that is sufficient to justify an optimization. For example, whether a function call can be executed speculatively is one of such preconditions. It can be derived by looking at the function attributes. For example, for speculative execution we probably need to know that the function doesn't write to memory and that it terminates.

It is not enough (for example division by zero).

Sure; I didn't mean to have enumerated a complete list.

Still: my point was you *can't* provide a complete list because we're not capturing everything with the existing attributes.

So I feel that this speculatable attribute is not the right answer. It should be a helper function that derives its result from a set of function attributes, but shouldn't be an attribute on its own.

Feel free to propose an alternative solution, right now there is no combination of attributes that is enough.

My point is that attributes should be about the function behavior: they should be about the *how*, and not what kind of transformations you can do with respective call sites.
In the same way, today we don't have attributes like "call can be removed if result unused", "call can be removed if multiple calls with same argument", "call can be sank into a loop", etc.
Instead we have things like "doesn't write to memory", "only writes to memory that is not observable by the caller", etc.
From our set of attributes we can infer if a transformation is valid or not. This approach scales much better: it's one attribute per behavior we want to capture instead of 1 attribute per transformation we want to do, which is surely much higher.

I get that, I'm still not sure what you're suggesting *in this case*.
Is it just the name that bothers your?
Looking at the description of the attribute, it fits your definition somehow: "This function attribute indicates that the function does not have undefined behavior for any possible combination of arguments or global memory state."

The other point is that this approach separates concerns: we have an analysis pass that decorates functions with attributes about how they behave, and then we have transformations that consume this information in some way. To me it seems useful to have this separation: it makes the analysis part much easier to reason about and to implement correctly.

I don't see how this is related to this attribute in any way.

My biggest concern with speculatable is that there's no intuitive semantics for it. If people already have different opinions of what "readonly" means (and it was supposed to have trivial semantics), then something as complex as speculatable seems like a can of worms.
And a month from now people will want more and more speculatable attributes. For example, "can be speculated across stores", "can be speculated across stores and function calls", etc. Doesn't seem to scale.

I don't have this impression. If you feel some parts of the definition need to be more carefully specified, that's fine. But the way you're putting it forward above seems like FUD to me.

The only reason I could see to have it as an attribute would be to carry cost information.

I'm not convinced that attributes should carry "cost" information, is there a precedent for this?

No, and I didn't propose we do that. But at some point for applications like ThinLTO and PGO it seems that an inter-proc cost/information framework will be needed. ThinLTO needs to summarize what functions do. Imagine extending ThinLTO summaries to include information like "simplifies a lot if 2nd argument is false", "returns a number in range [0, 4]", etc. So it feels that eventually we will need an additional set of information to be attached to functions that we can probably not cover with the attribute framework we have today.

The way this is structured in ThinLTO is using in-memory analysis that don't attach their result to the IR. This is then part of the summaries directly. These are just a serialization of the in-memory analysis, they don't need any IR construct like attribute or metadata.

In D20116#710866, @nlopes wrote:

Let me maybe zoom out and give a different perspective:
Right now call site and function attributes are an AND of predicates that are always guaranteed to hold for that specific call site or for all call sites, respectively.
Predicates include things like doesn't write to memory, only writes to memory that is not observable, etc. Using attributes we can state that several of these predicates hold.
In the ideal world, predicates wouldn't overlap, although since we can only state ANDs of predicates and not ORs, some overlap may be need in practice.

It behaves as an or right now? Last time I checked the implementation, call sites just scan its attributes and then check the set on the declaration

Disallow on call sites

In D20116#710937, @dberlin wrote:

I think that all of this is right, you can't apply some of these optimizations to call sites with the speculatable attribute.

A lot of these (but not all of these) amount to "you cannot clone speculatable", ie if you clone the call, you must remove the attribute.

I believe the set of conditions under which you could clone the attribute are:

The new call is CDEQ the original call (IE the set of conditions under which it executes is identical). IF you are cloning from one function to another, it must be CDEQ using the interprocedural control dependence.

The arguments are identical.

The function called is identical or marked speculatable

Note this is not entirely shocking,.

(Caveat: I think we're arguing some subtle points here, so apologies if I misunderstood your intent.)

I think (1) is somewhat shocking. I'd say normally we follow a weaker constraint: "The new call is executed only if the old call was executed", not "The new call is executed if and only [edit: previously this incorrectly said "only if and only if"] if the old call was executed".

For instance, if we unroll loops (say by a factor of 2):

for (i = 0; i < N; i++)
  f(i) readonly;      // X

it is reasonable that the result be:

for (i = 0; i < N / 2; i += 2) {
  f(i) readonly;       // A
  f(i + 1) readonly;   // B
}

if (i < N)
  f(i++) readonly;     // C

Assuming I correctly understood what you meant by "the set of conditions under which it executes is identical", we won't be able to keep any of the readonly attributes.

X is executed for all i < N.
A is executed for all even i if N > 1
B is executed for all odd i if N > 1
C is executed if N is 1.

Of course, in the program trace the instances in which f was executed stay the same in the pre-unroll and post-unroll program; and that's related to what I was trying to say earlier: with the speculatable attribute, the behavior of a program is no longer a property of its trace -- there could be a "hidden" speculatable call somewhere that influences the behavior of the program without actually being executed (in the initial program), by getting hoisted into the executable bits of the program. In order to mark these programs as "buggy", we will have to rule such "hidden" speculatable calls (that nevertheless have side effects) as tainting the program with UB, despite not being present in the program trace.

the "no cloning" is true of other attributes (you can't clone and apply readonly like is done in the devirt example above)

Why can't you apply readonly to the devirtualization example?

it's just that, being an attribute about control dependence, the effects relate to control dependence.

Yes -- IMO that's the key problem with speculatable. Since it says "there is no control dependence", we cannot apply it in a control dependent manner.

I agree, however, that we need to think carefully about how to define what speculatable means on an individual call site. Perhaps they're like convergent functions in this regard: you can't introduce new control dependencies (at least not in general).

Definitely true.

Either the CDEQ set of the call must not change , or you must be able to prove that changes cannot impact the function (IE you don't make it any less dead, etc).

What exactly do you mean by "the CDEQ set"? I could not find anything easily on Google. [edit: by CDEQ did you mean the cdequiv (control dependence equivalent) set? If so I think I know what you mean.]

Btw, is "make it any less dead" == "speculatively execute it", or something more subtle?

const int foo = bar = fred = 0;
if (foo)
if (bar)
if (fred)
   call baz() speculatable
You can prove hoisting into if (foo) cannot make it any less dead.

In D20116#711367, @arsenm wrote:

In D20116#710866, @nlopes wrote:

In the ideal world, predicates wouldn't overlap, although since we can only state ANDs of predicates and not ORs, some overlap may be need in practice.

It behaves as an or right now? Last time I checked the implementation, call sites just scan its attributes and then check the set on the declaration

That behavior precisely means it is an AND -- a call site marked as readonly nounwind is both readonly AND nounwind.

One concrete example illustrating Nuno's point is the dereferenceable_or_null attribute. If we had an is_null attribute (or emulated it via !range), then we would not need a new attribute to denote that a value is either dereferenceable or null.

In D20116#711697, @sanjoy wrote:

In D20116#711367, @arsenm wrote:

In D20116#710866, @nlopes wrote:

In the ideal world, predicates wouldn't overlap, although since we can only state ANDs of predicates and not ORs, some overlap may be need in practice.

It behaves as an or right now? Last time I checked the implementation, call sites just scan its attributes and then check the set on the declaration

That behavior precisely means it is an AND -- a call site marked as readonly nounwind is both readonly AND nounwind.

One concrete example illustrating Nuno's point is the dereferenceable_or_null attribute. If we had an is_null attribute (or emulated it via !range), then we would not need a new attribute to denote that a value is either dereferenceable or null.

OK, I was thinking about it like: Is this call readnone? It is if if the call site OR the declaration is readnone.

In D20116#711696, @sanjoy wrote:
In D20116#710937, @dberlin wrote:

I think that all of this is right, you can't apply some of these optimizations to call sites with the speculatable attribute.

A lot of these (but not all of these) amount to "you cannot clone speculatable", ie if you clone the call, you must remove the attribute.

I believe the set of conditions under which you could clone the attribute are:

The new call is CDEQ the original call (IE the set of conditions under which it executes is identical). IF you are cloning from one function to another, it must be CDEQ using the interprocedural control dependence.

The arguments are identical.

The function called is identical or marked speculatable

Note this is not entirely shocking,.

(Caveat: I think we're arguing some subtle points here, so apologies if I misunderstood your intent.)

I think (1) is somewhat shocking. I'd say normally we follow a weaker constraint: "The new call is executed only if the old call was executed", not "The new call is executed if and only [edit: previously this incorrectly said "only if and only if"] if the old call was executed".

For instance, if we unroll loops (say by a factor of 2):
for (i = 0; i < N; i++)
  f(i) readonly;      // X
it is reasonable that the result be:
for (i = 0; i < N / 2; i += 2) {
  f(i) readonly;       // A
  f(i + 1) readonly;   // B
}

I'm presuming you meant this loop to go half as much but still cover the same values in the same order in calls to f?
(it doesn't)

If so, agree we allow it in practice, butt hat's in practice
Here is an, IMHO, valid callsite attribute readonly implementation of f, not legally doable in C, but we're talking about llvm IR anyway.

void f(int a)
{
  // go scrobbling through instruction stream to see what the increment is
  if (increment == 2)
    write some memory
  otherwise
    do nothing
}

Now, obviously, i'm cheating, but the point is, f can read whatever state it wants here, and detect what you've done, and not be readonly in that case.
Again, if i'm understanding things right (the langref really is somewhat confusing to me here, but i assume it's legal to have not be readonly except for that call. That's what it seems like we are talking about here)

Given they can also unwind, and read state of other functions, i'm pretty positive you can come up with implementations easier than what i just did to detect and write memory in the two cases differently, without resorting to this trickery.

If you meant to make the loop iteration space change, i disagree with you that it's allowed even harder :)

Assuming I correctly understood what you meant by "the set of conditions under which it executes is identical", we won't be able to keep any of the readonly attributes.

X is executed for all i < N.

A is executed for all even i if N > 1

B is executed for all odd i if N > 1

C is executed if N is 1.

Of course, in the program trace the instances in which f was executed stay the same in the pre-unroll and post-unroll program; and that's related to what I was trying to say earlier: with the speculatable attribute, the behavior of a program is no longer a property of its trace -- there could be a "hidden" speculatable call somewhere that influences the behavior of the program without actually being executed (in the initial program), by getting hoisted into the executable bits of the program. In order to mark these programs as "buggy", we will have to rule such "hidden" speculatable calls (that nevertheless have side effects) as tainting the program with UB, despite not being present in the program trace.

Again, this is literally what it means to play with control dependence, so i *don't* find it shocking.

the "no cloning" is true of other attributes (you can't clone and apply readonly like is done in the devirt example above)

Why can't you apply readonly to the devirtualization example?

Sorry, meant to delete it. you can because it's dead code, otherwise, you can't.

it's just that, being an attribute about control dependence, the effects relate to control dependence.

Yes -- IMO that's the key problem with speculatable. Since it says "there is no control dependence", we cannot apply it in a control dependent manner.

Sure, i'd agree with that, but you really cannot work around that.

I agree, however, that we need to think carefully about how to define what speculatable means on an individual call site. Perhaps they're like convergent functions in this regard: you can't introduce new control dependencies (at least not in general).

Definitely true.

Either the CDEQ set of the call must not change , or you must be able to prove that changes cannot impact the function (IE you don't make it any less dead, etc).

What exactly do you mean by "the CDEQ set"? I could not find anything easily on Google. [edit: by CDEQ did you mean the cdequiv (control dependence equivalent) set? If so I think I know what you mean.]

yes, cdequiv, sorry. it's called CDEQ by some papers, and cdequiv by the other.

It's the equivalence classes of the control dependence graph, basically. The set of blocks/statements/etc (depends on what level you look) that execute only under the same conditions.

Btw, is "make it any less dead" == "speculatively execute it", or something more subtle?

yes, that's what we'd call it. but it's not executed in any case, so i guess it'd be "speculatively hoist it".
:)

const int foo = bar = fred = 0;
if (foo)
if (bar)
if (fred)
   call baz() speculatable
You can prove hoisting into if (foo) cannot make it any less dead.

In D20116#711778, @dberlin wrote:
In D20116#711696, @sanjoy wrote:
In D20116#710937, @dberlin wrote:

I think that all of this is right, you can't apply some of these optimizations to call sites with the speculatable attribute.

A lot of these (but not all of these) amount to "you cannot clone speculatable", ie if you clone the call, you must remove the attribute.

I believe the set of conditions under which you could clone the attribute are:

The new call is CDEQ the original call (IE the set of conditions under which it executes is identical). IF you are cloning from one function to another, it must be CDEQ using the interprocedural control dependence.

The arguments are identical.

The function called is identical or marked speculatable

Note this is not entirely shocking,.

(Caveat: I think we're arguing some subtle points here, so apologies if I misunderstood your intent.)

I think (1) is somewhat shocking. I'd say normally we follow a weaker constraint: "The new call is executed only if the old call was executed", not "The new call is executed if and only [edit: previously this incorrectly said "only if and only if"] if the old call was executed".

For instance, if we unroll loops (say by a factor of 2):
for (i = 0; i < N; i++)
  f(i) readonly;      // X
it is reasonable that the result be:
for (i = 0; i < N / 2; i += 2) {
  f(i) readonly;       // A
  f(i + 1) readonly;   // B
}
I'm presuming you meant this loop to go half as much but still cover the same values in the same order in calls to f?
(it doesn't)

Do'h. I did a split-brain there. :)

If so, agree we allow it in practice, butt hat's in practice
Here is an, IMHO, valid callsite attribute readonly implementation of f, not legally doable in C, but we're talking about llvm IR anyway.
void f(int a)
{
  // go scrobbling through instruction stream to see what the increment is
  if (increment == 2)
    write some memory
  otherwise
    do nothing
}
Now, obviously, i'm cheating, but the point is, f can read whatever state it wants here, and detect what you've done, and not be readonly in that case.

I don't think we can reasonably allow functions to base their behavior on the instruction stream of their callers (or anywhere else, for that matter), in IR or in C.

For instance, if we allowed things like that, even basic optimizations like this:

void f() {
  int a = 2, b = 3;
  f(a + b);
}

void f() {
  f(5);
}

would not be valid, since an implementation of f could be:

void f() {
  cond = does the caller's instruction stream have an add instruction?
  if (cond)
    print("hi");
}

and we'd change observable behavior by optimizing out the add instruction.

Again, if i'm understanding things right (the langref really is somewhat confusing to me here, but i assume it's legal to have not be readonly except for that call. That's what it seems like we are talking about here)

I think code like this is fine:

void f(bool do_store, int* ptr) {
  if (do_store)
    *ptr = 42;
}

...
f(false, ptr) readnone
...

The example you gave above seems odd to me because the callee is has different behavior based on the instruction stream of the caller, but the conditionally-readonly aspect of it is fine.

Given they can also unwind, and read state of other functions, i'm pretty positive you can come up with implementations easier than what i just did to detect and write memory in the two cases differently, without resorting to this trickery.

If there was a legal way to detect a difference between the two cases above, then loop unrolling would be illegal. So even if there is some way to detect the difference, I'd consider that a bug in the LLVM IR semantics since it disallows an important optimization.

If you meant to make the loop iteration space change, i disagree with you that it's allowed even harder :)

Yes, we can't transform the iteration space, since if instead of executing f(0) first, we execute f(N - 1) first; we'd break an f that was implemented as:

void f(int i) {
  // I don't think this is possible to do without writing memory in C++,
  // but it may be possible in other languages.
  throw i;
}

in which case the pre-transformed program would throw 0, but the post-transform program would throw N - 1.

Assuming I correctly understood what you meant by "the set of conditions under which it executes is identical", we won't be able to keep any of the readonly attributes.

X is executed for all i < N.

A is executed for all even i if N > 1

B is executed for all odd i if N > 1

C is executed if N is 1.

Of course, in the program trace the instances in which f was executed stay the same in the pre-unroll and post-unroll program; and that's related to what I was trying to say earlier: with the speculatable attribute, the behavior of a program is no longer a property of its trace -- there could be a "hidden" speculatable call somewhere that influences the behavior of the program without actually being executed (in the initial program), by getting hoisted into the executable bits of the program. In order to mark these programs as "buggy", we will have to rule such "hidden" speculatable calls (that nevertheless have side effects) as tainting the program with UB, despite not being present in the program trace.

Again, this is literally what it means to play with control dependence, so i *don't* find it shocking.

I'm not entirely sure what you mean here -- did you mean that it is okay to have stuff outside the program trace affect program definedness? Or did you mean the converse?

the "no cloning" is true of other attributes (you can't clone and apply readonly like is done in the devirt example above)

Why can't you apply readonly to the devirtualization example?

Sorry, meant to delete it. you can because it's dead code, otherwise, you can't.

it's just that, being an attribute about control dependence, the effects relate to control dependence.

Yes -- IMO that's the key problem with speculatable. Since it says "there is no control dependence", we cannot apply it in a control dependent manner.

Sure, i'd agree with that, but you really cannot work around that.

Yes!

In D20116#711865, @sanjoy wrote:
In D20116#711778, @dberlin wrote:
In D20116#711696, @sanjoy wrote:
In D20116#710937, @dberlin wrote:

I think that all of this is right, you can't apply some of these optimizations to call sites with the speculatable attribute.

A lot of these (but not all of these) amount to "you cannot clone speculatable", ie if you clone the call, you must remove the attribute.

I believe the set of conditions under which you could clone the attribute are:

The new call is CDEQ the original call (IE the set of conditions under which it executes is identical). IF you are cloning from one function to another, it must be CDEQ using the interprocedural control dependence.

The arguments are identical.

The function called is identical or marked speculatable

Note this is not entirely shocking,.

(Caveat: I think we're arguing some subtle points here, so apologies if I misunderstood your intent.)

I think (1) is somewhat shocking. I'd say normally we follow a weaker constraint: "The new call is executed only if the old call was executed", not "The new call is executed if and only [edit: previously this incorrectly said "only if and only if"] if the old call was executed".

For instance, if we unroll loops (say by a factor of 2):
for (i = 0; i < N; i++)
  f(i) readonly;      // X
it is reasonable that the result be:
for (i = 0; i < N / 2; i += 2) {
  f(i) readonly;       // A
  f(i + 1) readonly;   // B
}
I'm presuming you meant this loop to go half as much but still cover the same values in the same order in calls to f?
(it doesn't)
Do'h. I did a split-brain there. :)
If so, agree we allow it in practice, butt hat's in practice
Here is an, IMHO, valid callsite attribute readonly implementation of f, not legally doable in C, but we're talking about llvm IR anyway.
void f(int a)
{
  // go scrobbling through instruction stream to see what the increment is
  if (increment == 2)
    write some memory
  otherwise
    do nothing
}
Now, obviously, i'm cheating, but the point is, f can read whatever state it wants here, and detect what you've done, and not be readonly in that case.
I don't think we can reasonably allow functions to base their behavior on the instruction stream of their callers (or anywhere else, for that matter), in IR or in C.

As mentioned, i don't think that's strictly necessary to screw up what you did. But i'm willing to admit those are likely semantic bugs.
:)
In any case, I was just half-pointing out that i don't think the semantics of the other attribute are so cut and dried in their control dependence, etc, when applied to callsites, that speculatable on callsites is that bad.

For instance, if we allowed things like that, even basic optimizations like this:
void f() {
  int a = 2, b = 3;
  f(a + b);
}
to
void f() {
  f(5);
}
would not be valid, since an implementation of f could be:
void f() {
  cond = does the caller's instruction stream have an add instruction?
  if (cond)
    print("hi");
}
and we'd change observable behavior by optimizing out the add instruction.

Right, my point was more on the side of "we are pretending we've got the semantics of these other instructions down really well and they are easy to understand", when, honestly, there is nothing in the langref that outlaws what i wrote :)

The closest it comes is "when called with the same set of arguments and global state"
But you aren't calling it with the same global state depending on the definition of global state, which .. we don't define anywhere.

The example you gave above seems odd to me because the callee is has different behavior based on the instruction stream of the caller, but the conditionally-readonly aspect of it is fine.

As mentioned I'm pretty positive i could construct an example just as bad given the limitations expressed in the langref :)
I

Given they can also unwind, and read state of other functions, i'm pretty positive you can come up with implementations easier than what i just did to detect and write memory in the two cases differently, without resorting to this trickery.

If there was a legal way to detect a difference between the two cases above, then loop unrolling would be illegal.

FWIW, i'm honestly too lazy to go construct one, but i'm basically positive you can. At least, legal gievn the semantics and restrictions on LLVM IR as it exists today, becuase LLVM IR is not *that* well defined.
I would not believe it well-defined in C, but only because unwinders, etc are usually not well defined.
This is basically an exercise in the moral equivalent of godel numbering.
I'm going to go with "yes"
Is it possible to define our IR well enough to outlaw that: Maybe?
I'm not going to tug at this thread any longer though :P

So even if there is some way to detect the difference, I'd consider that a bug in the LLVM IR semantics since it disallows an important optimization.

I guess my basic point was exactly this - the semantics of our other attributes, compared to speculatable, are not so easy or non-shocking that i think artificially limiting speculatable to non-callsites only makes sense. Particularly becuase those attributes may not be so well defined that their behavior makes complete and total sense when applied to callsites
Instead, when we discover they are wrong we fix them. So why not just mark speculatable on callsites as experimental , see what happens, and go from there?

Bottom line: IMHO, We are unlikely to ever figure out the specific set of issues we will hit on callsites with speculatable if we don't allow it there. Yeah, we'll figure out if it's completely broken if we don't, but we all seem to agree that it's probably not?

I'm going to drop the rest of this, fwiw, because i don't think it's worth pushing the readonly example any further.
I'm positive i can come up with a program that meets whatever requirements you throw at it and still breaks in your example, precisely because we don't define our IR well enough to prevent it (I'm on vacation, so i'm not going to think about whether i could prove that you can't make the IR well defined enough to prevent it and still have it express useful programs :P)

In D20116#711882, @dberlin wrote:

Right, my point was more on the side of "we are pretending we've got the semantics of these other instructions down really well and they are easy to understand", when, honestly, there is nothing in the langref that outlaws what i wrote :)

The closest it comes is "when called with the same set of arguments and global state"
But you aren't calling it with the same global state depending on the definition of global state, which .. we don't define anywhere.

Of course. I'm not claiming that the LangRef is a mathematically precise document (though parts of it could be, if we incorporated Vellvm). However, the converse of "it is mathematically precise" is not "anything goes". :)

In other words, I think despite the semantics of LLVM IR being only informally specified, we can still reasonably draw *some* boundaries on what the semantics of various constructs *can* be, based on the optimizations we want to be correct.

So even if there is some way to detect the difference, I'd consider that a bug in the LLVM IR semantics since it disallows an important optimization.

I guess my basic point was exactly this - the semantics of our other attributes, compared to speculatable, are not so easy or non-shocking that i think artificially limiting speculatable to non-callsites only makes sense.

I'm not sure if I agree with the "artificially" characterization. My objection was based around (what I think are) concrete problems that we'll have if we allow this.

Particularly becuase those attributes may not be so well defined that their behavior makes complete and total sense when applied to callsites

I agree we have room to improve today. However, I don't see how two wrongs make a right here.

Instead, when we discover they are wrong we fix them. So why not just mark speculatable on callsites as experimental , see what happens, and go from there?
Bottom line: IMHO, We are unlikely to ever figure out the specific set of issues we will hit on callsites with speculatable if we don't allow it there. Yeah, we'll figure out if it's completely broken if we don't, but we all seem to agree that it's probably not?

As I said, my objection was based on my opinion that they're already discovered to be wrong. Of course, my (counter-)examples may not stand up to scrutiny, in which case my objection is moot.

Having said all this: while I won't exactly be happy with it, I would be fine with allowing speculatable on normal call sites if that helps some really compelling use case. But I want to make it clear that we're making a tradeoff here.

I'm going to drop the rest of this, fwiw, because i don't think it's worth pushing the readonly example any further.
I'm positive i can come up with a program that meets whatever requirements you throw at it and still breaks in your example, precisely because we don't define our IR well enough to prevent it (I'm on vacation, so i'm not going to think about whether i could prove that you can't make the IR well defined enough to prevent it and still have it express useful programs :P)

So even if there is some way to detect the difference, I'd consider that a bug in the LLVM IR semantics since it disallows an important optimization.

I guess my basic point was exactly this - the semantics of our other attributes, compared to speculatable, are not so easy or non-shocking that i think artificially limiting speculatable to non-callsites only makes sense.

I'm not sure if I agree with the "artificially" characterization. My objection was based around (what I think are) concrete problems that we'll have if we allow this.

Sorry, let me try to explain:
You have identified concrete problems.
IMHO, your concrete problems are not with the definition of the attribute, but instead are "things wanting to optimize callsites and keep this attribute have to understand control dependence".
But this is pretty much a truism to me given *any* sane definition of the attribute on callsites.
Additionally, IMHO, there can be *no* sensible way to define this attribute such that the problems go away.
So from my perspective, the concrete problems you've identified are going to get solved in exactly the same way whether we add them for callsites today, tomorrow, or five years from now.
Hence i see the limitation as fairly artificial. We aren't going solve these problems other than by auditing all the callsite optimizations and either making them drop this attribute, or deeply understand control dependence enough to do a correct thing.

As I said, my objection was based on my opinion that they're already discovered to be wrong.

I guess this is where we disagree.
I do not believe you have pointed out that there is something wrong with the semantic, at least in a way that you could solve.
I believe you have pointed out it will interact badly if we don't audit our code.

I'd be fine if our answer was "hey, start a patch with all the fixes necessary to make this work properly on callsites".
But otherwise, what's the path forward?
IE what do you see as a definition of these attributes on callsites that is sane but doesn't have the issues you foresee?

In D20116#713430, @dberlin wrote:

So from my perspective, the concrete problems you've identified are going to get solved in exactly the same way whether we add them for callsites today, tomorrow, or five years from now.

I agree.

Hence i see the limitation as fairly artificial. We aren't going solve these problems other than by auditing all the callsite optimizations and either making them drop this attribute, or deeply understand control dependence enough to do a correct thing.

I guess my English isn't strong enough to quite grok that use of "artificial". :) I partially agree with "We aren't going solve these problems other ... enough to do a correct thing.", but please see below.

As I said, my objection was based on my opinion that they're already discovered to be wrong.

I guess this is where we disagree.
I do not believe you have pointed out that there is something wrong with the semantic, at least in a way that you could solve.
I believe you have pointed out it will interact badly if we don't audit our code.

What I was *trying* to show was that the context-sensitive-speculatable semantic introduces a fundamentally new behavior -- that dead code can (now) affect the semantics of a program. I don't think we have this in LLVM today (today the behavior of an LLVM program can be deduced solely from the *trace* of the instructions that were actually executed), and the impact of changing LLVM to allow dead code to have this "action at a distance" is unknown to me.

That ^ is really the core of my objection. Almost everything else I said can be traced back to the above.

And making dead code affect behavior sets of all sorts of alarms in my head. I've mentioned some of the more concrete problems earlier, but the fundamental thing that is bugging me is that if (false) { X } will not be a NOP for some values of X.

I guess what you're saying is that there is nothing fundamental about the dead-code-influencing-behavior problem, and that it is just a matter of fixing passes to do the right thing?

If so, I do not have a better answer to that than a vague sense of unease. :)

I'd be fine if our answer was "hey, start a patch with all the fixes necessary to make this work properly on callsites".
But otherwise, what's the path forward?
IE what do you see as a definition of these attributes on callsites that is sane but doesn't have the issues you foresee?

If I was able to sell you on semantically-relevant-dead-code is a bad thing, then there is no path forward with a call specific speculative that is as strong as is implied in this patch. We can probably do something weaker though (say allow it only on calls with all constant arguments).

If you're okay with semantically-relevant-dead-code and its consequences, then it is a matter of fixing all of the passes that assume arbitrary dead code is "okay".

I'm leaning towards the former, but I'd understand if people wanted to do the latter.

FWIW I don't see much of a practical need for speculatable to apply to individual call sites. There are a few cases where I think it might be useful on specific intrinsic call sites but aren't really interesting enough to worry about.

Only allow for intrinsics

In D20116#740922, @arsenm wrote:

Only allow for intrinsics

Why only for intrinsics? I thought we had concluded that we'd only allow it for declarations and not on call sites (which may technically mean on call sited but only matching the declaration). I think it is important that we can apply it to regular functions.

In D20116#741087, @hfinkel wrote:

In D20116#740922, @arsenm wrote:

Only allow for intrinsics

Why only for intrinsics? I thought we had concluded that we'd only allow it for declarations and not on call sites (which may technically mean on call sited but only matching the declaration). I think it is important that we can apply it to regular functions.

I think so too, but @sanjoy said to restrict it to intrinsics for now. Intrinsics are the important part. I also ran into one minor issue with the call site restriction in D32655 where speculatable intrinsic calls are sometimes replaced with non-speculatable libcalls.

In D20116#741087, @hfinkel wrote:

In D20116#740922, @arsenm wrote:

Why only for intrinsics? I thought we had concluded that we'd only allow it for declarations and not on call sites (which may technically mean on call sited but only matching the declaration). I think it is important that we can apply it to regular functions.

I think a general speculatable attribute that is allowed only on functions decls is *less problematic*[0] that a context sensitive one, but I think speculatable intrinsics are clearly okay. Therefore my opinion is (which I expressed on IRC to Matt) is to first land the intrinsic variant of this, since that's what he's blocked on; and then we can go ahead with more aggressive variants on subsequent patches.

[0] https://reviews.llvm.org/D20116#709352

In D20116#741130, @arsenm wrote:

In D20116#741087, @hfinkel wrote:

In D20116#740922, @arsenm wrote:

Only allow for intrinsics

Why only for intrinsics? I thought we had concluded that we'd only allow it for declarations and not on call sites (which may technically mean on call sited but only matching the declaration). I think it is important that we can apply it to regular functions.

I think so too, but @sanjoy said to restrict it to intrinsics for now.

He suggested that, and I said that I did not want that restriction, and he said that he was fine with that:

In D20116#709352, @sanjoy wrote:

In D20116#708332, @hfinkel wrote:

I'm fine with restricting speculatable to only appear where it appears on a function declaration/definition unless/until we can figure out semantics for it on a call site in general. I don't want it restricted to intrinsics specifically, but I don't think that's the problem.

Only function-level speculatable (and no call site specific speculatable) seems less problematic. It would mean having a function declaration or definition incorrectly marked as speculatable, even if it is never called, is UB; but I can live with that as long as that is properly documented.

Intrinsics are the important part.

Not to me ;)

I also ran into one minor issue with the call site restriction in D32655 where speculatable intrinsic calls are sometimes replaced with non-speculatable libcalls.

This seems like it is an unfortunate information loss that we should fix, but why is that a problem?

In D20116#741132, @sanjoy wrote:

In D20116#741087, @hfinkel wrote:

In D20116#740922, @arsenm wrote:

Why only for intrinsics? I thought we had concluded that we'd only allow it for declarations and not on call sites (which may technically mean on call sited but only matching the declaration). I think it is important that we can apply it to regular functions.

I think a general speculatable attribute that is allowed only on functions decls is *less problematic*[0] that a context sensitive one, but I think speculatable intrinsics are clearly okay. Therefore my opinion is (which I expressed on IRC to Matt) is to first land the intrinsic variant of this, since that's what he's blocked on; and then we can go ahead with more aggressive variants on subsequent patches.

[0] https://reviews.llvm.org/D20116#709352

Okay, unfortunately, this is only useful to me if we allow it on function declarations, and I don't see how kicking this can down the road helps in this regard. If he adds this for intrinsics and I immediately turn around a propose a patch to remove the restriction, that's a waste of everyone's time. I thought that we had agreed that allowing it on function declarations was okay so long as we documented the fact that this introduces potential UB just by declaring such a function, so let's do that.

In D20116#741170, @hfinkel wrote:

Okay, unfortunately, this is only useful to me if we allow it on function declarations

I had somehow missed this bit ^ and I was under the impression that the main motivation for a general attribute was more completeness than anything else.

I thought that we had agreed that allowing it on function declarations was okay so long as we documented the fact that this introduces potential UB just by declaring such a function, so let's do that.

I had not phrased my concession clearly. :)

Just to be clear, I don't think they're okay, but I can live with them in the spirit of begin pragmatic.

So yes, if this attribute will be useless to you without the generalization to non-intrinsics, then I won't object to checking in the previous version of this patch.

In D20116#741181, @sanjoy wrote:

In D20116#741170, @hfinkel wrote:

Okay, unfortunately, this is only useful to me if we allow it on function declarations

I had somehow missed this bit ^ and I was under the impression that the main motivation for a general attribute was more completeness than anything else.

No problem.

I thought that we had agreed that allowing it on function declarations was okay so long as we documented the fact that this introduces potential UB just by declaring such a function, so let's do that.

I had not phrased my concession clearly. :)

Just to be clear, I don't think they're okay, but I can live with them in the spirit of begin pragmatic.

I understand. In a theoretical sense, I see adding them on function declarations as the same as adding them to intrinsics. Obviously there are practical differences, however, I'm not sure that in practice you're more likely to introduce a call to an arbitrary function, just because it happens to have been declared as speculatable, than you are to an intrinsic, just because it happens to be similarly available. I can't imagine any general transformation doing anything for each declared function just on the basis of it being declared. You'd need to be operating in a very restricted environment for that to make sense, and in such an environment, you should reasonably have the power not to mark functions as speculatable in a problematic way.

So yes, if this attribute will be useless to you without the generalization to non-intrinsics, then I won't object to checking in the previous version of this patch.

Thanks!

r301680

whitequark mentioned this in D18738: Add new !unconditionally_dereferenceable load instruction metadata.May 11 2017, 6:50 AM

Revision Contents

Path

Size

docs/

LangRef.rst

3 lines

include/

llvm/

Bitcode/

LLVMBitCodes.h

3 lines

IR/

Attributes.td

3 lines

Function.h

8 lines

Intrinsics.td

7 lines

lib/

AsmParser/

LLLexer.cpp

1 line

LLParser.cpp

1 line

LLToken.h

1 line

Bitcode/

Reader/

BitcodeReader.cpp

2 lines

Writer/

BitcodeWriter.cpp

2 lines

IR/

Attributes.cpp

3 lines

Verifier.cpp

1 line

test/

Bindings/

llvm-c/

Inputs/

invalid.ll.bc

invalid-bitcode.test

4 lines

Bitcode/

attributes.ll

10 lines

invalid.ll

2 lines

invalid.ll.bc

LTO/

X86/

Inputs/

invalid.ll.bc

invalid.ll

2 lines

utils/

TableGen/

1 line

3 lines

3 lines

11 lines

Diff 56749

docs/LangRef.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,375 Lines • ▼ Show 20 Lines
``noreturn``		``noreturn``
This function attribute indicates that the function never returns		This function attribute indicates that the function never returns
normally. This produces undefined behavior at runtime if the		normally. This produces undefined behavior at runtime if the
function ever does dynamically return.		function ever does dynamically return.
``norecurse``		``norecurse``
This function attribute indicates that the function does not call itself		This function attribute indicates that the function does not call itself
either directly or indirectly down any possible call path. This produces		either directly or indirectly down any possible call path. This produces
undefined behavior at runtime if the function ever does recurse.		undefined behavior at runtime if the function ever does recurse.
		``nosideffects``
		This function attribute indicates that the function has no side effects
		and can be safely speculated.
		mehdi_aminiUnsubmitted Not Done Reply Inline Actions IIRC, annoyingly the backend considers that an instructions can access memory and still don't have side-effect. It'd be nice to align (but I think the backend is "wrong" on this one). mehdi_amini: IIRC, annoyingly the backend considers that an instructions can access memory and still don't…
		eli.friedmanUnsubmitted Not Done Reply Inline Actions This description needs to be more thorough... "can be safely speculated" is an extremely fuzzy description. Your commit message says that "divide-by-zero" counts as a side-effect, but that isn't listed here. Does an infinite loop count as a side-effect? Can a read from or write to a global? An argument? A volatile load? eli.friedman: This description needs to be more thorough... "can be safely speculated" is an extremely fuzzy…
		tstellarAMDUnsubmitted Not Done Reply Inline Actions With my original definition, I was trying to match what we already have in the .td files for intrinsics. I've updated the definition in this patch to be: nosideeffects tells the optimizer that the function does not modify any state that isn't accessible from the IR (e.g. floating-point exception registers). I'm not sure if this is what you were thinking, but hopefully this gives us a better starting point for discussion. tstellarAMD: With my original definition, I was trying to match what we already have in the .td files for…
		mehdi_aminiUnsubmitted Not Done Reply Inline Actions Typo `accessbile` Also it isn't clear how it interacts with memory. Are we only considering "non-memory" effects with this attribute? What are the side effects we want to track and what will they be used for? Is it just about "this won't not trap or exit"? mehdi_amini: Typo `accessbile` Also it isn't clear how it interacts with memory. Are we only considering…
		arsenmAuthorUnsubmitted Not Done Reply Inline Actions I kind of think it should be renamed speculatable, since the intention is any kind of operation that would prevent speculating arsenm: I kind of think it should be renamed speculatable, since the intention is any kind of operation…
		hfinkelUnsubmitted Not Done Reply Inline Actions I agree. This is really the "safe to speculatively execute" attribute. hfinkel: I agree. This is really the "safe to speculatively execute" attribute.
``nounwind``		``nounwind``
		eli.friedmanUnsubmitted Not Done Reply Inline Actions The floating point status register is a weird example. The floating point status register is basically memory; it isn't actually addressable on most processors, but it behaves like a hidden global in every other way. eli.friedman: The floating point status register is a weird example. The floating point status register is…
This function attribute indicates that the function never raises an		This function attribute indicates that the function never raises an
exception. If the function does raise an exception, its runtime		exception. If the function does raise an exception, its runtime
behavior is undefined. However, functions marked nounwind may still		behavior is undefined. However, functions marked nounwind may still
trap or generate asynchronous exceptions. Exception handling schemes		trap or generate asynchronous exceptions. Exception handling schemes
that are recognized by LLVM to handle asynchronous exceptions, such		that are recognized by LLVM to handle asynchronous exceptions, such
as SEH, will still provide their implementation defined semantics.		as SEH, will still provide their implementation defined semantics.
``optnone``		``optnone``
This function attribute indicates that most optimization passes will skip		This function attribute indicates that most optimization passes will skip
▲ Show 20 Lines • Show All 90 Lines • ▼ Show 20 Lines	``sanitize_address``
This attribute indicates that AddressSanitizer checks		This attribute indicates that AddressSanitizer checks
(dynamic address safety analysis) are enabled for this function.		(dynamic address safety analysis) are enabled for this function.
``sanitize_memory``		``sanitize_memory``
This attribute indicates that MemorySanitizer checks (dynamic detection		This attribute indicates that MemorySanitizer checks (dynamic detection
of accesses to uninitialized memory) are enabled for this function.		of accesses to uninitialized memory) are enabled for this function.
``sanitize_thread``		``sanitize_thread``
This attribute indicates that ThreadSanitizer checks		This attribute indicates that ThreadSanitizer checks
(dynamic thread safety analysis) are enabled for this function.		(dynamic thread safety analysis) are enabled for this function.
``ssp``		``ssp``
This attribute indicates that the function should emit a stack		This attribute indicates that the function should emit a stack
smashing protector. It is in the form of a "canary" --- a random value		smashing protector. It is in the form of a "canary" --- a random value
		hfinkelUnsubmitted Done Reply Inline Actions Saying "its result", instead of "the result", reads better to me. hfinkel: Saying "its result", instead of "the result", reads better to me.
		majnemerUnsubmitted Done Reply Inline Actions We should say something to indicate that speculatable does not imply CSE-able. Unless I am mistaken, it is possible for a function to be speculatable but return different results given the same parameters. majnemer: We should say something to indicate that speculatable does not imply CSE-able. Unless I am…
		hfinkelUnsubmitted Not Done Reply Inline Actions We should say something to indicate that speculatable does not imply CSE-able. Unless I am mistaken, it is possible for a function to be speculatable but return different results given the same parameters. True; only a readnone speculatable function can be CSE'd. It might be readonly, but then you can't CSE it unless you know more about the memory it might access. hfinkel: > We should say something to indicate that speculatable does not imply CSE-able. Unless I am…
placed on the stack before the local variables that's checked upon		placed on the stack before the local variables that's checked upon
		hfinkelUnsubmitted Not Done Reply Inline Actions I don't think that we should use "CSE'd" here, as a term. We should say something about this attribute not being enough to conclude that the number of calls executed along any particular execution path being externally observable, or something along those lines. Akin, perhaps, to what we say for volatile. hfinkel: I don't think that we should use "CSE'd" here, as a term. We should say something about this…
return from the function to see if it has been overwritten. A		return from the function to see if it has been overwritten. A
		hfinkelUnsubmitted Done Reply Inline Actions Do you mean "will not be"? hfinkel: Do you mean "will not be"?
		tstellarAMDUnsubmitted Not Done Reply Inline Actions Yes, I did. This is fixed now. tstellarAMD: Yes, I did. This is fixed now.
heuristic is used to determine if a function needs stack protectors		heuristic is used to determine if a function needs stack protectors
		mehdi_aminiUnsubmitted Not Done Reply Inline Actions "does not have any effects besides calculating its result" and "speculatable is not enough to conclude that [...] the number of calls to this function will not be externally observable." seem contradictory to me. (Also you have a typo with `exection` instead of `execution`) mehdi_amini: "does not have any effects besides calculating its result" and "speculatable is not enough to…
		tstellarAMDUnsubmitted Not Done Reply Inline Actions I would like to try to revive this discussion. We've gone back and forth a lot on the attribute description. The intention is that speculatable allows something to be speculatively executed, but is not enough by itself to determine whether or not the function can be CSE'd. Would it make sense to replace: 'does not have any effects besides calculating its result' with 'does not have any effects other than possibly reading/writing memory and calculating its result' tstellarAMD: I would like to try to revive this discussion. We've gone back and forth a lot on the…
		hfinkelUnsubmitted Not Done Reply Inline Actions It also can't have any undefined behavior. hfinkel: It also can't have any undefined behavior.
		mehdi_aminiUnsubmitted Not Done Reply Inline Actions Isn't it enough to say: `This function attribute indicates that the function does not have undefined behavior, for any possible combination of arguments or global memory state.` ? mehdi_amini: Isn't it enough to say: `This function attribute indicates that the function does not have…
		hfinkelUnsubmitted Not Done Reply Inline Actions Yes, I think that sounds right (the comma is not necessary). hfinkel: Yes, I think that sounds right (the comma is not necessary).
or not. The heuristic used will enable protectors for functions with:		or not. The heuristic used will enable protectors for functions with:

- Character arrays larger than ``ssp-buffer-size`` (default 8).		- Character arrays larger than ``ssp-buffer-size`` (default 8).
- Aggregates containing character arrays larger than ``ssp-buffer-size``.		- Aggregates containing character arrays larger than ``ssp-buffer-size``.
- Calls to alloca() with variable sizes or constant sizes greater than		- Calls to alloca() with variable sizes or constant sizes greater than
``ssp-buffer-size``.		``ssp-buffer-size``.

Variables that are identified as requiring a protector will be arranged		Variables that are identified as requiring a protector will be arranged
▲ Show 20 Lines • Show All 10,929 Lines • Show Last 20 Lines

include/llvm/Bitcode/LLVMBitCodes.h

Show First 20 Lines • Show All 514 Lines • ▼ Show 20 Lines	enum AttributeKindCodes {
ATTR_KIND_CONVERGENT = 43,		ATTR_KIND_CONVERGENT = 43,
ATTR_KIND_SAFESTACK = 44,		ATTR_KIND_SAFESTACK = 44,
ATTR_KIND_ARGMEMONLY = 45,		ATTR_KIND_ARGMEMONLY = 45,
ATTR_KIND_SWIFT_SELF = 46,		ATTR_KIND_SWIFT_SELF = 46,
ATTR_KIND_SWIFT_ERROR = 47,		ATTR_KIND_SWIFT_ERROR = 47,
ATTR_KIND_NO_RECURSE = 48,		ATTR_KIND_NO_RECURSE = 48,
ATTR_KIND_INACCESSIBLEMEM_ONLY = 49,		ATTR_KIND_INACCESSIBLEMEM_ONLY = 49,
ATTR_KIND_INACCESSIBLEMEM_OR_ARGMEMONLY = 50,		ATTR_KIND_INACCESSIBLEMEM_OR_ARGMEMONLY = 50,
ATTR_KIND_ALLOC_SIZE = 51		ATTR_KIND_ALLOC_SIZE = 51,
		ATTR_KIND_NO_SIDEEFFECTS = 52
};		};

enum ComdatSelectionKindCodes {		enum ComdatSelectionKindCodes {
COMDAT_SELECTION_KIND_ANY = 1,		COMDAT_SELECTION_KIND_ANY = 1,
COMDAT_SELECTION_KIND_EXACT_MATCH = 2,		COMDAT_SELECTION_KIND_EXACT_MATCH = 2,
COMDAT_SELECTION_KIND_LARGEST = 3,		COMDAT_SELECTION_KIND_LARGEST = 3,
COMDAT_SELECTION_KIND_NO_DUPLICATES = 4,		COMDAT_SELECTION_KIND_NO_DUPLICATES = 4,
COMDAT_SELECTION_KIND_SAME_SIZE = 5,		COMDAT_SELECTION_KIND_SAME_SIZE = 5,
};		};

} // End bitc namespace		} // End bitc namespace
} // End llvm namespace		} // End llvm namespace

#endif		#endif

include/llvm/IR/Attributes.td

	Show First 20 Lines • Show All 100 Lines • ▼ Show 20 Lines
	def NoRecurse : EnumAttr<"norecurse">;			def NoRecurse : EnumAttr<"norecurse">;

	/// Disable redzone.			/// Disable redzone.
	def NoRedZone : EnumAttr<"noredzone">;			def NoRedZone : EnumAttr<"noredzone">;

	/// Mark the function as not returning.			/// Mark the function as not returning.
	def NoReturn : EnumAttr<"noreturn">;			def NoReturn : EnumAttr<"noreturn">;

				/// Function doesn't have sideeffects.
				def NoSideEffects : EnumAttr<"nosideeffects">;

	/// Function doesn't unwind stack.			/// Function doesn't unwind stack.
	def NoUnwind : EnumAttr<"nounwind">;			def NoUnwind : EnumAttr<"nounwind">;

	/// opt_size.			/// opt_size.
	def OptimizeForSize : EnumAttr<"optsize">;			def OptimizeForSize : EnumAttr<"optsize">;

	/// Function must not be optimized.			/// Function must not be optimized.
	def OptimizeNone : EnumAttr<"optnone">;			def OptimizeNone : EnumAttr<"optnone">;
	▲ Show 20 Lines • Show All 93 Lines • Show Last 20 Lines

include/llvm/IR/Function.h

Show First 20 Lines • Show All 345 Lines • ▼ Show 20 Lines	public:
}		}
void setConvergent() {		void setConvergent() {
addFnAttr(Attribute::Convergent);		addFnAttr(Attribute::Convergent);
}		}
void setNotConvergent() {		void setNotConvergent() {
removeFnAttr(Attribute::Convergent);		removeFnAttr(Attribute::Convergent);
}		}

		/// @brief Determine if the call has sideeffects.
		bool doesNotHaveSideEffects() const {
		return hasFnAttribute(Attribute::NoSideEffects);
		}
		void setDoesNotHaveSideEffects() {
		addFnAttr(Attribute::NoSideEffects);
		}

/// Determine if the function is known not to recurse, directly or		/// Determine if the function is known not to recurse, directly or
/// indirectly.		/// indirectly.
bool doesNotRecurse() const {		bool doesNotRecurse() const {
return hasFnAttribute(Attribute::NoRecurse);		return hasFnAttribute(Attribute::NoRecurse);
}		}
void setDoesNotRecurse() {		void setDoesNotRecurse() {
addFnAttr(Attribute::NoRecurse);		addFnAttr(Attribute::NoRecurse);
}		}
▲ Show 20 Lines • Show All 324 Lines • Show Last 20 Lines

include/llvm/IR/Intrinsics.td

	Show All 17 Lines
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	class IntrinsicProperty;			class IntrinsicProperty;

	// Intr*Mem - Memory properties. If no property is set, the worst case			// Intr*Mem - Memory properties. If no property is set, the worst case
	// is assumed (it may read and write any memory it can get access to and it may			// is assumed (it may read and write any memory it can get access to and it may
	// have other side effects).			// have other side effects).

	// IntrNoMem - The intrinsic does not access memory or have any other side			// IntrNoMem - The intrinsic does not access memory. However, it may have side
	// effects. It may be CSE'd deleted if dead, etc.			// effects.
	def IntrNoMem : IntrinsicProperty;			def IntrNoMem : IntrinsicProperty;

	// IntrReadMem - This intrinsic only reads from memory. It does not write to			// IntrReadMem - This intrinsic only reads from memory. It does not write to
	// memory and has no other side effects. Therefore, it cannot be moved across			// memory and has no other side effects. Therefore, it cannot be moved across
	// potentially aliasing stores. However, it can be reordered otherwise and can			// potentially aliasing stores. However, it can be reordered otherwise and can
	// be deleted if dead.			// be deleted if dead.
	def IntrReadMem : IntrinsicProperty;			def IntrReadMem : IntrinsicProperty;

	Show All 36 Lines
	// Parallels the noduplicate attribute on LLVM IR functions.			// Parallels the noduplicate attribute on LLVM IR functions.
	def IntrNoDuplicate : IntrinsicProperty;			def IntrNoDuplicate : IntrinsicProperty;

	// IntrConvergent - Calls to this intrinsic are convergent and may not be made			// IntrConvergent - Calls to this intrinsic are convergent and may not be made
	// control-dependent on any additional values.			// control-dependent on any additional values.
	// Parallels the convergent attribute on LLVM IR functions.			// Parallels the convergent attribute on LLVM IR functions.
	def IntrConvergent : IntrinsicProperty;			def IntrConvergent : IntrinsicProperty;

				// Calls to this intrinsics have no side effects, so it may be speculated.
				def IntrNoSideEffects : IntrinsicProperty;

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// Types used by intrinsics.			// Types used by intrinsics.
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	class LLVMType<ValueType vt> {			class LLVMType<ValueType vt> {
	ValueType VT = vt;			ValueType VT = vt;
	}			}

	▲ Show 20 Lines • Show All 601 Lines • Show Last 20 Lines

lib/AsmParser/LLLexer.cpp

Show First 20 Lines • Show All 633 Lines • ▼ Show 20 Lines	#define KEYWORD(STR) \
KEYWORD(noduplicate);		KEYWORD(noduplicate);
KEYWORD(noimplicitfloat);		KEYWORD(noimplicitfloat);
KEYWORD(noinline);		KEYWORD(noinline);
KEYWORD(norecurse);		KEYWORD(norecurse);
KEYWORD(nonlazybind);		KEYWORD(nonlazybind);
KEYWORD(nonnull);		KEYWORD(nonnull);
KEYWORD(noredzone);		KEYWORD(noredzone);
KEYWORD(noreturn);		KEYWORD(noreturn);
		KEYWORD(nosideeffects);
KEYWORD(nounwind);		KEYWORD(nounwind);
KEYWORD(optnone);		KEYWORD(optnone);
KEYWORD(optsize);		KEYWORD(optsize);
KEYWORD(readnone);		KEYWORD(readnone);
KEYWORD(readonly);		KEYWORD(readonly);
KEYWORD(returned);		KEYWORD(returned);
KEYWORD(returns_twice);		KEYWORD(returns_twice);
KEYWORD(signext);		KEYWORD(signext);
▲ Show 20 Lines • Show All 349 Lines • Show Last 20 Lines

lib/AsmParser/LLParser.cpp

Show First 20 Lines • Show All 1,077 Lines • ▼ Show 20 Lines	while (true) {
case lltok::kw_noduplicate: B.addAttribute(Attribute::NoDuplicate); break;		case lltok::kw_noduplicate: B.addAttribute(Attribute::NoDuplicate); break;
case lltok::kw_noimplicitfloat:		case lltok::kw_noimplicitfloat:
B.addAttribute(Attribute::NoImplicitFloat); break;		B.addAttribute(Attribute::NoImplicitFloat); break;
case lltok::kw_noinline: B.addAttribute(Attribute::NoInline); break;		case lltok::kw_noinline: B.addAttribute(Attribute::NoInline); break;
case lltok::kw_nonlazybind: B.addAttribute(Attribute::NonLazyBind); break;		case lltok::kw_nonlazybind: B.addAttribute(Attribute::NonLazyBind); break;
case lltok::kw_noredzone: B.addAttribute(Attribute::NoRedZone); break;		case lltok::kw_noredzone: B.addAttribute(Attribute::NoRedZone); break;
case lltok::kw_noreturn: B.addAttribute(Attribute::NoReturn); break;		case lltok::kw_noreturn: B.addAttribute(Attribute::NoReturn); break;
case lltok::kw_norecurse: B.addAttribute(Attribute::NoRecurse); break;		case lltok::kw_norecurse: B.addAttribute(Attribute::NoRecurse); break;
		case lltok::kw_nosideeffects: B.addAttribute(Attribute::NoSideEffects); break;
case lltok::kw_nounwind: B.addAttribute(Attribute::NoUnwind); break;		case lltok::kw_nounwind: B.addAttribute(Attribute::NoUnwind); break;
case lltok::kw_optnone: B.addAttribute(Attribute::OptimizeNone); break;		case lltok::kw_optnone: B.addAttribute(Attribute::OptimizeNone); break;
case lltok::kw_optsize: B.addAttribute(Attribute::OptimizeForSize); break;		case lltok::kw_optsize: B.addAttribute(Attribute::OptimizeForSize); break;
case lltok::kw_readnone: B.addAttribute(Attribute::ReadNone); break;		case lltok::kw_readnone: B.addAttribute(Attribute::ReadNone); break;
case lltok::kw_readonly: B.addAttribute(Attribute::ReadOnly); break;		case lltok::kw_readonly: B.addAttribute(Attribute::ReadOnly); break;
case lltok::kw_returns_twice:		case lltok::kw_returns_twice:
B.addAttribute(Attribute::ReturnsTwice); break;		B.addAttribute(Attribute::ReturnsTwice); break;
case lltok::kw_ssp: B.addAttribute(Attribute::StackProtect); break;		case lltok::kw_ssp: B.addAttribute(Attribute::StackProtect); break;
▲ Show 20 Lines • Show All 5,334 Lines • Show Last 20 Lines

lib/AsmParser/LLToken.h

Show First 20 Lines • Show All 138 Lines • ▼ Show 20 Lines	enum Kind {
kw_noduplicate,		kw_noduplicate,
kw_noimplicitfloat,		kw_noimplicitfloat,
kw_noinline,		kw_noinline,
kw_norecurse,		kw_norecurse,
kw_nonlazybind,		kw_nonlazybind,
kw_nonnull,		kw_nonnull,
kw_noredzone,		kw_noredzone,
kw_noreturn,		kw_noreturn,
		kw_nosideeffects,
kw_nounwind,		kw_nounwind,
kw_optnone,		kw_optnone,
kw_optsize,		kw_optsize,
kw_readnone,		kw_readnone,
kw_readonly,		kw_readonly,
kw_returned,		kw_returned,
kw_returns_twice,		kw_returns_twice,
kw_signext,		kw_signext,
▲ Show 20 Lines • Show All 92 Lines • Show Last 20 Lines

lib/Bitcode/Reader/BitcodeReader.cpp

Show First 20 Lines • Show All 1,421 Lines • ▼ Show 20 Lines	static Attribute::AttrKind getAttrFromCode(uint64_t Code) {
case bitc::ATTR_KIND_DEREFERENCEABLE_OR_NULL:		case bitc::ATTR_KIND_DEREFERENCEABLE_OR_NULL:
return Attribute::DereferenceableOrNull;		return Attribute::DereferenceableOrNull;
case bitc::ATTR_KIND_ALLOC_SIZE:		case bitc::ATTR_KIND_ALLOC_SIZE:
return Attribute::AllocSize;		return Attribute::AllocSize;
case bitc::ATTR_KIND_NO_RED_ZONE:		case bitc::ATTR_KIND_NO_RED_ZONE:
return Attribute::NoRedZone;		return Attribute::NoRedZone;
case bitc::ATTR_KIND_NO_RETURN:		case bitc::ATTR_KIND_NO_RETURN:
return Attribute::NoReturn;		return Attribute::NoReturn;
		case bitc::ATTR_KIND_NO_SIDEEFFECTS:
		return Attribute::NoSideEffects;
case bitc::ATTR_KIND_NO_UNWIND:		case bitc::ATTR_KIND_NO_UNWIND:
return Attribute::NoUnwind;		return Attribute::NoUnwind;
case bitc::ATTR_KIND_OPTIMIZE_FOR_SIZE:		case bitc::ATTR_KIND_OPTIMIZE_FOR_SIZE:
return Attribute::OptimizeForSize;		return Attribute::OptimizeForSize;
case bitc::ATTR_KIND_OPTIMIZE_NONE:		case bitc::ATTR_KIND_OPTIMIZE_NONE:
return Attribute::OptimizeNone;		return Attribute::OptimizeNone;
case bitc::ATTR_KIND_READ_NONE:		case bitc::ATTR_KIND_READ_NONE:
return Attribute::ReadNone;		return Attribute::ReadNone;
▲ Show 20 Lines • Show All 5,109 Lines • Show Last 20 Lines

lib/Bitcode/Writer/BitcodeWriter.cpp

Show First 20 Lines • Show All 587 Lines • ▼ Show 20 Lines	static uint64_t getAttrKindEncoding(Attribute::AttrKind Kind) {
case Attribute::NoDuplicate:		case Attribute::NoDuplicate:
return bitc::ATTR_KIND_NO_DUPLICATE;		return bitc::ATTR_KIND_NO_DUPLICATE;
case Attribute::NoImplicitFloat:		case Attribute::NoImplicitFloat:
return bitc::ATTR_KIND_NO_IMPLICIT_FLOAT;		return bitc::ATTR_KIND_NO_IMPLICIT_FLOAT;
case Attribute::NoInline:		case Attribute::NoInline:
return bitc::ATTR_KIND_NO_INLINE;		return bitc::ATTR_KIND_NO_INLINE;
case Attribute::NoRecurse:		case Attribute::NoRecurse:
return bitc::ATTR_KIND_NO_RECURSE;		return bitc::ATTR_KIND_NO_RECURSE;
		case Attribute::NoSideEffects:
		return bitc::ATTR_KIND_NO_SIDEEFFECTS;
case Attribute::NonLazyBind:		case Attribute::NonLazyBind:
return bitc::ATTR_KIND_NON_LAZY_BIND;		return bitc::ATTR_KIND_NON_LAZY_BIND;
case Attribute::NonNull:		case Attribute::NonNull:
return bitc::ATTR_KIND_NON_NULL;		return bitc::ATTR_KIND_NON_NULL;
case Attribute::Dereferenceable:		case Attribute::Dereferenceable:
return bitc::ATTR_KIND_DEREFERENCEABLE;		return bitc::ATTR_KIND_DEREFERENCEABLE;
case Attribute::DereferenceableOrNull:		case Attribute::DereferenceableOrNull:
return bitc::ATTR_KIND_DEREFERENCEABLE_OR_NULL;		return bitc::ATTR_KIND_DEREFERENCEABLE_OR_NULL;
▲ Show 20 Lines • Show All 3,110 Lines • Show Last 20 Lines

lib/IR/Attributes.cpp

Show First 20 Lines • Show All 276 Lines • ▼ Show 20 Lines	std::string Attribute::getAsString(bool InAttrGrp) const {
if (hasAttribute(Attribute::NonNull))		if (hasAttribute(Attribute::NonNull))
return "nonnull";		return "nonnull";
if (hasAttribute(Attribute::NoRedZone))		if (hasAttribute(Attribute::NoRedZone))
return "noredzone";		return "noredzone";
if (hasAttribute(Attribute::NoReturn))		if (hasAttribute(Attribute::NoReturn))
return "noreturn";		return "noreturn";
if (hasAttribute(Attribute::NoRecurse))		if (hasAttribute(Attribute::NoRecurse))
return "norecurse";		return "norecurse";
		if (hasAttribute(Attribute::NoSideEffects))
		return "nosideeffects";
if (hasAttribute(Attribute::NoUnwind))		if (hasAttribute(Attribute::NoUnwind))
return "nounwind";		return "nounwind";
if (hasAttribute(Attribute::OptimizeNone))		if (hasAttribute(Attribute::OptimizeNone))
return "optnone";		return "optnone";
if (hasAttribute(Attribute::OptimizeForSize))		if (hasAttribute(Attribute::OptimizeForSize))
return "optsize";		return "optsize";
if (hasAttribute(Attribute::ReadNone))		if (hasAttribute(Attribute::ReadNone))
return "readnone";		return "readnone";
▲ Show 20 Lines • Show All 218 Lines • ▼ Show 20 Lines	uint64_t AttributeImpl::getAttrMask(Attribute::AttrKind Val) {
case Attribute::JumpTable: return 1ULL << 45;		case Attribute::JumpTable: return 1ULL << 45;
case Attribute::Convergent: return 1ULL << 46;		case Attribute::Convergent: return 1ULL << 46;
case Attribute::SafeStack: return 1ULL << 47;		case Attribute::SafeStack: return 1ULL << 47;
case Attribute::NoRecurse: return 1ULL << 48;		case Attribute::NoRecurse: return 1ULL << 48;
case Attribute::InaccessibleMemOnly: return 1ULL << 49;		case Attribute::InaccessibleMemOnly: return 1ULL << 49;
case Attribute::InaccessibleMemOrArgMemOnly: return 1ULL << 50;		case Attribute::InaccessibleMemOrArgMemOnly: return 1ULL << 50;
case Attribute::SwiftSelf: return 1ULL << 51;		case Attribute::SwiftSelf: return 1ULL << 51;
case Attribute::SwiftError: return 1ULL << 52;		case Attribute::SwiftError: return 1ULL << 52;
		case Attribute::NoSideEffects: return 1ULL << 53;
case Attribute::Dereferenceable:		case Attribute::Dereferenceable:
llvm_unreachable("dereferenceable attribute not supported in raw format");		llvm_unreachable("dereferenceable attribute not supported in raw format");
break;		break;
case Attribute::DereferenceableOrNull:		case Attribute::DereferenceableOrNull:
llvm_unreachable("dereferenceable_or_null attribute not supported in raw "		llvm_unreachable("dereferenceable_or_null attribute not supported in raw "
"format");		"format");
break;		break;
case Attribute::ArgMemOnly:		case Attribute::ArgMemOnly:
▲ Show 20 Lines • Show All 1,101 Lines • Show Last 20 Lines

lib/IR/Verifier.cpp

Show First 20 Lines • Show All 1,274 Lines • ▼ Show 20 Lines	if (I->getKindAsEnum() == Attribute::NoReturn \|\|
I->getKindAsEnum() == Attribute::Cold \|\|		I->getKindAsEnum() == Attribute::Cold \|\|
I->getKindAsEnum() == Attribute::OptimizeNone \|\|		I->getKindAsEnum() == Attribute::OptimizeNone \|\|
I->getKindAsEnum() == Attribute::JumpTable \|\|		I->getKindAsEnum() == Attribute::JumpTable \|\|
I->getKindAsEnum() == Attribute::Convergent \|\|		I->getKindAsEnum() == Attribute::Convergent \|\|
I->getKindAsEnum() == Attribute::ArgMemOnly \|\|		I->getKindAsEnum() == Attribute::ArgMemOnly \|\|
I->getKindAsEnum() == Attribute::NoRecurse \|\|		I->getKindAsEnum() == Attribute::NoRecurse \|\|
I->getKindAsEnum() == Attribute::InaccessibleMemOnly \|\|		I->getKindAsEnum() == Attribute::InaccessibleMemOnly \|\|
I->getKindAsEnum() == Attribute::InaccessibleMemOrArgMemOnly \|\|		I->getKindAsEnum() == Attribute::InaccessibleMemOrArgMemOnly \|\|
		I->getKindAsEnum() == Attribute::NoSideEffects \|\|
I->getKindAsEnum() == Attribute::AllocSize) {		I->getKindAsEnum() == Attribute::AllocSize) {
if (!isFunction) {		if (!isFunction) {
CheckFailed("Attribute '" + I->getAsString() +		CheckFailed("Attribute '" + I->getAsString() +
"' only applies to functions!", V);		"' only applies to functions!", V);
return;		return;
}		}
} else if (I->getKindAsEnum() == Attribute::ReadOnly \|\|		} else if (I->getKindAsEnum() == Attribute::ReadOnly \|\|
I->getKindAsEnum() == Attribute::ReadNone) {		I->getKindAsEnum() == Attribute::ReadNone) {
▲ Show 20 Lines • Show All 3,167 Lines • Show Last 20 Lines

test/Bindings/llvm-c/Inputs/invalid.ll.bc

test/Bindings/llvm-c/invalid-bitcode.test

	; RUN: not llvm-c-test --module-dump < %S/Inputs/invalid.ll.bc 2>&1 \| FileCheck %s			; RUN: not llvm-c-test --module-dump < %S/Inputs/invalid.ll.bc 2>&1 \| FileCheck %s
	; RUN: not llvm-c-test --lazy-module-dump < %S/Inputs/invalid.ll.bc 2>&1 \| FileCheck %s			; RUN: not llvm-c-test --lazy-module-dump < %S/Inputs/invalid.ll.bc 2>&1 \| FileCheck %s

	CHECK: Error parsing bitcode: Unknown attribute kind (52)			CHECK: Error parsing bitcode: Unknown attribute kind (53)


	; RUN: not llvm-c-test --new-module-dump < %S/Inputs/invalid.ll.bc 2>&1 \| FileCheck --check-prefix=NEW %s			; RUN: not llvm-c-test --new-module-dump < %S/Inputs/invalid.ll.bc 2>&1 \| FileCheck --check-prefix=NEW %s
	; RUN: not llvm-c-test --lazy-new-module-dump < %S/Inputs/invalid.ll.bc 2>&1 \| FileCheck --check-prefix=NEW %s			; RUN: not llvm-c-test --lazy-new-module-dump < %S/Inputs/invalid.ll.bc 2>&1 \| FileCheck --check-prefix=NEW %s

	NEW: Error with new bitcode parser: Unknown attribute kind (52)			NEW: Error with new bitcode parser: Unknown attribute kind (53)

	; RUN: llvm-c-test --test-diagnostic-handler < %S/Inputs/invalid.ll.bc 2>&1 \| FileCheck --check-prefix=DIAGNOSTIC %s			; RUN: llvm-c-test --test-diagnostic-handler < %S/Inputs/invalid.ll.bc 2>&1 \| FileCheck --check-prefix=DIAGNOSTIC %s

	DIAGNOSTIC: Executing diagnostic handler			DIAGNOSTIC: Executing diagnostic handler
	DIAGNOSTIC: Diagnostic severity is of type error			DIAGNOSTIC: Diagnostic severity is of type error
	DIAGNOSTIC: Diagnostic handler was called while loading module			DIAGNOSTIC: Diagnostic handler was called while loading module

test/Bitcode/attributes.ll

Show First 20 Lines • Show All 198 Lines • ▼ Show 20 Lines
}		}

declare void @nobuiltin()		declare void @nobuiltin()

define void @f34()		define void @f34()
; CHECK: define void @f34()		; CHECK: define void @f34()
{		{
call void @nobuiltin() nobuiltin		call void @nobuiltin() nobuiltin
; CHECK: call void @nobuiltin() #32		; CHECK: call void @nobuiltin() #33
ret void;		ret void;
}		}

define void @f35() optnone noinline		define void @f35() optnone noinline
; CHECK: define void @f35() #23		; CHECK: define void @f35() #23
{		{
ret void;		ret void;
}		}
▲ Show 20 Lines • Show All 107 Lines • ▼ Show 20 Lines	define i8* @f54(i32) allocsize(0) {
ret i8* null		ret i8* null
}		}

; CHECK: define i8* @f55(i32, i32) #31		; CHECK: define i8* @f55(i32, i32) #31
define i8* @f55(i32, i32) allocsize(0, 1) {		define i8* @f55(i32, i32) allocsize(0, 1) {
ret i8* null		ret i8* null
}		}

		; CHECK: define void @f56() #32
		define void @f56() nosideeffects {
		ret void
		}

; CHECK: attributes #0 = { noreturn }		; CHECK: attributes #0 = { noreturn }
; CHECK: attributes #1 = { nounwind }		; CHECK: attributes #1 = { nounwind }
; CHECK: attributes #2 = { readnone }		; CHECK: attributes #2 = { readnone }
; CHECK: attributes #3 = { readonly }		; CHECK: attributes #3 = { readonly }
; CHECK: attributes #4 = { noinline }		; CHECK: attributes #4 = { noinline }
; CHECK: attributes #5 = { alwaysinline }		; CHECK: attributes #5 = { alwaysinline }
; CHECK: attributes #6 = { optsize }		; CHECK: attributes #6 = { optsize }
; CHECK: attributes #7 = { ssp }		; CHECK: attributes #7 = { ssp }
Show All 16 Lines
; CHECK: attributes #24 = { jumptable }		; CHECK: attributes #24 = { jumptable }
; CHECK: attributes #25 = { convergent }		; CHECK: attributes #25 = { convergent }
; CHECK: attributes #26 = { argmemonly }		; CHECK: attributes #26 = { argmemonly }
; CHECK: attributes #27 = { norecurse }		; CHECK: attributes #27 = { norecurse }
; CHECK: attributes #28 = { inaccessiblememonly }		; CHECK: attributes #28 = { inaccessiblememonly }
; CHECK: attributes #29 = { inaccessiblemem_or_argmemonly }		; CHECK: attributes #29 = { inaccessiblemem_or_argmemonly }
; CHECK: attributes #30 = { allocsize(0) }		; CHECK: attributes #30 = { allocsize(0) }
; CHECK: attributes #31 = { allocsize(0,1) }		; CHECK: attributes #31 = { allocsize(0,1) }
; CHECK: attributes #32 = { nobuiltin }		; CHECK: attributes #32 = { nosideeffects }
		; CHECK: attributes #33 = { nobuiltin }

test/Bitcode/invalid.ll

	; RUN: not llvm-dis < %s.bc 2>&1 \| FileCheck %s			; RUN: not llvm-dis < %s.bc 2>&1 \| FileCheck %s

	; CHECK: llvm-dis{{(\.EXE\|\.exe)?}}: error: Unknown attribute kind (52)			; CHECK: llvm-dis{{(\.EXE\|\.exe)?}}: error: Unknown attribute kind (53)

	; invalid.ll.bc has an invalid attribute number.			; invalid.ll.bc has an invalid attribute number.
	; The test checks that LLVM reports the error and doesn't access freed memory			; The test checks that LLVM reports the error and doesn't access freed memory
	; in doing so.			; in doing so.

test/Bitcode/invalid.ll.bc

test/LTO/X86/Inputs/invalid.ll.bc

test/LTO/X86/invalid.ll

	; RUN: not llvm-lto %S/Inputs/invalid.ll.bc 2>&1 \| FileCheck %s			; RUN: not llvm-lto %S/Inputs/invalid.ll.bc 2>&1 \| FileCheck %s


	; CHECK: llvm-lto{{.}}: error loading file '{{.}}/Inputs/invalid.ll.bc': Unknown attribute kind (52)			; CHECK: llvm-lto{{.}}: error loading file '{{.}}/Inputs/invalid.ll.bc': Unknown attribute kind (53)

utils/TableGen/CodeGenInstruction.h

Show First 20 Lines • Show All 251 Lines • ▼ Show 20 Lines	public:
bool hasExtraDefRegAllocReq : 1;		bool hasExtraDefRegAllocReq : 1;
bool isCodeGenOnly : 1;		bool isCodeGenOnly : 1;
bool isPseudo : 1;		bool isPseudo : 1;
bool isRegSequence : 1;		bool isRegSequence : 1;
bool isExtractSubreg : 1;		bool isExtractSubreg : 1;
bool isInsertSubreg : 1;		bool isInsertSubreg : 1;
bool isConvergent : 1;		bool isConvergent : 1;
bool hasNoSchedulingInfo : 1;		bool hasNoSchedulingInfo : 1;
		bool isNoSideEffects : 1;

std::string DeprecatedReason;		std::string DeprecatedReason;
bool HasComplexDeprecationPredicate;		bool HasComplexDeprecationPredicate;

/// Are there any undefined flags?		/// Are there any undefined flags?
bool hasUndefFlags() const {		bool hasUndefFlags() const {
return mayLoad_Unset \|\| mayStore_Unset \|\| hasSideEffects_Unset;		return mayLoad_Unset \|\| mayStore_Unset \|\| hasSideEffects_Unset;
}		}
▲ Show 20 Lines • Show All 86 Lines • Show Last 20 Lines

utils/TableGen/CodeGenIntrinsics.h

Show First 20 Lines • Show All 103 Lines • ▼ Show 20 Lines	struct CodeGenIntrinsic {
bool isNoDuplicate;		bool isNoDuplicate;

/// isNoReturn - True if the intrinsic is no-return.		/// isNoReturn - True if the intrinsic is no-return.
bool isNoReturn;		bool isNoReturn;

/// isConvergent - True if the intrinsic is marked as convergent.		/// isConvergent - True if the intrinsic is marked as convergent.
bool isConvergent;		bool isConvergent;

		/// isNoSideEffects - True if the intrinsic is marked as nosideeffects
		bool isNoSideEffects;

		eli.friedmanUnsubmitted Not Done Reply Inline Actions Saying that, for example, memcpy is nosideeffects seems very weird. "memcpy(0,0,8)" will crash. The same issue applies to basically any intrinsic that reads from or writes to its arguments. eli.friedman: Saying that, for example, memcpy is nosideeffects seems very weird. "memcpy(0,0,8)" will crash.
enum ArgAttribute {		enum ArgAttribute {
NoCapture,		NoCapture,
ReadOnly,		ReadOnly,
ReadNone		ReadNone
};		};
std::vector<std::pair<unsigned, ArgAttribute> > ArgumentAttributes;		std::vector<std::pair<unsigned, ArgAttribute> > ArgumentAttributes;

CodeGenIntrinsic(Record *R);		CodeGenIntrinsic(Record *R);
};		};

/// LoadIntrinsics - Read all of the intrinsics defined in the specified		/// LoadIntrinsics - Read all of the intrinsics defined in the specified
/// .td file.		/// .td file.
std::vector<CodeGenIntrinsic> LoadIntrinsics(const RecordKeeper &RC,		std::vector<CodeGenIntrinsic> LoadIntrinsics(const RecordKeeper &RC,
		tstellarAMDUnsubmitted Not Done Reply Inline Actions This a problem with the current definitions of TableGen's intrinsic properties. Any intrinsic with IntrNoMem, IntrReadMem, IntrWriteMem, or IntrArgMemOnly is defined as having no side-effects. The goal with this patch is to make it possible to have an intrinsic, like memcpy which only reads/writes arg memory, but may have other sideeffects. tstellarAMD: This a problem with the current definitions of TableGen's intrinsic properties. Any intrinsic…
		eli.friedmanUnsubmitted Not Done Reply Inline Actions Oh, I see, this is an existing problem. :( I'd definitely like to see this resolved before you start changing optimizations to use this flag, but I guess you can change it in a followup. Not that it helps memcpy in particular, but it might be worth considering some approach which allows one to say "this intrinsic has no side-effects if the pointer arguments are dereferenceable(n)". eli.friedman: Oh, I see, this is an existing problem. :( I'd definitely like to see this resolved before you…
bool TargetOnly);		bool TargetOnly);
}		}

#endif		#endif

utils/TableGen/CodeGenTarget.cpp

Show First 20 Lines • Show All 449 Lines • ▼ Show 20 Lines	CodeGenIntrinsic::CodeGenIntrinsic(Record *R) {
std::string DefName = R->getName();		std::string DefName = R->getName();
ModRef = ReadWriteMem;		ModRef = ReadWriteMem;
isOverloaded = false;		isOverloaded = false;
isCommutative = false;		isCommutative = false;
canThrow = false;		canThrow = false;
isNoReturn = false;		isNoReturn = false;
isNoDuplicate = false;		isNoDuplicate = false;
isConvergent = false;		isConvergent = false;
		isNoSideEffects = false;

if (DefName.size() <= 4 \|\|		if (DefName.size() <= 4 \|\|
std::string(DefName.begin(), DefName.begin() + 4) != "int_")		std::string(DefName.begin(), DefName.begin() + 4) != "int_")
PrintFatalError("Intrinsic '" + DefName + "' does not start with 'int_'!");		PrintFatalError("Intrinsic '" + DefName + "' does not start with 'int_'!");

EnumName = std::string(DefName.begin()+4, DefName.end());		EnumName = std::string(DefName.begin()+4, DefName.end());

if (R->getValue("GCCBuiltinName")) // Ignore a missing GCCBuiltinName field.		if (R->getValue("GCCBuiltinName")) // Ignore a missing GCCBuiltinName field.
▲ Show 20 Lines • Show All 118 Lines • ▼ Show 20 Lines	for (unsigned i = 0, e = PropList->size(); i != e; ++i) {
else if (Property->getName() == "Throws")		else if (Property->getName() == "Throws")
canThrow = true;		canThrow = true;
else if (Property->getName() == "IntrNoDuplicate")		else if (Property->getName() == "IntrNoDuplicate")
isNoDuplicate = true;		isNoDuplicate = true;
else if (Property->getName() == "IntrConvergent")		else if (Property->getName() == "IntrConvergent")
isConvergent = true;		isConvergent = true;
else if (Property->getName() == "IntrNoReturn")		else if (Property->getName() == "IntrNoReturn")
isNoReturn = true;		isNoReturn = true;
		else if (Property->getName() == "IntrNoSideEffects")
		isNoSideEffects = true;
else if (Property->isSubClassOf("NoCapture")) {		else if (Property->isSubClassOf("NoCapture")) {
unsigned ArgNo = Property->getValueAsInt("ArgNo");		unsigned ArgNo = Property->getValueAsInt("ArgNo");
ArgumentAttributes.push_back(std::make_pair(ArgNo, NoCapture));		ArgumentAttributes.push_back(std::make_pair(ArgNo, NoCapture));
} else if (Property->isSubClassOf("ReadOnly")) {		} else if (Property->isSubClassOf("ReadOnly")) {
unsigned ArgNo = Property->getValueAsInt("ArgNo");		unsigned ArgNo = Property->getValueAsInt("ArgNo");
ArgumentAttributes.push_back(std::make_pair(ArgNo, ReadOnly));		ArgumentAttributes.push_back(std::make_pair(ArgNo, ReadOnly));
} else if (Property->isSubClassOf("ReadNone")) {		} else if (Property->isSubClassOf("ReadNone")) {
unsigned ArgNo = Property->getValueAsInt("ArgNo");		unsigned ArgNo = Property->getValueAsInt("ArgNo");
ArgumentAttributes.push_back(std::make_pair(ArgNo, ReadNone));		ArgumentAttributes.push_back(std::make_pair(ArgNo, ReadNone));
} else		} else
llvm_unreachable("Unknown property!");		llvm_unreachable("Unknown property!");
}		}

// Sort the argument attributes for later benefit.		// Sort the argument attributes for later benefit.
std::sort(ArgumentAttributes.begin(), ArgumentAttributes.end());		std::sort(ArgumentAttributes.begin(), ArgumentAttributes.end());
}		}

utils/TableGen/IntrinsicEmitter.cpp

Show First 20 Lines • Show All 455 Lines • ▼ Show 20 Lines	if (L->isNoDuplicate != R->isNoDuplicate)
return R->isNoDuplicate;		return R->isNoDuplicate;

if (L->isNoReturn != R->isNoReturn)		if (L->isNoReturn != R->isNoReturn)
return R->isNoReturn;		return R->isNoReturn;

if (L->isConvergent != R->isConvergent)		if (L->isConvergent != R->isConvergent)
return R->isConvergent;		return R->isConvergent;

		if (L->isNoSideEffects != R->isNoSideEffects)
		return R->isNoSideEffects;

// Try to order by readonly/readnone attribute.		// Try to order by readonly/readnone attribute.
CodeGenIntrinsic::ModRefBehavior LK = L->ModRef;		CodeGenIntrinsic::ModRefBehavior LK = L->ModRef;
CodeGenIntrinsic::ModRefBehavior RK = R->ModRef;		CodeGenIntrinsic::ModRefBehavior RK = R->ModRef;
if (LK != RK) return (LK > RK);		if (LK != RK) return (LK > RK);

// Order by argument attributes.		// Order by argument attributes.
// This is reliable because each side is already sorted internally.		// This is reliable because each side is already sorted internally.
return (L->ArgumentAttributes < R->ArgumentAttributes);		return (L->ArgumentAttributes < R->ArgumentAttributes);
▲ Show 20 Lines • Show All 96 Lines • ▼ Show 20 Lines	if (ae) {
OS << " AS[" << numAttrs++ << "] = AttributeSet::get(C, "		OS << " AS[" << numAttrs++ << "] = AttributeSet::get(C, "
<< argNo+1 << ", AttrParam" << argNo +1 << ");\n";		<< argNo+1 << ", AttrParam" << argNo +1 << ");\n";
}		}
}		}

if (!intrinsic.canThrow \|\|		if (!intrinsic.canThrow \|\|
intrinsic.ModRef != CodeGenIntrinsic::ReadWriteMem \|\|		intrinsic.ModRef != CodeGenIntrinsic::ReadWriteMem \|\|
intrinsic.isNoReturn \|\| intrinsic.isNoDuplicate \|\|		intrinsic.isNoReturn \|\| intrinsic.isNoDuplicate \|\|
intrinsic.isConvergent) {		intrinsic.isConvergent \|\| intrinsic.isNoSideEffects) {
OS << " const Attribute::AttrKind Atts[] = {";		OS << " const Attribute::AttrKind Atts[] = {";
bool addComma = false;		bool addComma = false;
if (!intrinsic.canThrow) {		if (!intrinsic.canThrow) {
OS << "Attribute::NoUnwind";		OS << "Attribute::NoUnwind";
addComma = true;		addComma = true;
}		}
if (intrinsic.isNoReturn) {		if (intrinsic.isNoReturn) {
if (addComma)		if (addComma)
OS << ",";		OS << ",";
OS << "Attribute::NoReturn";		OS << "Attribute::NoReturn";
addComma = true;		addComma = true;
}		}
if (intrinsic.isNoDuplicate) {		if (intrinsic.isNoDuplicate) {
if (addComma)		if (addComma)
OS << ",";		OS << ",";
OS << "Attribute::NoDuplicate";		OS << "Attribute::NoDuplicate";
addComma = true;		addComma = true;
}		}
if (intrinsic.isConvergent) {		if (intrinsic.isConvergent) {
if (addComma)		if (addComma)
OS << ",";		OS << ",";
OS << "Attribute::Convergent";		OS << "Attribute::Convergent";
addComma = true;		addComma = true;
}		}
		if (intrinsic.isNoSideEffects) {
		if (addComma)
		OS << ",";
		OS << "Attribute::NoSideEffects";
		addComma = true;
		}

switch (intrinsic.ModRef) {		switch (intrinsic.ModRef) {
case CodeGenIntrinsic::NoMem:		case CodeGenIntrinsic::NoMem:
if (addComma)		if (addComma)
OS << ",";		OS << ",";
OS << "Attribute::ReadNone";		OS << "Attribute::ReadNone";
break;		break;
case CodeGenIntrinsic::ReadArgMem:		case CodeGenIntrinsic::ReadArgMem:
▲ Show 20 Lines • Show All 132 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Add speculatable function attribute
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 56749

docs/LangRef.rst

include/llvm/Bitcode/LLVMBitCodes.h

include/llvm/IR/Attributes.td

include/llvm/IR/Function.h

include/llvm/IR/Intrinsics.td

lib/AsmParser/LLLexer.cpp

lib/AsmParser/LLParser.cpp

lib/AsmParser/LLToken.h

lib/Bitcode/Reader/BitcodeReader.cpp

lib/Bitcode/Writer/BitcodeWriter.cpp

lib/IR/Attributes.cpp

lib/IR/Verifier.cpp

test/Bindings/llvm-c/Inputs/invalid.ll.bc

test/Bindings/llvm-c/invalid-bitcode.test

test/Bitcode/attributes.ll

test/Bitcode/invalid.ll

test/Bitcode/invalid.ll.bc

test/LTO/X86/Inputs/invalid.ll.bc

test/LTO/X86/invalid.ll

utils/TableGen/CodeGenInstruction.h

utils/TableGen/CodeGenIntrinsics.h

utils/TableGen/CodeGenTarget.cpp

utils/TableGen/IntrinsicEmitter.cpp

Unhandled Exception ("Exception")

Unhandled Exception ("Exception")

Unhandled Exception ("Exception")

This is an archive of the discontinued LLVM Phabricator instance.

Add speculatable function attributeClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 56749

docs/LangRef.rst

include/llvm/Bitcode/LLVMBitCodes.h

include/llvm/IR/Attributes.td

include/llvm/IR/Function.h

include/llvm/IR/Intrinsics.td

lib/AsmParser/LLLexer.cpp

lib/AsmParser/LLParser.cpp

lib/AsmParser/LLToken.h

lib/Bitcode/Reader/BitcodeReader.cpp

lib/Bitcode/Writer/BitcodeWriter.cpp

lib/IR/Attributes.cpp

lib/IR/Verifier.cpp

test/Bindings/llvm-c/Inputs/invalid.ll.bc

test/Bindings/llvm-c/invalid-bitcode.test

test/Bitcode/attributes.ll

test/Bitcode/invalid.ll

test/Bitcode/invalid.ll.bc

test/LTO/X86/Inputs/invalid.ll.bc

test/LTO/X86/invalid.ll

utils/TableGen/CodeGenInstruction.h

utils/TableGen/CodeGenIntrinsics.h

utils/TableGen/CodeGenTarget.cpp

utils/TableGen/IntrinsicEmitter.cpp

Add speculatable function attribute
ClosedPublic