This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
docs/
-
LangRef.rst
-
include/llvm/
-
llvm/
-
Bitcode/
-
LLVMBitCodes.h
-
IR/
-
Attributes.td
-
GlobalValue.h
-
lib/
-
Bitcode/
-
Reader/
-
BitcodeReader.cpp
-
Writer/
-
BitcodeWriter.cpp
-
IR/
3/11
Globals.cpp
-
Transforms/
-
IPO/
-
FunctionAttrs.cpp
-
Utils/
-
CodeExtractor.cpp
-
test/Bitcode/
-
Bitcode/
-
attributes.ll
-
unittests/IR/
-
IR/
-
FunctionTest.cpp
-
utils/
-
emacs/
-
llvm-mode.el
-
kate/
-
llvm.xml
-
vim/syntax/
-
syntax/
-
llvm.vim
-
vscode/llvm/syntaxes/
-
llvm/
-
syntaxes/
-
ll.tmLanguage.yaml

Differential D101011

[Attr] Add "noipa" function attribute
Needs ReviewPublic

Authored by dblaikie on Apr 21 2021, 7:25 PM.

Download Raw Diff

Details

Reviewers

rnk
jdoerfert
mehdi_amini
probinson
lebedev.ri

Summary

This is mostly inspired by the patch that added willreturn as a
reference point ( 3b77583e95236761d8741fd6df375975a8ca5d83 ).

Inspired by this thread:
https://lists.llvm.org/pipermail/llvm-dev/2021-April/149960.html

Solves PR41474

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

dblaikie created this revision.Apr 21 2021, 7:25 PM

Herald added subscribers: dexonsmith, jdoerfert, steven_wu, hiraditya. · View Herald TranscriptApr 21 2021, 7:25 PM

dblaikie requested review of this revision.Apr 21 2021, 7:25 PM

Herald added a project: Restricted Project. · View Herald TranscriptApr 21 2021, 7:25 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B100134: Diff 339445.Apr 21 2021, 8:44 PM

dblaikie added reviewers: rnk, jdoerfert, lebedev.ri, mehdi_amini, probinson.Apr 22 2021, 11:48 AM

GCC docs: This attribute implies noinline, noclone and no_icf attributes. So for example:

diff --git a/clang/lib/CodeGen/CodeGenModule.cpp b/clang/lib/CodeGen/CodeGenModule.cpp
index 6b966e7ca133..a13b1755cedf 100644
--- a/clang/lib/CodeGen/CodeGenModule.cpp
+++ b/clang/lib/CodeGen/CodeGenModule.cpp
@@ -1757,6 +1757,10 @@ void CodeGenModule::SetLLVMFunctionAttributesForDefinition(const Decl *D,
     // Naked implies noinline: we should not be inlining such functions.
     B.addAttribute(llvm::Attribute::Naked);
     B.addAttribute(llvm::Attribute::NoInline);
+  } else if (D->hasAttr<NoIPAAttr>()) {
+    // NoIPA implies noinline: we should not be inlining such functions.
+    B.addAttribute(llvm::Attribute::NoIPA);
+    B.addAttribute(llvm::Attribute::NoInline);
   } else if (D->hasAttr<NoDuplicateAttr>()) {
     B.addAttribute(llvm::Attribute::NoDuplicate);
   } else if (D->hasAttr<NoInlineAttr>() && !F->hasFnAttribute(llvm::Attribute::AlwaysInline)) {

(just PoC, not tested)

In D101011#2709748, @xbolva00 wrote:

GCC docs: This attribute implies noinline, noclone and no_icf attributes. So for example:

diff --git a/clang/lib/CodeGen/CodeGenModule.cpp b/clang/lib/CodeGen/CodeGenModule.cpp
index 6b966e7ca133..a13b1755cedf 100644
--- a/clang/lib/CodeGen/CodeGenModule.cpp
+++ b/clang/lib/CodeGen/CodeGenModule.cpp
@@ -1757,6 +1757,10 @@ void CodeGenModule::SetLLVMFunctionAttributesForDefinition(const Decl *D,
     // Naked implies noinline: we should not be inlining such functions.
     B.addAttribute(llvm::Attribute::Naked);
     B.addAttribute(llvm::Attribute::NoInline);
+  } else if (D->hasAttr<NoIPAAttr>()) {
+    // NoIPA implies noinline: we should not be inlining such functions.
+    B.addAttribute(llvm::Attribute::NoIPA);
+    B.addAttribute(llvm::Attribute::NoInline);
   } else if (D->hasAttr<NoDuplicateAttr>()) {
     B.addAttribute(llvm::Attribute::NoDuplicate);
   } else if (D->hasAttr<NoInlineAttr>() && !F->hasFnAttribute(llvm::Attribute::AlwaysInline)) {

(just PoC, not tested)

I think there's a reasonable argument to be made for keeping the attributes orthogonal - to implement the GCC compatible support in Clang we can always add both attributes in Clang's IRGen.

Check leaf attribute: https://reviews.llvm.org/D90275

I think you miss similar changes in SemaDeclAttr.cpp and CGCall.cpp and some testcases with

__attribute__((noipa))

In D101011#2709755, @dblaikie wrote:

In D101011#2709748, @xbolva00 wrote:

GCC docs: This attribute implies noinline, noclone and no_icf attributes. So for example:

diff --git a/clang/lib/CodeGen/CodeGenModule.cpp b/clang/lib/CodeGen/CodeGenModule.cpp
index 6b966e7ca133..a13b1755cedf 100644
--- a/clang/lib/CodeGen/CodeGenModule.cpp
+++ b/clang/lib/CodeGen/CodeGenModule.cpp
@@ -1757,6 +1757,10 @@ void CodeGenModule::SetLLVMFunctionAttributesForDefinition(const Decl *D,
     // Naked implies noinline: we should not be inlining such functions.
     B.addAttribute(llvm::Attribute::Naked);
     B.addAttribute(llvm::Attribute::NoInline);
+  } else if (D->hasAttr<NoIPAAttr>()) {
+    // NoIPA implies noinline: we should not be inlining such functions.
+    B.addAttribute(llvm::Attribute::NoIPA);
+    B.addAttribute(llvm::Attribute::NoInline);
   } else if (D->hasAttr<NoDuplicateAttr>()) {
     B.addAttribute(llvm::Attribute::NoDuplicate);
   } else if (D->hasAttr<NoInlineAttr>() && !F->hasFnAttribute(llvm::Attribute::AlwaysInline)) {

(just PoC, not tested)

I think there's a reasonable argument to be made for keeping the attributes orthogonal - to implement the GCC compatible support in Clang we can always add both attributes in Clang's IRGen.

This also avoids the awkwardness of the optnone-requires-noinline situation (where adding optnone means validation failures until you add noinline too) - or if we made it implied like your patch does - then things get weird on roundtrip (the attribute gets added when parsing the IR? so the output IR is different from the input IR).

Hmm, I guess the naked-implies-noinline code above is a pretty good existence proof if we went that route, though. So probably not the worst design choice. Oh, hmm - if we only add the "implies" when parsing - what happens if someone makes an IR module in-memory via the C++ API? Looks like Clang has to intentionally add both attributes... not especially ergonomic.

(though that does mean if we later wanted to separate these ideas it would be difficult - because there could be code only adding Naked without noinline and now we'd be changing the behavior of that. (at least with the optnone-requires-noinline if we do remove that constraint existing users won't be adversely effected because they have to add both... well, in theory - I guess that's probably only enforced on reading, so if someone makes only in-memory IR they wouldn't see the constraint and they'd have problems)

ugh. Yeah, more reasons not to tie attributes together like this, I suspect.

In D101011#2709757, @xbolva00 wrote:
Check leaf attribute: https://reviews.llvm.org/D90275

I think you miss similar changes in SemaDeclAttr.cpp and CGCall.cpp and some testcases with
__attribute__((noipa))

I'm not planning on adding the C noipa attribute to Clang (or at least not planning on doing it in this patch) - generally LLVM and Clang changes should be separated when possible, as they can in this case - the implementation and testing of the LLVM IR attribute can be done without changes to Clang, and should be done that way. Then Clang functionality can be built on top of that work in independent patches.

so the output IR is different from the input IR

No I dont mean input IR. I mean

int foo(int x) __attribute__((noipa)) {
    return x * 2;
}

int a(int x) {
    return foo(x);
}

Clang would codegen:

define dso_local i32 @foo(i32 %0) #0 {
  %2 = shl nsw i32 %0, 1
  ret i32 %2
}

attributes #0 = { noipa noinline }

WDYT?

In D101011#2709797, @dblaikie wrote:
In D101011#2709757, @xbolva00 wrote:
Check leaf attribute: https://reviews.llvm.org/D90275

I think you miss similar changes in SemaDeclAttr.cpp and CGCall.cpp and some testcases with
__attribute__((noipa))
I'm not planning on adding the C noipa attribute to Clang (or at least not planning on doing it in this patch) - generally LLVM and Clang changes should be separated when possible, as they can in this case - the implementation and testing of the LLVM IR attribute can be done without changes to Clang, and should be done that way. Then Clang functionality can be built on top of that work in independent patches.

Oh, okay. I can take it then.

In D101011#2709808, @xbolva00 wrote:
so the output IR is different from the input IR

No I dont mean input IR. I mean
int foo(int x) __attribute__((noipa)) {
    return x * 2;
}

int a(int x) {
    return foo(x);
}
Clang would codegen:
define dso_local i32 @foo(i32 %0) #0 {
  %2 = shl nsw i32 %0, 1
  ret i32 %2
}

attributes #0 = { noipa noinline }
WDYT?

Oh, sure - that'll happen in Clang (& I agree it should be done - both to match the GCC behavior, and because it seems like good behavior/likely what the user expects regardless), not in this LLVM patch,.

Great! (Sorry for small confusion)

The patch itself looks fine

jdoerfert added inline comments.Apr 22 2021, 1:14 PM

llvm/lib/IR/Globals.cpp
440	The only thing I'm not 100% convinced by is this part. I can see the appeal, but I can also imagine problems in the future. Not everything looking at the linkage might care about `noipa`. I'd rather introduce a helper that checks `noipa` and linkage, and maybe also the alwaysinline attribute. Basically, /// Determine whether the function \p F is IPO amendable /// /// If a function is exactly defined or it has alwaysinline attribute /// and is viable to be inlined, we say it is IPO amendable bool isFunctionIPOAmendable(const Function &F) { ┊ return !F->hasNoIPA() && (F.hasExactDefinition() \|\| F.isAlwaysInlined()); } whereas the "always inlined" function does not exist yet.

dblaikie added inline comments.Apr 22 2021, 1:25 PM

llvm/lib/IR/Globals.cpp
440	Yep - I mean this is the guts of it, so the bit that's interesting/debatable from a design perspective (but glad to hear the rest of it/the mechanical stuff is about right) I worry about implementing this as a separate property and having to update every pass/place that queries this sort of thing. That'll mean likely changing every call to hasExactDefinition (admittedly there are fewer calls than I was expecting, which also sort of worries me a bit - about 20, most of them in FunctionAttrs) and adding new test coverage for every use? & all the future uses that might find hasExactDefinition and think that's enough. Perhaps we could rename hasExactDefinition to something that would capture this new use case and reduce the chance it'd be used for something unsuitable later?

jdoerfert added inline comments.Apr 22 2021, 1:42 PM

llvm/lib/IR/Globals.cpp
440	I think "isFunctionIPOAmendable", or similar, makes it much clearer to the observer what is happening. We can also put more things in there, the always_inline logic, we can check for `naked`, etc. Since any new name/function requires to go to all callers, I don't think there is much to gain/loose (assuming we don't overload hasExactDefinition).

dblaikie added inline comments.Apr 22 2021, 2:26 PM

llvm/lib/IR/Globals.cpp
440	You mean not much to gain/lose regarding whether the function is renamed V a new function is added? Yeah, it's a bit of a stretch on my side, but I feel like renaming and adding some functionality to the function probably doesn't merit adding testing to al callers - but adding a new function and porting the passes over to it, I'd feel compelled to test each pass. How do you feel about either of those perspectives, or some other perspective on the change and testing of it?

jdoerfert added a subscriber: arsenm.Apr 22 2021, 2:37 PM

jdoerfert added inline comments.

llvm/lib/IR/Globals.cpp
440	yes, renaming vs new function seems to be little different. I favor a new function as there might be unknown uses downstream, but I'm not opposing renaming the existing one so we can make it do something else/more. I would create a new helper, select the passes we know should deal with `noipa` explicitly, move them over to the new helper, and add tests. It is unclear how we could get away with less tests if we do something else, I mean without just not testing if a pass honors `noipa` now. Function-attrs (and I can port them to the Attributor) is the top consumer. From reading through the uses of "exact definition" I'd say IPSCCP (via canTrackReturnsInterprocedurally) is another one. @arsenm might want to look at `AMDGPUAnnotateKernelFeatures.cpp`. PruneEH and DeadArgElim need updates and tests, though that should be easy. I have no idea what `ObjCARCAPElim.cpp` does, we probably should ping someone. That should cover everything in-tree, I think. That said, I'm sure we have some volunteers to do some of the porting testing so it's not all on you. (Looking at @xbolva00 ;) )

dblaikie added inline comments.Apr 22 2021, 2:48 PM

llvm/lib/IR/Globals.cpp
440	Yeah, I worry about keeping the existing one in place due to the risk of it being misused out of momentum/existing familiarity, rendering noipa less accurate. Admittedly partly inspired by the GCC implementation ( https://gcc.gnu.org/legacy-ml/gcc/2016-12/msg00064.html ) that implemented noipa by way of marking such functions as interposable - which I tend to agree with. The idea that we have an existing property that this maps to (though there's reasonable disagreement about how accurately) - now we're adding a new way that that property can be expressed. That doesn't necessitate testing all functionality that depends on the property - only testing that the new expression of the property is working correctly.

jdoerfert added inline comments.Apr 22 2021, 5:18 PM

llvm/lib/IR/Globals.cpp
440	Yeah, I worry about keeping the existing one in place due to the risk of it being misused out of momentum/existing familiarity, rendering noipa less accurate Fair. Maybe we rename it and introduce a new helper ;) Renaming so downstream users notice, new helper to encapsulate all the "can we do IPA across this call edge" logic without polutting the "is this derefinable" code path. I generally would value clear naming/design over the desire to "auto-port" existing users to what might be the right thing. While the latter has short term advantages it often is long term painful, and at some point someone will come along and split the linkage logic and the rest apart, or introduce an argument with default value, rendering all these thoughts mood. [Site note: I imagine we will actually have to find a way to split it to update our internalization optimization in which we want to work around derefinable linkage through a TU internal copy but not around `noipa`.] That said, if people really think hiding `noipa` behind this function is the way to go, I'm not going to block that.

dblaikie added inline comments.Apr 22 2021, 7:08 PM

llvm/lib/IR/Globals.cpp
440	Actually, looking at this further - maybe it makes sense to actually move it further /down/ rather than up. Closer to how GCC modeled this: one layer down, at `GlobalValue::isInterposable` I found a test for the `getParent() && getParent()->getSemanticInterposition()`, the latter uses module metadata to flag the module as semantically interposable. The documentation for "isInterposable" seems more accurate to the semantics I want to implement here: /// Whether the definition of this global may be replaced by something /// non-equivalent at link time. For example, if a function has weak linkage /// then the code defining it may be replaced by different code. (where's `isExactDefinition` says "Inlining is okay across non-exact linkage types as long as they're not interposable" and `mayBeDerefined` says "Returns true if the definition of this global may be replaced by a differently optimized variant of the same source level function at link time." - while the latter is less precise, it still has that worrying "of the same source level function" - which could suggest that a function that may be derefined could still be inlined) In fact, adding this noipa attribute could supersede the SemanticInterposition module metadata entirely - frontends could put noipa on every function (as they do with optnone at -O0 today). Should we call it something different than noipa, then? Should it be called semantic_interposition?

jdoerfert added inline comments.Apr 22 2021, 10:01 PM

llvm/lib/IR/Globals.cpp
440	On first thought I can see us adding an attribute to replace that metadata, it depends on how it is supposed to work in an LTO setting at the end of the day. That said, `noipa` should exist on its own. I also still think we should not intertwine two things that are similar but different: "do not perform ipa/ipo" and "the semantics do not allow you to do ipa/ipo, among other things". Long story short, I'll continue to be in favor of a new `canIdoIPO(CallBase/Function)` helper which we employ in our passes, though others should chime in if you prefer a solution in which `noipa` is checked in existing lookup functions.

@MaskRay @serge-sans-paille - you folks have any thoughts on this (see also the specific discussion thread in this review with @JDevlieghere). It looks like this attribute could allow per-function support for "-fsemantic-interposition" that would potentially replace the existing module metadata support for Semantic Interposition, perhaps? Is that feasible, would this be the right behavior? the right design/direction?

(also, I'm considering renaming this to "nointeropt" and changing "optnone" to "nointraopt" for symmetry/clarity (& then implementing clang optnone as "nointeropt+nointraopt"), in case that helps make the names more general/useful for different use cases)

In D101011#2713514, @dblaikie wrote:

@MaskRay @serge-sans-paille - you folks have any thoughts on this (see also the specific discussion thread in this review with @JDevlieghere). It looks like this attribute could allow per-function support for "-fsemantic-interposition" that would potentially replace the existing module metadata support for Semantic Interposition, perhaps? Is that feasible, would this be the right behavior? the right design/direction?

(also, I'm considering renaming this to "nointeropt" and changing "optnone" to "nointraopt" for symmetry/clarity (& then implementing clang optnone as "nointeropt+nointraopt"), in case that helps make the names more general/useful for different use cases)

(oh, asking because I came across the SemanticInterposition work done in D72829 while looking at where to implement this)

Wouldn't mind some thoughts from the other folks on the original thread, @rnk @mehdi_amini

I'm considering renaming this to "nointeropt" and changing "optnone" to "nointraopt" for symmetry/clarity (& then implementing clang optnone as "nointeropt+nointraopt"), in case that helps make the names more general/useful for different use cases)

I dont think this is a good step. This would be a very invasive major change to rename optnone. You can always improve the documentation for the attributes to make it more clear..

In D101011#2713621, @xbolva00 wrote:

I'm considering renaming this to "nointeropt" and changing "optnone" to "nointraopt" for symmetry/clarity (& then implementing clang optnone as "nointeropt+nointraopt"), in case that helps make the names more general/useful for different use cases)

I dont think this is a good step. This would be a very invasive major change to rename optnone. You can always improve the documentation for the attributes to make it more clear..

Eh, mechanically it'd be a big patch, but not a lot of work I'd expect. But yeah - we'll figure that out in another patch if/when it comes to that.

In D101011#2713514, @dblaikie wrote:

@MaskRay @serge-sans-paille - you folks have any thoughts on this (see also the specific discussion thread in this review with @JDevlieghere). It looks like this attribute could allow per-function support for "-fsemantic-interposition" that would potentially replace the existing module metadata support for Semantic Interposition, perhaps? Is that feasible, would this be the right behavior? the right design/direction?

(also, I'm considering renaming this to "nointeropt" and changing "optnone" to "nointraopt" for symmetry/clarity (& then implementing clang optnone as "nointeropt+nointraopt"), in case that helps make the names more general/useful for different use cases)

I think the module flag metadata "SemanticInterposition" is more of a workaround for the existing (30+) uses for GlobalValue::isInterposable.
Many probably should respect the proposed LLVM IR noipa (subject to rename) by using a helper function (similar to isDefinitionExact).
If we simply make GlobalValue::isInterposable return false if the global value doesn't have dso_local, we may regress some optimization passes.
Ideally the frontend has set dso_local/dso_preemptable correctly so GlobalValue::isInterposable shouldn't need to check "SemanticInterposition".

Instrumentation passes can create functions on the fly. They are usually internal. If not (I don't know such a case), a module flag metadata serves as the purpose for setting the default dso_local/dso_preemptable.
I don't think such synthesized functions care about dso_local optimizations so this argument retaining "SemanticInterposition" is very weak.
If we simply make GlobalValue::isInterposable return false if the global value doesn't have dso_local, we may regress some optimization passes.
Ideally the frontend has set dso_local/dso_preemptable correctly so GlobalValue::isInterposable shouldn't need to check "SemanticInterposition".

llvm/lib/IR/Globals.cpp
417	Sent D101264 to refactor this function a bit.

MaskRay added inline comments.Apr 25 2021, 3:14 PM

llvm/lib/IR/Globals.cpp
440	+1 for adding a helper named `isFunctionIPOAmendable` or similar. `"SemanticInterposition"` is more of a workaround to not regress existing `GlobalValue::isInterposable` call sites which might not be appropriate. If the call sites are fixed/confirmed ok, `noipa` can supersede `"SemanticInterposition"`.

This attribute will be useful:) Thanks for working on it.

I agree with @jdoerfert (https://lists.llvm.org/pipermail/llvm-dev/2021-April/150062.html) that the proposed attribute, noinline, and optnone should be orthogonal: no one implies the other(s).
If my reading from the long mailing list thread is correct, it is still undecided how the clang attribute should behave.
(Personally I'd hope that the GCC attribute noipa ("This attribute is supported mainly for the purpose of testing the compiler.") were different from the GCC attribute noinline.)

If the clang attribute would end up combining two attributes, we should better make the IR/clang attribute names different.
nointeropt may be a better name anyway because analysis in ipa does not necessarily mean optimization? :)
If our clang attribute is likely to have different semantics (I'd favor orthogonal semantics), noipa would be inappropriate as it will conflict with the GCC semantics.
Naming it sometime different will be nice.
For the Linux kernel https://github.com/ClangBuiltLinux/linux/issues/1302 , the kernel can use the clang attribute based on __clang__

xbolva00 edited the summary of this revision. (Show Details)May 2 2021, 9:16 AM

ychen added a subscriber: ychen.May 15 2021, 11:37 PM

Re-ping

Herald added a project: Restricted Project. · View Herald TranscriptMar 15 2022, 12:51 AM

Herald added a subscriber: ormris. · View Herald Transcript

In D101011#2715555, @MaskRay wrote:

This attribute will be useful:) Thanks for working on it.

I agree with @jdoerfert (https://lists.llvm.org/pipermail/llvm-dev/2021-April/150062.html) that the proposed attribute, noinline, and optnone should be orthogonal: no one implies the other(s).

At the IR level, I think that's fine/fair.

If my reading from the long mailing list thread is correct, it is still undecided how the clang attribute should behave.

I think Clang optnone -> IR optnone + noipa.

(Personally I'd hope that the GCC attribute noipa ("This attribute is supported mainly for the purpose of testing the compiler.") were different from the GCC attribute noinline.)

Sure - I forget the context, but I don't think there's any reason/need to have noipa and noinline alias each other, they are distinct features. (though, unfortunately, sometimes IPA produces the equivalent of inlining ("hey, this function always returns 5, we can just use 5 at the call site - oh, also the function has no side effects, so we can remove the now-unused call... and we've essentially inlined, even though we didn't use the inliner") - not sure if there are some IPA features we could intentionally break that'd stop LLVM doing "the equivalent of inlining, without using the inliner")

noipa will effectively imply the equivalent of noinline - because it'd be impossible to inline in the absence of a definition. But the inverse isn't true. I don't think we necessarily have to lower a clang-level noipa to an IR noipa+noinline, I think just using noipa would be sufficient to get the non-inlining semantics (again: how could you inline if you can't see the definition of a function).

If the clang attribute would end up combining two attributes, we should better make the IR/clang attribute names different.

Perhaps. Though probably the thing to rename would be optnone, rather than noipa. I think noipa is the right name. I think optnone as a name should/does include noipa. (I think this whole rabbit hole we got down to here is unfortunate and previous understanding of optnone did include noipa-like behavior (I think I've referenced previous patches that were consistent with that understanding & the original author (& myself as one of the reviewers of the feature) have stated that that was the intent of optnone, to also be noipa-like) - but I can appreciate that having distinct attributes can provide greater flexibility)

nointeropt may be a better name anyway because analysis in ipa does not necessarily mean optimization? :)

Eh, I don't think I'd want to go down that path - I think it's important to model this as noipa, as "imagine this function definition were not available at all" (as though it were defined in another translation unit entirely) - no analysis, no optimization, nothing - you can't see the definition at all. It's an easy to explain, robust concept.

If our clang attribute is likely to have different semantics (I'd favor orthogonal semantics), noipa would be inappropriate as it will conflict with the GCC semantics.

I don't think there's any need for our noipa to differ from GCC's semantics - their documentation that says noipa implies noinline can be seen as a description of behavior, not a constraint on how that behavior is achieved. noipa should logically disable inlining as a consequnce of disallowing interprodecural analysis/information.

In D101011#3381751, @xbolva00 wrote:

Re-ping

Sorry I haven't picked this up again. I lost steam when folks suggested that to implement this we could/should revisit/rewrite/retest every call site that checks for interposability... that just doesn't seem productive to me/hard to justify the time to do all that work and doing it partially I think would be really harmful (leaving interposability unaligned/inconsistent with noipa in various subtle ways).

Happy to talk about how we could move this forward perhaps with more hands to do all that work, but I still just don't feel great about that as a direction, really.

I don't think there's any need for our noipa to differ from GCC's semantics - their documentation that says noipa implies noinline can be seen as a description of behavior, not a constraint on how that behavior is achieved. noipa should logically disable inlining as a consequnce of disallowing interprodecural analysis/information.

GCC’s attribute(noipa) (frontend) -> noipa, noinline, noclone, no_icf (midend).

https://github.com/gcc-mirror/gcc/commit/036ea39917b0ef6f07a7c3c3c06002c73fd238f5

Any good reason why not do it same way as well? Avoid hidden logical assumption that noipa is basically noinline + something.

And then llvm optimizers can check - if (attrs.contains (“noipa”) do not propagate constants, etc…

I lost steam when folks suggested that to implement this we could/should revisit/rewrite/retest every call site that checks for interposability... that just doesn't seem productive to me/hard to justify the time to do all that work and doing it partially I think would be really harmful (leaving interposability unaligned/inconsistent with noipa in various subtle ways).

:/ not productive approach to work on one perfect patch.

This work could be split easily. 1) Basic llvm support, 2) clang attribute, 3) complete llvm support.

I agree, this is useful functionality, we should add it.

I would be OK with an all-in-one non-orthogonal noipa attribute. The only use case I can come up with for orthogonal attributes is for constructing fine-grained compiler test cases, or trying to carefully convince the optimizer to do one transform or another. The original use case seems more important.

I would also like to suggest another use case for the original attribute, which is that this feature supports hotpatching. If you block IPO across a particular function boundary, you can more reliably recompile and hotpatch in new code, without going to great lengths to break up modules into smaller object files and relinking that way.

In D101011#3383003, @xbolva00 wrote:

I don't think there's any need for our noipa to differ from GCC's semantics - their documentation that says noipa implies noinline can be seen as a description of behavior, not a constraint on how that behavior is achieved. noipa should logically disable inlining as a consequnce of disallowing interprodecural analysis/information.

GCC’s attribute(noipa) (frontend) -> noipa, noinline, noclone, no_icf (midend).

That has some appeal, but what about future optimizations? This supports testability (you can block each transform individually), but it requires frontends to keep track of these unpacked attributes that block all interprocedural optimization. And there's a bitcode auto-upgrade problem: if the frontend intended to block all IPO, how to you upgrade that intention? If you ignore that, you can solve the frontend problem with a utility like AttrBuilder::addNoIpaAttrs() which adds all the relevant attributes (noinline, noicf, whatever).

In D101011#3382926, @dblaikie wrote:

Sorry I haven't picked this up again. I lost steam when folks suggested that to implement this we could/should revisit/rewrite/retest every call site that checks for interposability... that just doesn't seem productive to me/hard to justify the time to do all that work and doing it partially I think would be really harmful (leaving interposability unaligned/inconsistent with noipa in various subtle ways).

Happy to talk about how we could move this forward perhaps with more hands to do all that work, but I still just don't feel great about that as a direction, really.

I'm sympathetic. :) Are other reviewers mostly concerned with the naming of "derefined" here? I think I agree that the fewer states and special cases we allow, the better, so aligning interposability and noipa is appealing to me.

if llvm attribute noipa also adds logical assumption about noinline, we should not then audit all check of noinline whether they should be extended for noipa check, or?

If we clang’s noipa translates to noipa + noinline + …, we dont have this issue.

In D101011#3383003, @xbolva00 wrote:

I don't think there's any need for our noipa to differ from GCC's semantics - their documentation that says noipa implies noinline can be seen as a description of behavior, not a constraint on how that behavior is achieved. noipa should logically disable inlining as a consequnce of disallowing interprodecural analysis/information.

GCC’s attribute(noipa) (frontend) -> noipa, noinline, noclone, no_icf (midend).

https://github.com/gcc-mirror/gcc/commit/036ea39917b0ef6f07a7c3c3c06002c73fd238f5

Any good reason why not do it same way as well? Avoid hidden logical assumption that noipa is basically noinline + something.

Because I don't think it should be necessary - noipa semantics should be strictly more powerful than noinline, so there shouldn't be a need to put noinline on the function.

If noinline ever makes a difference on a noipa function, I think that's a pretty serious bug.

In D101011#3383020, @xbolva00 wrote:

I lost steam when folks suggested that to implement this we could/should revisit/rewrite/retest every call site that checks for interposability... that just doesn't seem productive to me/hard to justify the time to do all that work and doing it partially I think would be really harmful (leaving interposability unaligned/inconsistent with noipa in various subtle ways).

:/ not productive approach to work on one perfect patch.

I don't think the approach of revisiting/retesting all these call sites is the right way to go about things. So I stopped working on this because I couldn't think of/find a way to approach this that addressed both my perspectives and those of reviewers here.

This work could be split easily. 1) Basic llvm support, 2) clang attribute, 3) complete llvm support.

The intermediate state created by pieces of (3) I have two concerns with:

I don't think it's a productive use of time to retest and rewrite every check for interposability
It introduces (certainly initially) divergence between these two properties which I don't think should ever diverge

In D101011#3383088, @xbolva00 wrote:

if llvm attribute noipa also adds logical assumption about noinline, we should not then audit all check of noinline whether they should be extended for noipa check, or?

No, I don't believe we do - if interposability is already well tested, that was the point of leveraging that definition/functionality - noipa should be very similar/the same as interposability. But if we have to revisit every check for interposability and rewrite the check - then, yes, we'd end up adding more test coverage and as a consequence we would end up adding testing for cases where noipa subsumes noinline, probably. (maybe the interposability testing is somewhere else/not in noinline directly - in which case we wouldn't add inlining testing, we'd test that other piece that happens to feed into inlining)

In D101011#3383074, @rnk wrote:

I agree, this is useful functionality, we should add it.

I would be OK with an all-in-one non-orthogonal noipa attribute. The only use case I can come up with for orthogonal attributes is for constructing fine-grained compiler test cases, or trying to carefully convince the optimizer to do one transform or another. The original use case seems more important.

I would also like to suggest another use case for the original attribute, which is that this feature supports hotpatching. If you block IPO across a particular function boundary, you can more reliably recompile and hotpatch in new code, without going to great lengths to break up modules into smaller object files and relinking that way.

Ah, indeed.

In D101011#3383003, @xbolva00 wrote:

I don't think there's any need for our noipa to differ from GCC's semantics - their documentation that says noipa implies noinline can be seen as a description of behavior, not a constraint on how that behavior is achieved. noipa should logically disable inlining as a consequnce of disallowing interprodecural analysis/information.

GCC’s attribute(noipa) (frontend) -> noipa, noinline, noclone, no_icf (midend).

That has some appeal, but what about future optimizations? This supports testability (you can block each transform individually), but it requires frontends to keep track of these unpacked attributes that block all interprocedural optimization. And there's a bitcode auto-upgrade problem: if the frontend intended to block all IPO, how to you upgrade that intention? If you ignore that, you can solve the frontend problem with a utility like AttrBuilder::addNoIpaAttrs() which adds all the relevant attributes (noinline, noicf, whatever).

I'm just not sure it makes sense to decompose these - like I don't know what noipa without noinline or no_icf would mean - if we're saying it's invalid IR /not/ to combine them, then I'm still confused by how we could/would implement noipa that wouldn't implicitly subsume toinline/no_icf functionality anyway. (if there was an implementation of noipa that didn't implicitly subsume the functionality of noinline/no_icf, then I'd consider that feature brittle/buggy in a way I don't think it could/should be - if noipa is tested at the same level as interposable, it should flow through inlining and icf naturally without the need for specific attributes to disable them)

dblaikie mentioned this in D100353: Support optnone in SCCP.May 25 2022, 2:50 PM

This review may be stuck/dead, consider abandoning if no longer relevant.
Removing myself as reviewer in attempt to clean dashboard.

Herald added a subscriber: StephenFan. · View Herald TranscriptJan 12 2023, 5:30 PM

sync/update based on recent discussions https://discourse.llvm.org/t/force-optimizations-even-when-optnone-is-present/74216/19

Rename mayBeDerefined to mayBeDerefinedOrNoIPA

Harbormaster completed remote builds in B257916: Diff 557848.Oct 23 2023, 2:34 PM

Add docs

Harbormaster completed remote builds in B257937: Diff 557886.Oct 25 2023, 6:06 PM

Revision Contents

Path

Size

llvm/

docs/

LangRef.rst

5 lines

include/

llvm/

Bitcode/

LLVMBitCodes.h

1 line

IR/

Attributes.td

3 lines

GlobalValue.h

10 lines

lib/

Bitcode/

Reader/

BitcodeReader.cpp

2 lines

Writer/

BitcodeWriter.cpp

2 lines

IR/

Globals.cpp

7 lines

Transforms/

IPO/

FunctionAttrs.cpp

10 lines

Utils/

CodeExtractor.cpp

1 line

test/

Bitcode/

attributes.ll

7 lines

unittests/

IR/

FunctionTest.cpp

12 lines

utils/

emacs/

llvm-mode.el

2 lines

kate/

llvm.xml

1 line

vim/

syntax/

llvm.vim

1 line

vscode/

llvm/

syntaxes/

ll.tmLanguage.yaml

1 line

Diff 557886

llvm/docs/LangRef.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,937 Lines • ▼ Show 20 Lines	``noimplicitfloat``
Also inhibits optimizations that create SIMD/vector code and registers from		Also inhibits optimizations that create SIMD/vector code and registers from
scalar code such as vectorization or memcpy/memset optimization. This		scalar code such as vectorization or memcpy/memset optimization. This
includes integer vectors. Vector instructions present in IR may still cause		includes integer vectors. Vector instructions present in IR may still cause
vector code to be generated.		vector code to be generated.
``noinline``		``noinline``
This attribute indicates that the inliner should never inline this		This attribute indicates that the inliner should never inline this
function in any situation. This attribute may not be used together		function in any situation. This attribute may not be used together
with the ``alwaysinline`` attribute.		with the ``alwaysinline`` attribute.
		``noipa``
		Disables any interprocedural analysis that inspects the definition of this
		function. Equivalent to moving this function definition to a separate,
		optimizer-opaque, module. Any attributes on the function are still respected
		(as they would be if they remained on a function declaration in this module).
``nomerge``		``nomerge``
This attribute indicates that calls to this function should never be merged		This attribute indicates that calls to this function should never be merged
during optimization. For example, it will prevent tail merging otherwise		during optimization. For example, it will prevent tail merging otherwise
identical code sequences that raise an exception or terminate the program.		identical code sequences that raise an exception or terminate the program.
Tail merging normally reduces the precision of source location information,		Tail merging normally reduces the precision of source location information,
making stack traces less useful for debugging. This attribute gives the		making stack traces less useful for debugging. This attribute gives the
user control over the tradeoff between code size and debug information		user control over the tradeoff between code size and debug information
precision.		precision.
▲ Show 20 Lines • Show All 25,746 Lines • Show Last 20 Lines

llvm/include/llvm/Bitcode/LLVMBitCodes.h

Show First 20 Lines • Show All 708 Lines • ▼ Show 20 Lines	enum AttributeKindCodes {
ATTR_KIND_ALLOCATED_POINTER = 81,		ATTR_KIND_ALLOCATED_POINTER = 81,
ATTR_KIND_ALLOC_KIND = 82,		ATTR_KIND_ALLOC_KIND = 82,
ATTR_KIND_PRESPLIT_COROUTINE = 83,		ATTR_KIND_PRESPLIT_COROUTINE = 83,
ATTR_KIND_FNRETTHUNK_EXTERN = 84,		ATTR_KIND_FNRETTHUNK_EXTERN = 84,
ATTR_KIND_SKIP_PROFILE = 85,		ATTR_KIND_SKIP_PROFILE = 85,
ATTR_KIND_MEMORY = 86,		ATTR_KIND_MEMORY = 86,
ATTR_KIND_NOFPCLASS = 87,		ATTR_KIND_NOFPCLASS = 87,
ATTR_KIND_OPTIMIZE_FOR_DEBUGGING = 88,		ATTR_KIND_OPTIMIZE_FOR_DEBUGGING = 88,
		ATTR_KIND_NO_INTERPROCEDURAL_ANALYSIS = 89,
};		};

enum ComdatSelectionKindCodes {		enum ComdatSelectionKindCodes {
COMDAT_SELECTION_KIND_ANY = 1,		COMDAT_SELECTION_KIND_ANY = 1,
COMDAT_SELECTION_KIND_EXACT_MATCH = 2,		COMDAT_SELECTION_KIND_EXACT_MATCH = 2,
COMDAT_SELECTION_KIND_LARGEST = 3,		COMDAT_SELECTION_KIND_LARGEST = 3,
COMDAT_SELECTION_KIND_NO_DUPLICATES = 4,		COMDAT_SELECTION_KIND_NO_DUPLICATES = 4,
COMDAT_SELECTION_KIND_SAME_SIZE = 5,		COMDAT_SELECTION_KIND_SAME_SIZE = 5,
Show All 14 Lines

llvm/include/llvm/IR/Attributes.td

	Show First 20 Lines • Show All 151 Lines • ▼ Show 20 Lines
	def NoFree : EnumAttr<"nofree", [FnAttr, ParamAttr]>;			def NoFree : EnumAttr<"nofree", [FnAttr, ParamAttr]>;

	/// Disable implicit floating point insts.			/// Disable implicit floating point insts.
	def NoImplicitFloat : EnumAttr<"noimplicitfloat", [FnAttr]>;			def NoImplicitFloat : EnumAttr<"noimplicitfloat", [FnAttr]>;

	/// inline=never.			/// inline=never.
	def NoInline : EnumAttr<"noinline", [FnAttr]>;			def NoInline : EnumAttr<"noinline", [FnAttr]>;

				/// Do not do interprocedural analysis or optimization including this function
				def NoIPA : EnumAttr<"noipa", [FnAttr]>;

	/// Function is called early and/or often, so lazy binding isn't worthwhile.			/// Function is called early and/or often, so lazy binding isn't worthwhile.
	def NonLazyBind : EnumAttr<"nonlazybind", [FnAttr]>;			def NonLazyBind : EnumAttr<"nonlazybind", [FnAttr]>;

	/// Disable merging for specified functions or call sites.			/// Disable merging for specified functions or call sites.
	def NoMerge : EnumAttr<"nomerge", [FnAttr]>;			def NoMerge : EnumAttr<"nomerge", [FnAttr]>;

	/// Pointer is known to be not null.			/// Pointer is known to be not null.
	def NonNull : EnumAttr<"nonnull", [ParamAttr, RetAttr]>;			def NonNull : EnumAttr<"nonnull", [ParamAttr, RetAttr]>;
	▲ Show 20 Lines • Show All 212 Lines • Show Last 20 Lines

llvm/include/llvm/IR/GlobalValue.h

Show First 20 Lines • Show All 124 Lines • ▼ Show 20 Lines	private:
friend class Constant;		friend class Constant;

void destroyConstantImpl();		void destroyConstantImpl();
Value handleOperandChangeImpl(Value From, Value *To);		Value handleOperandChangeImpl(Value From, Value *To);

/// Returns true if the definition of this global may be replaced by a		/// Returns true if the definition of this global may be replaced by a
/// differently optimized variant of the same source level function at link		/// differently optimized variant of the same source level function at link
/// time.		/// time.
bool mayBeDerefined() const {		bool mayBeDerefinedOrNoIPA() const {
switch (getLinkage()) {		switch (getLinkage()) {
case WeakODRLinkage:		case WeakODRLinkage:
case LinkOnceODRLinkage:		case LinkOnceODRLinkage:
case AvailableExternallyLinkage:		case AvailableExternallyLinkage:
return true;		return true;

case WeakAnyLinkage:		case WeakAnyLinkage:
case LinkOnceAnyLinkage:		case LinkOnceAnyLinkage:
case CommonLinkage:		case CommonLinkage:
case ExternalWeakLinkage:		case ExternalWeakLinkage:
case ExternalLinkage:		case ExternalLinkage:
case AppendingLinkage:		case AppendingLinkage:
case InternalLinkage:		case InternalLinkage:
case PrivateLinkage:		case PrivateLinkage:
// Optimizations may assume builtin semantics for functions defined as		// Optimizations may assume builtin semantics for functions defined as
// nobuiltin due to attributes at call-sites. To avoid applying IPO based		// nobuiltin due to attributes at call-sites. To avoid applying IPO based
// on nobuiltin semantics, treat such function definitions as maybe		// on nobuiltin semantics, treat such function definitions as maybe
// derefined.		// derefined.
return isInterposable() \|\| isNobuiltinFnDef();		return isInterposable() \|\| isNobuiltinFnDef() \|\| isNoipaFnDef();
}		}

llvm_unreachable("Fully covered switch above!");		llvm_unreachable("Fully covered switch above!");
}		}

/// Returns true if the global is a function definition with the nobuiltin		/// Returns true if the global is a function definition with the nobuiltin
/// attribute.		/// attribute.
bool isNobuiltinFnDef() const;		bool isNobuiltinFnDef() const;

		/// Returns true if the global is a function definition with the noipa
		/// attribute.
		bool isNoipaFnDef() const;

protected:		protected:
/// The intrinsic ID for this subclass (which must be a Function).		/// The intrinsic ID for this subclass (which must be a Function).
///		///
/// This member is defined by this class, but not used for anything.		/// This member is defined by this class, but not used for anything.
/// Subclasses can use it to store their intrinsic ID, if they have one.		/// Subclasses can use it to store their intrinsic ID, if they have one.
///		///
/// This is stored here to save space in Function on 64-bit hosts.		/// This is stored here to save space in Function on 64-bit hosts.
Intrinsic::ID IntID = (Intrinsic::ID)0U;		Intrinsic::ID IntID = (Intrinsic::ID)0U;
▲ Show 20 Lines • Show All 308 Lines • ▼ Show 20 Lines	public:
/// undefined behavior if the linker replaces the actual call destination with		/// undefined behavior if the linker replaces the actual call destination with
/// the unoptimized `foo`.		/// the unoptimized `foo`.
///		///
/// Inlining is okay across non-exact linkage types as long as they're not		/// Inlining is okay across non-exact linkage types as long as they're not
/// interposable (see \c isInterposable), since in such cases the currently		/// interposable (see \c isInterposable), since in such cases the currently
/// visible variant is a correct implementation of the original source		/// visible variant is a correct implementation of the original source
/// function; it just isn't the only correct implementation.		/// function; it just isn't the only correct implementation.
bool isDefinitionExact() const {		bool isDefinitionExact() const {
return !mayBeDerefined();		return !mayBeDerefinedOrNoIPA();
}		}

/// Return true if this global has an exact defintion.		/// Return true if this global has an exact defintion.
bool hasExactDefinition() const {		bool hasExactDefinition() const {
// While this computes exactly the same thing as		// While this computes exactly the same thing as
// isStrongDefinitionForLinker, the intended uses are different. This		// isStrongDefinitionForLinker, the intended uses are different. This
// function is intended to help decide if specific inter-procedural		// function is intended to help decide if specific inter-procedural
// transforms are correct, while isStrongDefinitionForLinker's intended use		// transforms are correct, while isStrongDefinitionForLinker's intended use
▲ Show 20 Lines • Show All 179 Lines • Show Last 20 Lines

llvm/lib/Bitcode/Reader/BitcodeReader.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 2,052 Lines • ▼ Show 20 Lines	static Attribute::AttrKind getAttrFromCode(uint64_t Code) {
case bitc::ATTR_KIND_BYREF:		case bitc::ATTR_KIND_BYREF:
return Attribute::ByRef;		return Attribute::ByRef;
case bitc::ATTR_KIND_MUSTPROGRESS:		case bitc::ATTR_KIND_MUSTPROGRESS:
return Attribute::MustProgress;		return Attribute::MustProgress;
case bitc::ATTR_KIND_HOT:		case bitc::ATTR_KIND_HOT:
return Attribute::Hot;		return Attribute::Hot;
case bitc::ATTR_KIND_PRESPLIT_COROUTINE:		case bitc::ATTR_KIND_PRESPLIT_COROUTINE:
return Attribute::PresplitCoroutine;		return Attribute::PresplitCoroutine;
		case bitc::ATTR_KIND_NO_INTERPROCEDURAL_ANALYSIS:
		return Attribute::NoIPA;
}		}
}		}

Error BitcodeReader::parseAlignmentValue(uint64_t Exponent,		Error BitcodeReader::parseAlignmentValue(uint64_t Exponent,
MaybeAlign &Alignment) {		MaybeAlign &Alignment) {
// Note: Alignment in bitcode files is incremented by 1, so that zero		// Note: Alignment in bitcode files is incremented by 1, so that zero
// can be used for default alignment.		// can be used for default alignment.
if (Exponent > Value::MaxAlignmentExponent + 1)		if (Exponent > Value::MaxAlignmentExponent + 1)
▲ Show 20 Lines • Show All 6,150 Lines • Show Last 20 Lines

llvm/lib/Bitcode/Writer/BitcodeWriter.cpp

Show First 20 Lines • Show All 817 Lines • ▼ Show 20 Lines	static uint64_t getAttrKindEncoding(Attribute::AttrKind Kind) {
case Attribute::NoUndef:		case Attribute::NoUndef:
return bitc::ATTR_KIND_NOUNDEF;		return bitc::ATTR_KIND_NOUNDEF;
case Attribute::ByRef:		case Attribute::ByRef:
return bitc::ATTR_KIND_BYREF;		return bitc::ATTR_KIND_BYREF;
case Attribute::MustProgress:		case Attribute::MustProgress:
return bitc::ATTR_KIND_MUSTPROGRESS;		return bitc::ATTR_KIND_MUSTPROGRESS;
case Attribute::PresplitCoroutine:		case Attribute::PresplitCoroutine:
return bitc::ATTR_KIND_PRESPLIT_COROUTINE;		return bitc::ATTR_KIND_PRESPLIT_COROUTINE;
		case Attribute::NoIPA:
		return bitc::ATTR_KIND_NO_INTERPROCEDURAL_ANALYSIS;
case Attribute::EndAttrKinds:		case Attribute::EndAttrKinds:
llvm_unreachable("Can not encode end-attribute kinds marker.");		llvm_unreachable("Can not encode end-attribute kinds marker.");
case Attribute::None:		case Attribute::None:
llvm_unreachable("Can not encode none-attribute.");		llvm_unreachable("Can not encode none-attribute.");
case Attribute::EmptyKey:		case Attribute::EmptyKey:
case Attribute::TombstoneKey:		case Attribute::TombstoneKey:
llvm_unreachable("Trying to encode EmptyKey/TombstoneKey");		llvm_unreachable("Trying to encode EmptyKey/TombstoneKey");
}		}
▲ Show 20 Lines • Show All 4,409 Lines • Show Last 20 Lines

llvm/lib/IR/Globals.cpp

Show First 20 Lines • Show All 264 Lines • ▼ Show 20 Lines

bool GlobalValue::isNobuiltinFnDef() const {		bool GlobalValue::isNobuiltinFnDef() const {
const Function *F = dyn_cast<Function>(this);		const Function *F = dyn_cast<Function>(this);
if (!F \|\| F->empty())		if (!F \|\| F->empty())
return false;		return false;
return F->hasFnAttribute(Attribute::NoBuiltin);		return F->hasFnAttribute(Attribute::NoBuiltin);
}		}

		bool GlobalValue::isNoipaFnDef() const {
		const Function *F = dyn_cast<Function>(this);
		if (!F \|\| F->empty())
		return false;
		return F->hasFnAttribute(Attribute::NoIPA);
		}

bool GlobalValue::isDeclaration() const {		bool GlobalValue::isDeclaration() const {
// Globals are definitions if they have an initializer.		// Globals are definitions if they have an initializer.
if (const GlobalVariable *GV = dyn_cast<GlobalVariable>(this))		if (const GlobalVariable *GV = dyn_cast<GlobalVariable>(this))
return GV->getNumOperands() == 0;		return GV->getNumOperands() == 0;

// Functions are definitions if they have a body.		// Functions are definitions if they have a body.
if (const Function *F = dyn_cast<Function>(this))		if (const Function *F = dyn_cast<Function>(this))
return F->empty() && !F->isMaterializable();		return F->empty() && !F->isMaterializable();
▲ Show 20 Lines • Show All 121 Lines • ▼ Show 20 Lines	bool GlobalValue::canBeOmittedFromSymbolTable() const {
// objects.		// objects.
if (auto *Var = dyn_cast<GlobalVariable>(this))		if (auto *Var = dyn_cast<GlobalVariable>(this))
if (!Var->isConstant())		if (!Var->isConstant())
return false;		return false;

return hasAtLeastLocalUnnamedAddr();		return hasAtLeastLocalUnnamedAddr();
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
		MaskRayUnsubmitted Not Done Reply Inline Actions Sent D101264 to refactor this function a bit. MaskRay: Sent D101264 to refactor this function a bit.
// GlobalVariable Implementation		// GlobalVariable Implementation
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

GlobalVariable::GlobalVariable(Type *Ty, bool constant, LinkageTypes Link,		GlobalVariable::GlobalVariable(Type *Ty, bool constant, LinkageTypes Link,
Constant *InitVal, const Twine &Name,		Constant *InitVal, const Twine &Name,
ThreadLocalMode TLMode, unsigned AddressSpace,		ThreadLocalMode TLMode, unsigned AddressSpace,
bool isExternallyInitialized)		bool isExternallyInitialized)
: GlobalObject(Ty, Value::GlobalVariableVal,		: GlobalObject(Ty, Value::GlobalVariableVal,
OperandTraits<GlobalVariable>::op_begin(this),		OperandTraits<GlobalVariable>::op_begin(this),
InitVal != nullptr, Link, Name, AddressSpace),		InitVal != nullptr, Link, Name, AddressSpace),
isConstantGlobal(constant),		isConstantGlobal(constant),
isExternallyInitializedConstant(isExternallyInitialized) {		isExternallyInitializedConstant(isExternallyInitialized) {
assert(!Ty->isFunctionTy() && PointerType::isValidElementType(Ty) &&		assert(!Ty->isFunctionTy() && PointerType::isValidElementType(Ty) &&
"invalid type for global variable");		"invalid type for global variable");
setThreadLocalMode(TLMode);		setThreadLocalMode(TLMode);
if (InitVal) {		if (InitVal) {
assert(InitVal->getType() == Ty &&		assert(InitVal->getType() == Ty &&
"Initializer should be the same type as the GlobalVariable!");		"Initializer should be the same type as the GlobalVariable!");
Op<0>() = InitVal;		Op<0>() = InitVal;
}		}
}		}

GlobalVariable::GlobalVariable(Module &M, Type *Ty, bool constant,		GlobalVariable::GlobalVariable(Module &M, Type *Ty, bool constant,
		jdoerfertUnsubmitted Not Done Reply Inline Actions The only thing I'm not 100% convinced by is this part. I can see the appeal, but I can also imagine problems in the future. Not everything looking at the linkage might care about `noipa`. I'd rather introduce a helper that checks `noipa` and linkage, and maybe also the alwaysinline attribute. Basically, /// Determine whether the function \p F is IPO amendable /// /// If a function is exactly defined or it has alwaysinline attribute /// and is viable to be inlined, we say it is IPO amendable bool isFunctionIPOAmendable(const Function &F) { ┊ return !F->hasNoIPA() && (F.hasExactDefinition() \|\| F.isAlwaysInlined()); } whereas the "always inlined" function does not exist yet. jdoerfert: The only thing I'm not 100% convinced by is this part. I can see the appeal, but I can also…
		dblaikieAuthorUnsubmitted Done Reply Inline Actions Yep - I mean this is the guts of it, so the bit that's interesting/debatable from a design perspective (but glad to hear the rest of it/the mechanical stuff is about right) I worry about implementing this as a separate property and having to update every pass/place that queries this sort of thing. That'll mean likely changing every call to hasExactDefinition (admittedly there are fewer calls than I was expecting, which also sort of worries me a bit - about 20, most of them in FunctionAttrs) and adding new test coverage for every use? & all the future uses that might find hasExactDefinition and think that's enough. Perhaps we could rename hasExactDefinition to something that would capture this new use case and reduce the chance it'd be used for something unsuitable later? dblaikie: Yep - I mean this is the guts of it, so the bit that's interesting/debatable from a design…
		jdoerfertUnsubmitted Not Done Reply Inline Actions I think "isFunctionIPOAmendable", or similar, makes it much clearer to the observer what is happening. We can also put more things in there, the always_inline logic, we can check for `naked`, etc. Since any new name/function requires to go to all callers, I don't think there is much to gain/loose (assuming we don't overload hasExactDefinition). jdoerfert: I think "isFunctionIPOAmendable", or similar, makes it much clearer to the observer what is…
		dblaikieAuthorUnsubmitted Done Reply Inline Actions You mean not much to gain/lose regarding whether the function is renamed V a new function is added? Yeah, it's a bit of a stretch on my side, but I feel like renaming and adding some functionality to the function probably doesn't merit adding testing to al callers - but adding a new function and porting the passes over to it, I'd feel compelled to test each pass. How do you feel about either of those perspectives, or some other perspective on the change and testing of it? dblaikie: You mean not much to gain/lose regarding whether the function is renamed V a new function is…
		jdoerfertUnsubmitted Not Done Reply Inline Actions yes, renaming vs new function seems to be little different. I favor a new function as there might be unknown uses downstream, but I'm not opposing renaming the existing one so we can make it do something else/more. I would create a new helper, select the passes we know should deal with `noipa` explicitly, move them over to the new helper, and add tests. It is unclear how we could get away with less tests if we do something else, I mean without just not testing if a pass honors `noipa` now. Function-attrs (and I can port them to the Attributor) is the top consumer. From reading through the uses of "exact definition" I'd say IPSCCP (via canTrackReturnsInterprocedurally) is another one. @arsenm might want to look at `AMDGPUAnnotateKernelFeatures.cpp`. PruneEH and DeadArgElim need updates and tests, though that should be easy. I have no idea what `ObjCARCAPElim.cpp` does, we probably should ping someone. That should cover everything in-tree, I think. That said, I'm sure we have some volunteers to do some of the porting testing so it's not all on you. (Looking at @xbolva00 ;) ) jdoerfert: yes, renaming vs new function seems to be little different. I favor a new function as there…
		dblaikieAuthorUnsubmitted Not Done Reply Inline Actions Yeah, I worry about keeping the existing one in place due to the risk of it being misused out of momentum/existing familiarity, rendering noipa less accurate. Admittedly partly inspired by the GCC implementation ( https://gcc.gnu.org/legacy-ml/gcc/2016-12/msg00064.html ) that implemented noipa by way of marking such functions as interposable - which I tend to agree with. The idea that we have an existing property that this maps to (though there's reasonable disagreement about how accurately) - now we're adding a new way that that property can be expressed. That doesn't necessitate testing all functionality that depends on the property - only testing that the new expression of the property is working correctly. dblaikie: Yeah, I worry about keeping the existing one in place due to the risk of it being misused out…
		jdoerfertUnsubmitted Not Done Reply Inline Actions Yeah, I worry about keeping the existing one in place due to the risk of it being misused out of momentum/existing familiarity, rendering noipa less accurate Fair. Maybe we rename it and introduce a new helper ;) Renaming so downstream users notice, new helper to encapsulate all the "can we do IPA across this call edge" logic without polutting the "is this derefinable" code path. I generally would value clear naming/design over the desire to "auto-port" existing users to what might be the right thing. While the latter has short term advantages it often is long term painful, and at some point someone will come along and split the linkage logic and the rest apart, or introduce an argument with default value, rendering all these thoughts mood. [Site note: I imagine we will actually have to find a way to split it to update our internalization optimization in which we want to work around derefinable linkage through a TU internal copy but not around `noipa`.] That said, if people really think hiding `noipa` behind this function is the way to go, I'm not going to block that. jdoerfert: > Yeah, I worry about keeping the existing one in place due to the risk of it being misused out…
		dblaikieAuthorUnsubmitted Done Reply Inline Actions Actually, looking at this further - maybe it makes sense to actually move it further /down/ rather than up. Closer to how GCC modeled this: one layer down, at `GlobalValue::isInterposable` I found a test for the `getParent() && getParent()->getSemanticInterposition()`, the latter uses module metadata to flag the module as semantically interposable. The documentation for "isInterposable" seems more accurate to the semantics I want to implement here: /// Whether the definition of this global may be replaced by something /// non-equivalent at link time. For example, if a function has weak linkage /// then the code defining it may be replaced by different code. (where's `isExactDefinition` says "Inlining is okay across non-exact linkage types as long as they're not interposable" and `mayBeDerefined` says "Returns true if the definition of this global may be replaced by a differently optimized variant of the same source level function at link time." - while the latter is less precise, it still has that worrying "of the same source level function" - which could suggest that a function that may be derefined could still be inlined) In fact, adding this noipa attribute could supersede the SemanticInterposition module metadata entirely - frontends could put noipa on every function (as they do with optnone at -O0 today). Should we call it something different than noipa, then? Should it be called semantic_interposition? dblaikie: Actually, looking at this further - maybe it makes sense to actually move it further /down/…
		jdoerfertUnsubmitted Not Done Reply Inline Actions On first thought I can see us adding an attribute to replace that metadata, it depends on how it is supposed to work in an LTO setting at the end of the day. That said, `noipa` should exist on its own. I also still think we should not intertwine two things that are similar but different: "do not perform ipa/ipo" and "the semantics do not allow you to do ipa/ipo, among other things". Long story short, I'll continue to be in favor of a new `canIdoIPO(CallBase/Function)` helper which we employ in our passes, though others should chime in if you prefer a solution in which `noipa` is checked in existing lookup functions. jdoerfert: On first thought I can see us adding an attribute to replace that metadata, it depends on how…
		MaskRayUnsubmitted Not Done Reply Inline Actions +1 for adding a helper named `isFunctionIPOAmendable` or similar. `"SemanticInterposition"` is more of a workaround to not regress existing `GlobalValue::isInterposable` call sites which might not be appropriate. If the call sites are fixed/confirmed ok, `noipa` can supersede `"SemanticInterposition"`. MaskRay: +1 for adding a helper named `isFunctionIPOAmendable` or similar. `"SemanticInterposition"` is…
LinkageTypes Link, Constant *InitVal,		LinkageTypes Link, Constant *InitVal,
const Twine &Name, GlobalVariable *Before,		const Twine &Name, GlobalVariable *Before,
ThreadLocalMode TLMode,		ThreadLocalMode TLMode,
std::optional<unsigned> AddressSpace,		std::optional<unsigned> AddressSpace,
bool isExternallyInitialized)		bool isExternallyInitialized)
: GlobalVariable(Ty, constant, Link, InitVal, Name, TLMode,		: GlobalVariable(Ty, constant, Link, InitVal, Name, TLMode,
AddressSpace		AddressSpace
? *AddressSpace		? *AddressSpace
▲ Show 20 Lines • Show All 140 Lines • Show Last 20 Lines

llvm/lib/Transforms/IPO/FunctionAttrs.cpp

Show First 20 Lines • Show All 737 Lines • ▼ Show 20 Lines

/// Deduce returned attributes for the SCC.		/// Deduce returned attributes for the SCC.
static void addArgumentReturnedAttrs(const SCCNodeSet &SCCNodes,		static void addArgumentReturnedAttrs(const SCCNodeSet &SCCNodes,
SmallSet<Function *, 8> &Changed) {		SmallSet<Function *, 8> &Changed) {
// Check each function in turn, determining if an argument is always returned.		// Check each function in turn, determining if an argument is always returned.
for (Function *F : SCCNodes) {		for (Function *F : SCCNodes) {
// We can infer and propagate function attributes only when we know that the		// We can infer and propagate function attributes only when we know that the
// definition we'll get at link time is exactly the definition we see now.		// definition we'll get at link time is exactly the definition we see now.
// For more details, see GlobalValue::mayBeDerefined.		// For more details, see GlobalValue::mayBeDerefinedOrNoIPA.
if (!F->hasExactDefinition())		if (!F->hasExactDefinition())
continue;		continue;

if (F->getReturnType()->isVoidTy())		if (F->getReturnType()->isVoidTy())
continue;		continue;

// There is nothing to do if an argument is already marked as 'returned'.		// There is nothing to do if an argument is already marked as 'returned'.
if (F->getAttributes().hasAttrSomewhere(Attribute::Returned))		if (F->getAttributes().hasAttrSomewhere(Attribute::Returned))
▲ Show 20 Lines • Show All 100 Lines • ▼ Show 20 Lines	static void addArgumentAttrs(const SCCNodeSet &SCCNodes,
SmallSet<Function *, 8> &Changed) {		SmallSet<Function *, 8> &Changed) {
ArgumentGraph AG;		ArgumentGraph AG;

// Check each function in turn, determining which pointer arguments are not		// Check each function in turn, determining which pointer arguments are not
// captured.		// captured.
for (Function *F : SCCNodes) {		for (Function *F : SCCNodes) {
// We can infer and propagate function attributes only when we know that the		// We can infer and propagate function attributes only when we know that the
// definition we'll get at link time is exactly the definition we see now.		// definition we'll get at link time is exactly the definition we see now.
// For more details, see GlobalValue::mayBeDerefined.		// For more details, see GlobalValue::mayBeDerefinedOrNoIPA.
if (!F->hasExactDefinition())		if (!F->hasExactDefinition())
continue;		continue;

if (addArgumentAttrsFromCallsites(*F))		if (addArgumentAttrsFromCallsites(*F))
Changed.insert(F);		Changed.insert(F);

// Functions that are readonly (or readnone) and nounwind and don't return		// Functions that are readonly (or readnone) and nounwind and don't return
// a value can't capture arguments. Don't analyze them.		// a value can't capture arguments. Don't analyze them.
▲ Show 20 Lines • Show All 234 Lines • ▼ Show 20 Lines	static void addNoAliasAttrs(const SCCNodeSet &SCCNodes,
// pointers.		// pointers.
for (Function *F : SCCNodes) {		for (Function *F : SCCNodes) {
// Already noalias.		// Already noalias.
if (F->returnDoesNotAlias())		if (F->returnDoesNotAlias())
continue;		continue;

// We can infer and propagate function attributes only when we know that the		// We can infer and propagate function attributes only when we know that the
// definition we'll get at link time is exactly the definition we see now.		// definition we'll get at link time is exactly the definition we see now.
// For more details, see GlobalValue::mayBeDerefined.		// For more details, see GlobalValue::mayBeDerefinedOrNoIPA.
if (!F->hasExactDefinition())		if (!F->hasExactDefinition())
return;		return;

// We annotate noalias return values, which are only applicable to		// We annotate noalias return values, which are only applicable to
// pointer types.		// pointer types.
if (!F->getReturnType()->isPointerTy())		if (!F->getReturnType()->isPointerTy())
continue;		continue;

▲ Show 20 Lines • Show All 95 Lines • ▼ Show 20 Lines	static void addNonNullAttrs(const SCCNodeSet &SCCNodes,
// pointers.		// pointers.
for (Function *F : SCCNodes) {		for (Function *F : SCCNodes) {
// Already nonnull.		// Already nonnull.
if (F->getAttributes().hasRetAttr(Attribute::NonNull))		if (F->getAttributes().hasRetAttr(Attribute::NonNull))
continue;		continue;

// We can infer and propagate function attributes only when we know that the		// We can infer and propagate function attributes only when we know that the
// definition we'll get at link time is exactly the definition we see now.		// definition we'll get at link time is exactly the definition we see now.
// For more details, see GlobalValue::mayBeDerefined.		// For more details, see GlobalValue::mayBeDerefinedOrNoIPA.
if (!F->hasExactDefinition())		if (!F->hasExactDefinition())
return;		return;

// We annotate nonnull return values, which are only applicable to		// We annotate nonnull return values, which are only applicable to
// pointer types.		// pointer types.
if (!F->getReturnType()->isPointerTy())		if (!F->getReturnType()->isPointerTy())
continue;		continue;

▲ Show 20 Lines • Show All 439 Lines • ▼ Show 20 Lines	if (!canReturn(*F)) {
Changed.insert(F);		Changed.insert(F);
}		}
}		}
}		}

static bool functionWillReturn(const Function &F) {		static bool functionWillReturn(const Function &F) {
// We can infer and propagate function attributes only when we know that the		// We can infer and propagate function attributes only when we know that the
// definition we'll get at link time is exactly the definition we see now.		// definition we'll get at link time is exactly the definition we see now.
// For more details, see GlobalValue::mayBeDerefined.		// For more details, see GlobalValue::mayBeDerefinedOrNoIPA.
if (!F.hasExactDefinition())		if (!F.hasExactDefinition())
return false;		return false;

// Must-progress function without side-effects must return.		// Must-progress function without side-effects must return.
if (F.mustProgress() && F.onlyReadsMemory())		if (F.mustProgress() && F.onlyReadsMemory())
return true;		return true;

// Can only analyze functions with a definition.		// Can only analyze functions with a definition.
▲ Show 20 Lines • Show All 252 Lines • Show Last 20 Lines

llvm/lib/Transforms/Utils/CodeExtractor.cpp

Show First 20 Lines • Show All 940 Lines • ▼ Show 20 Lines	if (Attr.isStringAttribute()) {
case Attribute::NoRedZone:		case Attribute::NoRedZone:
case Attribute::NoUnwind:		case Attribute::NoUnwind:
case Attribute::NoSanitizeBounds:		case Attribute::NoSanitizeBounds:
case Attribute::NoSanitizeCoverage:		case Attribute::NoSanitizeCoverage:
case Attribute::NullPointerIsValid:		case Attribute::NullPointerIsValid:
case Attribute::OptimizeForDebugging:		case Attribute::OptimizeForDebugging:
case Attribute::OptForFuzzing:		case Attribute::OptForFuzzing:
case Attribute::OptimizeNone:		case Attribute::OptimizeNone:
		case Attribute::NoIPA:
case Attribute::OptimizeForSize:		case Attribute::OptimizeForSize:
case Attribute::SafeStack:		case Attribute::SafeStack:
case Attribute::ShadowCallStack:		case Attribute::ShadowCallStack:
case Attribute::SanitizeAddress:		case Attribute::SanitizeAddress:
case Attribute::SanitizeMemory:		case Attribute::SanitizeMemory:
case Attribute::SanitizeThread:		case Attribute::SanitizeThread:
case Attribute::SanitizeHWAddress:		case Attribute::SanitizeHWAddress:
case Attribute::SanitizeMemTag:		case Attribute::SanitizeMemTag:
▲ Show 20 Lines • Show All 942 Lines • Show Last 20 Lines

llvm/test/Bitcode/attributes.ll

	Show First 20 Lines • Show All 511 Lines • ▼ Show 20 Lines
	define void @f88() skipprofile { ret void }			define void @f88() skipprofile { ret void }

	define void @f89() optdebug			define void @f89() optdebug
	; CHECK: define void @f89() [[OPTDEBUG:#[0-9]+]]			; CHECK: define void @f89() [[OPTDEBUG:#[0-9]+]]
	{			{
	ret void;			ret void;
	}			}

				define void @f90() noipa
				; CHECK: define void @f90() [[NOIPA:#[0-9]+]]
				{
				ret void
				}

	; CHECK: attributes #0 = { noreturn }			; CHECK: attributes #0 = { noreturn }
	; CHECK: attributes #1 = { nounwind }			; CHECK: attributes #1 = { nounwind }
	; CHECK: attributes #2 = { memory(none) }			; CHECK: attributes #2 = { memory(none) }
	; CHECK: attributes #3 = { memory(read) }			; CHECK: attributes #3 = { memory(read) }
	; CHECK: attributes #4 = { noinline }			; CHECK: attributes #4 = { noinline }
	; CHECK: attributes #5 = { alwaysinline }			; CHECK: attributes #5 = { alwaysinline }
	; CHECK: attributes #6 = { optsize }			; CHECK: attributes #6 = { optsize }
	; CHECK: attributes #7 = { ssp }			; CHECK: attributes #7 = { ssp }
	Show All 40 Lines
	; CHECK: attributes #48 = { nosanitize_coverage }			; CHECK: attributes #48 = { nosanitize_coverage }
	; CHECK: attributes #49 = { noprofile }			; CHECK: attributes #49 = { noprofile }
	; CHECK: attributes #50 = { disable_sanitizer_instrumentation }			; CHECK: attributes #50 = { disable_sanitizer_instrumentation }
	; CHECK: attributes #51 = { uwtable(sync) }			; CHECK: attributes #51 = { uwtable(sync) }
	; CHECK: attributes #52 = { nosanitize_bounds }			; CHECK: attributes #52 = { nosanitize_bounds }
	; CHECK: attributes [[FNRETTHUNKEXTERN]] = { fn_ret_thunk_extern }			; CHECK: attributes [[FNRETTHUNKEXTERN]] = { fn_ret_thunk_extern }
	; CHECK: attributes [[SKIPPROFILE]] = { skipprofile }			; CHECK: attributes [[SKIPPROFILE]] = { skipprofile }
	; CHECK: attributes [[OPTDEBUG]] = { optdebug }			; CHECK: attributes [[OPTDEBUG]] = { optdebug }
				; CHECK: attributes [[NOIPA]] = { noipa }
	; CHECK: attributes #[[NOBUILTIN]] = { nobuiltin }			; CHECK: attributes #[[NOBUILTIN]] = { nobuiltin }

llvm/unittests/IR/FunctionTest.cpp

Show First 20 Lines • Show All 480 Lines • ▼ Show 20 Lines	)");
EXPECT_EQ(&*It++, BB3);		EXPECT_EQ(&*It++, BB3);
EXPECT_EQ(&*It++, BB4);		EXPECT_EQ(&*It++, BB4);
EXPECT_EQ(&*It++, BB5);		EXPECT_EQ(&*It++, BB5);

// Erase all BBs.		// Erase all BBs.
It = F->erase(F->begin(), F->end());		It = F->erase(F->begin(), F->end());
EXPECT_EQ(F->size(), 0u);		EXPECT_EQ(F->size(), 0u);
}		}

		TEST(FunctionTest, NoIPAInexact) {
		LLVMContext Ctx;
		std::unique_ptr<Module> M = parseIR(Ctx, R"(
		define void @foo() { bb1: ret void }
		define void @bar() #0 { bb1: ret void }
		attributes #0 = { noipa }
		)");
		EXPECT_TRUE(M->getFunction("foo")->isDefinitionExact());
		EXPECT_FALSE(M->getFunction("bar")->isDefinitionExact());
		}

} // end namespace		} // end namespace

llvm/utils/emacs/llvm-mode.el

Show All 18 Lines	(defvar llvm-mode-syntax-table
"Syntax table used while in LLVM mode.")		"Syntax table used while in LLVM mode.")

(defvar llvm-font-lock-keywords		(defvar llvm-font-lock-keywords
(list		(list
;; Attributes		;; Attributes
`(,(regexp-opt		`(,(regexp-opt
'("alwaysinline" "argmemonly" "allocsize" "builtin" "cold" "convergent" "dereferenceable" "dereferenceable_or_null" "hot" "immarg" "inaccessiblememonly"		'("alwaysinline" "argmemonly" "allocsize" "builtin" "cold" "convergent" "dereferenceable" "dereferenceable_or_null" "hot" "immarg" "inaccessiblememonly"
"inaccessiblemem_or_argmemonly" "inalloca" "inlinehint" "jumptable" "minsize" "mustprogress" "naked" "nobuiltin" "nonnull" "nocapture"		"inaccessiblemem_or_argmemonly" "inalloca" "inlinehint" "jumptable" "minsize" "mustprogress" "naked" "nobuiltin" "nonnull" "nocapture"
"nocallback" "nocf_check" "noduplicate" "nofree" "noimplicitfloat" "noinline" "nomerge" "nonlazybind" "noprofile" "noredzone" "noreturn"		"nocallback" "nocf_check" "noduplicate" "nofree" "noimplicitfloat" "noinline" "noipa" "nomerge" "nonlazybind" "noprofile" "noredzone" "noreturn"
"norecurse" "nosync" "noundef" "nounwind" "nosanitize_bounds" "nosanitize_coverage" "null_pointer_is_valid" "optdebug" "optforfuzzing" "optnone" "optsize" "preallocated" "readnone" "readonly" "returned" "returns_twice"		"norecurse" "nosync" "noundef" "nounwind" "nosanitize_bounds" "nosanitize_coverage" "null_pointer_is_valid" "optdebug" "optforfuzzing" "optnone" "optsize" "preallocated" "readnone" "readonly" "returned" "returns_twice"
"shadowcallstack" "signext" "speculatable" "speculative_load_hardening" "ssp" "sspreq" "sspstrong" "safestack" "sanitize_address" "sanitize_hwaddress" "sanitize_memtag"		"shadowcallstack" "signext" "speculatable" "speculative_load_hardening" "ssp" "sspreq" "sspstrong" "safestack" "sanitize_address" "sanitize_hwaddress" "sanitize_memtag"
"sanitize_thread" "sanitize_memory" "strictfp" "swifterror" "uwtable" "vscale_range" "willreturn" "writeonly" "zeroext") 'symbols) . font-lock-constant-face)		"sanitize_thread" "sanitize_memory" "strictfp" "swifterror" "uwtable" "vscale_range" "willreturn" "writeonly" "zeroext") 'symbols) . font-lock-constant-face)
;; Variables		;; Variables
'("%[-a-zA-Z$._][-a-zA-Z$._0-9]*" . font-lock-variable-name-face)		'("%[-a-zA-Z$._][-a-zA-Z$._0-9]*" . font-lock-variable-name-face)
;; Labels		;; Labels
'("[-a-zA-Z$._0-9]+:" . font-lock-variable-name-face)		'("[-a-zA-Z$._0-9]+:" . font-lock-variable-name-face)
;; Unnamed variable slots		;; Unnamed variable slots
▲ Show 20 Lines • Show All 86 Lines • Show Last 20 Lines

llvm/utils/kate/llvm.xml

Show First 20 Lines • Show All 98 Lines • ▼ Show 20 Lines	<list name="function-attributes">
<item> naked </item>		<item> naked </item>
<item> nobuiltin </item>		<item> nobuiltin </item>
<item> nocallback </item>		<item> nocallback </item>
<item> nocf_check </item>		<item> nocf_check </item>
<item> noduplicate </item>		<item> noduplicate </item>
<item> nofree </item>		<item> nofree </item>
<item> noimplicitfloat </item>		<item> noimplicitfloat </item>
<item> noinline </item>		<item> noinline </item>
		<item> noipa </item>
<item> nomerge </item>		<item> nomerge </item>
<item> noprofile </item>		<item> noprofile </item>
<item> noredzone </item>		<item> noredzone </item>
<item> noreturn </item>		<item> noreturn </item>
<item> nosync </item>		<item> nosync </item>
<item> nounwind </item>		<item> nounwind </item>
<item> null_pointer_is_valid </item>		<item> null_pointer_is_valid </item>
<item> optdebug </item>		<item> optdebug </item>
▲ Show 20 Lines • Show All 195 Lines • Show Last 20 Lines

llvm/utils/vim/syntax/llvm.vim

Show First 20 Lines • Show All 123 Lines • ▼ Show 20 Lines	syn keyword llvmKeyword
\ nocallback		\ nocallback
\ nocapture		\ nocapture
\ nocf_check		\ nocf_check
\ no_cfi		\ no_cfi
\ noduplicate		\ noduplicate
\ nofree		\ nofree
\ noimplicitfloat		\ noimplicitfloat
\ noinline		\ noinline
		\ noipa
\ nomerge		\ nomerge
\ nonlazybind		\ nonlazybind
\ nonnull		\ nonnull
\ noprofile		\ noprofile
\ norecurse		\ norecurse
\ noredzone		\ noredzone
\ noreturn		\ noreturn
\ nosync		\ nosync
▲ Show 20 Lines • Show All 133 Lines • Show Last 20 Lines

llvm/utils/vscode/llvm/syntaxes/ll.tmLanguage.yaml

Show First 20 Lines • Show All 222 Lines • ▼ Show 20 Lines	- match: "\\bacq_rel\\b\|\
\\bnobuiltin\\b\|\		\\bnobuiltin\\b\|\
\\bnocallback\\b\|\		\\bnocallback\\b\|\
\\bnocapture\\b\|\		\\bnocapture\\b\|\
\\bnocf_check\\b\|\		\\bnocf_check\\b\|\
\\bnoduplicate\\b\|\		\\bnoduplicate\\b\|\
\\bnofree\\b\|\		\\bnofree\\b\|\
\\bnoimplicitfloat\\b\|\		\\bnoimplicitfloat\\b\|\
\\bnoinline\\b\|\		\\bnoinline\\b\|\
		\\bnoipa\\b\|\
\\bnomerge\\b\|\		\\bnomerge\\b\|\
\\bnonlazybind\\b\|\		\\bnonlazybind\\b\|\
\\bnonnull\\b\|\		\\bnonnull\\b\|\
\\bnoprofile\\b\|\		\\bnoprofile\\b\|\
\\bnorecurse\\b\|\		\\bnorecurse\\b\|\
\\bnoredzone\\b\|\		\\bnoredzone\\b\|\
\\bnoreturn\\b\|\		\\bnoreturn\\b\|\
\\bnosync\\b\|\		\\bnosync\\b\|\
▲ Show 20 Lines • Show All 128 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[Attr] Add "noipa" function attributeNeeds ReviewPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 557886

llvm/docs/LangRef.rst

llvm/include/llvm/Bitcode/LLVMBitCodes.h

llvm/include/llvm/IR/Attributes.td

llvm/include/llvm/IR/GlobalValue.h

llvm/lib/Bitcode/Reader/BitcodeReader.cpp

llvm/lib/Bitcode/Writer/BitcodeWriter.cpp

llvm/lib/IR/Globals.cpp

llvm/lib/Transforms/IPO/FunctionAttrs.cpp

llvm/lib/Transforms/Utils/CodeExtractor.cpp

llvm/test/Bitcode/attributes.ll

llvm/unittests/IR/FunctionTest.cpp

llvm/utils/emacs/llvm-mode.el

llvm/utils/kate/llvm.xml

llvm/utils/vim/syntax/llvm.vim

llvm/utils/vscode/llvm/syntaxes/ll.tmLanguage.yaml

[Attr] Add "noipa" function attribute
Needs ReviewPublic